Fitting Bell Curves to Data Distributions using Visualization

01/11/2023
by   Eric Newburger, et al.
0

Idealized probability distributions, such as normal or other curves, lie at the root of confirmatory statistical tests. But how well do people understand these idealized curves? In practical terms, does the human visual system allow us to match sample data distributions with hypothesized population distributions from which those samples might have been drawn? And how do different visualization techniques impact this capability? This paper shares the results of a crowdsourced experiment that tested the ability of respondents to fit normal curves to four different data distribution visualizations: bar histograms, dotplot histograms, strip plots, and boxplots. We find that the crowd can estimate the center (mean) of a distribution with some success and little bias. We also find that people generally overestimate the standard deviation, which we dub the "umbrella effect" because people tend to want to cover the whole distribution using the curve, as if sheltering it from the heavens above, and that strip plots yield the best accuracy.

READ FULL TEXT

page 7

page 12

research
02/16/2023

A numerical approximation method for the Fisher-Rao distance between multivariate normal distributions

We present a simple method to approximate Rao's distance between multiva...
research
06/21/2020

Equivalence of several curves assessing the similarity between probability distributions

The recent advent of powerful generative models has triggered the renewe...
research
09/13/2018

Receiver Operating Characteristic (ROC) Curves

Receiver operating characteristic (ROC) curves are used ubiquitously to ...
research
10/19/2018

Population and Empirical PR Curves for Assessment of Ranking Algorithms

The ROC curve is widely used to assess the quality of prediction/classif...
research
12/03/2021

Inference for ROC Curves Based on Estimated Predictive Indices

We provide a comprehensive theory of conducting in-sample statistical in...
research
02/01/2017

Information-theoretic interpretation of tuning curves for multiple motion directions

We have developed an efficient information-maximization method for compu...

Please sign up or login with your details

Forgot password? Click here to reset