Statistics How To

Data Distribution

Statistics Definitions >

A data distribution, in statistics, is just a function or a listing which shows all the possible values (or intervals) of the data. It also (and this is important) tells you how often each value occurs. Often, the data in a distribution will be ordered from smaller to larger, and graphs and charts allow us to easily see both the values and the frequency with which they appear.

From a distribution we can calculate the probability of any one particular observation in the sample space, or the likelihood that an observation will have a value which is less than (or greater than) a point of interest.

The function of a distribution that shows the density of the values of our data is called a probability density function, and is sometimes abbreviated pdf.

One Often-Encountered Data Distribution

There are some statistical distributions that come up so often they have received their own names; one of these is the bell-shaped curve, also called the normal distribution. When graphed (from smaller to greater, with frequency that values occur being graphed on the y axis) it looks something like a tidy bell shape, with tails on both sides. The graph is continuous; which means every point is included, and there are no discontinuities between points. It is also symmetric over a central point (the mean).

The normal distribution is actually an infinite family of distributions (each fully defined by unique means and standard deviation) rather than just one, but they share many of the same properties.

References

Damodaran, Aswath. Statistical Distributions. Retrieved from http://people.stern.nyu.edu/adamodar/New_Home_Page/StatFile/statdistns.htm on August 9, 2018.

Rumsey, Deborah J. What the Distribution Tells You About a Statistical Data Set. Retrieved from https://www.dummies.com/education/math/statistics/what-the-distribution-tells-you-about-a-statistical-data-set/ on August 9, 2018.

Brownlee, Jason. A Gentle Introduction to Statistical Data Distributions. Retrieved from https://machinelearningmastery.com/statistical-data-distributions/ on August 9, 2018

------------------------------------------------------------------------------

Confused and have questions? Head over to Chegg and use code “CS5OFFBTS18” (exp. 11/30/2018) to get $5 off your first month of Chegg Study, so you can understand any concept by asking a subject expert and getting an in-depth explanation online 24/7.

Comments? Need to post a correction? Please post a comment on our Facebook page.

Check out our updated Privacy policy and Cookie Policy