Study smarter with Fiveable
Get study guides, practice questions, and cheatsheets for all your subjects. Join 500,000+ students with a 96% pass rate.
When you describe a dataset, the mean or median only tells half the story. You also need to know how spread out the values are. Measures of dispersion answer the question: how much do individual data points vary from the center? This concept underpins everything from understanding sampling variability to interpreting confidence intervals and hypothesis tests.
Different measures of spread serve different purposes. Some are sensitive to outliers, others resist them. Some preserve original units, others standardize for comparison. Don't just memorize formulas. Understand when to use each measure and what it tells you about the variability of your distribution.
These measures use specific data points (like maximums, minimums, or quartiles) to capture spread. They're intuitive and easy to calculate, but they don't use every observation in the dataset.
The range is the simplest measure of spread. You only need two values to calculate it.
For example, if exam scores run from 52 to 98, the range is . But if one student scored 12, the range jumps to 86, even though the bulk of scores didn't change.
The IQR measures the spread of the middle 50% of your data, cutting off the top and bottom quarters.
Compare: Range vs. IQR โ both measure spread using specific data points, but range uses extremes while IQR uses quartiles. If a question gives you a dataset with obvious outliers and asks which measure better represents typical spread, IQR is your answer.
These measures calculate how far each data point falls from the mean, then summarize those deviations. They use every observation, making them more informative but also more sensitive to extreme values.
Variance measures the average squared deviation from the mean. Squaring serves two purposes: it eliminates negative deviations (which would otherwise cancel out), and it gives extra weight to points far from the mean.
The difference in denominators matters. Sample variance divides by (called Bessel's correction) because a sample tends to underestimate the true population spread. Using corrects for this bias, giving you an unbiased estimate of the population variance.
The downside of variance is that its units are squared. If your data is in centimeters, variance is in square centimeters, which is hard to interpret directly.
Standard deviation is simply the square root of variance, which brings you back to the original units of your data.
The empirical rule (68-95-99.7 rule) gives standard deviation its most useful interpretation for normal distributions:
So if exam scores have a mean of 75 and a standard deviation of 10, roughly 68% of students scored between 65 and 85.
MAD averages the absolute deviations from the mean instead of squaring them.
MAD is less commonly tested than standard deviation, but it's useful conceptually. It gives you a more intuitive sense of "average distance from the center."
Compare: Variance vs. Standard Deviation โ variance squares the units (making interpretation awkward), while standard deviation restores original units. Always report standard deviation when describing spread to an audience; use variance primarily in calculations and statistical formulas.
These measures let you compare variability across datasets with different scales or units. They answer the question: which dataset is relatively more spread out?
The CV expresses the standard deviation as a percentage of the mean.
For example, if test scores have a mean of 75 and SD of 10, the CV is . If reaction times have a mean of 250 ms and SD of 40 ms, the CV is . Even though reaction times have a larger SD in absolute terms, they also show greater relative variability.
Compare: Standard Deviation vs. Coefficient of Variation โ standard deviation measures absolute spread in original units, while CV measures relative spread as a percentage of the mean. Use CV when comparing variability between datasets with different units or vastly different means.
These measures describe where data points fall within the distribution, helping you understand both spread and relative standing.
Percentiles divide data into 100 equal parts. The th percentile is the value below which of observations fall. If you scored at the 90th percentile on a test, 90% of test-takers scored at or below your score.
Quartiles are three specific percentiles that split the data into four equal groups:
The five-number summary pulls these together with the extremes: minimum, , median, , maximum. This summary is the foundation of a box plot and gives you a quick picture of both center and spread.
Compare: Percentiles vs. Z-scores โ both describe position, but percentiles tell you what percentage of data falls below a value, while z-scores tell you how many standard deviations a value is from the mean. Percentiles work for any distribution shape; z-scores are most useful when you can connect them to a known distribution (like the normal curve).
| Concept | Best Examples |
|---|---|
| Simple spread (uses extremes) | Range |
| Resistant to outliers | IQR, MAD |
| Uses all data points | Variance, Standard Deviation, MAD |
| Same units as original data | Standard Deviation, MAD, Range, IQR |
| Squared units | Variance |
| Comparing across different scales | Coefficient of Variation |
| Describes position in distribution | Percentiles, Quartiles |
| Used in the 1.5 ร IQR outlier rule | IQR, , |
A dataset contains one extreme outlier. Which two measures of spread would be most affected, and which two would be most resistant?
You're comparing the variability of test scores (mean = 75, SD = 10) with the variability of reaction times in milliseconds (mean = 250, SD = 40). Which dataset shows greater relative variability, and what measure would you use to determine this?
Explain why sample variance divides by instead of . What problem does this correction solve?
Compare and contrast standard deviation and IQR as measures of spread. Under what conditions would you choose one over the other?
If a value falls at the 90th percentile, what does this tell you? How would you describe this same value's position using quartiles?