Study smarter with Fiveable
Get study guides, practice questions, and cheatsheets for all your subjects. Join 500,000+ students with a 96% pass rate.
When you describe a dataset, the mean or median only tells half the storyโyou also need to know how spread out the values are. Measures of dispersion answer the critical question: how much do individual data points vary from the center? This concept underpins everything from understanding sampling variability to interpreting confidence intervals and hypothesis tests. On exams, you'll need to calculate these measures, choose the right one for different situations, and explain what they reveal about your data.
The key insight here is that different measures of spread serve different purposes. Some are sensitive to outliers, others resist them. Some preserve original units, others standardize for comparison. Don't just memorize formulasโunderstand when to use each measure and what it tells you about the shape and variability of your distribution. That's what separates a correct answer from a complete one.
These measures use specific data points (like maximums, minimums, or quartiles) to capture spread. They're intuitive and easy to calculate, but they don't use every observation in the dataset.
Compare: Range vs. IQRโboth measure spread using specific data points, but range uses extremes while IQR uses quartiles. If an FRQ gives you a dataset with obvious outliers and asks which measure better represents typical spread, IQR is your answer.
These measures calculate how far each data point falls from the mean, then summarize those deviations. They use every observation, making them more informative but also more sensitive to extreme values.
Compare: Variance vs. Standard Deviationโvariance squares the units (making interpretation awkward), while standard deviation restores original units. Always report standard deviation when describing spread; use variance primarily in calculations and statistical formulas.
These measures allow you to compare variability across datasets with different scales or unitsโessential when asking "which dataset is relatively more spread out?"
Compare: Standard Deviation vs. Coefficient of Variationโstandard deviation measures absolute spread in original units, while CV measures relative spread as a percentage of the mean. Use CV when comparing variability between datasets with different units or vastly different means.
These measures describe where data points fall within the distribution, helping you understand both spread and relative standing.
Compare: Percentiles vs. Z-scoresโboth describe position, but percentiles tell you what percentage of data falls below a value, while z-scores tell you how many standard deviations a value is from the mean. Percentiles work for any distribution; z-scores assume you're working with the standard deviation.
| Concept | Best Examples |
|---|---|
| Simple spread (uses extremes) | Range |
| Resistant to outliers | IQR, MAD |
| Uses all data points | Variance, Standard Deviation, MAD |
| Same units as original data | Standard Deviation, MAD, Range, IQR |
| Squared units | Variance |
| Comparing across different scales | Coefficient of Variation |
| Describes position in distribution | Percentiles, Quartiles |
| Used in the 1.5 ร IQR outlier rule | IQR, , |
A dataset contains one extreme outlier. Which two measures of spread would be most affected, and which two would be most resistant?
You're comparing the variability of test scores (mean = 75, SD = 10) with the variability of reaction times in milliseconds (mean = 250, SD = 40). Which dataset shows greater relative variability, and what measure would you use to determine this?
Explain why sample variance divides by instead of . What problem does this correction solve?
Compare and contrast standard deviation and IQR as measures of spread. Under what conditions would you choose one over the other?
If a value falls at the 90th percentile, what does this tell you? How would you describe this same value's position using quartiles?