Study smarter with Fiveable
Get study guides, practice questions, and cheatsheets for all your subjects. Join 500,000+ students with a 96% pass rate.
Statistics isn't just about crunching numbers—it's about extracting meaning from data. Every statistical measure you learn serves a specific purpose: some tell you where the "center" of your data lives, others reveal how spread out or clustered your values are, and still others help you understand relationships between variables. On exams, you're being tested on your ability to choose the right measure for the situation and interpret what that measure actually tells you about the underlying data.
The key concepts here fall into three categories: measures of central tendency (where's the middle?), measures of spread (how variable is the data?), and measures of position and relationship (where does a value rank, and how do variables relate?). Don't just memorize formulas—know when each measure is appropriate, what makes it sensitive to outliers, and how to interpret results in context.
These measures answer the fundamental question: what's a typical value in this dataset? Each one captures "center" differently, and choosing the right one depends on your data's shape and what you're trying to communicate.
Compare: Mean vs. Median—both measure center, but mean uses all values while median uses only position. When an FRQ gives you a skewed distribution or mentions outliers, median is usually the better choice for representing "typical."
Knowing the center isn't enough—you need to understand how much variation exists around that center. These measures quantify dispersion, from simple to sophisticated.
Compare: Range vs. Standard Deviation—range is quick but crude (two points only), while standard deviation incorporates every data point. If an exam asks for a "robust" or "reliable" measure of spread, standard deviation wins.
Compare: Standard Deviation vs. IQR—both measure spread, but SD is sensitive to outliers while IQR is resistant. For skewed data or when outliers are present, IQR gives a more accurate picture of typical variability.
These measures tell you where a specific value ranks within the larger dataset—essential for comparing individuals across different scales or distributions.
Compare: Percentiles vs. Quartiles—quartiles are just specific percentiles (25th, 50th, 75th). Know that is always the median, and IQR is always .
When you have two variables, you need tools to understand how they move together—this is where correlation comes in.
Compare: Correlation vs. Causation—a high value does NOT prove one variable causes changes in another. This is a classic exam trap: correlation describes association, not causation.
| Concept | Best Examples |
|---|---|
| Central tendency (balanced data) | Mean |
| Central tendency (skewed/outliers) | Median |
| Central tendency (categorical) | Mode |
| Simple spread measure | Range |
| Spread in original units | Standard Deviation |
| Spread resistant to outliers | IQR |
| Position within distribution | Percentiles, Quartiles |
| Relationship between variables | Correlation Coefficient |
A dataset of home prices includes one mansion worth $15 million. Which measure of central tendency would best represent a "typical" home price, and why?
Compare variance and standard deviation: why do we bother calculating standard deviation when variance already measures spread?
A student scores in the 85th percentile on an exam. Another student's score equals . Which student performed better, and how do you know?
You're analyzing two datasets with identical means. Dataset A has and Dataset B has . What does this tell you about how the data points are distributed differently?
Two variables have a correlation coefficient of . Describe the relationship and explain why this does NOT prove that changes in one variable cause changes in the other.