Study smarter with Fiveable
Get study guides, practice questions, and cheatsheets for all your subjects. Join 500,000+ students with a 96% pass rate.
When you encounter a dataset on the AP Statistics exam, your first task is almost always to describe its centerโbut which measure of center you choose reveals how well you understand the data's shape and context. The College Board tests your ability to select the appropriate measure based on distribution characteristics like skewness, outliers, and data type, not just your ability to calculate a mean. This connects directly to Unit 1's emphasis on describing distributions and carries through to sampling distributions in Unit 5, where the sample mean becomes the foundation for inference.
Beyond calculation, you're being tested on statistical reasoning: When does the median tell a more honest story than the mean? How do outliers pull the mean but leave the median unchanged? Why does the Central Limit Theorem focus on the mean rather than other measures? Don't just memorize formulasโknow what concept each measure illustrates and when each is the most appropriate choice for summarizing data.
These measures incorporate all values in the dataset, making them mathematically complete but also vulnerable to the influence of extreme values. The key principle: using all data maximizes information but can distort the "typical" value when distributions are skewed.
Compare: Mean vs. Weighted Meanโboth use all data points, but the weighted mean acknowledges that some observations matter more. On FRQs involving stratified samples or combining statistics from subgroups, the weighted mean is your tool.
These measures remain stable even when extreme values are present, making them robust statistics. The key principle: resistance comes from ignoring or minimizing the influence of values far from the center.
Compare: Mean vs. Medianโboth measure center, but the mean uses all values while the median uses only position. If an FRQ asks you to justify which measure better represents "typical," check for skewness or outliersโthe median wins when either is present.
These measures apply when data involves categories or repeated values rather than continuous measurements. The key principle: some data types restrict which measures are mathematically meaningful.
Compare: Mode vs. Mean/Medianโthe mode is the only measure you can use for categorical data like "favorite color" or "political party." When describing a distribution's shape, use modal language (unimodal, bimodal) rather than mean or median language.
These variations of the mean are designed for particular types of data or relationships. The key principle: the arithmetic mean isn't always the most meaningful averageโcontext determines the appropriate calculation.
Compare: Arithmetic vs. Geometric vs. Harmonic Meanโfor the same positive dataset, Harmonic โค Geometric โค Arithmetic (with equality only when all values are identical). The arithmetic mean works for additive relationships, geometric for multiplicative, and harmonic for rates with varying denominators.
These measures can be computed with minimal data but sacrifice robustness for simplicity. The key principle: ease of calculation often comes at the cost of reliability.
| Concept | Best Examples |
|---|---|
| Uses all data points | Mean, Weighted Mean |
| Resistant to outliers | Median, Trimmed Mean |
| Works with categorical data | Mode |
| Indicates skewness direction | Mean vs. Median comparison |
| Multiplicative relationships | Geometric Mean |
| Averaging rates | Harmonic Mean |
| Foundation for inference | Mean (connects to CLT and sampling distributions) |
| Shape description | Mode (unimodal, bimodal, multimodal) |
A dataset of home prices in a neighborhood includes one mansion worth $5 million among otherwise modest homes. Which measure of central tendency would best represent the "typical" home price, and why?
Compare and contrast the mean and median: Under what conditions will they be approximately equal, and under what conditions will they differ substantially?
A student's grades are: Quiz (weight 10%) = 85, Midterm (weight 30%) = 78, Final (weight 60%) = 92. Why is the weighted mean more appropriate than the arithmetic mean here, and what is the weighted average?
You're told a distribution is right-skewed. Without seeing the data, what can you predict about the relationship between the mean and median? How would this influence your choice of summary statistic?
FRQ-style: A researcher reports both the mean and median income for a sample of workers. The mean is $72,000 and the median is $54,000. Describe the likely shape of the income distribution and explain which measure better represents a "typical" worker's income. Justify your reasoning.