AP Statistics Unit 5 ReviewSampling Distributions

Verified for the 2027 examCompiled by AP educators~7–12% of the exam
Pep mascot
Upgrade your Fiveable account to print any study guide

Download study guides as beautiful PDFs See example

Print or share PDFs with your students

Always prints our latest, updated content

Mark up and annotate as you study

Click below to go to billing portal → update your plan → choose Yearly→ and select "Fiveable Share Plan". Only pay the difference

Plan is open to all students, teachers, parents, etc
Pep mascot
Upgrade your Fiveable account to export vocabulary

Download study guides as beautiful PDFs See example

Print or share PDFs with your students

Always prints our latest, updated content

Mark up and annotate as you study

Plan is open to all students, teachers, parents, etc

AP Statistics Unit 5, Sampling Distributions, covers the normal distribution and how sample statistics vary across samples, making up 7-12% of the AP exam across 8 topics. The central limit theorem is the big idea here: with large enough samples, the distribution of sample means becomes approximately normal, even when the original population isn't. AP Stats Unit 5 also gets into biased vs. unbiased estimates, sampling distributions for proportions, and differences between two samples or two proportions.

unit 5 review

AP Statistics Unit 5, Sampling Distributions, is about what happens when you take a statistic (like a sample mean or sample proportion) and ask how it would vary if you repeated your sample over and over. The single biggest idea is the Central Limit Theorem, which says that with a large enough sample size, the distribution of sample means is approximately normal even when the population isn't. This unit is 7-12% of the AP exam, and it's the bridge that turns probability (Unit 4) into inference (Units 6-9).

What this unit covers

Why your sample isn't like mine (and the normal model that handles it)

  • Two people taking random samples from the same population get different statistics. That's sampling variability, and it's the whole reason this unit exists. The variation may be random (just chance) or non-random (a flaw in how data was collected).
  • The normal distribution gets a comeback tour from Unit 1, but now you treat it as a continuous random variable. The area under the curve over an interval IS the probability that a value lands in that interval.
  • You'll calculate normal probabilities in both directions. Given a value, find the probability (using z-scores, a table, or a calculator's normalcdf). Given a probability or percentile, find the boundary value (invNorm). For example, "the lowest 10% of values" means finding xax_a where P(X<xa)=0.10P(X < x_a) = 0.10.
  • You also have to judge when a normal model is appropriate. Normal curves are symmetric and bell-shaped, so they only approximate distributions that share those features.

The sampling distribution idea and the Central Limit Theorem

  • A sampling distribution of a statistic is the distribution of that statistic's values across all possible samples of a given size from a population. Picture taking thousands of samples, computing xˉ\bar{x} each time, and making a histogram of those means. That histogram is the sampling distribution.
  • The Central Limit Theorem (CLT) says that if sample values are independent and n is sufficiently large (n ≥ 30 is the standard cutoff for means), the sampling distribution of the sample mean is approximately normal, no matter what shape the population has. Skewed population, bimodal population, doesn't matter.
  • You can also estimate sampling distributions using simulation, which is how the CLT usually gets demonstrated. A randomization distribution is built the same way, by repeatedly reshuffling and recomputing a statistic.

Point estimates and bias

  • A sample statistic is a point estimator of the corresponding population parameter. xˉ\bar{x} estimates μ\mu, and p^\hat{p} estimates pp.
  • An estimator is unbiased if, on average across all possible samples, its value equals the population parameter. Bias is about the center of the sampling distribution, not about any single sample being "wrong."
  • Even unbiased estimators vary from sample to sample. That variability can be modeled with probability, which is what the standard deviation formulas in this unit do.

The four sampling distributions you must know

  • Sample proportion p^\hat{p} has mean μp^=p\mu_{\hat{p}} = p and standard deviation σp^=p(1p)/n\sigma_{\hat{p}} = \sqrt{p(1-p)/n}. It's approximately normal when np10np \geq 10 and n(1p)10n(1-p) \geq 10 (the Large Counts condition).
  • Sample mean xˉ\bar{x} has mean μxˉ=μ\mu_{\bar{x}} = \mu and standard deviation σxˉ=σ/n\sigma_{\bar{x}} = \sigma/\sqrt{n}. It's normal if the population is normal, or approximately normal when n30n \geq 30 by the CLT.
  • Difference in sample proportions p^1p^2\hat{p}_1 - \hat{p}_2 has mean p1p2p_1 - p_2 and standard deviation p1(1p1)n1+p2(1p2)n2\sqrt{\frac{p_1(1-p_1)}{n_1} + \frac{p_2(1-p_2)}{n_2}}, with the Large Counts check applied to both samples.
  • Difference in sample means xˉ1xˉ2\bar{x}_1 - \bar{x}_2 has mean μ1μ2\mu_1 - \mu_2 and standard deviation σ12n1+σ22n2\sqrt{\frac{\sigma_1^2}{n_1} + \frac{\sigma_2^2}{n_2}}. Normal if both populations are normal, approximately normal if both sample sizes are at least 30.
  • One fine-print rule covers all four. The standard deviation formulas assume sampling with replacement (independence). When sampling without replacement, the true standard deviation is a bit smaller, but if the sample is less than 10% of the population, the difference is negligible. That's the 10% condition.

Unit 5, Sampling Distributions at a glance

StatisticMean of sampling distributionStandard deviationNormality check
Sample proportion p^\hat{p}ppp(1p)n\sqrt{\frac{p(1-p)}{n}}np10np \geq 10 and n(1p)10n(1-p) \geq 10
Sample mean xˉ\bar{x}μ\muσn\frac{\sigma}{\sqrt{n}}Population normal, OR n30n \geq 30 (CLT)
Difference in proportions p^1p^2\hat{p}_1 - \hat{p}_2p1p2p_1 - p_2p1(1p1)n1+p2(1p2)n2\sqrt{\frac{p_1(1-p_1)}{n_1} + \frac{p_2(1-p_2)}{n_2}}All four counts 10\geq 10
Difference in means xˉ1xˉ2\bar{x}_1 - \bar{x}_2μ1μ2\mu_1 - \mu_2σ12n1+σ22n2\sqrt{\frac{\sigma_1^2}{n_1} + \frac{\sigma_2^2}{n_2}}Both populations normal, OR both n30n \geq 30

Why Unit 5, Sampling Distributions matters in AP Stats

This is the unit where AP Stats pivots from describing data to making conclusions about populations. Every confidence interval and significance test in the rest of the course is just a sampling distribution wearing a different outfit. If you understand why σ/n\sigma/\sqrt{n} shrinks as n grows, the second half of the course makes sense instead of feeling like a pile of formulas.

  • It answers the course's central question, which is how confident you can be in a conclusion when variation is always present.
  • Probabilistic reasoning lets you anticipate patterns. You can't predict one sample's mean, but you can predict how thousands of sample means will behave.
  • The conditions you check here (random, 10% condition, Large Counts or normality) are the exact same conditions you'll write out on every inference problem for the rest of the year.

How this unit connects across the course

  • The normal distribution, z-scores, and the empirical rule from one-variable data (Unit 1) come back here, except now you apply them to distributions of statistics instead of distributions of individual values.
  • Random sampling methods (Unit 3) are what make sampling distributions trustworthy. A biased sampling method produces a biased estimator, and no amount of CLT magic fixes that.
  • Random variables, expected value, and combining independent variables (Unit 4) supply the machinery. The formulas for differences in this unit come straight from the rules for variances of sums and differences of independent random variables.
  • Everything here pays off immediately in inference for proportions (Unit 6) and means (Unit 7), where the standard deviations from this unit become standard errors inside confidence intervals and test statistics. The same logic extends to chi-square (Unit 8) and slopes (Unit 9).

Key formulas and procedures

  • z=xμσz = \frac{x - \mu}{\sigma} standardizes a value so you can find normal probabilities; for a statistic, replace x, μ\mu, and σ\sigma with the statistic, its mean, and its standard deviation.
  • μp^=p\mu_{\hat{p}} = p and σp^=p(1p)n\sigma_{\hat{p}} = \sqrt{\frac{p(1-p)}{n}} give the center and spread of the sampling distribution of a sample proportion.
  • μxˉ=μ\mu_{\bar{x}} = \mu and σxˉ=σn\sigma_{\bar{x}} = \frac{\sigma}{\sqrt{n}} give the center and spread of the sampling distribution of a sample mean.
  • σp^1p^2=p1(1p1)n1+p2(1p2)n2\sigma_{\hat{p}_1 - \hat{p}_2} = \sqrt{\frac{p_1(1-p_1)}{n_1} + \frac{p_2(1-p_2)}{n_2}} is the standard deviation for a difference in proportions from two independent samples.
  • σxˉ1xˉ2=σ12n1+σ22n2\sigma_{\bar{x}_1 - \bar{x}_2} = \sqrt{\frac{\sigma_1^2}{n_1} + \frac{\sigma_2^2}{n_2}} is the standard deviation for a difference in means. Notice you always add inside the radical, even for a difference.
  • Large Counts condition (np10np \geq 10 and n(1p)10n(1-p) \geq 10) verifies approximate normality for proportions; check it for both groups in two-sample problems.
  • CLT condition (n30n \geq 30, or a normal population) verifies approximate normality for means.
  • 10% condition (sample size less than 10% of population) lets you use the standard deviation formulas when sampling without replacement.
  • Procedure for any probability question about a statistic: state the mean and standard deviation of the sampling distribution, verify the shape is approximately normal, compute a z-score, find the area, and interpret in context.

Unit 5, Sampling Distributions on the AP exam

Unit 5 is 7-12% of the AP exam. On the multiple-choice section, expect questions that ask you to compute a probability involving xˉ\bar{x} or p^\hat{p}, identify the correct mean and standard deviation of a sampling distribution, recognize what happens to the spread when n changes (quadrupling n cuts the standard deviation in half), and pick out which scenario satisfies the conditions for approximate normality. Conceptual CLT questions are common, like identifying which histogram could be a sampling distribution of means from a skewed population.

On free-response questions, sampling distribution work usually appears inside a larger problem. You might calculate P(xˉ>some value)P(\bar{x} > \text{some value}) as one part of a question, justify why a sampling distribution is approximately normal by naming and checking conditions, or explain in context what a probability means. Sampling distributions also anchor investigative-task-style questions that build a new statistic and ask you to reason about its distribution from a simulation. Wherever it shows up, full credit requires three moves: correct parameters, a stated and verified shape condition, and an interpretation tied to the specific context. "There is about a 4% chance that a random sample of 50 batteries has a mean lifetime above 510 hours" earns points; a bare number doesn't.

Essential questions

  • Why do statistics vary from sample to sample, and how can that variation itself be predictable?
  • How does the Central Limit Theorem let us use the normal model even when we know nothing about the population's shape?
  • What makes an estimator trustworthy, and how do bias and variability describe different kinds of "wrong"?
  • How does sample size change what we can conclude from a sample?

Key terms to know

  • Sampling distribution: The distribution of values of a statistic across all possible samples of a given size from a population.
  • Central Limit Theorem (CLT): With independent observations and a large enough sample size, the sampling distribution of the sample mean is approximately normal regardless of the population's shape.
  • Parameter: A number describing a population, like μ\mu or pp; usually unknown.
  • Statistic: A number computed from a sample, like xˉ\bar{x} or p^\hat{p}; it estimates a parameter.
  • Point estimator: A statistic used as a single-value estimate of a population parameter.
  • Unbiased estimator: An estimator whose average value across all possible samples equals the population parameter.
  • Sampling variability: The natural sample-to-sample variation in a statistic's value.
  • Standard deviation of a statistic: A measure of how far a statistic typically falls from the parameter; it shrinks as sample size grows.
  • Large Counts condition: The check np10np \geq 10 and n(1p)10n(1-p) \geq 10 that justifies a normal approximation for sample proportions.
  • 10% condition: When sampling without replacement, the sample should be less than 10% of the population so the standard deviation formulas still work.
  • Continuous random variable: A variable that can take any value in an interval, with probabilities given by areas under a density curve.
  • z-score: The number of standard deviations a value sits from the mean, used to find normal probabilities.
  • Randomization distribution: A simulated distribution of a statistic built by repeatedly re-randomizing data, used to estimate sampling distributions.

Common mix-ups

  • The CLT is about the SHAPE of the sampling distribution of the mean. The facts that μxˉ=μ\mu_{\bar{x}} = \mu and σxˉ=σ/n\sigma_{\bar{x}} = \sigma/\sqrt{n} are true for any sample size; the CLT only adds approximate normality when n is large.
  • Don't confuse the population distribution, the distribution of one sample's data, and the sampling distribution of a statistic. The first two can be any shape; only the third gets squeezed toward normal as n grows.
  • For differences of independent statistics, variances ADD even though you're subtracting. The standard deviation formula always has a plus sign under the square root.
  • Bias and variability are separate problems. A high-variability estimator can still be unbiased (it's centered correctly, just noisy), and a low-variability estimator can be badly biased (consistent, but consistently wrong).

Frequently Asked Questions

What topics are covered in AP Stats Unit 5?

AP Stats Unit 5 covers 8 topics on sampling distributions: the normal distribution, the Central Limit Theorem, biased and unbiased point estimates, sampling distributions for sample proportions, differences in sample proportions, sample means, and differences in sample means. The unit builds the statistical foundation you need for inference. Here's the full topic list: - 5.1 Introducing Statistics: Why Is My Sample Not Like Yours? - 5.2 The Normal Distribution, Revisited - 5.3 The Central Limit Theorem - 5.4 Biased and Unbiased Point Estimates - 5.5 Sampling Distributions for Sample Proportions - 5.6 Sampling Distributions for Differences in Sample Proportions - 5.7 Sampling Distributions for Sample Means - 5.8 Sampling Distributions for Differences in Sample Means See all the matched practice at AP Stats Unit 5.

How much of the AP Stats exam is Unit 5?

AP Stats Unit 5 makes up 7-12% of the AP exam. That weight covers sampling distributions, the normal distribution, and the Central Limit Theorem. These concepts are also the backbone of Units 6-9, so understanding them well pays off across a much larger portion of the exam than that percentage suggests.

What's on the AP Stats Unit 5 progress check (MCQ and FRQ)?

The AP Stats Unit 5 progress check on AP Classroom includes both MCQ and FRQ parts drawn from all 8 topics in the unit. MCQ questions test the normal distribution, Central Limit Theorem, and sampling distributions for proportions and means. FRQ prompts typically ask you to identify, set up, and interpret a sampling distribution in context. The progress check pulls heavily from topics 5.2 through 5.8, so make sure you're comfortable calculating probabilities using the normal distribution and explaining why the Central Limit Theorem applies for a given sample size. For matched practice problems that mirror the progress check format, visit AP Stats Unit 5.

How do I practice AP Stats Unit 5 FRQs?

AP Stats Unit 5 FRQs most often come from the normal distribution, the Central Limit Theorem, and sampling distributions for sample proportions and means. A typical prompt gives you a real-world scenario and asks you to describe the shape, center, and spread of a sampling distribution, then calculate a probability or explain what the Central Limit Theorem guarantees. To practice effectively, work through each step out loud: state conditions, show the formula, calculate, and interpret in context. That last step, writing a sentence that ties your number back to the scenario, is where most points are lost. You'll find FRQ-style practice problems organized by topic at AP Stats Unit 5.

Where can I find AP Stats Unit 5 practice questions?

The best place to find AP Stats Unit 5 practice questions, including multiple-choice and practice test sets, is AP Stats Unit 5. That page organizes MCQ and FRQ practice by topic, covering the normal distribution, Central Limit Theorem, and all four sampling distribution types (proportions, differences in proportions, means, and differences in means). For a focused practice test experience, work through topic-by-topic MCQs first to spot gaps, then move to full FRQ prompts. Targeting topics 5.3, 5.5, and 5.7 first gives you the highest return since those show up most on both the progress check and the actual AP exam.

How should I study AP Stats Unit 5?

Start AP Stats Unit 5 by locking in the normal distribution (topic 5.2) before anything else, since every later topic builds on it. Then work through the Central Limit Theorem (5.3) carefully and practice explaining in plain English why a large enough sample size makes the sampling distribution approximately normal. Here's a practical study sequence: 1. Review the normal distribution and practice z-score probability calculations. 2. Study the Central Limit Theorem and know the conditions: random sample, independence, and large enough n. 3. Work topics 5.4-5.8 in order, sketching the sampling distribution (shape, mean, standard deviation) for each scenario before calculating. 4. For every practice problem, write a one-sentence interpretation of your answer in context. The most common mistake is skipping the conditions check. On the AP exam, stating and verifying conditions is worth points on its own. Visit AP Stats Unit 5 for topic-by-topic practice to reinforce each step.