Updates for 2027 AP exams coming soon

AP Statistics Unit 5 Review: Regression Analysis

Review AP Statistics Unit 5 to understand how and why sample statistics vary from sample to sample, and how the normal distribution and Central Limit Theorem let you model that variation precisely. This unit is the bridge between probability and every inference procedure in Units 6 through 9.

Use the topic guides, key terms, and practice questions available for this unit to build fluency with sampling distribution formulas and conditions before moving to inference.

start review notes review topics

Updates for 2027 AP exams coming soon

What is AP Statistics unit 5?

When you take a sample from a population, the statistic you calculate will not match the true parameter exactly, and it will not match the statistic from a different sample either. Unit 5 explains why that happens and gives you the mathematical tools to describe and quantify that variation.

A sampling distribution is the distribution of a statistic across all possible samples of a given size. Unit 5 covers the shape, center, and spread of sampling distributions for sample proportions, sample means, and their differences, and establishes the conditions under which a normal model applies.

Sampling variability is expected

Two samples from the same population will almost always produce different statistics. That variation can be random (chance) or non-random (bias from a flawed design). Recognizing the source of variation is the first step before any inference.

The normal distribution is the core model

A normal distribution is symmetric and bell-shaped, described by mean mu and standard deviation sigma. Area under the curve over an interval equals the probability a value falls there. Z-scores and calculator functions like normalcdf and invNorm let you find those areas precisely.

Conditions determine when normal models apply

For proportions, the Large Counts condition (np >= 10 and n(1-p) >= 10) must hold. For means, either the population is normal or n >= 30 by the CLT. The 10% condition (n < 10% of population) justifies treating observations as independent when sampling without replacement.

Why sampling distributions matter

Every confidence interval and hypothesis test in Units 6 through 9 rests on knowing the center, spread, and shape of a sampling distribution. If you can state the mean and standard deviation of the distribution of p-hat or x-bar, and justify a normal model, you can calculate any probability about a sample result and set up any inference procedure.

AP Statistics unit 5 topics

5.1

Why Is My Sample Not Like Yours?

Introduces sampling variability: statistics from repeated samples of the same population differ due to random chance or non-random bias. Establishes why conclusions from a single sample carry uncertainty.

open guide

5.2

The Normal Distribution, Revisited

Reviews the normal distribution as a model for continuous random variables. Covers z-scores, area under the curve as probability, and using normalcdf and invNorm to find probabilities and boundary values.

open guide

5.3

The Central Limit Theorem

Defines sampling distributions and states the CLT: with independent observations and sufficiently large n, the sampling distribution of x-bar is approximately normal. Introduces randomization distributions via simulation.

open guide

5.4

Biased and Unbiased Point Estimates

Distinguishes biased from unbiased estimators. The sample mean and sample proportion are unbiased. Larger samples reduce variability but cannot correct bias from a flawed sampling method.

open guide

5.5

Sampling Distributions for Sample Proportions

Describes the center (p), spread (sqrt(p(1-p)/n)), and shape (approximately normal when Large Counts holds) of the sampling distribution of p-hat. Applies the 10% condition for without-replacement sampling.

open guide

5.6

Sampling Distributions for Differences in Sample Proportions

Extends proportion sampling distributions to two independent groups. Mean is p1 minus p2, standard deviation uses both group formulas, and normality requires all four Large Counts checks to pass.

open guide

5.7

Sampling Distributions for Sample Means

Describes the center (mu), spread (sigma/sqrt(n)), and shape of the sampling distribution of x-bar. Normal when population is normal; approximately normal for any population when n >= 30 by the CLT.

open guide

5.8

Sampling Distributions for Differences in Sample Means

Extends mean sampling distributions to two independent groups. Mean is mu1 minus mu2, standard deviation is sqrt(sigma1^2/n1 + sigma2^2/n2), and normality requires both populations normal or both n >= 30.

open guide

guide