---
title: "Unit 5 – Regression Analysis"
description: "Unit 5 – Regression Analysis - AP Statistics unit content"
canonical: "https://fiveable.me/ap-stats/unit-5"
type: "unit"
subject: "AP Statistics"
unit: "Unit 5 – Regression Analysis"
---

# Unit 5 – Regression Analysis

## Overview

Unit 5 develops the logic of sampling distributions: why statistics vary across samples, how the normal distribution models that variation, and what conditions justify using a normal model for sample proportions and sample means. The Central Limit Theorem and the 10% and Large Counts conditions appear throughout and carry directly into every inference unit.

## AP CED Alignment

This unit hub is organized around AP Course and Exam Description topics, skills, and exam task types when they are available in the source data.
- 5.1: Why Is My Sample Not Like Yours?
- 5.2: The Normal Distribution, Revisited
- 5.3: The Central Limit Theorem
- 5.4: Biased and Unbiased Point Estimates
- 5.5: Sampling Distributions for Sample Proportions
- 5.6: Sampling Distributions for Differences in Sample Proportions
- 5.7: Sampling Distributions for Sample Means
- 5.8: Sampling Distributions for Differences in Sample Means
- Topic 5.1: Why samples differ: sampling variability
- Topic 5.2: The normal distribution: probabilities and intervals
- Topic 5.3: The Central Limit Theorem and sampling distributions
- Topic 5.4: Biased and unbiased point estimates
- Topic 5.5: Sampling distributions for sample proportions
- Topic 5.6: Sampling distributions for differences in sample proportions
- Topic 5.7: Sampling distributions for sample means
- Topic 5.8: Sampling distributions for differences in sample means
- Skill Category 2 - Data Analysis
- Skill Category 3 - Using Probability and Simulation
- FRQ 3 – Focus on Probability and Sampling Distributions
- FRQ 6 – Investigative Task

## Topics

- [5.1: Why Is My Sample Not Like Yours?](/ap-stats/unit-5/why-is-my-sample-not-like-yours/study-guide/Mrybsi6gfieJDqF2LNju): Introduces sampling variability: statistics from repeated samples of the same population differ due to random chance or non-random bias. Establishes why conclusions from a single sample carry uncertainty.
- [5.2: The Normal Distribution, Revisited](/ap-stats/unit-5/normal-distribution-revisited/study-guide/dx4vMcx3WjSw68f1Ov66): Reviews the normal distribution as a model for continuous random variables. Covers z-scores, area under the curve as probability, and using normalcdf and invNorm to find probabilities and boundary values.
- [5.3: The Central Limit Theorem](/ap-stats/unit-5/central-limit-theorem/study-guide/DPmpebCrsJBYfpSgOKn3): Defines sampling distributions and states the CLT: with independent observations and sufficiently large n, the sampling distribution of x-bar is approximately normal. Introduces randomization distributions via simulation.
- [5.4: Biased and Unbiased Point Estimates](/ap-stats/unit-5/biased-unbiased-point-estimates/study-guide/eZ5sR9XOkLB1o9KKpMHF): Distinguishes biased from unbiased estimators. The sample mean and sample proportion are unbiased. Larger samples reduce variability but cannot correct bias from a flawed sampling method.
- [5.5: Sampling Distributions for Sample Proportions](/ap-stats/unit-5/sampling-distributions-for-sample-proportions/study-guide/Ezxev8MPpv3mFKjV4Gq3): Describes the center (p), spread (sqrt(p(1-p)/n)), and shape (approximately normal when Large Counts holds) of the sampling distribution of p-hat. Applies the 10% condition for without-replacement sampling.
- [5.6: Sampling Distributions for Differences in Sample Proportions](/ap-stats/unit-5/sampling-distributions-for-differences-sample-proportions/study-guide/VOvA8du6YHMjhEwB7lEW): Extends proportion sampling distributions to two independent groups. Mean is p1 minus p2, standard deviation uses both group formulas, and normality requires all four Large Counts checks to pass.
- [5.7: Sampling Distributions for Sample Means](/ap-stats/unit-5/sampling-distributions-for-sample-means/study-guide/JcwkFAqbdjfLgHUdojkE): Describes the center (mu), spread (sigma/sqrt(n)), and shape of the sampling distribution of x-bar. Normal when population is normal; approximately normal for any population when n >= 30 by the CLT.
- [5.8: Sampling Distributions for Differences in Sample Means](/ap-stats/unit-5/sampling-distributions-for-differences-sample-means/study-guide/hPyIdIhuKKF731eU2qOT): Extends mean sampling distributions to two independent groups. Mean is mu1 minus mu2, standard deviation is sqrt(sigma1^2/n1 + sigma2^2/n2), and normality requires both populations normal or both n >= 30.

## Hardest Topics And Analytics

Snapshot: practice snapshot
This snapshot uses Fiveable practice activity to show where students tend to miss questions and which review moves are worth prioritizing first.
- **58% average MCQ accuracy** (Across 7.7k multiple-choice practice attempts for this unit.)
- **7.7k MCQ attempts** (Practice activity included in this snapshot.)
- **61% average FRQ score** (Across 45 scored free-response attempts for this unit.)
- **5.4: Biased and Unbiased Point Estimates**: 48% MCQ miss rate across 720 attempts. Review Biased and Unbiased Point Estimates with attention to how the concept appears in AP-style source and evidence questions.
- **5.5: Sampling Distributions for Sample Proportions**: 42% MCQ miss rate across 2243 attempts. Review Sampling Distributions for Sample Proportions with attention to how the concept appears in AP-style source and evidence questions.
- **5.8: Sampling Distributions for Differences in Sample Means**: 40% MCQ miss rate across 892 attempts. Review Sampling Distributions for Differences in Sample Means with attention to how the concept appears in AP-style source and evidence questions.
- **5.7: Sampling Distributions for Sample Means**: 39% MCQ miss rate across 786 attempts. Review Sampling Distributions for Sample Means with attention to how the concept appears in AP-style source and evidence questions.

## Review Notes

### Topic 5.1: Why samples differ: sampling variability

Every sample statistic is subject to sampling variability: repeated samples from the same population produce different values of x-bar or p-hat. That variation may be random (chance) or non-random (systematic bias from poor design). A single sample always carries uncertainty about the true population parameter.

- **Sampling variability**: The natural tendency for a statistic to take different values across different samples drawn from the same population.
- **Random variation**: Differences in sample statistics caused by chance alone, not by a flaw in the sampling method.
- **Non-random variation**: Systematic differences caused by bias in data collection, such as nonresponse bias or a convenience sample, that consistently push estimates away from the true parameter.
- **Parameter vs. statistic**: A parameter (mu, p) describes the population; a statistic (x-bar, p-hat) describes the sample and is used to estimate the parameter.

**Checkpoint:** Can you identify whether a described source of variation in a sample result is random or non-random, and explain what that means for conclusions?

### Topic 5.2: The normal distribution: probabilities and intervals

A normal distribution is fully described by its mean mu and standard deviation sigma. The area under the curve over any interval equals the probability a value falls there. Use z = (x - mu)/sigma to standardize, then a z-table or normalcdf to find area. Use invNorm to work backward from a given probability to a boundary value.

- **Z-score**: z = (x - mu)/sigma; tells you how many standard deviations a value is above or below the mean.
- **Empirical rule**: In any normal distribution, approximately 68% of values fall within 1 sigma, 95% within 2 sigma, and 99.7% within 3 sigma of the mean.
- **normalcdf**: Calculator function that returns the area (probability) under a normal curve between two boundary values.
- **invNorm**: Calculator function that returns the x-value corresponding to a given cumulative area from the left tail.
- **Normality**: A distribution is approximately normal when it is symmetric and bell-shaped; this justifies using normal probability calculations.

**Checkpoint:** Given mu = 70 and sigma = 8, can you find P(X > 82) and the value below which the lowest 10% of values fall?

Task | Formula or tool | Example
--- | --- | ---
Find P(X < a) | normalcdf(-inf, a, mu, sigma) | P(X < 78) with mu=70, sigma=8
Find P(a < X < b) | normalcdf(a, b, mu, sigma) | P(65 < X < 85)
Find P(X > b) | normalcdf(b, inf, mu, sigma) | P(X > 90)
Find boundary for given area | invNorm(area, mu, sigma) | Find x where P(X < x) = 0.25
Standardize a value | z = (x - mu)/sigma | z = (82 - 70)/8 = 1.5

### Topic 5.3: The Central Limit Theorem and sampling distributions

A sampling distribution of a statistic is the distribution of that statistic across all possible samples of size n from a given population. The Central Limit Theorem (CLT) states that when n is sufficiently large and observations are independent, the sampling distribution of the sample mean is approximately normal regardless of the population's shape. A randomization distribution is a simulated version of this, generated by repeatedly reassigning response values to treatment groups.

- **Sampling distribution**: The distribution of values a statistic takes across all possible samples of a fixed size from a population.
- **Central Limit Theorem**: When sample values are independent and n is sufficiently large (rule of thumb: n >= 30), the sampling distribution of x-bar is approximately normal.
- **Randomization distribution**: A simulated sampling distribution created by repeatedly and randomly reallocating response values to treatment groups, used to assess results from a randomized experiment.
- **Independence condition**: Sample values must be independent of each other; satisfied by random sampling or, for without-replacement sampling, by the 10% condition.

**Checkpoint:** A population is strongly right-skewed. If you take samples of size n = 40, what does the CLT say about the shape of the sampling distribution of x-bar?

### Topic 5.4: Biased and unbiased point estimates

A point estimator is a sample statistic used to estimate a population parameter. An estimator is unbiased if its sampling distribution is centered at the true parameter value, meaning the average of all possible estimates equals the parameter. The sample mean x-bar and sample proportion p-hat are both unbiased estimators. Increasing sample size reduces variability in the estimator but does not fix bias if the sampling method is flawed.

- **Unbiased estimator**: A statistic whose sampling distribution has a mean equal to the population parameter it estimates; x-bar is unbiased for mu, p-hat is unbiased for p.
- **Bias**: Systematic error in an estimator; a biased estimator consistently overestimates or underestimates the parameter across repeated samples.
- **Variability of an estimator**: How spread out the sampling distribution of the estimator is; larger sample sizes produce less variability.
- **Point estimate**: A single value calculated from sample data used as the best guess for a population parameter.

**Checkpoint:** Explain why increasing sample size from 30 to 300 reduces variability in x-bar but does not correct bias caused by a convenience sample.

### Topic 5.5: Sampling distributions for sample proportions

When you take random samples of size n from a population with proportion p, the sample proportion p-hat has a predictable sampling distribution. Its mean equals p (unbiased), its standard deviation equals sqrt(p(1-p)/n), and it is approximately normal when the Large Counts condition holds. The 10% condition justifies using this formula when sampling without replacement.

- **Sample proportion (p-hat)**: The proportion of successes in a sample; an unbiased estimator of the population proportion p.
- **Standard deviation of p-hat**: sigma_p-hat = sqrt(p(1-p)/n); decreases as sample size increases.
- **Large Counts condition**: np >= 10 and n(1-p) >= 10; required for the sampling distribution of p-hat to be approximately normal.
- **10% condition**: Sample size must be less than 10% of the population size to treat observations as independent when sampling without replacement.

**Checkpoint:** A population has p = 0.3. For n = 50, verify the Large Counts condition and find the standard deviation of p-hat.

### Topic 5.6: Sampling distributions for differences in sample proportions

When comparing two independent groups, the statistic of interest is p-hat1 minus p-hat2. Its sampling distribution has mean equal to p1 minus p2 and standard deviation equal to sqrt(p1(1-p1)/n1 + p2(1-p2)/n2). It is approximately normal when all four Large Counts checks pass: n1p1 >= 10, n1(1-p1) >= 10, n2p2 >= 10, and n2(1-p2) >= 10.

- **Sampling distribution for the difference in sample proportions**: The distribution of p-hat1 minus p-hat2 across all possible pairs of independent samples; centered at p1 minus p2.
- **Standard deviation of p-hat1 minus p-hat2**: sqrt(p1(1-p1)/n1 + p2(1-p2)/n2); variances of independent distributions add.
- **Four-part Large Counts check**: All four products n1p1, n1(1-p1), n2p2, n2(1-p2) must be at least 10 for the normal model to apply.

**Checkpoint:** Two groups have p1 = 0.4, n1 = 60, p2 = 0.5, n2 = 80. Verify all four Large Counts conditions and find the standard deviation of p-hat1 minus p-hat2.

### Topic 5.7: Sampling distributions for sample means

For a numerical variable sampled from a population with mean mu and standard deviation sigma, the sampling distribution of x-bar has mean mu_x-bar = mu and standard deviation sigma_x-bar = sigma/sqrt(n). It is exactly normal if the population is normal, and approximately normal for any population shape when n >= 30 by the CLT. Use z = (x-bar minus mu)/(sigma/sqrt(n)) to find probabilities.

- **Sample mean (x-bar)**: The average of values in a sample; an unbiased estimator of the population mean mu.
- **Standard error of the mean**: sigma/sqrt(n); the standard deviation of the sampling distribution of x-bar. Larger n produces a smaller standard error.
- **Normal population implies normal sampling distribution**: If the population is normally distributed, x-bar is exactly normal for any sample size n.
- **CLT for means**: If the population is not normal, the sampling distribution of x-bar is approximately normal when n >= 30.

**Checkpoint:** A population has mu = 50 and sigma = 12. For samples of size n = 36, find P(x-bar > 53) using the correct standard error.

### Topic 5.8: Sampling distributions for differences in sample means

When comparing two independent groups on a numerical variable, the statistic x-bar1 minus x-bar2 has a sampling distribution centered at mu1 minus mu2 with standard deviation sqrt(sigma1^2/n1 + sigma2^2/n2). The distribution is approximately normal when both populations are normal or when both sample sizes are at least 30.

- **Mean of x-bar1 minus x-bar2**: mu1 minus mu2; the sampling distribution is centered at the true difference in population means.
- **Standard deviation of x-bar1 minus x-bar2**: sqrt(sigma1^2/n1 + sigma2^2/n2); variances of independent sampling distributions add.
- **Normality condition for differences in means**: Normal model applies if both populations are normal, or if both n1 >= 30 and n2 >= 30 by the CLT.

**Checkpoint:** Two populations have mu1 = 80, sigma1 = 10, n1 = 40 and mu2 = 75, sigma2 = 8, n2 = 35. Find the mean and standard deviation of the sampling distribution of x-bar1 minus x-bar2 and justify the normal model.

Statistic | Mean of sampling distribution | Standard deviation of sampling distribution | Normal model condition
--- | --- | --- | ---
p-hat | p | sqrt(p(1-p)/n) | np >= 10 and n(1-p) >= 10
p-hat1 minus p-hat2 | p1 minus p2 | sqrt(p1(1-p1)/n1 + p2(1-p2)/n2) | All four large counts >= 10
x-bar | mu | sigma/sqrt(n) | Population normal OR n >= 30
x-bar1 minus x-bar2 | mu1 minus mu2 | sqrt(sigma1^2/n1 + sigma2^2/n2) | Both populations normal OR both n >= 30

## Study Guides

- [Unit 5 Overview: Regression Analysis](/ap-stats/unit-5/review/study-guide/DTw89sv8RD3Eq3WC58AB)
- [5.3 Linear Regression Models](/ap-stats/unit-5/linear-regression-models/study-guide/PSt5cfDuvB5nu60DHulR)
- [5.4 Residuals](/ap-stats/unit-5/residuals/study-guide/zdTJQZw0UVGswyK6kkEF)
- [5.1 Graphical Representations Between Two Quantitative Variables](/ap-stats/unit-5/representing-relationship-between-two-quantitative-variables/study-guide/3rWWsKXcnbYlqY64hQ1j)
- [5.2 Correlation](/ap-stats/unit-5/correlation/study-guide/LlS81pC6QricXgIKNuFM)
- [5.5 Least-Squares Regression](/ap-stats/unit-5/least-squares-regression/study-guide/cRc4EhpHno3A4KvWrqyj)

## Practice Preview

### Multiple-choice practice

- **AP-style practice question**: Skill Category 2 - Data Analysis | The amount of active ingredient in a pharmaceutical pill has a population mean of $$250$$ mg and a standard deviation of $$8$$ mg. For a random sample of $$16$$ pills, what are the mean and standard deviation of the sampling distribution of the sample mean?
- **AP-style practice question**: Skill Category 3 - Using Probability and Simulation | A statistician assumes population proportions of $$p_1 = 0.40$$ and $$p_2 = 0.50$$. They require the standard deviation of the difference in sample proportions to be exactly $$0.05$$. If equal sample sizes $$n$$ are used for both populations, what is the required value of $$n$$?
- **AP-style practice question**: Skill Category 3 - Using Probability and Simulation | We wish to approximate the sampling distribution of $$\hat{p}_1 - \hat{p}_2$$ with a normal model. Given population proportions $$p_1 = 0.04$$ and $$p_2 = 0.04$$, which of the following pairs of sample sizes $$n_1$$ and $$n_2$$ are sufficient to satisfy the large counts condition?
- **AP-style practice question**: Skill Category 3 - Using Probability and Simulation | Population J has a success proportion of $$p_J = 0.8$$ and Population K has a success proportion of $$p_K = 0.5$$. Independent random samples of size $$100$$ are taken from each population. What is the value of the standard deviation of the sampling distribution of the difference in sample proportions $$\hat{p}_J - \hat{p}_K$$?
- **AP-style practice question**: Skill Category 3 - Using Probability and Simulation | A study compares the proportion of voters supporting a new policy in Region A ($$p_A$$) and Region B ($$p_B$$). Random samples of size $$n_A$$ and $$n_B$$ are selected from the respective large populations. Which expression represents the standard deviation of the sampling distribution of the difference in sample proportions $$\hat{p}_A - \hat{p}_B$$?
- **AP-style practice question**: Skill Category 3 - Using Probability and Simulation | A statistician calculates the value $$\sqrt{\frac{0.2(0.8)}{50} + \frac{0.3(0.7)}{60}}$$ based on known population proportions. Which of the following best describes what this value represents?

### FRQ practice

- **University major distribution and random sampling**: FRQ 3 – Focus on Probability and Sampling Distributions | University major distribution and random sampling
- **Sample mean versus median as population estimators**: FRQ 6 – Investigative Task | Sample mean versus median as population estimators

## Key Terms

- **Parameter**: A numerical value that describes a characteristic of a population, such as mu or p; the target that sample statistics are used to estimate.
- **Sample Statistic**: A numerical value calculated from sample data, such as x-bar or p-hat, used as a point estimate of the corresponding population parameter.
- **Unbiased Estimator**: A statistic whose sampling distribution is centered at the population parameter it estimates; x-bar is unbiased for mu and p-hat is unbiased for p.
- **Standard Error**: The standard deviation of a sampling distribution; for x-bar it equals sigma/sqrt(n), and for p-hat it equals sqrt(p(1-p)/n). Larger n produces a smaller standard error.
- **Z-scores**: A standardized value calculated as (data value minus mean) divided by standard deviation; used to find probabilities under a normal curve.
- **Empirical Rule**: In a normal distribution, approximately 68% of values fall within 1 standard deviation of the mean, 95% within 2, and 99.7% within 3.
- **Large Counts Condition**: The requirement that np >= 10 and n(1-p) >= 10 for the sampling distribution of p-hat to be approximately normal; all four analogous counts must hold for differences in proportions.
- **ten percent condition**: The requirement that sample size be less than 10% of the population size to treat observations as independent when sampling without replacement.
- **Sample Proportion**: p-hat; the ratio of successes in a sample to total observations, used to estimate the population proportion p.
- **Sample Mean**: x-bar; the average of values in a sample, used to estimate the population mean mu. Its sampling distribution has mean mu and standard deviation sigma/sqrt(n).
- **Normality**: The property of a distribution being symmetric and bell-shaped; required (or approximately satisfied) before using normal probability calculations for sampling distributions.
- **randomization distribution**: A simulated sampling distribution created by repeatedly and randomly reallocating response values to treatment groups in a randomized experiment; used to assess whether an observed result is unusual.
- **Bias**: Systematic error in an estimator or sampling method that consistently pushes estimates away from the true parameter value; not reduced by increasing sample size.

## Common Mistakes

- **Using sigma instead of sigma/sqrt(n) for sample means**: When finding probabilities about x-bar, the standard deviation of the sampling distribution is sigma/sqrt(n), not sigma. Using sigma gives the spread of individual values, not the spread of sample means. Always divide by sqrt(n) when working with x-bar.
- **Skipping condition checks before applying a normal model**: The normal model for p-hat requires np >= 10 and n(1-p) >= 10. For x-bar, you need either a normal population or n >= 30. For differences, all conditions must hold for both groups. Stating a normal model without verifying conditions loses credit on the exam.
- **Confusing the 10% condition with the Large Counts condition**: The 10% condition (n < 10% of population) addresses independence when sampling without replacement. The Large Counts condition (np >= 10 and n(1-p) >= 10) addresses whether the normal model applies. These are separate checks that serve different purposes.
- **Treating a biased sampling method as fixable by increasing n**: A larger sample size reduces variability in an estimator but does not remove bias. If the sampling method systematically excludes part of the population, the estimator will still be off-center no matter how large n gets.
- **Forgetting to interpret probabilities in context**: A probability statement about p-hat or x-bar must reference the specific population and variable described in the problem. Writing P(x-bar > 53) = 0.067 without saying what that means in context is incomplete on a free-response question.

## Exam Connections

- **Condition verification as a required step**: Free-response questions on sampling distributions routinely require you to check and justify conditions before proceeding. For proportions, state and verify the Large Counts condition and 10% condition explicitly. For means, state whether the population is normal or invoke the CLT with n >= 30. Skipping this step or stating conditions without numbers from the problem typically results in lost credit.
- **Probability calculations with correct standard error**: A common task is computing the probability that a sample statistic falls in a given range. You must use the standard deviation of the sampling distribution (sigma/sqrt(n) for x-bar, sqrt(p(1-p)/n) for p-hat), not the population standard deviation. Show the z-score formula with numbers substituted, then state the probability with a calculator result and interpret it in context.
- **Interpreting parameters and probabilities in context**: Exam questions expect you to connect numerical results to the real-world scenario described. A probability about x-bar should be stated as the likelihood that the sample mean of a specific variable in a specific population falls in a given range, not just as a decimal. Similarly, the mean and standard deviation of a sampling distribution should be described with units and context, not just as formulas.

## Final Review Checklist

- **Identify sources of sampling variability**: Distinguish random variation (chance) from non-random variation (bias) and explain what each implies for the reliability of a sample statistic.
- **Calculate normal distribution probabilities**: Use z = (x - mu)/sigma and normalcdf or a z-table to find P(X < a), P(a < X < b), and P(X > b). Use invNorm to find boundary values from a given probability.
- **State and apply the Central Limit Theorem**: Explain that the sampling distribution of x-bar is approximately normal when observations are independent and n >= 30, regardless of population shape. Identify when CLT applies and when it does not.
- **Distinguish biased from unbiased estimators**: Explain that x-bar and p-hat are unbiased because their sampling distributions are centered at the true parameter. Explain that larger n reduces variability but does not fix bias.
- **Describe sampling distributions for proportions and means**: State the mean and standard deviation formulas for p-hat and x-bar, verify the Large Counts and 10% conditions, and justify a normal model before calculating probabilities.
- **Extend to differences in proportions and means**: Apply the formulas for the mean and standard deviation of p-hat1 minus p-hat2 and x-bar1 minus x-bar2. Verify all required conditions for both groups before using a normal model.
- **Interpret results in context with units**: State probabilities and parameters with appropriate units and in the context of the specific population described in the problem, not just as abstract numbers.

## Study Plan

- **Start with sampling variability and the normal distribution (5.1-5.2)**: Read the topic guides for 5.1 and 5.2. Practice identifying random versus non-random variation in sample results. Then work through normal distribution probability problems using z-scores, normalcdf, and invNorm until the process is automatic.
- **Build the CLT and estimator concepts (5.3-5.4)**: Read the topic guides for 5.3 and 5.4. Sketch what happens to the shape of the sampling distribution of x-bar as n increases for a skewed population. Practice explaining in writing why x-bar is unbiased and why larger n reduces variability but not bias.
- **Understand the proportion sampling distribution formulas (5.5-5.6)**: Read the topic guides for 5.5 and 5.6. For each problem, write out the mean and standard deviation formulas, check the Large Counts and 10% conditions explicitly, then calculate the requested probability using a z-score and normalcdf.
- **Understand the mean sampling distribution formulas (5.7-5.8)**: Read the topic guides for 5.7 and 5.8. Practice the sigma/sqrt(n) formula for x-bar and the sqrt(sigma1^2/n1 + sigma2^2/n2) formula for differences. For each problem, justify the normal model before computing any probability.
- **Review all four formulas together and practice with available questions**: Use the comparison table covering p-hat, p-hat1 minus p-hat2, x-bar, and x-bar1 minus x-bar2 to review all four sampling distributions side by side. Work through the available practice questions for this unit, focusing on condition verification and contextual interpretation.

## More Ways To Review

- [Topic study guides](/ap-stats/unit-5#topics)
- [FRQ practice](/ap-stats/frq-practice)
- [Cram archive videos](/cram-archives?subject=ap-statistics&unit=unit-5)
- [Cheatsheets](/ap-stats/cheatsheets/unit-5)
- [Key terms](/ap-stats/key-terms)

## FAQs

### What topics are covered in AP Stats Unit 5?

AP Stats Unit 5 covers 8 topics on sampling distributions: the normal distribution, the Central Limit Theorem, biased and unbiased point estimates, sampling distributions for sample proportions, differences in sample proportions, sample means, and differences in sample means. The unit builds the statistical foundation you need for inference. Here's the full topic list:
- 5.1 Introducing Statistics: Why Is My Sample Not Like Yours?
- 5.2 The Normal Distribution, Revisited
- 5.3 The Central Limit Theorem
- 5.4 Biased and Unbiased Point Estimates
- 5.5 Sampling Distributions for Sample Proportions
- 5.6 Sampling Distributions for Differences in Sample Proportions
- 5.7 Sampling Distributions for Sample Means
- 5.8 Sampling Distributions for Differences in Sample Means See all the matched practice at [AP Stats Unit 5](/ap-stats/unit-5).

### How much of the AP Stats exam is Unit 5?

AP Stats Unit 5 makes up 7-12% of the AP exam. That weight covers sampling distributions, the normal distribution, and the Central Limit Theorem. These concepts are also the backbone of Units 6-9, so understanding them well pays off across a much larger portion of the exam than that percentage suggests.

### What's on the AP Stats Unit 5 progress check (MCQ and FRQ)?

The AP Stats Unit 5 progress check on AP Classroom includes both MCQ and FRQ parts drawn from all 8 topics in the unit. MCQ questions test the normal distribution, Central Limit Theorem, and sampling distributions for proportions and means. FRQ prompts typically ask you to identify, set up, and interpret a sampling distribution in context. The progress check pulls heavily from topics 5.2 through 5.8, so make sure you're comfortable calculating probabilities using the normal distribution and explaining why the Central Limit Theorem applies for a given sample size. For matched practice problems that mirror the progress check format, visit [AP Stats Unit 5](/ap-stats/unit-5).

### How do I practice AP Stats Unit 5 FRQs?

AP Stats Unit 5 FRQs most often come from the normal distribution, the Central Limit Theorem, and sampling distributions for sample proportions and means. A typical prompt gives you a real-world scenario and asks you to describe the shape, center, and spread of a sampling distribution, then calculate a probability or explain what the Central Limit Theorem guarantees. To practice effectively, work through each step out loud: state conditions, show the formula, calculate, and interpret in context. That last step, writing a sentence that ties your number back to the scenario, is where most points are lost. You'll find FRQ-style practice problems organized by topic at [AP Stats Unit 5](/ap-stats/unit-5).

### Where can I find AP Stats Unit 5 practice questions?

The best place to find AP Stats Unit 5 practice questions, including multiple-choice and practice test sets, is [AP Stats Unit 5](/ap-stats/unit-5). That page organizes MCQ and FRQ practice by topic, covering the normal distribution, Central Limit Theorem, and all four sampling distribution types (proportions, differences in proportions, means, and differences in means). For a focused practice test experience, work through topic-by-topic MCQs first to spot gaps, then move to full FRQ prompts. Targeting topics 5.3, 5.5, and 5.7 first gives you the highest return since those show up most on both the progress check and the actual AP exam.

### How should I study AP Stats Unit 5?

Start AP Stats Unit 5 by locking in the normal distribution (topic 5.2) before anything else, since every later topic builds on it. Then work through the Central Limit Theorem (5.3) carefully and practice explaining in plain English why a large enough sample size makes the sampling distribution approximately normal. Here's a practical study sequence:
1. Review the normal distribution and practice z-score probability calculations.
2. Study the Central Limit Theorem and know the conditions: random sample, independence, and large enough n.
3. Work topics 5.4-5.8 in order, sketching the sampling distribution (shape, mean, standard deviation) for each scenario before calculating.
4. For every practice problem, write a one-sentence interpretation of your answer in context. The most common mistake is skipping the conditions check. On the AP exam, stating and verifying conditions is worth points on its own. Visit [AP Stats Unit 5](/ap-stats/unit-5) for topic-by-topic practice to reinforce each step.

## Structured Data

```json
{"@context":"https://schema.org","@type":"FAQPage","inLanguage":"en","mainEntity":[{"@type":"Question","@id":"https://fiveable.me/ap-stats/unit-5#what-topics-are-covered-in-ap-stats-unit-5","name":"What topics are covered in AP Stats Unit 5?","acceptedAnswer":{"@type":"Answer","text":"AP Stats Unit 5 covers 8 topics on sampling distributions: the normal distribution, the Central Limit Theorem, biased and unbiased point estimates, sampling distributions for sample proportions, differences in sample proportions, sample means, and differences in sample means. The unit builds the statistical foundation you need for inference. Here's the full topic list:\n- 5.1 Introducing Statistics: Why Is My Sample Not Like Yours?\n- 5.2 The Normal Distribution, Revisited\n- 5.3 The Central Limit Theorem\n- 5.4 Biased and Unbiased Point Estimates\n- 5.5 Sampling Distributions for Sample Proportions\n- 5.6 Sampling Distributions for Differences in Sample Proportions\n- 5.7 Sampling Distributions for Sample Means\n- 5.8 Sampling Distributions for Differences in Sample Means See all the matched practice at <a href=\"/ap-stats/unit-5\">AP Stats Unit 5</a>."}},{"@type":"Question","@id":"https://fiveable.me/ap-stats/unit-5#how-much-of-the-ap-stats-exam-is-unit-5","name":"How much of the AP Stats exam is Unit 5?","acceptedAnswer":{"@type":"Answer","text":"AP Stats Unit 5 makes up 7-12% of the AP exam. That weight covers sampling distributions, the normal distribution, and the Central Limit Theorem. These concepts are also the backbone of Units 6-9, so understanding them well pays off across a much larger portion of the exam than that percentage suggests."}},{"@type":"Question","@id":"https://fiveable.me/ap-stats/unit-5#whats-on-the-ap-stats-unit-5-progress-check-mcq-and-frq","name":"What's on the AP Stats Unit 5 progress check (MCQ and FRQ)?","acceptedAnswer":{"@type":"Answer","text":"The AP Stats Unit 5 progress check on AP Classroom includes both MCQ and FRQ parts drawn from all 8 topics in the unit. MCQ questions test the normal distribution, Central Limit Theorem, and sampling distributions for proportions and means. FRQ prompts typically ask you to identify, set up, and interpret a sampling distribution in context. The progress check pulls heavily from topics 5.2 through 5.8, so make sure you're comfortable calculating probabilities using the normal distribution and explaining why the Central Limit Theorem applies for a given sample size. For matched practice problems that mirror the progress check format, visit <a href=\"/ap-stats/unit-5\">AP Stats Unit 5</a>."}},{"@type":"Question","@id":"https://fiveable.me/ap-stats/unit-5#how-do-i-practice-ap-stats-unit-5-frqs","name":"How do I practice AP Stats Unit 5 FRQs?","acceptedAnswer":{"@type":"Answer","text":"AP Stats Unit 5 FRQs most often come from the normal distribution, the Central Limit Theorem, and sampling distributions for sample proportions and means. A typical prompt gives you a real-world scenario and asks you to describe the shape, center, and spread of a sampling distribution, then calculate a probability or explain what the Central Limit Theorem guarantees. To practice effectively, work through each step out loud: state conditions, show the formula, calculate, and interpret in context. That last step, writing a sentence that ties your number back to the scenario, is where most points are lost. You'll find FRQ-style practice problems organized by topic at <a href=\"/ap-stats/unit-5\">AP Stats Unit 5</a>."}},{"@type":"Question","@id":"https://fiveable.me/ap-stats/unit-5#where-can-i-find-ap-stats-unit-5-practice-questions","name":"Where can I find AP Stats Unit 5 practice questions?","acceptedAnswer":{"@type":"Answer","text":"The best place to find AP Stats Unit 5 practice questions, including multiple-choice and practice test sets, is <a href=\"/ap-stats/unit-5\">AP Stats Unit 5</a>. That page organizes MCQ and FRQ practice by topic, covering the normal distribution, Central Limit Theorem, and all four sampling distribution types (proportions, differences in proportions, means, and differences in means). For a focused practice test experience, work through topic-by-topic MCQs first to spot gaps, then move to full FRQ prompts. Targeting topics 5.3, 5.5, and 5.7 first gives you the highest return since those show up most on both the progress check and the actual AP exam."}},{"@type":"Question","@id":"https://fiveable.me/ap-stats/unit-5#how-should-i-study-ap-stats-unit-5","name":"How should I study AP Stats Unit 5?","acceptedAnswer":{"@type":"Answer","text":"Start AP Stats Unit 5 by locking in the normal distribution (topic 5.2) before anything else, since every later topic builds on it. Then work through the Central Limit Theorem (5.3) carefully and practice explaining in plain English why a large enough sample size makes the sampling distribution approximately normal. Here's a practical study sequence:\n1. Review the normal distribution and practice z-score probability calculations.\n2. Study the Central Limit Theorem and know the conditions: random sample, independence, and large enough n.\n3. Work topics 5.4-5.8 in order, sketching the sampling distribution (shape, mean, standard deviation) for each scenario before calculating.\n4. For every practice problem, write a one-sentence interpretation of your answer in context. The most common mistake is skipping the conditions check. On the AP exam, stating and verifying conditions is worth points on its own. Visit <a href=\"/ap-stats/unit-5\">AP Stats Unit 5</a> for topic-by-topic practice to reinforce each step."}}]}
```