A sampling distribution describes how a statistic varies across many possible samples. In the revised AP Statistics course, this topic connects simulation-based thinking to the Central Limit Theorem and explains why sample means often have an approximately normal sampling distribution.

Why This Matters for the AP Statistics Exam

This topic is the bridge from probability models into inference. Later confidence intervals and hypothesis tests for means depend on understanding that a sample statistic does not stay fixed from sample to sample. Instead, it has its own distribution, center, and spread.

On the exam, you may need to describe a sampling distribution, explain what a simulation is estimating, or justify why the sampling distribution of a sample mean is approximately normal.

more resources to help you study

practice multiple choice FRQ practice & scoring cheatsheets score calculator key terms

Key Takeaways

A sampling distribution is the distribution of a statistic over all possible samples of a fixed size from a population.
You can approximate a sampling distribution by simulation, repeatedly taking random samples and recording the statistic.
A randomization distribution is built by repeatedly reallocating or reassigning observed outcomes under a chance model.
The Central Limit Theorem says the sampling distribution of a sample mean becomes approximately normal as sample size increases.
Larger samples produce less variable sample means, so the sampling distribution gets tighter around the population mean.

Sampling Distributions

Suppose you repeatedly take random samples of the same size from one population and calculate the same statistic each time. The distribution of those statistic values is the sampling distribution.

That is different from:

the population distribution, which describes individual data values in the population
the sample distribution, which describes the values in one specific sample

The revised course wants you to keep those three ideas separate.

Approximating a Sampling Distribution with Simulation

In practice, you usually do not list all possible samples by hand. Instead, you approximate the sampling distribution by simulation:

Assume a population or parameter value.
Repeatedly generate many random samples of the same size.
Compute the statistic for each sample.
Graph the resulting statistic values.

That simulated graph approximates the sampling distribution.

This idea matters because it lets you study the behavior of a statistic before jumping into formal inference formulas.

Randomization Distributions

A randomization distribution is closely related, but the process is slightly different. Instead of sampling again from a population, you repeatedly shuffle, reassign, or reallocate outcomes under a chance model and record the statistic each time.

In the revised course, randomization distributions help support simulation-based reasoning about what kinds of statistic values would be expected by chance alone.

The Central Limit Theorem

The Central Limit Theorem (CLT) says that the sampling distribution of a sample mean has a shape that can be approximated by a normal distribution when the sample size is large enough.

Two ideas matter here:

the sampled values should be independent
the larger the sample, the better the normal approximation tends to be

If the population itself is normal, the sampling distribution of the sample mean is normal for any sample size. If the population is not normal, a sufficiently large sample size still makes the sampling distribution of the sample mean approximately normal.

What Gets More Stable as Sample Size Grows

As sample size increases:

the sampling distribution of the sample mean becomes less spread out
sample means cluster more tightly around the population mean
the normal approximation becomes more reliable

This is why larger samples usually produce more stable estimates.

How to Use This on the AP Statistics Exam

MCQ

Distinguish between a population distribution, a sample distribution, and a sampling distribution.
Identify when a simulation is describing a sampling distribution versus a randomization distribution.
Recognize when the CLT is about sample means rather than about individual observations.

Free Response

If asked to justify a normal model for a sample mean, cite the CLT, independence, and the large sample size.
If asked what a simulation is doing, state what statistic is being recorded over repeated samples or reallocations.
Keep the statistic, parameter, and population clearly tied to context.

Common Trap

The CLT does not say the population becomes normal. It says the sampling distribution of the sample mean becomes approximately normal.

Common Misconceptions

A large sample makes the original data normal. It does not.
A sampling distribution is just the distribution of one sample. It is not; it is the distribution of a statistic across many samples.
Randomization distributions and sampling distributions are identical. They are related but built in different ways.
A larger sample removes all variability. It reduces variability in the sampling distribution, but sample statistics still vary.

Vocabulary

The following words are mentioned explicitly in the AP® course framework for this topic.

Term	Definition
approximately normal	A distribution that closely follows the shape of a normal distribution, allowing for the use of normal probability methods.
central limit theorem	A theorem stating that when the sample size is sufficiently large, the sampling distribution of the mean of a random variable will be approximately normally distributed.
independence	The condition that observations in a sample are not influenced by each other, typically ensured through random sampling or randomized experiments.
null distribution	The probability distribution of the test statistic under the assumption that the null hypothesis is true.
random sample	A sample selected from a population in such a way that every member has an equal chance of being chosen, reducing bias and allowing for valid statistical inference.
sample size	The number of observations or data points collected in a sample, denoted as n.
sampling distribution	The probability distribution of a sample statistic (such as a sample proportion) obtained from repeated sampling of a population.
simulation	A method of modeling random events so that simulated outcomes closely match real-world outcomes, used to estimate probabilities.
statistic	Numerical summaries or measures calculated from sample data, such as mean, median, or standard deviation.

Frequently Asked Questions

What is the CLT in AP Stats?

The Central Limit Theorem says that when sample values are independent and the sample size is sufficiently large, the sampling distribution of the sample mean is approximately normal.

When can I use the Central Limit Theorem in AP Statistics?

Use the CLT for sample means when the observations are independent and n is large enough. On AP Stats, n greater than or equal to 30 is a common rule of thumb for non-normal populations.

Does the CLT make the population normal?

No. The CLT does not change the population distribution or individual data values. It describes the shape of the sampling distribution of sample means.

What conditions do I need for the CLT?

You need independent sample values and a sufficiently large sample size. If the population is already normal, the sampling distribution of the mean can be normal even for smaller samples.

What formula goes with the Central Limit Theorem?

After justifying a normal model, sample means are often standardized with z = (x̄ - μ)/(σ/√n). The standard deviation of the sampling distribution is σ/√n.

How should I write a CLT justification on an FRQ?

State that the sampling distribution of x̄ is approximately normal because the sample values are independent and the sample size is large enough, then connect that to the context of the problem.