Fiveable

๐ŸŽฒIntro to Statistics Unit 7 Review

QR code for Intro to Statistics practice questions

7.3 Using the Central Limit Theorem

7.3 Using the Central Limit Theorem

Written by the Fiveable Content Team โ€ข Last updated August 2025
Written by the Fiveable Content Team โ€ข Last updated August 2025
๐ŸŽฒIntro to Statistics
Unit & Topic Study Guides

Central Limit Theorem

Applying the Central Limit Theorem

The central limit theorem (CLT) says that the sampling distribution of the sample mean will be approximately normal as long as certain conditions are met. This holds true regardless of the shape of the original population distribution, which is what makes it so useful.

Conditions for the CLT to apply:

  • Sufficiently large sample size: Typically nโ‰ฅ30n \geq 30. If the population is already normal, smaller samples work fine. If the population is heavily skewed, you may need a larger nn.
  • Independence: Each observation in the sample must be independent of the others. In practice, this means using random sampling and, for sampling without replacement, the sample should be less than 10% of the population.
  • Population not extremely skewed: With very skewed distributions, you need a larger sample size before the CLT kicks in.

Properties of the sampling distribution (when CLT applies):

  • Mean of the sampling distribution equals the population mean: ฮผxห‰=ฮผ\mu_{\bar{x}} = \mu
  • Standard deviation of the sampling distribution (called the standard error) is: ฯƒxห‰=ฯƒn\sigma_{\bar{x}} = \frac{\sigma}{\sqrt{n}}

Notice that the standard error shrinks as nn increases. Larger samples produce less variability in sample means.

To calculate probabilities for sample means, follow these steps:

  1. Verify the CLT conditions are met.

  2. Identify ฮผ\mu, ฯƒ\sigma, and nn.

  3. Calculate the standard error: ฯƒxห‰=ฯƒn\sigma_{\bar{x}} = \frac{\sigma}{\sqrt{n}}

  4. Convert the sample mean to a z-score: Z=xห‰โˆ’ฮผฯƒxห‰Z = \frac{\bar{x} - \mu}{\sigma_{\bar{x}}}

  5. Use a standard normal table or calculator to find the probability associated with that z-score.

The CLT also applies to sums of independent random variables. When working with sums instead of means:

  • Mean of the sum: ฮผฮฃx=nโ‹…ฮผ\mu_{\Sigma x} = n \cdot \mu
  • Standard deviation of the sum: ฯƒฮฃx=nโ‹…ฯƒ\sigma_{\Sigma x} = \sqrt{n} \cdot \sigma

The sum's standard deviation grows with n\sqrt{n}, which is the opposite direction from the standard error of the mean.

Law of Large Numbers Connection

The law of large numbers says that as sample size increases, the sample mean converges to the population mean. In notation: as nโ†’โˆžn \to \infty, xห‰โ†’ฮผ\bar{x} \to \mu.

The CLT builds on this idea but goes further. The law of large numbers tells you where the sample mean is heading (toward ฮผ\mu). The CLT tells you the shape of the sampling distribution along the way (approximately normal) and how spread out it is (via the standard error).

Both concepts share the same core insight: larger samples give you more accurate and predictable estimates of population parameters, because sampling variability decreases as nn grows.

Applying central limit theorem, Teorema del lรญmite central - Central limit theorem - xcv.wiki

Central Limit Theorem for Means vs. Sums

The CLT works for both sample means and sample sums, but you need to recognize which one a problem is asking about.

Use the CLT for means when:

  • The question asks about a sample mean or average (e.g., mean height, average income)
  • You need a probability like P(80<xห‰<85)P(80 < \bar{x} < 85)
  • You standardize using ฯƒxห‰=ฯƒn\sigma_{\bar{x}} = \frac{\sigma}{\sqrt{n}}

Use the CLT for sums when:

  • The question asks about a total (e.g., total sales, combined weight)
  • You need a probability like P(1000<ฮฃx<1500)P(1000 < \Sigma x < 1500)
  • You standardize using ฯƒฮฃx=nโ‹…ฯƒ\sigma_{\Sigma x} = \sqrt{n} \cdot \sigma

A quick way to tell: if the problem says "total" or "sum," use the sum version. If it says "mean" or "average," use the mean version. In both cases, check that the CLT conditions (large nn, independence, not extremely skewed) are satisfied before proceeding.

Statistical Inference and the Central Limit Theorem

The CLT is what makes most of statistical inference possible. Because the sampling distribution is approximately normal, you can use it to:

  • Construct confidence intervals: Estimate a population parameter within a range of plausible values, using the standard error to determine the margin of error.
  • Perform hypothesis tests: Assess whether observed sample data are consistent with a claimed population parameter by calculating how likely the result would be under the null hypothesis.
  • Quantify sampling error: The difference between a sample statistic and the true population parameter. The CLT tells you this error decreases as sample size increases, at a rate proportional to 1n\frac{1}{\sqrt{n}}.

These techniques all rely on knowing the shape and spread of the sampling distribution, which is exactly what the CLT provides.