Fiveable

📊Honors Statistics Unit 8 Review

QR code for Honors Statistics practice questions

8.1 A Single Population Mean Using the Normal Distribution

8.1 A Single Population Mean Using the Normal Distribution

Written by the Fiveable Content Team • Last updated August 2025
Written by the Fiveable Content Team • Last updated August 2025
📊Honors Statistics
Unit & Topic Study Guides
Pep mascot

Confidence Intervals for a Single Population Mean

Pep mascot
more resources to help you study

Confidence intervals for population means

A confidence interval gives you a range of values that likely contains the true population mean, based on what you observed in a sample. The idea is straightforward: since a single sample mean probably won't land exactly on the population mean, you build a buffer around it using known information about sampling variability.

Before you can use the normal distribution to build this interval, two conditions must be met:

  • The sample is a simple random sample from the population, ensuring the data is representative and unbiased.
  • The population is normally distributed, or the sample size is large (n30n \geq 30). When n30n \geq 30, the Central Limit Theorem kicks in and guarantees the sampling distribution of xˉ\bar{x} is approximately normal regardless of the population's shape.

The formula for a confidence interval is:

xˉ±zσn\bar{x} \pm z^* \frac{\sigma}{\sqrt{n}}

where:

  • xˉ\bar{x} is the sample mean, your point estimate and the center of the interval
  • zz^* is the critical value from the standard normal distribution, determined by your confidence level (e.g., z=1.645z^* = 1.645 for 90%, z=1.96z^* = 1.96 for 95%, z=2.576z^* = 2.576 for 99%)
  • σ\sigma is the population standard deviation, which measures variability in the population (this formula assumes σ\sigma is known)
  • nn is the sample size

The quantity σn\frac{\sigma}{\sqrt{n}} is called the standard error. It measures how much the sample mean tends to vary from sample to sample. Notice that as nn increases, the standard error shrinks, which makes the interval narrower. That's why larger samples give more precise estimates.

Steps to calculate a confidence interval:

  1. Verify that both conditions (random sample and normality/large nn) are met.
  2. Identify xˉ\bar{x}, σ\sigma, and nn from the problem.
  3. Determine the critical value zz^* based on the desired confidence level.
  4. Calculate the margin of error: zσnz^* \frac{\sigma}{\sqrt{n}}.
  5. Compute the lower bound (xˉmargin of error\bar{x} - \text{margin of error}) and upper bound (xˉ+margin of error\bar{x} + \text{margin of error}).

For example, suppose a random sample of n=50n = 50 students has a mean test score of xˉ=78\bar{x} = 78, and the population standard deviation is σ=10\sigma = 10. For a 95% confidence interval:

  • Standard error: 10501.414\frac{10}{\sqrt{50}} \approx 1.414
  • Margin of error: 1.96×1.4142.771.96 \times 1.414 \approx 2.77
  • Interval: (782.77, 78+2.77)=(75.23, 80.77)(78 - 2.77,\ 78 + 2.77) = (75.23,\ 80.77)
Confidence intervals for population means, 8.1 A Single Population Mean using the Normal Distribution | Introduction to Statistics

Interpretation of confidence intervals

Getting the interpretation right matters a lot on exams. A confidence interval provides a range of plausible values for the population mean based on your sample data.

The correct interpretation follows this template: "We are [confidence level]% confident that the true population mean of [context] is between [lower bound] and [upper bound]."

For the example above: "We are 95% confident that the true mean test score of all students is between 75.23 and 80.77."

What the confidence level actually means: if you repeated the sampling process many times and built a 95% confidence interval each time, about 95% of those intervals would contain the true population mean. The confidence level describes the long-run success rate of the method, not any single interval.

Common misinterpretations to avoid:

  • "There is a 95% probability that the population mean is in this interval." This is wrong. The population mean is a fixed number; it's either in the interval or it isn't. The 95% refers to the method's reliability across many samples.
  • "95% of the data falls within this interval." The interval estimates the mean, not the spread of individual observations.
Confidence intervals for population means, A Single Population Mean using the Normal Distribution · Statistics

Sample size for margin of error

Sometimes you need to plan ahead: how large a sample do you need to achieve a specific margin of error? You can solve for nn algebraically from the confidence interval formula.

The margin of error (EE) is the maximum expected difference between xˉ\bar{x} and the true population mean. The formula for the required sample size is:

n=(zσE)2n = \left(\frac{z^* \sigma}{E}\right)^2

Steps to determine sample size:

  1. Choose your desired confidence level and find the corresponding zz^*.
  2. Identify σ\sigma (the population standard deviation). If unknown, use a reasonable estimate from prior studies or a pilot sample.
  3. Decide on your desired margin of error EE.
  4. Plug into the formula and compute nn.
  5. Always round up to the next whole number. Rounding down would give you a margin of error slightly larger than what you wanted.

For example, if you want a 95% confidence interval (z=1.96z^* = 1.96) with a margin of error of E=2E = 2 and σ=10\sigma = 10:

n=(1.96×102)2=(9.8)2=96.04n = \left(\frac{1.96 \times 10}{2}\right)^2 = (9.8)^2 = 96.04

You'd round up to n=97n = 97.

Notice the trade-offs built into this formula: a higher confidence level (larger zz^*) or a smaller margin of error (smaller EE) both require a larger sample. More precision and more confidence cost more data.

Statistical Inference and Hypothesis Testing

Confidence intervals are one tool within the broader framework of statistical inference, which uses sample data to draw conclusions about populations.

The sampling distribution of xˉ\bar{x} describes how the sample mean varies across all possible samples of size nn. The confidence interval formula relies directly on the shape and spread of this distribution.

Hypothesis testing is a related technique. While a confidence interval estimates a parameter by giving a range of plausible values, a hypothesis test asks whether the data provide enough evidence to reject a specific claim about the parameter. The two approaches are connected: if a hypothesized value falls outside your confidence interval, you'd reject it at the corresponding significance level.

Note that this section of the course uses the normal distribution (zz) because σ\sigma is assumed known. When σ\sigma is unknown and estimated by the sample standard deviation ss, you'll switch to the tt-distribution, where degrees of freedom (df=n1df = n - 1) affect the shape of the distribution and the critical values you use.