Confidence intervals are the backbone of statistical inference on the AP Statistics exam. They show up in Units 6, 7, and 9, and you'll encounter them in both multiple-choice questions and FRQs. The College Board wants you to understand that we use intervals (not single values) to estimate population parameters because sample statistics vary from sample to sample. Every confidence interval you construct reflects this fundamental truth: you're acknowledging uncertainty while still making useful claims about populations.
Here's what you're really being tested on: knowing which interval procedure fits which situation, verifying the conditions that make each formula valid, and interpreting your results correctly. The formulas themselves follow a consistent structure: point estimate ยฑ (critical value)(standard error). But the details change depending on whether you're estimating proportions vs. means, one sample vs. two samples, or categorical vs. quantitative relationships. Don't just memorize formulas; know what type of data and research question each one addresses.
One-Sample Intervals for Proportions
When you have categorical data from a single sample and want to estimate the true population proportion, you'll use the one-sample z-interval. The sampling distribution of p^โ is approximately normal when sample sizes are large enough, which is why we can use the standard normal (z) distribution here.
One-Sample Z-Interval for a Proportion
Formula:p^โยฑzโnp^โ(1โp^โ)โโ where p^โ is the sample proportion and zโ is the critical value (e.g., zโ=1.96 for 95% confidence)
Success-failure condition: both np^โโฅ10 and n(1โp^โ)โฅ10 must be satisfied. This ensures the sampling distribution is approximately normal so that using zโ is valid.
Independence conditions: data must come from a random sample or randomized experiment, and the 10% condition (nโค0.10N) applies when sampling without replacement. Without independence, the standard error formula breaks down.
One-Sample Intervals for Means
When estimating a population mean from quantitative data, the choice between z and t depends on whether you know the population standard deviation. In practice, you almost never know ฯ, so the t-interval dominates AP Statistics.
Z-Interval for a Mean (Known ฯ)
Formula:xหยฑzโ(nโฯโ) using the known population standard deviation ฯ in the standard error
Rarely used in practice because knowing ฯ while not knowing ฮผ is an unusual situation. This formula appears mainly in theoretical or textbook problems.
Normality required: either the population is normally distributed, or nโฅ30 so the Central Limit Theorem kicks in
T-Interval for a Mean (Unknown ฯ)
Formula:xหยฑtโ(nโsโ) where s replaces ฯ, with degrees of freedom df=nโ1
The t-distribution is wider than the z-distribution, especially for small samples. Those heavier tails account for the extra uncertainty that comes from estimating ฯ with s.
Conditions: random sample, independence (10% condition), and population approximately normal OR large sample size (nโฅ30)
Compare: Z-interval vs. T-interval for means: both estimate ฮผ, but the t-interval uses s instead of ฯ and has heavier tails. On the AP exam, if ฯ isn't explicitly given, use the t-interval. This is the default for quantitative data.
Two-Sample Intervals for Comparing Groups
Comparing two populations is where inference gets more interesting. The key question: are the samples independent (two separate groups) or paired (same subjects measured twice)?
Two-Sample T-Interval for Difference of Means
Formula:(xห1โโxห2โ)ยฑtโn1โs12โโ+n2โs22โโโ where the standard error combines variability from both samples
Degrees of freedom: use your calculator's "2-SampTInt" function, which applies the Welch approximation. Don't try to compute df by hand on the AP exam.
Conditions: two independent random samples, 10% condition for each group, and both populations approximately normal OR both sample sizes large
Two-Sample Z-Interval for Difference of Proportions
Formula:(p^โ1โโp^โ2โ)ยฑzโn1โp^โ1โ(1โp^โ1โ)โ+n2โp^โ2โ(1โp^โ2โ)โโ Note that you use each sample's p^โ separately. There's no pooling for confidence intervals (pooling only happens in hypothesis tests for proportions).
Interpretation: if the interval contains zero, you don't have convincing evidence of a difference. The sign of the values tells you which group has the larger proportion.
Success-failure condition: check all four values: n1โp^โ1โ, n1โ(1โp^โ1โ), n2โp^โ2โ, and n2โ(1โp^โ2โ). Each must be โฅ10.
Compare: Two-sample means vs. two-sample proportions: both compare independent groups, but means use the t-distribution while proportions use z. If an FRQ asks you to compare two treatments with a binary outcome (yes/no, success/failure), you need the two-proportion z-interval.
Paired T-Interval for Mean Difference
Formula:dหยฑtโ(nโsdโโ) where dห is the mean of the differences and sdโ is the standard deviation of the differences
When to use: matched pairs designs, before-and-after studies, or any situation where each observation in one sample is linked to a specific observation in the other
The core idea: you're reducing a two-sample problem to a one-sample problem by computing the differences first, then analyzing those differences. Here df=nโ1 where n is the number of pairs.
Compare: Two-sample t-interval vs. paired t-interval: the paired approach controls for individual variability and often produces narrower intervals. Watch for FRQ setups where subjects are measured twice or matched by characteristics. That's your cue to use paired procedures.
Inference for Regression Slopes
Unit 9 extends confidence intervals to linear regression. Here you're estimating the true population slope ฮฒ based on your sample slope b.
T-Interval for the Slope of a Regression Line
Formula:bยฑtโโ SEbโ where SEbโ=โ(xiโโxห)2โsโ and s is the residual standard deviation. In practice, your calculator or computer output provides SEbโ directly.
Degrees of freedom:df=nโ2. You lose two degrees of freedom because you're estimating both the slope and the intercept.
Conditions (LINE):
Linear relationship (check the residual plot for no curved pattern)
Independence (random sample, 10% condition)
Normal residuals (check with a histogram or Normal probability plot of residuals)
Equal variance (residuals show constant spread across all x-values, no "fan" shape)
Compare: T-interval for slope vs. t-interval for a mean: both use the t-distribution, but slope inference has df=nโ2 instead of df=nโ1, and the conditions focus on residual behavior rather than the raw data. If you see regression output on an FRQ, look for the standard error of the slope coefficient in the table.
Advanced Intervals (Beyond Core AP Content)
These formulas occasionally appear in enrichment contexts but are not central to the AP Statistics exam. Know they exist, but prioritize the intervals above.
Confidence Interval for Population Variance
Formula:(ฯฮฑ/22โ(nโ1)s2โ,ย ฯ1โฮฑ/22โ(nโ1)s2โ) using the chi-squared distribution, which is right-skewed
Not symmetric: unlike z and t intervals, this interval is asymmetric around the point estimate
Strong normality assumption: the population must be normally distributed. This procedure is sensitive to departures from normality.
Confidence Interval for Ratio of Two Variances
Formula:(s22โs12โโโ Fฮฑ/2โ1โ,ย s22โs12โโโ Fฮฑ/2โ) using the F-distribution with df1โ=n1โโ1 and df2โ=n2โโ1
Application: testing whether two populations have equal variances before running a pooled two-sample t-test
Requires normality in both populations. Rarely tested on AP Statistics but useful for understanding ANOVA assumptions.
Quick Reference Table
Situation
Procedure to Use
Estimating a single proportion
One-sample z-interval for p
Estimating a single mean
T-interval for ฮผ (use z only if ฯ is known)
Comparing two independent proportions
Two-sample z-interval for p1โโp2โ
Comparing two independent means
Two-sample t-interval for ฮผ1โโฮผ2โ
Comparing paired/matched data
Paired t-interval for ฮผdโ
Estimating a regression slope
T-interval for ฮฒ with df=nโ2
Intervals using z-distribution
One-proportion, two-proportion (large samples)
Intervals using t-distribution
One-mean, two-means, paired, regression slope
Self-Check Questions
What conditions must you verify before constructing a one-sample z-interval for a proportion, and why does each condition matter?
Compare the t-interval for a single mean and the paired t-interval: what do they have in common, and when would you choose one over the other?
If a confidence interval for p1โโp2โ is (0.03,0.15), what can you conclude about the relationship between the two population proportions?
An FRQ gives you regression output including b=2.4 and SEbโ=0.6 with n=22. What critical value would you use for a 95% confidence interval, and what are the degrees of freedom?
A researcher wants to determine whether a new teaching method improves test scores. Students are tested before and after the intervention. Which confidence interval procedure is appropriate, and why would using a two-sample t-interval be incorrect here?