Confidence intervals are the backbone of statistical inference on the AP Statistics examโthey show up in Units 6, 7, and 9, and you'll encounter them in both multiple-choice questions and FRQs. The College Board wants you to understand that we use intervals (not single values) to estimate population parameters because sample statistics vary from sample to sample. Every confidence interval you construct reflects this fundamental truth: we're acknowledging uncertainty while still making useful claims about populations.
Here's what you're really being tested on: knowing which interval procedure fits which situation, verifying the conditions that make each formula valid, and interpreting your results correctly. The formulas themselves follow a consistent structureโpoint estimate ยฑ (critical value)(standard error)โbut the details change depending on whether you're estimating proportions vs. means, one sample vs. two samples, or categorical vs. quantitative relationships. Don't just memorize formulas; know what type of data and research question each one addresses.
One-Sample Intervals for Proportions
When you have categorical data with a single sample and want to estimate the true population proportion, you'll use the one-sample z-interval. The sampling distribution of p^โ is approximately normal when sample sizes are large enough, which is why we can use the standard normal (z) distribution.
One-Sample Z-Interval for a Proportion
Formula: p^โยฑzโnp^โ(1โp^โ)โโโwhere p^โ is the sample proportion and zโ is the critical value (e.g., zโ=1.96 for 95% confidence)
Success-failure condition: both np^โโฅ10 and n(1โp^โ)โฅ10 must be satisfied to ensure the sampling distribution is approximately normal
Independence conditions: data must come from a random sample or randomized experiment, and the 10% condition (nโค0.10N) applies when sampling without replacement
One-Sample Intervals for Means
When estimating a population mean from quantitative data, the choice between z and t depends on whether you know the population standard deviation. Spoiler: you almost never know ฯ in real applications, so the t-interval dominates AP Statistics.
Z-Interval for a Mean (Known ฯ)
Formula: xหยฑzโ(nโฯโ)โuses the known population standard deviation ฯ in the standard error
Rarely used in practice because knowing ฯ while not knowing ฮผ is an unusual situation; this formula appears mainly in theoretical problems
Normal distribution required: either the population is normally distributed, or nโฅ30 for the Central Limit Theorem to apply
T-Interval for a Mean (Unknown ฯ)
Formula: xหยฑtโ(nโsโ)โsubstitutes sample standard deviation s for ฯ, with degrees of freedom df=nโ1
T-distribution is wider than the z-distribution, especially for small samples; this accounts for the extra uncertainty from estimating ฯ
Conditions: random sample, independence (10% condition), and population approximately normal OR large sample size (nโฅ30)
Compare: Z-interval vs. T-interval for meansโboth estimate ฮผ, but the t-interval uses s instead of ฯ and has heavier tails. On the AP exam, if ฯ isn't explicitly given, use the t-interval. This is the default for quantitative data.
Two-Sample Intervals for Comparing Groups
Comparing two populations is where inference gets interesting. The key question: are the samples independent (two separate groups) or paired (same subjects measured twice)?
Two-Sample T-Interval for Difference of Means
Formula: (xห1โโxห2โ)ยฑtโn1โs12โโ+n2โs22โโโโthe standard error combines variability from both samples
Degrees of freedom: use calculator's "2-SampTInt" function, which applies the Welch approximation; don't try to compute df by hand on the AP exam
Conditions: two independent random samples, 10% condition for each group, and both populations approximately normal OR both sample sizes large
Two-Sample Z-Interval for Difference of Proportions
Formula: (p^โ1โโp^โ2โ)ยฑzโn1โp^โ1โ(1โp^โ1โ)โ+n2โp^โ2โ(1โp^โ2โ)โโโnote that we use each sample's p^โ separately (no pooling for confidence intervals)
Interpretation: if the interval contains zero, there's no statistically significant difference; the direction of the difference tells you which group has the larger proportion
Success-failure condition: check all four valuesโn1โp^โ1โ, n1โ(1โp^โ1โ), n2โp^โ2โ, n2โ(1โp^โ2โ)โeach must be โฅ10
Compare: Two-sample means vs. two-sample proportionsโboth compare independent groups, but means use the t-distribution while proportions use z. If an FRQ asks you to compare two treatments with a binary outcome, you need the two-proportion z-interval.
Paired T-Interval for Mean Difference
Formula: dหยฑtโ(nโsdโโ)โwhere dห is the mean of the differences and sdโ is the standard deviation of the differences
When to use: matched pairs designs, before-and-after studies, or any situation where each observation in one sample is linked to a specific observation in the other
Key insight: you're reducing a two-sample problem to a one-sample problem by analyzing the differences; df=nโ1 where n is the number of pairs
Compare: Two-sample t-interval vs. paired t-intervalโthe paired approach controls for individual variability and often produces narrower intervals. Watch for FRQ setups where subjects are measured twice or matched by characteristics; that's your cue to use paired procedures.
Inference for Regression Slopes
Unit 9 extends confidence intervals to linear regression. Here you're estimating the true population slope ฮฒ based on your sample slope b.
T-Interval for the Slope of a Regression Line
Formula: bยฑtโโ SEbโโwhere SEbโ=โ(xiโโxห)2โsโ and s is the residual standard deviation; your calculator or computer output provides SEbโ directly
Degrees of freedom: df=nโ2โyou lose two degrees of freedom because you're estimating both the slope and intercept
Conditions (LINE):Linear relationship (check residual plot), Independence (random sample, 10% condition), Normal residuals (check histogram or Normal probability plot), Equal variance (residuals show constant spread across x-values)
Compare: T-interval for slope vs. t-interval for meanโboth use the t-distribution, but slope inference has df=nโ2 instead of df=nโ1, and the conditions focus on residual behavior rather than the original data. If you see regression output on an FRQ, look for the standard error of the slope coefficient.
Advanced Intervals (Beyond Core AP Content)
These formulas occasionally appear in enrichment contexts but are not central to the AP Statistics exam. Know they exist, but prioritize the intervals above.
Confidence Interval for Population Variance
Formula: (ฯฮฑ/22โ(nโ1)s2โ,ฯ1โฮฑ/22โ(nโ1)s2โ)โuses the chi-squared distribution, which is right-skewed
Not symmetric: unlike z and t intervals, this interval is asymmetric around the point estimate
Strong normality assumption: the population must be normally distributed; this procedure is sensitive to departures from normality
Confidence Interval for Ratio of Two Variances
Formula: (s22โs12โโโ Fฮฑ/2โ1โ,s22โs12โโโ Fฮฑ/2โ)โuses the F-distribution with df1โ=n1โโ1 and df2โ=n2โโ1
Application: testing whether two populations have equal variances before running a pooled two-sample t-test
Requires normality in both populations; rarely tested on AP Statistics but useful for understanding ANOVA assumptions
Quick Reference Table
Concept
Best Examples
Estimating a single proportion
One-sample z-interval for p
Estimating a single mean
T-interval for ฮผ (use z only if ฯ is known)
Comparing two independent proportions
Two-sample z-interval for p1โโp2โ
Comparing two independent means
Two-sample t-interval for ฮผ1โโฮผ2โ
Comparing paired/matched data
Paired t-interval for ฮผdโ
Estimating a regression slope
T-interval for ฮฒ with df=nโ2
Intervals using z-distribution
One-proportion, two-proportion (large samples)
Intervals using t-distribution
One-mean, two-means, paired, regression slope
Self-Check Questions
What conditions must you verify before constructing a one-sample z-interval for a proportion, and why does each condition matter?
Compare the t-interval for a single mean and the paired t-interval: what do they have in common, and when would you choose one over the other?
If a confidence interval for p1โโp2โ is (0.03,0.15), what can you conclude about the relationship between the two population proportions?
An FRQ gives you regression output including b=2.4 and SEbโ=0.6 with n=22. What critical value would you use for a 95% confidence interval, and what are the degrees of freedom?
A researcher wants to determine whether a new teaching method improves test scores. Students are tested before and after the intervention. Which confidence interval procedure is appropriate, and why would using a two-sample t-interval be incorrect here?