Q: What's the formula for the standard deviation of x̄₁ - x̄₂?

If you’re sampling independently from two populations, the mean of x̄1 − x̄2 is μ1 − μ2 and the standard deviation (standard error) is σ(x̄1 − x̄2) = sqrt( σ1²/n1 + σ2²/n2 ). That’s the CED formula (UNC-3.T.1). If you sample without replacement and each sample is >10% of its population, use the finite-population correction (the true SD is a bit smaller). The sampling distribution can be treated as approximately normal if the populations are normal or if both n1 and n2 ≥ 30 (CLT; UNC-3.U.1-.2). This is the formula you’ll use on AP problems comparing two means; for practice and a short study guide, see the Topic 5.8 page ( https://library.fiveable.me/ap-statistics/unit-5/sampling-distributions-for-differences-sample-means/study-guide/hPyIdIhuKKF731eU2qOT ).

Q: I don't understand why we subtract the means but add the variances in the formula.

Think of x̄1 and x̄2 as two random variables. By linearity of expectation, the mean of their difference is the difference of the means: μ(x̄1 − x̄2) = μ1 − μ2. For variability, use the rule Var(A − B) = Var(A) + Var(B) when A and B are independent. Each sample mean has variance σ1²/n1 and σ2²/n2, so Var(x̄1 − x̄2) = σ1²/n1 + σ2²/n2. We take the square root to get the standard deviation (standard error): σ(x̄1 − x̄2) = sqrt(σ1²/n1 + σ2²/n2). So we subtract means (because expectation is linear) but add variances (because variances of independent terms add even when you subtract). Remember the without-replacement finite population correction makes the SE slightly smaller if sample >10% of pop (CED UNC-3.T.2). For a focused review see the Topic 5.8 study guide ( https://library.fiveable.me/ap-statistics/unit-5/sampling-distributions-for-differences-sample-means/study-guide/hPyIdIhuKKF731eU2qOT ) and try practice problems ( https://library.fiveable.me/practice/ap-statistics ).

Question 1

How do I find the mean and standard deviation for the difference between two sample means?

Accepted Answer

You use the population means and standard deviations (or their sample estimates) and the sample sizes. For two independent samples: - Mean of the difference: μ(x̄1 − x̄2) = μ1 − μ2. - Standard deviation (standard error) of the difference: σ(x̄1 − x̄2) = sqrt(σ1^2/n1 + σ2^2/n2). So if μ1 = 50, μ2 = 45, σ1 = 10, σ2 = 8, n1 = 25, n2 = 36: - Mean = 50 − 45 = 5 - SE = sqrt(10^2/25 + 8^2/36) = sqrt(100/25 + 64/36) = sqrt(4 + 1.778) ≈ sqrt(5.778) ≈ 2.40 Conditions: samples must be independent and random. If sampling without replacement from finite populations, use the 10% rule (if n < 10% of population, the formula is fine). For normal approximation: the sampling distribution is approx normal if each population is normal or both n ≥ 30 (CLT). This is exactly what the AP CED gives for Topic 5.8. For a quick review see the Topic 5.8 study guide (https://library.fiveable.me/ap-statistics/unit-5/sampling-distributions-for-differences-sample-means/study-guide/hPyIdIhuKKF731eU2qOT) and more practice problems at (https://library.fiveable.me/practice/ap-statistics).

Question 2

What's the formula for the standard deviation of x̄₁ - x̄₂?

Accepted Answer

If you’re sampling independently from two populations, the mean of x̄1 − x̄2 is μ1 − μ2 and the standard deviation (standard error) is σ(x̄1 − x̄2) = sqrt( σ1²/n1 + σ2²/n2 ). That’s the CED formula (UNC-3.T.1). If you sample without replacement and each sample is >10% of its population, use the finite-population correction (the true SD is a bit smaller). The sampling distribution can be treated as approximately normal if the populations are normal or if both n1 and n2 ≥ 30 (CLT; UNC-3.U.1-.2). This is the formula you’ll use on AP problems comparing two means; for practice and a short study guide, see the Topic 5.8 page (https://library.fiveable.me/ap-statistics/unit-5/sampling-distributions-for-differences-sample-means/study-guide/hPyIdIhuKKF731eU2qOT).

Question 3

When can I use the normal distribution for differences in sample means?

Accepted Answer

You can model x̄1 − x̄2 with a normal distribution when these conditions hold: - Means and SE: the sampling distribution has mean μ1 − μ2 and standard error √(σ1²/n1 + σ2²/n2) (or use s's if σ’s unknown). - Shape (normal/approximate normal): either each population is approximately normal, or if not, both sample sizes are large (the AP large-sample rule: n1 ≥ 30 and n2 ≥ 30) so the CLT makes the difference approximately normal (UNC-3.U.1 and UNC-3.U.2). - Independence & sampling: samples must be independent and random; if sampling without replacement, each n should be ≤ 10% of its population so the standard error formula is ok (finite population correction negligible; UNC-3.T.2). On the AP exam you should always state/check these conditions before using z or t methods for two-sample inference. For a quick refresher see the Topic 5.8 study guide (https://library.fiveable.me/ap-statistics/unit-5/sampling-distributions-for-differences-sample-means/study-guide/hPyIdIhuKKF731eU2qOT) and try practice questions (https://library.fiveable.me/practice/ap-statistics).

Question 4

I'm confused about when to use √(σ₁²/n₁ + σ₂²/n₂) - is this always the formula?

Accepted Answer

Short answer: use σ(x̄1 − x̄2) = √(σ1²/n1 + σ2²/n2) when the two samples are independent and you know (or can assume) the population standard deviations σ1 and σ2. That formula gives the true standard deviation of the sampling distribution of x̄1 − x̄2 (CED UNC-3.T.1). When it’s not appropriate: - If σ1 and/or σ2 are unknown (most real problems), replace them with s1 and s2 to estimate the standard error: SE ≈ √(s1²/n1 + s2²/n2) and use the appropriate t procedures or large-sample z approximations. - If you sample without replacement from finite populations, the true SD is smaller; use the finite population correction or note the 10% rule: if each sample is https://library.fiveable.me/ap-statistics/unit-5/sampling-distributions-for-differences-sample-means/study-guide/hPyIdIhuKKF731eU2qOT) and practice problems (https://library.fiveable.me/practice/ap-statistics).

Question 5

What's the difference between sampling with replacement and without replacement for difference in means?

Accepted Answer

When you sample with replacement (or from two independent populations), the sampling distribution of x̄1 − x̄2 has mean μ1 − μ2 and standard error √(σ1²/n1 + σ2²/n2) (UNC-3.T.1). If you sample without replacement from finite populations, samples are slightly less variable because you’re removing items—so the true standard deviation of x̄1 − x̄2 is smaller than that formula predicts (use the finite population correction when needed). Practically, the CED says if each sample size is less than 10% of its population, the FPC is negligible and you can use √(σ1²/n1 + σ2²/n2) (UNC-3.T.2). Normal approximation rules are the same: sampling distribution is approx normal if populations are normal or both n ≥ 30 (UNC-3.U). For AP-style review, see the Topic 5.8 study guide (https://library.fiveable.me/ap-statistics/unit-5/sampling-distributions-for-differences-sample-means/study-guide/hPyIdIhuKKF731eU2qOT) and more unit review/practice at (https://library.fiveable.me/ap-statistics/unit-5) and (https://library.fiveable.me/practice/ap-statistics).

Question 6

Can someone explain step by step how to solve a problem about comparing two population means?

Accepted Answer

Step-by-step: compare two independent means (x̄1 − x̄2) 1. Identify: populations, sample means x̄1, x̄2, sample sizes n1, n2, and whether population σ1, σ2 are known. For AP problems σ’s are usually unknown—use t methods. 2. Conditions (AP CED): samples independent and random, sampling https://library.fiveable.me/ap-statistics/unit-5/sampling-distributions-for-differences-sample-means/study-guide/hPyIdIhuKKF731eU2qOT). For more practice, check Unit 5 or the 1000+ AP practice problems (https://library.fiveable.me/ap-statistics/unit-5) and (https://library.fiveable.me/practice/ap-statistics).

Question 7

When do I need both sample sizes to be greater than 30 vs when the populations need to be normal?

Accepted Answer

Short answer: Use the population normality rule when at least one sample is small; use the “both n ≥ 30” rule when you don’t know the population shapes. Why: The CED says the sampling distribution of x̄1 − x̄2 is approximately normal if each population distribution is normal (UNC-3.U.1)—that’s true even for small n. If the populations are not normal or you don’t know their shape, rely on the CLT: the sampling distribution is approximately normal when both sample sizes are large (n1 ≥ 30 and n2 ≥ 30; UNC-3.U.2). Also make sure samples are independent and random (UNC-3.T, finite-population correction if without replacement). On the exam, always state which condition you’re using (population normal vs. CLT large-sample rule) and check independence. For a quick review, see the Topic 5.8 study guide (https://library.fiveable.me/ap-statistics/unit-5/sampling-distributions-for-differences-sample-means/study-guide/hPyIdIhuKKF731eU2qOT), the Unit 5 overview (https://library.fiveable.me/ap-statistics/unit-5), and grab practice problems at (https://library.fiveable.me/practice/ap-statistics).

Question 8

How do I know if the 10% condition is satisfied when comparing two samples?

Accepted Answer

Check the 10% condition separately for each sample: when sampling WITHOUT replacement from a finite population, you want n1 < 0.10·N1 and n2 < 0.10·N2. If both sample sizes are less than 10% of their respective population sizes, the finite-population correction is negligible and the standard-error formula √(σ1²/n1 + σ2²/n2) applies (CED UNC-3.T.2). If you’re sampling WITH replacement (or from effectively infinite populations such as large national populations), you don’t need the 10% check. If you can’t verify population sizes, give a quick reality check: a sample of 50 from a population of 1,000 is 5% (OK); 50 from 300 is ~16.7% (fails). For AP problems they usually tell you population sizes or state “random sample from a large population”; follow that and the CLT/large-sample rules (n ≥ 30) for normal approximation (CED UNC-3.U). More on this topic: Fiveable’s Topic 5.8 study guide (https://library.fiveable.me/ap-statistics/unit-5/sampling-distributions-for-differences-sample-means/study-guide/hPyIdIhuKKF731eU2qOT).

Question 9

I don't understand why we subtract the means but add the variances in the formula.

Accepted Answer

Think of x̄1 and x̄2 as two random variables. By linearity of expectation, the mean of their difference is the difference of the means: μ(x̄1 − x̄2) = μ1 − μ2. For variability, use the rule Var(A − B) = Var(A) + Var(B) when A and B are independent. Each sample mean has variance σ1²/n1 and σ2²/n2, so Var(x̄1 − x̄2) = σ1²/n1 + σ2²/n2. We take the square root to get the standard deviation (standard error): σ(x̄1 − x̄2) = sqrt(σ1²/n1 + σ2²/n2). So we subtract means (because expectation is linear) but add variances (because variances of independent terms add even when you subtract). Remember the without-replacement finite population correction makes the SE slightly smaller if sample >10% of pop (CED UNC-3.T.2). For a focused review see the Topic 5.8 study guide (https://library.fiveable.me/ap-statistics/unit-5/sampling-distributions-for-differences-sample-means/study-guide/hPyIdIhuKKF731eU2qOT) and try practice problems (https://library.fiveable.me/practice/ap-statistics).

Question 10

What does it mean that the sampling distribution of x̄₁ - x̄₂ has mean μ₁ - μ₂?

Accepted Answer

It means x̄1 − x̄2 is an unbiased estimator of the true difference between the two population means. If you repeatedly take independent random samples (with replacement) of sizes n1 and n2, the average value of those sample differences will converge to μ1 − μ2—the sampling distribution is centered at μ1 − μ2. Practically: one observed x̄1 − x̄2 is just one draw from that distribution; on average you’d expect the difference you estimate to equal the actual population difference (in the same units). The spread around that center is the standard error √(σ1²/n1 + σ2²/n2), and if conditions for normal approximation hold (populations normal or n1,n2 ≥ 30) you can use a normal model. For AP exam framing, always state parameters and interpret them in context (CED UNC-3.V.1). For a quick review, see the Fiveable topic study guide (https://library.fiveable.me/ap-statistics/unit-5/sampling-distributions-for-differences-sample-means/study-guide/hPyIdIhuKKF731eU2qOT).

Question 11

How do I calculate the probability that one sample mean is greater than another sample mean?

Accepted Answer

Compute the sampling distribution of D = x̄1 − x̄2 first. Its mean is μD = μ1 − μ2 and its standard error is SE = sqrt(σ1^2/n1 + σ2^2/n2) (or use s1,s2 if you’re estimating). If the sampling distribution is (approximately) normal (populations normal or n1,n2 ≥ 30), then D ~ N(μD, SE). To get P(x̄1 > x̄2) = P(D > 0): 1. Standardize: z = (0 − μD)/SE. 2. Use the standard normal: P(D > 0) = 1 − Φ(z). Example: if μ1=10, μ2=8, σ1=4, σ2=3, n1=n2=36 then μD=2, SE = sqrt(16/36 + 9/36)=sqrt(25/36)=5/6 ≈0.833. z = (0−2)/0.833 ≈ −2.40 so P(D>0)=1−Φ(−2.40)=Φ(2.40)≈0.992. Remember AP expectations: state parameters (UNC-3.T), check normal-approx conditions (UNC-3.U), and interpret probability in context (UNC-3.V). For a focused review see the Topic 5.8 study guide (https://library.fiveable.me/ap-statistics/unit-5/sampling-distributions-for-differences-sample-means/study-guide/hPyIdIhuKKF731eU2qOT) and practice problems (https://library.fiveable.me/practice/ap-statistics).

Question 12

Why is the standard deviation smaller when sampling without replacement?

Accepted Answer

When you sample without replacement you remove items from the population, so successive draws are slightly less variable—each pick gives you more information about what’s left. Formally, the variance of a sample mean from a finite population is multiplied by the finite-population correction (FPC): σ(x̄) = (σ/√n) · √((N − n)/(N − 1)), and for a difference of means you apply that FPC to each group's standard deviation. The factor √((N − n)/(N − 1)) is < 1 when n > 1, so the standard error is smaller than the with-replacement formula √(σ1²/n1 + σ2²/n2). On the AP exam you won’t usually need the full FPC number; CED says if each sample is less than 10% of its population, the FPC is negligible and you can use the simpler formula. For a clear writeup and examples, see the Topic 5.8 study guide (https://library.fiveable.me/ap-statistics/unit-5/sampling-distributions-for-differences-sample-means/study-guide/hPyIdIhuKKF731eU2qOT).

Question 13

I missed class - what's the difference between this topic and just regular sampling distributions?

Accepted Answer

Short answer: regular sampling distributions usually describe one sample statistic (like x̄ or p̂). Topic 5.8 is about the sampling distribution of a difference between two independent sample means (x̄1 − x̄2). Key differences you should know from the CED: - Parameter formulas: mean μ(x̄1−x̄2) = μ1 − μ2 and standard error σ(x̄1−x̄2) = √(σ1²/n1 + σ2²/n2). If you sample without replacement and n is >10% of the population, use the finite-population correction (CED UNC-3.T.1–T.2). - Normality: the difference is approximately normal if both populations are normal or if both samples are large (n ≥ 30) by CLT (UNC-3.U). - Conditions: samples must be independent (not paired); interpret probabilities and parameters with correct units and populations (UNC-3.V). For exam practice and worked examples, check the Topic 5.8 study guide (https://library.fiveable.me/ap-statistics/unit-5/sampling-distributions-for-differences-sample-means/study-guide/hPyIdIhuKKF731eU2qOT), the full Unit 5 overview (https://library.fiveable.me/ap-statistics/unit-5), and tons of practice problems (https://library.fiveable.me/practice/ap-statistics).

Question 14

How do I interpret the results when comparing average test scores from two different schools?

Accepted Answer

First find the sampling distribution parameters: the mean of x̄1 − x̄2 equals μ1 − μ2 (so your point estimate is x̄1 − x̄2) and the standard error is SE = sqrt(σ1²/n1 + σ2²/n2) (or use s1,s2 if σ’s unknown). Make sure the two samples are independent and either the populations are approx. normal or both n1,n2 ≥ 30 (CLT) so the normal/t approximation applies (CED UNC-3.T, UNC-3.U). Interpretation rules: - The sign and value of x̄1 − x̄2 (in test-score points) tell you how much higher school 1’s average is than school 2’s. E.g., x̄1 − x̄2 = 5 points means school 1’s sample mean is 5 points higher. - Use SE to get a test statistic or CI. If a 95% CI for μ1 − μ2 does not contain 0, there’s statistically significant difference at α=0.05; if a p-value < α, reject H0: μ1 = μ2 (CED UNC-3.V). - Always state results in context and units: “There is/ isn’t convincing evidence that the average test score at School A is higher/lower than School B by about X points.” For help with calculations and practice, see the Topic 5.8 study guide (https://library.fiveable.me/ap-statistics/unit-5/sampling-distributions-for-differences-sample-means/study-guide/hPyIdIhuKKF731eU2qOT), the Unit 5 overview (https://library.fiveable.me/ap-statistics/unit-5), and lots of practice problems (https://library.fiveable.me/practice/ap-statistics).

Question 15

What calculator functions do I use to find probabilities for differences in sample means?

Accepted Answer

Use the normal / t calculator commands—you’ll compute the difference’s mean μx̄1−x̄2 = μ1 − μ2 and its SE = sqrt(σ1²/n1 + σ2²/n2) (or replace σ with s when using sample SDs). Then: - For TI-83/84 (common on the AP): compute the z (or t) statistic by hand: z = (observed difference − μdifference)/SE. Then use normalcdf(lower, upper, μ, σ) to get probabilities (e.g., normalcdf(z, 1E99, 0, 1) for right tail after standardizing). Use invNorm for critical z-values. - If you need a t-curve (small n, unknown σ and conditions met), use tcdf(lower, upper, df) on TI-84 after computing the t statistic. - For built-in inference routines (easier): STAT → TESTS → 2-SampZTest or 2-SampTTest (and 2-SampZInt / 2-SampTInt for CIs) to get p-values or intervals directly. Remember AP rules: show you checked conditions (independence, CLT or normal populations) and report units and context. For a quick refresher, see the Topic 5.8 study guide (https://library.fiveable.me/ap-statistics/unit-5/sampling-distributions-for-differences-sample-means/study-guide/hPyIdIhuKKF731eU2qOT).

Term	Definition
difference in sample means	The result of subtracting one sample mean from another sample mean, calculated as x̄₁ - x̄₂.
independent populations	Two populations from which samples are drawn such that the selection from one population does not affect the selection from the other.
normal distribution	A probability distribution that is mound-shaped and symmetric, characterized by a population mean (μ) and population standard deviation (σ).
parameter	A numerical summary that describes a characteristic of an entire population.
population distribution	The distribution of all values of a variable across the entire population.
population mean	The average of all values in an entire population, denoted as μ.
population means	The average values of two distinct populations being compared, denoted as μ₁ and μ₂.
population standard deviation	A measure of the spread or dispersion of all values in a population, denoted by σ, which is a parameter of the normal distribution.
probability	The likelihood or chance that a particular outcome or event will occur, expressed as a value between 0 and 1.
sample mean	The average of all values in a sample, denoted as x̄, used as an estimate of the population mean.
sample size	The number of observations or data points collected in a sample, denoted as n.
sampling distribution	The probability distribution of a sample statistic (such as a sample proportion) obtained from repeated sampling of a population.
sampling with replacement	A sampling method in which an item selected from a population can be selected again in subsequent draws.
sampling without replacement	A sampling method in which an item selected from a population cannot be selected again in subsequent draws.
standard error	The standard deviation of a sampling distribution, which measures the variability of a sample statistic across repeated samples.

📊AP Statistics Unit 5 Review

5.8 Sampling Distributions for Differences in Sample Means

📊AP Statistics
Unit 5 Review

5.8 Sampling Distributions for Differences in Sample Means

Unit & Topic Study Guides

Formulas

Normal Condition: Central Limit Theorem

Practice Problem

Answer

Vocabulary

Frequently Asked Questions

history

social science

english & capstone

arts

science

math & computer science

world languages

high school exams

honors classes

college classes

hs classes

Study Content & Tools

Company

Resources