📊AP Statistics Unit 7 Review

7.9 Carrying Out a Test for the Difference of Two Population Means

Q: What test is used for the difference of two population means?

Use a two-sample t-test when comparing the means of two populations using independent random samples or a randomized experiment and quantitative data.

Q: What is the two-sample t-test statistic?

The test statistic compares the difference in sample means to the hypothesized difference, usually zero, divided by the standard error based on the two sample standard deviations and sample sizes.

Q: Do I need to memorize the two-sample t-test formula for AP Statistics?

No. The AP Statistics CED notes that test statistic formulas do not need to be memorized because they can be built from the general test statistic structure and formula sheet information.

Q: How do I find degrees of freedom for a two-sample t-test?

Use technology for degrees of freedom when available. The degrees of freedom fall between the smaller of n1 - 1 and n2 - 1 and the value n1 + n2 - 2.

Q: How do I interpret the p-value for a two-sample t-test?

The p-value is computed assuming the null hypothesis is true, usually that the two population means are equal. It gives the probability of getting a test statistic as extreme as the observed one by random chance.

Q: How do I write the conclusion for a two-sample t-test?

Compare the p-value to alpha, reject or fail to reject the null hypothesis, and state the result in context of the two populations and the research question.

Written by the Fiveable Content Team • Last updated June 2026

Verified for the 2027 exam

Verified for the 2027 exam•Written by the Fiveable Content Team • Last updated June 2026

📊AP Statistics

Unit & Topic Study Guides

AP Statistics Exam

Multiple-Choice Questions (MCQ)

FRQ 6 – Investigative Task

FRQs 1-5 – Free Response

Unit 1 – Exploring One–Variable Data

Unit 1 Overview: Exploring One-Variable Data

1.1 Introducing Statistics: What Can We Learn from Data?

1.2 The Language of Variation: Variables

1.3 Representing a Categorical Variable with Tables

1.4 Representing a Categorical Variable with Graphs

1.5 Representing a Quantitative Variable with Graphs

1.6 Describing the Distribution of a Quantitative Variable

1.7 Summary Statistics for a Quantitative Variable

1.8 Graphical Representations of Summary Statistics

1.9 Comparing Distributions of a Quantitative Variable

1.10 The Normal Distribution

Unit 2 – Exploring Two–Variable Data

Unit 2 Overview: Exploring Two-Variable Data

2.1 Introducing Statistics: Are Variables Related?

2.2 Representing Two Categorical Variables

2.3 Statistics for Two Categorical Variables

2.4 Representing the Relationship Between Two Quantitative Variables

2.5 Correlation

2.6 Linear Regression Models

2.7 Residuals

2.8 Least Squares Regression

2.9 Analyzing Departures from Linearity

Unit 3 – Collecting Data

Unit 3 Overview: Collecting Data

3.1 Introducing Statistics: Do the Data We Collected Tell the Truth?

3.2 Introduction to Planning a Study

3.3 Random Sampling and Data Collection

3.4 Potential Problems with Sampling

3.5 Introduction to Experimental Design

3.6 Selecting an Experimental Design

3.7 Inference and Experiments

Unit 4 – Probability, Random Variables, and Probability Distributions

Unit 4 Overview: Probability, Random Variables, and Probability Distributions

4.1 Introducing Statistics: Random and Non-Random Patterns?

4.2 Estimating Probabilities Using Simulation

4.3 Introduction to Probability

4.4 Mutually Exclusive Events

4.5 Conditional Probability

4.6 Independent Events and Unions of Events

4.7 Introduction to Random Variables and Probability Distributions

4.8 Mean and Standard Deviation of Random Variables

4.9 Combining Random Variables

4.10 Introduction to the Binomial Distribution

4.11 Parameters for a Binomial Distribution

4.12 The Geometric Distribution

Unit 5 – Sampling Distributions

Unit 5 Overview: Sampling Distributions

5.1 Introducing Statistics: Why Is My Sample Not Like Yours?

5.2 The Normal Distribution, Revisited

5.3 The Central Limit Theorem

5.4 Biased and Unbiased Point Estimates

5.5 Sampling Distributions for Sample Proportions

5.6 Sampling Distributions for Differences in Sample Proportions

5.7 Sampling Distributions for Sample Means

5.8 Sampling Distributions for Differences in Sample Means

Unit 6 – Proportions

Unit 6 Overview: Inference for Categorical Data: Proportions

6.1 Introducing Statistics: Why Be Normal?

6.2 Constructing a Confidence Interval for a Population Proportion

6.3 Justifying a Claim Based on a Confidence Interval for a Population Proportion

6.4 Setting Up a Test for a Population Proportion

6.5 Interpreting p-Values

6.6 Concluding a Test for a Population Proportion

6.7 Potential Errors When Performing Tests

6.8 Confidence Intervals for the Difference of Two Proportions

6.9 Justifying a Claim Based on a Confidence Interval for a Difference of Population Proportions

6.10 Setting Up a Test for the Difference of Two Population Proportions

6.11 Carrying Out a Test for the Difference of Two Population Proportions

Unit 7 – Means

Unit 7 Overview: Means

7.1 Introducing Statistics: Should I Worry About Error?

7.2 Constructing a Confidence Interval for a Population Mean

7.3 Justifying a Claim About a Population Mean Based on a Confidence Interval

7.4 Setting Up a Test for a Population Mean

7.5 Carrying Out a Test for a Population Mean

7.6 Confidence Intervals for the Difference of Two Means

7.7 Justifying a Claim About the Difference of Two Means Based on a Confidence Interval

7.8 Setting Up a Test for the Difference of Two Population Means

7.9 Carrying Out a Test for the Difference of Two Population Means

7.10 Skills Focus: Selecting, Implementing, and Communicating Inference Procedures

Unit 8 – Chi–Squares

Unit 8 Overview: Chi Square

8.1 Introducing Statistics: Are My Results Unexpected?

8.2 Setting Up a Chi Square Goodness of Fit Test

8.3 Carrying Out a Chi Square Goodness of Fit Test

8.4 Expected Counts in Two Way Tables

8.5 Setting Up a Chi-Square Test for Homogeneity or Independence

8.6 Carrying Out a Chi-Square Test for Homogeneity or Independence

8.7 Skills Focus: Selecting an Appropriate Inference Procedure for Categorical Data

Unit 9 – Slopes

Unit 9 Overview: Slopes

9.1 Introducing Statistics: Do Those Points Align?

9.2 Confidence Intervals for the Slope of a Regression Model

9.3 Justifying a Claim About the Slope of a Regression Model Based on a Confidence Interval

9.4 Setting Up a Test for the Slope of a Regression Model

9.5 Carrying Out a Test for the Slope of a Regression Model

9.6 Skills Focus: Selecting an Appropriate Inference Procedure

Frequently Asked Questions

How Do I Self-Study AP Statistics?

What Are the Best Quizlet Decks for AP Statistics?

What Are the Best AP Statistics Textbooks and Prep Books?

Is AP Statistics Hard? Is AP Statistics Worth Taking?

How Can I Get a 5 in AP Statistics?

What is bias?

Previous Exam Prep

Analyzing Categorical Data - Slides

Analyzing Categorical Data

Sampling Methods and Sources of Bias - Slides

Sampling Methods and Sources of Bias

Sampling Methods and Sources of Bias - Slides

Displaying Quantitative Data with Graphs

Experiments and Observational Studies - Slides

Experiments and Observational Studies

Advanced Linear Regression

Describing Location in a Distribution

Describing Location in a Distribution - Slides

Advanced Linear Regression

Normal Curve and Normal Calculations

Advanced Linear Regression

Powerpoint slides

Randomness, Probability and Simulation

Probability: Two-Way Tables, Conditional, Independence, Tree Diagrams, etc

Probability Review: Random Variables, Binomial/Geometric Distributions - Slides

Probability: Random Variables, Binomial/Geometric Distributions

Sampling Distributions for Proportions - Slides

Sampling Distributions for Proportions

Sampling Distributions for Means

Sampling Distributions for Means - Slides

Inference: Confidence Intervals for Proportions - Slides

Inference: Confidence Intervals for Proportions

Inference: Confidence Intervals for Means - Slides

Inference: Confidence Intervals for Means

Inference: Hypothesis Tests for Proportions

Presentation Slides, Inference: Hypothesis test for proportions

Presentation Slides, Hypothesis Tests for Means

Inference: Hypothesis Tests for Means

Presentation Slides, z and t procedures

Review of Inference: z and t Procedures

Stats Q & A Slides

Q and A

Inference: Errors & Power of Tests Slides, Errors and Power

Inference: Errors & Power of Tests

Advanced Linear Regression

Normal Distributions

Presentation Slides, Normal Distributions

Presentation Slides, Conditional Probability

Conditional Probability and Independence

Presentation Slides, Sampling Distributions

Sampling Distributions for Differences of means and proportions

Presentation Slides, Combining Random Vairables

Combining Random Variables

Late Medieval Review (Units 1-2)

Late Medieval Review (Units 1-2) - 5/7 - Slides

How to Get The Most Out of The Formula Sheet

Presentation Slides, Formula Sheet

2020 CB Mock Question with Q and A

2020 CB Mock Question with Q and A - Stream Slides

Slides for Presentation

Final Review and Open Q and A

AP Stats Unit 1 Review

Advanced Linear Regression

Study Tools

Download AP Statistics Cheat Sheet PDF Cram Chart

AP Statistics Cheat Sheet PDF & Formula Review Chart

Exam Skills

Score Higher on AP Statistics: MCQ Tips from Students

Score Higher on AP Statistics: FRQ Tips from Students

AP Statistics Free Response Questions

AP Statistics Free Response Help - FRQ

AP Stats Mixed Units Practice FRQ #4 & Feedback

AP Stats Unit 7 FRQ Practice Prompt (#1) Answers & Feedback

AP Stats Unit 1 Practice FRQ Prompt Answers & Feedback

AP Stats Mixed Units Practice FRQ #3 & Feedback

AP Stats Mixed Units Practice FRQ #2 & Feedback

AP Stats Unit 4 FRQ Practice Prompt Answers & Feedback

AP Stats Unit 4 Practice FRQ #2

AP Stats FRQ Practice Prompt Answers & Feedback (Unit 2)

AP Stats Mixed Units Practice FRQ #1 & Feedback

AP Stats FRQ Practice Prompt Samples & Feedback (Unit 5)

AP Stats Practice FRQ Responses & Feedback (Unit 6)

AP Stats Practice FRQ Responses & Feedback (Unit 4)

AP Stats Unit 3 FRQ Practice Prompt Answers & Feedback

2017 FRQ Review

2017 FRQ Review - Stream Slides

Presentation Slides, 2018 FRQ

2018 FRQ Review

2019 FRQ Review - Stream Slides

2019 FRQ Review

Presentation Slides, FRQ Collecting Data

FRQ Collecting Data

Q&A Student Study Session

🎉NMSI AP Reader Chat: Statistics

Inference - Stream Slides

Inference

Unit 1 FRQ Review and Check In

AP Stats FRQ Practice

AP Cram Sessions 2021

AP Statistics Cram Unit 1: Exploring One Variable Data

🌶️ AP Stats Cram Review: Unit 1: Exploring One Variable Data

AP Statistics Cram Unit 2: Exploring Two Variable Data

🌶️ AP Stats Cram Review: Unit 2: Exploring Two Variable Data

AP Statistics Cram Unit 3: Collecting Data

🌶️ AP Stats Cram Review: Unit 3: Collecting Data

AP Statistics Cram Unit 4: Probability, Random Variables and Probability Distributions

🌶️ AP Stats Cram Review: Unit 4: Probability, Random Variables and Probability Distributions

🌶️ AP Stats Cram Review: Unit 5: Sampling Distributions

AP Statistics Cram Unit 5: Sampling Distributions

AP Statistics Cram Unit 6: Inference for Categorical Data: Proportions Confidence Intervals

🌶️ AP Stats Cram Review: Unit 6: Inference for Categorical Data: Proportions Confidence Intervals

AP Statistics Cram Unit 6: Inference for Categorical Data: Proportions Hypothesis Tests

🌶️ AP Stats Cram Review: Unit 6: Inference for Categorical Data: Proportions Hypothesis Tests

AP Statistics Cram Unit 7: Inference for Quantitative Data: Means Confidence Intervals

🌶️ AP Stats Cram Review Unit 7 Inference for Quantitative Data Means Confidence Intervals

AP Statistics Cram Unit 7: Inference for Quantitative Data: Means Hypothesis Tests

🌶️ AP Stats Cram Review: Unit 7: Inference for Quantitative Data: Means Hypothesis Tests

🌶️ AP Stats Cram Review Units 8 and 9 (Inference for Categorical Data Chi Square and Inference for Quantitative Data Slopes)

AP Statistics Cram Units 8 and 9 (Inference for Categorical Data: Chi Square and Inference for Quantitative Data: Slopes)

AP Statistics Cram Free Response Tips and Tricks

🌶️ AP Stats Cram Review Free Response Tips and Tricks

🌶️ AP Statistics Finale May 16, 2021

AP Statistics Finale

🌶️ AP Statistics Finale Watch Party Admin 2

AP Statistics Finale

🌶️ AP Statistics Finale Watch Party Admin 3

To carry out a two-sample t-test for the difference of two population means, calculate the t statistic by dividing the difference in sample means by the standard error, find the degrees of freedom with technology, get the p-value, and compare it to your significance level. If the p-value is at or below alpha, reject the null hypothesis and state your conclusion in context.

Why This Matters for the AP Statistics Exam

This topic is the calculation and conclusion stage of comparing two means, which shows up often in free-response questions that ask whether data give convincing evidence of a difference. When a question asks for convincing evidence, it is asking for a significance test, not just a description of the numbers. You need to identify the correct parameter and hypotheses, check conditions, calculate the test statistic and p-value, and then write a conclusion that links the p-value to the decision in context. Being precise with notation and showing your work clearly is important for strong exam responses on both multiple-choice and free-response questions.

more resources to help you study

practice multiple choice FRQ practice & scoring cheatsheets score calculator key terms

Key Takeaways

The test statistic is t = ((x̄₁-x̄₂)-(μ₁-μ₂))/√(s₁²/n₁+s₂²/n₂), and under the null the (μ₁-μ₂) term is 0.
The standard error of the difference is √(s₁²/n₁+s₂²/n₂).
Degrees of freedom fall between the smaller of n1-1 and n2-1 and n1+n2-2; technology gives a precise value.
The p-value is computed by assuming the null is true, meaning the two population means are equal.
Compare the p-value to alpha: if p ≤ alpha, reject H0; if p > alpha, fail to reject H0.
Write down the t statistic, degrees of freedom, and p-value, and state your conclusion in context.

Calculating the Test Statistic

Once you have confirmed the conditions for a two-sample t-test are met, you can calculate the test statistic and p-value to decide whether the difference between the two means is statistically significant.

You are comparing one quantitative variable across two independent samples. The first step is finding the difference between the two sample means and dividing it by the standard error of the difference.

Degrees of Freedom

Calculating by hand: take the smaller of the two sample sizes and subtract 1. This is the conservative approach, similar to what you did with a single sample in Unit 7.5.
Using technology such as a graphing calculator: the degrees of freedom come with the output, and the value falls between the smaller of n1-1 and n2-1 and n1+n2-2.

t Statistic

The general test statistic formula is:

$\frac{observed-expected}{\sigma}$

For the difference of two population means, this becomes:

$\frac{\bar{x}_{1}-\bar{x}_{2}}{\sqrt{\frac{{s^2}_{1}}{n_1}+\frac{{s^2}_{2}}{n_2}}}$

The full form includes the hypothesized difference (μ₁-μ₂) in the numerator, but since the null hypothesis usually sets that difference to 0, the term drops out. You can build this from the general test statistic formula and the standard error formulas on the Formula Sheet, so you do not need to memorize it.

Calculating the P-Value

With your degrees of freedom and t statistic, you can use the table on the Formula Sheet. Find the row for your degrees of freedom, look across to find the t value closest to yours, and use the tail probability that matches.

A more exact approach is to run a two-sample t-test using technology such as a graphing calculator. You can either type in the summary statistics or enter the raw data into a list. The output gives you the t statistic, degrees of freedom, and p-value.

Remember that the p-value is computed by assuming the null hypothesis is true, which means assuming the two population means are equal.

On a free-response question, write down the t statistic, the degrees of freedom, and the p-value so your work is complete and clear.

Making the Decision and Stating a Conclusion

Once you have your p-value, compare it to the significance level (often 0.05) to evaluate the null hypothesis.

If the p-value is at or below alpha, reject H0. You have convincing evidence for the alternative hypothesis.
If the p-value is greater than alpha, fail to reject H0. You do not have convincing evidence for the alternative.

Always state your conclusion in context. For a study comparing the mean number of green beans picked from two fields, a conclusion might read:

Since the p-value is essentially 0 and less than 0.05, we reject H0. We have convincing evidence that the true mean number of green beans picked from Field A differs from the true mean picked from Field B.

That conclusion works because it compares the p-value to the significance level, states the decision about H0, connects to the alternative hypothesis, and stays in the context of the problem.

Worked Example: Comparing Recovery Times

Here is how the pieces fit together in a real comparison. In a study comparing mean recovery times for two surgical procedures to repair a torn ACL, one group had a sample size of 110 and the other had 100. The degrees of freedom fall between 100 (the smaller of 110 and 100) and 208 (110 + 100 − 2). Using technology, the degrees of freedom came out to about 207.18. With a test statistic of t ≈ 7.13, the p-value is the area greater than 7.13 for a t-distribution with df = 207.18. That very large t statistic gives a tiny p-value, which would lead you to reject the null and conclude there is convincing evidence of a difference in mean recovery times.

How to Use This on the AP Statistics Exam

Free Response

State hypotheses in terms of population parameters: H₀: μ₁-μ₂=0 (or μ₁=μ₂) and an alternative that matches the question.
Name the procedure (two-sample t-test for a difference of means) and check conditions before calculating.
Show the test statistic, degrees of freedom, and p-value.
Compare the p-value to alpha with a numerical reference, such as "Because p < 0.05, we reject H0."
End with a conclusion in context that connects back to the alternative hypothesis.

MCQ

Be ready to identify the correct standard error, √(s₁²/n₁+s₂²/n₂), and the correct test statistic.
Know that under the null, the (μ₁-μ₂) term equals 0.
Recognize that degrees of freedom from technology fall between the smaller of n1-1 and n2-1 and n1+n2-2.

Common Trap

"Convincing evidence" signals a significance test, not just a description of the data.
Rejecting H0 is not the same as proving the alternative; it means the data are unlikely under the null.

Common Misconceptions

The two-sample t-test uses two independent samples. Do not confuse it with a matched-pairs setup, where you analyze differences as a single sample.
The p-value is not the probability that the null hypothesis is true. It is computed assuming the null is true.
Failing to reject H0 does not prove the means are equal. It only means you lacked convincing evidence of a difference.
The hypotheses must be written with population parameters (μ₁ and μ₂), not sample statistics.
A formal decision compares the p-value to alpha directly. Vague statements like "the p-value is small" without a comparison are not enough.
Degrees of freedom for two samples are not simply n1-1. By hand you use the smaller sample size minus 1 as a conservative value, while technology gives a more exact number.

Vocabulary

The following words are mentioned explicitly in the College Board Course and Exam Description for this topic.

Term	Definition
degrees of freedom	A parameter of the t-distribution that affects its shape; as degrees of freedom increase, the t-distribution approaches the normal distribution.
difference in sample means	The result of subtracting one sample mean from another sample mean, calculated as x̄₁ - x̄₂.
difference of population means	The difference between the mean values of two distinct populations, calculated as μ₁ - μ₂.
normal distribution	A probability distribution that is mound-shaped and symmetric, characterized by a population mean (μ) and population standard deviation (σ).
null hypothesis	The initial claim or assumption being tested in a hypothesis test, typically stating that there is no effect or no difference.
p-value	The probability of observing a test statistic as extreme as or more extreme than the one calculated from the sample data, assuming the null hypothesis is true.
population means	The average values of two distinct populations being compared, denoted as μ₁ and μ₂.
quantitative variable	A variable that is measured numerically and can take on a range of values, allowing for mathematical operations and statistical analysis.
randomized experiment	A study design where subjects are randomly assigned to treatment groups to establish cause-and-effect relationships.
reject the null hypothesis	The decision made when the p-value is less than or equal to the significance level, indicating sufficient evidence against the null hypothesis.
sampling distribution	The probability distribution of a sample statistic (such as a sample proportion) obtained from repeated sampling of a population.
significance level	The threshold probability (α) used to determine whether to reject the null hypothesis in a significance test.
significance test	A statistical procedure used to determine whether there is sufficient evidence to reject the null hypothesis based on sample data.
simple random sample	A sample selected from a population such that every possible sample of the same size has an equal chance of being chosen.
standard error	The standard deviation of a sampling distribution, which measures the variability of a sample statistic across repeated samples.
statistical reasoning	The logical process of using sample data and significance test results to draw conclusions about populations and answer research questions.
t-distribution	A probability distribution used when the population standard deviation is unknown and the sample standard deviation is used instead, characterized by heavier tails than the normal distribution.
test statistic	A calculated value used to determine whether to reject the null hypothesis in a hypothesis test, computed from sample data.
two-sample test	A significance test used to compare the means of two different populations based on sample data from each population.

Frequently Asked Questions

What test is used for the difference of two population means?

Use a two-sample t-test when comparing the means of two populations using independent random samples or a randomized experiment and quantitative data.

What is the two-sample t-test statistic?

The test statistic compares the difference in sample means to the hypothesized difference, usually zero, divided by the standard error based on the two sample standard deviations and sample sizes.

Do I need to memorize the two-sample t-test formula for AP Statistics?

No. The AP Statistics CED notes that test statistic formulas do not need to be memorized because they can be built from the general test statistic structure and formula sheet information.

How do I find degrees of freedom for a two-sample t-test?

Use technology for degrees of freedom when available. The degrees of freedom fall between the smaller of n1 - 1 and n2 - 1 and the value n1 + n2 - 2.

How do I interpret the p-value for a two-sample t-test?

The p-value is computed assuming the null hypothesis is true, usually that the two population means are equal. It gives the probability of getting a test statistic as extreme as the observed one by random chance.

How do I write the conclusion for a two-sample t-test?

Compare the p-value to alpha, reject or fail to reject the null hypothesis, and state the result in context of the two populations and the research question.