Setting up a two-sample z-test for a difference of two proportions means writing your hypotheses, defining what each proportion represents, and checking that your conditions are met before you calculate anything. The null hypothesis says the two population proportions are equal, and the big change here is that you check normality using a pooled (combined) proportion instead of separate ones.
Why This Matters for the AP Statistics Exam
This topic is part of Unit 6, which carries a good share of the exam. Comparing two proportions shows up when a question asks whether two groups truly differ on some categorical outcome, like the proportion of two populations that succeed, agree, or respond a certain way.
On the exam, you may need to:
- Recognize that a comparison of two groups on a categorical variable calls for a two-sample z-test for a difference of proportions.
- Write correct hypotheses and define parameters clearly.
- Verify independence and normality conditions, including the pooled proportion check.
Getting the setup right matters because the rest of the test (calculating the statistic, finding the p-value, and concluding) depends on it. Carrying out the full test comes next in 6.11.

Key Takeaways
- The null hypothesis is always Hโ: pโ = pโ (equivalently pโ - pโ = 0). The alternative is one-sided (pโ > pโ or pโ < pโ) or two-sided (pโ โ pโ).
- Define pโ and pโ in context so the reader knows exactly which populations you are comparing.
- The correct procedure is a two-sample z-test for a difference of two population proportions.
- Check independence: two independent random samples (or random assignment in an experiment), plus the 10% condition when sampling without replacement.
- Check normality with a pooled proportion pฬc, then confirm nโpฬc, nโ(1-pฬc), nโpฬc, and nโ(1-pฬc) are all large enough (commonly at least 10).
- Use meaningful subscripts so it is clear which proportion goes with which group.
Hypotheses and Parameters
The first step is writing your hypotheses. The null hypothesis always sets the two population proportions equal, and the alternative says one is greater than, less than, or not equal to the other.
- Null: Hโ: pโ = pโ, or equivalently Hโ: pโ - pโ = 0
- One-sided alternative: Hโ: pโ > pโ or Hโ: pโ < pโ
- Two-sided alternative: Hโ: pโ โ pโ
Just as important, define what pโ and pโ represent. State the populations in words so the reader knows exactly what you are comparing. The null hypothesis sets the difference to 0, which is the "no difference or effect" situation.
Conditions
You also have to check the conditions for inference. The three checks are similar to the ones for the confidence interval, with one change in the normal check.
(1) Random
Both samples need to come from random samples (or random assignment in an experiment). Without randomization, your results suffer from bias and you cannot generalize to the populations.
(2) Independence
Check that the data come from two independent random samples or a randomized experiment. When sampling without replacement, also confirm the 10% condition: nโ โค 10% Nโ and nโ โค 10% Nโ. In a randomized experiment, random assignment of treatments is what makes the groups independent.
(3) Normal
For proportions, you check normality using the Large Counts idea: expected successes and failures should be large enough (commonly at least 10). The change for a two-sample test is that you assume Hโ is true (pโ = pโ), so you combine both samples into one pooled proportion:
pฬc = (nโpฬโ + nโpฬโ)/(nโ + nโ)
Then check that all four of these are large enough (commonly at least 10):
- nโpฬc
- nโ(1-pฬc)
- nโpฬc
- nโ(1-pฬc)
You pool because the null hypothesis assumes the two proportions are equal. Under that assumption, combining the samples gives a single best estimate of the common proportion, which you use to check the shape condition.
Example
Suppose MJ made 836 of 1623 shots and Lebron made 622 of 1493 shots, and you want to test whether their shooting proportions differ. Here is how the setup looks for a two-sample z-test.
Hypotheses and Parameters
- Hโ: p_MJ = p_L (their population shot-making proportions are equal)
- Hโ: p_MJ โ p_L (their population shot-making proportions are different)
Let p_MJ be the true proportion of shots MJ makes and p_L be the true proportion of shots Lebron makes. Using meaningful subscripts like MJ and L keeps it clear which proportion matches which group.
Conditions
- Random: The problem does not state the shots were randomly sampled, so this is an assumption you would have to make and note as a limitation.
- Independent: It is reasonable that each player took far more than 10 times the number of shots in the sample, so the 10% condition is satisfied and the two samples are independent of each other.
- Normal: First find the pooled proportion.
pฬc = (836 + 622)/(1623 + 1493) โ 0.468
Now check Large Counts using pฬc โ 0.468 (so 1 - pฬc โ 0.532):
- 1623(0.468) โ 760 โฅ 10 โ๏ธ
- 1623(0.532) โ 863 โฅ 10 โ๏ธ
- 1493(0.468) โ 699 โฅ 10 โ๏ธ
- 1493(0.532) โ 794 โฅ 10 โ๏ธ
All conditions check out, so the setup is complete and you are ready to calculate the test statistic and find the p-value (that is the next topic, 6.11).
How to Use This on the AP Statistics Exam
Free Response
- Name the procedure: a two-sample z-test for a difference of two population proportions.
- Write Hโ: pโ = pโ and the correct alternative based on the question.
- Define pโ and pโ in context, with clear subscripts.
- Show all three conditions: random, independent (including the 10% check), and normal with the pooled proportion.
- Clear notation and labeled work are important for communicating your setup, even before you calculate anything.
MCQ
- Be ready to pick the correct hypotheses or the correct procedure from a list.
- Watch for the pooled proportion in the normal check; that is the detail that separates one-sample and two-sample proportion tests.
Common Trap
- Forgetting to pool. For the difference of two proportions, the normality check uses the combined pฬc, not separate pฬโ and pฬโ.
Common Misconceptions
- The null hypothesis is not about a specific value like 0.5 for each group. It only says the two proportions are equal, which is the same as saying their difference is 0.
- "Failing to reject" does not prove the proportions are equal. A test can only give evidence for a difference, not prove there is none.
- The 10% condition is about independence within each sample (sampling without replacement), not about the normal condition. Keep those two checks separate.
- The pooled proportion is only for the test setup under Hโ. The confidence interval for a difference of proportions does not pool; it uses the separate sample proportions.
- A small p-value will come later, but it shows the difference would be unusual if Hโ were true. It does not, by itself, measure how big the difference is.
Related AP Statistics Guides
- Unit 6 Overview: Inference for Categorical Data: Proportions
- 6.2 Constructing a Confidence Interval for a Population Proportion
- 6.1 Introducing Statistics: Why Be Normal?
- 6.4 Setting Up a Test for a Population Proportion
- 6.3 Justifying a Claim Based on a Confidence Interval for a Population Proportion
- 6.5 Interpreting p-Values
Vocabulary
The following words are mentioned explicitly in the College Board Course and Exam Description for this topic.Term | Definition |
|---|---|
alternative hypothesis | The claim that contradicts the null hypothesis, representing what the researcher is trying to find evidence for. |
approximately normal | A distribution that closely follows the shape of a normal distribution, allowing for the use of normal probability methods. |
categorical variable | A variable that takes on values that are category names or group labels rather than numerical values. |
difference of two population proportions | The comparison between two population proportions, expressed as pโ - pโ, to determine if they differ significantly. |
independence | The condition that observations in a sample are not influenced by each other, typically ensured through random sampling or randomized experiments. |
null hypothesis | The initial claim or assumption being tested in a hypothesis test, typically stating that there is no effect or no difference. |
one-sided alternative hypothesis | An alternative hypothesis that specifies the direction of the difference, either pโ < pโ or pโ > pโ. |
pooled proportion | A combined estimate of the population proportion calculated from both samples when assuming the null hypothesis is true: pฬc = (nโpฬโ + nโpฬโ)/(nโ + nโ). |
population proportion | The true proportion or percentage of a characteristic in an entire population, typically denoted as p. |
randomized experiment | A study design where subjects are randomly assigned to treatment groups to establish cause-and-effect relationships. |
sampling distribution | The probability distribution of a sample statistic (such as a sample proportion) obtained from repeated sampling of a population. |
sampling without replacement | A sampling method in which an item selected from a population cannot be selected again in subsequent draws. |
simple random sample | A sample selected from a population such that every possible sample of the same size has an equal chance of being chosen. |
statistical inference | The process of drawing conclusions about a population based on data collected from a sample. |
two-sample z-test | A hypothesis test used to compare the difference between two population proportions using the standard normal distribution. |
two-sided alternative hypothesis | An alternative hypothesis that specifies the difference could be in either direction, stated as pโ โ pโ. |
Frequently Asked Questions
When do you use a two-sample z-test for proportions?
Use a two-sample z-test for proportions when you are comparing two population proportions for one categorical variable, such as whether two groups differ in the proportion that supports, succeeds, or responds a certain way.
What are the hypotheses for a difference of two population proportions?
The null hypothesis is H0: p1 = p2, or H0: p1 - p2 = 0. The alternative can be one-sided, such as Ha: p1 > p2 or Ha: p1 < p2, or two-sided, Ha: p1 != p2.
What is p-hat combined?
The combined or pooled proportion, p-hat combined, estimates the common proportion assuming the null hypothesis is true. It is found by combining successes from both samples over the total sample size.
When do you use p-hat combined in AP Statistics?
Use p-hat combined when checking the normal condition for a two-sample z-test for a difference of proportions. You do not use it for a confidence interval for a difference of proportions.
What conditions do you check for a two-proportion z-test?
Check randomization, independence, and normality. For independence, use the 10% condition when sampling without replacement. For normality, check large counts using the pooled proportion.
How is setting up a two-proportion test scored on the AP Statistics exam?
AP Statistics scoring rewards naming the correct procedure, defining parameters in context, writing correct hypotheses, and verifying conditions clearly before calculating a test statistic or p-value.