Q: I'm confused about null and alternative hypotheses for goodness of fit - can someone explain?

Think of a goodness-of-fit test as checking whether the observed counts match a specific distribution. Your null hypothesis (H0) gives the specified proportions for every category—e.g., H0: predicted proportions = 0.25, 0.25, 0.25, 0.25 (or generally p1 = p1,0; p2 = p2,0; …). Expected counts = n · p (CED VAR-8.D.1). The alternative (Ha) is not a specific list of different proportions; it’s the broad statement that at least one category’s true proportion differs from what H0 says (CED VAR-8.B.1). When you set up the test also check conditions: random sample/experiment and 10% rule, and all expected counts > 5 for accuracy (CED VAR-8.E.1). The test statistic is χ² = Σ[(observed − expected)²/expected] (CED VAR-8.A.2), with df = number of categories − 1. For more examples and a guided walkthrough, see the Topic 8.2 study guide ( https://library.fiveable.me/ap-statistics/unit-8/setting-up-chi-square-goodness-fit-test/study-guide/2W5HT2MbSAG4Ty1buvCd ).

Question 1

How do I calculate expected counts for a chi-square goodness of fit test?

Accepted Answer

Expected counts are just the counts you’d expect if the null hypothesis about proportions is true. For each category: expected count = n × (null proportion for that category). Example: if n = 200 and H0 says p = 0.30 for Category A, expected = 200 × 0.30 = 60. Use those expected counts in the chi-square statistic χ² = Σ (observed − expected)² / expected (formula appears on the AP formula sheet). Quick checks for AP exam: make sure data come from a random sample (or randomized experiment) and that when sampling without replacement n ≤ 10% of the population. Also check the large-counts rule: a conservative rule is all expected counts > 5. For more on setting up goodness-of-fit tests see the Topic 8.2 study guide (https://library.fiveable.me/ap-statistics/unit-8/setting-up-chi-square-goodness-fit-test/study-guide/2W5HT2MbSAG4Ty1buvCd).

Question 2

What's the difference between chi-square goodness of fit and chi-square independence tests?

Accepted Answer

Goodness-of-fit tests whether one categorical variable follows a specified distribution (null gives proportions for each category). You compute expected counts as n·p for each category, χ² = Σ((Obs − Exp)²/Exp), and df = k − 1 (k = # categories). Chi-square independence (or homogeneity) uses a two-way table to test whether two categorical variables are related (independence) or whether several groups have the same distribution (homogeneity). Expected counts come from row×column totals / grand total, df = (r − 1)(c − 1). Common CED checks apply to both: data from a random sample or randomized experiment, 10% condition when sampling without replacement, and all expected counts ≳ 5 for the χ² approximation to be reliable. On the AP exam, pick goodness-of-fit for one categorical variable (Topic 8.2) and independence/homogeneity for two-way tables (Topic 8.5); see the Topic 8.2 study guide (https://library.fiveable.me/ap-statistics/unit-8/setting-up-chi-square-goodness-fit-test/study-guide/2W5HT2MbSAG4Ty1buvCd) and Unit 8 overview (https://library.fiveable.me/ap-statistics/unit-8).

Question 3

When do I use a chi-square test instead of a z-test or t-test?

Accepted Answer

Use a chi-square test when you’re working with counts for one categorical variable and you want to test whether the distribution of counts matches some specified proportions (goodness-of-fit) or whether categories are independent/in the same distribution (homogeneity/independence). That matches the CED: “chi-square test for goodness of fit” for one categorical variable (VAR-8.C.1). Compute expected counts as n·p (VAR-8.D.1), then χ² = Σ((obs − exp)²/exp) (VAR-8.A.2). Check conditions: random sample, n ≤ 10% of population if without replacement, and all expected counts > 5 for accuracy (VAR-8.E.1). Use z- or t-tests when your statistic is a sample mean or a single/paired proportion difference and the sampling distribution is approximately normal (z for large-sample proportions or known σ; t for means with unknown σ). For more on chi-square setup and examples, see the Topic 8.2 study guide (https://library.fiveable.me/ap-statistics/unit-8/setting-up-chi-square-goodness-of-fit-test/study-guide/2W5HT2MbSAG4Ty1buvCd) and Unit 8 overview (https://library.fiveable.me/ap-statistics/unit-8).

Question 4

What's the formula for the chi-square test statistic and how do I use it?

Accepted Answer

Formula: Pearson’s chi-square statistic is χ² = Σ (observed − expected)² / expected, where “expected” = n · p (the count your null hypothesis predicts for each category). How to use it (quick steps): 1. State H0 with null proportions for each category and Ha: at least one proportion differs. 2. Compute expected counts = n·p for every category (CED: VAR-8.D.1). 3. Check conditions: random sample, n ≤ 10% of population if sampling without replacement, and all expected counts > 5 (conservative) (CED: VAR-8.E.1). 4. Calculate χ² = Σ (O−E)²/E across categories (CED: VAR-8.A.2—this measures distance between observed & expected). 5. Degrees of freedom = k − 1 (k = number of categories). 6. Find p-value from the chi-square distribution (right-skewed; larger χ² → smaller p). Compare p to α and conclude (reject H0 if p < α). For AP-aligned help, see the Topic 8.2 study guide (https://library.fiveable.me/ap-statistics/unit-8/setting-up-chi-square-goodness-fit-test/study-guide/2W5HT2MbSAG4Ty1buvCd), the Unit 8 overview (https://library.fiveable.me/ap-statistics/unit-8), and tons of practice problems (https://library.fiveable.me/practice/ap-statistics).

Question 5

I'm confused about null and alternative hypotheses for goodness of fit - can someone explain?

Accepted Answer

Think of a goodness-of-fit test as checking whether the observed counts match a specific distribution. Your null hypothesis (H0) gives the specified proportions for every category—e.g., H0: predicted proportions = 0.25, 0.25, 0.25, 0.25 (or generally p1 = p1,0; p2 = p2,0; …). Expected counts = n · p (CED VAR-8.D.1). The alternative (Ha) is not a specific list of different proportions; it’s the broad statement that at least one category’s true proportion differs from what H0 says (CED VAR-8.B.1). When you set up the test also check conditions: random sample/experiment and 10% rule, and all expected counts > 5 for accuracy (CED VAR-8.E.1). The test statistic is χ² = Σ[(observed − expected)²/expected] (CED VAR-8.A.2), with df = number of categories − 1. For more examples and a guided walkthrough, see the Topic 8.2 study guide (https://library.fiveable.me/ap-statistics/unit-8/setting-up-chi-square-goodness-fit-test/study-guide/2W5HT2MbSAG4Ty1buvCd).

Question 6

How do I know if my expected counts are large enough to do the chi-square test?

Accepted Answer

Check the independence conditions first: your data must come from a random sample or randomized experiment, and if sampling without replacement check n ≤ 10% of N. Then check the large-counts condition from the CED: a conservative rule is that every expected count = n·p (for each category) should be greater than 5. If all expected counts > 5, the chi-square goodness-of-fit approximation is considered accurate enough for the test and p-value from the χ² distribution. If one or more expected counts ≤ 5, don’t just run the test—combine similar categories to raise expected counts or use a simulation/randomization approach (or a different exact method) so your inference is valid. For more guide steps and examples, see the Topic 8.2 study guide (https://library.fiveable.me/ap-statistics/unit-8/setting-up-chi-square-goodness-fit-test/study-guide/2W5HT2MbSAG4Ty1buvCd) and the Unit 8 overview (https://library.fiveable.me/ap-statistics/unit-8).

Question 7

What does it mean when they say "at least one proportion is not as specified" in the alternative hypothesis?

Accepted Answer

When the alternative hypothesis says “at least one proportion is not as specified,” it means the null gives specific proportions for every category (p1, p2, …), and the alternative just says those proportions aren’t all correct—at least one category’s true proportion differs from the null. It doesn’t tell you which category or how many are different; the chi-square goodness-of-fit test only detects whether the observed counts deviate enough overall from the expected counts (expected = n·p under H0). Remember: the chi-square statistic sums (observed − expected)^2/expected across categories, and the test uses df = k − 1. Before you infer, check AP conditions: random sample, 10% rule when sampling without replacement, and all expected counts > 5 for the large-counts rule (CED VAR-8.*). For extra review, see the Topic 8.2 study guide (https://library.fiveable.me/ap-statistics/unit-8/setting-up-chi-square-goodness-fit-test/study-guide/2W5HT2MbSAG4Ty1buvCd) and practice problems (https://library.fiveable.me/practice/ap-statistics).

Question 8

Can someone walk me through the steps of setting up a chi-square goodness of fit test?

Accepted Answer

Start by stating hypotheses: H0 gives the null proportions for each category (p1, p2, …, pk); Ha says at least one proportion differs. Calculate expected counts: expected = n·(null proportion) for each category (CED VAR-8.D.1). Check conditions: data from a random sample or experiment; if sampling without replacement, n ≤ 10% of N; and all expected counts should be > 5 (conservative rule) so the chi-square approximation is valid (VAR-8.E.1). Compute the chi-square statistic: χ² = Σ (observed − expected)² / expected (VAR-8.A.2). Degrees of freedom = k − 1. Use the χ² distribution (right-skewed) to find the p-value and compare to α: if p ≤ α reject H0 (VAR-8.A.3, VAR-8.B.1). On the AP exam you’ll use the formula/table or calculator to get the p-value; formulas/tables are provided. For a quick refresher, see the Fiveable study guide for Topic 8.2 (https://library.fiveable.me/ap-statistics/unit-8/setting-up-chi-square-goodness-fit-test/study-guide/2W5HT2MbSAG4Ty1buvCd).

Question 9

Why do all my expected counts have to be greater than 5?

Accepted Answer

Short answer: because the chi-square test’s math assumes the test statistic follows a chi-square distribution only when counts are “large enough,” and the usual conservative rule the CED gives is that every expected count > 5. If expected counts are too small the chi-square approximation is poor, so the p-value and conclusions can be misleading. Why that matters: the chi-square statistic χ² = Σ((obs − exp)²/exp) relies on approx. sampling behavior that becomes more accurate as counts grow (the distribution is positive and right-skewed but gets less skewed with more degrees of freedom and larger counts—VAR-8.A.3 and VAR-8.E.1.b). The CED therefore asks you to check expected = n·p and that all expected counts > 5 before using the chi-square test (Topic 8.2/8.3). If some expected counts ≤ 5: combine similar categories if that makes sense, collect more data, or use a simulation/Monte Carlo or exact (multinomial) method instead. For quick review, see the Topic 8.2 study guide (https://library.fiveable.me/ap-statistics/unit-8/setting-up-chi-square-goodness-fit-test/study-guide/2W5HT2MbSAG4Ty1buvCd) and practice questions (https://library.fiveable.me/practice/ap-statistics).

Question 10

What's the 10% rule for chi-square tests and when do I need to check it?

Accepted Answer

The 10% rule is part of the independence condition for chi-square inference: when your sample is taken without replacement from a finite population, you should check that n ≤ 0.10·N so observations can be treated as (approximately) independent. You only need to check it when the sample is a sizable fraction of the population (i.e., sampling without replacement). If you have a random sample or randomized experiment from a very large population, the rule will usually hold automatically. Also remember the other chi-square condition: all expected counts should be > 5 (a conservative “large counts” check) before using the chi-square goodness-of-fit test (VAR-8.E in the CED). For a quick refresher on setting up these checks and calculating expected counts (n·p), see the Topic 8.2 study guide (https://library.fiveable.me/ap-statistics/unit-8/setting-up-chi-square-goodness-fit-test/study-guide/2W5HT2MbSAG4Ty1buvCd).

Question 11

How do I find degrees of freedom for a goodness of fit test?

Accepted Answer

For a chi-square goodness-of-fit test, degrees of freedom tell you which chi-square curve to use. The basic rule is: df = (number of categories) − 1 So if you have 4 categories, df = 4 − 1 = 3. If you estimated any parameters from the data (for example, you fit a distribution and estimated one probability), subtract the number of estimated parameters: df = k − 1 − (number estimated). Why it matters: the df affect the shape of the chi-square distribution (less skew as df grows) and which critical value or p-value you read from the chi-square table. Don’t forget the conditions: expected counts should be large (conservative rule: all expected > 5) and data must be from a random sample (CED VAR-8.E). For a quick refresher, see the Topic 8.2 study guide (https://library.fiveable.me/ap-statistics/unit-8/setting-up-chi-square-goodness-fit-test/study-guide/2W5HT2MbSAG4Ty1buvCd) and practice problems (https://library.fiveable.me/practice/ap-statistics).

Question 12

I don't understand how to write the hypotheses when there are multiple categories - help?

Accepted Answer

Think of the null as a full set of specified proportions and the alternative as “one or more are different.” Write H0 by listing the null proportion for every category; write Ha as that at least one pi is not as specified. Short template: - H0: p1 = p1,0, p2 = p2,0, …, pk = pk,0 (these are the null proportions) - Ha: At least one pi ≠ pi,0 Example (4 flavors): H0: p(vanilla)=0.25, p(chocolate)=0.35, p(strawberry)=0.20, p(mint)=0.20. Ha: At least one proportion differs from the values in H0. Use these H0 proportions to compute expected counts = n · p (CED VAR-8.A.1 and VAR-8.D.1). Check conditions: random sample, 10% rule if without replacement, and all expected counts > 5 (CED VAR-8.E.1). For more on wording, examples, and practice problems, see Fiveable’s Topic 8.2 study guide (https://library.fiveable.me/ap-statistics/unit-8/setting-up-chi-square-goodness-fit-test/study-guide/2W5HT2MbSAG4Ty1buvCd) and Unit 8 overview (https://library.fiveable.me/ap-statistics/unit-8). Practice problems: (https://library.fiveable.me/practice/ap-statistics).

Question 13

What's the difference between observed counts and expected counts in chi-square?

Accepted Answer

Observed counts are the actual numbers you collect in each category—what you saw in the sample. Expected counts are what you would expect to see in each category if the null hypothesis were true: expected = n · (null proportion) for each category (CED VAR-8.D.1). Chi-square looks at how far observed and expected differ: χ² = Σ((observed − expected)² / expected) (CED VAR-8.A.2). Large differences make χ² big and suggest the null proportions aren’t right (CED VAR-8.B.1). Before relying on the test, check conditions: data from a random sample, 10% rule if sampling without replacement, and the conservative large-counts rule that all expected counts > 5 (CED VAR-8.E.1). For a quick refresher on setting up expected counts and examples, see the Topic 8.2 study guide (https://library.fiveable.me/ap-statistics/unit-8/setting-up-chi-square-goodness-fit-test/study-guide/2W5HT2MbSAG4Ty1buvCd). For more practice, try problems at (https://library.fiveable.me/practice/ap-statistics).

Question 14

How do I use my calculator to find the chi-square test statistic?

Accepted Answer

Short answer: enter observed and expected counts into lists, compute (O−E)^2/E for each category, then sum those values to get χ². Use df = k − 1 (k = # categories) and get the p-value with the chi-square CDF on your calculator. TI-83/84 steps (works on most models): 1. Stat → Edit → put Observed counts in L1 and Expected counts in L2 (expected = n·p for each category). 2. In L3 enter the formula: (L1−L2)^2/L2—press 2nd → L1, −, 2nd → L2, ^2, ÷, 2nd → L2, ENTER. 3. Stat → Calc → 1-Var Stats on L3 (or just 2nd → LIST → MATH → sum( L3 )) to get χ² = sum(L3). 4. Degrees of freedom df = k − 1. For p-value use 2nd → DISTR → chi2cdf(lower, upper, df). Upper tail p = chi2cdf(χ², 1E99, df) (or p = 1 − chi2cdf(0, χ², df)). Don’t forget AP checks: data must be random, n ≤ 10% N if applicable, and all expected counts > 5 for the test to be reliable (CED VAR-8.E). For a step-by-step review of these ideas, see the Topic 8.2 study guide (https://library.fiveable.me/ap-statistics/unit-8/setting-up-chi-square-goodness-fit-test/study-guide/2W5HT2MbSAG4Ty1buvCd).

Question 15

Why are chi-square distributions always skewed right and positive?

Accepted Answer

Because the chi-square statistic is a sum of squared terms, its values can never be negative. Each term is (observed − expected)^2 / expected, so every piece is ≥ 0 and squaring removes sign information—that forces the whole distribution to live on the positive side. Squaring also makes large deviations rarer but more extreme, which piles probability near 0 and gives a long right tail (right skew). Degrees of freedom control how much tail you get: with small df the distribution is very skewed right; as df increases the shape becomes more symmetric and looks more like a normal. That’s why the CED says chi-square distributions are positive and skewed right, and why the skew lessens with increasing df (see Topic 8.2 study guide for more: https://library.fiveable.me/ap-statistics/unit-8/setting-up-chi-square-goodness-fit-test/study-guide/2W5HT2MbSAG4Ty1buvCd). For practice, try problems on the Unit 8 page (https://library.fiveable.me/ap-statistics/unit-8) or the practice bank (https://library.fiveable.me/practice/ap-statistics).

Term	Definition
alternative hypothesis	The claim that contradicts the null hypothesis, representing what the researcher is trying to find evidence for.
categorical data	Data that represents categories or groups rather than numerical measurements, such as colors, types, or classifications.
chi-square distributions	Probability distributions used to test the goodness of fit between observed and expected categorical data, characterized by positive values and right skewness.
chi-square statistic	A test statistic that measures the distance between observed and expected counts relative to the expected counts.
chi-square test	A statistical test used to determine whether observed frequencies of categorical data match expected frequencies based on a hypothesized distribution.
degrees of freedom	A parameter of the t-distribution that affects its shape; as degrees of freedom increase, the t-distribution approaches the normal distribution.
distribution of proportions	The way in which proportions are spread across the categories of a categorical variable.
expected count	The theoretical frequency in each cell of a contingency table that would be expected if the null hypothesis of independence or homogeneity were true.
goodness of fit	A statistical test that determines how well observed data match the expected distribution specified by a hypothesis.
independence	The condition that observations in a sample are not influenced by each other, typically ensured through random sampling or randomized experiments.
null hypothesis	The initial claim or assumption being tested in a hypothesis test, typically stating that there is no effect or no difference.
null proportion	The hypothesized proportion for each category under the null hypothesis in a chi-square goodness of fit test.
observed count	The actual frequency or number of observations in each cell of a contingency table from the collected data.
proportion	A part or share of a whole, expressed as a fraction, decimal, or percentage.
random sample	A sample selected from a population in such a way that every member has an equal chance of being chosen, reducing bias and allowing for valid statistical inference.
randomized experiment	A study design where subjects are randomly assigned to treatment groups to establish cause-and-effect relationships.
sample size	The number of observations or data points collected in a sample, denoted as n.
sampling without replacement	A sampling method in which an item selected from a population cannot be selected again in subsequent draws.
statistical inference	The process of drawing conclusions about a population based on data collected from a sample.

📊AP Statistics Unit 8 Review

8.2 Setting Up a Chi Square Goodness of Fit Test

8.2 Setting Up a Chi Square Goodness of Fit Test

Unit & Topic Study Guides

Expected Counts (and the Unexpected?)

Tying Into Chi-Squares

Chi-Square Distributions

Goodness of Fit

Parameters

Hypotheses

Null Hypothesis

Alternate Hypothesis

Conditions

Example

Hypotheses and Parameter

Conditions

Vocabulary

Frequently Asked Questions

history

social science

english & capstone

arts

science

math & computer science

world languages

high school exams

honors classes

college classes

hs classes