Q: I don't understand what "goodness of fit" actually means - can someone explain?

“Goodness of fit” asks: do the counts you observed in different categories match what you’d expect under a specified model? In an AP chi-square goodness-of-fit test your H0 is “the population follows the given distribution” (a specific set of category probabilities); Ha is “it doesn’t.” You compute χ² = Σ(Observed − Expected)²/Expected (VAR-8.F.1), with df = #categories − 1, and find a p-value from the chi-square distribution (or a randomization distribution) (VAR-8.F.2, VAR-8.G.1). The p-value is the probability, assuming H0 is true, of getting a χ² as large or larger than the observed (DAT-3.I.1). Compare p to α to decide to reject or fail to reject H0 (DAT-3.J). Check expected-count conditions (all expected counts reasonably large) and report context. For a quick review of steps and examples, see the Topic 8.3 study guide ( https://library.fiveable.me/ap-statistics/unit-8/carrying-out-chi-square-goodness-fit-test/study-guide/XmvvVf9spR7e6xT6TPEG )—and practice more problems on the Unit 8 page ( https://library.fiveable.me/ap-statistics/unit-8 ) or the AP practice bank ( https://library.fiveable.me/practice/ap-statistics ).

Q: Do I need to check conditions before doing a chi-square test and what are they?

Yes—you should check conditions before doing a chi-square goodness-of-fit test. AP expects you to verify that the test’s assumptions are reasonable so the chi-square distribution (or a randomization distribution) is valid. Key conditions: - Randomness/independence: data come from a random sample or randomized experiment and observations are independent (or sample ≤10% of the population if sampling without replacement). - Expected count condition: each expected count (under H0) should be at least 5 (AP often phrases this as “expected count condition”—if many expected counts are https://library.fiveable.me/ap-statistics/unit-8/carrying-out-chi-square-goodness-fit-test/study-guide/XmvvVf9spR7e6xT6TPEG ) and the Unit 8 overview ( https://library.fiveable.me/ap-statistics/unit-8 ).

Q: Why do we square the differences in the chi-square formula?

We square the differences (Observed − Expected) in χ² = Σ(Obs − Exp)²/Exp for two reasons: math and meaning. Math first—squaring makes all differences positive so large deviations don’t cancel out (positive vs. negative). It also emphasizes bigger discrepancies (a difference of 6 counts contributes 36, much larger than two differences of 3). Meaning-wise—dividing by the expected count standardizes each squared difference so categories with bigger expected counts don’t automatically dominate the total. The result is a single measure of how far the observed counts are from the null-model counts across all categories (the Pearson chi-square statistic in the CED VAR-8.F.1). That statistic’s null distribution (the χ² distribution or a randomization distribution) lets you get a p-value (CED VAR-8.G and DAT-3.I) to decide whether the pattern is unlikely under H0. For more worked examples and practice problems on carrying out the goodness-of-fit test, see the Fiveable study guide ( https://library.fiveable.me/ap-statistics/unit-8/carrying-out-chi-square-goodness-fit-test/study-guide/XmvvVf9spR7e6xT6TPEG ) and the Unit 8 overview ( https://library.fiveable.me/ap-statistics/unit-8 ).

Question 1

What's the formula for chi-square goodness of fit test?

Accepted Answer

The chi-square goodness-of-fit test statistic is χ² = Σ (Observed − Expected)² / Expected, summing over all categories. Degrees of freedom = number of categories − 1. Under H0 (the specified probability model), χ² has a chi-square distribution (or you can use a randomization distribution if you didn’t assume a model). The p-value is the probability, assuming H0 is true, of getting a χ² as large or larger than the observed value—compare that p-value to α to decide whether to reject H0. Check that all expected counts are large enough (the AP expected-count condition) before using the chi-square table or calculator. The χ² formula and table appear on the AP formula sheet; for a focused walk-through and practice problems, see the Fiveable Topic 8.3 study guide (https://library.fiveable.me/ap-statistics/unit-8/carrying-out-chi-square-goodness-fit-test/study-guide/XmvvVf9spR7e6xT6TPEG) and the Unit 8 overview (https://library.fiveable.me/ap-statistics/unit-8).

Question 2

How do I calculate expected counts for a chi-square test?

Accepted Answer

For a goodness-of-fit test, the expected count for each category is what you’d expect under the null probability model: expected = (total sample size) × (null probability for that category). - If the null says specific proportions p1, p2, …, pk, then Ei = n × pi for each category i. - If the null says “all categories equally likely,” pi = 1/k so Ei = n/k. You use those Ei in the Pearson chi-square: χ² = Σ (Observed − Expected)² / Expected (CED VAR-8.F.1). Check the expected-count condition before using the chi-square table: typically every Ei should be at least 5 (or combine small categories). That affects whether the chi-square approximation is valid (CED: large-sample approximation). For more examples and steps, see the Topic 8.3 study guide (https://library.fiveable.me/ap-statistics/unit-8/carrying-out-chi-square-goodness-fit-test/study-guide/XmvvVf9spR7e6xT6TPEG) and try practice problems at (https://library.fiveable.me/practice/ap-statistics).

Question 3

When do I use chi-square goodness of fit vs chi-square independence test?

Accepted Answer

Use chi-square goodness-of-fit when you’re testing whether one categorical variable follows a specific distribution (one variable, k categories). Null: the population proportions equal the specified values. Test statistic: χ² = Σ (obs − exp)²/exp with df = k − 1. Use the chi-square independence test when you have two categorical variables in a two-way table and you want to know whether they’re associated (or independent). There you compute expected counts from the table margins and use χ² with df = (rows − 1)(cols − 1). Always check conditions: counts are counts (not percentages), and the expected count condition for the large-sample chi-square approximation (AP wants you to verify expected counts are large enough). Find the p-value from the chi-square distribution (or randomization if model isn’t assumed) and compare to α to reject/fail to reject. For more AP-aligned practice and worked examples, see the Topic 8.3 study guide (https://library.fiveable.me/ap-statistics/unit-8/carrying-out-chi-square-goodness-fit-test/study-guide/XmvvVf9spR7e6xT6TPEG) and Unit 8 overview (https://library.fiveable.me/ap-statistics/unit-8).

Question 4

I'm confused about degrees of freedom - is it always n-1 for goodness of fit?

Accepted Answer

Short answer: usually yes—but with an important caveat. CED-level rule (VAR-8.F.1): for a goodness-of-fit chi-square test, the default degrees of freedom = number of categories − 1. That’s what you’ll use when the null specifies the category probabilities completely (e.g., H0: categorical distribution is 20%, 30%, 50%). The caveat: if you estimate parameters of the null distribution from the data (for example, you fit a Poisson or normal and you estimate its mean from the sample), you lose extra degrees of freedom. Then df = (number of categories − 1) − (number of parameters estimated). Also remember AP allows using a randomization distribution instead of the chi-square model when appropriate (VAR-8.F.2). For more examples and practice, see the Topic 8.3 study guide (https://library.fiveable.me/ap-statistics/unit-8/carrying-out-chi-square-goodness-fit-test/study-guide/XmvvVf9spR7e6xT6TPEG) and the practice question bank (https://library.fiveable.me/practice/ap-statistics).

Question 5

Step by step how do you do a chi-square goodness of fit test?

Accepted Answer

Step-by-step (short) for a chi-square goodness-of-fit test: 1. State hypotheses. H0: the population follows the specified distribution (give probabilities); Ha: it does not. 2. Check conditions. Data are counts from a random sample (or randomization) and each expected count = n × model probability is at least 5 (large-sample approx). 3. Compute expected counts for each category: E_i = n·p_i. 4. Calculate the test statistic: χ² = Σ (Observed_i − Expected_i)² / Expected_i (CED VAR-8.F.1). Degrees of freedom = k − 1. 5. Find the p-value from the chi-square distribution (table or software) with df = k−1 (CED VAR-8.G.1). If you used a randomization distribution instead, use that null distribution. 6. Conclusion: compare p to α. If p ≤ α, reject H0 (evidence the distribution differs); if p > α, fail to reject H0 (not enough evidence)—interpret in context (CED DAT-3.I, DAT-3.J). 7. (Optional) Check standardized residuals (Obs−Exp)/√Exp to see which categories contribute most. For practice and AP-style guidance, see the Fiveable Topic 8.3 study guide (https://library.fiveable.me/ap-statistics/unit-8/carrying-out-chi-square-goodness-fit-test/study-guide/XmvvVf9spR7e6xT6TPEG) and Unit 8 overview (https://library.fiveable.me/ap-statistics/unit-8).

Question 6

What does it mean when my chi-square statistic is really big?

Accepted Answer

A really big chi-square statistic means the observed counts deviate a lot from the expected counts under your null model. By the formula χ² = Σ(Observed − Expected)² / Expected, big squared differences add up, so large χ² values indicate larger discrepancies across categories. Under the null (the specified distribution is true) the null distribution of χ² (with df = number of categories − 1) makes large values unlikely, so a large χ² usually gives a small p-value—meaning you’d reject H0 at typical α levels and conclude the data are inconsistent with the claimed distribution. Remember to check conditions first (expected count condition) and report df and how you found the p-value (table or software). For practice and exam-style guidance on carrying out the test and interpreting p-values, see the Topic 8.3 study guide (https://library.fiveable.me/ap-statistics/unit-8/carrying-out-chi-square-goodness-fit-test/study-guide/XmvvVf9spR7e6xT6TPEG) and the Unit 8 overview (https://library.fiveable.me/ap-statistics/unit-8).

Question 7

How do I find the p-value using the chi-square table?

Accepted Answer

First compute χ² = Σ(Observed − Expected)²/Expected and df = number of categories − 1 (CED VAR-8.F). The chi-square table (AP formula sheet) gives χ² critical values for various tail probabilities p with a specified df—remember the test is an upper-tail test, so p = P(Χ² ≥ observed). To get the p-value from the table: find the row for your df, then locate two adjacent critical values that bracket your observed χ². If your χ² lies between the table entries for p = 0.10 and p = 0.05, your p-value is between 0.10 and 0.05. If χ² is larger than the largest table value listed, p is smaller than the smallest tail probability shown (e.g., p < 0.005). Compare that p-value range to α to reject/fail to reject (CED DAT-3.J). For step-by-step examples see the Topic 8.3 study guide (https://library.fiveable.me/ap-statistics/unit-8/carrying-out-chi-square-goodness-fit-test/study-guide/XmvvVf9spR7e6xT6TPEG) and more practice (https://library.fiveable.me/practice/ap-statistics).

Question 8

I don't understand what "goodness of fit" actually means - can someone explain?

Accepted Answer

“Goodness of fit” asks: do the counts you observed in different categories match what you’d expect under a specified model? In an AP chi-square goodness-of-fit test your H0 is “the population follows the given distribution” (a specific set of category probabilities); Ha is “it doesn’t.” You compute χ² = Σ(Observed − Expected)²/Expected (VAR-8.F.1), with df = #categories − 1, and find a p-value from the chi-square distribution (or a randomization distribution) (VAR-8.F.2, VAR-8.G.1). The p-value is the probability, assuming H0 is true, of getting a χ² as large or larger than the observed (DAT-3.I.1). Compare p to α to decide to reject or fail to reject H0 (DAT-3.J). Check expected-count conditions (all expected counts reasonably large) and report context. For a quick review of steps and examples, see the Topic 8.3 study guide (https://library.fiveable.me/ap-statistics/unit-8/carrying-out-chi-square-goodness-fit-test/study-guide/XmvvVf9spR7e6xT6TPEG)—and practice more problems on the Unit 8 page (https://library.fiveable.me/ap-statistics/unit-8) or the AP practice bank (https://library.fiveable.me/practice/ap-statistics).

Question 9

Do I need to check conditions before doing a chi-square test and what are they?

Accepted Answer

Yes—you should check conditions before doing a chi-square goodness-of-fit test. AP expects you to verify that the test’s assumptions are reasonable so the chi-square distribution (or a randomization distribution) is valid. Key conditions: - Randomness/independence: data come from a random sample or randomized experiment and observations are independent (or sample ≤10% of the population if sampling without replacement). - Expected count condition: each expected count (under H0) should be at least 5 (AP often phrases this as “expected count condition”—if many expected counts are https://library.fiveable.me/ap-statistics/unit-8/carrying-out-chi-square-goodness-fit-test/study-guide/XmvvVf9spR7e6xT6TPEG) and the Unit 8 overview (https://library.fiveable.me/ap-statistics/unit-8).

Question 10

Why do we square the differences in the chi-square formula?

Accepted Answer

We square the differences (Observed − Expected) in χ² = Σ(Obs − Exp)²/Exp for two reasons: math and meaning. Math first—squaring makes all differences positive so large deviations don’t cancel out (positive vs. negative). It also emphasizes bigger discrepancies (a difference of 6 counts contributes 36, much larger than two differences of 3). Meaning-wise—dividing by the expected count standardizes each squared difference so categories with bigger expected counts don’t automatically dominate the total. The result is a single measure of how far the observed counts are from the null-model counts across all categories (the Pearson chi-square statistic in the CED VAR-8.F.1). That statistic’s null distribution (the χ² distribution or a randomization distribution) lets you get a p-value (CED VAR-8.G and DAT-3.I) to decide whether the pattern is unlikely under H0. For more worked examples and practice problems on carrying out the goodness-of-fit test, see the Fiveable study guide (https://library.fiveable.me/ap-statistics/unit-8/carrying-out-chi-square-goodness-fit-test/study-guide/XmvvVf9spR7e6xT6TPEG) and the Unit 8 overview (https://library.fiveable.me/ap-statistics/unit-8).

Question 11

How do I interpret the p-value in a chi-square goodness of fit test?

Accepted Answer

The p-value in a chi-square goodness-of-fit test is the probability, assuming the null hypothesis and the probability model are true, of getting a chi-square statistic as large (or larger) than the one you observed (CED: DAT-3.I.1). Practically: a small p-value (e.g., 0.03) means the observed counts are unlikely under H0, so you reject H0 at α = 0.05 and conclude the distribution probably differs from the specified model. A large p-value (e.g., 0.40) means the data are consistent with H0, so you fail to reject H0—you don’t “accept” H0, you just don’t have convincing evidence against it (CED: DAT-3.J.1). Remember to find p using a chi-square table or software (VAR-8.G.1), check df = categories − 1, and verify expected-count conditions before trusting the p-value. For a quick review of steps and practice, see Fiveable’s Topic 8.3 study guide (https://library.fiveable.me/ap-statistics/unit-8/carrying-out-chi-square-goodness-fit-test/study-guide/XmvvVf9spR7e6xT6TPEG) and lots of practice problems (https://library.fiveable.me/practice/ap-statistics).

Question 12

When do I reject the null hypothesis in a chi-square test?

Accepted Answer

You reject the null in a chi-square goodness-of-fit when the data are too unlikely under H0—that is, when the p-value is less than your significance level α (usually 0.05). Calculate χ² = Σ(Observed − Expected)² / Expected with df = (number of categories − 1). Then either (a) find the p-value from the χ² distribution and reject H0 if p < α, or (b) compare the test statistic to the chi-square critical value: reject if χ² > χ²α,df. Also check conditions first: expected counts should be large enough for the chi-square approximation (each expected count typically ≥ 5) or use a randomization distribution otherwise. The AP exam provides the χ² table and calculator use is allowed—be ready to report χ², df, p-value, and a context sentence. For a quick topic review, see the Fiveable study guide (https://library.fiveable.me/ap-statistics/unit-8/carrying-out-chi-square-goodness-fit-test/study-guide/XmvvVf9spR7e6xT6TPEG) and more practice at (https://library.fiveable.me/practice/ap-statistics).

Question 13

What's the difference between observed and expected frequencies?

Accepted Answer

Observed counts are the actual numbers you get from your sample—what you actually saw in each category. Expected counts are the counts you would expect in each category if the null hypothesis (the specified distribution) were true. For goodness-of-fit, expected = (total sample size) × (proposed category probability). You use both in the chi-square statistic: χ² = Σ (Observed − Expected)² / Expected. Large differences between observed and expected increase χ² and can lead to a small p-value, meaning the data are unlikely under the null. Don’t forget the expected-count condition: each expected count should be at least about 5 for the chi-square approximation to be reliable (or use a randomization approach). Degrees of freedom = number of categories − 1. For a clear walkthrough and examples, check Fiveable’s Topic 8.3 study guide (https://library.fiveable.me/ap-statistics/unit-8/carrying-out-chi-square-goodness-fit-test/study-guide/XmvvVf9spR7e6xT6TPEG).

Question 14

Can I use my calculator to find chi-square p-values or do I have to use the table?

Accepted Answer

Yes—you can (and should) use your calculator on the AP exam to get chi-square p-values when it has the right functions. The CED allows p-values from a table or computer-generated output (VAR-8.G.1), and AP instructs you to bring a graphing calculator with statistical capabilities. Most calculators (TI-84, TI-89, Casio, etc.) have a chi-square cdf or a built-in chi-square test that returns the p-value. For example, on a TI-84 you can use chi2cdf(χ², 1E99, df) to get the upper-tail p (or 1-chi2cdf(0, χ², df)). If you don’t have that, the provided chi-square table (in the formula/tables) can give a range for the p-value—that’s acceptable but less precise. Practice using your calculator before the exam so you can compute p quickly and correctly. For a refresher on carrying out the test and examples, see the Topic 8.3 study guide (https://library.fiveable.me/ap-statistics/unit-8/carrying-out-chi-square-goodness-fit-test/study-guide/XmvvVf9spR7e6xT6TPEG) and more practice at (https://library.fiveable.me/practice/ap-statistics).

Question 15

I keep getting negative expected counts - what am I doing wrong?

Accepted Answer

You can’t get a negative expected count—if you do, you’ve made an arithmetic or setup mistake. Expected counts are counts predicted by the null model and must be ≥ 0. Check these common errors: - Wrong formula: For goodness-of-fit use Expected = n × p_i (or n/k for equal categories). For two-way tables use Expected = (row total × column total) / grand total. - Entering negative probabilities or proportions: p_i must be between 0 and 1 and sum to 1. If one p_i is negative (or you used observed − expected by mistake) you’ll get negative values. - Calculator/data-entry slip: confirm you input totals, probabilities, or margins correctly (no stray minus signs). - Mixing up standardized residuals or (Observed − Expected) with Expected itself—standardized residuals can be negative, expected counts cannot. Also remember AP CED requirements: use χ² = Σ(Observed − Expected)²/Expected and check the expected-count condition for the chi-square approximation. If you want step-by-step examples, see the Topic 8.3 study guide (https://library.fiveable.me/ap-statistics/unit-8/carrying-out-chi-square-goodness-fit-test/study-guide/XmvvVf9spR7e6xT6TPEG) and practice problems (https://library.fiveable.me/practice/ap-statistics).

Term	Definition
chi-square distribution	A probability distribution used in chi-square tests, characterized by degrees of freedom and used to determine p-values for test statistics.
chi-square test	A statistical test used to determine whether observed frequencies of categorical data match expected frequencies based on a hypothesized distribution.
degrees of freedom	A parameter of the t-distribution that affects its shape; as degrees of freedom increase, the t-distribution approaches the normal distribution.
expected count	The theoretical frequency in each cell of a contingency table that would be expected if the null hypothesis of independence or homogeneity were true.
null distribution	The probability distribution of the test statistic under the assumption that the null hypothesis is true.
null hypothesis	The initial claim or assumption being tested in a hypothesis test, typically stating that there is no effect or no difference.
observed count	The actual frequency or number of observations in each cell of a contingency table from the collected data.
p-value	The probability of observing a test statistic as extreme as or more extreme than the one calculated from the sample data, assuming the null hypothesis is true.
probability model	A mathematical framework that describes the probability distribution of outcomes under specified assumptions.
reject the null hypothesis	The decision made when the p-value is less than or equal to the significance level, indicating sufficient evidence against the null hypothesis.
significance level	The threshold probability (α) used to determine whether to reject the null hypothesis in a significance test.
significance test	A statistical procedure used to determine whether there is sufficient evidence to reject the null hypothesis based on sample data.
test statistic	A calculated value used to determine whether to reject the null hypothesis in a hypothesis test, computed from sample data.
theoretical distribution	A probability distribution based on a mathematical model, such as the normal distribution, used to approximate the distribution of a test statistic.

📊AP Statistics Unit 8 Review

8.3 Carrying Out a Chi Square Goodness of Fit Test

8.3 Carrying Out a Chi Square Goodness of Fit Test

Unit & Topic Study Guides

Doing The Test!

Test Statistic

Example

Degrees of Freedom

P-Value

Example

Conclusion

Vocabulary

Frequently Asked Questions

history

social science

english & capstone

arts

science

math & computer science

world languages

high school exams

honors classes

college classes

hs classes

Study Content & Tools

Company

Resources