---
title: "Chi-Square Tests — AP Stats Definition & Exam Guide"
description: "Chi-square tests check whether categorical data fit expected counts, testing homogeneity or independence in two-way tables. Core to AP Stats Unit 8 inference."
canonical: "https://fiveable.me/ap-stats/key-terms/chi-square-tests"
type: "key-term"
subject: "AP Statistics"
---

# Chi-Square Tests — AP Stats Definition & Exam Guide

## Definition

Chi-Square Tests are statistical methods used to determine whether there is a significant association between categorical variables. These tests help researchers assess how well observed data fit expected frequencies, allowing for conclusions about homogeneity or independence among different groups.

## Related Study Guides

- [Unit 8 Overview: Chi Square](/ap-stats/unit-8/review/study-guide/mqGqXgXbv88w7n8XuQfx)

## Review

### Related Terms

- [Categorical Variables](/ap-stats/key-terms/categorical-variables): Variables that can be divided into distinct groups or categories, such as gender, race, or yes/no responses.
- Expected Frequencies: The theoretical frequency of occurrences in each category, based on a specific hypothesis or model.
- [Degrees of Freedom](/ap-stats/key-terms/degrees-of-freedom): A parameter used in statistics that represents the number of independent values or quantities that can vary in a statistical calculation.

### Key Facts

- Chi-Square Tests can be classified into two main types: the Chi-Square Test for Homogeneity, which assesses whether different populations have the same distribution of a categorical variable, and the Chi-Square Test for Independence, which examines if two categorical variables are independent of each other.
- The test statistic is calculated using the formula $$\chi^2 = \sum \frac{(O - E)^2}{E}$$, where O represents observed frequencies and E represents expected frequencies.
- A higher Chi-Square statistic indicates a greater difference between observed and expected frequencies, suggesting a potential association between the variables being studied.
- Chi-Square Tests require a minimum sample size to ensure validity; typically, at least 5 expected observations per category is recommended to avoid unreliable results.
- When interpreting the results, a p-value less than a significance level (usually 0.05) leads to rejecting the null hypothesis, indicating that an association likely exists between the categorical variables.

### How do you determine whether to use a Chi-Square Test for Homogeneity or a Chi-Square Test for Independence?

To decide between a Chi-Square Test for Homogeneity and a Chi-Square Test for Independence, consider the nature of your data. Use the Test for Homogeneity when comparing distributions across multiple groups or populations to see if they are similar. On the other hand, choose the Test for Independence when you want to examine whether two categorical variables within one population are related or independent.

### What steps are involved in setting up and performing a Chi-Square Test, including calculating the test statistic?

Setting up a Chi-Square Test involves several steps: first, collect data and create a contingency table to display observed frequencies. Then calculate the expected frequencies based on the null hypothesis. The next step is to apply the formula $$\chi^2 = \sum \frac{(O - E)^2}{E}$$ to find the test statistic. Finally, compare the calculated Chi-Square value to critical values from the Chi-Square distribution table using the appropriate degrees of freedom to draw conclusions about your hypothesis.

### Evaluate how sample size and expected frequencies influence the validity of results from Chi-Square Tests.

Sample size and expected frequencies are critical for the validity of Chi-Square Tests. A larger sample size increases the reliability of the results, reducing variability and leading to more accurate estimates of expected frequencies. If expected frequencies in any cell of the contingency table fall below 5, it can result in misleading conclusions due to violating test assumptions. Therefore, ensuring an adequate sample size and checking expected frequencies before conducting the test is essential for producing trustworthy findings.
