# 2.3 Statistics for Two Categorical Variables

#bivariatedata

#exploringdata

#anticipatingpatterns

written by peter cao

Lets look back at the two-way table from Unit 2.2. Courtesy of Starnes, Daren S. and Tabor, Josh. The Practice of Statistics—For the AP Exam, 5th Edition. Cengage Publishing.

We can find more than just joint relative frequencies here as there’s also marginal relative frequencies and conditional relative frequencies. Marginal Relative Frequency

A marginal relative frequency is the relative frequency of all the people in a certain category. For example, the marginal relative frequency of a "50-50 chance” is 1416/4826 as from the right margin, we see that 1416 overall respondents gave that response.

### Conditional Relative Frequency

On the other hand, the conditional relative frequency is the frequency that we have of a particular category given the fact that we know a subject is in another category. The category that we know is called the given, or independent category, while the other is called the dependent category, just like independent and dependent variables on graphs. For example, the conditional frequency for “50-50 chance given male” is 720/2459 because out of the 2459 males who responded, 720 of them said “50-50 chance.” When calculating a conditional relative frequency, our denominator (or total) is usually considerably smaller than that of overall total.

## Determining Associations from a Two-Way Table

From a two-way table, we can use marginal and conditional relative frequencies to consider if two categorical variables are associated or not. To do this, see if two corresponding conditional relative frequencies across different categories are not the same. This is also the same as seeing if the conditional relative frequency is not the same as the marginal relative frequency for the dependent category. This makes it so that certain independent category values are more likely to yield a certain result than others. That is, we can predict behavior given the fact that we know that an individual falls under a certain category.

### Example

Using the two-way table above, we can determine that the variables "gender" and "opinion" are independent, or not associated, because the marginal relative frequency of being "50-50 chance" is roughly equal to the conditional relative frequency of being "50-50 chance given male". 🎥Watch: AP Stats - Probability: Two Way Tables, Independence, Tree Diagrams, etc