Bivariate Data in AP Statistics

Bivariate data is data that records two variables for each individual, letting you analyze the relationship between them. In AP Statistics, that means two-way tables and conditional relative frequencies for two categorical variables, or scatterplots and correlation for two quantitative variables.

Verified for the 2027 AP Statistics exam•Last updated June 2026

What is Bivariate Data?

Bivariate data is exactly what the name says. "Bi" means two, so bivariate data measures two variables on each individual in your dataset. Instead of asking "what does this one variable look like?" (that was Unit 1), you're now asking "how do these two variables relate to each other?" Does knowing one variable's value tell you anything about the other?

The analysis tools depend on what kind of variables you have. Two categorical variables (like grade level and favorite subject) go into a contingency table, where you compute marginal and conditional relative frequencies to look for association. Two quantitative variables (like hours studied and exam score) go on a scatterplot, where you describe direction, form, and strength and quantify the relationship with the correlation coefficient. Either way, the core question is the same. You're hunting for an association, a pattern where the value of one variable is connected to the value of the other.

Why Bivariate Data matters in AP® Statistics

Bivariate data is the entire premise of Unit 2: Exploring Two-Variable Data. For the categorical side, it directly supports learning objectives 2.3.A (calculate statistics for two categorical variables, meaning marginal and conditional relative frequencies from a two-way table) and 2.3.B (compare those statistics to decide whether the two variables are associated). If conditional distributions differ across groups, the variables are associated; if they're roughly the same, they're not. That single move, comparing conditional distributions, is one of the most frequently tested skills in the unit. And bivariate thinking doesn't stop in Unit 2. It comes back with inference machinery in Unit 8 (chi-square tests) and in regression inference, so the intuition you build here pays off all year.

Keep studying AP® Statistics Unit 2

Visual cheatsheet

view gallery

Practice questions for this topic

Unit 2 study guide

Full AP® Statistics practice exam

How Bivariate Data connects across the course

Contingency Table (Unit 2)

A contingency table (two-way table) is how you actually display bivariate categorical data. Rows are one variable, columns are the other, and the cells show how often each combination occurs. Every marginal and conditional relative frequency calculation starts here.

Conditional Relative Frequency (Unit 2)

Conditional relative frequencies are the test for association in bivariate categorical data. You divide a cell by its row or column total, then compare across groups. If the percentages differ noticeably between groups, the two variables are associated.

Correlation Coefficient (Unit 2)

When both variables are quantitative instead of categorical, bivariate analysis switches tools. You graph a scatterplot and use r to measure the strength and direction of a linear relationship. Same big question, different variable types.

Chi-Square Test (Unit 8)

Unit 2 lets you describe an association in a sample. The chi-square test for independence in Unit 8 lets you decide whether that association in your bivariate categorical data is statistically significant or just sampling noise. It's the inference upgrade of Topic 2.3.

Is Bivariate Data on the AP® Statistics exam?

You won't see a question that just asks "define bivariate data." Instead, the exam hands you bivariate data and tests what you do with it. For two categorical variables, expect a two-way table where you calculate marginal or conditional relative frequencies, then compare conditional distributions to judge whether the variables are associated (that's 2.3.A and 2.3.B in action). Classic MCQ traps include dividing a cell by the grand total when the question asks for a conditional proportion, which needs the row or column total instead. On FRQs, two-variable setups show up constantly, whether it's interpreting a table, describing a scatterplot, or (later in the course) running a chi-square test. When you conclude there's an association, remember the catchphrase that association is not causation. Graders look for that distinction.

Bivariate Data vs Univariate Data

Univariate data measures one variable per individual (Unit 1 territory, like a histogram of test scores). Bivariate data measures two variables per individual so you can study the relationship between them. Quick check: if the question is about a relationship, association, or one variable predicting another, you're in bivariate land.

Key things to remember about Bivariate Data

Bivariate data records two variables for each individual, and the goal is to describe the relationship between those variables.
Two categorical variables get analyzed with a two-way table using marginal and conditional relative frequencies (LO 2.3.A).
Two variables are associated if the conditional relative frequencies differ across groups; if the conditional distributions look the same, there's no association (LO 2.3.B).
Two quantitative variables get analyzed with a scatterplot and the correlation coefficient instead of a table.
Finding an association in bivariate data never proves causation; only a well-designed experiment can do that.
The descriptive bivariate tools from Unit 2 return as inference procedures later, with chi-square tests for categorical pairs in Unit 8.

Frequently asked questions about Bivariate Data

What is bivariate data in AP Stats?

Bivariate data is data where two variables are measured for each individual, like recording both class year and sport for every student. In AP Stats Unit 2, you analyze it with two-way tables (categorical) or scatterplots (quantitative) to look for an association.

What's the difference between bivariate and univariate data?

Univariate data measures one variable per individual and describes a single distribution (Unit 1). Bivariate data measures two variables per individual and asks whether they're related (Unit 2).

Does an association in bivariate data mean one variable causes the other?

No. Association just means the conditional distributions differ or the variables move together. Causation requires a randomized experiment, which is why "correlation does not imply causation" is a phrase graders expect you to know.

Is bivariate data always two quantitative variables?

No. Bivariate data can be two categorical variables (analyzed with contingency tables, Topic 2.3), two quantitative variables (scatterplots and correlation), or one of each. The variable types determine which tool you use, not whether the data counts as bivariate.

How do you tell if two categorical variables are associated?

Compute conditional relative frequencies, like the percent in each category within each row, and compare across rows. If the percentages differ meaningfully between groups, the variables are associated; if they're nearly identical, they're not.

Keep studying AP Statistics

Connect this key term to the AP exam workflow: review the course, practice questions, and check related study tools.

AP Statistics hub

Review units, study guides, and course resources.

AP-style practice

Check this vocabulary in multiple-choice context.

FRQ practice

Apply key concepts in written AP responses.

score calculator

Estimate the exam score you are working toward.

cheatsheets

Review the highest-yield facts before practice.

practice exam

Put the full course together before test day.