Bivariate data is data that records two variables for each individual, letting you analyze the relationship between them. In AP Statistics, that means two-way tables and conditional relative frequencies for two categorical variables, or scatterplots and correlation for two quantitative variables.
Bivariate data is exactly what the name says. "Bi" means two, so bivariate data measures two variables on each individual in your dataset. Instead of asking "what does this one variable look like?" (that was Unit 1), you're now asking "how do these two variables relate to each other?" Does knowing one variable's value tell you anything about the other?
The analysis tools depend on what kind of variables you have. Two categorical variables (like grade level and favorite subject) go into a contingency table, where you compute marginal and conditional relative frequencies to look for association. Two quantitative variables (like hours studied and exam score) go on a scatterplot, where you describe direction, form, and strength and quantify the relationship with the correlation coefficient. Either way, the core question is the same. You're hunting for an association, a pattern where the value of one variable is connected to the value of the other.
Bivariate data is the entire premise of Unit 2: Exploring Two-Variable Data. For the categorical side, it directly supports learning objectives 2.3.A (calculate statistics for two categorical variables, meaning marginal and conditional relative frequencies from a two-way table) and 2.3.B (compare those statistics to decide whether the two variables are associated). If conditional distributions differ across groups, the variables are associated; if they're roughly the same, they're not. That single move, comparing conditional distributions, is one of the most frequently tested skills in the unit. And bivariate thinking doesn't stop in Unit 2. It comes back with inference machinery in Unit 8 (chi-square tests) and in regression inference, so the intuition you build here pays off all year.
Keep studying AP Statistics Unit LZt3oAfYhsf8qaSW
Contingency Table (Unit 2)
A contingency table (two-way table) is how you actually display bivariate categorical data. Rows are one variable, columns are the other, and the cells show how often each combination occurs. Every marginal and conditional relative frequency calculation starts here.
Conditional Relative Frequency (Unit 2)
Conditional relative frequencies are the test for association in bivariate categorical data. You divide a cell by its row or column total, then compare across groups. If the percentages differ noticeably between groups, the two variables are associated.
Correlation Coefficient (Unit 2)
When both variables are quantitative instead of categorical, bivariate analysis switches tools. You graph a scatterplot and use r to measure the strength and direction of a linear relationship. Same big question, different variable types.
Chi-Square Test (Unit 8)
Unit 2 lets you describe an association in a sample. The chi-square test for independence in Unit 8 lets you decide whether that association in your bivariate categorical data is statistically significant or just sampling noise. It's the inference upgrade of Topic 2.3.
You won't see a question that just asks "define bivariate data." Instead, the exam hands you bivariate data and tests what you do with it. For two categorical variables, expect a two-way table where you calculate marginal or conditional relative frequencies, then compare conditional distributions to judge whether the variables are associated (that's 2.3.A and 2.3.B in action). Classic MCQ traps include dividing a cell by the grand total when the question asks for a conditional proportion, which needs the row or column total instead. On FRQs, two-variable setups show up constantly, whether it's interpreting a table, describing a scatterplot, or (later in the course) running a chi-square test. When you conclude there's an association, remember the catchphrase that association is not causation. Graders look for that distinction.
Univariate data measures one variable per individual (Unit 1 territory, like a histogram of test scores). Bivariate data measures two variables per individual so you can study the relationship between them. Quick check: if the question is about a relationship, association, or one variable predicting another, you're in bivariate land.
Bivariate data records two variables for each individual, and the goal is to describe the relationship between those variables.
Two categorical variables get analyzed with a two-way table using marginal and conditional relative frequencies (LO 2.3.A).
Two variables are associated if the conditional relative frequencies differ across groups; if the conditional distributions look the same, there's no association (LO 2.3.B).
Two quantitative variables get analyzed with a scatterplot and the correlation coefficient instead of a table.
Finding an association in bivariate data never proves causation; only a well-designed experiment can do that.
The descriptive bivariate tools from Unit 2 return as inference procedures later, with chi-square tests for categorical pairs in Unit 8.
Bivariate data is data where two variables are measured for each individual, like recording both class year and sport for every student. In AP Stats Unit 2, you analyze it with two-way tables (categorical) or scatterplots (quantitative) to look for an association.
Univariate data measures one variable per individual and describes a single distribution (Unit 1). Bivariate data measures two variables per individual and asks whether they're related (Unit 2).
No. Association just means the conditional distributions differ or the variables move together. Causation requires a randomized experiment, which is why "correlation does not imply causation" is a phrase graders expect you to know.
No. Bivariate data can be two categorical variables (analyzed with contingency tables, Topic 2.3), two quantitative variables (scatterplots and correlation), or one of each. The variable types determine which tool you use, not whether the data counts as bivariate.
Compute conditional relative frequencies, like the percent in each category within each row, and compare across rows. If the percentages differ meaningfully between groups, the variables are associated; if they're nearly identical, they're not.