Confounding Variables

A confounding variable is a variable related to both the explanatory variable and the response variable, so its effects get tangled up with the treatment's effects. On the AP Stats exam, random assignment is the tool that balances confounding variables and lets you make a causal claim.

Verified for the 2027 AP Statistics examLast updated June 2026

What is Confounding Variables?

A confounding variable is a third variable that is related to both the explanatory variable and the response variable in a study. Because its effect is mixed together with the treatment's effect, you can't tell which one actually caused the change in the response. Classic example: students who choose to use flashcards also tend to study more hours overall. If flashcard users score higher, is it the flashcards or the extra hours? Study time is confounded with study method.

In the AP Stats CED (Topic 3.5), confounding is the reason a well-designed experiment requires random assignment of treatments. Random assignment tends to balance the effects of uncontrolled variables across treatment groups, so any difference in responses can be attributed to the treatments themselves. That's the entire logic behind why experiments can support causal conclusions and observational studies usually can't. The confounder doesn't disappear; it just gets spread roughly evenly across groups so it can't favor one treatment over another.

Why Confounding Variables matters in AP Statistics

Confounding variables live in Unit 3 (Collecting Data), specifically Topic 3.5, Introduction to Experimental Design. Three learning objectives lean on this idea. AP Stats 3.5.A asks you to identify the components of an experiment, including potential confounding variables. AP Stats 3.5.B says a well-designed experiment must control potential confounding variables where appropriate. AP Stats 3.5.C explains that in a completely randomized design, random assignment tends to balance the effects of uncontrolled (confounding) variables. The concept also echoes forward into inference. Topics 7.1 and 9.1 ask you to think about whether observed patterns reflect a real effect or something else, and 'something else' often means a confounder hiding in a study without random assignment. When you write a conclusion for a significance test on means (Unit 7) or slopes (Unit 9), how the data were collected determines whether you're allowed to say 'caused.'

How Confounding Variables connects across the course

Randomization (Unit 3)

Random assignment is the antidote to confounding. It doesn't eliminate confounders, it spreads them roughly evenly across treatment groups, so differences in the response can be attributed to the treatments. This is the single most-tested link on the exam.

Control Group (Unit 3)

A control group gives you a baseline for comparison, which is one of the four elements of a well-designed experiment alongside random assignment, replication, and control of confounding variables. Without a comparison group, you can't separate the treatment's effect from everything else going on.

Causal Relationships (Units 3, 7, 9)

Confounding is the reason 'correlation is not causation' exists. An observational study can show a strong association in a scatterplot (Unit 9) or a big difference in means (Unit 7), but a confounder could be producing it. Only random assignment earns you a causal conclusion.

Double Blind Experiment (Unit 3)

Blinding handles a specific kind of confounding that comes from expectations. If subjects or researchers know who got which treatment, their beliefs can influence the response and get tangled with the treatment's real effect.

Is Confounding Variables on the AP Statistics exam?

On multiple choice, expect stems describing a study and asking why random assignment matters, or asking you to spot a design flaw. Practice questions in this style hand you a scenario (three teaching methods, 60 students, prior knowledge varies) and ask which design controls for the confounder, which is where blocking on the confounding variable comes in. On FRQs, experimental design is a regular Question 2 topic. The 2023 FRQ about adding fibers to concrete for driveway paving tested exactly this skill, designing a study so that differences in cracking can be attributed to the fibers and not to something else like soil or weather conditions. When you answer, you have to do two things in writing. First, name a plausible confounding variable specific to the context (not just 'other factors'). Second, explain the mechanism, meaning how that variable is linked to both the explanatory variable and the response. Vague answers lose the point.

Confounding Variables vs Lurking Variable

Both are third variables messing with your conclusion, but a confounding variable's effect is specifically mixed up with the explanatory variable, so the two effects can't be separated. A lurking variable is the broader idea of any unmeasured variable influencing results. On the exam, use 'confounding' when the third variable is associated with the treatment groups themselves, which is the situation random assignment is designed to fix.

Key things to remember about Confounding Variables

  • A confounding variable is related to both the explanatory variable and the response variable, so its effect cannot be separated from the treatment's effect.

  • Random assignment tends to balance confounding variables across treatment groups, which is exactly why experiments can support causal conclusions.

  • Observational studies cannot establish causation because subjects choose their own groups, leaving confounders free to drive the results.

  • When an FRQ asks for a confounding variable, name a specific variable from the context and explain how it connects to both the treatment and the response.

  • A well-designed experiment includes comparison of at least two groups, random assignment, replication, and control of potential confounding variables.

  • Blocking on a known confounder, like prior knowledge in a teaching-methods study, removes its variation from the comparison instead of just hoping randomization balances it.

Frequently asked questions about Confounding Variables

What is a confounding variable in AP Stats?

It's a variable related to both the explanatory variable and the response variable, so its effects get mixed up with the treatment's effects. Example: study method confounded with hours studied, where you can't tell which one raised the test scores.

Does random assignment get rid of confounding variables?

No, and this is a common scoring trap. Random assignment doesn't eliminate confounders; it tends to balance their effects across treatment groups so they can't systematically favor one treatment. Saying 'eliminates' instead of 'balances' can cost you on an FRQ.

What's the difference between a confounding variable and a lurking variable?

A confounding variable's effect is specifically entangled with the explanatory variable, while a lurking variable is any unmeasured third variable affecting the results. Confounding is the term the CED uses in experimental design (Topic 3.5).

Can an experiment with random assignment still have confounding?

Mostly no, that's the point of random assignment. But expectation effects can still sneak in if subjects or researchers know who got which treatment, which is why well-designed experiments often add single or double blinding on top of randomization.

How do I describe a confounding variable on an AP Stats FRQ?

Name a specific variable from the scenario, then explain its link to both the explanatory and response variables. For a driveway concrete experiment like 2023 FRQ Q2, soil or weather conditions could affect cracking severity regardless of whether fibers were added.