Confounding in AP Statistics

Confounding occurs when the effects of two or more variables on a response are mixed together, so you cannot tell which variable actually caused the observed effect. In AP Stats, confounding is the core reason observational studies cannot establish causation, while randomized experiments can.

Verified for the 2027 AP Statistics exam•Last updated June 2026

What is Confounding?

Confounding happens when an explanatory variable and some other variable are tangled together, and both could explain the response you observed. Because their effects are mixed, you can't separate them. Picture the classic example. Coffee drinkers live longer than non-drinkers. Does coffee cause longevity? Maybe. But maybe coffee drinkers also exercise more, earn more, or smoke less. Those other variables are confounders, and they make the coffee-longevity link impossible to interpret as cause and effect.

This is exactly why the CED draws a hard line between observational studies and experiments (Topic 3.2). In an observational study, no treatments are imposed, so confounding variables run free. The groups you're comparing (coffee drinkers vs. non-drinkers) may differ in dozens of ways besides the variable you care about. In an experiment, random assignment to treatments spreads those other variables roughly evenly across groups, which is what lets you draw a causal conclusion. Confounding isn't a math error or a sampling mistake. It's a built-in feature of how the data were collected.

Why Confounding matters in AP® Statistics

Confounding lives in Unit 3 (Collecting Data), Topic 3.2, and it directly supports learning objectives 3.2.A (identify the type of a study) and 3.2.B (identify appropriate generalizations and determinations). The essential knowledge for 3.2.B states it flat out: it is not possible to determine causal relationships between variables using data collected in an observational study. Confounding is the reason why. This idea echoes through the rest of the course too. Every time you write a conclusion for inference in Units 6-9, the scope of that conclusion (causation vs. association) traces back to whether the study design controlled for confounding through random assignment. If you can name a plausible confounder and explain how it muddies the relationship, you've mastered one of the most-tested skills in Unit 3.

Keep studying AP® Statistics Unit 3

Visual cheatsheet

view gallery

Practice questions for this topic

Unit 3 study guide

Full AP® Statistics practice exam

How Confounding connects across the course

Confounder (Unit 3)

A confounder is the specific variable doing the damage, while confounding is the problem it creates. On the exam, you're often asked to name a plausible confounder and explain how it's linked to both the explanatory variable and the response.

Randomization (Unit 3)

Random assignment is the cure for confounding. By randomly placing subjects into treatment groups, an experiment balances out confounding variables (even ones nobody thought of), which is what earns the right to say 'causes' instead of 'is associated with.'

Observational Study (Unit 3)

Observational studies are where confounding thrives, because subjects sort themselves into groups instead of being assigned. That self-sorting is exactly how confounders sneak in, and it's why these studies top out at association.

Confidence Interval (Units 6-8)

When you interpret inference results later in the course, the study design from Unit 3 still controls your wording. A confidence interval from observational data supports an association claim, never a causal one, because confounding was never ruled out.

Is Confounding on the AP® Statistics exam?

Confounding shows up two ways. In multiple choice, you get a study summary (coffee drinkers live longer, students who study with music score higher, 7-8 hours of sleep links to less heart disease) and you have to spot the design flaw or pick the most plausible confounding variable. The trap answers usually let you blame sampling when the real issue is the lack of random assignment. On FRQs, confounding drives study-design questions like the 2021 walking-and-cholesterol investigation and the 2023 fiber-concrete experiment, where you justify why random assignment matters or why a conclusion can't be causal. The 2022 allergy-clinics FRQ shows the comparison version, where a third variable distorts the comparison between two groups. To earn full credit, don't just say 'there could be confounding.' Name a specific variable and explain how it connects to both the explanatory variable and the response.

Confounding vs Bias

Bias and confounding are different diseases. Bias is a flaw in how data are collected (bad sampling, leading questions, nonresponse) that makes your estimate systematically wrong. Confounding is a flaw in interpretation, where a third variable makes it impossible to tell what's causing what. A study can have a perfectly random, unbiased sample and still be riddled with confounding because it's observational. Fixing one doesn't fix the other. Random selection fights bias and lets you generalize; random assignment fights confounding and lets you infer causation.

Key things to remember about Confounding

Confounding occurs when the effects of two variables on a response are mixed together, so you can't tell which one is actually responsible.
Confounding is the reason observational studies cannot establish cause and effect, no matter how large or well-sampled they are.
Random assignment in an experiment controls confounding by spreading other variables roughly evenly across treatment groups.
A good confounder explanation names a specific variable and links it to both the explanatory variable and the response, not just one of them.
Confounding and bias are different problems; random selection reduces bias, while random assignment reduces confounding.
When writing conclusions, say 'is associated with' for observational studies and reserve 'causes' for well-designed randomized experiments.

Frequently asked questions about Confounding

What is confounding in AP Stats?

Confounding is when the effects of two or more variables get mixed together, so you can't tell which variable is causing the observed outcome. It's the central reason Topic 3.2 says observational studies can't determine causal relationships.

Can an observational study ever prove causation?

No. The CED is explicit that you cannot determine causal relationships from observational data, because confounding variables are never controlled. Only a randomized experiment, where treatments are imposed and randomly assigned, supports a causal conclusion.

How is confounding different from bias?

Bias makes your estimate systematically off-target due to flawed data collection, like undercoverage or leading questions. Confounding tangles up cause and effect because a third variable affects both groups being compared. A study can be unbiased and still confounded.

Is a confounding variable the same as a lurking variable?

They're close. Both are 'hidden' third variables, but a confounder specifically influences both the explanatory variable and the response, which is what makes their effects impossible to separate. On the AP exam, name the variable and explain both links.

How do you control for confounding in an experiment?

Random assignment of subjects to treatments. Randomization balances confounding variables, including ones the researcher never measured, across the groups. That's why the 2021 walking-and-cholesterol FRQ and the 2023 concrete-fibers FRQ center on study design choices.

Keep studying AP Statistics

Connect this key term to the AP exam workflow: review the course, practice questions, and check related study tools.

AP Statistics hub

Review units, study guides, and course resources.

AP-style practice

Check this vocabulary in multiple-choice context.

FRQ practice

Apply key concepts in written AP responses.

score calculator

Estimate the exam score you are working toward.

cheatsheets

Review the highest-yield facts before practice.

practice exam

Put the full course together before test day.