Observational Study

In AP Statistics, an observational study is a study where investigators collect data on a sample without imposing treatments, either by looking back at existing data (retrospective) or following subjects forward (prospective). It can reveal associations but cannot establish cause and effect.

Verified for the 2027 AP Statistics examLast updated June 2026

What is Observational Study?

An observational study is exactly what it sounds like. Researchers watch and record. They do not assign anyone to a treatment, change anyone's behavior, or impose conditions. The CED splits observational studies into two flavors. A retrospective study looks backward at data that already exists, like pulling 500 medical records to compare aspirin users and non-users. A prospective study identifies a sample now and follows it into the future, like tracking 500 adults' sleep habits and heart disease rates for 10 years. A sample survey is also a type of observational study, since asking people questions doesn't impose a treatment on them.

The defining limitation is built right into the design. Because nobody assigned the groups, the people in them may differ in lots of ways besides the variable being studied. Those lurking differences are confounding variables, and they're the reason an observational study can show that two variables are associated but can never prove one causes the other. If you want causation, you need an experiment with random assignment.

Why Observational Study matters in AP Statistics

Observational studies live in Topic 3.2 (Introduction to Planning a Study) in Unit 3: Collecting Data, supporting two learning objectives. AP Stats 3.2.A asks you to identify the type of a study, which means classifying a scenario as observational vs. experimental and, within observational, retrospective vs. prospective. AP Stats 3.2.B asks you to identify appropriate generalizations and determinations, and this is where points are won or lost. Two rules do all the work. First, you can only generalize to a population if the sample was randomly selected (or otherwise representative) from that population. Second, you can never determine a causal relationship from observational data, no matter how strong the association looks. These two ideas come back in every inference unit, because the scope of any conclusion you draw later depends on how the data was collected in the first place. Get the full design picture in the Topic 3.2 study guide.

How Observational Study connects across the course

Experiment (Unit 3)

The experiment is the observational study's mirror image. In an experiment, researchers impose treatments and use random assignment, which balances out confounding variables and earns the right to conclude causation. One question on study design is really asking which of these two you're looking at.

Confounding Variable (Unit 3)

Confounding is the reason observational studies can't prove causation. If adults who sleep 7-8 hours also exercise more, you can't tell whether sleep or exercise is lowering heart disease rates. Exam questions love asking you to name a plausible confounder for an observational result.

Random Sampling (Unit 3)

Random sampling and random assignment do different jobs, and observational studies only get the first one. Random sampling lets you generalize from the sample to the population. It does nothing to fix the causation problem, because the groups still weren't assigned by the researcher.

Confidence Interval (Units 6-9)

When you build confidence intervals later in the course, the scope of your conclusion traces back to study design. Data from an observational study with a random sample supports a population estimate, but the interpretation still can't claim one variable causes another.

Is Observational Study on the AP Statistics exam?

Multiple-choice questions usually hand you a scenario and ask you to classify it. The 'researchers examined medical records of 500 heart attack patients' setup is a classic retrospective observational study stem. Another common angle asks which design is LEAST appropriate for establishing causality, and the answer is always the observational option. A third type describes an observational finding and asks which confounding variable most threatens the conclusion. On FRQs, study design shows up regularly. The 2021 FRQ on walking and cholesterol and the 2024 FRQ on car mileage both required reasoning about how data was collected and what conclusions that collection method allows. The two sentences that earn credit are versions of the same scope-of-inference logic: random selection lets you generalize to the population sampled, and lack of random assignment means you can describe an association but cannot claim causation.

Observational Study vs Experiment

The one-question test is whether the researchers imposed a treatment. If they assigned subjects to conditions (this group walks daily, that group doesn't), it's an experiment. If they just measured what people were already doing (recording who happened to drink coffee), it's observational. Watching is not the same as assigning, and only assigning supports a cause-and-effect conclusion. A common trap answer describes researchers 'comparing two groups' as if that makes it an experiment. It doesn't. What matters is who created the groups.

Key things to remember about Observational Study

  • In an observational study, treatments are not imposed; researchers only collect and examine data on a sample.

  • Retrospective studies look backward at existing data, while prospective studies follow a sample forward in time, and a sample survey is also a type of observational study.

  • It is not possible to determine causal relationships from data collected in an observational study, because confounding variables can explain the association.

  • You can only generalize results to a population if the sample was randomly selected from that population, and only to that specific population.

  • Random sampling answers 'who can we generalize to,' while random assignment answers 'can we claim causation,' and observational studies can have the first but never the second.

Frequently asked questions about Observational Study

What is an observational study in AP Stats?

It's a study where researchers collect data without imposing any treatment on subjects, either retrospectively (examining past data like medical records) or prospectively (following a sample into the future). It falls under Topic 3.2 in Unit 3 and can show association but not causation.

Can an observational study prove causation if the sample is random?

No. Random sampling lets you generalize to the population, but causation requires random assignment to treatments, which observational studies never have. The CED states it directly: you cannot determine causal relationships from observational data.

How is an observational study different from an experiment?

An experiment imposes treatments and uses random assignment to create comparable groups, which supports cause-and-effect conclusions. An observational study just records what's already happening, so confounding variables can always offer an alternative explanation for any association.

What's the difference between retrospective and prospective observational studies?

Retrospective studies examine data that already exists, like reviewing 500 patients' medical records for aspirin use after heart attacks. Prospective studies recruit a sample now and follow it forward, like tracking adults' sleep habits and heart disease for 10 years.

Is a survey an observational study?

Yes. A sample survey collects data from a sample to learn about a population without imposing any treatment, so the CED classifies it as a type of observational study.