AP Statistics Unit 3 ReviewCollecting Data

Verified for the 2027 examCompiled by AP educators~12–15% of the exam
Pep mascot
Upgrade your Fiveable account to print any study guide

Download study guides as beautiful PDFs See example

Print or share PDFs with your students

Always prints our latest, updated content

Mark up and annotate as you study

Click below to go to billing portal → update your plan → choose Yearly→ and select "Fiveable Share Plan". Only pay the difference

Plan is open to all students, teachers, parents, etc
Pep mascot
Upgrade your Fiveable account to export vocabulary

Download study guides as beautiful PDFs See example

Print or share PDFs with your students

Always prints our latest, updated content

Mark up and annotate as you study

Plan is open to all students, teachers, parents, etc

AP Statistics Unit 3, Collecting Data, covers experimental design and random sampling across 7 topics, making up 12-15% of the AP exam, with a focus on how data collection methods determine whether conclusions are actually valid. You'll work through the difference between observational studies and experiments, why random sampling matters for reducing bias, and how to spot problems like confounding variables or undercoverage. AP Stats gets real here: topics include selecting experimental designs, inference from experiments, and what it actually means when data "tells the truth."

unit 3 review

AP Statistics Unit 3, Collecting Data, is about how you gather data and what that lets you conclude. The single biggest idea is that the method of collection determines the validity of the conclusion. Random sampling lets you generalize to a population, and random assignment lets you claim cause and effect. This unit makes up 12-15% of the AP exam, and it is the one unit where you can earn full credit with zero calculations, just careful reasoning about design.

What this unit covers

Studies, samples, and what you can conclude

  • A population is everyone or everything you care about. A sample is the subset you actually measure. Almost everything in this course is about using a sample to learn about a population.
  • In an observational study, you watch and record without imposing anything. It can be retrospective (looking back at existing data) or prospective (following individuals forward in time). A sample survey is an observational study that collects data from a sample to learn about a population.
  • In an experiment, you deliberately impose treatments on experimental units and measure a response.
  • The two big payoffs are separate. Random selection lets you generalize from sample to population. Random assignment lets you conclude causation. An observational study can never establish a causal relationship, no matter how big the sample, because variables you didn't control could explain the association.

Random sampling methods

  • A simple random sample (SRS) gives every group of a given size an equal chance of being chosen. You can get one by numbering individuals and using a random number generator, a table of random values, or drawing slips from a hat. Sampling can be with replacement (an individual can be picked more than once) or without replacement (each individual picked at most once).
  • Stratified random sampling splits the population into strata (groups that are similar within, like grade levels), then takes an SRS from every stratum. This guarantees representation of each group and tends to reduce variability.
  • Cluster sampling splits the population into clusters (groups that are ideally mini versions of the whole population, like classrooms), then randomly selects entire clusters and measures everyone in them. It is convenient when the population is spread out.
  • Systematic sampling picks every kth individual from an ordered list after a random starting point.
  • Each method has tradeoffs based on the question, the population, and the resources available. Stratified is great when groups differ from each other; cluster is great when traveling to every individual would be impractical.

Bias, the ways sampling goes wrong

  • Bias means certain responses are systematically favored over others. Bigger samples do not fix bias; a biased method just gives you a precise wrong answer.
  • Voluntary response bias happens when the sample is made of people who chose to participate. People with strong opinions opt in, so the sample misrepresents the population.
  • Undercoverage bias happens when part of the population has a reduced chance of being included, like a phone survey missing people without phones.
  • Nonresponse bias happens when individuals chosen for the sample can't be reached or refuse to answer, and those people differ from the ones who do respond.
  • Response bias comes from the measurement itself, like leading question wording, sensitive topics that push people to lie, or interviewers influencing answers.
  • A key distinction worth practicing is that nonresponse means you selected them but got nothing, while voluntary response means they selected themselves.

Designing experiments

  • The vocabulary matters. Experimental units are the individuals assigned treatments (called subjects or participants when they're people). The explanatory variable (factor) is what you manipulate, its levels or combinations of levels are the treatments, and the response variable is the outcome you measure.
  • A well-designed experiment has four pillars. Comparison of at least two treatment groups (one can be a control group). Random assignment of treatments to units. Replication, meaning more than one unit per treatment group. Control of potential confounding variables where appropriate.
  • A confounding variable is related to the explanatory variable and influences the response, so you can't tell which one caused the effect. Random assignment tends to balance confounders across groups, which is exactly why it earns you causal conclusions.
  • In a completely randomized design, treatments go to units completely at random. A randomized block design groups similar units into blocks first, then randomizes within each block. A matched pairs design is blocking with blocks of size two, or one subject receiving both treatments in random order.
  • A placebo is an inactive treatment that controls for the placebo effect. In a single-blind experiment, either the subjects or the evaluators don't know who got which treatment; in a double-blind experiment, neither group knows.

Reading experimental results

  • Statistical inference attributes conclusions about the data to the process that generated them. Random assignment lets researchers ask whether an observed difference is too large to have happened just by the luck of the random assignment.
  • A difference that is unlikely to occur by chance alone is statistically significant. Statistically significant differences between treatment groups are evidence that the treatments caused the difference, provided the design was sound.

Unit 3, Collecting Data at a glance

Method or designHow it worksStrengthWatch out for
Simple random sampleEvery group of size n equally likely; use a random number generator or slipsUnbiased, the baseline for inferenceNeeds a complete list of the population
Stratified samplingSplit into similar strata, SRS from eachGuarantees representation, less variabilityStrata must differ from each other to help
Cluster samplingRandomly pick whole clusters, measure everyone insideCheap and practical for spread-out populationsClusters should each look like the population
Systematic samplingEvery kth person after a random startEasy with an ordered listHidden patterns in the list cause bias
Convenience/voluntaryTake whoever is easy or whoever opts inNone, statisticallyStrongly biased, never generalizable
Completely randomized designTreatments assigned to all units at randomRandom assignment balances confoundersNeeds replication and a comparison group
Randomized block designGroup similar units, randomize within blocksRemoves variability from the blocking variableBlock on a variable related to the response
Matched pairs designBlocks of two, or each subject gets both treatmentsEach subject is their own comparisonRandomize treatment order or pair assignment

Why Unit 3, Collecting Data matters in AP Stats

Every inference procedure you run later in the course assumes the data were collected properly. Unit 3 is where you earn the right to make claims at all, because a confidence interval built on a voluntary response sample is just confidently wrong.

  • The course's central theme of variation runs through this unit. Random sampling introduces chance variation on purpose, which is the only kind of variation probability can model.
  • Scope of inference is a recurring exam idea. Random selection answers "who can I generalize to," random assignment answers "can I say it caused the effect," and you need both questions answered for every study you ever read.
  • Confounding explains why "correlation is not causation" is true, and this unit gives you the precise mechanism (a lurking variable tangled up with the explanatory variable) instead of just the slogan.

How this unit connects across the course

  • The association vs. causation warning from scatterplots and regression (Unit 2) gets its full explanation here. Confounding is why an observed association in observational data can't prove cause and effect.
  • Random sampling is what makes probability models apply to data. The chance behavior you study next (Unit 4) only describes samples that were actually collected by chance.
  • Sampling distributions (Unit 5) describe how statistics vary across repeated random samples, which only makes sense if the sampling was random in the first place.
  • Every inference procedure for proportions, means, chi-square, and slopes (Units 6-9) includes a "random" condition you must check, and that condition points straight back to this unit. Your conclusions in those units inherit their scope from how the data were collected.

Key formulas and procedures

This unit is design and reasoning, not computation, so the "formulas" are really procedures you must describe precisely.

  • Selecting an SRS: number the individuals 1 to N, use a random number generator (or table) to pick n distinct numbers, ignoring repeats, and sample those individuals. On FRQs, you must describe a process specific enough that someone else could replicate it.
  • Random assignment in a completely randomized design: number the units, randomly select half (or the appropriate fraction) for treatment A and the rest for treatment B, using a generator, table, or slips drawn without replacement.
  • Building a stratified sample: identify the strata, then carry out a separate SRS within each stratum.
  • Setting up a block design: form blocks of units similar on a variable expected to affect the response, then randomly assign treatments within each block.
  • Scope of inference check: random selection present means generalize to the population; random assignment present means conclude causation; neither means you can only describe the sample.

Unit 3, Collecting Data on the AP exam

This unit carries 12-15% of the exam weight, the same range as Units 1 and 2, and it shows up in both multiple choice and free response. Multiple-choice questions give you a study description and ask you to identify the sampling method, name the most likely source of bias, spot the treatments and response variable, or pick the correct scope of inference. The classic free-response question on this unit asks you to design an experiment or sampling plan from scratch, like describing how to randomly assign 40 plants to two fertilizer treatments. Full credit requires naming a concrete randomization mechanism, not just writing "randomly assign." Other FRQs hand you a flawed study and ask you to identify the flaw, explain why it biases results in a particular direction, or explain why causation can't be concluded. Vague answers lose points fast here; "the sample is biased" earns nothing without saying which bias, how it arises in this context, and which direction it likely pushes the estimate.

Essential questions

  • Why does the way data are collected matter more than how much data are collected?
  • What allows researchers to conclude that one variable causes a change in another?
  • When is it legitimate to generalize from a sample to a larger population?
  • How can randomness, which feels like the opposite of control, make conclusions more trustworthy?

Key terms to know

  • Population: the entire group of items or subjects of interest in a study.
  • Sample: the subset of the population actually selected and measured.
  • Observational study: a study where treatments are not imposed; investigators only record data, so causation can't be concluded.
  • Experiment: a study where treatments are deliberately assigned to experimental units to measure their effect on a response.
  • Simple random sample (SRS): a sample where every group of a given size has an equal chance of being chosen.
  • Strata: subgroups of a population that are similar within themselves, each sampled separately in stratified sampling.
  • Confounding variable: a variable related to the explanatory variable that also affects the response, making it impossible to separate their effects.
  • Treatment: a level or combination of levels of the explanatory variable(s) assigned to experimental units.
  • Control group: a treatment group used as a baseline for comparison, sometimes receiving a placebo.
  • Placebo: an inactive treatment given so that the psychological effect of receiving treatment is the same across groups.
  • Double-blind: a design where neither the subjects nor the people measuring the response know who received which treatment.
  • Voluntary response bias: bias from a sample made of people who chose to participate, typically those with strong opinions.
  • Undercoverage bias: bias from part of the population having a reduced chance of being included in the sample.
  • Statistically significant: describes an observed difference so large it is unlikely to have occurred by chance alone.

Common mix-ups

  • Stratified vs. cluster sampling: with strata you sample some individuals from every group; with clusters you sample every individual from some groups. Strata should be similar within and different between; clusters should each resemble the whole population.
  • Nonresponse vs. voluntary response bias: nonresponse means the researcher selected people who then couldn't be reached or refused; voluntary response means people selected themselves into the sample.
  • Random selection vs. random assignment: selection is about who gets into the study and controls generalization; assignment is about who gets which treatment and controls causal conclusions. An experiment on volunteers can prove causation but only for people like the volunteers.
  • Blocking vs. stratifying: same idea (grouping similar individuals), different setting. Stratifying happens when sampling; blocking happens when assigning treatments in an experiment.

Frequently Asked Questions

What topics are covered in AP Stats Unit 3?

AP Stats Unit 3 covers 7 topics focused on collecting data and experimental design: 3.1 Do the Data We Collected Tell the Truth, 3.2 Introduction to Planning a Study, 3.3 Random Sampling and Data Collection, 3.4 Potential Problems with Sampling, 3.5 Introduction to Experimental Design, 3.6 Selecting an Experimental Design, and 3.7 Inference and Experiments. The big ideas are how to design studies that produce trustworthy data and how to tell the difference between observational studies and experiments. See AP Stats Unit 3 for practice on all seven topics.

How much of the AP Stats exam is Unit 3?

AP Stats Unit 3 makes up 12-15% of the AP exam, making it one of the more heavily tested units. It covers collecting data through experimental design and random sampling, including how to identify bias, choose a sampling method, and draw valid conclusions from well-designed studies.

What's on the AP Stats Unit 3 progress check (MCQ and FRQ)?

The AP Stats Unit 3 progress check includes both MCQ and FRQ parts drawn from all seven unit topics, with heavy emphasis on experimental design, random sampling methods, and identifying sources of bias. MCQ questions test whether you can recognize study types and spot flaws in data collection. FRQ questions typically ask you to design a study or explain why a method does or does not support a causal conclusion. Topics like 3.4 Potential Problems with Sampling and 3.6 Selecting an Experimental Design show up most often. Practice progress check-style questions at AP Stats Unit 3.

How do I practice AP Stats Unit 3 FRQs?

AP Stats Unit 3 FRQs most often ask you to design an experiment or evaluate a sampling method, drawing on topics like 3.5 Introduction to Experimental Design, 3.6 Selecting an Experimental Design, and 3.7 Inference and Experiments. A typical question gives you a scenario and asks you to describe a completely randomized design or a block design, explain how random sampling reduces bias, or state whether a causal conclusion is justified. To practice, write out full responses and check that you name the treatment groups, explain randomization, and address potential confounding variables. You can find FRQ practice aligned to these topics at AP Stats Unit 3.

Where can I find AP Stats Unit 3 practice questions?

For AP Stats Unit 3 practice questions, including MCQ and practice test sets, head to AP Stats Unit 3. You'll find multiple-choice questions covering experimental design, random sampling, and bias, plus free-response practice across all 7 topics in the unit. When you work through MCQs, focus on questions that ask you to identify study types and spot problems with data collection methods, since those show up most on the actual exam.

How should I study AP Stats Unit 3?

Start AP Stats Unit 3 by building a clear mental map of the difference between observational studies and experiments, since that distinction drives most of the unit. From there, work through these steps: 1. Learn the sampling methods in 3.3 (simple random, stratified, cluster, systematic) and practice explaining why random sampling reduces bias. 2. Study 3.4 Potential Problems with Sampling so you can name and explain undercoverage, nonresponse, and response bias. 3. Work through 3.5 and 3.6 to understand completely randomized designs, block designs, and matched pairs, then sketch out each design type by hand. 4. Finish with 3.7 Inference and Experiments to understand when you can and cannot claim causation. For each topic, write out at least one FRQ-style explanation in your own words. Experimental design questions reward precise vocabulary, so practice using terms like control group, random assignment, and confounding variable correctly. Find practice sets at AP Stats Unit 3.