Study smarter with Fiveable
Get study guides, practice questions, and cheatsheets for all your subjects. Join 500,000+ students with a 96% pass rate.
Sampling methods form the backbone of statistical inference—and the AP Statistics exam tests whether you understand why certain methods produce valid conclusions while others don't. You're not just being asked to identify "stratified" versus "cluster" sampling; you're being tested on whether you can explain how each method affects bias, variability, and the validity of inference. Every confidence interval, hypothesis test, and chi-square procedure you'll encounter assumes data was collected properly, so understanding sampling is prerequisite knowledge for Units 5–9.
The key distinction that drives this entire topic is probability sampling versus non-probability sampling. Probability methods give every member of the population a known, non-zero chance of selection—this is what allows us to make legitimate generalizations. When you see questions about the 10% condition, independence assumptions, or why a study's conclusions might be flawed, you're applying sampling concepts. Don't just memorize definitions—know what makes each method statistically valid (or not) and when each is most appropriate.
These methods give every population member a known chance of selection, which is the fundamental requirement for valid statistical inference. Without probability sampling, confidence intervals and hypothesis tests lose their mathematical foundation.
Compare: Stratified vs. Cluster sampling—both divide populations into groups, but stratified samples from every group while cluster samples entire groups. Stratified reduces variability; cluster often increases it. If an FRQ describes dividing by geography and sampling whole areas, that's cluster. If it describes ensuring representation of subgroups, that's stratified.
These methods do not give every population member a chance of selection, which means results cannot be generalized to the population through statistical inference. The AP exam frequently tests whether you can identify these as sources of bias.
Compare: Stratified sampling vs. Quota sampling—both aim for proportional representation of subgroups, but stratified uses random selection within strata while quota uses researcher judgment. Only stratified supports valid statistical inference.
Real-world studies often combine methods to balance statistical validity with practical constraints. The AP exam expects you to recognize these hybrid approaches.
Compare: Simple cluster vs. Multistage sampling—cluster samples everyone in selected clusters, while multistage adds another random selection step within clusters. Multistage typically produces more precise estimates but requires more complex analysis.
This conceptual division determines whether your inference is mathematically valid. Every inference procedure in Units 6–9 assumes probability sampling.
Compare: Probability vs. Non-probability sampling—the distinction isn't about sample size or effort; it's about whether randomization determines selection. A carefully designed convenience sample of 10,000 is still biased, while a proper SRS of 100 supports valid inference.
| Concept | Best Examples |
|---|---|
| Equal probability of selection | Simple Random Sample |
| Reducing variability through subgroups | Stratified Random Sample |
| Cost-effective for spread-out populations | Cluster Sample, Multistage Sampling |
| Required for chi-square homogeneity test | Stratified Random Sample |
| Sources of selection bias | Convenience Sample, Voluntary Response Sample |
| Appears representative but isn't random | Quota Sample |
| Supports valid statistical inference | SRS, Stratified, Cluster, Systematic (all probability methods) |
| Risk from periodic patterns in lists | Systematic Random Sample |
A researcher divides a city into neighborhoods, randomly selects 5 neighborhoods, and surveys every household in those neighborhoods. A second researcher divides residents by income level and randomly selects participants from each income group. Which method will likely produce estimates with lower variability, and why?
An online poll asks visitors to a news website to vote on a political issue and receives 50,000 responses. Why can't we construct a valid 95% confidence interval for the population proportion from this data, despite the large sample size?
Both stratified and quota sampling aim to ensure representation of subgroups. What specific feature distinguishes them, and how does this affect the validity of inference?
A study uses systematic sampling with from an alphabetized list of employees. Under what circumstance might this method introduce bias, and how could the researchers check for this problem?
For a chi-square test of homogeneity comparing customer satisfaction across three store locations, the CED specifies that data should be collected using a stratified random sample or randomized experiment. Explain why a cluster sample (randomly selecting entire stores) would be inappropriate for this inference.