Question 1

What's the difference between sampling with replacement and without replacement?

Accepted Answer

Sampling with replacement vs. without replacement is simple but important (CED DAT-2.C.1). - With replacement: once you pick an item (or person), you put it back so it can be picked again. Each draw is independent and has the same probabilities every time. This is often used in probability models and some simulations. - Without replacement: once an item is picked, it can’t be picked again. Draws are dependent because the composition of the population changes after each pick. Many real-world surveys (like drawing names from a hat or selecting people for an SRS) use without replacement. Why it matters for AP Stats: an SRS requires every group of a given size to have equal chance (DAT-2.C.2)—you can get an SRS either by ignoring repeats from a random-number list (effectively without replacement) or by using replacement when the problem specifies it. For calculations, with replacement → independent trials (use binomial models); without replacement → dependent trials (use hypergeometric unless n is ≤10% of the population, then dependence is negligible). Practice problems and more on sampling are in the Topic 3.3 study guide (https://library.fiveable.me/ap-statistics/unit-3/random-sampling-data-collection/study-guide/nQz8XwRMmIKKBS59qrew), the Unit 3 overview (https://library.fiveable.me/ap-statistics/unit-3), and lots of AP-style practice (https://library.fiveable.me/practice/ap-statistics).

Question 2

How do I know if a study is using simple random sampling or not?

Accepted Answer

Simple random sampling (SRS) means every possible group of the chosen size has an equal chance of being selected (CED DAT-2.C.2). To tell if a study used an SRS, ask these quick checks: - Did they number every individual in the population and use a random number generator or table to pick numbers? (Yes → likely SRS.) - Did they pick individuals one at a time ignoring repeats (sampling without replacement)? That’s consistent with SRS. - If they picked entire groups (regions, classrooms) at random and surveyed everyone in those groups, that’s cluster sampling, not SRS (CED DAT-2.C.4). - If they split the population into homogeneous strata and then sampled randomly within each stratum, that’s stratified sampling (CED DAT-2.C.3). - If they used a fixed interval (every 10th name after a random start), that’s systematic sampling (CED DAT-2.C.5). - If they surveyed everyone, it’s a census (CED DAT-2.C.6). So look for language about “every individual numbered and chosen by random number/table” or explicit statements of cluster/strata/systematic methods. For more examples and practice, check the Topic 3.3 study guide (https://library.fiveable.me/ap-statistics/unit-3/random-sampling-data-collection/study-guide/nQz8XwRMmIKKBS59qrew), the Unit 3 overview (https://library.fiveable.me/ap-statistics/unit-3), and practice problems (https://library.fiveable.me/practice/ap-statistics).

Question 3

When do I use stratified sampling vs cluster sampling?

Accepted Answer

Use stratified sampling when you want guaranteed, precise representation of known subgroups (strata). Strata are formed so members are similar (homogeneous) on a key characteristic (e.g., grade level, gender). Then take an SRS from each stratum and combine. Stratified reduces variability and improves estimates for those subgroups—use it when you care about subgroup estimates or when a subgroup is rare. Use cluster sampling for practicality and cost when the population is naturally grouped into clusters that each look like the population (heterogeneous within, similar across clusters)—for example, schools or city blocks. Randomly select a few clusters, then collect data from all units in those clusters. Cluster is efficient for widespread populations but usually has larger sampling error than stratified. On the AP exam you should identify which method fits a description (CED DAT-2.C: stratified = homogeneous strata + SRS within strata; cluster = SRS of clusters + all units). For a quick review see the Topic 3.3 study guide (https://library.fiveable.me/ap-statistics/unit-3/random-sampling-data-collection/study-guide/nQz8XwRMmIKKBS59qrew) and try practice problems (https://library.fiveable.me/practice/ap-statistics).

Question 4

What's the formula for systematic random sampling and how do I calculate the interval?

Accepted Answer

Systematic random sampling: pick a random starting position r (1 ≤ r ≤ k) and then select every k-th item: r, r + k, r + 2k, … until you have n items. The interval k is k = N / n (rounded appropriately). If N/n isn’t an integer, use k = floor(N/n) or round and make sure your random start r is chosen from 1 to k so every unit has an (approximately) equal chance. This is sampling without replacement (you don’t pick the same item twice). Example: population N = 1,000, want n = 50 ⇒ k = 1000/50 = 20. Choose a random start r from 1–20 (say r = 7), then pick 7, 27, 47, …, 987. On the AP CED this is DAT-2.C.5: you must show a random start and fixed periodic interval. For more review and examples see the Topic 3.3 study guide (https://library.fiveable.me/ap-statistics/unit-3/random-sampling-data-collection/study-guide/nQz8XwRMmIKKBS59qrew) and extra practice (https://library.fiveable.me/practice/ap-statistics).

Question 5

I'm confused about strata vs clusters - can someone explain the difference?

Accepted Answer

Short version: strata are groups you form to make your sample more precise by sampling within each group (homogeneous within); clusters are groups you form to make sampling easier by randomly selecting whole groups and measuring everyone in them (heterogeneous within). Details tied to the CED: - Stratified random sample (DAT-2.C.3): divide the population into strata that share a key attribute (e.g., grade level). Then do an SRS inside each stratum and combine those results. Strata are chosen because units within a stratum are similar (low variability)—this reduces sampling error. - Cluster sample (DAT-2.C.4): split the population into clusters that each look like the population (e.g., city blocks). Randomly select some clusters, then measure every unit in those clusters. Clusters should be heterogeneous within and similar to each other. This saves cost/time. When to use which: use stratified sampling when you want more precise estimates for known subgroups; use cluster sampling when a full list of individuals is hard to get but clusters are easy to sample. For review, see the Topic 3.3 study guide (https://library.fiveable.me/ap-statistics/unit-3/random-sampling-data-collection/study-guide/nQz8XwRMmIKKBS59qrew) and more unit resources (https://library.fiveable.me/ap-statistics/unit-3). For practice, try problems at (https://library.fiveable.me/practice/ap-statistics).

Question 6

How do I identify which sampling method was used in a word problem?

Accepted Answer

Start by looking for keywords and how the population is broken up or chosen—that tells you the method. Quick checklist (use CED terms): - Simple random sample (SRS): every group of size n has equal chance; look for “randomly select individuals,” “random number generator,” or “numbered list” (DAT-2.C.2). - Stratified: population split into strata (homogeneous groups) and you SRS within each stratum (look for “by grade,” “by gender,” etc.) (DAT-2.C.3). - Cluster: population split into clusters that are mini-populations (heterogeneous within); you randomly pick whole clusters and sample everyone in them (look for “select 5 schools and survey all students”) (DAT-2.C.4). - Systematic: “random start” then every k-th item (e.g., every 10th name) (DAT-2.C.5). - Census: everyone is measured. - Convenience: volunteers or whoever’s easiest. Also note replacement vs without replacement: does an item/person get picked more than once? (DAT-2.C.1). On the exam, explicitly tie wording to these definitions (DAT-2.C). For a quick refresher and examples, check the Topic 3.3 study guide (https://library.fiveable.me/ap-statistics/unit-3/random-sampling-data-collection/study-guide/nQz8XwRMmIKKBS59qrew) and more unit review (https://library.fiveable.me/ap-statistics/unit-3).

Question 7

What are the advantages and disadvantages of each sampling method for the AP exam?

Accepted Answer

Short answer: each method trades bias, precision, and practicality—know those trade-offs and how the CED definitions apply so you can identify or justify methods on the exam. - Simple random sample (SRS): every group of size n equally likely. - Advantages: unbiased, easy to justify for inference (foundation for many procedures). - Disadvantages: can be costly or impractical for large/population-wide lists. - Stratified random sample (strata = homogeneous groups): SRS within strata. - Advantages: more precise estimates when strata differ (reduces variability), ensures representation of key subgroups. - Disadvantages: requires good strata info and correct allocation; mis-stratifying can hurt. - Cluster sample (clusters = mini-populations, heterogeneous within): - Advantages: efficient and cheaper for geographically spread populations (sample all units in selected clusters). - Disadvantages: larger sampling variability than SRS if clusters differ; risk of undercoverage. - Systematic sample (random start + fixed interval): - Advantages: easy to implement; roughly like SRS if no periodicity. - Disadvantages: can be biased if list has periodic patterns. - Census: - Advantages: measures whole population (no sampling error). - Disadvantages: costly, slow, nonresponse and measurement errors still matter. - With vs without replacement: - Without replacement is usual in surveys (no duplicate units); with replacement simpler theoretically but rarely used in practice. On the AP exam you’ll be asked to identify methods and explain appropriateness (CED DAT-2.C, DAT-2.D). For quick prep, review the Topic 3.3 study guide (https://library.fiveable.me/ap-statistics/unit-3/random-sampling-data-collection/study-guide/nQz8XwRMmIKKBS59qrew) and practice many questions (https://library.fiveable.me/practice/ap-statistics).

Question 8

When would you use a census instead of sampling?

Accepted Answer

Use a census when you need data from every unit in the population rather than an estimate from a sample. The CED defines a census as selecting all items/subjects in a population (DAT-2.C.6). A census is appropriate when the population is small and reachable (e.g., all 25 students in a classroom), when legal/administrative reasons require full counts (like a national census), or when you need complete accuracy and can’t tolerate sampling variability. Remember trade-offs: a census removes sampling variability but can be expensive, slow, and still vulnerable to measurement error or nonresponse. For large or hard-to-reach populations, a properly designed sample (SRS, stratified, cluster, or systematic) is usually more practical and aligns with AP objectives about choosing methods and justifying them (DAT-2.D.1, DAT-2.C.*). For a quick refresher, check the Topic 3.3 study guide (https://library.fiveable.me/ap-statistics/unit-3/random-sampling-data-collection/study-guide/nQz8XwRMmIKKBS59qrew) and try practice problems (https://library.fiveable.me/practice/ap-statistics).

Question 9

How do I solve problems about random number generators and simple random samples?

Accepted Answer

Think of random-number-generator (RNG) problems as a 3-step checklist: label, generate, decide. 1. Label the population (CED: SRS basis). Give each unit a unique number with the same number of digits. If you’re sampling without replacement, ignore repeats; with replacement, keep repeats. State which you’re doing—that matters for probabilities (DAT-2.C.1). 2. Use the RNG or table to pick numbers. Start at a random place and read blocks of digits equal to your labels. If a number is outside the label range or a repeat you’re ignoring, skip it and continue (DAT-2.C.2; systematic/random-start ideas are related in DAT-2.C.5). 3. Check SRS conditions: every group of the given size must be equally likely. If you stratify, cluster, or use systematic selection, explain why that’s not an SRS (DAT-2.C.3–5). For probabilities, treat selections as binomial when sampling with replacement or when n is small relative to population and sampling is effectively independent (use n p for expected counts). For practice, work through the Topic 3.3 study guide (https://library.fiveable.me/ap-statistics/unit-3/random-sampling-data-collection/study-guide/nQz8XwRMmIKKBS59qrew) and try many problems on Fiveable’s practice page (https://library.fiveable.me/practice/ap-statistics). Those mirror AP exam wording and conditions you’ll be tested on.

Question 10

What's the step-by-step process for stratified random sampling?

Accepted Answer

Step-by-step for stratified random sampling (AP Stats style): 1. Define the population and the variable(s) you care about. 2. Identify relevant strata—groups that are homogeneous with respect to the key characteristic(s) (e.g., grade level, gender, region). Explain why strata reduce variability. 3. Decide sample size n and whether you’ll sample with or without replacement (CED notes without replacement is common). 4. Determine how many from each stratum: proportional allocation (ni = n·(size of stratum / population)) or equal/other allocation if justified. 5. Within each stratum, create a sampling frame (list individuals) and use a simple random sample (SRS) to pick ni units (random number generator, table, etc.). 6. Combine the selected units from all strata to form the full sample. 7. Check for potential bias and document method (helps you justify appropriateness on the exam: identify sampling method, cite homogeneity within strata, and SRS within strata). For more practice and AP-aligned reminders, see the Topic 3.3 study guide (https://library.fiveable.me/ap-statistics/unit-3/random-sampling-data-collection/study-guide/nQz8XwRMmIKKBS59qrew) and Unit 3 overview (https://library.fiveable.me/ap-statistics/unit-3). For lots of practice problems, use (https://library.fiveable.me/practice/ap-statistics).

Question 11

I don't understand when cluster sampling is appropriate vs when it's not good to use

Accepted Answer

Use cluster sampling when it's practical and the population naturally falls into groups (clusters) that look like mini–populations—and each cluster is internally diverse (heterogeneous) while clusters are similar to each other. Example: dividing a city into neighborhoods, randomly selecting some neighborhoods, then surveying every household in those selected neighborhoods. That matches DAT-2.C.4: pick clusters at random and collect from all units in chosen clusters. Don't use cluster sampling when clusters are very different from one another or when clusters are internally homogeneous for the characteristic you care about—that can give biased, unrepresentative results and inflate sampling error. Also avoid it when you can easily do a simple random or stratified sample (those give better precision if you care about subgroups). Quick tip for the AP: identify cluster sampling by “randomly choose groups, then sample everyone in chosen groups.” See the Topic 3.3 study guide for examples (https://library.fiveable.me/ap-statistics/unit-3/random-sampling-data-collection/study-guide/nQz8XwRMmIKKBS59qrew) and practice more problems at (https://library.fiveable.me/practice/ap-statistics).

Question 12

How do I know if the groups in a problem are homogeneous or heterogeneous?

Accepted Answer

Homogeneous groups have members who share the same key characteristic(s); heterogeneous groups contain a mix of different types. In AP terms: stratified sampling splits the population into strata that are homogeneous (e.g., males vs. females, grade levels) and then you take an SRS from each stratum. Cluster sampling splits the population into clusters that should be heterogeneous within and similar to one another (each cluster is like a mini-population), and you sample whole clusters. Quick checklist to decide: - Were groups formed by a shared attribute that’s relevant to the question? → homogeneous → stratified. - Are groups natural, intact mini-populations (each contains a mix like the whole pop) and clusters look alike? → heterogeneous within clusters → cluster sampling. - If unsure, ask: “Does the grouping reduce within-group variability for the thing I care about?” If yes, it’s homogeneous. This matches DAT-2.C (strata = homogeneous, clusters = heterogeneous). For more examples and practice, see the Topic 3.3 study guide (https://library.fiveable.me/ap-statistics/unit-3/random-sampling-data-collection/study-guide/nQz8XwRMmIKKBS59qrew) and extra practice (https://library.fiveable.me/practice/ap-statistics).

Question 13

What's the difference between systematic sampling and simple random sampling?

Accepted Answer

Simple random sample (SRS): every possible group of the chosen size has an equal chance of being selected. Practically, you number the population and pick numbers with a random number generator or table (sampling usually without replacement). That guarantees no systematic pattern and is the basis for many inference procedures on the AP exam (CED DAT-2.C.2). Systematic random sample: pick a random starting point, then select every k-th unit (random start + fixed interval)—e.g., every 10th name after a random start (CED DAT-2.C.5). It’s easier to carry out but can introduce bias if the list has a periodic pattern that lines up with your interval. Key difference: SRS gives equal chance to every group; systematic uses a random start plus a regular interval. For more examples and AP-style practice, see the Topic 3.3 study guide (https://library.fiveable.me/ap-statistics/unit-3/random-sampling-data-collection/study-guide/nQz8XwRMmIKKBS59qrew) and practice problems (https://library.fiveable.me/practice/ap-statistics).

Question 14

Can someone explain why you would divide a population into strata vs clusters?

Accepted Answer

You divide into strata when you want to reduce variability and make estimates more precise: strata are homogeneous groups (same characteristic, e.g., grade level), you take a simple random sample from each stratum, then combine results (CED: stratified random sample, DAT-2.C.3). Use strata when a characteristic is strongly related to the outcome so sampling within each group gives better representation. You divide into clusters when you need practicality or lower cost: clusters are mini-populations that should be heterogeneous internally and similar to each other (CED: cluster sample, DAT-2.C.4). You randomly pick whole clusters and survey all units in them—good when the population is geographically spread out, but it usually gives less precision than stratified sampling. On the AP exam you’ll be asked to identify which method fits a description (DAT-2.C and DAT-2.D). For a quick review see the Topic 3.3 study guide (https://library.fiveable.me/ap-statistics/unit-3/random-sampling-data-collection/study-guide/nQz8XwRMmIKKBS59qrew) and practice lots of examples (https://library.fiveable.me/practice/ap-statistics).

Question 15

How do I determine if a sampling method is appropriate for a given situation on the FRQ?

Accepted Answer

Ask three quick questions for the situation: Who is the population? What exactly are you trying to estimate/compare? What resources or constraints matter (time, cost, access)? Then match a sampling method to those answers using CED ideas. Steps to decide on the FRQ: - Check representativeness and bias: Will the method give every relevant unit a chance? If not, it’s likely biased (avoid convenience or voluntary samples unless justified). - Match method to goal/structure: If every small group should be represented, use stratified (homogeneous strata, DAT-2.C.3). If natural groups exist and you’ll survey whole groups, cluster is OK (heterogeneous within clusters, DAT-2.C.4). If every k-th name is chosen after a random start, that’s systematic (DAT-2.C.5). If every possible sample of size n must be equally likely, say SRS (DAT-2.C.2). - Consider practicality: census (DAT-2.C.6) is best but often infeasible; clusters can save cost but increase variability. - State replacement: note if sampling with or without replacement matters (DAT-2.C.1). - Justify: in your answer say why the method reduces bias or fits logistics, mention any trade-offs per DAT-2.D.1. Use AP wording (identify method, justify why appropriate/ not) and check practice FRQs on Fiveable (study guide: https://library.fiveable.me/ap-statistics/unit-3/random-sampling-data-collection/study-guide/nQz8XwRMmIKKBS59qrew; unit overview: https://library.fiveable.me/ap-statistics/unit-3).

Term	Definition
census	A data collection method that selects all items or subjects in a population.
cluster	Concentrations of data usually separated by gaps in a distribution.
cluster sample	A sampling method in which a population is divided into smaller groups called clusters, and a simple random sample of clusters is selected, with data collected from all observations in the selected clusters.
population	The entire group of individuals or items from which a sample is drawn and about which conclusions are to be made.
random number generator	A tool or method used to randomly select items from a population for inclusion in a simple random sample.
sample	A subset of individuals or items selected from a population for the purpose of data collection and analysis.
sampling method	A specific procedure or technique used to select a subset of individuals from a population for data collection and analysis.
sampling with replacement	A sampling method in which an item selected from a population can be selected again in subsequent draws.
sampling without replacement	A sampling method in which an item selected from a population cannot be selected again in subsequent draws.
strata	Separate groups within a population created by dividing it based on shared attributes or characteristics for stratified sampling.
stratified random sample	A sampling method in which a population is divided into separate groups called strata based on shared characteristics, and a simple random sample is selected from each stratum.
systematic random sample	A sampling method in which sample members are selected from a population according to a random starting point and a fixed, periodic interval.

📊AP Statistics Unit 3 Review

3.3 Random Sampling and Data Collection

3.3 Random Sampling and Data Collection

Unit & Topic Study Guides

Types of Non-Biased Sampling Methods

Simple Random Sample (SRS)

Choosing an SRS using a TI-84 Calculator:

Stratified Random Sample

Example

Cluster Sample

Example

Systematic Random Sample

Example

Practice Problem

Answer

Vocabulary

Frequently Asked Questions

history

social science

english & capstone

arts

science

math & computer science

world languages

high school exams

honors classes

college classes

hs classes