Question 1

How do I compare two histograms for my AP Stats exam?

Accepted Answer

Look at the two histograms and describe the same five features for each, then compare them directly: shape, center, spread (variability), outliers/gaps, and any clusters or modes. - Shape: note symmetry vs. skew (e.g., “both right-skewed; B is more strongly skewed”). - Center: give a measure (median or mean) and say which histogram has the larger center (e.g., “median around 120 ppm for B vs. ~75 ppm for A”). - Spread: compare range or IQR and mention which is more variable (e.g., “A has a wider range; B is tighter”). - Outliers/gaps: point out any isolated bars or empty intervals. - Modes/clusters: note unimodal/bimodal or clusters. Finish with a one-sentence conclusion in context (what that difference means for the variable). Use side-by-side boxplots or report means/SDs if asked for numerical comparison on the exam (Skill 2.D). For a quick Topic 1.9 review, see the Fiveable study guide (https://library.fiveable.me/ap-statistics/unit-1/comparing-distributions-quantitative-variable/study-guide/2j5wKJg84ZKKN1T5CEmz) and more practice problems (https://library.fiveable.me/practice/ap-statistics).

Question 2

What's the difference between side-by-side boxplots and regular boxplots?

Accepted Answer

A regular (single) boxplot shows one quantitative distribution with its five-number summary: min, Q1, median, Q3, max (and any outliers). A side-by-side boxplot just draws two or more boxplots on the same scale so you can compare independent samples directly. Use side-by-side boxplots when you want to compare center (medians), spread (IQRs), skewness, gaps/clusters, and outliers across groups—exactly what UNC-1.N in the CED expects (compare two or more independent samples). Tip: always line them up on the same axis so medians and IQRs are easy to read; look for differences in medians (center), IQR/whisker length (variability), and isolated points (outliers).

Question 3

When do I use mean vs median to compare two data sets?

Accepted Answer

Use the mean when the distribution is roughly symmetric with no strong outliers—the mean (with SD) summarizes center and spread well and is what inference about means uses on the exam. Use the median when the distribution is skewed or has outliers—the median (with IQR or range) gives a resistant measure of center. Quick rule of thumb: symmetric → mean ± SD; skewed/outliers → median and IQR. When comparing two samples on the AP exam, describe shape, center, and spread (and mention outliers/clusters/gaps)—e.g., “Sample A is right-skewed with a larger median; Sample B is roughly symmetric with a higher mean.” If you’re unsure or distributions differ, report both mean and median and justify which you prefer. For practice comparing graphs and summaries (histograms, side-by-side boxplots, mean/median, SD/IQR) see the Topic 1.9 study guide (https://library.fiveable.me/ap-statistics/unit-1/comparing-distributions-quantitative-variable/study-guide/2j5wKJg84ZKKN1T5CEmz) and the Unit 1 overview (https://library.fiveable.me/ap-statistics/unit-1). For more problems, try the practice set (https://library.fiveable.me/practice/ap-statistics).

Question 4

I'm confused about how to describe the shape, center, and spread when comparing distributions - can someone help?

Accepted Answer

Think of comparing distributions as describing three things for each group, then stating how they differ. 1) Shape—note symmetry vs. skewness (right/left), modality (uni/bi), clusters or gaps, and outliers. e.g., “Group A is right-skewed with one cluster; Group B is roughly symmetric.” 2) Center—pick median for skewed data (resistant) or mean for roughly symmetric. Give a number or range: “Median A ≈ 40 ppm; median B ≈ 120 ppm.” 3) Spread—use IQR (and boxplots) for skewed data or SD/range for symmetric. Say which has more variability: “A has larger IQR and wider range than B.” When you compare, always mention context and at least one of shape/center/spread (CED UNC-1.N.1, UNC-1.O.1). Good phrasing: “Compared to B, A is more skewed right, has a lower center (median ≈ 50 vs. 120), and greater spread (IQR larger; more extreme values).” For practice, use the Topic 1.9 study guide (https://library.fiveable.me/ap-statistics/unit-1/comparing-distributions-quantitative-variable/study-guide/2j5wKJg84ZKKN1T5CEmz), the Unit 1 overview (https://library.fiveable.me/ap-statistics/unit-1), and try problems at (https://library.fiveable.me/practice/ap-statistics).

Question 5

How do I know which graph to use when comparing multiple groups of data?

Accepted Answer

Pick the graph that highlights the features you need to compare (center, spread, shape, outliers, clusters/gaps) and that fits your sample sizes. - Side-by-side boxplots—best for quick comparisons of median, IQR, range, and outliers across multiple groups (use when you have moderate-to-large samples). (CED: UNC-1.N.1) - Histograms—use when you want to see shape and modality (skewness, bimodality) for each group; compare bins and use the same bin widths/scale. - Dotplots or stemplots—use for small samples (show individual values and clusters/gaps). - Overlaid kernel density plots or smoothed histograms—good for shape comparison if sample sizes are big and you can keep the same scale. Always use the same axis scales, label axes and units, and pair graphs with numerical summaries (mean/median, s or IQR, range, % outliers). The AP exam expects you to compare distributions (Skill 2.D in Unit 1), so be ready to describe shape, center, variability, and context. For a quick review, see the Topic 1.9 study guide (https://library.fiveable.me/ap-statistics/unit-1/comparing-distributions-quantitative-variable/study-guide/2j5wKJg84ZKKN1T5CEmz).

Question 6

What's the step-by-step process for comparing two distributions using summary statistics?

Accepted Answer

Step-by-step: 1. Pick appropriate displays first (side-by-side boxplots, histograms, dotplots) to spot shape, clusters, gaps, and outliers (CED UNC-1.N.1). 2. State shape for each (symmetric, right/left skewed, bimodal)—shape affects whether to use mean/sd or median/IQR. 3. Compare center: report medians (and/or means if roughly symmetric). Give actual numbers (e.g., medianA = 45, medianB = 52). 4. Compare spread: report IQRs for skewed data and standard deviations for roughly symmetric data; also say ranges. 5. Note outliers and gaps explicitly and say which group they’re in. 6. Summarize differences clearly in context: which group tends to be larger, which is more variable, and any practical significance. 7. If needed for AP tasks, connect to inference: mention sample sizes and that comparisons using means/sds require roughly symmetric shape or large n (AP exam checks conditions). Use CED keywords (mean, median, sd, IQR, outliers, skewness). For a quick refresher, see the Topic 1.9 study guide (https://library.fiveable.me/ap-statistics/unit-1/comparing-distributions-quantitative-variable/study-guide/2j5wKJg84ZKKN1T5CEmz) and practice problems (https://library.fiveable.me/practice/ap-statistics).

Question 7

Can someone explain how to identify outliers when comparing boxplots side by side?

Accepted Answer

Look at each boxplot’s five-number summary (Q1, median, Q3, min, max) and use the IQR method. Calculate IQR = Q3 − Q1, then fences: lower = Q1 − 1.5·IQR and upper = Q3 + 1.5·IQR. Any points (or plotted dots) outside those fences are outliers; points beyond Q1 − 3·IQR or Q3 + 3·IQR are often called extreme outliers. On side-by-side boxplots, mark which groups have outliers, how many, and how far they lie from the box—that helps you compare variability and influence on center. Say, “Group A has two mild outliers above the upper fence; Group B has one extreme low outlier,” then comment on how those outliers affect medians and IQRs. This aligns with Topic 1.9 (compare center, variability, outliers) in the CED (UNC-1.N). For a quick refresher, check the Topic 1.9 study guide (https://library.fiveable.me/ap-statistics/unit-1/comparing-distributions-quantitative-variable/study-guide/2j5wKJg84ZKKN1T5CEmz) and try practice problems (https://library.fiveable.me/practice/ap-statistics).

Question 8

I don't understand how to write about variability differences between two data sets - what should I look for?

Accepted Answer

Focus on the spread you see and pick the right measure to compare it. Look at graphs (side-by-side boxplots, histograms, dotplots): note which distribution is more “spread out” (longer whiskers, wider histogram). Then quantify with numbers: range and SD show overall spread (SD is best for roughly symmetric data), IQR and box lengths show spread of the middle 50% and are better when distributions are skewed or have outliers. Point out outliers and gaps—a few extreme values can make SD large while IQR stays small. Always mention context and which measure you chose and why (e.g., “Region A has larger SD (12 vs. 5) but similar IQRs, likely because of a few large outliers”). AP graders expect both graphical and numerical comparison (use side-by-side boxplots and report IQR/SD as appropriate). For more examples and practice, check the Topic 1.9 study guide (https://library.fiveable.me/ap-statistics/unit-1/comparing-distributions-quantitative-variable/study-guide/2j5wKJg84ZKKN1T5CEmz) and browse lots of practice questions (https://library.fiveable.me/practice/ap-statistics).

Question 9

How do I calculate and compare standard deviations for multiple samples?

Accepted Answer

Calculate each sample’s standard deviation with the AP formula for a sample: s = sqrt[ Σ(xi − x̄)² / (n − 1) ]. Steps: find each sample’s mean x̄, subtract the mean from each observation, square and sum those deviations, divide by n−1, then take the square root. Do this separately for each sample to get s1, s2, … When you compare sample standard deviations: - Compare the numeric s values directly—larger s means more spread in the original units. Give the context (e.g., “Sample A s = 12 minutes, Sample B s = 5 minutes—A is more variable”). - Check shape and outliers first (skewness or extreme values inflate s). If distributions are skewed or have outliers, also report IQR or use side-by-side boxplots as CED Topic 1.9 recommends. - Note sample size: small n makes s less reliable. On the exam, you’ll often pair these numeric comparisons with graphs (histograms/boxplots) and mention center, variability, and outliers (CED UNC-1.N / UNC-1.O). For a quick refresh, see the Topic 1.9 study guide (https://library.fiveable.me/ap-statistics/unit-1/comparing-distributions-quantitative-variable/study-guide/2j5wKJg84ZKKN1T5CEmz) and practice lots of problems (https://library.fiveable.me/practice/ap-statistics).

Question 10

When comparing distributions, do I always need to mention center, spread, and shape?

Accepted Answer

Short answer: usually, yes—but only include the pieces that matter for the question. What AP expects: the CED says when you compare quantitative distributions you should look at center, variability (spread), and shape—and also mention clusters, gaps, and outliers when they’re relevant (UNC-1.N.1, UNC-1.O.1). That means in many prompts you should explicitly compare center (mean/median), spread (SD/IQR/range), and shape (skewness, modality) plus any obvious outliers or gaps. That said: focus on what affects the comparison. If two distributions have the same shape and no outliers, a short comparison of centers and variability is enough. If one is skewed or has an outlier, call that out because it changes which center/measure you emphasize (median/IQR vs mean/SD). Always give context (what the variable is) and use numbers when available. Want more practice and examples that match the CED? Check the Topic 1.9 study guide (https://library.fiveable.me/ap-statistics/unit-1/comparing-distributions-quantitative-variable/study-guide/2j5wKJg84ZKKN1T5CEmz) and try problems at Fiveable’s practice page (https://library.fiveable.me/practice/ap-statistics).

Question 11

What does it mean when one distribution has a higher IQR than another?

Accepted Answer

IQR = Q3 − Q1, so it measures the spread of the middle 50% of the data (the central box in a boxplot). If one distribution has a higher IQR than another, its middle 50% of values are more spread out—the typical/central observations are more variable and less tightly clustered. That tells you the distribution has greater variability (in the center) even if the medians are the same. Remember: IQR is robust to outliers, so a larger IQR usually reflects genuine increased spread of typical values, not just a few extreme points. On the AP exam you’ll often compare distributions using IQR (and boxplots) to describe variability per UNC-1.N (see the Topic 1.9 study guide) (https://library.fiveable.me/ap-statistics/unit-1/comparing-distributions-quantitative-variable/study-guide/2j5wKJg84ZKKN1T5CEmz). For more practice comparing spread, check the Unit 1 review (https://library.fiveable.me/ap-statistics/unit-1) and practice problems (https://library.fiveable.me/practice/ap-statistics).

Question 12

How do I describe clusters and gaps when I have multiple histograms to compare?

Accepted Answer

When you compare clusters and gaps across histograms, be specific and tie descriptions to context (CED: UNC-1.N). Say where clusters occur (values or bins) and how many, and point out any gaps or empty ranges. Short checklist and sentence templates: - Identify clusters: “Histogram A has a strong cluster around 40–55 (most bars high there), while Histogram B’s main cluster is 70–85.” - Note multiple clusters (bimodal): “Histogram C is bimodal with peaks near 20 and 60.” - Describe gaps: “Histogram A shows a clear gap from 56–69 (no observations), whereas B has no such gap.” - Compare frequency/strength: “A’s cluster around 40–55 is taller (higher relative frequency) than B’s cluster near 70.” - Mention context and overall features: “In terms of lead concentration, Region A clusters low while Region B clusters higher, and A has a wide gap in the midrange.” On the AP exam you’ll be expected to compare features (clusters, gaps, center, spread, outliers) concisely and in context—use exact ranges/bins and relative language (“larger”, “smaller”, “more pronounced”). For more examples and practice wording, check the Topic 1.9 study guide (https://library.fiveable.me/ap-statistics/unit-1/comparing-distributions-quantitative-variable/study-guide/2j5wKJg84ZKKN1T5CEmz) and try practice problems (https://library.fiveable.me/practice/ap-statistics).

Question 13

I keep mixing up which measures go with which type of distribution - how do I remember when to use mean/std dev vs median/IQR?

Accepted Answer

Use mean and standard deviation when the distribution is roughly symmetric with no strong outliers—those stats summarize center and spread well because the mean uses every value and s measures typical distance from that mean. Use median and IQR when the distribution is skewed or has outliers—the median/IQR resist extreme values and better describe a “typical” center and the middle 50% spread. Quick checklist for AP comparisons (Topic 1.9 / UNC-1.O & UNC-1.N): - Look at the graph first (histogram, boxplot, dotplot). If roughly symmetric → report mean ± SD. - If skewed or outliers → report median and IQR (or use side-by-side boxplots to compare IQRs). - Always describe shape, center, variability, and mention outliers or gaps (CED keywords). Want practice: review the Topic 1.9 study guide (https://library.fiveable.me/ap-statistics/unit-1/comparing-distributions-quantitative-variable/study-guide/2j5wKJg84ZKKN1T5CEmz) and try problems at (https://library.fiveable.me/practice/ap-statistics).

Question 14

What's the formula for comparing relative frequencies between different sample sizes?

Accepted Answer

Turn raw counts into comparable proportions: relative frequency (or sample proportion) = x/n (or % = 100·x/n). If you want to compare two samples formally, use the sampling distribution for the difference of proportions. For two independent samples with sample proportions p̂1 = x1/n1 and p̂2 = x2/n2: - Standard error (general) for p̂1 − p̂2: SE = sqrt[ p̂1(1 − p̂1)/n1 + p̂2(1 − p̂2)/n2 ]. - Test statistic for H0: p1 = p2 (use pooled proportion p̂c = (x1 + x2)/(n1 + n2)): SE_pooled = sqrt[ p̂c(1 − p̂c)(1/n1 + 1/n2) ], z = (p̂1 − p̂2) / SE_pooled. These formulas (and the one-sample percent/relative-frequency idea) are what AP expects when comparing relative frequencies—see the sampling distributions & standard error entries on the AP formula sheet. For a quick topic refresher check the Topic 1.9 study guide (https://library.fiveable.me/ap-statistics/unit-1/comparing-distributions-quantitative-variable/study-guide/2j5wKJg84ZKKN1T5CEmz) and try practice problems (https://library.fiveable.me/practice/ap-statistics).

Question 15

How do I write a good comparison paragraph for an FRQ about two data sets?

Accepted Answer

Start with one clear comparative sentence (overall which sample is larger or more spread out). Then hit the five CED pieces: shape, center, variability, outliers/gaps, and context. For example: - Overall: “Sample B tends to have higher lead concentrations than Sample A.” - Shape: “Both distributions are right-skewed (long right tail).” - Center: “The median for B is higher than A (say median B > median A), so typical values in B are larger.” - Variability/outliers: “A shows a larger range and more extreme high values, so A is more variable and has outliers.” - Conclude in context: “So, region B has higher typical lead levels, but region A shows more extreme contamination.” Always reference the graph or summary stats you used (median, mean, IQR, SD, range) and compare pints numerically when given. AP graders look for shape, center, spread, at least one comparison, and context—practice writing these with the Topic 1.9 study guide (https://library.fiveable.me/ap-statistics/unit-1/comparing-distributions-quantitative-variable/study-guide/2j5wKJg84ZKKN1T5CEmz) and try extra FRQs at (https://library.fiveable.me/practice/ap-statistics).

Term	Definition
center	A measure indicating the middle or typical value of a distribution.
cluster	Concentrations of data usually separated by gaps in a distribution.
gap	Regions of a distribution between two data values where there are no observed data.
graphical representations	Visual displays such as bar charts, pie charts, or other graphs used to present data in a visual format.
histogram	A graph where the height of each bar represents the number or proportion of observations within an interval, with the ability to alter interval widths to change the appearance.
independent samples	Two or more separate groups of data where the values in one group do not influence or depend on the values in another group.
mean	The average value of a dataset, represented by μ in the context of a population.
outlier	Data points that are unusually small or large relative to the rest of the data.
relative frequency	The proportion of observations in a category, expressed as a decimal, fraction, or percentage of the total.
side-by-side boxplots	A graphical representation that displays multiple boxplots arranged next to each other to compare the distributions of different groups or samples.
standard deviation	A measure of how spread out data values are from the mean, represented by σ in the context of a population.
summary statistics	Numerical measures that describe key features of a dataset, such as center, spread, and shape.
variability	The spread or dispersion of data values in a distribution.

📊AP Statistics Unit 1 Review

1.9 Comparing Distributions of a Quantitative Variable

1.9 Comparing Distributions of a Quantitative Variable

Unit & Topic Study Guides

Comparing Groups with Stem-and-Leaf Plots: Warm Up

Comparing Groups with Histograms: Practice AP-Style Problem

Comparing Groups with Box Plots: Practice AP-Style Problem

Vocabulary

Frequently Asked Questions

history

social science

english & capstone

arts

science

math & computer science

world languages

high school exams

honors classes

college classes

hs classes

Study Content & Tools

Company

Resources