Fiveable

๐Ÿ“ŠAP Statistics Unit 1 Review

QR code for AP Statistics practice questions

1.9 Comparing Distributions of a Quantitative Variable

1.9 Comparing Distributions of a Quantitative Variable

Written by the Fiveable Content Team โ€ข Last updated June 2026
Verified for the 2027 exam
Verified for the 2027 examโ€ขWritten by the Fiveable Content Team โ€ข Last updated June 2026
๐Ÿ“ŠAP Statistics
Unit & Topic Study Guides

Previous Exam Prep

AP Cram Sessions 2021

Pep mascot

Comparing distributions means describing two or more groups of quantitative data using the same features and the same vocabulary. Always compare shape, center, and variability, plus any unusual features like outliers, gaps, or clusters, and always do it in context with units.

Why This Matters for the AP Statistics Exam

This skill shows up whenever you see two or more groups shown with side-by-side boxplots, comparative histograms, stacked dotplots, or back-to-back stem plots. You will need to read the graphs or summary statistics and write a clear comparison in context.

The most common mistake here is describing each group on its own instead of comparing them. AP questions in this topic want you to connect the groups directly: which one has a higher center, which one has more spread, which one is more skewed. On free response, writing in context and using true comparison words is important for clear, complete work.

Key Takeaways

  • Compare distributions on shape, center, and variability, plus unusual features (outliers, gaps, clusters, multiple peaks).
  • Use comparison words like "greater than," "less than," or "about the same." Do not just describe each group separately.
  • Any graph type works for comparing groups: histograms, side-by-side boxplots, dotplots, or stem plots, as long as the groups use the same scale.
  • Any numerical summary works too: mean, median, standard deviation, IQR, range, or relative frequency.
  • Use resistant measures (median, IQR) when a distribution is skewed or has outliers, and report center and spread together.
  • Always include context and units. Numbers without context are incomplete answers.

How to Use This on the AP Statistics Exam

Free Response

When a prompt says "compare the distributions," hit all of these:

  • Shape: Is each group symmetric, skewed right, or skewed left? Unimodal or bimodal?
  • Center: Which group has a higher median or mean? Or are they about the same?
  • Variability: Which group is more spread out? Compare range, IQR, or standard deviation.
  • Unusual features: Point out outliers, gaps, or clusters in either group.

Phrase every point as a comparison. "Group A is skewed right while Group B is roughly symmetric" earns more than two separate descriptions.

Choosing Summary Statistics

  • For a skewed distribution or one with outliers, the median and IQR describe center and spread better because they are resistant.
  • For a roughly symmetric distribution, the mean and standard deviation work well.
  • If one group is skewed and the other is symmetric, expect the mean to sit toward the long tail in the skewed group. That can shift how the means compare even when medians look similar.

Common Trap

Reporting only the center is incomplete. A response that compares medians but ignores shape and spread will lose credit. Hit every feature.

Comparing Groups with Stem-and-Leaf Plots: Warm Up

Try a quick warm up using a familiar graph: the stem plot.

Question: The weights of two groups of eight animals, Group M and Group N, are recorded and the data are shown in the stem plots below (each stem and leaf represents weight in kg). Use the stem plots to compare the weights of the animals in the two groups.

Group M:

1 | 4

2 | 3 4 8

3 | 2 6 8

4 |

5 | 0

Group N:

1 | 0

2 | 3 6

3 | 5

4 | 1

5 | 4 7

6 | 2

To compare, look at center, spread, and shape for both groups. Group M has weights from 14 to 50 kg, and Group N has weights from 10 to 62 kg, so Group N has a wider range. Group N has heavier animals at the top than anything in Group M.

Look at how the values cluster too. Group M has most of its values bunched in the 20s and 30s, while Group N spreads more evenly across the whole range. That means Group M's animals are more similar in weight to each other, while Group N's weights are more varied.

Overall, Group N has greater variability than Group M, with a more spread-out distribution.

Comparing Groups with Histograms: Practice AP-Style Problem

Each state in the United States keeps records on the number of pupils enrolled in public schools and the number of teachers employed by public schools for each school year. From these records, the ratio of pupils to teachers (P-T ratio) can be calculated for each state. The histograms below show the P-T ratio for every state during the 2001-2002 school year. The histogram on the left displays the ratios for the 24 states west of the Mississippi River, and the histogram on the right displays the ratios for the 26 states east of the Mississippi River.

Source: The College Board (via AP Classroom)

a. Estimate the median P-T ratio for each group.

The question asks you to estimate the median, not compute it. For states west of the Mississippi (n = 24), the median falls between the 12th and 13th value in the ordered list, and both fall in the interval 15-16. For states east of the Mississippi (n = 26), the median falls between the 13th and 14th value, and both also fall in the interval 15-16. So both groups have a median of about 15 to 16 students per teacher.

b. Write a few sentences comparing the distributions of P-T ratios for states in the two groups (west and east) during the 2001-2002 school year.

Compare shape, center, and variability one at a time. Start with shape. The two histograms look different: West is unimodal and skewed to the right, while East is unimodal and nearly symmetric.

For center, part (a) showed the medians are about the same, between 15 and 16 for both groups.

For variability, look at how spread out the values are around the center. West values vary more than East values. Even though the data are grouped, you can still approximate the range. The range for West is at most 22 - 12 = 10, and the range for East is at most 19 - 12 = 7. So East has less variability than West.

c. Using your answers in parts (a) and (b), explain how you think the mean P-T ratio will compare for the two groups (west and east).

The two histograms have different shapes. Since West is skewed to the right, its mean will be pulled toward the high values, so the mean will be greater than the median. For East, since it is fairly symmetric, the mean will be close to the median. Putting that together, the mean for the West group will probably be greater than the mean for the East group.

Comparing Groups with Box Plots: Practice AP-Style Problem

A team of psychologists studied visualization in basketball, where players visualize making a basket before shooting. They ran an experiment in which 20 basketball players with similar abilities were randomly assigned to two groups. The 10 players in group 1 received visualization training, and the 10 players in group 2 did not.

Each player stood 22 feet from the basket at the same location on the court. Each player attempted to make the basket until two consecutive baskets were made. Players who received visualization training were instructed to use visualization techniques before each attempt. The total number of attempts, including the last two, was recorded for each player.

The total number of attempts for each of the 20 players is summarized in the following box plots.

Source: The College Board

There are two groups, with 10 players randomly assigned to each. Group 1 received visualization training and group 2 did not. Compare the box plots feature by feature.

Both groups have the same minimum number of attempts, but every other measure is different. At the first quartile, group 1 reached two consecutive baskets in 3 attempts, while group 2 took 4 attempts.

Now look at the median. The median is much lower for group 1 than for group 2. Group 1 has an outlier, but it is still less than the maximum of group 2. Across all five number summary values, group 1 is lower than group 2, which suggests the training had an effect. Note that this is a description of the sample, not yet a generalization about all players.

To answer the question, it is enough to compare medians. Because the median number of attempts for players who received visualization training (4) is less than the median for players who did not (7), players who received training tended to need fewer attempts to make two consecutive baskets.

Common Misconceptions

  • Describing instead of comparing. Listing features of Group A, then separately listing features of Group B, does not count as comparing. Connect them with words like "higher," "lower," or "about the same."
  • Reporting only the center. Comparing medians or means alone is incomplete. You also need shape and variability, plus unusual features.
  • Forgetting context and units. A comparison without the variable and its units (like P-T ratio or number of attempts) is not a full answer.
  • Mixing up skew and the mean's direction. In a right-skewed distribution, the mean is pulled toward the larger values (to the right of the median), not toward the peak.
  • Using nonresistant measures with outliers. The mean, range, and standard deviation get pulled by outliers. When data are skewed or have outliers, lean on the median and IQR.
  • Assuming different boxes with the same min are basically equal. Two groups can share a minimum and still differ a lot in median, quartiles, and spread. Check every part of the five number summary.

Vocabulary

The following words are mentioned explicitly in the College Board Course and Exam Description for this topic.

Term

Definition

center

A measure indicating the middle or typical value of a distribution.

cluster

Concentrations of data usually separated by gaps in a distribution.

gap

Regions of a distribution between two data values where there are no observed data.

graphical representations

Visual displays such as bar charts, pie charts, or other graphs used to present data in a visual format.

histogram

A graph where the height of each bar represents the number or proportion of observations within an interval, with the ability to alter interval widths to change the appearance.

independent samples

Two or more separate groups of data where the values in one group do not influence or depend on the values in another group.

mean

The average value of a dataset, represented by ฮผ in the context of a population.

outlier

Data points that are unusually small or large relative to the rest of the data.

relative frequency

The proportion of observations in a category, expressed as a decimal, fraction, or percentage of the total.

side-by-side boxplots

A graphical representation that displays multiple boxplots arranged next to each other to compare the distributions of different groups or samples.

standard deviation

A measure of how spread out data values are from the mean, represented by ฯƒ in the context of a population.

summary statistics

Numerical measures that describe key features of a dataset, such as center, spread, and shape.

variability

The spread or dispersion of data values in a distribution.

Frequently Asked Questions

How do you compare distributions in AP Statistics?

Compare shape, center, variability, and unusual features, then write in context with units. Use direct comparison language such as higher, lower, more spread out, less variable, more skewed, or about the same.

What features should I compare for two quantitative distributions?

Compare shape, center, variability, clusters, gaps, outliers, and other unusual features. For center, compare mean or median. For variability, compare standard deviation, IQR, range, or the spread shown in graphs like boxplots and histograms.

How do I compare two histograms?

Use the same scale, then compare the overall shape, where the data are centered, how spread out the values are, and any gaps, clusters, or outliers. Do not describe one histogram fully and then the other; connect the groups directly in each sentence.

How do I compare two boxplots?

Compare medians for center, IQRs and ranges for variability, and any outliers or differences in skew. A boxplot comparison should say which group has the higher median, which group is more variable, and whether the groups have unusual features.

When should I compare mean vs median?

Use the median when distributions are skewed or have outliers because it is resistant. Use the mean for roughly symmetric distributions without strong outliers. If a distribution is right-skewed, the mean is usually pulled above the median; if left-skewed, it is usually pulled below the median.

What is the biggest free-response mistake when comparing distributions?

The biggest mistake is describing each distribution separately instead of comparing them. AP Statistics prompts expect comparison language and context. A strong response directly compares shape, center, variability, and unusual features using the variable name and units when available.

Pep mascot
Upgrade your Fiveable account to print any study guide

Download study guides as beautiful PDFs โ†’ See example

Print or share PDFs with your students

Always prints our latest, updated content

Mark up and annotate as you study

Click below to go to billing portal โ†’ update your plan โ†’ choose Yearlyโ†’ and select "Fiveable Share Plan". Only pay the difference

Plan is open to all students, teachers, parents, etc
Pep mascot
Upgrade your Fiveable account to export vocabulary

Download study guides as beautiful PDFs โ†’ See example

Print or share PDFs with your students

Always prints our latest, updated content

Mark up and annotate as you study

Plan is open to all students, teachers, parents, etc
report an error
description

screenshots help us find and fix the issue faster (optional)

add screenshot