Study smarter with Fiveable
Get study guides, practice questions, and cheatsheets for all your subjects. Join 500,000+ students with a 96% pass rate.
Probability distributions are mathematical models that let statisticians predict outcomes, quantify uncertainty, and make inferences about populations. On the AP Statistics exam, you need to recognize which distribution applies to a given scenario, understand the conditions required for each model, and calculate probabilities using their specific parameters. These distributions connect directly to Units 4, 5, and beyond, from basic probability calculations to sampling distributions to inference procedures like confidence intervals and hypothesis tests.
Don't just memorize formulas and shapes. Know why each distribution exists: What real-world process does it model? What conditions must be met? How does it connect to the Central Limit Theorem or chi-square tests? When you understand the underlying mechanism, you can tackle any FRQ scenario the exam throws at you, whether it's identifying the right distribution, checking conditions, or interpreting results in context.
These distributions model situations where you're counting discrete outcomes: how many successes occur, how many trials until success, or how many events happen in a given interval. Each has specific conditions that determine when it's the appropriate model.
A Bernoulli distribution models a single trial with exactly two outcomes: success (1) or failure (0). It's the simplest probability distribution and the building block for several others.
The binomial distribution counts the number of successes in a fixed number of independent trials, where each trial has the same probability of success. A helpful mnemonic for checking conditions is BINS: Binary outcomes, Independent trials, Number of trials fixed, Same probability on each trial.
For example, if you flip a fair coin 40 times and count heads, that's binomial with and . The expected number of heads is , and the standard deviation is .
The Poisson distribution models the count of events occurring in a fixed interval of time or space, when those events happen independently at a constant average rate.
Compare: Binomial vs. Poisson: both count discrete events, but binomial requires a fixed number of trials while Poisson models events in continuous time or space with no fixed upper limit on the count. If an FRQ describes "the number of customers arriving per hour," think Poisson. If it's "the number of defective items in a sample of 50," think binomial.
The geometric distribution counts how many trials it takes to get the first success in repeated independent Bernoulli trials. Where the binomial asks "how many successes in trials?", the geometric asks "how many trials until the first success?"
Compare: Binomial vs. Geometric: binomial fixes the number of trials and counts successes; geometric fixes the number of successes (at one) and counts trials. Both require independent trials with constant .
Continuous distributions model variables that can take any value within an interval: time, height, test scores, or any measurement on a continuous scale. A key distinction from discrete distributions: probability is found as area under the density curve, not at individual points. The probability that a continuous variable equals any single exact value is 0.
The uniform distribution applies when all outcomes are equally likely within a defined range from to .
Finding probabilities with a continuous uniform is straightforward: just calculate the proportion of the interval. If wait times are uniformly distributed between 0 and 10 minutes, the probability of waiting between 3 and 7 minutes is .
The normal distribution is the most important distribution in AP Statistics. It's a symmetric, bell-shaped curve defined by two parameters: mean (center) and standard deviation (spread).
The reason the normal distribution shows up everywhere is the Central Limit Theorem. Even if the underlying population isn't normal, the distribution of sample means will be approximately normal for large enough samples (typically as a rough guideline).
The exponential distribution models waiting time between events when those events occur at a constant rate .
Compare: Poisson vs. Exponential: Poisson counts how many events occur in a fixed time period; exponential measures how long between events. They're two sides of the same process, both using rate parameter .
These distributions arise specifically in statistical inference. They describe how test statistics behave under the null hypothesis or how estimators vary across samples. You won't typically model raw data with these; instead, they tell you what to expect from your calculated statistics.
The t-distribution looks like the normal distribution but has heavier tails, meaning extreme values are more likely. Those heavier tails account for the extra uncertainty introduced when you estimate the population standard deviation from sample data.
The chi-square () distribution models the sum of squared standardized values, so it can only take positive values and is right-skewed.
Compare: t-distribution vs. Chi-square: both depend on degrees of freedom, but t is symmetric around zero (for testing means) while chi-square is always positive and skewed (for testing categorical relationships). The t approaches normal quickly; chi-square approaches normal only with very large df.
The F-distribution is the ratio of two independent chi-square distributions, each divided by their degrees of freedom. Like chi-square, it's always positive and right-skewed.
| Concept | Best Examples |
|---|---|
| Counting successes in fixed trials | Binomial, Bernoulli |
| Counting events in continuous interval | Poisson |
| Waiting time/trials until success | Geometric, Exponential |
| Symmetric continuous data | Normal, Uniform |
| Inference with unknown | Student's t |
| Categorical data analysis | Chi-Square |
| Comparing variances/ANOVA | F-Distribution |
| Memoryless property | Geometric (discrete), Exponential (continuous) |
A quality control inspector examines 100 items and records how many are defective. Which distribution models this scenario, and what conditions must be verified?
Compare the geometric and exponential distributions: What do they have in common, and how do their applications differ?
Why does the AP Statistics curriculum emphasize the normal distribution so heavily? Connect your answer to the Central Limit Theorem and inference procedures.
An FRQ presents a chi-square test for independence. Explain why the chi-square distribution (rather than the normal or t) is the appropriate sampling distribution for the test statistic.
Both the Poisson and binomial distributions count discrete events. Under what conditions can you use the Poisson as an approximation for the binomial, and why might you want to?