Probability Distribution

A probability distribution is a table, graph, or function that shows every possible value of a random variable along with the probability of each value; for a discrete random variable, all probabilities must be between 0 and 1 and must sum to exactly 1.

Verified for the 2027 AP Statistics examLast updated June 2026

What is Probability Distribution?

A probability distribution is the complete map of a random variable's behavior. It pairs every possible value the variable can take with the probability of getting that value. You can write it as a table (the most common format on the AP exam), a graph like a histogram, or a function like the binomial formula. Per the CED (4.7.A), the rules for a discrete random variable are strict and simple. The variable takes a countable number of values, each value gets a probability, and those probabilities must add up to 1.

Here's the useful way to think about it. A regular data distribution describes what did happen in a sample. A probability distribution describes what could happen, and how likely each outcome is, before you ever collect data. Roll two dice and X = the sum. The distribution tells you P(X = 7) = 6/36 without rolling anything. Once you have a distribution, you can describe its shape, center, and spread (4.7.B), which means you can compute an expected value and standard deviation for a process, not just a dataset. Special named distributions like the binomial (Topic 4.10) and geometric (Topic 4.12) are just probability distributions with formulas that generate the probabilities for you.

Why Probability Distribution matters in AP Statistics

This is the backbone concept of Unit 4 (Probability, Random Variables, and Probability Distributions). Learning objective 4.7.A asks you to represent a probability distribution for a discrete random variable, and 4.7.B asks you to interpret one by describing shape, center, and spread in context. Everything else in the unit builds on top of that. The binomial distribution (4.10.A and 4.10.B) and the geometric distribution (4.12.A through 4.12.C) are specific probability distributions where formulas replace the table. The CED also points out that you can construct a distribution from probability rules or estimate one with a simulation using random number generators, and simulation-based questions about distributions show up regularly in multiple choice. Beyond Unit 4, this idea is the on-ramp to sampling distributions in Unit 5 and every inference procedure after that.

How Probability Distribution connects across the course

Random Variable (Unit 4)

A probability distribution doesn't exist without a random variable to describe. The random variable supplies the numerical outcomes, and the distribution attaches a probability to each one. They're two halves of the same object, which is why the AP exam almost always introduces them together as 'Let X = ...' followed by a table.

Binomial Random Variable (Unit 4)

The binomial distribution is a probability distribution you don't have to build by hand. When X counts successes in n independent trials with success probability p, the binomial formula generates every probability in the table automatically. Same concept, just with a shortcut formula attached.

Expected Value (Unit 4)

Expected value is the mean of a probability distribution. You multiply each value by its probability and add everything up. So the distribution isn't just a list, it's the input for finding the center and spread of a random process, which is exactly what 4.7.B asks you to interpret.

Independent Trials (Unit 4)

Both the binomial and geometric probability functions only work when trials are independent with a constant probability of success. If independence breaks, those named distributions no longer apply, and that condition check is a classic point of attack in multiple-choice stems.

Is Probability Distribution on the AP Statistics exam?

Multiple-choice questions hit this term in two main ways. First, simulation questions ask you to estimate a probability distribution using random number generators, then judge whether the simulation design is valid or interpret a result like 'P(X = 6) ≈ 0.22 from 500 trials' as an estimate of the true probability, not the exact answer. Second, calculation questions hand you a scenario (like a 15% defect rate with 20 items inspected) and expect you to recognize the binomial setting and compute P(X ≤ 2) using the binomial probability function. On the free-response side, probability distributions anchor the Unit 4 FRQ almost every year. The 2023 FRQ (bath fizzies with cash prizes) and the 2024 FRQ (geode crystals in an online game) both required working with a random variable's distribution, finding probabilities, and interpreting expected values in context. The graders want you to define the random variable, verify probabilities sum to 1 when you build a table, and write interpretations using the units and context of the problem.

Probability Distribution vs Sampling distribution

A probability distribution describes the possible values of one random variable, like the number of heads in 8 coin flips. A sampling distribution (Unit 5) is a specific kind of probability distribution that describes a statistic, like a sample mean or sample proportion, across all possible samples. Every sampling distribution is a probability distribution, but the reverse isn't true. If the question is about x̄ or p̂ varying from sample to sample, you're in sampling distribution territory.

Key things to remember about Probability Distribution

  • A probability distribution shows every possible value of a random variable and the probability of each value, as a table, graph, or function.

  • For a discrete random variable, every probability must be between 0 and 1, and all the probabilities must sum to exactly 1.

  • You can build a probability distribution using probability rules, or estimate it with a simulation using random number generators.

  • Interpreting a probability distribution means describing its shape, center, and spread in the context of the situation.

  • The binomial distribution (counts successes in n trials) and the geometric distribution (counts trials until the first success) are named probability distributions with their own formulas.

  • Expected value and standard deviation of a random variable are calculated directly from its probability distribution, so the table is your starting point for every calculation.

Frequently asked questions about Probability Distribution

What is a probability distribution in AP Stats?

It's a table, graph, or function that lists every possible value of a random variable with the probability of each value. For a discrete random variable, the probabilities must each be between 0 and 1 and must sum to 1.

Do the probabilities in a probability distribution always have to add up to 1?

Yes, always, for a discrete random variable. This is one of the fastest checks on the exam. If a table's probabilities sum to 0.95 or 1.05, it's not a valid probability distribution, and FRQ graders expect you to verify this when you build one yourself.

What's the difference between a probability distribution and a frequency distribution?

A frequency distribution shows how often outcomes actually occurred in collected data. A probability distribution shows the theoretical likelihood of each outcome before you collect anything. A simulation bridges the two, since relative frequencies from many trials estimate the true probabilities.

Is a probability distribution the same as a binomial distribution?

Not exactly. The binomial distribution is one specific type of probability distribution, used when you count successes in n independent trials with constant success probability p. Probability distribution is the umbrella term that also covers geometric, uniform, normal, and any custom table a problem gives you.

How do you find the mean of a probability distribution?

Multiply each value of the random variable by its probability, then add the products. That sum is the expected value, μ. For named distributions there are shortcuts, like μ = 1/p for a geometric random variable.