Cumulative distribution function (CDF)

Q: What is cumulative distribution function (CDF) in Intro to Statistics?

It is the function that gives the probability a random variable is less than or equal to a value, written P(X ≤ x). In Intro to Statistics, you use it to find probabilities for continuous distributions, especially waiting-time models like the exponential distribution.

Q: How do you find probability from a CDF?

Use the CDF value at the upper endpoint and subtract the CDF value at the lower endpoint. For a continuous random variable, P(a < X ≤ b) = F(b) - F(a). That works because the CDF counts all probability up to each cutoff.

Q: What is the difference between a CDF and a PDF?

A PDF describes how probability is spread out, but it does not directly give interval probability by itself. A CDF gives the accumulated probability up to a point, which makes it easier to answer questions like 'less than' or 'between' values.

Q: How does the CDF work for the exponential distribution?

For an exponential random variable with rate λ, the CDF is F(x) = 1 - e^{-λx} for x ≥ 0. That formula gives the chance that the waiting time is at most x, which is why it shows up in arrival and failure-time problems.

A cumulative distribution function, or CDF, gives the probability that a random variable is less than or equal to a value, P(X ≤ x). In Intro to Statistics, you use it to read distribution shapes and find probabilities on intervals.

Last updated July 2026

What is cumulative distribution function (CDF)?

A cumulative distribution function (CDF) is the rule that tells you how much probability has piled up by a given value of a random variable. For a continuous random variable X, the CDF is written as F(x) = P(X ≤ x). That means if you pick a point on the number line, the CDF gives the probability that X lands at or to the left of that point.

In Intro to Statistics, the CDF turns a distribution into a running total. Small x values have CDF values near 0 because there is little probability to the left. As x increases, the CDF never goes down, since you are only adding more probability. That is why every CDF is non-decreasing, and for continuous distributions it is also right-continuous.

A good way to picture it is as an accumulation curve. Instead of asking, “How tall is the density here?” you ask, “How much area has been collected up to this point?” For a continuous distribution, the exact value at a single point is not a probability by itself. The CDF works because probabilities come from intervals, and intervals have area.

That is why CDFs are useful for finding probabilities between two values. If you want P(a < X ≤ b), you subtract: F(b) - F(a). The first value gives the total area to the left of b, and the second gives the total area to the left of a. The difference is the probability in between.

The exponential distribution is a common place where you see a CDF in action. If X is exponential with rate λ, then F(x) = 1 - e^{-λx} for x ≥ 0. This formula matches the idea of waiting time: at x = 0, the CDF is 0, and as x gets larger, the probability of having seen the event by then rises toward 1.

Why cumulative distribution function (CDF) matters in Intro to Statistics

The CDF is one of the cleanest ways to move from a random variable to an actual probability answer in Intro to Statistics. If a question asks for the chance that a wait time is below 5 minutes, between 2 and 7 minutes, or at most some cutoff, the CDF is the tool that turns the distribution into that probability.

It also helps you read continuous distributions without getting stuck on the PDF. A density curve tells you where values are more or less concentrated, but the CDF tells you the accumulated probability up to any point. That makes it easier to compare cutoffs, interpret percentile-style questions, and check whether a value looks unusually small or large within the model.

The CDF is especially useful in the exponential distribution unit because that distribution models waiting time and event timing. If you are working with light bulbs, arrivals, breakdowns, or any process with a constant rate, the CDF gives the probability the event has already happened by time t. That is a practical move in homework and quizzes because it lets you answer real context questions with one formula.

It also sets up later ideas like the memoryless property and hazard function. Once you know how probability accumulates over time, it is easier to see why exponential waiting times behave the way they do and why the distribution is so common in Poisson process and queuing theory examples.

Keep studying Intro to Statistics Unit 5

Visual cheatsheet

view gallery

Unit 5 study guide

How cumulative distribution function (CDF) connects across the course

Probability Density Function (PDF)

The PDF shows how probability is spread out across values, while the CDF shows how much probability has accumulated up to a point. In practice, you often read the PDF to understand the shape and then use the CDF to get actual probabilities on intervals.

Rate Parameter (λ)

In the exponential distribution, the rate parameter controls how quickly the CDF rises toward 1. A larger λ means events happen sooner on average, so the cumulative probability builds up faster over time.

Memoryless Property

The memoryless property is easier to see once you use the CDF for waiting times. It says that for an exponential random variable, the probability of waiting longer does not depend on how long you have already waited.

Interarrival Time

Interarrival time is the waiting time between events, and the CDF gives the probability that this wait is at most a certain length. That is why CDF questions often show up in arrival or service-time problems.

Is cumulative distribution function (CDF) on the Intro to Statistics exam?

A quiz or problem set question will often give you an exponential distribution and ask for a probability like P(X ≤ 4) or P(2 < X ≤ 6). Your move is to use the CDF, either by plugging into the formula F(x) = 1 - e^{-λx} or by subtracting two CDF values for an interval. If the question is asking for a waiting-time interpretation, translate the math back into context, such as the chance a customer arrives within 5 minutes or a machine fails before a deadline.

You may also be asked to sketch or interpret the shape of the CDF. The graph should start near 0, rise as x increases, and level off near 1. If a solution uses the PDF instead, check whether the prompt wants accumulated probability rather than height on the density curve.