Fiveable

📊Honors Statistics Unit 4 Review

QR code for Honors Statistics practice questions

4.2 Mean or Expected Value and Standard Deviation

4.2 Mean or Expected Value and Standard Deviation

Written by the Fiveable Content Team • Last updated August 2025
Written by the Fiveable Content Team • Last updated August 2025
📊Honors Statistics
Unit & Topic Study Guides
Pep mascot

Expected Value and Standard Deviation

Expected value and standard deviation let you summarize a discrete random variable with just two numbers: where the distribution centers and how spread out it is. These tools are essential for predicting outcomes and comparing different probability distributions.

Pep mascot
more resources to help you study

Expected Value of Discrete Variables

The expected value of a discrete random variable XX, written E(X)E(X) or μ\mu, is the long-run average you'd observe if you repeated the random process many times. You calculate it by multiplying each possible value by its probability, then adding up all those products.

μ=E(X)=xP(X=x)\mu = E(X) = \sum x \cdot P(X = x)

For a fair six-sided die:

E(X)=116+216+316+416+516+616=3.5E(X) = 1 \cdot \frac{1}{6} + 2 \cdot \frac{1}{6} + 3 \cdot \frac{1}{6} + 4 \cdot \frac{1}{6} + 5 \cdot \frac{1}{6} + 6 \cdot \frac{1}{6} = 3.5

Notice that 3.5 isn't even a value you can roll. The expected value doesn't have to be a possible outcome; it's the balancing point of the distribution. Think of it as where the probability distribution would balance if you placed it on a number line.

The probability mass function (PMF) is what gives you the probability for each possible value of a discrete random variable. It's the table or formula you plug into the expected value calculation.

Expected value of discrete variables, Discrete Random Variables (3 of 5) | Concepts in Statistics

Law of Large Numbers Interpretation

The law of large numbers says that as you repeat a random process more and more times, the average of your observed results gets closer and closer to the expected value.

  • Flip a fair coin 10 times and you might get 70% heads. Flip it 10,000 times and the proportion of heads will be very close to 0.5.
  • Roll a die 5 times and your average might be 4.2. Roll it 5,000 times and the average will hover near 3.5.

This is why E(X)E(X) is called the "expected" value: it's what you expect to see on average over many trials. It also explains why larger sample sizes give more reliable estimates of population parameters.

Expected value of discrete variables, data visualization - Plot the probability mass function - Cross Validated

Variance and Standard Deviation Computation

Expected value tells you the center, but two distributions can share the same mean and look completely different. Variance and standard deviation capture how spread out the values are around that center.

Variance Var(X)Var(X) is the average squared deviation from the mean:

Var(X)=σ2=(xμ)2P(X=x)Var(X) = \sigma^2 = \sum (x - \mu)^2 \cdot P(X = x)

Standard deviation σ\sigma is the square root of the variance, which brings the units back to the same scale as XX:

σ=Var(X)\sigma = \sqrt{Var(X)}

Here's how to compute them step by step:

  1. Find the mean μ=E(X)\mu = E(X) using the method above.

  2. Subtract the mean from each possible value to get the deviation: (xμ)(x - \mu).

  3. Square each deviation: (xμ)2(x - \mu)^2.

  4. Multiply each squared deviation by its probability: (xμ)2P(X=x)(x - \mu)^2 \cdot P(X = x).

  5. Sum all the products to get the variance.

  6. Take the square root of the variance to get the standard deviation.

Worked example: Suppose XX has this distribution: P(X=1)=0.2P(X=1) = 0.2, P(X=2)=0.5P(X=2) = 0.5, P(X=3)=0.3P(X=3) = 0.3.

First, find the mean:

μ=1(0.2)+2(0.5)+3(0.3)=0.2+1.0+0.9=2.1\mu = 1(0.2) + 2(0.5) + 3(0.3) = 0.2 + 1.0 + 0.9 = 2.1

Then compute the variance:

Var(X)=(12.1)2(0.2)+(22.1)2(0.5)+(32.1)2(0.3)Var(X) = (1 - 2.1)^2(0.2) + (2 - 2.1)^2(0.5) + (3 - 2.1)^2(0.3)

=(1.21)(0.2)+(0.01)(0.5)+(0.81)(0.3)= (1.21)(0.2) + (0.01)(0.5) + (0.81)(0.3) =0.242+0.005+0.243=0.49= 0.242 + 0.005 + 0.243 = 0.49

σ=0.49=0.7\sigma = \sqrt{0.49} = 0.7

A larger standard deviation means the values are more spread out from the mean; a smaller one means they cluster tightly around it.

Population and Sample Statistics

  • Population parameters (like μ\mu and σ\sigma) describe the entire population. When you know the full probability distribution, you can calculate these exactly using the formulas above.
  • Sample statistics (like xˉ\bar{x} and ss) are calculated from a subset of data and serve as estimates of the population parameters.

In this unit, you're typically working with known probability distributions, so you're computing population parameters directly. Later in the course, you'll work more with sample statistics and the challenge of estimating parameters from incomplete data.