In physical sciences, raw data means nothing until you can describe it. Whether you're analyzing radioactive decay counts, measuring reaction times, or comparing material strengths, statistical measures give you the language to communicate what your data actually shows. You're being tested on your ability to choose the right measure for a given situation—knowing when the mean misleads you, why standard deviation matters more than range, and how distribution shape affects your conclusions.
These measures fall into two fundamental categories: central tendency (where's the middle?) and dispersion (how spread out is it?). A third category—distribution shape—tells you whether your data is symmetric or skewed, which determines which other measures you can trust. Don't just memorize formulas; know when each measure breaks down and what it reveals about your experimental results.
Measures of Central Tendency
These statistics answer the question: what's a typical value in my data set? Each measure captures "typical" differently, and choosing the wrong one can misrepresent your results entirely.
Mean
Calculated as the sum of all values divided by the count—mathematically, xˉ=n1∑i=1nxi
Sensitive to outliers, meaning a single extreme measurement can drag the mean away from where most data actually sits
Best used when data is symmetric and free of extreme values; the go-to measure for normally distributed experimental results
Median
The middle value when data is ordered—for even-numbered sets, average the two central values
Robust against outliers, making it the preferred central tendency measure for skewed distributions
Reveals when mean is misleading; if median differs significantly from mean, your data is likely skewed
Mode
The most frequently occurring value in a data set—the only central tendency measure applicable to categorical data
Can be unimodal, bimodal, or multimodal, with multiple modes often indicating distinct subpopulations in your sample
Useful for identifying clustering in discrete measurements like photon counts or particle detection events
Compare: Mean vs. Median—both locate the "center" of data, but the mean uses all values mathematically while the median uses position only. If an FRQ gives you a skewed distribution and asks for the best central tendency measure, choose median and explain why outliers would distort the mean.
Measures of Spread (Dispersion)
Knowing the center isn't enough—you need to quantify how much variation exists. These measures describe whether your data clusters tightly or spreads widely, which directly impacts how confident you can be in your results.
Range
Simply the maximum minus minimum value—the quickest but crudest measure of spread
Completely determined by two extreme points, ignoring how the rest of your data is distributed
Useful for quick assessment but inadequate for serious analysis; always supplement with other dispersion measures
Variance
The average of squared deviations from the mean—calculated as σ2=n1∑i=1n(xi−xˉ)2 for populations (use n−1 for samples)
Squaring eliminates negative deviations and weights larger deviations more heavily than small ones
Units are squared, making variance difficult to interpret directly but essential for mathematical derivations
Standard Deviation
The square root of variance, σ=σ2, returning dispersion to the original measurement units
Quantifies uncertainty in measurements—the standard way to report experimental precision in physical sciences
For normal distributions, approximately 68% of data falls within ±1σ of the mean, 95% within ±2σ
Compare: Range vs. Standard Deviation—range uses only two data points while standard deviation incorporates every measurement. Standard deviation is almost always preferred because it reflects the actual distribution of your data, not just its extremes.
Quartiles
Divide ordered data into four equal parts—Q1 (25th percentile), Q2 (median, 50th percentile), and Q3 (75th percentile)
Provide distribution landmarks without assuming any particular shape, making them useful for any data set
Essential for box plots and for identifying potential outliers using the 1.5×IQR rule
Interquartile Range (IQR)
Calculated as IQR=Q3−Q1, capturing the spread of the middle 50% of data
Robust against outliers because it ignores the extreme 25% on each end of the distribution
Preferred dispersion measure for skewed data when standard deviation would be misleading
Compare: Standard Deviation vs. IQR—standard deviation assumes roughly symmetric data and is sensitive to outliers, while IQR makes no distributional assumptions and ignores extremes. Use IQR when your data is skewed; use standard deviation when it's approximately normal.
Measures of Distribution Shape
These statistics describe how your data is distributed, not just where it centers or how much it spreads. Distribution shape determines which other statistical measures and tests are appropriate.
Skewness
Measures asymmetry around the mean—positive skewness means a longer right tail, negative skewness means a longer left tail
Indicates mean-median relationship: positively skewed data has mean > median; negatively skewed has mean < median
Affects which central tendency measure to trust; high skewness is a red flag that the mean may misrepresent typical values
Kurtosis
Measures "tailedness" or peak sharpness relative to a normal distribution—not simply how peaked the center is
High kurtosis (leptokurtic) indicates heavy tails with more outliers than expected; low kurtosis (platykurtic) indicates light tails
Critical for risk assessment in physical sciences when extreme values have significant consequences (equipment failure, safety limits)
Compare: Skewness vs. Kurtosis—skewness tells you about asymmetry (left-right balance), while kurtosis tells you about tail weight (outlier frequency). Both describe distribution shape, but they capture completely different features. A distribution can be symmetric (zero skewness) but still have unusual kurtosis.
Quick Reference Table
Concept
Best Examples
Central tendency (symmetric data)
Mean
Central tendency (skewed data)
Median, Mode
Basic spread
Range
Spread with units matching data
Standard Deviation
Spread for mathematical work
Variance
Robust spread measure
IQR, Quartiles
Distribution asymmetry
Skewness
Outlier tendency
Kurtosis
Self-Check Questions
You measure the masses of 20 samples, but one sample was contaminated and has an extremely high mass. Which central tendency measure should you report, and why?
Compare and contrast variance and standard deviation. Why do physical scientists typically report standard deviation rather than variance when describing measurement uncertainty?
A data set has a mean of 45 and a median of 52. What does this tell you about the skewness of the distribution, and which measure better represents a "typical" value?
Which two measures of spread are most robust against outliers? Explain why this robustness matters when analyzing experimental data with potential measurement errors.
If an FRQ asks you to describe both the center and spread of a non-symmetric data set, which four statistics would you calculate and report? Justify your choices.