Fiveable

🎲Intro to Statistics Unit 10 Review

QR code for Intro to Statistics practice questions

10.4 Matched or Paired Samples

10.4 Matched or Paired Samples

Written by the Fiveable Content Team • Last updated August 2025
Written by the Fiveable Content Team • Last updated August 2025
🎲Intro to Statistics
Unit & Topic Study Guides

Matched or Paired Samples in Hypothesis Testing

Most hypothesis tests you've seen so far compare two separate, unrelated groups. But what happens when the two measurements come from the same subjects or naturally linked pairs? That's where matched (or paired) samples come in. Instead of comparing two group means directly, you calculate the difference within each pair and then test whether those differences are significantly different from zero.

Characteristics of Matched Samples

Matched pairs involve two related samples where each observation in one sample has a direct counterpart in the other. Common examples include:

  • Before-and-after measurements on the same person (weight before and after a diet program)
  • Twin studies comparing a trait between identical twins raised in different environments
  • Matched case-control studies where each case is paired with a similar control based on age, sex, or other factors

Because the observations are linked, the samples are dependent. This is the defining feature that separates matched pairs from independent samples.

The goal is to determine whether the mean difference between paired observations is significantly different from zero.

  • Null hypothesis (H0H_0): The population mean difference equals zero (μd=0\mu_d = 0). In other words, there's no real change or difference between the paired conditions.
  • Alternative hypothesis (HaH_a):
    • Two-sided: μd0\mu_d \neq 0
    • One-sided: μd>0\mu_d > 0 or μd<0\mu_d < 0
Characteristics of matched samples, Hypothesis Testing: Two Samples | Boundless Statistics

Test Statistic for Matched Pairs

The first step is to compute a difference score for every pair. If you measured 15 people before and after a treatment, you'd get 15 difference values. Then you treat those differences as a single sample and run a one-sample t-test on them.

The formula for the test statistic is:

t=dˉμdsd/nt = \frac{\bar{d} - \mu_d}{s_d / \sqrt{n}}

  • dˉ\bar{d} = mean of the paired differences
  • μd\mu_d = hypothesized mean difference (almost always 0 under H0H_0)
  • sds_d = standard deviation of the differences
  • nn = number of pairs (not the total number of individual observations)

This test statistic follows a t-distribution with n1n - 1 degrees of freedom.

Steps to Conduct a Paired t-Test

  1. Calculate the difference did_i for each pair (e.g., "after" minus "before").

  2. Find the mean of those differences (dˉ\bar{d}) and their standard deviation (sds_d).

  3. Plug into the formula above to get your t-statistic.

  4. Determine your critical value from the t-table using n1n - 1 degrees of freedom, or calculate the p-value.

  5. Decision rule: If the p-value is less than your significance level (commonly 0.05), reject H0H_0. If the p-value is greater, fail to reject H0H_0.

This procedure is also called a dependent t-test or paired t-test.

Characteristics of matched samples, 8.1 Inference for Two Dependent Samples (Matched Pairs) – Significant Statistics

Matched Pairs vs. Independent Samples

Choosing the wrong test here is a common mistake on exams. The distinction comes down to whether the data points are linked.

FeatureMatched PairsIndependent Samples
Relationship between samplesDependent (paired observations)Unrelated (no pairing)
What you analyzeDifferences within each pairMeans of two separate groups
Degrees of freedomn1n - 1 (where nn = number of pairs)Based on both sample sizes and variances
Typical scenarioSame subjects measured twice, twins, matched controlsComparing two distinct populations

When to use which: If there's a natural pairing or the same subjects appear in both samples, use matched pairs. If the two groups have no connection to each other, use independent samples.

Why Matched Pairs Can Be More Powerful

A matched pairs design controls for variability between subjects. Since each person serves as their own control, individual differences (like baseline fitness, age, or genetics) get canceled out when you take the difference. This reduces noise in your data, which makes it easier to detect a real effect if one exists. That's why paired designs often have greater statistical power than independent samples designs for the same number of subjects.

The correlation between paired measurements reflects how tightly linked the two sets of scores are. Higher correlation means more of that between-subject variability gets removed, which further increases the precision of your test.