Matched or Paired Samples in Hypothesis Testing
Most hypothesis tests you've seen so far compare two separate, unrelated groups. But what happens when the two measurements come from the same subjects or naturally linked pairs? That's where matched (or paired) samples come in. Instead of comparing two group means directly, you calculate the difference within each pair and then test whether those differences are significantly different from zero.
Characteristics of Matched Samples
Matched pairs involve two related samples where each observation in one sample has a direct counterpart in the other. Common examples include:
- Before-and-after measurements on the same person (weight before and after a diet program)
- Twin studies comparing a trait between identical twins raised in different environments
- Matched case-control studies where each case is paired with a similar control based on age, sex, or other factors
Because the observations are linked, the samples are dependent. This is the defining feature that separates matched pairs from independent samples.
The goal is to determine whether the mean difference between paired observations is significantly different from zero.
- Null hypothesis (): The population mean difference equals zero (). In other words, there's no real change or difference between the paired conditions.
- Alternative hypothesis ():
- Two-sided:
- One-sided: or

Test Statistic for Matched Pairs
The first step is to compute a difference score for every pair. If you measured 15 people before and after a treatment, you'd get 15 difference values. Then you treat those differences as a single sample and run a one-sample t-test on them.
The formula for the test statistic is:
- = mean of the paired differences
- = hypothesized mean difference (almost always 0 under )
- = standard deviation of the differences
- = number of pairs (not the total number of individual observations)
This test statistic follows a t-distribution with degrees of freedom.
Steps to Conduct a Paired t-Test
-
Calculate the difference for each pair (e.g., "after" minus "before").
-
Find the mean of those differences () and their standard deviation ().
-
Plug into the formula above to get your t-statistic.
-
Determine your critical value from the t-table using degrees of freedom, or calculate the p-value.
-
Decision rule: If the p-value is less than your significance level (commonly 0.05), reject . If the p-value is greater, fail to reject .
This procedure is also called a dependent t-test or paired t-test.

Matched Pairs vs. Independent Samples
Choosing the wrong test here is a common mistake on exams. The distinction comes down to whether the data points are linked.
| Feature | Matched Pairs | Independent Samples |
|---|---|---|
| Relationship between samples | Dependent (paired observations) | Unrelated (no pairing) |
| What you analyze | Differences within each pair | Means of two separate groups |
| Degrees of freedom | (where = number of pairs) | Based on both sample sizes and variances |
| Typical scenario | Same subjects measured twice, twins, matched controls | Comparing two distinct populations |
When to use which: If there's a natural pairing or the same subjects appear in both samples, use matched pairs. If the two groups have no connection to each other, use independent samples.
Why Matched Pairs Can Be More Powerful
A matched pairs design controls for variability between subjects. Since each person serves as their own control, individual differences (like baseline fitness, age, or genetics) get canceled out when you take the difference. This reduces noise in your data, which makes it easier to detect a real effect if one exists. That's why paired designs often have greater statistical power than independent samples designs for the same number of subjects.
The correlation between paired measurements reflects how tightly linked the two sets of scores are. Higher correlation means more of that between-subject variability gets removed, which further increases the precision of your test.