---
title: "Residual Analysis — AP Stats Definition & Exam Guide"
description: "Residual analysis means checking residuals (observed minus predicted y) to verify linearity and constant variance before regression inference in AP Stats Unit 9."
canonical: "https://fiveable.me/ap-stats/key-terms/residual-analysis"
type: "key-term"
subject: "AP Statistics"
unit: "Unit 9"
---

# Residual Analysis — AP Stats Definition & Exam Guide

## Definition

Residual analysis is the process of examining residuals (observed y minus predicted ŷ) with plots and summaries to check the conditions for regression inference, especially that the true relationship is linear and that the standard deviation of y stays roughly constant across all values of x.

## What It Is

A [residual](/ap-stats/unit-5/residuals/study-guide/zdTJQZw0UVGswyK6kkEF "fv-autolink") is the leftover error for one data point. It's the observed y-value minus the value your [regression line](/ap-stats/key-terms/regression-line "fv-autolink") predicted, written yi − ŷi. Residual analysis means looking at all those leftovers together, usually in a residual plot, to judge whether a linear model is actually trustworthy.

In Unit 9, residual analysis gets a specific job. Before you build a confidence interval for the [slope](/ap-stats/unit-5/linear-regression-models/study-guide/PSt5cfDuvB5nu60DHulR "fv-autolink") of a regression line, the CED says you have to verify conditions, and residuals are your evidence for two of them. First, the residual plot should show no curved pattern, which supports the condition that the true relationship between x and y is linear. Second, the vertical spread of the residuals should look roughly the same from left to right, which supports the condition that the standard deviation of y does not vary with x (constant variance). If the residual plot fans out, funnels in, or bends, the t-interval for the slope isn't valid.

## Why It Matters

Residual analysis lives in Topic 9.2 ([Confidence Intervals](/ap-stats/key-terms/confidence-interval "fv-autolink") for the Slope of a Regression Model) and directly supports learning objective [AP Stats](/ap-stats "fv-autolink") 9.2.B, verifying the conditions to calculate a confidence interval for the slope. The essential knowledge spells it out twice. Analysis of residuals may be used to verify linearity, and analysis of residuals may be used to check for approximately equal standard deviations for all x. In other words, when an inference problem hands you a residual plot, it's not decoration. It's the tool you cite to justify that the interval b ± t*(SEb) from AP Stats 9.2.D is legitimate. The same logic carries into significance tests for slope in Topics 9.3-9.6, so this one skill pays off across all of Unit 9.

## Connections

### Least-Squares Regression and Residuals (Unit 2)

You first met residuals in [Unit 2](/ap-stats/unit-2 "fv-autolink"), where a patternless residual plot told you a line was an appropriate model. Unit 9 recycles that exact skill, but now the stakes are higher. A bad residual plot doesn't just mean a poor fit, it means your confidence interval for the slope is invalid.

### [Constant Variance (Unit 9)](/ap-stats/key-terms/constant-variance)

Equal [spread](/ap-stats/key-terms/spread "fv-autolink") is the condition residual analysis is built to catch. If the residual plot fans out as x increases, the standard deviation of y is changing with x, and the constant variance condition fails. Look for a band of points with roughly even vertical width across the whole plot.

### SEb, the Standard Error of the Slope (Unit 9)

SEb is computed using s, an estimate of the common [standard deviation](/ap-stats/unit-1/investigative-question-revisited-data-collection/study-guide/f842Kr6YNnYX4G0dtAC8 "fv-autolink") of y around the line. That formula only makes sense if there IS one common standard deviation, which is exactly what residual analysis verifies. The conditions and the calculation are two halves of the same procedure.

### [Random Sample (Unit 3)](/ap-stats/key-terms/random-sample)

Residual plots can't check everything. Independence comes from how the data were collected (a random sample or randomized experiment, plus the 10% condition when sampling without replacement). On the exam, check residuals for linearity and equal spread, but check the study design for independence.

## On the AP Exam

Multiple-choice questions often show you a residual plot and ask which condition it supports or violates, or describe a study setup and ask you to assess conditions, like a researcher sampling 500 customers from 2,000 without replacement (that one fails the n ≤ 10% of N check, since 500 is 25% of the population). On FRQs, regression inference questions typically award credit for naming the conditions and pointing to specific evidence, so write things like "the residual plot shows no curved pattern, so the linearity condition is met" rather than just "conditions are satisfied." Vague condition-checking is one of the most common ways students lose FRQ points in Unit 9.

## residual analysis vs Residual plots for model fit (Unit 2)

Same plot, different purpose. In Unit 2, you read a residual plot to decide whether a linear model fits the sample data well. In Unit 9, residual analysis is a formal condition check for inference. You're verifying that the true relationship is linear AND that the spread of residuals is roughly constant across x, so the t-procedures for the slope are valid. Unit 2 asks "is a line a good description?" Unit 9 asks "can I trust this interval?"

## Key Takeaways

- A residual is observed minus predicted, yi − ŷi, and residual analysis means examining all the residuals to check whether regression inference is justified.
- A residual plot with no curved pattern supports the linearity condition for a confidence interval for the slope.
- Roughly equal vertical spread of residuals across all x-values supports the constant variance condition, that the standard deviation of y does not vary with x.
- Residual analysis cannot verify independence. That condition is checked through random sampling or random assignment plus the 10% condition.
- On FRQs, cite specific features of the residual plot when checking conditions, because writing 'conditions are met' with no evidence earns no credit.
- A fan or funnel shape in the residual plot signals changing variability, which invalidates the b ± t*(SEb) interval.

## FAQs

### What is residual analysis in AP Stats?

It's examining the residuals (observed y minus predicted ŷ) from a regression line, usually with a residual plot, to verify the conditions for inference about the slope. A patternless plot with even spread supports the linearity and constant variance conditions in Topic 9.2.

### Does a good residual plot prove the conditions for regression inference are met?

No. A patternless, evenly-spread residual plot supports linearity and constant variance, but independence depends on the data collection (random sample or randomized experiment, and n ≤ 10% of N when sampling without replacement). You can't see independence in a residual plot.

### How is residual analysis different from just making a residual plot in Unit 2?

The plot is the same, but in Unit 2 you used it to decide whether a line fits the data well. In Unit 9, residual analysis is a required condition check that determines whether your confidence interval b ± t*(SEb) is even valid.

### What does a fan shape in a residual plot mean?

It means the spread of the residuals changes as x changes, so the standard deviation of y varies with x. That violates the constant variance condition, and a t-interval for the slope would not be appropriate.

### Which conditions can residual analysis actually check?

Two of them. A lack of curved pattern checks linearity, and roughly equal vertical spread checks constant variance. Independence comes from study design, and approximate normality of responses at each x is a separate condition you often assess from the residuals' distribution or assume from the problem setup.

## Related Study Guides

- [Legacy AP Statistics Topic: Confidence Interval for Regression Slope](/ap-stats/unit-9/confidence-intervals-for-slope-regression-model/study-guide/YsvXWrndemJrI2kBF3Wn)

## Structured Data

```json
{"@context":"https://schema.org","@graph":[{"@type":"LearningResource","@id":"https://fiveable.me/ap-stats/key-terms/residual-analysis#resource","name":"Residual Analysis — AP Stats Definition & Exam Guide","url":"https://fiveable.me/ap-stats/key-terms/residual-analysis","learningResourceType":"Concept explainer","educationalLevel":"AP® / High School","about":{"@id":"https://fiveable.me/ap-stats/key-terms/residual-analysis#term"},"audience":{"@type":"EducationalAudience","educationalRole":"student"},"dateModified":"2026-06-11T05:22:54.942Z","isPartOf":{"@type":"Collection","name":"AP Statistics Key Terms","url":"https://fiveable.me/ap-stats/key-terms"},"publisher":{"@type":"Organization","name":"Fiveable","url":"https://fiveable.me"}},{"@type":"DefinedTerm","@id":"https://fiveable.me/ap-stats/key-terms/residual-analysis#term","name":"residual analysis","description":"Residual analysis is the process of examining residuals (observed y minus predicted ŷ) with plots and summaries to check the conditions for regression inference, especially that the true relationship is linear and that the standard deviation of y stays roughly constant across all values of x.","url":"https://fiveable.me/ap-stats/key-terms/residual-analysis","inDefinedTermSet":{"@type":"DefinedTermSet","name":"AP Statistics Key Terms","url":"https://fiveable.me/ap-stats/key-terms"}},{"@type":"FAQPage","mainEntity":[{"@type":"Question","name":"What is residual analysis in AP Stats?","acceptedAnswer":{"@type":"Answer","text":"It's examining the residuals (observed y minus predicted ŷ) from a regression line, usually with a residual plot, to verify the conditions for inference about the slope. A patternless plot with even spread supports the linearity and constant variance conditions in Topic 9.2."}},{"@type":"Question","name":"Does a good residual plot prove the conditions for regression inference are met?","acceptedAnswer":{"@type":"Answer","text":"No. A patternless, evenly-spread residual plot supports linearity and constant variance, but independence depends on the data collection (random sample or randomized experiment, and n ≤ 10% of N when sampling without replacement). You can't see independence in a residual plot."}},{"@type":"Question","name":"How is residual analysis different from just making a residual plot in Unit 2?","acceptedAnswer":{"@type":"Answer","text":"The plot is the same, but in Unit 2 you used it to decide whether a line fits the data well. In Unit 9, residual analysis is a required condition check that determines whether your confidence interval b ± t*(SEb) is even valid."}},{"@type":"Question","name":"What does a fan shape in a residual plot mean?","acceptedAnswer":{"@type":"Answer","text":"It means the spread of the residuals changes as x changes, so the standard deviation of y varies with x. That violates the constant variance condition, and a t-interval for the slope would not be appropriate."}},{"@type":"Question","name":"Which conditions can residual analysis actually check?","acceptedAnswer":{"@type":"Answer","text":"Two of them. A lack of curved pattern checks linearity, and roughly equal vertical spread checks constant variance. Independence comes from study design, and approximate normality of responses at each x is a separate condition you often assess from the residuals' distribution or assume from the problem setup."}}]},{"@type":"BreadcrumbList","itemListElement":[{"@type":"ListItem","position":1,"name":"AP Statistics","item":"https://fiveable.me/ap-stats"},{"@type":"ListItem","position":2,"name":"Key Terms","item":"https://fiveable.me/ap-stats/key-terms"},{"@type":"ListItem","position":3,"name":"Unit 9","item":"https://fiveable.me/ap-stats/unit-9"},{"@type":"ListItem","position":4,"name":"residual analysis"}]}]}
```
