Confidence Intervals for Home Costs

Confidence intervals for population means
A confidence interval gives you a range of plausible values for the true population mean home cost, based on what you observe in a sample. The core idea: since we almost never have data on every home in a population, we use sample statistics plus a margin of error to build an interval that likely captures the true mean.
When the population standard deviation () is known, use the z-formula:
- : sample mean home cost
- : critical value from the standard normal distribution (1.96 for 95% confidence)
- : known population standard deviation
- : sample size
When the population standard deviation is unknown (which is almost always the case in practice), use the t-formula:
- : sample standard deviation (used as an estimate of )
- : critical value from the t-distribution with degrees of freedom
The t-distribution has heavier tails than the z-distribution, which makes the interval slightly wider to account for the extra uncertainty of estimating with . As grows large, the t-distribution approaches the standard normal.
Steps to calculate a confidence interval for home costs:
- Choose your confidence level and find the corresponding critical value. For 95% confidence with a large sample, . If using the t-distribution with, say, 99 degrees of freedom, .
- Calculate the sample mean and (if is unknown) the sample standard deviation from your data.
- Compute the standard error: or .
- Plug into the formula. For example, with , , and :
- State the interval and interpret it in context: "We are 95% confident that the true mean home cost in this population is between and ."
Note that this procedure assumes the sampling distribution of is approximately normal. That's satisfied when the population itself is roughly normal, or when the sample size is large enough for the Central Limit Theorem to apply (typically ).

Interpretation of confidence intervals
Getting the interpretation right matters a lot on exams. A 95% confidence interval does not mean there's a 95% probability the true mean falls in this particular interval. The true mean is a fixed value; it's either in the interval or it isn't.
The correct interpretation: if you repeated the sampling process many times and built a 95% confidence interval each time, about 95% of those intervals would contain the true population mean.
Width of the interval tells you about precision. A narrow interval like to is a much more useful estimate than a wide one like to .
Three factors control the width:
- Sample size (): Larger samples produce narrower intervals because the standard error shrinks. Going from to cuts the standard error by roughly a factor of .
- Variability in the data: More spread in home prices (larger ) means wider intervals. A market with produces a much wider interval than one with .
- Confidence level: Higher confidence requires a larger critical value, which widens the interval. A 99% interval is wider than a 90% interval from the same data. You're trading precision for greater confidence.

Applications in housing markets
Comparing two markets: If the confidence intervals for mean home prices in two cities don't overlap at all (City A: to ; City B: to ), that's strong informal evidence of a real difference in population means. However, overlapping intervals (City A: to ; City B: to ) do not automatically mean the means are equal. Two intervals can overlap and a formal hypothesis test can still find a significant difference. Comparing individual confidence intervals is a rough tool, not a substitute for a proper two-sample test.
Evaluating individual properties: If a home is listed at and the 95% confidence interval for the area's mean price is to , that listing sits above the estimated average range. This doesn't mean the price is "wrong," but it does suggest the home is priced above what's typical for the area.
Limitations to keep in mind:
- A confidence interval is only as good as the sample it's built from. A biased or non-representative sample (e.g., only sampling luxury homes) will produce a misleading interval no matter how large is.
- The method assumes observations are independent and that the normality condition is met (either a normal population or a large enough sample).
- Even at 95% confidence, 5% of intervals constructed this way will miss the true mean. That's not a flaw; it's built into the method.
Statistical Inference and Hypothesis Testing
Confidence intervals are one of the two main tools of statistical inference (drawing conclusions about a population from sample data). The other is hypothesis testing.
Both methods rely on the sampling distribution of and its standard error ( or ). A confidence interval estimates where the parameter is. A hypothesis test evaluates whether the parameter equals a specific claimed value. They're closely connected: if a hypothesized value falls outside your 95% confidence interval, you'd reject that value at the significance level.