Fiveable

📉Statistical Methods for Data Science Unit 5 Review

QR code for Statistical Methods for Data Science practice questions

5.1 Fundamentals of Hypothesis Testing

5.1 Fundamentals of Hypothesis Testing

Written by the Fiveable Content Team • Last updated August 2025
Written by the Fiveable Content Team • Last updated August 2025
📉Statistical Methods for Data Science
Unit & Topic Study Guides

Hypothesis testing is a crucial statistical method for making decisions about populations based on sample data. It involves formulating null and alternative hypotheses, choosing between one-tailed and two-tailed tests, and setting significance levels to evaluate evidence.

The process includes calculating test statistics, determining rejection regions, and using p-values to make decisions. Understanding these fundamentals helps researchers draw meaningful conclusions from data and assess the strength of evidence against null hypotheses.

Hypothesis Formulation

Null and Alternative Hypotheses

  • Null hypothesis (H0H_0) assumes no significant difference or effect exists between populations or variables
  • Alternative hypothesis (HaH_a or H1H_1) proposes a significant difference or effect exists, contradicting the null hypothesis
  • Hypotheses are mutually exclusive and exhaustive, covering all possible outcomes
  • Formulating hypotheses involves clearly defining the parameter of interest and the direction of the alternative hypothesis

One-Tailed and Two-Tailed Tests

  • One-tailed test specifies the direction of the alternative hypothesis (greater than or less than the null value)
    • Upper-tailed test: Ha:μ>μ0H_a: \mu > \mu_0
    • Lower-tailed test: Ha:μ<μ0H_a: \mu < \mu_0
  • Two-tailed test allows for the alternative hypothesis to be either greater than or less than the null value (Ha:μμ0H_a: \mu \neq \mu_0)
  • Choice between one-tailed and two-tailed tests depends on the research question and prior knowledge about the direction of the effect
  • One-tailed tests are more powerful but less conservative compared to two-tailed tests
Null and Alternative Hypotheses, File:P-value in statistical significance testing.svg - Wikimedia Commons

Test Components

Significance Level and Critical Region

  • Level of significance (α\alpha) is the probability of rejecting the null hypothesis when it is true (Type I error)
    • Commonly used levels: 0.01, 0.05, and 0.10
  • Critical region (or rejection region) is the range of test statistic values that lead to rejecting the null hypothesis
  • Critical values are the boundaries of the critical region, determined by the level of significance and the distribution of the test statistic
  • Choosing an appropriate significance level balances the risks of Type I and Type II errors
Null and Alternative Hypotheses, hypothesis testing - Distribution of test statistic under null and alternative - Cross Validated

Test Statistic and Rejection Region

  • Test statistic is a value calculated from the sample data used to make a decision about the null hypothesis
    • Examples: z-statistic, t-statistic, F-statistic, chi-square statistic
  • Test statistic follows a known distribution under the null hypothesis (e.g., standard normal, t-distribution, F-distribution)
  • Rejection region is the range of test statistic values that exceed the critical value(s)
  • If the test statistic falls within the rejection region, the null hypothesis is rejected in favor of the alternative hypothesis

Decision Making

P-Value and Decision Rule

  • P-value is the probability of observing a test statistic as extreme as or more extreme than the one calculated from the sample data, assuming the null hypothesis is true
  • P-value provides a measure of the strength of evidence against the null hypothesis
  • Smaller p-values indicate stronger evidence against the null hypothesis
  • Decision rule: Reject the null hypothesis if the p-value is less than or equal to the level of significance (α\alpha)
    • If p-value α\leq \alpha, reject H0H_0 and conclude that there is significant evidence to support HaH_a
    • If p-value >α> \alpha, fail to reject H0H_0 and conclude that there is insufficient evidence to support HaH_a

Making Decisions and Interpreting Results

  • Decision making in hypothesis testing is based on comparing the p-value to the pre-specified level of significance
  • Rejecting the null hypothesis suggests that the observed effect or difference is statistically significant
  • Failing to reject the null hypothesis does not prove that the null hypothesis is true, but rather that there is insufficient evidence to support the alternative hypothesis
  • Interpreting results should consider the context of the research question, sample size, and practical significance of the findings
  • Statistical significance does not always imply practical or clinical significance, and the magnitude of the effect should be considered alongside the p-value
Pep mascot
Upgrade your Fiveable account to print any study guide

Download study guides as beautiful PDFs See example

Print or share PDFs with your students

Always prints our latest, updated content

Mark up and annotate as you study

Click below to go to billing portal → update your plan → choose Yearly → and select "Fiveable Share Plan". Only pay the difference

Plan is open to all students, teachers, parents, etc
Pep mascot
Upgrade your Fiveable account to export vocabulary

Download study guides as beautiful PDFs See example

Print or share PDFs with your students

Always prints our latest, updated content

Mark up and annotate as you study

Plan is open to all students, teachers, parents, etc
report an error
description

screenshots help us find and fix the issue faster (optional)

add screenshot

2,589 studying →