Fiveable

👩‍💻Foundations of Data Science Unit 7 Review

QR code for Foundations of Data Science practice questions

7.2 Hypothesis Testing Fundamentals

7.2 Hypothesis Testing Fundamentals

Written by the Fiveable Content Team • Last updated August 2025
Written by the Fiveable Content Team • Last updated August 2025
👩‍💻Foundations of Data Science
Unit & Topic Study Guides

Hypothesis testing is a crucial tool in data science, allowing researchers to make informed decisions based on statistical evidence. It involves formulating null and alternative hypotheses, understanding error types, and interpreting p-values to draw meaningful conclusions.

Significance levels play a key role in hypothesis testing, determining when to reject the null hypothesis. Researchers must carefully choose appropriate levels, considering factors like error risks and field conventions, to ensure reliable and meaningful results in their data analysis.

Hypothesis Testing Fundamentals

Formulation of research hypotheses

  • Null hypothesis (H0) represents default assumption of no effect or difference typically includes equality (=, ≤, or ≥)
  • Alternative hypothesis (Ha or H1) expresses researcher's claim or suspicion contradicts null hypothesis includes inequality (≠, <, or >)
  • Steps to formulate hypotheses:
    1. Identify research question
    2. Determine parameter of interest
    3. State null hypothesis as no effect or difference
    4. Express alternative hypothesis as opposite of null
  • Hypothesis pairs:
    • Mean: H0: μ = μ0 vs Ha: μ ≠ μ0
    • Proportion: H0: p ≤ p0 vs Ha: p > p0
    • Comparing groups: H0: μ1 = μ2 vs Ha: μ1 > μ2
Formulation of research hypotheses, Comparing two means – Learning Statistics with R

Types of statistical errors

  • Type I error (false positive) occurs when rejecting true null hypothesis probability denoted by α (alpha) also called significance level
  • Type II error (false negative) happens when failing to reject false null hypothesis probability denoted by β (beta)
  • Inverse relationship between Type I and II errors decreasing one increases the other
  • Power of test calculated as 1 - β measures probability of correctly rejecting false null hypothesis
  • Error implications:
    • Type I leads to false conclusions or unnecessary actions (approving ineffective drug)
    • Type II results in missing important effects or differences (failing to detect environmental hazard)
Formulation of research hypotheses, Introduction to Hypothesis Testing | Concepts in Statistics

P-values in hypothesis testing

  • P-value represents probability of obtaining test results at least as extreme as observed assuming null hypothesis is true
  • Calculation depends on test statistic and distribution:
    • Z-test: p-value = P(Z ≥ |z|) for two-tailed test
    • T-test: p-value = P(T ≥ |t|) for two-tailed test
  • Small p-value (< 0.05) suggests strong evidence against null hypothesis large p-value indicates weak evidence
  • Reject H0 if p-value < significance level fail to reject H0 if p-value ≥ significance level
  • Limitations: do not measure effect size or importance can be affected by sample size

Significance levels for tests

  • Common levels: 0.05 (5%) standard in many fields 0.01 (1%) more stringent for scientific research 0.10 (10%) less stringent for exploratory research
  • Factors influencing choice: consequences of Type I error field-specific conventions sample size and power considerations
  • Lower significance level reduces Type I error risk but increases Type II error risk
  • Multiple testing problem requires adjusting significance level (Bonferroni correction)
  • Reporting results: state chosen significance level before test report exact p-values rather than just "significant" or "not significant"
Pep mascot
Upgrade your Fiveable account to print any study guide

Download study guides as beautiful PDFs See example

Print or share PDFs with your students

Always prints our latest, updated content

Mark up and annotate as you study

Click below to go to billing portal → update your plan → choose Yearly → and select "Fiveable Share Plan". Only pay the difference

Plan is open to all students, teachers, parents, etc
Pep mascot
Upgrade your Fiveable account to export vocabulary

Download study guides as beautiful PDFs See example

Print or share PDFs with your students

Always prints our latest, updated content

Mark up and annotate as you study

Plan is open to all students, teachers, parents, etc
report an error
description

screenshots help us find and fix the issue faster (optional)

add screenshot

2,589 studying →