9.1 Null and Alternative Hypotheses

3 min readjune 27, 2024

is a powerful tool for making decisions based on data. It involves setting up null and alternative hypotheses, then using statistical methods to determine which is more likely to be true.

The process includes formulating hypotheses, interpreting symbols, and making decisions based on test statistics or p-values. Understanding this process is crucial for drawing meaningful conclusions from data in various fields.

Hypothesis Testing

Formulation of statistical hypotheses

Top images from around the web for Formulation of statistical hypotheses
Top images from around the web for Formulation of statistical hypotheses
  • (H0H_0) asserts no significant difference or effect exists
    • Represents the default or status quo position (no change from current understanding)
    • Often includes an equality symbol (=, ≤, or ≥) (population mean μ=100\mu = 100)
  • (HaH_a or H1H_1) asserts a significant difference or effect exists
    • Represents the research question or claim being tested (new drug is effective)
    • Often includes an inequality symbol (≠, >, or <) (population proportion p>0.5p > 0.5)
  • Examples:
    • Testing a new teaching method: H0H_0: No improvement in test scores, HaH_a: Improvement in test scores
    • Comparing product defect rates: H0H_0: Defect rates are equal (p1=p2p_1 = p_2), HaH_a: Defect rates differ (p1p2p_1 \neq p_2)

Interpretation of hypothesis symbols

  • H0H_0 denotes the null hypothesis
  • HaH_a or H1H_1 denotes the alternative hypothesis
  • Greek letters represent population parameters:
    • μ\mu represents the population mean (average value)
    • pp represents the population proportion (fraction or percentage)
    • σ\sigma represents the population standard deviation (measure of variability)
  • Subscripts differentiate between specific populations or groups (μ1\mu_1 vs. μ2\mu_2)
  • Equality symbols in H0H_0 suggest no difference or effect (= for exactly equal, ≤ or ≥ for at most or at least)
  • Inequality symbols in HaH_a suggest a difference or effect (≠ for not equal, > or < for greater than or less than)

Decision-making in hypothesis testing

  1. Gather sample data relevant to the hypotheses being tested
  2. Compute the appropriate based on the data and hypothesis (z-score for normal data, t-score for small samples, chi-square for categorical data)
  3. Determine the critical value(s) using the chosen (α\alpha, often 0.05)
    • If the test statistic is in the (more extreme than the critical value), H0H_0
    • If the test statistic is outside the critical region (less extreme than the critical value), H0H_0
  4. Alternatively, calculate the (probability of observing the sample data or more extreme results, assuming H0H_0 is true)
    • If the p-value is less than α\alpha, reject H0H_0 (statistically significant result)
    • If the p-value is greater than or equal to α\alpha, fail to reject H0H_0 (not statistically significant)
  5. Draw conclusions based on the decision regarding H0H_0
    • Rejecting H0H_0 suggests the sample data supports HaH_a (evidence for a significant difference or effect)
    • Failing to reject H0H_0 suggests insufficient evidence to support HaH_a (cannot conclude a significant difference or effect exists)

Statistical Inference and Interpretation

  • involves drawing conclusions about populations based on sample data
  • Hypothesis testing is a key method in statistical inference for making decisions about population parameters
  • indicates the likelihood that an observed effect is not due to chance
  • Confidence intervals provide a range of plausible values for population parameters, complementing hypothesis tests
  • measures the magnitude of the observed difference or relationship, providing context for statistical significance

Key Terms to Review (25)

Alternative Hypothesis: The alternative hypothesis, denoted as H1 or Ha, is a statement that contradicts the null hypothesis and suggests that the observed difference or relationship in a study is statistically significant and not due to chance. It represents the researcher's belief about the population parameter or the relationship between variables.
Chi-Square Test: The chi-square test is a statistical hypothesis test used to determine if there is a significant difference between observed and expected frequencies or proportions in one or more categories. It is a versatile test that can be applied in various contexts, including contingency tables, discrete distributions, and tests of independence or variance.
Confidence Interval: A confidence interval is a range of values that is likely to contain an unknown population parameter, such as a mean or proportion, with a specified level of confidence. It provides a way to quantify the uncertainty associated with estimating a population characteristic from a sample.
Critical Region: The critical region, also known as the rejection region, is a range of values for a test statistic that leads to the rejection of the null hypothesis in a statistical hypothesis test. It represents the set of outcomes that are considered statistically significant and unlikely to have occurred by chance under the assumption that the null hypothesis is true.
Effect Size: Effect size is a quantitative measure that indicates the magnitude or strength of the relationship between two variables or the difference between two groups. It provides information about the practical significance of a statistical finding, beyond just the statistical significance.
Fail to Reject: Fail to reject refers to the outcome of a hypothesis test where the null hypothesis is not rejected, indicating that the observed data does not provide sufficient evidence to conclude that the null hypothesis is false. This term is crucial in the context of 9.1 Null and Alternative Hypotheses, as it represents one of the possible conclusions in a hypothesis test.
H0: H0, or the null hypothesis, is a statistical hypothesis that represents the default or status quo position that there is no significant difference or relationship between the variables being studied. It is the hypothesis that is assumed to be true until the evidence strongly suggests otherwise.
H1: H1, or the alternative hypothesis, is a statistical hypothesis that proposes a specific relationship or difference between two or more variables. It is the hypothesis that the researcher believes to be true and wants to provide evidence for, in contrast to the null hypothesis (H0).
Ha: In the context of hypothesis testing, Ha, or the alternative hypothesis, represents the statement that the researcher believes to be true. It is the complement of the null hypothesis (H0) and is the hypothesis that the researcher aims to provide evidence for through statistical analysis.
Hypothesis Testing: Hypothesis testing is a statistical method used to determine whether a particular claim or hypothesis about a population parameter is likely to be true or false based on sample data. It involves formulating null and alternative hypotheses, collecting and analyzing sample data, and making a decision to either reject or fail to reject the null hypothesis.
Normal Distribution: The normal distribution, also known as the Gaussian distribution, is a continuous probability distribution that is symmetrical and bell-shaped. It is a fundamental concept in statistics and probability theory, with widespread applications across various fields, including the topics covered in this course.
Null Hypothesis: The null hypothesis, denoted as H0, is a statistical hypothesis that states there is no significant difference or relationship between the variables being studied. It represents the default or initial position that a researcher takes before conducting an analysis or experiment.
One-Tailed Hypothesis: A one-tailed hypothesis is a type of statistical hypothesis test where the alternative hypothesis specifies the direction of the expected difference or relationship between two variables. It is used when the researcher has a clear prediction about the direction of the effect, rather than just expecting a difference in any direction.
P-value: The p-value is a statistical measure that represents the probability of obtaining a test statistic that is at least as extreme as the observed value, given that the null hypothesis is true. It is a crucial component in hypothesis testing, as it helps determine the strength of evidence against the null hypothesis and guides the decision-making process in statistical analysis across a wide range of topics in statistics.
Reject: In the context of statistical hypothesis testing, to reject a null hypothesis means that the observed data provides sufficient evidence to conclude that the null hypothesis is false. The decision to reject the null hypothesis is made based on the results of a statistical test, which determines the likelihood of obtaining the observed data if the null hypothesis is true.
Significance Level: The significance level, denoted as α, is the probability of rejecting the null hypothesis when it is true. It represents the maximum acceptable probability of making a Type I error, which is the error of concluding that an effect exists when it does not. The significance level is a critical component in hypothesis testing, as it sets the threshold for determining the statistical significance of the observed results.
Statistical Inference: Statistical inference is the process of using data analysis to infer properties about a population from a sample. It involves drawing conclusions and making predictions based on the information gathered from a subset of a larger group or dataset.
Statistical Significance: Statistical significance is a statistical measure that determines the probability of an observed effect or relationship occurring by chance alone. It is a crucial concept in hypothesis testing, experimental design, and data analysis, as it helps researchers distinguish between findings that are likely due to random chance and those that are likely to represent a true effect or relationship in the population.
T-distribution: The t-distribution, also known as the Student's t-distribution, is a probability distribution used to make statistical inferences about the mean of a population when the sample size is small and the population standard deviation is unknown. It is a bell-shaped, symmetric distribution that is similar to the normal distribution but has heavier tails, accounting for the increased uncertainty associated with small sample sizes.
T-test: The t-test is a statistical hypothesis test that is used to determine if the mean of a population is significantly different from a hypothesized value or if the means of two populations are significantly different from each other. It is commonly used in scenarios where the population standard deviation is unknown, and the sample size is small.
Test Statistic: A test statistic is a numerical value calculated from a sample data that is used to determine whether to reject or fail to reject the null hypothesis in a hypothesis test. It is a crucial component in various statistical analyses, as it provides the basis for making inferences about population parameters.
Two-Tailed Hypothesis: A two-tailed hypothesis is a statistical hypothesis test in which the critical region is two-sided, meaning it is split into two parts, one in each of the tails of the probability distribution. This type of hypothesis test is used when the researcher is interested in determining if the population parameter is different from a specified value, without specifying the direction of the difference.
Type I Error: A Type I error, also known as a false positive, occurs when the null hypothesis is true, but the test incorrectly rejects it. In other words, it is the error of concluding that a difference exists when, in reality, there is no actual difference between the populations or treatments being studied.
Type II Error: A type II error, also known as a false negative, occurs when the null hypothesis is true, but the statistical test fails to reject it. In other words, the test concludes that there is no significant difference or effect when, in reality, there is one.
Z-test: The z-test is a statistical hypothesis test that uses the standard normal distribution to determine if the mean of a population is equal to a specified value. It is commonly used when the population standard deviation is known or when the sample size is large enough to assume a normal distribution.
© 2024 Fiveable Inc. All rights reserved.
AP® and SAT® are trademarks registered by the College Board, which is not affiliated with, and does not endorse this website.