Intro to Statistics Unit 9 ReviewHypothesis Testing: Single Sample

Pep mascot
Upgrade your Fiveable account to print any study guide

Download study guides as beautiful PDFs See example

Print or share PDFs with your students

Always prints our latest, updated content

Mark up and annotate as you study

Click below to go to billing portal → update your plan → choose Yearly→ and select "Fiveable Share Plan". Only pay the difference

Plan is open to all students, teachers, parents, etc
Pep mascot
Upgrade your Fiveable account to export vocabulary

Download study guides as beautiful PDFs See example

Print or share PDFs with your students

Always prints our latest, updated content

Mark up and annotate as you study

Plan is open to all students, teachers, parents, etc

Hypothesis testing is a powerful statistical method used to evaluate claims about population parameters based on sample data. It provides a structured approach for making data-driven decisions across various fields, from psychology to quality control. The process involves formulating null and alternative hypotheses, selecting an appropriate test statistic, and comparing the calculated p-value to a predetermined significance level. This framework allows researchers to assess the validity of their assumptions and draw meaningful conclusions from their data.

unit 9 review

What's Hypothesis Testing?

  • Statistical method used to determine whether a claim or hypothesis about a population parameter is reasonable based on sample data
  • Involves comparing a sample statistic to a hypothesized population parameter to assess the validity of the claim
  • Helps researchers and analysts make data-driven decisions by providing a framework for testing assumptions and drawing conclusions
  • Relies on the concept of statistical significance, which quantifies the likelihood of observing a sample result if the null hypothesis is true
  • Commonly used in fields such as psychology, biology, marketing, and quality control to test theories, evaluate interventions, and make predictions
    • For example, a psychologist might use hypothesis testing to determine if a new therapy is effective in reducing anxiety symptoms compared to a placebo
  • Requires specifying a null hypothesis (H0) and an alternative hypothesis (Ha) that represent competing claims about the population parameter
  • The outcome of a hypothesis test is either rejecting the null hypothesis in favor of the alternative or failing to reject the null hypothesis due to insufficient evidence

Types of Hypotheses

  • Null hypothesis (H0) represents the default or status quo claim, typically stating that there is no significant difference or relationship between variables
    • For example, H0: The mean weight of a population is equal to 150 pounds
  • Alternative hypothesis (Ha) represents the claim the researcher is trying to support, suggesting a significant difference or relationship exists
    • For example, Ha: The mean weight of a population is not equal to 150 pounds
  • One-tailed (directional) alternative hypotheses specify the direction of the difference or relationship
    • Left-tailed: Ha states that the population parameter is less than the hypothesized value
    • Right-tailed: Ha states that the population parameter is greater than the hypothesized value
  • Two-tailed (non-directional) alternative hypotheses do not specify the direction of the difference or relationship
    • Ha simply states that the population parameter is different from the hypothesized value
  • The choice between a one-tailed or two-tailed test depends on the research question and prior knowledge about the direction of the effect
  • Hypothesis tests are designed to control the Type I error rate (rejecting a true null hypothesis) while maximizing power to detect a true alternative hypothesis

Steps in Hypothesis Testing

  1. State the null and alternative hypotheses
    • Clearly define the population parameter of interest and the hypothesized value
    • Specify the direction of the alternative hypothesis (one-tailed or two-tailed)
  2. Choose the appropriate test statistic and distribution
    • Select a test statistic that measures the difference between the sample statistic and the hypothesized value (e.g., z-score, t-score, chi-square)
    • Identify the sampling distribution of the test statistic under the null hypothesis (e.g., standard normal, t-distribution, chi-square distribution)
  3. Set the significance level (α)
    • Determine the acceptable Type I error rate, typically 0.05 or 0.01
    • The significance level represents the probability of rejecting a true null hypothesis
  4. Calculate the test statistic and p-value
    • Compute the test statistic using the sample data and the hypothesized value
    • Find the p-value, which is the probability of observing a test statistic as extreme as or more extreme than the one calculated, assuming the null hypothesis is true
  5. Make a decision and interpret the results
    • Compare the p-value to the significance level
    • If the p-value is less than the significance level, reject the null hypothesis in favor of the alternative hypothesis
    • If the p-value is greater than or equal to the significance level, fail to reject the null hypothesis
    • Interpret the results in the context of the research question and consider the practical significance of the findings

Test Statistics and Distributions

  • Test statistics are standardized values that measure the difference between a sample statistic and a hypothesized population parameter
  • The choice of test statistic depends on the type of data, sample size, and assumptions about the population distribution
  • Common test statistics for single sample tests include:
    • z-score: Used when the population standard deviation is known and the sample size is large (n ≥ 30) or the population is normally distributed
      • z=xˉμσ/nz = \frac{\bar{x} - \mu}{\sigma / \sqrt{n}}, where xˉ\bar{x} is the sample mean, μ\mu is the hypothesized population mean, σ\sigma is the population standard deviation, and nn is the sample size
    • t-score: Used when the population standard deviation is unknown and the sample size is small (n < 30), assuming the population is normally distributed
      • t=xˉμs/nt = \frac{\bar{x} - \mu}{s / \sqrt{n}}, where ss is the sample standard deviation
    • Chi-square (χ2\chi^2): Used for goodness-of-fit tests to compare observed frequencies to expected frequencies based on a hypothesized distribution
      • χ2=(OE)2E\chi^2 = \sum \frac{(O - E)^2}{E}, where OO is the observed frequency and EE is the expected frequency
  • The sampling distribution of the test statistic under the null hypothesis determines the critical values and p-values for the test
    • For example, the z-score follows a standard normal distribution (mean = 0, standard deviation = 1) under the null hypothesis
  • The shape and parameters of the sampling distribution depend on the sample size and the population distribution
  • As the sample size increases, the sampling distribution becomes more normal due to the Central Limit Theorem

Significance Levels and p-values

  • The significance level (α) is the probability of rejecting a true null hypothesis (Type I error)
    • Commonly used significance levels are 0.05 and 0.01, which correspond to a 5% and 1% chance of making a Type I error, respectively
  • The significance level is set by the researcher before conducting the hypothesis test and represents the maximum acceptable risk of making a Type I error
  • The p-value is the probability of observing a test statistic as extreme as or more extreme than the one calculated from the sample data, assuming the null hypothesis is true
    • For example, if the p-value is 0.03, there is a 3% chance of observing a test statistic as extreme or more extreme if the null hypothesis is true
  • The p-value is calculated based on the test statistic and the sampling distribution under the null hypothesis
  • A small p-value (typically less than the significance level) provides evidence against the null hypothesis and suggests that the alternative hypothesis may be true
  • The p-value is used to make a decision about rejecting or failing to reject the null hypothesis
    • If the p-value is less than the significance level, the null hypothesis is rejected in favor of the alternative hypothesis
    • If the p-value is greater than or equal to the significance level, there is insufficient evidence to reject the null hypothesis
  • The p-value is a measure of the strength of evidence against the null hypothesis, but it does not provide information about the size or practical importance of the effect

Making Decisions: Reject or Fail to Reject

  • The decision to reject or fail to reject the null hypothesis is based on the comparison of the p-value to the significance level (α)
  • If the p-value is less than the significance level, the null hypothesis is rejected in favor of the alternative hypothesis
    • This means that the sample evidence is strong enough to conclude that the population parameter is different from the hypothesized value
    • Rejecting the null hypothesis suggests that the observed difference or relationship is statistically significant and unlikely to have occurred by chance alone
  • If the p-value is greater than or equal to the significance level, there is insufficient evidence to reject the null hypothesis
    • This means that the sample evidence is not strong enough to conclude that the population parameter is different from the hypothesized value
    • Failing to reject the null hypothesis does not prove that the null hypothesis is true, but rather that there is not enough evidence to support the alternative hypothesis
  • The decision to reject or fail to reject the null hypothesis is a binary outcome based on the chosen significance level
    • However, the p-value provides more information about the strength of evidence against the null hypothesis
    • A smaller p-value indicates stronger evidence against the null hypothesis, even if it is not below the significance level
  • It is important to consider the practical significance of the results in addition to the statistical significance
    • A statistically significant result may not be practically meaningful if the effect size is small or the consequences of the decision are minor
  • The choice of significance level and the interpretation of the results should be based on the context of the research question and the potential implications of making a Type I or Type II error

Common Single Sample Tests

  • One-sample z-test: Used to test a hypothesis about a population mean when the population standard deviation is known and the sample size is large (n ≥ 30) or the population is normally distributed
    • Null hypothesis: H0:μ=μ0H_0: \mu = \mu_0, where μ0\mu_0 is the hypothesized population mean
    • Alternative hypothesis: Ha:μμ0H_a: \mu \neq \mu_0 (two-tailed), Ha:μ<μ0H_a: \mu < \mu_0 (left-tailed), or Ha:μ>μ0H_a: \mu > \mu_0 (right-tailed)
    • Test statistic: z=xˉμ0σ/nz = \frac{\bar{x} - \mu_0}{\sigma / \sqrt{n}}
  • One-sample t-test: Used to test a hypothesis about a population mean when the population standard deviation is unknown and the sample size is small (n < 30), assuming the population is normally distributed
    • Null hypothesis: H0:μ=μ0H_0: \mu = \mu_0
    • Alternative hypothesis: Ha:μμ0H_a: \mu \neq \mu_0 (two-tailed), Ha:μ<μ0H_a: \mu < \mu_0 (left-tailed), or Ha:μ>μ0H_a: \mu > \mu_0 (right-tailed)
    • Test statistic: t=xˉμ0s/nt = \frac{\bar{x} - \mu_0}{s / \sqrt{n}}
  • One-sample proportion test: Used to test a hypothesis about a population proportion when the sample size is large enough (np ≥ 10 and n(1-p) ≥ 10) and the population is at least 10 times larger than the sample
    • Null hypothesis: H0:p=p0H_0: p = p_0, where p0p_0 is the hypothesized population proportion
    • Alternative hypothesis: Ha:pp0H_a: p \neq p_0 (two-tailed), Ha:p<p0H_a: p < p_0 (left-tailed), or Ha:p>p0H_a: p > p_0 (right-tailed)
    • Test statistic: z=p^p0p0(1p0)/nz = \frac{\hat{p} - p_0}{\sqrt{p_0(1-p_0) / n}}, where p^\hat{p} is the sample proportion
  • Chi-square goodness-of-fit test: Used to test whether a sample of categorical data comes from a population with a specified distribution
    • Null hypothesis: H0H_0: The sample data follow the specified distribution
    • Alternative hypothesis: HaH_a: The sample data do not follow the specified distribution
    • Test statistic: χ2=(OE)2E\chi^2 = \sum \frac{(O - E)^2}{E}, where OO is the observed frequency and EE is the expected frequency based on the specified distribution
  • These tests can be performed using statistical software or by calculating the test statistic and p-value manually using the appropriate formulas and tables

Real-World Applications

  • Quality control: Hypothesis testing is used to monitor the quality of products or processes in manufacturing settings
    • For example, a company might test whether the mean weight of a product is within the specified tolerance limits
  • Medical research: Hypothesis testing is used to evaluate the effectiveness of new drugs, treatments, or interventions
    • For example, a clinical trial might test whether a new medication reduces blood pressure more than a placebo
  • Psychology: Hypothesis testing is used to study human behavior, cognition, and development
    • For example, a researcher might test whether a specific therapy reduces symptoms of depression compared to a control group
  • Market research: Hypothesis testing is used to assess consumer preferences, brand awareness, and the effectiveness of advertising campaigns
    • For example, a company might test whether a new product feature increases customer satisfaction compared to the existing product
  • Environmental science: Hypothesis testing is used to investigate the impact of human activities on natural systems and to evaluate conservation efforts
    • For example, a scientist might test whether a particular pollutant concentration exceeds a regulatory threshold in a water sample
  • Education: Hypothesis testing is used to evaluate the effectiveness of teaching methods, curricula, and educational interventions
    • For example, a study might test whether a new instructional approach improves student performance compared to traditional methods
  • Finance: Hypothesis testing is used to analyze market trends, assess investment strategies, and evaluate the performance of financial models
    • For example, an analyst might test whether a particular stock's returns are significantly different from the market average
  • These examples illustrate the wide range of fields and problems where hypothesis testing is applied to make data-driven decisions and draw meaningful conclusions from sample data