intro to statistics unit 9 study guides

hypothesis testing: single sample

9.1

Null and Alternative Hypotheses

9.2

Outcomes and the Type I and Type II Errors

9.3

Probability Distribution Needed for Hypothesis Testing

9.4

Rare Events, the Sample, Decision and Conclusion

9.5

Additional Information and Full Hypothesis Test Examples

9.6

Hypothesis Testing of a Single Mean and Single Proportion

unit 9 review

Hypothesis testing is a powerful statistical method used to evaluate claims about population parameters based on sample data. It provides a structured approach for making data-driven decisions across various fields, from psychology to quality control. The process involves formulating null and alternative hypotheses, selecting an appropriate test statistic, and comparing the calculated p-value to a predetermined significance level. This framework allows researchers to assess the validity of their assumptions and draw meaningful conclusions from their data.

What's Hypothesis Testing?

Statistical method used to determine whether a claim or hypothesis about a population parameter is reasonable based on sample data
Involves comparing a sample statistic to a hypothesized population parameter to assess the validity of the claim
Helps researchers and analysts make data-driven decisions by providing a framework for testing assumptions and drawing conclusions
Relies on the concept of statistical significance, which quantifies the likelihood of observing a sample result if the null hypothesis is true
Commonly used in fields such as psychology, biology, marketing, and quality control to test theories, evaluate interventions, and make predictions
- For example, a psychologist might use hypothesis testing to determine if a new therapy is effective in reducing anxiety symptoms compared to a placebo
Requires specifying a null hypothesis (H0) and an alternative hypothesis (Ha) that represent competing claims about the population parameter
The outcome of a hypothesis test is either rejecting the null hypothesis in favor of the alternative or failing to reject the null hypothesis due to insufficient evidence

Types of Hypotheses

Null hypothesis (H0) represents the default or status quo claim, typically stating that there is no significant difference or relationship between variables
- For example, H0: The mean weight of a population is equal to 150 pounds
Alternative hypothesis (Ha) represents the claim the researcher is trying to support, suggesting a significant difference or relationship exists
- For example, Ha: The mean weight of a population is not equal to 150 pounds
One-tailed (directional) alternative hypotheses specify the direction of the difference or relationship
- Left-tailed: Ha states that the population parameter is less than the hypothesized value
- Right-tailed: Ha states that the population parameter is greater than the hypothesized value
Two-tailed (non-directional) alternative hypotheses do not specify the direction of the difference or relationship
- Ha simply states that the population parameter is different from the hypothesized value
The choice between a one-tailed or two-tailed test depends on the research question and prior knowledge about the direction of the effect
Hypothesis tests are designed to control the Type I error rate (rejecting a true null hypothesis) while maximizing power to detect a true alternative hypothesis

Steps in Hypothesis Testing

State the null and alternative hypotheses
- Clearly define the population parameter of interest and the hypothesized value
- Specify the direction of the alternative hypothesis (one-tailed or two-tailed)
Choose the appropriate test statistic and distribution
- Select a test statistic that measures the difference between the sample statistic and the hypothesized value (e.g., z-score, t-score, chi-square)
- Identify the sampling distribution of the test statistic under the null hypothesis (e.g., standard normal, t-distribution, chi-square distribution)
Set the significance level (α)
- Determine the acceptable Type I error rate, typically 0.05 or 0.01
- The significance level represents the probability of rejecting a true null hypothesis
Calculate the test statistic and p-value
- Compute the test statistic using the sample data and the hypothesized value
- Find the p-value, which is the probability of observing a test statistic as extreme as or more extreme than the one calculated, assuming the null hypothesis is true
Make a decision and interpret the results
- Compare the p-value to the significance level
- If the p-value is less than the significance level, reject the null hypothesis in favor of the alternative hypothesis
- If the p-value is greater than or equal to the significance level, fail to reject the null hypothesis
- Interpret the results in the context of the research question and consider the practical significance of the findings

Test Statistics and Distributions

Test statistics are standardized values that measure the difference between a sample statistic and a hypothesized population parameter
The choice of test statistic depends on the type of data, sample size, and assumptions about the population distribution
Common test statistics for single sample tests include:
- z-score: Used when the population standard deviation is known and the sample size is large (n ≥ 30) or the population is normally distributed
  - $z = \frac{\bar{x} - \mu}{\sigma / \sqrt{n}}$, where $\bar{x}$ is the sample mean, $\mu$ is the hypothesized population mean, $\sigma$ is the population standard deviation, and $n$ is the sample size
- t-score: Used when the population standard deviation is unknown and the sample size is small (n < 30), assuming the population is normally distributed
  - $t = \frac{\bar{x} - \mu}{s / \sqrt{n}}$, where $s$ is the sample standard deviation
- Chi-square ($\chi^2$): Used for goodness-of-fit tests to compare observed frequencies to expected frequencies based on a hypothesized distribution
  - $\chi^2 = \sum \frac{(O - E)^2}{E}$, where $O$ is the observed frequency and $E$ is the expected frequency
The sampling distribution of the test statistic under the null hypothesis determines the critical values and p-values for the test
- For example, the z-score follows a standard normal distribution (mean = 0, standard deviation = 1) under the null hypothesis
The shape and parameters of the sampling distribution depend on the sample size and the population distribution
As the sample size increases, the sampling distribution becomes more normal due to the Central Limit Theorem

Significance Levels and p-values

The significance level (α) is the probability of rejecting a true null hypothesis (Type I error)
- Commonly used significance levels are 0.05 and 0.01, which correspond to a 5% and 1% chance of making a Type I error, respectively
The significance level is set by the researcher before conducting the hypothesis test and represents the maximum acceptable risk of making a Type I error
The p-value is the probability of observing a test statistic as extreme as or more extreme than the one calculated from the sample data, assuming the null hypothesis is true
- For example, if the p-value is 0.03, there is a 3% chance of observing a test statistic as extreme or more extreme if the null hypothesis is true
The p-value is calculated based on the test statistic and the sampling distribution under the null hypothesis
A small p-value (typically less than the significance level) provides evidence against the null hypothesis and suggests that the alternative hypothesis may be true
The p-value is used to make a decision about rejecting or failing to reject the null hypothesis
- If the p-value is less than the significance level, the null hypothesis is rejected in favor of the alternative hypothesis
- If the p-value is greater than or equal to the significance level, there is insufficient evidence to reject the null hypothesis
The p-value is a measure of the strength of evidence against the null hypothesis, but it does not provide information about the size or practical importance of the effect

Making Decisions: Reject or Fail to Reject

The decision to reject or fail to reject the null hypothesis is based on the comparison of the p-value to the significance level (α)
If the p-value is less than the significance level, the null hypothesis is rejected in favor of the alternative hypothesis
- This means that the sample evidence is strong enough to conclude that the population parameter is different from the hypothesized value
- Rejecting the null hypothesis suggests that the observed difference or relationship is statistically significant and unlikely to have occurred by chance alone
If the p-value is greater than or equal to the significance level, there is insufficient evidence to reject the null hypothesis
- This means that the sample evidence is not strong enough to conclude that the population parameter is different from the hypothesized value
- Failing to reject the null hypothesis does not prove that the null hypothesis is true, but rather that there is not enough evidence to support the alternative hypothesis
The decision to reject or fail to reject the null hypothesis is a binary outcome based on the chosen significance level
- However, the p-value provides more information about the strength of evidence against the null hypothesis
- A smaller p-value indicates stronger evidence against the null hypothesis, even if it is not below the significance level
It is important to consider the practical significance of the results in addition to the statistical significance
- A statistically significant result may not be practically meaningful if the effect size is small or the consequences of the decision are minor
The choice of significance level and the interpretation of the results should be based on the context of the research question and the potential implications of making a Type I or Type II error

Common Single Sample Tests

One-sample z-test: Used to test a hypothesis about a population mean when the population standard deviation is known and the sample size is large (n ≥ 30) or the population is normally distributed
- Null hypothesis: $H_0: \mu = \mu_0$, where $\mu_0$ is the hypothesized population mean
- Alternative hypothesis: $H_a: \mu \neq \mu_0$ (two-tailed), $H_a: \mu < \mu_0$ (left-tailed), or $H_a: \mu > \mu_0$ (right-tailed)
- Test statistic: $z = \frac{\bar{x} - \mu_0}{\sigma / \sqrt{n}}$
One-sample t-test: Used to test a hypothesis about a population mean when the population standard deviation is unknown and the sample size is small (n < 30), assuming the population is normally distributed
- Null hypothesis: $H_0: \mu = \mu_0$
- Alternative hypothesis: $H_a: \mu \neq \mu_0$ (two-tailed), $H_a: \mu < \mu_0$ (left-tailed), or $H_a: \mu > \mu_0$ (right-tailed)
- Test statistic: $t = \frac{\bar{x} - \mu_0}{s / \sqrt{n}}$
One-sample proportion test: Used to test a hypothesis about a population proportion when the sample size is large enough (np ≥ 10 and n(1-p) ≥ 10) and the population is at least 10 times larger than the sample
- Null hypothesis: $H_0: p = p_0$, where $p_0$ is the hypothesized population proportion
- Alternative hypothesis: $H_a: p \neq p_0$ (two-tailed), $H_a: p < p_0$ (left-tailed), or $H_a: p > p_0$ (right-tailed)
- Test statistic: $z = \frac{\hat{p} - p_0}{\sqrt{p_0(1-p_0) / n}}$, where $\hat{p}$ is the sample proportion
Chi-square goodness-of-fit test: Used to test whether a sample of categorical data comes from a population with a specified distribution
- Null hypothesis: $H_0$: The sample data follow the specified distribution
- Alternative hypothesis: $H_a$: The sample data do not follow the specified distribution
- Test statistic: $\chi^2 = \sum \frac{(O - E)^2}{E}$, where $O$ is the observed frequency and $E$ is the expected frequency based on the specified distribution
These tests can be performed using statistical software or by calculating the test statistic and p-value manually using the appropriate formulas and tables

Real-World Applications

Quality control: Hypothesis testing is used to monitor the quality of products or processes in manufacturing settings
- For example, a company might test whether the mean weight of a product is within the specified tolerance limits
Medical research: Hypothesis testing is used to evaluate the effectiveness of new drugs, treatments, or interventions
- For example, a clinical trial might test whether a new medication reduces blood pressure more than a placebo
Psychology: Hypothesis testing is used to study human behavior, cognition, and development
- For example, a researcher might test whether a specific therapy reduces symptoms of depression compared to a control group
Market research: Hypothesis testing is used to assess consumer preferences, brand awareness, and the effectiveness of advertising campaigns
- For example, a company might test whether a new product feature increases customer satisfaction compared to the existing product
Environmental science: Hypothesis testing is used to investigate the impact of human activities on natural systems and to evaluate conservation efforts
- For example, a scientist might test whether a particular pollutant concentration exceeds a regulatory threshold in a water sample
Education: Hypothesis testing is used to evaluate the effectiveness of teaching methods, curricula, and educational interventions
- For example, a study might test whether a new instructional approach improves student performance compared to traditional methods
Finance: Hypothesis testing is used to analyze market trends, assess investment strategies, and evaluate the performance of financial models
- For example, an analyst might test whether a particular stock's returns are significantly different from the market average
These examples illustrate the wide range of fields and problems where hypothesis testing is applied to make data-driven decisions and draw meaningful conclusions from sample data

intro to statistics unit 9 study guides

unit 9 review

What's Hypothesis Testing?

Types of Hypotheses

Steps in Hypothesis Testing

Test Statistics and Distributions

Significance Levels and p-values

Making Decisions: Reject or Fail to Reject

Common Single Sample Tests

Real-World Applications

history

social science

english & capstone

arts

science

math & computer science

world languages

high school exams

honors classes

college classes

hs classes

Study Content & Tools

Company

Resources