ap stats study guides

โœŒ๏ธย  Unit 2 - Exploring Two-Variable Data

โš–๏ธย  Unit 6 - Inference for Categorical Data: Proportions

๐Ÿ˜ผย  Unit 7 - Inference for Qualitative Data: Means

โœณ๏ธย  Unit 8 Inference for Categorical Data: Chi-Square

๐Ÿ“ˆย  Unit 9 - Inference for Quantitative Data: Slopes

๐Ÿงย  Multiple Choice Questions (MCQs)

7.8 Setting Up a Test for the Difference of Two Population Means

#statisticalinference

#twomeans

#ttest

โฑ๏ธย ย 4 min read

written by

Josh Argo

josh argo


https://cdn.pixabay.com/photo/2020/04/04/09/40/peas-5001525_960_720.jpg

image courtesy of: pixabay.com

What is a Test for Difference of Two Population Means

Have you ever looked at two different populations (maybe classes, brands of food, types of materials) and wondered, "Hmm..are these two things REALLY different?" This is exactly where a test for the difference of two population means is useful. As long as your data is quantitative, in other words we are testing means, this is the type of test we will use. If you are using some form of technology, you will select Two Sample T Test. In making note of what test you are performing on the AP Statistics exam, it is best to be more specific and state, "Two Sample T Test for Difference in Two Population Means".

Hypotheses

As with any statistical test, the first step necessary to perform the significance test is to write our hypotheses. We always have a null hypothesis, which is the hypothesis that the two populations are not different. Then we have our alternate hypothesis, which states that they are different in some way (either less than, greater than or simply not equal to).

When writing out your hypotheses, you should state them as follows:

Ho:ย  ๐žต1=๐žต2

Ha:ย  ๐žต1โ‰ ๐žต2, ๐žต1<๐žต2 or ๐žต1>๐žต2

Another way of writing them using differences is as follows:

Ho:ย  ๐žต1-๐žต2=0

Ha:ย  ๐žต1-๐žต2>0, ๐žต1-๐žต2<0, or ๐žต1-๐žต2โ‰ 0

The first option is more in line with the technology used in actually computing the test statistic and p value that we will cover in Unit 7.9.

Conditions

Also, as with any significance test, there are conditions for inference that we must check to assure that our test is accurately going to be able to draw conclusions about a said population.

โ˜‘๏ธRandom

When drawing our two samples to perform our t-test, it is absolutely imperative that our samples are random from the given populations. If they are not random, we cannot generalize our results to the two given populations, which renders our tests useless and there is no way to fix sampling bias with the numbers. If the test involves an experimental study, it is important to note that the treatments were randomly assigned. This allows us to make a causation conclusion.

โ˜‘๏ธIndependent

Since we are normally sampling without replacement, it is also important to make sure our sample is independently chosen. This can be assumed under the 10% condition which states that as long as our population is 10x our sample, we can assume independence. If dealing with an experimental study, independence is not necessary since we are randomizing treatments.

โ˜‘๏ธNormal

When calculating our p-value, we are going to make use of the t curve to see what the probability of obtaining our samples are. In order to ensure that we can use the t curve for our sampling distribution, we must check to be sure that either:

  1. Each of our samples are greater than or equal to 30. (Central Limit Theorem)

  2. Our populations are normally distributed. (Given in question prompt)

  3. A boxplot of our sample data shows no obvious skewness or apparent outliers. (Last resort)

Example

Mr. Fleck runs a green bean farm. He has two fields that he normally picks from. Every day, he goes out and picks green beans from both fields and has found that the two fields appear to be yielding different amount of crops. In order to test his theory, he randomly selects 120 days to pick from both fields. Field A yields an average of 580 beans with a standard deviation of 25, while Field B yields an average of 550 with a standard deviation of 12. Do the data give convincing evidence that the two fields yield different amount of beans?

Hypotheses:

Our null hypothesis is that the two fields yield equal amounts of beans, so our Ho: ๐žตA=๐žตB, where ๐žตA is the true mean number of beans that comes from field A everyday and ๐žตB is the true mean number of beans that comes from Field B everyday.

Our alternate hypothesis is that these two are different, so our Ha: ๐žตAโ‰ ๐žตB.

Conditions

  • Random: both samples are randomly selected by the day chosen

  • Independent: It is reasonable to believe that there are 1200 days that he could pick from the fields.

  • Normal: 120>30, so our sampling distribution of the difference of the two means will be approximately normal

๐ŸŽฅWatch: AP Stats - Review of Inference: z and t Procedures

continue learning

Slide 1 of 12
Fiveable

Join Our Community

Fiveable Community students are already meeting new friends, starting study groups, and sharing tons of opportunities for other high schoolers. Soon the Fiveable Community will be on a totally new platform where you can share, save, and organize your learning links and lead study groups among other students!๐ŸŽ‰

Fiveable Logo

2550 north lake drive
suite 2
milwaukee, wi 53211

92% of Fiveable students earned a 3 or higher on their 2020 AP Exams.

*apยฎ and advanced placementยฎ are registered trademarks of the college board, which was not involved in the production of, and does not endorse, this product.

ยฉ fiveable 2020 | all rights reserved.