upgrade
upgrade
⚖️AP Statistics Unit 6 Vocabulary

133 essential vocabulary terms and definitions for Unit 6 – Proportions

Study Unit 6
Practice Vocabulary
⚖️Unit 6 – Proportions
Topics

⚖️Unit 6 – Proportions

6.10 Setting Up a Test for the Difference of Two Population Proportions

TermDefinition
alternative hypothesisThe claim that contradicts the null hypothesis, representing what the researcher is trying to find evidence for.
approximately normalA distribution that closely follows the shape of a normal distribution, allowing for the use of normal probability methods.
categorical variableA variable that takes on values that are category names or group labels rather than numerical values.
difference of two population proportionsThe comparison between two population proportions, expressed as p₁ - p₂, to determine if they differ significantly.
independenceThe condition that observations in a sample are not influenced by each other, typically ensured through random sampling or randomized experiments.
null hypothesisThe initial claim or assumption being tested in a hypothesis test, typically stating that there is no effect or no difference.
one-sided alternative hypothesisAn alternative hypothesis that specifies the direction of the difference, either p₁ < p₂ or p₁ > p₂.
pooled proportionA combined estimate of the population proportion calculated from both samples when assuming the null hypothesis is true: p̂c = (n₁p̂₁ + n₂p̂₂)/(n₁ + n₂).
population proportionThe true proportion or percentage of a characteristic in an entire population, typically denoted as p.
randomized experimentA study design where subjects are randomly assigned to treatment groups to establish cause-and-effect relationships.
sampling distributionThe probability distribution of a sample statistic (such as a sample proportion) obtained from repeated sampling of a population.
sampling without replacementA sampling method in which an item selected from a population cannot be selected again in subsequent draws.
simple random sampleA sample selected from a population such that every possible sample of the same size has an equal chance of being chosen.
statistical inferenceThe process of drawing conclusions about a population based on data collected from a sample.
two-sample z-testA hypothesis test used to compare the difference between two population proportions using the standard normal distribution.
two-sided alternative hypothesisAn alternative hypothesis that specifies the difference could be in either direction, stated as p₁ ≠ p₂.

6.1 Introducing Statistics

TermDefinition
distributionThe pattern of how data values are spread or arranged across a range.
populationThe entire group of individuals or items from which a sample is drawn and about which conclusions are to be made.
sampleA subset of individuals or items selected from a population for the purpose of data collection and analysis.
variationDifferences in data that occur by chance due to the random nature of sampling, rather than from systematic causes.

6.11 Carrying Out a Test for the Difference of Two Population Proportions

TermDefinition
difference in sample proportionsThe difference between two sample proportions (p̂₁ - p̂₂) used to compare proportions from two different samples.
difference of two population proportionsThe comparison between two population proportions, expressed as p₁ - p₂, to determine if they differ significantly.
null hypothesisThe initial claim or assumption being tested in a hypothesis test, typically stating that there is no effect or no difference.
p-valueThe probability of observing a test statistic as extreme as or more extreme than the one calculated from the sample data, assuming the null hypothesis is true.
pooled proportionA combined estimate of the population proportion calculated from both samples when assuming the null hypothesis is true: p̂c = (n₁p̂₁ + n₂p̂₂)/(n₁ + n₂).
population proportionThe true proportion or percentage of a characteristic in an entire population, typically denoted as p.
reject the null hypothesisThe decision made when the p-value is less than or equal to the significance level, indicating sufficient evidence against the null hypothesis.
significance levelThe threshold probability (α) used to determine whether to reject the null hypothesis in a significance test.
significance testA statistical procedure used to determine whether there is sufficient evidence to reject the null hypothesis based on sample data.
standard errorThe standard deviation of a sampling distribution, which measures the variability of a sample statistic across repeated samples.
test statisticA calculated value used to determine whether to reject the null hypothesis in a hypothesis test, computed from sample data.

6.2 Constructing a Confidence Interval for a Population Proportion

TermDefinition
approximately normalA distribution that closely follows the shape of a normal distribution, allowing for the use of normal probability methods.
categorical variableA variable that takes on values that are category names or group labels rather than numerical values.
confidence intervalA range of values, calculated from sample data, that is likely to contain the true population parameter with a specified level of confidence.
confidence levelThe probability that a confidence interval will contain the true population parameter, typically expressed as a percentage such as 90%, 95%, or 99%.
critical valueA value from the standard normal distribution used to determine the margin of error for a given confidence level.
independenceThe condition that observations in a sample are not influenced by each other, typically ensured through random sampling or randomized experiments.
margin of errorThe amount by which a sample statistic is likely to vary from the corresponding population parameter, calculated as the critical value times the standard error.
number of failuresThe count of unfavorable outcomes in a sample, denoted as n(1-p̂), used to verify the normality condition.
number of successesThe count of favorable outcomes in a sample, denoted as np̂, used to verify the normality condition.
one-sample z-interval for a proportionA confidence interval procedure used to estimate a population proportion based on a single sample, using the standard normal (z) distribution.
population parameterA numerical characteristic of an entire population, such as the mean, proportion, or standard deviation.
population proportionThe true proportion or percentage of a characteristic in an entire population, typically denoted as p.
random sampleA sample selected from a population in such a way that every member has an equal chance of being chosen, reducing bias and allowing for valid statistical inference.
randomized experimentA study design where subjects are randomly assigned to treatment groups to establish cause-and-effect relationships.
sample proportionThe proportion of individuals in a sample that have a particular characteristic, denoted as p-hat (p̂).
sample sizeThe number of observations or data points collected in a sample, denoted as n.
sample statisticA numerical value calculated from sample data that is used to estimate the corresponding population parameter.
sampling distributionThe probability distribution of a sample statistic (such as a sample proportion) obtained from repeated sampling of a population.
sampling without replacementA sampling method in which an item selected from a population cannot be selected again in subsequent draws.
standard errorThe standard deviation of a sampling distribution, which measures the variability of a sample statistic across repeated samples.
standard normal distributionA normal distribution with mean 0 and standard deviation 1, used to determine critical values for confidence intervals.

6.3 Justifying a Claim Based on a Confidence Interval for a Population Proportion

TermDefinition
claimA statement or assertion about a population parameter that can be evaluated using statistical evidence.
confidence intervalA range of values, calculated from sample data, that is likely to contain the true population parameter with a specified level of confidence.
confidence levelThe probability that a confidence interval will contain the true population parameter, typically expressed as a percentage such as 90%, 95%, or 99%.
margin of errorThe amount by which a sample statistic is likely to vary from the corresponding population parameter, calculated as the critical value times the standard error.
one-sample proportionA confidence interval or hypothesis test that estimates or tests a single population proportion based on data from one sample.
population proportionThe true proportion or percentage of a characteristic in an entire population, typically denoted as p.
random sampleA sample selected from a population in such a way that every member has an equal chance of being chosen, reducing bias and allowing for valid statistical inference.
sample sizeThe number of observations or data points collected in a sample, denoted as n.
width of a confidence intervalThe range or span of a confidence interval, calculated as the difference between the upper and lower bounds of the interval.

6.4 Setting Up a Test for a Population Proportion

TermDefinition
10% conditionThe requirement that sample size n is at most 10% of the population size N to ensure independence when sampling without replacement.
alternative hypothesisThe claim that contradicts the null hypothesis, representing what the researcher is trying to find evidence for.
approximately normalA distribution that closely follows the shape of a normal distribution, allowing for the use of normal probability methods.
categorical variableA variable that takes on values that are category names or group labels rather than numerical values.
independenceThe condition that observations in a sample are not influenced by each other, typically ensured through random sampling or randomized experiments.
null hypothesisThe initial claim or assumption being tested in a hypothesis test, typically stating that there is no effect or no difference.
number of failuresThe count of unfavorable outcomes in a sample, denoted as n(1-p̂), used to verify the normality condition.
number of successesThe count of favorable outcomes in a sample, denoted as np̂, used to verify the normality condition.
one-sample z-test for a population proportionA hypothesis test used to determine whether a sample proportion provides evidence that a population proportion differs from a hypothesized value.
one-sided alternative hypothesisAn alternative hypothesis that specifies the direction of the difference, either p₁ < p₂ or p₁ > p₂.
population proportionThe true proportion or percentage of a characteristic in an entire population, typically denoted as p.
random sampleA sample selected from a population in such a way that every member has an equal chance of being chosen, reducing bias and allowing for valid statistical inference.
randomized experimentA study design where subjects are randomly assigned to treatment groups to establish cause-and-effect relationships.
sample proportionThe proportion of individuals in a sample that have a particular characteristic, denoted as p-hat (p̂).
sampling distributionThe probability distribution of a sample statistic (such as a sample proportion) obtained from repeated sampling of a population.
sampling without replacementA sampling method in which an item selected from a population cannot be selected again in subsequent draws.
statistical inferenceThe process of drawing conclusions about a population based on data collected from a sample.
two-sided alternative hypothesisAn alternative hypothesis that specifies the difference could be in either direction, stated as p₁ ≠ p₂.

6.5 Interpreting p-Values

TermDefinition
alternative hypothesisThe claim that contradicts the null hypothesis, representing what the researcher is trying to find evidence for.
null distributionThe probability distribution of the test statistic under the assumption that the null hypothesis is true.
null hypothesisThe initial claim or assumption being tested in a hypothesis test, typically stating that there is no effect or no difference.
one-sample proportionA confidence interval or hypothesis test that estimates or tests a single population proportion based on data from one sample.
p-valueThe probability of observing a test statistic as extreme as or more extreme than the one calculated from the sample data, assuming the null hypothesis is true.
population proportionThe true proportion or percentage of a characteristic in an entire population, typically denoted as p.
probability modelA mathematical framework that describes the probability distribution of outcomes under specified assumptions.
sample statisticA numerical value calculated from sample data that is used to estimate the corresponding population parameter.
significance levelThe threshold probability (α) used to determine whether to reject the null hypothesis in a significance test.
significance testA statistical procedure used to determine whether there is sufficient evidence to reject the null hypothesis based on sample data.
standard errorThe standard deviation of a sampling distribution, which measures the variability of a sample statistic across repeated samples.
test statisticA calculated value used to determine whether to reject the null hypothesis in a hypothesis test, computed from sample data.
theoretical distributionA probability distribution based on a mathematical model, such as the normal distribution, used to approximate the distribution of a test statistic.
z-statisticA standardized test statistic for a population proportion calculated as (sample statistic - null value) divided by the standard deviation of the statistic.
z-testA hypothesis test that uses the standard normal distribution to determine whether a sample statistic differs significantly from a population parameter.

6.6 Concluding a Test for a Population Proportion

TermDefinition
alternative hypothesisThe claim that contradicts the null hypothesis, representing what the researcher is trying to find evidence for.
null hypothesisThe initial claim or assumption being tested in a hypothesis test, typically stating that there is no effect or no difference.
p-valueThe probability of observing a test statistic as extreme as or more extreme than the one calculated from the sample data, assuming the null hypothesis is true.
population proportionThe true proportion or percentage of a characteristic in an entire population, typically denoted as p.
reject the null hypothesisThe decision made when the p-value is less than or equal to the significance level, indicating sufficient evidence against the null hypothesis.
significance levelThe threshold probability (α) used to determine whether to reject the null hypothesis in a significance test.
significance testA statistical procedure used to determine whether there is sufficient evidence to reject the null hypothesis based on sample data.
statistical evidenceInformation from sample data that supports or fails to support a hypothesis about a population parameter.
test statisticA calculated value used to determine whether to reject the null hypothesis in a hypothesis test, computed from sample data.

6.7 Potential Errors When Performing Tests

TermDefinition
null hypothesisThe initial claim or assumption being tested in a hypothesis test, typically stating that there is no effect or no difference.
parameterA numerical summary that describes a characteristic of an entire population.
power of a testThe probability that a statistical test will correctly reject a false null hypothesis.
sample sizeThe number of observations or data points collected in a sample, denoted as n.
significance levelThe threshold probability (α) used to determine whether to reject the null hypothesis in a significance test.
standard errorThe standard deviation of a sampling distribution, which measures the variability of a sample statistic across repeated samples.
Type I errorAn error that occurs when a null hypothesis is rejected when it is actually true; the probability of committing this error is equal to the significance level (α).
Type II errorAn error that occurs when a null hypothesis is not rejected when it is actually false.

6.8 Confidence Intervals for the Difference of Two Proportions

TermDefinition
10% conditionThe requirement that sample size n is at most 10% of the population size N to ensure independence when sampling without replacement.
approximately normalA distribution that closely follows the shape of a normal distribution, allowing for the use of normal probability methods.
categorical variableA variable that takes on values that are category names or group labels rather than numerical values.
confidence intervalA range of values, calculated from sample data, that is likely to contain the true population parameter with a specified level of confidence.
difference in proportionsThe difference between two population proportions, calculated as p₁ - p₂, used to compare the prevalence of a characteristic across two populations.
difference of two population proportionsThe comparison between two population proportions, expressed as p₁ - p₂, to determine if they differ significantly.
independenceThe condition that observations in a sample are not influenced by each other, typically ensured through random sampling or randomized experiments.
population proportionThe true proportion or percentage of a characteristic in an entire population, typically denoted as p.
randomized experimentA study design where subjects are randomly assigned to treatment groups to establish cause-and-effect relationships.
sample proportionThe proportion of individuals in a sample that have a particular characteristic, denoted as p-hat (p̂).
sampling distributionThe probability distribution of a sample statistic (such as a sample proportion) obtained from repeated sampling of a population.
sampling without replacementA sampling method in which an item selected from a population cannot be selected again in subsequent draws.
simple random sampleA sample selected from a population such that every possible sample of the same size has an equal chance of being chosen.
standard errorThe standard deviation of a sampling distribution, which measures the variability of a sample statistic across repeated samples.
success-failure conditionA requirement that the expected number of successes and failures in each sample (np̂ and n(1-p̂)) meet a minimum threshold, typically 5 or 10, to ensure the sampling distribution is approximately normal.
test statisticA calculated value used to determine whether to reject the null hypothesis in a hypothesis test, computed from sample data.
two-sample z-intervalA confidence interval procedure that uses the standard normal distribution to estimate the difference between two population proportions based on sample data.

6.9 Justifying a Claim Based on a Confidence Interval for a Difference of Population Proportions

TermDefinition
confidence intervalA range of values, calculated from sample data, that is likely to contain the true population parameter with a specified level of confidence.
difference in proportionsThe difference between two population proportions, calculated as p₁ - p₂, used to compare the prevalence of a characteristic across two populations.
population proportionThe true proportion or percentage of a characteristic in an entire population, typically denoted as p.
random samplingA method of selecting samples from a population where each member has an equal chance of being chosen, ensuring the sample is representative of the population.
sample sizeThe number of observations or data points collected in a sample, denoted as n.