Study smarter with Fiveable
Get study guides, practice questions, and cheatsheets for all your subjects. Join 500,000+ students with a 96% pass rate.
Choosing the right statistical test is one of the most critical decisions you'll make in biostatistics—and it's exactly what exams love to test. You're not just being asked to memorize formulas; you're being evaluated on whether you understand when to use each test based on your data type, sample design, and research question. The tests in this guide fall into predictable patterns: comparing means vs. analyzing relationships, parametric vs. non-parametric approaches, and independent vs. paired designs.
Think of statistical tests as tools in a toolkit—each designed for a specific job. A t-test won't help you with categorical data any more than a hammer helps with screws. Master the decision logic behind test selection: What type of variable do I have? How many groups am I comparing? Does my data meet parametric assumptions? Don't just memorize that a Chi-square test exists—know that it's your go-to when both variables are categorical and you're testing independence.
When your research question asks "Is there a difference between groups?" and your outcome is continuous, you need a test that compares means. The choice depends on how many groups you're comparing and whether your data meets normality assumptions.
Compare: t-test vs. ANOVA—both compare means of continuous outcomes, but t-tests handle exactly two groups while ANOVA handles three or more. If an exam asks which test to use for comparing blood pressure across four treatment arms, ANOVA is your answer.
When your data violates normality assumptions, uses ordinal scales, or involves small samples, non-parametric tests provide robust alternatives. These tests work with ranks rather than raw values, making them distribution-free.
Compare: Mann-Whitney U vs. Wilcoxon signed-rank—both are non-parametric and rank-based, but Mann-Whitney handles independent groups while Wilcoxon handles paired/related samples. This mirrors the independent vs. paired t-test distinction. FRQ tip: If the question mentions "matched pairs" or "same subjects measured twice" with non-normal data, Wilcoxon is correct.
These tests ask "How are variables related?" rather than "Are groups different?" The choice depends on whether you're measuring association, predicting outcomes, or modeling probabilities.
Compare: Linear vs. logistic regression—both model relationships between predictors and outcomes, but linear regression requires a continuous dependent variable while logistic regression handles binary outcomes. If the outcome is "survived vs. died" or "positive vs. negative test," logistic regression is required.
When both your variables are categorical (nominal or ordinal), you need tests designed for frequency data rather than means.
Compare: Chi-square vs. correlation—both assess relationships, but Chi-square handles categorical-categorical associations while correlation handles continuous-continuous relationships. Exam trap: Don't use correlation for variables like "smoker/non-smoker" and "disease/no disease"—that's a Chi-square question.
| Concept | Best Examples |
|---|---|
| Comparing two group means (parametric) | Independent t-test, Paired t-test |
| Comparing three+ group means (parametric) | One-way ANOVA, Two-way ANOVA |
| Non-parametric two-group comparison | Mann-Whitney U (independent), Wilcoxon signed-rank (paired) |
| Non-parametric three+ group comparison | Kruskal-Wallis test |
| Variance comparison | F-test |
| Continuous outcome prediction | Linear regression, Correlation analysis |
| Binary outcome prediction | Logistic regression |
| Categorical variable association | Chi-square test |
A researcher wants to compare pain scores (ordinal scale) between three treatment groups with small, non-normally distributed samples. Which test should they use, and why is ANOVA inappropriate?
Compare and contrast the Mann-Whitney U test and the Wilcoxon signed-rank test. What study design features determine which one to use?
You're analyzing whether smoking status (yes/no) is associated with lung cancer diagnosis (yes/no). Which test is appropriate, and what assumption must be checked before proceeding?
A clinical trial measures blood glucose before and after a new medication in the same 50 patients. Data appear normally distributed. Which test should be used? What would change your answer to a non-parametric alternative?
An FRQ presents regression output with an odds ratio of 2.3 for a predictor variable. What type of regression produced this output, and how would you interpret this odds ratio in context?