Study smarter with Fiveable
Get study guides, practice questions, and cheatsheets for all your subjects. Join 500,000+ students with a 96% pass rate.
Understanding study designs is foundational to everything you'll encounter in epidemiology. You can't interpret a research finding, evaluate an intervention, or critique a public health policy without knowing how the evidence was generated. Every study design comes with trade-offs between internal validity (how confident you can be about causation), external validity (how well findings generalize), and feasibility (time, cost, and ethical constraints). These trade-offs appear constantly on exams, especially when you're asked to recommend an appropriate design for a given research question.
You're being tested on your ability to match study designs to research scenarios, identify potential biases, and interpret the appropriate measures of association (relative risk, odds ratios, prevalence). Don't just memorize definitions. Know what each design can and cannot tell you about causation, which biases threaten each design, and when one approach is preferred over another.
These designs involve researcher-controlled manipulation of exposures or interventions. Because investigators assign participants to groups, experimental designs offer the strongest evidence for causation. But they come with significant ethical and practical constraints.
RCTs are the gold standard for causal inference. Random assignment distributes both known and unknown confounders equally across groups, so any difference in outcomes can be attributed to the intervention rather than pre-existing differences between participants.
These designs test interventions but lack randomization. Instead, they rely on pre-existing groups, natural experiments, or before-and-after comparisons when random assignment is infeasible or unethical.
Compare: RCTs vs. Quasi-experimental studies: both test interventions, but RCTs eliminate confounding through randomization while quasi-experiments must address it through design or analysis. If an exam asks which design provides stronger causal evidence, RCT wins. If it asks what's practical for evaluating a new public health policy already being rolled out, quasi-experimental is your answer.
These designs observe natural variation in exposures without manipulation. The key distinction is directionality: do you start with exposure status and follow forward, or start with disease status and look backward?
Cohort studies start with exposure and follow forward in time. You classify participants as exposed or unexposed, then watch both groups to see who develops the outcome. This establishes temporal sequence definitively, which is essential for causal inference.
Case-control studies work backward from disease. You identify people who already have the outcome (cases) and people who don't (controls), then compare their past exposure histories.
This is a hybrid design embedded within an existing cohort. As cases arise during cohort follow-up, controls are sampled from the same cohort members who were still at risk at the time each case occurred.
Compare: Cohort vs. Case-control: cohort studies follow exposure forward and calculate relative risk; case-control studies work backward from disease and calculate odds ratios. For rare diseases, case-control is practical. For rare exposures, cohort is preferred. Exam questions often ask you to justify design choice based on disease rarity, so keep this distinction sharp.
These designs describe patterns and generate hypotheses but cannot establish causation. They're the starting point of epidemiologic investigation, not the endpoint.
Cross-sectional studies are a snapshot design that measures exposure and outcome simultaneously in a defined population at a single point in time.
Ecological studies use populations, not individuals, as the unit of analysis. They compare disease rates across countries, regions, or time periods using aggregate data.
These provide detailed clinical documentation of individual cases without a comparison group.
Compare: Cross-sectional vs. Ecological studies: both are descriptive, but cross-sectional collects individual-level data while ecological uses population-level data. Cross-sectional can identify individual-level associations (though not causal ones); ecological cannot make individual-level inferences due to ecological fallacy.
These designs follow participants over extended periods to capture temporal relationships and disease progression. The defining feature is repeated observation of the same individuals.
Longitudinal studies take repeated measurements on the same individuals over time. This captures within-person change, disease natural history, and long-term exposure effects.
Compare: Longitudinal vs. Cross-sectional: longitudinal follows individuals over time and can establish temporal sequence; cross-sectional captures one moment and cannot. If an exam scenario asks about tracking disease progression or determining whether exposure precedes outcome, longitudinal is required.
These methods don't generate new data but systematically aggregate existing evidence. They sit at the top of the evidence hierarchy when done properly.
A systematic review uses a structured, reproducible protocol to identify, evaluate, and synthesize all relevant studies on a specific question. A meta-analysis goes one step further by pooling the data statistically.
| Concept | Best Examples |
|---|---|
| Strongest causal evidence | RCTs, Cohort studies |
| Efficient for rare diseases | Case-control, Nested case-control |
| Prevalence estimation | Cross-sectional studies |
| Hypothesis generation | Ecological studies, Case series |
| Policy/intervention evaluation | Quasi-experimental, RCTs |
| Long-term exposure effects | Longitudinal, Cohort studies |
| Evidence synthesis | Meta-analyses, Systematic reviews |
| First signal of emerging diseases | Case reports, Case series |
A researcher wants to study risk factors for a rare childhood cancer. Which study design is most efficient, and what measure of association would be calculated?
Compare cohort and case-control studies: What can cohort studies calculate that case-control studies cannot, and why?
An FRQ describes a study comparing heart disease rates between countries with different dietary fat consumption. What type of study is this, and what major limitation threatens its conclusions?
Why are RCTs considered the gold standard for causal inference, yet inappropriate for studying whether smoking causes lung cancer?
A nested case-control study is conducted within an ongoing cohort. What advantages does this hybrid design offer over a traditional case-control study conducted in the general population?