A well-designed experiment lets you test whether one thing actually causes another. Without careful design, you can't tell if your results are real or just the product of bias and confounding factors. Here's what goes into setting one up.

Start with a research question and hypothesis. Your research question defines what you want to find out. Your hypothesis is a testable prediction about the relationship between variables. For example: "Does a new tutoring method improve exam scores compared to the standard method?"

Determine the variables:

Independent variable: The factor you manipulate or change. In the tutoring example, it's the tutoring method.
Dependent variable: The outcome you measure. Here, that's the exam score.
Confounding variables: Other factors (age, prior knowledge, study habits) that could influence the dependent variable and muddy your results. You need to account for these so they don't create false conclusions.

Establish control and treatment groups:

Control group: Receives no treatment or the standard treatment. This serves as your baseline for comparison.
Treatment group: Receives the new intervention you're testing.
Random assignment: You randomly place participants into these groups. This is critical because it makes the groups roughly equivalent before the treatment, so any difference afterward is more likely due to the independent variable.

Identify potential sources of bias:

Selection bias: Systematic differences between your groups before the experiment even starts (e.g., volunteers tend to be more motivated than non-volunteers).
Experimenter bias: The researcher unintentionally influences participants or interprets results in a way that confirms their expectations.
Measurement bias: Inaccuracies or inconsistencies in how the dependent variable is measured (e.g., a poorly calibrated instrument, or different observers recording data differently).
Confounding bias: An uncontrolled outside variable influences the dependent variable, making it look like the independent variable had an effect when it didn't (or masking a real effect).

Strengths vs. limitations of sampling methods

How you select participants for your study determines how much you can trust and generalize your results. Each sampling method involves trade-offs.

Simple random sampling gives every member of the population an equal chance of being selected (think lottery drawing or a random number generator).

Strengths: Minimizes selection bias; results are highly generalizable. Limitations: You need a complete list of the entire population, which is often impractical. Can also be time-consuming and costly for large populations.

Stratified sampling divides the population into subgroups (strata) based on a shared characteristic, like age bracket or income level, then randomly samples from each subgroup.

Strengths: Guarantees representation of key subgroups, which can make results more precise. Limitations: You need to know the population's characteristics ahead of time, and it's more complex to set up than simple random sampling.

Cluster sampling divides the population into naturally occurring groups (clusters), like schools or city blocks, then randomly selects entire clusters to study.

Strengths: Cost-effective and practical for large or geographically spread-out populations. Limitations: Less precise than other methods because people within a cluster tend to be similar to each other, which can increase sampling error.

Convenience sampling selects whoever is easiest to reach (e.g., surveying people at a mall, posting an online poll).

Strengths: Quick, cheap, and easy to carry out. Limitations: High risk of bias and poor representativeness. Results generally can't be generalized to the broader population.

Design of data collection experiments, 6.813D Reading: Experiment Design

Experimental Design and Results

Impact of experimental design on results

The way you design an experiment directly affects what kinds of conclusions you can draw. Statisticians evaluate this through three types of validity.

Internal validity is the extent to which your experiment establishes a true cause-and-effect relationship between the independent and dependent variables. Three things strengthen it:

Controlled variables: Minimizing the influence of confounding variables so you can isolate the effect of the treatment.
Random assignment: Ensuring that pre-existing differences between participants are spread evenly across groups, so differences in outcomes are more likely caused by the treatment.
Blinding: Concealing group assignments from participants (single-blind) or from both participants and researchers (double-blind) to reduce bias and placebo effects.

External validity is the extent to which your results generalize beyond your specific experiment to other populations, settings, or times.

A representative sample that reflects the broader population (diverse demographics, relevant characteristics) increases external validity.
Conducting the experiment in a real-world setting rather than a tightly controlled lab improves what's called ecological validity, meaning results are more likely to hold up outside the study.

Statistical conclusion validity is the accuracy of the statistical inferences you make about the relationship between variables. Three factors support it:

Adequate sample size: Larger samples give you more statistical power to detect real effects and reduce the risk of Type II errors (failing to detect an effect that's actually there).
Reliable measurements: Using consistent, validated tools to measure the dependent variable (e.g., standardized tests, calibrated instruments).
Replication: Repeating the experiment to see if findings hold up, which increases confidence in the results.

Key elements of experimental design

These are the core principles that tie everything together:

Randomization: Randomly assigning participants to groups to reduce bias and make groups comparable.
Control: Using a control group to isolate the effect of the independent variable. Without a control, you have no baseline for comparison.
Placebo effect: Participants sometimes improve just because they believe they're receiving treatment. Using a placebo (an inactive treatment that looks real) in the control group helps you separate genuine treatment effects from psychological ones.
Experimental design type: The structure of your experiment should match your research question. In a between-subjects design, different people are in each group. In a within-subjects design, the same people experience all conditions. Each has trade-offs in terms of controlling for individual differences and practical feasibility.

2,589 studying →