Step 1: Categorical data (Topics 2.1-2.3)Start with two-way tables. Practice computing joint, marginal, and conditional relative frequencies from a single table, then sketch a segmented bar graph from the same data. Confirm you can state whether an association exists and explain why using conditional distributions.
Step 2: Scatterplots and correlation (Topics 2.4-2.5)Describe several scatterplots using form, direction, strength, and unusual features. Then practice interpreting r values in context, including explaining why a strong r does not imply causation and why r near 1 does not guarantee a linear form.
Step 3: Regression models and predictions (Topic 2.6)Given a regression equation, practice writing slope and intercept interpretations in context using the exact units of the variables. Calculate predicted values and identify when a prediction involves extrapolation.
Step 4: Residuals and model assessment (Topics 2.7-2.8)Calculate residuals by hand for a few data points, then practice reading residual plots to judge model fit. Use the formulas b = r(sy/sx) and a = y-bar - b(x-bar) to derive the LSRL, and interpret r² in a full sentence with context.
Step 5: Departures from linearity (Topic 2.9)Review the definitions of outlier, high-leverage point, and influential point with examples. Practice comparing residual plots and r² values before and after a log transformation to explain why the transformed model is a better fit.