Fiveable

🎣Statistical Inference Unit 3 Review

QR code for Statistical Inference practice questions

3.1 Bivariate and Multivariate Distributions

3.1 Bivariate and Multivariate Distributions

Written by the Fiveable Content Team • Last updated August 2025
Written by the Fiveable Content Team • Last updated August 2025
🎣Statistical Inference
Unit & Topic Study Guides

Bivariate and multivariate distributions help us understand how multiple random variables interact. They're crucial for analyzing complex systems where outcomes depend on multiple factors, like how height and weight relate or how education impacts income.

These distributions let us calculate probabilities for specific combinations of variables. We can also find marginal and conditional probabilities, giving us a deeper understanding of how variables influence each other in real-world scenarios.

Foundations of Bivariate and Multivariate Distributions

Joint probability distributions

  • Joint probability distribution describes probability of two or more random variables occurring simultaneously (coin flip and die roll)
  • Discrete joint probability distribution represented by joint probability mass function (PMF) denoted as P(X=x,Y=y)P(X=x, Y=y) for two variables (number of heads in coin flips and sum of dice rolls)
  • Continuous joint probability distribution represented by joint probability density function (PDF) denoted as f(x,y)f(x, y) for two variables (height and weight of individuals)
  • Interpretation describes relationship between multiple random variables allows calculation of probabilities for specific combinations of values
Joint probability distributions, Multivariate normal distribution - Wikipedia

Bivariate and multivariate probabilities

  • Marginal distributions obtained by summing or integrating over one variable
  • For discrete: P(X=x)=yP(X=x,Y=y)P(X=x) = \sum_y P(X=x, Y=y) (probability of getting a specific number of heads regardless of dice roll)
  • For continuous: fX(x)=f(x,y)dyf_X(x) = \int_{-\infty}^{\infty} f(x,y) dy (probability density of height regardless of weight)
  • Conditional distributions probability of one variable given a specific value of another
  • For discrete: P(Y=yX=x)=P(X=x,Y=y)P(X=x)P(Y=y|X=x) = \frac{P(X=x, Y=y)}{P(X=x)} (probability of dice sum given 3 heads in coin flips)
  • For continuous: fYX(yx)=f(x,y)fX(x)f_{Y|X}(y|x) = \frac{f(x,y)}{f_X(x)} (probability density of weight given a specific height)
  • Deriving joint PMF or PDF from given information about relationship between variables using transformation techniques for known distributions
Joint probability distributions, probability - Joint distribution of multiple binomial distributions - Mathematics Stack Exchange

Properties of multivariate distributions

  • Bivariate normal distribution joint PDF: f(x,y)=12πσXσY1ρ2exp(12(1ρ2)[(xμX)2σX2+(yμY)2σY22ρ(xμX)(yμY)σXσY])f(x,y) = \frac{1}{2\pi\sigma_X\sigma_Y\sqrt{1-\rho^2}} \exp\left(-\frac{1}{2(1-\rho^2)}[\frac{(x-\mu_X)^2}{\sigma_X^2} + \frac{(y-\mu_Y)^2}{\sigma_Y^2} - \frac{2\rho(x-\mu_X)(y-\mu_Y)}{\sigma_X\sigma_Y}]\right)
  • Parameters: means (μX,μY\mu_X, \mu_Y), standard deviations (σX,σY\sigma_X, \sigma_Y), correlation coefficient (ρ\rho)
  • Properties of bivariate normal distribution:
    • Marginal distributions univariate normal
    • Conditional distributions normal
    • Uncorrelated variables independent
  • Other common multivariate distributions:
    • Multinomial distribution models probability of different outcomes in multiple trials (rolling dice multiple times)
    • Dirichlet distribution continuous multivariate generalization of beta distribution (modeling proportions of different components in a mixture)

Visualization of multivariate data

  • Scatter plots display relationship between two variables useful for identifying patterns, correlations, and outliers (height vs weight)
  • Contour plots represent 3D surface on 2D plane show lines of constant probability density for bivariate distributions (bivariate normal distribution)
  • Heat maps visualize joint distribution of two discrete variables color intensity represents probability or frequency (contingency table for education level and income)
  • Pair plots matrix of scatter plots for multiple variables useful for exploring relationships in multivariate data (comparing multiple physical characteristics)
  • 3D surface plots visualize joint PDF for two continuous variables height represents probability density (bivariate normal distribution)
Pep mascot
Upgrade your Fiveable account to print any study guide

Download study guides as beautiful PDFs See example

Print or share PDFs with your students

Always prints our latest, updated content

Mark up and annotate as you study

Click below to go to billing portal → update your plan → choose Yearly → and select "Fiveable Share Plan". Only pay the difference

Plan is open to all students, teachers, parents, etc
Pep mascot
Upgrade your Fiveable account to export vocabulary

Download study guides as beautiful PDFs See example

Print or share PDFs with your students

Always prints our latest, updated content

Mark up and annotate as you study

Plan is open to all students, teachers, parents, etc
report an error
description

screenshots help us find and fix the issue faster (optional)

add screenshot

2,589 studying →