Fiveable

🎲Statistical Mechanics Unit 8 Review

QR code for Statistical Mechanics practice questions

8.1 Molecular velocity distribution

8.1 Molecular velocity distribution

Written by the Fiveable Content Team • Last updated August 2025
Written by the Fiveable Content Team • Last updated August 2025
🎲Statistical Mechanics
Unit & Topic Study Guides

The Maxwell-Boltzmann distribution describes how molecular speeds are spread out in a gas at thermal equilibrium. It's one of the most important results in kinetic theory because it bridges the gap between what individual molecules are doing and the bulk properties you can measure, like temperature and pressure.

Molecular velocity distribution basics

In any gas at thermal equilibrium, molecules aren't all moving at the same speed. Instead, there's a continuous range of velocities, from nearly zero to very fast. The molecular velocity distribution tells you what fraction of molecules have speeds in any given range. This is a statistical description: you can't predict what one molecule is doing, but you can precisely describe the behavior of a huge collection of them.

Maxwell-Boltzmann distribution

The Maxwell-Boltzmann distribution gives the probability density for molecular speeds in an ideal gas at thermal equilibrium. It was developed by James Clerk Maxwell and Ludwig Boltzmann in the 19th century and rests on three key assumptions:

  • The molecules are identical and obey classical (not quantum) mechanics
  • They don't interact with each other except through brief elastic collisions
  • The system has reached thermal equilibrium

The full expression for the speed distribution is:

f(v)=4π(m2πkT)3/2v2emv2/2kTf(v) = 4\pi \left(\frac{m}{2\pi kT}\right)^{3/2} v^2 \, e^{-mv^2/2kT}

where mm is the molecular mass, kk is Boltzmann's constant, TT is absolute temperature, and vv is the molecular speed. Each piece of this formula has a physical role: the exponential emv2/2kTe^{-mv^2/2kT} is the Boltzmann factor (penalizing high-energy states), the v2v^2 factor comes from the density of states in three dimensions (more ways to have speed vv when vv is larger), and the prefactor ensures normalization.

Probability density function

The function f(v)f(v) is a probability density, meaning f(v)dvf(v)\,dv gives the probability that a molecule's speed falls between vv and v+dvv + dv. The curve is not symmetric. It rises from zero, peaks at the most probable speed, then has a long tail extending toward high speeds (positive skew). This tail matters: it means a small but significant fraction of molecules always have speeds much higher than the average.

Because it's a proper probability density, the total area under the curve equals 1:

0f(v)dv=1\int_0^\infty f(v)\, dv = 1

Velocity components vs. speed

There's an important distinction between velocity components and speed:

  • Each individual component (vxv_x, vyv_y, vzv_z) follows a Gaussian (normal) distribution centered at zero. Positive and negative values are equally likely, reflecting the isotropy of the gas.
  • The speed v=vx2+vy2+vz2v = \sqrt{v_x^2 + v_y^2 + v_z^2} is always non-negative and follows the Maxwell-Boltzmann distribution shown above.

The three velocity components are statistically independent and identically distributed. The Maxwell-Boltzmann speed distribution emerges when you combine these three Gaussian components and convert to spherical coordinates. The v2v^2 factor in the distribution comes directly from the spherical shell volume element 4πv2dv4\pi v^2\,dv: there are more ways to arrange three components to give a larger speed than a smaller one.

Statistical derivation

Assumptions and constraints

The derivation starts from a few physical and mathematical requirements:

  • A large number of identical, non-interacting classical particles
  • Thermal equilibrium (the distribution doesn't change with time)
  • Conservation of total energy
  • Isotropy of space (no direction is special)
  • Equipartition of energy among degrees of freedom

Derivation steps

  1. Write the joint distribution. Because the three velocity components are independent, the probability of a molecule having velocity (vx,vy,vz)(v_x, v_y, v_z) factors as g(vx)g(vy)g(vz)g(v_x)\,g(v_y)\,g(v_z).
  2. Apply isotropy. The distribution can only depend on the speed v=vx2+vy2+vz2v = \sqrt{v_x^2 + v_y^2 + v_z^2}, not on direction. This constraint, combined with the factorization, leads to a functional equation whose solution is a Gaussian: g(vi)eαvi2g(v_i) \propto e^{-\alpha v_i^2}.
  3. Use equipartition to fix the constant. The equipartition theorem says each translational degree of freedom carries average energy 12kT\frac{1}{2}kT. This identifies α=m2kT\alpha = \frac{m}{2kT}.
  4. Normalize each component distribution. Integrating the Gaussian over all viv_i from -\infty to ++\infty fixes the prefactor, giving g(vi)=(m2πkT)1/2emvi2/2kTg(v_i) = \left(\frac{m}{2\pi kT}\right)^{1/2} e^{-mv_i^2/2kT}.
  5. Convert to the speed distribution. Transform from Cartesian (vx,vy,vz)(v_x, v_y, v_z) to spherical coordinates and integrate over angles. The spherical shell element 4πv2dv4\pi v^2\,dv introduces the v2v^2 factor, yielding the final Maxwell-Boltzmann speed distribution.

Normalization condition

The normalization integral 0f(v)dv=1\int_0^\infty f(v)\,dv = 1 is what pins down the prefactor in the distribution. Without it, f(v)f(v) would only tell you relative probabilities. With normalization, f(v)dvf(v)\,dv gives actual probabilities, so you can make quantitative predictions and compare distributions for different gases or temperatures.

Key features

Three characteristic speeds summarize the distribution. They differ because of the distribution's asymmetry, and each one appears in different physical contexts.

Most probable velocity

This is the speed at the peak of f(v)f(v). You find it by setting dfdv=0\frac{df}{dv} = 0:

vp=2kTmv_p = \sqrt{\frac{2kT}{m}}

It's the single most likely speed for any given molecule, but because of the long high-speed tail, most molecules actually have speeds above vpv_p.

Average velocity

The mean speed, obtained by integrating vf(v)v\,f(v):

vˉ=8kTπm\bar{v} = \sqrt{\frac{8kT}{\pi m}}

This is slightly larger than vpv_p (by a factor of 4/π1.128\sqrt{4/\pi} \approx 1.128). The average speed shows up in calculations of effusion rates and collision frequencies.

Root mean square velocity

The rms speed comes from the average of v2v^2:

vrms=3kTmv_{\text{rms}} = \sqrt{\frac{3kT}{m}}

This is the largest of the three characteristic speeds. It connects directly to the average translational kinetic energy: Ek=12mvrms2=32kT\langle E_k \rangle = \frac{1}{2}m\,v_{\text{rms}}^2 = \frac{3}{2}kT. That relationship is why vrmsv_{\text{rms}} appears in the kinetic theory derivation of pressure.

Ordering to remember: vp<vˉ<vrmsv_p < \bar{v} < v_{\text{rms}}, always. The ratios are fixed: vp:vˉ:vrms=1:1.128:1.225v_p : \bar{v} : v_{\text{rms}} = 1 : 1.128 : 1.225.

Temperature dependence

Effect on distribution shape

Temperature controls both the width and the height of the distribution:

  • Higher TT: The curve broadens and its peak drops. Molecules spread over a wider range of speeds, and the high-speed tail extends further.
  • Lower TT: The curve narrows and the peak grows taller. Molecules cluster more tightly around the most probable speed.

The total area under the curve stays at 1 regardless of temperature (normalization is preserved). The asymmetric shape also persists at all temperatures.

Maxwell-Boltzmann distribution, Category:Maxwell–Boltzmann distributions - Wikimedia Commons

Velocity scaling with temperature

All three characteristic speeds scale as T\sqrt{T}:

v2=v1T2T1v_2 = v_1 \sqrt{\frac{T_2}{T_1}}

So doubling the absolute temperature increases each characteristic speed by a factor of 21.41\sqrt{2} \approx 1.41. This square-root dependence means you need to quadruple the temperature to double the molecular speeds.

The same scaling applies when comparing molecules of different mass at the same temperature: heavier molecules move more slowly by a factor of m1/m2\sqrt{m_1/m_2}.

Applications and implications

Kinetic theory of gases

The Maxwell-Boltzmann distribution is the engine behind kinetic theory. Using it, you can derive:

  • Pressure from the rate of molecular impacts on container walls
  • Temperature as a measure of average translational kinetic energy (32kT=12mvrms2\frac{3}{2}kT = \frac{1}{2}m\,v_{\text{rms}}^2)
  • The ideal gas law PV=NkTPV = NkT from first principles
  • Heat capacities through the equipartition theorem applied to molecular degrees of freedom

This molecular-level picture also explains why Boyle's law and Charles's law work: they're consequences of how molecular speeds and collision rates respond to changes in volume and temperature.

Reaction rates

For a chemical reaction to occur, colliding molecules typically need kinetic energy exceeding some activation energy EaE_a. The Maxwell-Boltzmann distribution tells you what fraction of molecules meet this threshold. The fraction with energy above EaE_a is proportional to eEa/kTe^{-E_a/kT}, which is the core of the Arrhenius equation:

krate=AeEa/kTk_{\text{rate}} = A\,e^{-E_a/kT}

This explains why reaction rates increase sharply with temperature: even a modest temperature rise pushes significantly more molecules into the high-energy tail of the distribution.

Effusion and diffusion

  • Effusion is the escape of gas through a tiny hole (smaller than the mean free path). The effusion rate depends on vˉ\bar{v}, leading to Graham's law: the rate of effusion is inversely proportional to m\sqrt{m}. Lighter gases effuse faster.
  • Diffusion (mixing of gases) also depends on molecular speeds, along with the mean free path between collisions.

Graham's law has practical applications, including isotope separation. Gaseous diffusion of UF6UF_6 was historically used to enrich uranium, exploiting the slight mass difference between 235U^{235}U and 238U^{238}U isotopes.

Experimental verification

Molecular beam experiments

The most direct test of the Maxwell-Boltzmann distribution uses molecular beams. Otto Stern and colleagues performed the pioneering experiments in the 1920s. The basic method:

  1. Heat a gas in an oven with a small opening to produce a molecular beam
  2. Collimate the beam using slits so molecules travel in a well-defined direction
  3. Use a velocity selector (typically rotating slotted disks) to pass only molecules in a narrow speed range
  4. Measure the intensity of molecules arriving at a detector as a function of the selected speed

The resulting speed distributions match the Maxwell-Boltzmann prediction with excellent agreement across different gases and temperatures.

Spectroscopic measurements

Atoms and molecules in a gas emit or absorb light at frequencies that are Doppler-shifted by their motion. Faster molecules produce larger shifts, so the shape of a spectral line reflects the velocity distribution along the line of sight. This Doppler broadening produces a Gaussian line profile (matching the Gaussian distribution of a single velocity component). High-resolution and laser-based spectroscopy (such as laser-induced fluorescence) can map out velocity distributions in detail, providing another confirmation of the Maxwell-Boltzmann prediction.

Limitations and extensions

Non-ideal gas behavior

The Maxwell-Boltzmann distribution assumes non-interacting molecules, which breaks down when:

  • Pressure is high: molecules are close together and intermolecular forces (attractive and repulsive) become significant
  • Temperature is low: molecules move slowly enough that attractive forces can cause clustering or condensation

In these regimes, equations of state like the van der Waals equation account for molecular interactions, and the simple Maxwell-Boltzmann distribution no longer accurately describes the velocity distribution. Dense gases and liquids require more complex statistical mechanical treatments.

Quantum mechanical considerations

Classical statistics fail when the thermal de Broglie wavelength becomes comparable to the average spacing between particles. This happens at very low temperatures or for very light particles (like helium or hydrogen). In that regime:

  • Bosons (integer spin, e.g., 4He^4\text{He}) follow the Bose-Einstein distribution, which allows Bose-Einstein condensation
  • Fermions (half-integer spin, e.g., 3He^3\text{He}, electrons) follow the Fermi-Dirac distribution, which enforces the Pauli exclusion principle

The Maxwell-Boltzmann distribution emerges as the classical limit of both quantum distributions when the temperature is high enough that quantum effects are negligible.

Maxwell-Boltzmann vs. Boltzmann

These names are easy to confuse:

  • The Boltzmann distribution P(E)eE/kTP(E) \propto e^{-E/kT} is a general result: it gives the probability of any state with energy EE in a system at thermal equilibrium. It applies to any degree of freedom, not just translational motion.
  • The Maxwell-Boltzmann speed distribution is a specific application of the Boltzmann distribution to the translational kinetic energy of gas molecules in three dimensions. The v2v^2 factor (from the density of states) is what makes it different from a simple exponential decay.

Comparison with other distributions

  • Each velocity component follows a Gaussian (normal) distribution, centered at zero
  • The speed distribution is mathematically a Maxwell distribution, which is related to the chi distribution with 3 degrees of freedom (since speed is the magnitude of a 3D vector with normally distributed components)
  • The energy distribution f(E)EeE/kTf(E) \propto \sqrt{E}\,e^{-E/kT} follows a chi-squared distribution with 3 degrees of freedom
  • At relativistic speeds (vcv \sim c), the Maxwell-Boltzmann distribution is replaced by the Jüttner distribution, though this is rarely relevant for ordinary gases