The Maxwell-Boltzmann distribution describes how molecular speeds are spread out in a gas at thermal equilibrium. It's one of the most important results in kinetic theory because it bridges the gap between what individual molecules are doing and the bulk properties you can measure, like temperature and pressure.
Molecular velocity distribution basics
In any gas at thermal equilibrium, molecules aren't all moving at the same speed. Instead, there's a continuous range of velocities, from nearly zero to very fast. The molecular velocity distribution tells you what fraction of molecules have speeds in any given range. This is a statistical description: you can't predict what one molecule is doing, but you can precisely describe the behavior of a huge collection of them.
Maxwell-Boltzmann distribution
The Maxwell-Boltzmann distribution gives the probability density for molecular speeds in an ideal gas at thermal equilibrium. It was developed by James Clerk Maxwell and Ludwig Boltzmann in the 19th century and rests on three key assumptions:
- The molecules are identical and obey classical (not quantum) mechanics
- They don't interact with each other except through brief elastic collisions
- The system has reached thermal equilibrium
The full expression for the speed distribution is:
where is the molecular mass, is Boltzmann's constant, is absolute temperature, and is the molecular speed. Each piece of this formula has a physical role: the exponential is the Boltzmann factor (penalizing high-energy states), the factor comes from the density of states in three dimensions (more ways to have speed when is larger), and the prefactor ensures normalization.
Probability density function
The function is a probability density, meaning gives the probability that a molecule's speed falls between and . The curve is not symmetric. It rises from zero, peaks at the most probable speed, then has a long tail extending toward high speeds (positive skew). This tail matters: it means a small but significant fraction of molecules always have speeds much higher than the average.
Because it's a proper probability density, the total area under the curve equals 1:
Velocity components vs. speed
There's an important distinction between velocity components and speed:
- Each individual component (, , ) follows a Gaussian (normal) distribution centered at zero. Positive and negative values are equally likely, reflecting the isotropy of the gas.
- The speed is always non-negative and follows the Maxwell-Boltzmann distribution shown above.
The three velocity components are statistically independent and identically distributed. The Maxwell-Boltzmann speed distribution emerges when you combine these three Gaussian components and convert to spherical coordinates. The factor in the distribution comes directly from the spherical shell volume element : there are more ways to arrange three components to give a larger speed than a smaller one.
Statistical derivation
Assumptions and constraints
The derivation starts from a few physical and mathematical requirements:
- A large number of identical, non-interacting classical particles
- Thermal equilibrium (the distribution doesn't change with time)
- Conservation of total energy
- Isotropy of space (no direction is special)
- Equipartition of energy among degrees of freedom
Derivation steps
- Write the joint distribution. Because the three velocity components are independent, the probability of a molecule having velocity factors as .
- Apply isotropy. The distribution can only depend on the speed , not on direction. This constraint, combined with the factorization, leads to a functional equation whose solution is a Gaussian: .
- Use equipartition to fix the constant. The equipartition theorem says each translational degree of freedom carries average energy . This identifies .
- Normalize each component distribution. Integrating the Gaussian over all from to fixes the prefactor, giving .
- Convert to the speed distribution. Transform from Cartesian to spherical coordinates and integrate over angles. The spherical shell element introduces the factor, yielding the final Maxwell-Boltzmann speed distribution.
Normalization condition
The normalization integral is what pins down the prefactor in the distribution. Without it, would only tell you relative probabilities. With normalization, gives actual probabilities, so you can make quantitative predictions and compare distributions for different gases or temperatures.
Key features
Three characteristic speeds summarize the distribution. They differ because of the distribution's asymmetry, and each one appears in different physical contexts.
Most probable velocity
This is the speed at the peak of . You find it by setting :
It's the single most likely speed for any given molecule, but because of the long high-speed tail, most molecules actually have speeds above .
Average velocity
The mean speed, obtained by integrating :
This is slightly larger than (by a factor of ). The average speed shows up in calculations of effusion rates and collision frequencies.
Root mean square velocity
The rms speed comes from the average of :
This is the largest of the three characteristic speeds. It connects directly to the average translational kinetic energy: . That relationship is why appears in the kinetic theory derivation of pressure.
Ordering to remember: , always. The ratios are fixed: .
Temperature dependence
Effect on distribution shape
Temperature controls both the width and the height of the distribution:
- Higher : The curve broadens and its peak drops. Molecules spread over a wider range of speeds, and the high-speed tail extends further.
- Lower : The curve narrows and the peak grows taller. Molecules cluster more tightly around the most probable speed.
The total area under the curve stays at 1 regardless of temperature (normalization is preserved). The asymmetric shape also persists at all temperatures.

Velocity scaling with temperature
All three characteristic speeds scale as :
So doubling the absolute temperature increases each characteristic speed by a factor of . This square-root dependence means you need to quadruple the temperature to double the molecular speeds.
The same scaling applies when comparing molecules of different mass at the same temperature: heavier molecules move more slowly by a factor of .
Applications and implications
Kinetic theory of gases
The Maxwell-Boltzmann distribution is the engine behind kinetic theory. Using it, you can derive:
- Pressure from the rate of molecular impacts on container walls
- Temperature as a measure of average translational kinetic energy ()
- The ideal gas law from first principles
- Heat capacities through the equipartition theorem applied to molecular degrees of freedom
This molecular-level picture also explains why Boyle's law and Charles's law work: they're consequences of how molecular speeds and collision rates respond to changes in volume and temperature.
Reaction rates
For a chemical reaction to occur, colliding molecules typically need kinetic energy exceeding some activation energy . The Maxwell-Boltzmann distribution tells you what fraction of molecules meet this threshold. The fraction with energy above is proportional to , which is the core of the Arrhenius equation:
This explains why reaction rates increase sharply with temperature: even a modest temperature rise pushes significantly more molecules into the high-energy tail of the distribution.
Effusion and diffusion
- Effusion is the escape of gas through a tiny hole (smaller than the mean free path). The effusion rate depends on , leading to Graham's law: the rate of effusion is inversely proportional to . Lighter gases effuse faster.
- Diffusion (mixing of gases) also depends on molecular speeds, along with the mean free path between collisions.
Graham's law has practical applications, including isotope separation. Gaseous diffusion of was historically used to enrich uranium, exploiting the slight mass difference between and isotopes.
Experimental verification
Molecular beam experiments
The most direct test of the Maxwell-Boltzmann distribution uses molecular beams. Otto Stern and colleagues performed the pioneering experiments in the 1920s. The basic method:
- Heat a gas in an oven with a small opening to produce a molecular beam
- Collimate the beam using slits so molecules travel in a well-defined direction
- Use a velocity selector (typically rotating slotted disks) to pass only molecules in a narrow speed range
- Measure the intensity of molecules arriving at a detector as a function of the selected speed
The resulting speed distributions match the Maxwell-Boltzmann prediction with excellent agreement across different gases and temperatures.
Spectroscopic measurements
Atoms and molecules in a gas emit or absorb light at frequencies that are Doppler-shifted by their motion. Faster molecules produce larger shifts, so the shape of a spectral line reflects the velocity distribution along the line of sight. This Doppler broadening produces a Gaussian line profile (matching the Gaussian distribution of a single velocity component). High-resolution and laser-based spectroscopy (such as laser-induced fluorescence) can map out velocity distributions in detail, providing another confirmation of the Maxwell-Boltzmann prediction.
Limitations and extensions
Non-ideal gas behavior
The Maxwell-Boltzmann distribution assumes non-interacting molecules, which breaks down when:
- Pressure is high: molecules are close together and intermolecular forces (attractive and repulsive) become significant
- Temperature is low: molecules move slowly enough that attractive forces can cause clustering or condensation
In these regimes, equations of state like the van der Waals equation account for molecular interactions, and the simple Maxwell-Boltzmann distribution no longer accurately describes the velocity distribution. Dense gases and liquids require more complex statistical mechanical treatments.
Quantum mechanical considerations
Classical statistics fail when the thermal de Broglie wavelength becomes comparable to the average spacing between particles. This happens at very low temperatures or for very light particles (like helium or hydrogen). In that regime:
- Bosons (integer spin, e.g., ) follow the Bose-Einstein distribution, which allows Bose-Einstein condensation
- Fermions (half-integer spin, e.g., , electrons) follow the Fermi-Dirac distribution, which enforces the Pauli exclusion principle
The Maxwell-Boltzmann distribution emerges as the classical limit of both quantum distributions when the temperature is high enough that quantum effects are negligible.
Related distributions
Maxwell-Boltzmann vs. Boltzmann
These names are easy to confuse:
- The Boltzmann distribution is a general result: it gives the probability of any state with energy in a system at thermal equilibrium. It applies to any degree of freedom, not just translational motion.
- The Maxwell-Boltzmann speed distribution is a specific application of the Boltzmann distribution to the translational kinetic energy of gas molecules in three dimensions. The factor (from the density of states) is what makes it different from a simple exponential decay.
Comparison with other distributions
- Each velocity component follows a Gaussian (normal) distribution, centered at zero
- The speed distribution is mathematically a Maxwell distribution, which is related to the chi distribution with 3 degrees of freedom (since speed is the magnitude of a 3D vector with normally distributed components)
- The energy distribution follows a chi-squared distribution with 3 degrees of freedom
- At relativistic speeds (), the Maxwell-Boltzmann distribution is replaced by the Jüttner distribution, though this is rarely relevant for ordinary gases