Fiveable

🎲Statistical Mechanics Unit 9 Review

QR code for Statistical Mechanics practice questions

9.4 Mean field approximation

9.4 Mean field approximation

Written by the Fiveable Content Team • Last updated August 2025
Written by the Fiveable Content Team • Last updated August 2025
🎲Statistical Mechanics
Unit & Topic Study Guides

Fundamentals of Mean Field Theory

Mean field theory tackles a core problem in statistical mechanics: how do you analyze a system where every particle interacts with every other particle? The answer is to replace all those complicated pairwise interactions with a single average "effective field" that each particle feels. This converts an intractable many-body problem into a solvable single-body problem.

The trade-off is that you lose information about local correlations and fluctuations. But for many systems, especially those with long-range interactions or high coordination numbers, mean field theory captures the essential physics of phase transitions and collective behavior remarkably well.

Definition and Basic Concepts

The central idea: instead of tracking how spin ii interacts with each of its neighbors individually, you assume every spin sees the same average environment produced by all the others.

  • Each particle interacts with an effective field rather than with individual neighbors
  • The many-body problem reduces to a self-consistent single-particle problem
  • Fluctuations and correlations between particles are neglected by construction
  • Despite this simplification, the theory often gives qualitatively correct predictions for phase transitions and symmetry breaking

Assumptions and Limitations

Mean field theory rests on the assumption that the local environment of each particle is well-represented by the global average. This works when:

  • Interactions are long-range, so each particle couples to many others and local deviations average out
  • The spatial dimension is high, giving each site many neighbors (the theory becomes exact as dd \to \infty)
  • The system is far from a critical point, where fluctuations are small

It breaks down when:

  • Short-range correlations dominate the physics (low-dimensional systems)
  • The system is near a critical point, where fluctuations diverge and the correlation length grows large
  • You need accurate critical exponents, which mean field theory systematically gets wrong below the upper critical dimension

Mean Field Approximation Techniques

Several methods exist for implementing the mean field approximation. Each offers a different balance of accuracy, computational cost, and physical transparency.

Variational Approach

This method uses the Bogoliubov inequality to bound the true free energy from above. You choose a trial Hamiltonian (usually non-interacting) with adjustable parameters, then minimize the resulting free energy.

  1. Write a trial Hamiltonian H0H_0 with variational parameters (e.g., an effective field heffh_{\text{eff}})

  2. Compute the variational free energy: Fvar=F0+HH00F_{\text{var}} = F_0 + \langle H - H_0 \rangle_0, where 0\langle \cdot \rangle_0 denotes the average over the trial ensemble

  3. Minimize FvarF_{\text{var}} with respect to the variational parameters

  4. The result provides an upper bound on the true free energy FF

This is the basis of the Hartree-Fock approximation in quantum many-body theory. Adding more variational parameters (e.g., Jastrow factors for correlations) systematically improves accuracy.

Effective Field Method

This is the most physically intuitive approach, often introduced through Weiss molecular field theory for magnets.

  1. Pick a single spin sis_i in the lattice
  2. Replace all neighboring spins with their thermal average: sjsj=ms_j \to \langle s_j \rangle = m (the magnetization)
  3. The spin now sits in an effective field heff=zJm+hh_{\text{eff}} = zJm + h, where zz is the coordination number, JJ is the coupling constant, and hh is any external field
  4. Compute m=tanh(βheff)m = \tanh(\beta h_{\text{eff}}) for the Ising model, giving a self-consistency equation
  5. Solve this equation (graphically or numerically) to find the equilibrium magnetization

The self-consistency requirement is what makes this a mean field theory: the average field depends on mm, and mm depends on the average field.

Cluster Expansion

Rather than treating each site independently, cluster methods group nearby sites together and treat intra-cluster interactions exactly while applying the mean field approximation only to inter-cluster couplings.

  • Systematically includes short-range correlations that simple mean field theory misses
  • The Bethe-Peierls approximation (Bethe lattice) is the simplest cluster method, treating a central spin and its zz neighbors exactly
  • Higher-order cluster methods (Cluster Variation Method, or CVM) include larger groups of sites
  • Truncating at different cluster sizes gives a hierarchy of increasingly accurate approximations
  • Widely used in alloy thermodynamics and lattice gas models

Applications in Statistical Mechanics

Ising Model

The Ising model, with spins si=±1s_i = \pm 1 on a lattice interacting via H=JijsisjH = -J\sum_{\langle ij \rangle} s_i s_j, is the standard testbed for mean field theory.

  • Mean field predicts a second-order phase transition at Tc=zJ/kBT_c = zJ/k_B, where zz is the coordination number
  • In 1D (z=2z = 2), mean field incorrectly predicts a transition; the exact result shows no phase transition at finite temperature
  • In 2D, the exact Onsager solution gives Tc2.27J/kBT_c \approx 2.27 J/k_B for a square lattice (z=4z = 4), while mean field predicts Tc=4J/kBT_c = 4J/k_B, overshooting by about 76%
  • Mean field becomes increasingly accurate as dd increases, and is exact for dd \to \infty

This model cleanly illustrates both the qualitative successes and quantitative failures of the approximation.

Ferromagnetic Systems

Below the Curie temperature TcT_c, mean field theory predicts spontaneous magnetization even without an external field. The magnetization near TcT_c follows:

m(TcT)1/2m \propto (T_c - T)^{1/2}

This square-root behavior (exponent β=1/2\beta = 1/2) is a signature of mean field theory. Real 3D ferromagnets have β0.33\beta \approx 0.33, showing that fluctuations matter near the critical point.

  • The predicted TcT_c scales with the coordination number zz, which is qualitatively correct: more neighbors means stronger collective ordering
  • The overall shape of the magnetization curve m(T)m(T) is qualitatively right
  • The theory extends naturally to antiferromagnets (using sublattice magnetizations) and more complex magnetic orderings

Liquid-Gas Transitions

The van der Waals equation is a classic mean field theory for fluids:

(P+aV2)(Vb)=kBT\left(P + \frac{a}{V^2}\right)(V - b) = k_B T

Here aa accounts for the average attractive interaction between molecules, and bb for their finite volume. This equation predicts a critical point and a liquid-gas coexistence curve, capturing the qualitative phase diagram of real fluids. However, it gives the same mean field critical exponents (β=1/2\beta = 1/2, γ=1\gamma = 1) that fail quantitatively near the critical point.

Mathematical Formulation

Definition and basic concepts, Mean-field approximation and the Curie-Weiss model - 集智百科 - 复杂系统|人工智能|复杂科学|复杂网络|自组织

Mean Field Equations

The self-consistency equation is the heart of any mean field theory. For the Ising model with coordination number zz and coupling JJ:

m=tanh(zJm+hkBT)m = \tanh\left(\frac{zJm + h}{k_B T}\right)

where mm is the magnetization per spin and hh is the external field.

  • At h=0h = 0, this equation has a non-trivial solution (m0m \neq 0) only below Tc=zJ/kBT_c = zJ/k_B
  • Near TcT_c, you can expand the tanh\tanh to find the scaling of mm with temperature
  • These transcendental equations are typically solved graphically (plotting both sides vs. mm) or numerically
  • More complex systems yield coupled self-consistency equations for multiple order parameters

Free Energy Calculations

The mean field free energy for the Ising model can be written as a function of mm:

F(m)=12zJm2kBTln(2cosh(zJmkBT))F(m) = -\frac{1}{2}zJm^2 - k_B T \ln\left(2\cosh\left(\frac{zJm}{k_B T}\right)\right)

Near the critical point, this can be expanded in powers of mm to obtain the Landau form:

F(m)F0+a(TTc)m2+bm4+F(m) \approx F_0 + a(T - T_c)m^2 + bm^4 + \cdots

The sign change of the quadratic coefficient at TcT_c signals the phase transition. Minimizing F(m)F(m) with respect to mm recovers the self-consistency equation and determines the equilibrium state.

Order Parameters

An order parameter is a quantity that distinguishes the ordered phase from the disordered one. It's zero in the disordered (high-symmetry) phase and nonzero in the ordered (broken-symmetry) phase.

  • Magnetization mm for ferromagnets
  • Density difference ρlρg\rho_l - \rho_g for the liquid-gas transition
  • Superconducting gap Δ\Delta for superconductors

For continuous (second-order) transitions, the order parameter grows continuously from zero at TcT_c. For first-order transitions, it jumps discontinuously. Near the critical point, order parameters obey scaling laws: mTTcβm \sim |T - T_c|^\beta.

Critical Phenomena and Phase Transitions

Mean Field Critical Exponents

Mean field theory predicts a set of universal critical exponents that are independent of microscopic details:

ExponentDefinitionMean Field Value
β\betaOrder parameter: m(TcT)βm \sim (T_c - T)^\beta1/2
γ\gammaSusceptibility: $$\chi \simT - T_c
δ\deltaCritical isotherm: mh1/δm \sim h^{1/\delta} at T=TcT = T_c3
α\alphaSpecific heat: $$C \simT - T_c
These values are exact above the upper critical dimension dc=4d_c = 4 for short-range Ising-like systems. Below dcd_c, fluctuations modify the exponents. For example, the 3D Ising model has β0.33\beta \approx 0.33, γ1.24\gamma \approx 1.24.

Universality Classes

In reality, critical exponents depend on just a few features: spatial dimensionality, symmetry of the order parameter, and range of interactions. Systems sharing these features belong to the same universality class and have identical critical exponents.

Mean field theory misses this richness entirely. It predicts a single set of exponents for all continuous transitions, regardless of dimension or symmetry. This is one of its most significant failures and a key motivation for the renormalization group.

Landau Theory Connection

Landau theory is a phenomenological approach that expands the free energy in powers of the order parameter, guided by symmetry:

F(m)=F0+a(TTc)m2+bm4+F(m) = F_0 + a(T - T_c)m^2 + bm^4 + \cdots

  • The coefficients are determined by symmetry (e.g., if the system has mmm \to -m symmetry, only even powers appear)
  • Minimizing F(m)F(m) reproduces the same critical exponents as microscopic mean field theories
  • Landau theory is equivalent to mean field theory near TcT_c, but it's formulated without reference to any specific microscopic model
  • Adding gradient terms gives Ginzburg-Landau theory, which can describe spatial variations of the order parameter and the effects of fluctuations

Beyond Mean Field Theory

Fluctuations and Correlations

Mean field theory assumes the order parameter is spatially uniform. Near a critical point, this assumption fails badly because fluctuations grow and become correlated over long distances.

The Ginzburg criterion quantifies when mean field theory breaks down. It compares the magnitude of fluctuations to the mean field order parameter. For a dd-dimensional system with short-range interactions, mean field theory fails when:

TTc/TcGi|T - T_c| / T_c \lesssim \text{Gi}

where Gi\text{Gi} is the Ginzburg number. For 3D systems, this number can be small (mean field works over most of the phase diagram) or large (fluctuations dominate), depending on the system.

In mean field theory, correlation functions decay exponentially: G(r)er/ξG(r) \sim e^{-r/\xi}. At the critical point, the true behavior is a power law: G(r)r(d2+η)G(r) \sim r^{-(d-2+\eta)}, which mean field theory cannot capture.

Renormalization Group Approach

The renormalization group (RG) is the systematic framework for handling fluctuations at all length scales. It explains why universality exists and predicts the correct critical exponents.

The basic idea: coarse-grain the system by integrating out short-wavelength fluctuations, then rescale. Fixed points of this transformation correspond to critical points, and the behavior near fixed points determines the critical exponents.

  • RG reduces to mean field theory above dc=4d_c = 4, confirming that mean field exponents are exact there
  • Below dcd_c, RG predicts non-classical exponents that match experiments
  • The ϵ\epsilon-expansion (ϵ=4d\epsilon = 4 - d) provides a perturbative way to compute corrections to mean field exponents

Corrections to Mean Field

Several systematic methods improve on mean field predictions:

  • High-temperature and low-temperature series expansions: compute thermodynamic quantities as power series in βJ\beta J or eβJe^{-\beta J}, then extrapolate
  • ϵ\epsilon-expansion: expand critical exponents in powers of ϵ=dcd\epsilon = d_c - d, where dcd_c is the upper critical dimension
  • Bethe-Peierls approximation: treat a cluster of spins exactly, embedding it in a mean field environment
  • Effective field theories: incorporate local fluctuations while keeping the mean field structure at large scales
Definition and basic concepts, Phase transitions – TikZ.net

Numerical Methods

Monte Carlo Simulations

Monte Carlo methods sample configurations stochastically, weighted by the Boltzmann factor, to compute thermal averages without solving the partition function analytically.

  1. Start from an initial configuration
  2. Propose a move (e.g., flip a spin)
  3. Accept or reject the move based on the Metropolis criterion: accept if ΔE<0\Delta E < 0; if ΔE>0\Delta E > 0, accept with probability eβΔEe^{-\beta \Delta E}
  4. Repeat to generate a Markov chain of configurations
  5. Compute averages over the chain after equilibration

Monte Carlo provides numerically exact results (within statistical error) for finite systems and serves as a benchmark for testing mean field predictions. It can handle complex geometries and interactions that are analytically intractable.

Molecular Dynamics vs. Mean Field

Molecular dynamics (MD) integrates Newton's equations for all particles, giving access to both equilibrium and dynamical properties. Compared to mean field theory:

  • MD captures the full microscopic dynamics, including correlations and fluctuations
  • It's far more computationally expensive, scaling with system size and simulation time
  • MD can validate or refute mean field predictions for specific systems
  • Hybrid approaches like Car-Parrinello molecular dynamics combine quantum mean field (density functional theory) with classical dynamics for the nuclei

Strengths and Weaknesses

Accuracy vs. Simplicity

Mean field theory's greatest strength is that it provides a qualitatively correct picture of phase transitions with minimal computational effort. You get the existence of a critical temperature, spontaneous symmetry breaking, and the general shape of the phase diagram, all from a self-consistency equation you can solve on paper.

The cost is quantitative accuracy near critical points and in low dimensions. For building intuition and guiding more detailed calculations, mean field theory remains indispensable.

Range of Validity

The Ginzburg criterion is the practical tool for assessing when mean field theory applies. As a rule of thumb:

  • High dimensions (d>4d > 4 for Ising-like systems): mean field is exact
  • 3D systems far from TcT_c: mean field is usually a good approximation
  • 3D systems near TcT_c: mean field fails for critical exponents but may still give reasonable transition temperatures
  • 2D and 1D systems: mean field is often qualitatively wrong (e.g., predicting transitions that don't exist in 1D)
  • Long-range interactions: extend the range of validity, effectively increasing the "effective dimension"

Comparison with Exact Solutions

Where exact solutions exist, they provide a clear picture of mean field theory's accuracy:

  • 1D Ising model: exact solution shows no phase transition; mean field incorrectly predicts one
  • 2D Ising model (Onsager): exact TcT_c and critical exponents differ significantly from mean field values
  • Infinite-dimensional models: mean field becomes exact, confirming the theory's internal consistency
  • Spherical model: solvable in all dimensions, with mean field exponents above d=4d = 4

Advanced Topics

Spin Glasses and Disorder

The Sherrington-Kirkpatrick (SK) model applies mean field theory to disordered magnetic systems where couplings JijJ_{ij} are random. The physics is dramatically richer than the ferromagnetic case:

  • The free energy landscape has an exponential number of metastable states
  • Replica symmetry breaking (Parisi's solution) is needed to correctly describe the low-temperature phase
  • The mathematical structure connects to optimization problems, computational complexity, and neural network theory
  • This is one of the rare cases where mean field theory reveals genuinely new physics rather than just approximating known behavior

Quantum Mean Field Theory

Mean field ideas extend naturally to quantum systems:

  • Hartree-Fock: replaces electron-electron interactions with an average potential; each electron moves in the self-consistent field of all others
  • Bogoliubov theory: treats weakly interacting Bose gases by replacing the condensate with a classical field
  • Density functional theory (DFT): maps the interacting electron problem to a non-interacting one in an effective potential (Kohn-Sham scheme)
  • Dynamical mean field theory (DMFT): maps a lattice problem to a self-consistent impurity problem, capturing local quantum fluctuations while treating spatial correlations at the mean field level. This is particularly powerful for strongly correlated electron systems.

Non-Equilibrium Systems

Mean field approximations also apply outside equilibrium:

  • Reaction-diffusion systems: replace spatial correlations with average concentrations to get rate equations
  • Boltzmann equation: a mean field kinetic theory where the collision term depends on the single-particle distribution function
  • Fokker-Planck equations: describe the evolution of probability distributions under stochastic dynamics
  • Applications range from polymer dynamics to epidemiology (e.g., SIR models where infection rates depend on average population fractions)