Mean field theory tackles a core problem in statistical mechanics: how do you analyze a system where every particle interacts with every other particle? The answer is to replace all those complicated pairwise interactions with a single average "effective field" that each particle feels. This converts an intractable many-body problem into a solvable single-body problem.

The trade-off is that you lose information about local correlations and fluctuations. But for many systems, especially those with long-range interactions or high coordination numbers, mean field theory captures the essential physics of phase transitions and collective behavior remarkably well.

Definition and Basic Concepts

The central idea: instead of tracking how spin $i$ interacts with each of its neighbors individually, you assume every spin sees the same average environment produced by all the others.

Each particle interacts with an effective field rather than with individual neighbors
The many-body problem reduces to a self-consistent single-particle problem
Fluctuations and correlations between particles are neglected by construction
Despite this simplification, the theory often gives qualitatively correct predictions for phase transitions and symmetry breaking

Assumptions and Limitations

Mean field theory rests on the assumption that the local environment of each particle is well-represented by the global average. This works when:

Interactions are long-range, so each particle couples to many others and local deviations average out
The spatial dimension is high, giving each site many neighbors (the theory becomes exact as $d \to \infty$ )
The system is far from a critical point, where fluctuations are small

It breaks down when:

Short-range correlations dominate the physics (low-dimensional systems)
The system is near a critical point, where fluctuations diverge and the correlation length grows large
You need accurate critical exponents, which mean field theory systematically gets wrong below the upper critical dimension

Mean Field Approximation Techniques

Several methods exist for implementing the mean field approximation. Each offers a different balance of accuracy, computational cost, and physical transparency.

Variational Approach

This method uses the Bogoliubov inequality to bound the true free energy from above. You choose a trial Hamiltonian (usually non-interacting) with adjustable parameters, then minimize the resulting free energy.

Write a trial Hamiltonian $H_0$ with variational parameters (e.g., an effective field $h_{\text{eff}}$ )
Compute the variational free energy: $F_{\text{var}} = F_0 + \langle H - H_0 \rangle_0$ , where $\langle \cdot \rangle_0$ denotes the average over the trial ensemble
Minimize $F_{\text{var}}$ with respect to the variational parameters
The result provides an upper bound on the true free energy $F$

This is the basis of the Hartree-Fock approximation in quantum many-body theory. Adding more variational parameters (e.g., Jastrow factors for correlations) systematically improves accuracy.

Effective Field Method

This is the most physically intuitive approach, often introduced through Weiss molecular field theory for magnets.

Pick a single spin $s_i$ in the lattice
Replace all neighboring spins with their thermal average: $s_j \to \langle s_j \rangle = m$ (the magnetization)
The spin now sits in an effective field $h_{\text{eff}} = zJm + h$ , where $z$ is the coordination number, $J$ is the coupling constant, and $h$ is any external field
Compute $m = \tanh(\beta h_{\text{eff}})$ for the Ising model, giving a self-consistency equation
Solve this equation (graphically or numerically) to find the equilibrium magnetization

The self-consistency requirement is what makes this a mean field theory: the average field depends on $m$ , and $m$ depends on the average field.

Cluster Expansion

Rather than treating each site independently, cluster methods group nearby sites together and treat intra-cluster interactions exactly while applying the mean field approximation only to inter-cluster couplings.

Systematically includes short-range correlations that simple mean field theory misses
The Bethe-Peierls approximation (Bethe lattice) is the simplest cluster method, treating a central spin and its $z$ neighbors exactly
Higher-order cluster methods (Cluster Variation Method, or CVM) include larger groups of sites
Truncating at different cluster sizes gives a hierarchy of increasingly accurate approximations
Widely used in alloy thermodynamics and lattice gas models

Applications in Statistical Mechanics

Ising Model

The Ising model, with spins $s_i = \pm 1$ on a lattice interacting via $H = -J\sum_{\langle ij \rangle} s_i s_j$ , is the standard testbed for mean field theory.

Mean field predicts a second-order phase transition at $T_c = zJ/k_B$ , where $z$ is the coordination number
In 1D ( $z = 2$ ), mean field incorrectly predicts a transition; the exact result shows no phase transition at finite temperature
In 2D, the exact Onsager solution gives $T_c \approx 2.27 J/k_B$ for a square lattice ( $z = 4$ ), while mean field predicts $T_c = 4J/k_B$ , overshooting by about 76%
Mean field becomes increasingly accurate as $d$ increases, and is exact for $d \to \infty$

This model cleanly illustrates both the qualitative successes and quantitative failures of the approximation.

Ferromagnetic Systems

Below the Curie temperature $T_c$ , mean field theory predicts spontaneous magnetization even without an external field. The magnetization near $T_c$ follows:

$m \propto (T_c - T)^{1/2}$

This square-root behavior (exponent $\beta = 1/2$ ) is a signature of mean field theory. Real 3D ferromagnets have $\beta \approx 0.33$ , showing that fluctuations matter near the critical point.

The predicted $T_c$ scales with the coordination number $z$ , which is qualitatively correct: more neighbors means stronger collective ordering
The overall shape of the magnetization curve $m(T)$ is qualitatively right
The theory extends naturally to antiferromagnets (using sublattice magnetizations) and more complex magnetic orderings

Liquid-Gas Transitions

The van der Waals equation is a classic mean field theory for fluids:

$\left(P + \frac{a}{V^2}\right)(V - b) = k_B T$

Here $a$ accounts for the average attractive interaction between molecules, and $b$ for their finite volume. This equation predicts a critical point and a liquid-gas coexistence curve, capturing the qualitative phase diagram of real fluids. However, it gives the same mean field critical exponents ( $\beta = 1/2$ , $\gamma = 1$ ) that fail quantitatively near the critical point.

Mathematical Formulation

Definition and basic concepts, Mean-field approximation and the Curie-Weiss model - 集智百科 - 复杂系统|人工智能|复杂科学|复杂网络|自组织

Mean Field Equations

The self-consistency equation is the heart of any mean field theory. For the Ising model with coordination number $z$ and coupling $J$ :

$m = \tanh\left(\frac{zJm + h}{k_B T}\right)$

where $m$ is the magnetization per spin and $h$ is the external field.

At $h = 0$ , this equation has a non-trivial solution ( $m \neq 0$ ) only below $T_c = zJ/k_B$
Near $T_c$ , you can expand the $\tanh$ to find the scaling of $m$ with temperature
These transcendental equations are typically solved graphically (plotting both sides vs. $m$ ) or numerically
More complex systems yield coupled self-consistency equations for multiple order parameters

Free Energy Calculations

The mean field free energy for the Ising model can be written as a function of $m$ :

$F(m) = -\frac{1}{2}zJm^2 - k_B T \ln\left(2\cosh\left(\frac{zJm}{k_B T}\right)\right)$

Near the critical point, this can be expanded in powers of $m$ to obtain the Landau form:

$F(m) \approx F_0 + a(T - T_c)m^2 + bm^4 + \cdots$

The sign change of the quadratic coefficient at $T_c$ signals the phase transition. Minimizing $F(m)$ with respect to $m$ recovers the self-consistency equation and determines the equilibrium state.

Order Parameters

An order parameter is a quantity that distinguishes the ordered phase from the disordered one. It's zero in the disordered (high-symmetry) phase and nonzero in the ordered (broken-symmetry) phase.

Magnetization $m$ for ferromagnets
Density difference $\rho_l - \rho_g$ for the liquid-gas transition
Superconducting gap $\Delta$ for superconductors

For continuous (second-order) transitions, the order parameter grows continuously from zero at $T_c$ . For first-order transitions, it jumps discontinuously. Near the critical point, order parameters obey scaling laws: $m \sim |T - T_c|^\beta$ .

Critical Phenomena and Phase Transitions

Mean Field Critical Exponents

Mean field theory predicts a set of universal critical exponents that are independent of microscopic details:

Exponent	Definition	Mean Field Value
$\beta$	Order parameter: $m \sim (T_c - T)^\beta$	1/2
$\gamma$	Susceptibility: $$\chi \sim	T - T_c
$\delta$	Critical isotherm: $m \sim h^{1/\delta}$ at $T = T_c$	3
$\alpha$	Specific heat: $$C \sim	T - T_c

These values are exact above the upper critical dimension $d_c = 4$ for short-range Ising-like systems. Below $d_c$ , fluctuations modify the exponents. For example, the 3D Ising model has $\beta \approx 0.33$ , $\gamma \approx 1.24$ .

Universality Classes

In reality, critical exponents depend on just a few features: spatial dimensionality, symmetry of the order parameter, and range of interactions. Systems sharing these features belong to the same universality class and have identical critical exponents.

Mean field theory misses this richness entirely. It predicts a single set of exponents for all continuous transitions, regardless of dimension or symmetry. This is one of its most significant failures and a key motivation for the renormalization group.

Landau Theory Connection

Landau theory is a phenomenological approach that expands the free energy in powers of the order parameter, guided by symmetry:

$F(m) = F_0 + a(T - T_c)m^2 + bm^4 + \cdots$

The coefficients are determined by symmetry (e.g., if the system has $m \to -m$ symmetry, only even powers appear)
Minimizing $F(m)$ reproduces the same critical exponents as microscopic mean field theories
Landau theory is equivalent to mean field theory near $T_c$ , but it's formulated without reference to any specific microscopic model
Adding gradient terms gives Ginzburg-Landau theory, which can describe spatial variations of the order parameter and the effects of fluctuations

Beyond Mean Field Theory

Fluctuations and Correlations

Mean field theory assumes the order parameter is spatially uniform. Near a critical point, this assumption fails badly because fluctuations grow and become correlated over long distances.

The Ginzburg criterion quantifies when mean field theory breaks down. It compares the magnitude of fluctuations to the mean field order parameter. For a $d$ -dimensional system with short-range interactions, mean field theory fails when:

$|T - T_c| / T_c \lesssim \text{Gi}$

where $\text{Gi}$ is the Ginzburg number. For 3D systems, this number can be small (mean field works over most of the phase diagram) or large (fluctuations dominate), depending on the system.

In mean field theory, correlation functions decay exponentially: $G(r) \sim e^{-r/\xi}$ . At the critical point, the true behavior is a power law: $G(r) \sim r^{-(d-2+\eta)}$ , which mean field theory cannot capture.

Renormalization Group Approach

The renormalization group (RG) is the systematic framework for handling fluctuations at all length scales. It explains why universality exists and predicts the correct critical exponents.

The basic idea: coarse-grain the system by integrating out short-wavelength fluctuations, then rescale. Fixed points of this transformation correspond to critical points, and the behavior near fixed points determines the critical exponents.

RG reduces to mean field theory above $d_c = 4$ , confirming that mean field exponents are exact there
Below $d_c$ , RG predicts non-classical exponents that match experiments
The $\epsilon$ -expansion ( $\epsilon = 4 - d$ ) provides a perturbative way to compute corrections to mean field exponents

Corrections to Mean Field

Several systematic methods improve on mean field predictions:

High-temperature and low-temperature series expansions: compute thermodynamic quantities as power series in $\beta J$ or $e^{-\beta J}$ , then extrapolate
$\epsilon$ -expansion: expand critical exponents in powers of $\epsilon = d_c - d$ , where $d_c$ is the upper critical dimension
Bethe-Peierls approximation: treat a cluster of spins exactly, embedding it in a mean field environment
Effective field theories: incorporate local fluctuations while keeping the mean field structure at large scales

Definition and basic concepts, Phase transitions – TikZ.net

Numerical Methods

Monte Carlo Simulations

Monte Carlo methods sample configurations stochastically, weighted by the Boltzmann factor, to compute thermal averages without solving the partition function analytically.

Start from an initial configuration
Propose a move (e.g., flip a spin)
Accept or reject the move based on the Metropolis criterion: accept if $\Delta E < 0$ ; if $\Delta E > 0$ , accept with probability $e^{-\beta \Delta E}$
Repeat to generate a Markov chain of configurations
Compute averages over the chain after equilibration

Monte Carlo provides numerically exact results (within statistical error) for finite systems and serves as a benchmark for testing mean field predictions. It can handle complex geometries and interactions that are analytically intractable.

Molecular Dynamics vs. Mean Field

Molecular dynamics (MD) integrates Newton's equations for all particles, giving access to both equilibrium and dynamical properties. Compared to mean field theory:

MD captures the full microscopic dynamics, including correlations and fluctuations
It's far more computationally expensive, scaling with system size and simulation time
MD can validate or refute mean field predictions for specific systems
Hybrid approaches like Car-Parrinello molecular dynamics combine quantum mean field (density functional theory) with classical dynamics for the nuclei

Strengths and Weaknesses

Accuracy vs. Simplicity

Mean field theory's greatest strength is that it provides a qualitatively correct picture of phase transitions with minimal computational effort. You get the existence of a critical temperature, spontaneous symmetry breaking, and the general shape of the phase diagram, all from a self-consistency equation you can solve on paper.

The cost is quantitative accuracy near critical points and in low dimensions. For building intuition and guiding more detailed calculations, mean field theory remains indispensable.

Range of Validity

The Ginzburg criterion is the practical tool for assessing when mean field theory applies. As a rule of thumb:

High dimensions ( $d > 4$ for Ising-like systems): mean field is exact
3D systems far from $T_c$ : mean field is usually a good approximation
3D systems near $T_c$ : mean field fails for critical exponents but may still give reasonable transition temperatures
2D and 1D systems: mean field is often qualitatively wrong (e.g., predicting transitions that don't exist in 1D)
Long-range interactions: extend the range of validity, effectively increasing the "effective dimension"

Comparison with Exact Solutions

Where exact solutions exist, they provide a clear picture of mean field theory's accuracy:

1D Ising model: exact solution shows no phase transition; mean field incorrectly predicts one
2D Ising model (Onsager): exact $T_c$ and critical exponents differ significantly from mean field values
Infinite-dimensional models: mean field becomes exact, confirming the theory's internal consistency
Spherical model: solvable in all dimensions, with mean field exponents above $d = 4$

Advanced Topics

Spin Glasses and Disorder

The Sherrington-Kirkpatrick (SK) model applies mean field theory to disordered magnetic systems where couplings $J_{ij}$ are random. The physics is dramatically richer than the ferromagnetic case:

The free energy landscape has an exponential number of metastable states
Replica symmetry breaking (Parisi's solution) is needed to correctly describe the low-temperature phase
The mathematical structure connects to optimization problems, computational complexity, and neural network theory
This is one of the rare cases where mean field theory reveals genuinely new physics rather than just approximating known behavior

Quantum Mean Field Theory

Mean field ideas extend naturally to quantum systems:

Hartree-Fock: replaces electron-electron interactions with an average potential; each electron moves in the self-consistent field of all others
Bogoliubov theory: treats weakly interacting Bose gases by replacing the condensate with a classical field
Density functional theory (DFT): maps the interacting electron problem to a non-interacting one in an effective potential (Kohn-Sham scheme)
Dynamical mean field theory (DMFT): maps a lattice problem to a self-consistent impurity problem, capturing local quantum fluctuations while treating spatial correlations at the mean field level. This is particularly powerful for strongly correlated electron systems.

Non-Equilibrium Systems

Mean field approximations also apply outside equilibrium:

Reaction-diffusion systems: replace spatial correlations with average concentrations to get rate equations
Boltzmann equation: a mean field kinetic theory where the collision term depends on the single-particle distribution function
Fokker-Planck equations: describe the evolution of probability distributions under stochastic dynamics
Applications range from polymer dynamics to epidemiology (e.g., SIR models where infection rates depend on average population fractions)

2,589 studying →