Fiveable

🔋Electromagnetism II Unit 5 Review

QR code for Electromagnetism II practice questions

5.3 Covariant formulation of Maxwell's equations

5.3 Covariant formulation of Maxwell's equations

Written by the Fiveable Content Team • Last updated August 2025
Written by the Fiveable Content Team • Last updated August 2025
🔋Electromagnetism II
Unit & Topic Study Guides

The covariant formulation of Maxwell's equations unifies electric and magnetic fields into a single framework built on special relativity. Instead of treating E\vec{E} and B\vec{B} as separate vector fields that transform in complicated ways between reference frames, this approach packages them into a single tensor object that transforms cleanly under Lorentz transformations. The result is a set of equations that are manifestly Lorentz-invariant and far more compact than the traditional vector calculus form.

Covariant formulation overview

The central idea is to rewrite electromagnetism using the language of four-vectors and tensors in Minkowski spacetime. Because these mathematical objects have well-defined transformation rules under Lorentz boosts and rotations, any equation built from them is automatically consistent with special relativity.

Advantages vs traditional formulation

  • Space and time are treated on equal footing, so Lorentz invariance is built in from the start rather than checked after the fact.
  • The six components of E\vec{E} and B\vec{B} merge into a single antisymmetric tensor FμνF^{\mu\nu}, cutting down on bookkeeping.
  • Relativistic effects (length contraction of field lines, mixing of electric and magnetic fields under boosts) emerge naturally from the tensor transformation law.
  • All four of Maxwell's equations reduce to just two tensor equations.

Relationship to special relativity

The formulation lives in four-dimensional Minkowski spacetime, where an event is labeled by coordinates xμ=(ct,x,y,z)x^{\mu} = (ct, x, y, z). Every quantity in the theory (potentials, currents, fields) is expressed as a four-vector or tensor, meaning it transforms via Lorentz transformation matrices Λμν\Lambda^{\mu}{}_{\nu}. This guarantees that the laws of electromagnetism take the same form in every inertial frame, which is exactly what the postulates of special relativity demand.

Mathematical framework

Four-vectors in spacetime

A four-vector has one timelike and three spacelike components and transforms under Lorentz transformations just like the position four-vector. Two key examples:

  • Four-position: xμ=(ct,r)x^{\mu} = (ct,\, \vec{r})
  • Four-velocity: uμ=γ(c,v)u^{\mu} = \gamma(c,\, \vec{v}), where γ=(1v2/c2)1/2\gamma = (1 - v^2/c^2)^{-1/2} is the Lorentz factor

The inner product of two four-vectors, computed with the metric tensor, is a Lorentz scalar (the same in all frames).

Metric tensor definition

The metric tensor gμνg_{\mu\nu} defines the geometry of spacetime. In flat Minkowski spacetime with the (,+,+,+)(-, +, +, +) signature:

gμν=diag(1,+1,+1,+1)g_{\mu\nu} = \text{diag}(-1, +1, +1, +1)

All off-diagonal components vanish. The metric serves two roles:

  1. It computes invariant intervals: ds2=gμνdxμdxν=c2dt2+dx2+dy2+dz2ds^2 = g_{\mu\nu}\,dx^{\mu}\,dx^{\nu} = -c^2 dt^2 + dx^2 + dy^2 + dz^2
  2. It raises and lowers tensor indices (see below).

Be careful with sign conventions. Some textbooks use (+,,,)(+, -, -, -). The physics is the same, but signs in component expressions for FμνF^{\mu\nu} will differ.

Raising and lowering indices

The metric tensor converts between contravariant (upper) and covariant (lower) indices:

  • Lowering: Aμ=gμνAνA_{\mu} = g_{\mu\nu}\,A^{\nu}
  • Raising: Aμ=gμνAνA^{\mu} = g^{\mu\nu}\,A_{\nu}

In Minkowski spacetime this is straightforward: raising or lowering a spatial index leaves the component unchanged, while raising or lowering the time index flips the sign. For example, if Aμ=(A0,A1,A2,A3)A^{\mu} = (A^0, A^1, A^2, A^3), then Aμ=(A0,A1,A2,A3)A_{\mu} = (-A^0, A^1, A^2, A^3).

Electromagnetic field tensor

The electromagnetic field tensor FμνF^{\mu\nu} is a rank-2 antisymmetric tensor that encodes all six components of the electric and magnetic fields. It's defined as the antisymmetric derivative of the four-potential:

Fμν=μAννAμF^{\mu\nu} = \partial^{\mu}A^{\nu} - \partial^{\nu}A^{\mu}

Tensor components and structure

Because Fμν=FνμF^{\mu\nu} = -F^{\nu\mu}, the diagonal entries vanish and only six independent components remain. Written as a matrix (using the (,+,+,+)(-, +, +, +) convention):

Fμν=(0Ex/cEy/cEz/cEx/c0BzByEy/cBz0BxEz/cByBx0)F^{\mu\nu} = \begin{pmatrix} 0 & -E_x/c & -E_y/c & -E_z/c \\ E_x/c & 0 & -B_z & B_y \\ E_y/c & B_z & 0 & -B_x \\ E_z/c & -B_y & B_x & 0 \end{pmatrix}

The time-space components (F0iF^{0i}) carry the electric field, while the space-space components (FijF^{ij}) carry the magnetic field. Specifically:

  • F0i=Ei/cF^{0i} = -E^i/c
  • Fij=ϵijkBkF^{ij} = -\epsilon^{ijk}B_k

Electric and magnetic fields

You can read off the fields from the tensor:

  • Electric field: Ei=cF0iE^i = -c\,F^{0i}
  • Magnetic field: Bk=12ϵijkFijB_k = -\tfrac{1}{2}\,\epsilon_{ijk}\,F^{ij}

The key conceptual point: E\vec{E} and B\vec{B} are not independent entities. They are different components of the same object, FμνF^{\mu\nu}. What one observer calls a pure electric field, another observer (in a different frame) may see as a mixture of electric and magnetic fields.

Transformation properties

Under a Lorentz transformation Λμν\Lambda^{\mu}{}_{\nu}, the field tensor transforms as:

Fμν=ΛμαΛνβFαβF'^{\mu\nu} = \Lambda^{\mu}{}_{\alpha}\,\Lambda^{\nu}{}_{\beta}\,F^{\alpha\beta}

This is the standard transformation law for a rank-2 contravariant tensor. It automatically mixes the E\vec{E} and B\vec{B} components in exactly the right way, reproducing the field transformation formulas you may have seen derived by other methods.

Advantages vs traditional formulation, Covariant formulation of classical electromagnetism - Wikipedia

Maxwell's equations in covariant form

All four of Maxwell's equations collapse into two tensor equations. The source information is carried by the four-current density:

Jμ=(cρ,J)J^{\mu} = (c\rho,\, \vec{J})

where ρ\rho is the charge density and J\vec{J} is the current density.

Inhomogeneous equations

μFμν=μ0Jν\partial_{\mu}F^{\mu\nu} = \mu_0 J^{\nu}

This single equation packages together two of Maxwell's equations:

  • Setting ν=0\nu = 0 gives Gauss's law: E=ρ/ϵ0\nabla \cdot \vec{E} = \rho/\epsilon_0
  • Setting ν=i\nu = i (spatial) gives Ampère's law with Maxwell's correction: ×Bμ0ϵ0E/t=μ0J\nabla \times \vec{B} - \mu_0\epsilon_0\,\partial\vec{E}/\partial t = \mu_0\vec{J}

Homogeneous equations

[μFνλ]=0\partial_{[\mu}F_{\nu\lambda]} = 0

The square brackets denote antisymmetrization over all three indices. This equation encodes the other two Maxwell equations:

  • Gauss's law for magnetism: B=0\nabla \cdot \vec{B} = 0
  • Faraday's law: ×E=B/t\nabla \times \vec{E} = -\partial\vec{B}/\partial t

An equivalent way to write the homogeneous equations uses the Levi-Civita tensor:

ϵμνλρνFλρ=0\epsilon^{\mu\nu\lambda\rho}\,\partial_{\nu}F_{\lambda\rho} = 0

The homogeneous equations are automatically satisfied whenever FμνF^{\mu\nu} is derived from a four-potential via Fμν=μAννAμF^{\mu\nu} = \partial^{\mu}A^{\nu} - \partial^{\nu}A^{\mu}. This is because the antisymmetric derivative of an antisymmetric derivative vanishes identically (a consequence of partial derivatives commuting).

Compact tensor notation

To summarize, the full content of Maxwell's equations in covariant form is:

  1. μFμν=μ0Jν\partial_{\mu}F^{\mu\nu} = \mu_0 J^{\nu} (inhomogeneous: sources generate fields)
  2. [μFνλ]=0\partial_{[\mu}F_{\nu\lambda]} = 0 (homogeneous: no magnetic monopoles, Faraday's law)

These two equations replace the four traditional Maxwell equations and are manifestly Lorentz-invariant.

Electromagnetic stress-energy tensor

The stress-energy tensor TμνT^{\mu\nu} describes how energy and momentum are distributed in the electromagnetic field and how they flow through space.

Tensor definition and components

Tμν=1μ0 ⁣(FμαFνα14gμνFαβFαβ)T^{\mu\nu} = \frac{1}{\mu_0}\!\left(F^{\mu\alpha}F^{\nu}{}_{\alpha} - \frac{1}{4}\,g^{\mu\nu}F_{\alpha\beta}F^{\alpha\beta}\right)

The physical content of each block of components:

  • T00T^{00} is the electromagnetic energy density: u=12 ⁣(ϵ0E2+B2μ0)u = \frac{1}{2}\!\left(\epsilon_0 E^2 + \frac{B^2}{\mu_0}\right)
  • T0iT^{0i} is the energy flux (Poynting vector) divided by cc: Si/cS^i/c, where S=1μ0(E×B)\vec{S} = \frac{1}{\mu_0}(\vec{E} \times \vec{B})
  • TijT^{ij} is the Maxwell stress tensor, representing the flux of the ii-th component of momentum in the jj-th direction.

Note that TμνT^{\mu\nu} is symmetric (Tμν=TνμT^{\mu\nu} = T^{\nu\mu}) and traceless in vacuum (Tμμ=0T^{\mu}{}_{\mu} = 0).

Conservation of energy and momentum

μTμν=FνμJμ\partial_{\mu}T^{\mu\nu} = -F^{\nu\mu}J_{\mu}

The right-hand side is the negative of the Lorentz force density. In regions with no charges or currents (Jμ=0J^{\mu} = 0), this reduces to μTμν=0\partial_{\mu}T^{\mu\nu} = 0, which is a local conservation law: the energy and momentum stored in the electromagnetic field are conserved.

When sources are present, the right-hand side accounts for energy and momentum being transferred between the field and the charged matter.

Lorentz force in covariant form

The relativistic equation of motion for a charged particle is:

Kμ=qFμνuνK^{\mu} = qF^{\mu\nu}u_{\nu}

Here Kμ=dpμ/dτK^{\mu} = dp^{\mu}/d\tau is the four-force (the rate of change of four-momentum with respect to proper time). The spatial components reproduce the familiar Lorentz force:

F=q(E+v×B)\vec{F} = q(\vec{E} + \vec{v} \times \vec{B})

The time component gives the rate at which the field does work on the particle: K0=γqEv/cK^0 = \gamma q\vec{E}\cdot\vec{v}/c.

Gauge invariance

The four-potential AμA^{\mu} contains redundant degrees of freedom. Gauge invariance is the statement that this redundancy doesn't affect any physical observable.

Four-potential and gauge transformations

The four-potential combines the scalar and vector potentials:

Aμ=(ϕ/c,A)A^{\mu} = (\phi/c,\, \vec{A})

A gauge transformation replaces AμA^{\mu} with:

Aμ=Aμ+μΛA'^{\mu} = A^{\mu} + \partial^{\mu}\Lambda

where Λ(x)\Lambda(x) is any smooth scalar function of spacetime. Because Fμν=μAννAμF^{\mu\nu} = \partial^{\mu}A^{\nu} - \partial^{\nu}A^{\mu}, the antisymmetry kills the Λ\Lambda terms:

Fμν=μAννAμ=Fμν+(μννμ)Λ=FμνF'^{\mu\nu} = \partial^{\mu}A'^{\nu} - \partial^{\nu}A'^{\mu} = F^{\mu\nu} + (\partial^{\mu}\partial^{\nu} - \partial^{\nu}\partial^{\mu})\Lambda = F^{\mu\nu}

So the physical fields E\vec{E} and B\vec{B} are completely unaffected.

Advantages vs traditional formulation, Covariant formulation of classical electromagnetism - Wikipedia

Lorenz and Coulomb gauges

You can exploit gauge freedom to impose a condition on AμA^{\mu} that simplifies calculations.

Lorenz gauge: μAμ=0\partial_{\mu}A^{\mu} = 0

  • This is the natural choice for relativistic problems because the condition itself is Lorentz-invariant.
  • The field equations decouple into four independent wave equations: Aμ=μ0Jμ\Box\, A^{\mu} = \mu_0 J^{\mu}, where =μμ\Box = \partial_{\mu}\partial^{\mu} is the d'Alembertian.

Coulomb gauge: A=0\nabla \cdot \vec{A} = 0

  • Useful in non-relativistic or radiation problems.
  • The scalar potential satisfies Poisson's equation: 2ϕ=ρ/ϵ0\nabla^2 \phi = -\rho/\epsilon_0 (instantaneous response).
  • The vector potential satisfies a modified wave equation.
  • Not Lorentz-invariant, so less natural in a relativistic context.

Physical significance of gauge choice

No measurement can distinguish between two potentials related by a gauge transformation. The electric and magnetic fields, the Lorentz force, and the stress-energy tensor are all gauge-invariant. The choice of gauge is purely a matter of mathematical convenience: pick whichever gauge makes your particular problem easiest to solve.

Applications and examples

Electromagnetic waves in vacuum

With no sources (Jμ=0J^{\mu} = 0) and in the Lorenz gauge, the potential satisfies:

Aμ=0\Box\, A^{\mu} = 0

This is the wave equation in four-dimensional spacetime. Its plane-wave solutions describe electromagnetic radiation propagating at speed cc, with E\vec{E} and B\vec{B} perpendicular to each other and to the propagation direction. The covariant form makes it transparent that these wave solutions are the same in every inertial frame.

Fields of moving charges

For a point charge qq moving with four-velocity uμu^{\mu}, the four-current density is:

Jμ(x)=quμδ(3)(rrq(t))J^{\mu}(x) = q\,u^{\mu}\,\delta^{(3)}(\vec{r} - \vec{r}_q(t))

Solving the inhomogeneous equation Aμ=μ0Jμ\Box\, A^{\mu} = \mu_0 J^{\mu} with retarded boundary conditions yields the Liénard-Wiechert potentials. The resulting fields show characteristic relativistic features: field lines of a charge in uniform motion are compressed in the transverse direction (a manifestation of Lorentz contraction), and the electric and magnetic fields mix under boosts exactly as the tensor transformation law predicts.

Relativistic electrodynamics

The covariant formulation is indispensable for problems where charges move at speeds comparable to cc:

  • Synchrotron radiation: Accelerating relativistic charges radiate in a narrow forward cone. The covariant framework handles the Lorentz boosts between the rest frame and the lab frame cleanly.
  • Particle accelerators: The transformation of fields between frames is essential for designing beam optics and understanding beam-beam interactions.
  • Relativistic Doppler effect: The four-vector description of the wave vector kμk^{\mu} makes frequency shifts under boosts straightforward.

Connection to Lagrangian formalism

Electromagnetic Lagrangian density

The dynamics of the electromagnetic field can be derived from a single scalar Lagrangian density:

LEM=14μ0FμνFμνJμAμ\mathcal{L}_{EM} = -\frac{1}{4\mu_0}\,F_{\mu\nu}F^{\mu\nu} - J^{\mu}A_{\mu}

The first term is the free-field kinetic term (it contains the energy stored in E\vec{E} and B\vec{B}). The second term couples the field to external sources. Because LEM\mathcal{L}_{EM} is a Lorentz scalar, the action S=LEMd4xS = \int \mathcal{L}_{EM}\, d^4x is Lorentz-invariant.

Derivation of field equations

Applying the principle of least action (δS=0\delta S = 0) with AμA_{\mu} as the dynamical variable:

  1. Write the action: S= ⁣(14μ0FμνFμνJμAμ)d4xS = \int\!\left(-\frac{1}{4\mu_0}F_{\mu\nu}F^{\mu\nu} - J^{\mu}A_{\mu}\right)d^4x

  2. Vary with respect to AνA_{\nu}: δS= ⁣(1μ0FμνμδAνJνδAν)d4x\delta S = \int\!\left(-\frac{1}{\mu_0}F^{\mu\nu}\,\partial_{\mu}\delta A_{\nu} - J^{\nu}\delta A_{\nu}\right)d^4x

  3. Integrate by parts (discard boundary terms): δS= ⁣(1μ0μFμνJν)δAνd4x\delta S = \int\!\left(\frac{1}{\mu_0}\partial_{\mu}F^{\mu\nu} - J^{\nu}\right)\delta A_{\nu}\,d^4x

  4. Since δAν\delta A_{\nu} is arbitrary, the integrand must vanish: μFμν=μ0Jν\partial_{\mu}F^{\mu\nu} = \mu_0 J^{\nu}

This recovers the inhomogeneous Maxwell equations. The homogeneous equations don't come from the action; they follow identically from the definition Fμν=μAννAμF^{\mu\nu} = \partial^{\mu}A^{\nu} - \partial^{\nu}A^{\mu}.

Note that the Lorenz gauge condition is not derived from the Lagrangian. It's an additional constraint you impose separately to fix the gauge freedom.

Noether's theorem and conserved quantities

Noether's theorem states that every continuous symmetry of the action yields a conserved current. For the electromagnetic Lagrangian:

  • Spacetime translation invariance (the Lagrangian doesn't depend explicitly on xμx^{\mu}) gives conservation of energy and momentum, encoded in μTμν=0\partial_{\mu}T^{\mu\nu} = 0 (in the absence of sources).
  • Gauge invariance (AμAμ+μΛA^{\mu} \to A^{\mu} + \partial^{\mu}\Lambda) leads to charge conservation: μJμ=0\partial_{\mu}J^{\mu} = 0. You can verify this directly by taking ν\partial_{\nu} of the inhomogeneous equation and using the antisymmetry of FμνF^{\mu\nu}.
  • Lorentz invariance yields conservation of angular momentum, including both orbital and "spin" contributions from the field.
2,589 studying →