The covariant formulation of Maxwell's equations unifies electric and magnetic fields into a single framework built on special relativity. Instead of treating $\vec{E}$ and $\vec{B}$ as separate vector fields that transform in complicated ways between reference frames, this approach packages them into a single tensor object that transforms cleanly under Lorentz transformations. The result is a set of equations that are manifestly Lorentz-invariant and far more compact than the traditional vector calculus form.

Covariant formulation overview

The central idea is to rewrite electromagnetism using the language of four-vectors and tensors in Minkowski spacetime. Because these mathematical objects have well-defined transformation rules under Lorentz boosts and rotations, any equation built from them is automatically consistent with special relativity.

Advantages vs traditional formulation

Space and time are treated on equal footing, so Lorentz invariance is built in from the start rather than checked after the fact.
The six components of $\vec{E}$ and $\vec{B}$ merge into a single antisymmetric tensor $F^{\mu\nu}$ , cutting down on bookkeeping.
Relativistic effects (length contraction of field lines, mixing of electric and magnetic fields under boosts) emerge naturally from the tensor transformation law.
All four of Maxwell's equations reduce to just two tensor equations.

Relationship to special relativity

The formulation lives in four-dimensional Minkowski spacetime, where an event is labeled by coordinates $x^{\mu} = (ct, x, y, z)$ . Every quantity in the theory (potentials, currents, fields) is expressed as a four-vector or tensor, meaning it transforms via Lorentz transformation matrices $\Lambda^{\mu}{}_{\nu}$ . This guarantees that the laws of electromagnetism take the same form in every inertial frame, which is exactly what the postulates of special relativity demand.

Mathematical framework

Four-vectors in spacetime

A four-vector has one timelike and three spacelike components and transforms under Lorentz transformations just like the position four-vector. Two key examples:

Four-position: $x^{\mu} = (ct,\, \vec{r})$
Four-velocity: $u^{\mu} = \gamma(c,\, \vec{v})$ , where $\gamma = (1 - v^2/c^2)^{-1/2}$ is the Lorentz factor

The inner product of two four-vectors, computed with the metric tensor, is a Lorentz scalar (the same in all frames).

Metric tensor definition

The metric tensor $g_{\mu\nu}$ defines the geometry of spacetime. In flat Minkowski spacetime with the $(-, +, +, +)$ signature:

$g_{\mu\nu} = \text{diag}(-1, +1, +1, +1)$

All off-diagonal components vanish. The metric serves two roles:

It computes invariant intervals: $ds^2 = g_{\mu\nu}\,dx^{\mu}\,dx^{\nu} = -c^2 dt^2 + dx^2 + dy^2 + dz^2$
It raises and lowers tensor indices (see below).

Be careful with sign conventions. Some textbooks use $(+, -, -, -)$ . The physics is the same, but signs in component expressions for $F^{\mu\nu}$ will differ.

Raising and lowering indices

The metric tensor converts between contravariant (upper) and covariant (lower) indices:

Lowering: $A_{\mu} = g_{\mu\nu}\,A^{\nu}$
Raising: $A^{\mu} = g^{\mu\nu}\,A_{\nu}$

In Minkowski spacetime this is straightforward: raising or lowering a spatial index leaves the component unchanged, while raising or lowering the time index flips the sign. For example, if $A^{\mu} = (A^0, A^1, A^2, A^3)$ , then $A_{\mu} = (-A^0, A^1, A^2, A^3)$ .

Electromagnetic field tensor

The electromagnetic field tensor $F^{\mu\nu}$ is a rank-2 antisymmetric tensor that encodes all six components of the electric and magnetic fields. It's defined as the antisymmetric derivative of the four-potential:

$F^{\mu\nu} = \partial^{\mu}A^{\nu} - \partial^{\nu}A^{\mu}$

Tensor components and structure

Because $F^{\mu\nu} = -F^{\nu\mu}$ , the diagonal entries vanish and only six independent components remain. Written as a matrix (using the $(-, +, +, +)$ convention):

$F^{\mu\nu} = \begin{pmatrix} 0 & -E_x/c & -E_y/c & -E_z/c \\ E_x/c & 0 & -B_z & B_y \\ E_y/c & B_z & 0 & -B_x \\ E_z/c & -B_y & B_x & 0 \end{pmatrix}$

The time-space components ( $F^{0i}$ ) carry the electric field, while the space-space components ( $F^{ij}$ ) carry the magnetic field. Specifically:

$F^{0i} = -E^i/c$
$F^{ij} = -\epsilon^{ijk}B_k$

Electric and magnetic fields

You can read off the fields from the tensor:

Electric field: $E^i = -c\,F^{0i}$
Magnetic field: $B_k = -\tfrac{1}{2}\,\epsilon_{ijk}\,F^{ij}$

The key conceptual point: $\vec{E}$ and $\vec{B}$ are not independent entities. They are different components of the same object, $F^{\mu\nu}$ . What one observer calls a pure electric field, another observer (in a different frame) may see as a mixture of electric and magnetic fields.

Transformation properties

Under a Lorentz transformation $\Lambda^{\mu}{}_{\nu}$ , the field tensor transforms as:

$F'^{\mu\nu} = \Lambda^{\mu}{}_{\alpha}\,\Lambda^{\nu}{}_{\beta}\,F^{\alpha\beta}$

This is the standard transformation law for a rank-2 contravariant tensor. It automatically mixes the $\vec{E}$ and $\vec{B}$ components in exactly the right way, reproducing the field transformation formulas you may have seen derived by other methods.

Advantages vs traditional formulation, Covariant formulation of classical electromagnetism - Wikipedia

Maxwell's equations in covariant form

All four of Maxwell's equations collapse into two tensor equations. The source information is carried by the four-current density:

$J^{\mu} = (c\rho,\, \vec{J})$

where $\rho$ is the charge density and $\vec{J}$ is the current density.

Inhomogeneous equations

$\partial_{\mu}F^{\mu\nu} = \mu_0 J^{\nu}$

This single equation packages together two of Maxwell's equations:

Setting $\nu = 0$ gives Gauss's law: $\nabla \cdot \vec{E} = \rho/\epsilon_0$
Setting $\nu = i$ (spatial) gives Ampère's law with Maxwell's correction: $\nabla \times \vec{B} - \mu_0\epsilon_0\,\partial\vec{E}/\partial t = \mu_0\vec{J}$

Homogeneous equations

$\partial_{[\mu}F_{\nu\lambda]} = 0$

The square brackets denote antisymmetrization over all three indices. This equation encodes the other two Maxwell equations:

Gauss's law for magnetism: $\nabla \cdot \vec{B} = 0$
Faraday's law: $\nabla \times \vec{E} = -\partial\vec{B}/\partial t$

An equivalent way to write the homogeneous equations uses the Levi-Civita tensor:

$\epsilon^{\mu\nu\lambda\rho}\,\partial_{\nu}F_{\lambda\rho} = 0$

The homogeneous equations are automatically satisfied whenever $F^{\mu\nu}$ is derived from a four-potential via $F^{\mu\nu} = \partial^{\mu}A^{\nu} - \partial^{\nu}A^{\mu}$ . This is because the antisymmetric derivative of an antisymmetric derivative vanishes identically (a consequence of partial derivatives commuting).

Compact tensor notation

To summarize, the full content of Maxwell's equations in covariant form is:

$\partial_{\mu}F^{\mu\nu} = \mu_0 J^{\nu}$ (inhomogeneous: sources generate fields)
$\partial_{[\mu}F_{\nu\lambda]} = 0$ (homogeneous: no magnetic monopoles, Faraday's law)

These two equations replace the four traditional Maxwell equations and are manifestly Lorentz-invariant.

Electromagnetic stress-energy tensor

The stress-energy tensor $T^{\mu\nu}$ describes how energy and momentum are distributed in the electromagnetic field and how they flow through space.

Tensor definition and components

$T^{\mu\nu} = \frac{1}{\mu_0}\!\left(F^{\mu\alpha}F^{\nu}{}_{\alpha} - \frac{1}{4}\,g^{\mu\nu}F_{\alpha\beta}F^{\alpha\beta}\right)$

The physical content of each block of components:

$T^{00}$ is the electromagnetic energy density: $u = \frac{1}{2}\!\left(\epsilon_0 E^2 + \frac{B^2}{\mu_0}\right)$
$T^{0i}$ is the energy flux (Poynting vector) divided by $c$ : $S^i/c$ , where $\vec{S} = \frac{1}{\mu_0}(\vec{E} \times \vec{B})$
$T^{ij}$ is the Maxwell stress tensor, representing the flux of the $i$ -th component of momentum in the $j$ -th direction.

Note that $T^{\mu\nu}$ is symmetric ( $T^{\mu\nu} = T^{\nu\mu}$ ) and traceless in vacuum ( $T^{\mu}{}_{\mu} = 0$ ).

Conservation of energy and momentum

$\partial_{\mu}T^{\mu\nu} = -F^{\nu\mu}J_{\mu}$

The right-hand side is the negative of the Lorentz force density. In regions with no charges or currents ( $J^{\mu} = 0$ ), this reduces to $\partial_{\mu}T^{\mu\nu} = 0$ , which is a local conservation law: the energy and momentum stored in the electromagnetic field are conserved.

When sources are present, the right-hand side accounts for energy and momentum being transferred between the field and the charged matter.

Lorentz force in covariant form

The relativistic equation of motion for a charged particle is:

$K^{\mu} = qF^{\mu\nu}u_{\nu}$

Here $K^{\mu} = dp^{\mu}/d\tau$ is the four-force (the rate of change of four-momentum with respect to proper time). The spatial components reproduce the familiar Lorentz force:

$\vec{F} = q(\vec{E} + \vec{v} \times \vec{B})$

The time component gives the rate at which the field does work on the particle: $K^0 = \gamma q\vec{E}\cdot\vec{v}/c$ .

Gauge invariance

The four-potential $A^{\mu}$ contains redundant degrees of freedom. Gauge invariance is the statement that this redundancy doesn't affect any physical observable.

Four-potential and gauge transformations

The four-potential combines the scalar and vector potentials:

$A^{\mu} = (\phi/c,\, \vec{A})$

A gauge transformation replaces $A^{\mu}$ with:

$A'^{\mu} = A^{\mu} + \partial^{\mu}\Lambda$

where $\Lambda(x)$ is any smooth scalar function of spacetime. Because $F^{\mu\nu} = \partial^{\mu}A^{\nu} - \partial^{\nu}A^{\mu}$ , the antisymmetry kills the $\Lambda$ terms:

$F'^{\mu\nu} = \partial^{\mu}A'^{\nu} - \partial^{\nu}A'^{\mu} = F^{\mu\nu} + (\partial^{\mu}\partial^{\nu} - \partial^{\nu}\partial^{\mu})\Lambda = F^{\mu\nu}$

So the physical fields $\vec{E}$ and $\vec{B}$ are completely unaffected.

Lorenz and Coulomb gauges

You can exploit gauge freedom to impose a condition on $A^{\mu}$ that simplifies calculations.

Lorenz gauge: $\partial_{\mu}A^{\mu} = 0$

This is the natural choice for relativistic problems because the condition itself is Lorentz-invariant.
The field equations decouple into four independent wave equations: $\Box\, A^{\mu} = \mu_0 J^{\mu}$ , where $\Box = \partial_{\mu}\partial^{\mu}$ is the d'Alembertian.

Coulomb gauge: $\nabla \cdot \vec{A} = 0$

Useful in non-relativistic or radiation problems.
The scalar potential satisfies Poisson's equation: $\nabla^2 \phi = -\rho/\epsilon_0$ (instantaneous response).
The vector potential satisfies a modified wave equation.
Not Lorentz-invariant, so less natural in a relativistic context.

Physical significance of gauge choice

No measurement can distinguish between two potentials related by a gauge transformation. The electric and magnetic fields, the Lorentz force, and the stress-energy tensor are all gauge-invariant. The choice of gauge is purely a matter of mathematical convenience: pick whichever gauge makes your particular problem easiest to solve.

Applications and examples

Electromagnetic waves in vacuum

With no sources ( $J^{\mu} = 0$ ) and in the Lorenz gauge, the potential satisfies:

$\Box\, A^{\mu} = 0$

This is the wave equation in four-dimensional spacetime. Its plane-wave solutions describe electromagnetic radiation propagating at speed $c$ , with $\vec{E}$ and $\vec{B}$ perpendicular to each other and to the propagation direction. The covariant form makes it transparent that these wave solutions are the same in every inertial frame.

Fields of moving charges

For a point charge $q$ moving with four-velocity $u^{\mu}$ , the four-current density is:

$J^{\mu}(x) = q\,u^{\mu}\,\delta^{(3)}(\vec{r} - \vec{r}_q(t))$

Solving the inhomogeneous equation $\Box\, A^{\mu} = \mu_0 J^{\mu}$ with retarded boundary conditions yields the Liénard-Wiechert potentials. The resulting fields show characteristic relativistic features: field lines of a charge in uniform motion are compressed in the transverse direction (a manifestation of Lorentz contraction), and the electric and magnetic fields mix under boosts exactly as the tensor transformation law predicts.

Relativistic electrodynamics

The covariant formulation is indispensable for problems where charges move at speeds comparable to $c$ :

Synchrotron radiation: Accelerating relativistic charges radiate in a narrow forward cone. The covariant framework handles the Lorentz boosts between the rest frame and the lab frame cleanly.
Particle accelerators: The transformation of fields between frames is essential for designing beam optics and understanding beam-beam interactions.
Relativistic Doppler effect: The four-vector description of the wave vector $k^{\mu}$ makes frequency shifts under boosts straightforward.

Connection to Lagrangian formalism

Electromagnetic Lagrangian density

The dynamics of the electromagnetic field can be derived from a single scalar Lagrangian density:

$\mathcal{L}_{EM} = -\frac{1}{4\mu_0}\,F_{\mu\nu}F^{\mu\nu} - J^{\mu}A_{\mu}$

The first term is the free-field kinetic term (it contains the energy stored in $\vec{E}$ and $\vec{B}$ ). The second term couples the field to external sources. Because $\mathcal{L}_{EM}$ is a Lorentz scalar, the action $S = \int \mathcal{L}_{EM}\, d^4x$ is Lorentz-invariant.

Derivation of field equations

Applying the principle of least action ( $\delta S = 0$ ) with $A_{\mu}$ as the dynamical variable:

Write the action: $S = \int\!\left(-\frac{1}{4\mu_0}F_{\mu\nu}F^{\mu\nu} - J^{\mu}A_{\mu}\right)d^4x$
Vary with respect to $A_{\nu}$ : $\delta S = \int\!\left(-\frac{1}{\mu_0}F^{\mu\nu}\,\partial_{\mu}\delta A_{\nu} - J^{\nu}\delta A_{\nu}\right)d^4x$
Integrate by parts (discard boundary terms): $\delta S = \int\!\left(\frac{1}{\mu_0}\partial_{\mu}F^{\mu\nu} - J^{\nu}\right)\delta A_{\nu}\,d^4x$
Since $\delta A_{\nu}$ is arbitrary, the integrand must vanish: $\partial_{\mu}F^{\mu\nu} = \mu_0 J^{\nu}$

This recovers the inhomogeneous Maxwell equations. The homogeneous equations don't come from the action; they follow identically from the definition $F^{\mu\nu} = \partial^{\mu}A^{\nu} - \partial^{\nu}A^{\mu}$ .

Note that the Lorenz gauge condition is not derived from the Lagrangian. It's an additional constraint you impose separately to fix the gauge freedom.

Noether's theorem and conserved quantities

Noether's theorem states that every continuous symmetry of the action yields a conserved current. For the electromagnetic Lagrangian:

Spacetime translation invariance (the Lagrangian doesn't depend explicitly on $x^{\mu}$ ) gives conservation of energy and momentum, encoded in $\partial_{\mu}T^{\mu\nu} = 0$ (in the absence of sources).
Gauge invariance ( $A^{\mu} \to A^{\mu} + \partial^{\mu}\Lambda$ ) leads to charge conservation: $\partial_{\mu}J^{\mu} = 0$ . You can verify this directly by taking $\partial_{\nu}$ of the inhomogeneous equation and using the antisymmetry of $F^{\mu\nu}$ .
Lorentz invariance yields conservation of angular momentum, including both orbital and "spin" contributions from the field.

2,589 studying →