Classical Fields
Classical Fields
Classical Fields
Asaf Pe’er1
Definition: a field is a physical quantity defined at every point in space and time: (~x, t).
While classical particle mechanics deals with a finite number of generalized coordinates - or
degrees of freedom, qa (t) (indexed by a label a), in field theory we are interested in the
dynamics of fields
φa (~x, t) (1)
where both a and ~x are considered as labels. Thus we are dealing with a system with an
infinite number of degrees of freedom - at least one for each point ~x in space. Notice that
the concept of position has been relegated from a dynamical variable in particle
mechanics to a mere label in field theory.
As opposed to QM, in QFT the dynamical degrees of freedom are the values
of φ at every single point. As opposed to QM, where X was changed into an operator,
in QFT x just stays as a label. Moreover, I stress that φ is not a wave function; although
it obeys certain equations that look like the same equations in QM, it has a different inter-
pretation.
The most familiar examples of fields from classical physics are the electric and magnetic
~ x, t) and B(~
fields, E(~ ~ x, t). Both of these fields are spatial 3-vectors. In a more sophisticated
treatment of electromagnetism, we derive these two 3-vectors from a single 4-component
~ where µ = 0, 1, 2, 3 shows that this field is a vector in spacetime.
field, Aµ (~x, t) ≡ (φ, A),
1
Physics Dep., University College Cork
–2–
Next we are interested in knowing the dynamics of fields, namely, how they evolve. We
could do that in various ways. As an example, for the electromagnetic field, we could write
the equations of motion, which are Maxwell’s equations. However, we choose a more concise
way: the dynamics of the field is governed by a Lagrangian which is a function
of φ(~x, t), φ̇(~x, t), and ∇φ(~x, t). In all the systems we study in this course, the Lagrangian
is of the form, Z
L(t) = d3 xL(φa , ∂µ φa ), (3)
where L is officially called Lagrangian density, although everyone simply calls it the
Lagrangian. The action is
Z t2 Z Z
S= dt d xL = d4 xL.
3
(4)
t1
Note: Recall that in particle mechanics L depends on q and q̇, but not q̈. In field theory we
similarly restrict to Lagrangians L depending on φ and φ̇, but not φ̈. In principle, there is
nothing to stop L depending on ∇φ, ∇2 φ, ∇3 φ, etc. However, with an eye to later Lorentz
invariance, we will only consider Lagrangians depending on ∇φ and not higher derivatives.
The equations of motion can be determined by the principle of least action. We
vary the path, keeping the end points fixed and require δS = 0,
R 4 h ∂L i
δS = d x ∂φa δφa + ∂(∂∂L δ(∂ φ
µ a )
R 4 nh ∂L µ φa ) i o (5)
∂L ∂L
= d x ∂φa − ∂µ ∂(∂µ φa ) δφa + ∂µ ∂(∂µ φa ) δφa
The last term is a total derivative and vanishes for any δφa (~x, t) that decays at spatial infinity
and obeys δφa (~x, t1 ) = δφa (~x, t2 ) = 0. Note that equation 5 is similar to the one derived
for particle mechanics, only now the Lagrangian depends on both the time derivate (φ̇)
and the spatial derivative (∇φ) of φ; hence the use of the 4-derivative ∂µ φ. Requiring
δS = 0 for all such paths yields the Euler-Lagrange equations of motion for the fields φa ,
∂L ∂L
∂µ − = 0. (6)
∂(∂µ φa ) ∂φa
–3–
We can compare the Lagrangian in Equation 7 to the usual expression for the Lagrangian,
L = T − V , and identify the kinetic energy of the field as
1
Z
T = d3 x φ̇2 , (9)
2
and the potential energy of the field as
1 1 2 2
Z
3 2
V = dx (∇φ) + m φ . (10)
2 2
The first term in Equation 10 is called the gradient energy, while the phrase “potential
energy”, or just “potential”, is usually reserved for the last term. This also provides some
“intuitive” explanation for the choice of sign in η µν , which is opposite to that used in GR:
using the sign convention in Equation 8, the kinetic term in Equation 9 is positive.
In order to determine the Equation of motion associated with the Lagrangian in Equa-
tion 7, we compute:
∂L ∂L
= −m2 φ and = ∂ µ φ ≡ (φ̇, −∇φ). (11)
∂φ ∂(∂µ φ)
φ̈ − ∇2 φ + m2 φ = 0, (12)
∂µ ∂ µ φ + m2 φ = 0. (13)
–4–
1 ∂V
L = ∂µ φ∂ µ φ − V (φ) ⇒ ∂µ ∂ µ φ + = 0. (14)
2 ∂φ
We could also consider a Lagrangian that is linear in time derivatives, rather than
quadratic. Take a complex scalar field ψ whose dynamics is defined by the real Lagrangian,
i ⋆
L= ψ ψ̇ − ψ̇ ψ − ∇ψ ⋆ · ∇ψ − mψ ⋆ ψ.
⋆
(15)
2
The equation of motion can be determined by treating ψ and ψ ⋆ as independent objects, so
that
∂L i ∂L i ∂L
= ψ̇ − mψ and = − ψ and = −∇ψ. (16)
∂ψ ⋆ 2 ∂ ψ̇ ⋆ 2 ∂∇ψ ⋆
This gives the equation of motion,
∂ψ
i = −∇2 ψ + mψ. (17)
∂t
Equation 17 looks very much like the Schrödinger equation - only it is not !. Or, at least,
the interpretation of this equation is very different: the field ψ is a classical field with none
of the probability interpretation of the wavefunction. We will return to this point later on.
The initial data required on a Cauchy surface 1 differs for the two examples above.
When L ∼ φ̇2 , both φ and φ̇ must be specified to determine the future evolution; however
when L ∼ ψ ⋆ ψ, only ψ and ψ ⋆ are needed.
1
Intuitively, a Cauchy surface is a plane in space-time which is like an instant of time; its significance is
that giving the initial conditions on this plane determines the future (and the past) uniquely.
–5–
1.2. Locality
In each of the examples above, the Lagrangian is local. This means that there are no
terms in the Lagrangian coupling φ(~x, t) directly to φ(~y , t) with ~x 6= ~y . For example, there
are no terms that look like Z
L = d3 xd3 yφ(~x)φ(~y ). (22)
A priori, there is no reason for this. After all, ~x is merely a label, and we are quite happy
to couple other labels together (for example, the term ∂3 A0 ∂0 A3 in the Maxwell Lagrangian
2
For further discussion, please compare with the GR notes on special relativity.
–6–
couples the µ = 0 field to the µ = 3 field). But the closest we get for the ~x label is a coupling
between φ(~x) and φ(~x + δ~x) through the gradient term(∇φ)2 . This property of locality is,
as far as we know, a key feature of all theories of Nature. Indeed, one of the main reasons
for introducing field theories in classical physics is to implement locality. In this course, we
will only consider local Lagrangians.
2. Lorentz invariance
The laws of nature are relativistic. In fact, one of the main motivations to develop
quantum field theory is to reconcile quantum mechanics with special relativity.
To this end, we want to construct field theories in which space and time are
placed on an equal footing and the theory is invariant under Lorentz transformations,
′ ′
xµ → xµ = Λ µ ν xν , (23)
where the requirement that the interval is conserved implies that the elements of the matrix
Λ satisfy
Λµ σ η στ Λν τ = η µν . (24)
The matrices which satisfy Equation 24 are known as Lorentz transformations. Examples
of Lorentz transformations are rotation by an angle θ about the x3 axis, and a boost by v(< 1)
along the x1 axis; these are respectively described by the matrices
1 0 0 0 cosh φ − sinh φ 0 0
0 cos θ sin θ 0 − sinh φ cosh φ 0 0
µ µ
Λ ν = and Λ ν = (25)
0 − sin θ cos θ 0 0 0 1 0
0 0 0 1 0 0 0 1
√
where γ ≡ cosh φ ≡ 1/ 1 − v 2 . The set of Lorentz transformations form a group under
matrix multiplication, known as Lorentz group. By itself the Lorentz transformations
form a Lie group under matrix multiplications.
You have seen in previous courses how the Lorentz transformation act on the space.
However, here we are interested in seeing how the Lorentz transformation act on the fields
that live in the space. Mathematically, we need to find a representation of the Lorentz
group on the fields.
The simplest example is the scalar field which, under the Lorentz transformation x →
x = Λx, transforms as φ′ (x′ = Λx) = φ(x), or
′
The inverse Λ−1 appears in the argument because we are dealing with an active trans-
formation in which the field is truly shifted, while the coordinates are still (see Figure 1).
To see why this means that the inverse appears, it will suffice to consider a non-relativistic
example such as a temperature field. Suppose we start with an initial field φ(~x) = T (~x)
which has a hotspot at, say, ~x = (1, 0, 0). After a rotation, ~x → R~x about the z-axis, the
new field, φ′ (~x) will have the hotspot at, say ~x = (0, 1, 0).
We want now to express the new field φ′ (~x) in terms of the old field φ(~x). We thus
place ourselves at ~x′ = (0, 1, 0) and ask what the old field looked like where we have come
from at R−1 (0, 1, 0) = (1, 0, 0). This R−1 is the origin of the inverse transformation. In other
words, the transformed field, evaluated at the new (boosted) point x, must give the same
result as the original field evaluated at the point before it was boosted (Λ−1 x). (If we were
instead dealing with a passive transformation in which we relabel our choice of coordinates,
we would have instead φ(~x) → φ′ (~x) = φ(Λx)).
Fig. 1.— Left: In active transformation, a point moves from position P to P ′ (e.g., by
rotating clockwise). In passive transformation (right), the point P does not move, while the
coordinate system rotates counterclockwise. Note that in the active rotation, the coordinates
of the point P ′ are the same as those of point P relative to the rotated coordinates in the
passive rotation.
For a real scalar field we have φ(x) → φ(x) = φ(Λ−1 x). The derivative of the scalar field
is a covariant vector (consult with the SR notes), and therefore its transformation law is
where y = Λ−1 x (Compare with SR, Equations 40, 45). This means that the derivative
terms in the Lagrangian density (Equation 7) transform as
The potential terms ( 21 m2 φ2 ) transform according to Equation 26, with φ2 (x) → φ2 (y).
Putting this all together, we find that the action is indeed invariant under Lorentz transfor-
mations, Z Z Z
S= d4 xL(x) → d4 xL(y) = d4 yL(y) = S, (28)
where, in the last step, we need the fact that we don’t pick up a Jacobian factor when we
change integration variables from d4 x to d4 y. This follows because det Λ = 1. (At least
R R
for Lorentz transformation connected to the identity which, for now, is all we deal with).
In the first-order Lagrangian (Equation 15), space and time are not on the same footing.
(L is linear in time derivatives, but quadratic in spatial derivatives). The theory is thus not
Lorentz invariant.
In practice, it is easy to see if the action is Lorentz invariant: just make sure all the
Lorentz indices µ = 0, 1, 2, 3 are contracted with Lorentz invariant objects, such as the metric
ηµν . Other Lorentz invariant objects you can use include the totally antisymmetric tensor
ǫµνρσ and the matrices γµ that we will introduce when we come to discuss spinors later on.
(Equation 21) is indeed invariant. Of course, historically electrodynamics was the first
Lorentz invariant theory to be discovered: it was found even before the concept of Lorentz
invariance.
The role of symmetries in field theory is possibly even more important than in particle
mechanics. There are Lorentz symmetries, internal symmetries, gauge symmetries, super-
symmetries.... We start here by recasting Noethers theorem in a field theoretic framework.
The relationship between symmetries and conservation laws in classical field theory are
summarized in Noether’s theorem. Noether’s theorem states that every continuous
symmetry of the Lagrangian gives rise to a conserved current j µ (x) such that the
equations of motion imply
∂µ j µ = 0, (29)
~ · ~j = 0.
or, in other words, ∂j 0 /∂t + ∇
Note. A conserved current implies a conserved charge Q, defined as
Z
Q= d3 xj 0 . (30)
R3
where A is the area bounding V , and we used Stokes’ theorem. Equation 33 implies that
any charge leaving V must be accounted for by a flow of the current 3-vector ~j out of the
volume. This kind of local conservation of charge holds in any local field theory.
Proof of Noether’s Theorem.
We consider infinitesimal3 transformations (e.g., rotation) of the form
where ǫ is an infinitesimal parameter and δφa is some deformation of the field configuration.
We call this transformation a symmetry, if it leaves the equations of motion invariant:
δL = 0. This is insured if the action is invariant under Equation 34.
In fact, we can allow the action to change by a surface term, as such a term will not
affect the derivation of the Euler-Lagrange equations of motion. Thus, the Lagrangian must
be invariant under Equation 34, up to a 4-divergence:
The first term (in the square brackets) vanishes by the Euler-Lagrange Equations (6). We
thus have
∂L
δL = ∂µ δφa (37)
∂(∂µ φa )
But by the definition of symmetry transformation, Equation 35 implies that δL must be
equal to ∂µ J µ , and we thus get that
∂L
∂µ j µ = 0 for j µ (x) = δφa − J µ . (38)
∂(∂µ φa )
Equation 38 thus implies that the current j µ is conserved; moreover, it also tells us what it
is !.
3
That’s where we make use of the “continuous” property in Noether’s theorem. E.g., when you look in
the mirror, there is a parity symmetry, but it does not give rise to a conserved charge.
– 11 –
Let us demonstrate Noether’s theorem using the following example. Recall that in
classical particle mechanics, invariance under spatial translations gives rise to the con-
servation of momentum, while invariance under time translations is responsible for the
conservation of energy. We will now see the analogue in field theories.
We can describe an infinitesimal translation, xν → xν − ǫν , alternatively as a transfor-
mation of the field,
φa (x) → φa (x) + ǫν ∂ν φa (x). (39)
(Note that the sign in the field transformation is plus, instead of minus, because we are doing
an active, as opposed to passive, transformation). Similarly, the Lagrangian - being a scalar
(and assuming it has no explicit x dependence) - must transform in the same way,
(This is the correct transformation for a Lagrangian that has no explicit x dependence, but
only depends on x through the fields φa (x). All theories that we consider in this course will
have this property).
Since the change in the Lagrangian is a total derivative, we may invoke Noether’s the-
orem. In this case, we have 4 independent translations - ǫν with ν = 0, 1, 2, 3, (three spatial
translations and one translation in time), resulting in 4 conserved currents: (j µ )ν , one for
each of the translations. Using J µ = δνµ L(x) from Equation 40, we can write the 4 currents
as
∂L
(j µ )ν = ∂ν φa − δνµ L ≡ T µ ν . (41)
∂(∂µ φa )
T µ ν is called the energy momentum tensor (or stress-energy tensor) of the field φa . It
satisfied
∂µ T µ ν = 0. (42)
The four conserved quantities are given by
Z Z
3 00
E = d xT and P = d3 xT 0i ,
i
(43)
where E is the total energy of the field configuration, and P i is the total momentum of the
field configuration.
– 12 –
Consider the simplest scalar field theory with Lagrangian given by Equation 7, L =
1
∂ φ∂ µ φ
2 µ
− 21 m2 φ2 . From the above discussion and Equation 41 we find
T µν = ∂ µ φ∂ ν φ − η µν L. (44)
One can verify using the equation of motion for φ that this expression indeed satisfies ∂µ T µν =
0. For this example, the conserved energy and momentum are given by
1 2 1 1 2 2
Z
3 2
E= dx φ̇ + (∇φ) + m φ , (45)
2 2 2
Z
P = d3 xφ̇∂ i φ
i
(46)
Note that E and P i are global quantities, obtained by integrating over the entire space.
Further note that for this field, T µν is symmetric, namely T µν = T νµ . This is not always
the case. Nevertheless, there is typically a way to massage the energy momentum tensor of
any theory into a symmetric form by adding an extra term
where Γρµν is some function of the fields that is anti-symmetric in the first two indices so
Γρµν = −Γµρν . This guarantees that ∂µ ∂ρ Γρµν = 0, so that the new energy-momentum tensor
is also a conserved current.
A Trick.4
One reason we are interested in a symmetric energy-momentum tensor is to make contact
with general relativity: such an object sits on the right-hand side of Einsteins field equations.
In fact this observation provides a quick and easy way to determine a symmetric energy-
momentum tensor. First, we transform from flat Minkowski spacetime to an arbitrary curved
spacetime in the usual way: we replace ηµν with the curved metric gµν (x), and replace the
kinetic terms with suitable covariant derivatives. Then a symmetric energy momentum
tensor in the flat space theory is given by
√
µν 2 ∂( −gL)
Θ = −√ . (48)
−g ∂gµν gµν =ηµν
It should be noted however that this trick requires a little more care when working with
spinors (to be introduced below).
4
If you study GR, this is clear. If not - sorry... you may skip this part for now.
– 13 –
j µ = − ∂(∂∂Lµ φ) ω ρ ν xν ∂ρ φ + ω µ ν xν L
h i (54)
= −ω ρ ν ∂(∂∂Lµ φ) xν ∂ρ φ − δρµ xν L = −ω ρ ν T µ ρ xν .
Note that as opposed to the translation case (Equation 41), we have left the infinitesimal
choice of ω ρ ν in the expression for the current in Equation 54. But clearly, we don’t need
it there (the same way we didn’t need ǫν in Equation 41). What we do need is 6 different
– 14 –
currents - one for each independent choice of ω ρ ν . Using the anti-symmetry property of ω ρ ν ,
these currents can be written as
We already know that P i is conserved, and thus dP i /dt = 0; Equation 58 thus implies that
under boost,
d
Z
d3 xxi T 00 = constant (59)
dt
This is the statement that the center of energy of the field travels with a constant velocity. Its
kind of like a field theoretic version of Newtons first law but, rather surprisingly, appearing
here as a conservation law.
The two examples given above involved transformations of spacetime, as well as transfor-
mations of the field. An internal symmetry is one that only involves a transformation
of the fields and acts the same at every point in spacetime. The simplest example occurs
√
for a complex scalar field ψ(x) = (φ1 (x) + iφ2 (x))/ 2. We can build a real Lagrangian by
L = ∂µ ψ ⋆ ∂ µ ψ − V (|ψ|2 ) (60)
where the potential is a general polynomial in |ψ|2 = ψ ⋆ ψ. To find the equations of motion,
we could expand ψ in terms of φ1 and φ2 and work as before. However, it’s easier (and
– 15 –
equivalent) to treat ψ and ψ ⋆ as independent variables5 and vary the action with respect to
both of them. For example, varying with respect to ψ ⋆ leads to the equation of motion
∂V (ψ ⋆ ψ)
∂µ ∂ µ ψ + = 0. (61)
∂ψ ⋆
The Lagrangian has a continuous symmetry which rotates φ1 and φ2 or, equivalently,
rotates the phase of ψ:
(where the last equation holds for infinitesimal constant α, which is not a function of x - this
is what “global” means!). The Lagrangian remains invariant under this change: δL = 0 (or:
J µ = const). The associated conserved current is
Conserved currents of this type are of extreme importance in QFT. We will see later that
the conserved charges arising from currents of this type have the interpretation of electric
charge or particle number (for example, baryon or lepton number; what we really mean
is that the number of baryons minus anti-baryons is fixed).
Non Abelian Internal Symmetries6
Consider a theory involving N scalar fields φa , all with the same mass, and Lagrangian:
N N N
!2
1X 1X X
L= ∂ µ φa ∂ µ φa − m2 φ2a − g φ2a (64)
2 a=1
2 a=1 a=1
In this case the Lagrangian is invariant under the non-Abelian symmetry group G = SO(N ).
(Actually O(N ) in this case).7 One can construct theories from complex fields in a similar
manner that are invariant under an SU (N ) symmetry group. Non-Abelian symmetries of
5
This is why the Lagrangian in Equation 60 does not have a (1/2) prefactor, as in the Lagrangian for the
Klein-Gordon field.
6
Abelian group is a commutative group, namely the result of applying an operation to two of the group
members does not depend on their order.
7
“SO” stands for “Special Orthogonal”. Orthogonal group is a group of distance-preserving transforma-
tions which preserved the origin (such as rotations in Euclidean space). The group operation can be described
by a n × n orthogonal matrix. The “special” implies that the determinant of the orthogonal matrix is +1
(it could also be −1).
– 16 –
this type are often referred to as global symmetries to distinguish them from the “local
gauge” symmetries that we will meet later. Isospin is an example of such a symmetry, albeit
realized only approximately in Nature.
A Trick for Determining the Conserved Current Associated with Internal
Symmetry.
There is a quick method to determine the conserved current associated with an internal
(global) symmetry. Suppose we have an internal symmetry, δφ = αφ for which the La-
grangian is invariant: δL = 0. Here, α is a constant real number. (We may generalize the
discussion easily to a non-Abelian internal symmetry for which α becomes a matrix).
The trick: re-do the transformation, assuming that α depends on spacetime: α = α(x).
The action is no longer invariant, and δL would not be equal to 0; however, the change in
δL must be such that in the limit where α is x-independent, δL = 0. Thus, the change must
be of the form
δL = (∂µ α)hµ (φ), (65)
for some function hµ (φ), since we know that δL = 0 when α is constant. The change in the
action is therefore Z Z
δS = d xδL = − d4 xα(x)∂µ hµ
4
(66)
Here comes the point: we know that when the equations of motion are satisfied, δS = 0 for
all variations - including the one we consider here, δφ = α(x)φ. This can only happen if
∂ µ hµ = 0 (67)
We see that we can identify the function hµ = j µ as the conserved current. This way of
viewing things emphasizes that it is the derivative terms, not the potential terms in the action
that contribute to the current. (The potential terms are invariant even when α = α(x)).
Exercise: using this trick, find the conserved current for the Lagrangian we derived above
(Equation 60).
The link between the Lagrangian formalism and the quantum theory goes via the path
integral. In this course we will not discuss path integral methods, and focus instead on
canonical quantization. For this we need the Hamiltonian formalism of field theory. We
start by defining the momentum π a (x) conjugate to φa (x),
∂L
π a (x) ≡ . (68)
∂ φ̇a
– 17 –
The conjugate momentum π a (x) is a function of x, just like the field φa (x) itself. It is not to
be confused with the total momentum P i defined in Equation 43, which is a single number
characterizing the whole field configuration. The Hamiltonian density is given by
where, as in classical mechanics, we view H as a function of φa (x) and π a (x), rather than
φa (x) and φ̇a (x). The Hamiltonian is then simply
Z
H = d3 xH (70)
Clearly, the Hamiltonian agrees with the definition of the total energy (Equation 45) that
we get from applying Noether’s theorem for the time translation invariance.
In the Lagrangian formalism, Lorentz invariance is clear for all to see since the action
is invariant under Lorentz transformations. In contrast, the Hamiltonian formalism is not
manifestly Lorentz invariant: we have picked a preferred time. For example, the equations
of motion for φ(x) = φ(~x, t) arise from Hamilton’s equations,
∂H ∂H
φ̇(~x, t) = and π̇(~x, t) = − . (73)
∂π(~x, t) ∂φ(~x, t)
Unlike the Euler-Lagrange Equations (Equation 6), Equations 73 do not look Lorentz in-
variant. However, clearly, even though the Hamiltonian framework doesn’t look Lorentz
invariant, the physics must remain unchanged. If we start from a relativistic theory, all final
answers must be Lorentz invariant even if it’s not manifest at intermediate steps. We will
pause at several points along the quantum route to check that this is indeed the case.