Dirac equation in curved spacetime

Generalization of the Dirac equation

In mathematical physics, the Dirac equation in curved spacetime is a generalization of the Dirac equation from flat spacetime (Minkowski space) to curved spacetime, a general Lorentzian manifold.

Mathematical formulation[edit]

Spacetime[edit]

In full generality the equation can be defined on

M{displaystyle M}

or

(M,g){displaystyle (M,mathbf {g} )}

a pseudo-Riemannian manifold, but for concreteness we restrict to pseudo-Riemannian manifold with signature

(+++){displaystyle (-+++)}

. The metric is referred to as

g{displaystyle mathbf {g} }

, or

gab{displaystyle g_{ab}}

in abstract index notation.

Frame fields[edit]

We use a set of vierbein or frame fields

{eμ}={e0,e1,e2,e3}{displaystyle {e_{mu }}={e_{0},e_{1},e_{2},e_{3}}}

, which are a set of vector fields (which are not necessarily defined globally on

M{displaystyle M}

). Their defining equation is

The vierbein defines a local rest frame, allowing the constant Gamma matrices to act at each spacetime point.

In differential-geometric language, the vierbein is equivalent to a section of the frame bundle, and so defines a local trivialization of the frame bundle.

Spin connection[edit]

To write down the equation we also need the spin connection, also known as the connection (1-)form. The dual frame fields

{eμ}{displaystyle {e^{mu }}}

have defining relation

The connection 1-form is then

where

a{displaystyle nabla _{a}}

is a covariant derivative, or equivalently a choice of connection on the frame bundle, most often taken to be the Levi-Civita connection.

One should be careful not to treat the abstract Latin indices and Greek indices as the same, and further to note that neither of these are coordinate indices: it can be verified that

ωμνa{displaystyle omega ^{mu }{}_{nu a}}

doesn’t transform as a tensor under a change of coordinates.

Mathematically, the frame fields

{eμ}{displaystyle {e_{mu }}}

define an isomorphism at each point

p{displaystyle p}

where they are defined from the tangent space

TpM{displaystyle T_{p}M}

to

R1,3{displaystyle mathbb {R} ^{1,3}}

. Then abstract indices label the tangent space, while greek indices label

R1,3{displaystyle mathbb {R} ^{1,3}}

. If the frame fields are position dependent then greek indices do not necessarily transform tensorially under a change of coordinates.

Raising and lowering indices is done with

gab{displaystyle g_{ab}}

for latin indices and

ημν{displaystyle eta _{mu nu }}

for greek indices.

The connection form can be viewed as a more abstract connection on a principal bundle, specifically on the frame bundle, which is defined on any smooth manifold, but which restricts to an orthonormal frame bundle on pseudo-Riemannian manifolds.

The connection form with respect to frame fields

{eμ}{displaystyle {e_{mu }}}

defined locally is, in differential-geometric language, the connection with respect to a local trivialization.

Clifford algebra[edit]

Just as with the Dirac equation on flat spacetime, we make use of the Clifford algebra, a set of four gamma matrices

{γμ}{displaystyle {gamma ^{mu }}}

satisfying

where

{,}{displaystyle {cdot ,cdot }}

is the anticommutator.

They can be used to construct a representation of the Lorentz algebra: defining

where

[,]{displaystyle [cdot ,cdot ]}

is the commutator.

It can be shown they satisfy the commutation relations of the Lorentz algebra:

They therefore are the generators of a representation of the Lorentz algebra

so(1,3){displaystyle {mathfrak {so}}(1,3)}

. But they do not generate a representation of the Lorentz group

SO(1,3){displaystyle {text{SO}}(1,3)}

, just as the Pauli matrices generate a representation of the rotation algebra

so(3){displaystyle {mathfrak {so}}(3)}

but not

SO(3){displaystyle {text{SO}}(3)}

. They in fact form a representation of

Spin(1,3).{displaystyle {text{Spin}}(1,3).}

However, it is a standard abuse of terminology to any representations of the Lorentz algebra as representations of the Lorentz group, even if they do not arise as representations of the Lorentz group.

The representation space is isomorphic to

C4{displaystyle mathbb {C} ^{4}}

as a vector space. In the classification of Lorentz group representations, the representation is labelled

(12,0)(0,12){displaystyle left({frac {1}{2}},0right)oplus left(0,{frac {1}{2}}right)}

.

The abuse of terminology extends to forming this representation at the group level. We can write a finite Lorentz transformation on

R1,3{displaystyle mathbb {R} ^{1,3}}

as

Λσρ=exp(i2αμνMμν)σρ{displaystyle Lambda _{sigma }^{rho }=exp left({frac {i}{2}}alpha _{mu nu }M^{mu nu }right){}_{sigma }^{rho }}


where

Mμν{displaystyle M^{mu nu }}

is the standard basis for the Lorentz algebra. These generators have components

or, with both indices up or both indices down, simply matrices which have

+1{displaystyle +1}

in the

μ,ν{displaystyle mu ,nu }

index and

1{displaystyle -1}

in the

ν,μ{displaystyle nu ,mu }

index, and 0 everywhere else.

If another representation

ρ{displaystyle rho }

has generators

Tμν=ρ(Mμν),{displaystyle T^{mu nu }=rho (M^{mu nu }),}

then we write

where

i,j{displaystyle i,j}

are indices for the representation space.

In the case

Tμν=σμν{displaystyle T^{mu nu }=sigma ^{mu nu }}

, without being given generator components

αμν{displaystyle alpha _{mu nu }}

for

Λσρ{displaystyle Lambda _{sigma }^{rho }}

, this

ρ(Λ){displaystyle rho (Lambda )}

is not well defined: there are sets of generator components

αμν,βμν{displaystyle alpha _{mu nu },beta _{mu nu }}

which give the same

Λσρ{displaystyle Lambda _{sigma }^{rho }}

but different

ρ(Λ)ji.{displaystyle rho (Lambda )_{j}^{i}.}

Covariant derivative for fields in a representation of the Lorentz group[edit]

Given a coordinate frame

α{displaystyle {partial _{alpha }}}

arising from say coordinates

{xα}{displaystyle {x^{alpha }}}

, the partial derivative with respect to a general orthonormal frame

{eμ}{displaystyle {e_{mu }}}

is defined

and connection components with respect to a general orthonormal frame are

These components do not transform tensorially under a change of frame, but do when combined. Also, these are definitions rather than saying that these objects can arise as partial derivatives in some coordinate chart. In general there are non-coordinate orthonormal frames, for which the commutator of vector fields is non-vanishing.

It can be checked that under the transformation

if we define the covariant derivative

then

Dμψ{displaystyle D_{mu }psi }

transforms as

This generalises to any representation

R{displaystyle R}

for the Lorentz group: if

v{displaystyle v}

is a vector field for the associated representation,

When

R{displaystyle R}

is the fundamental representation for

SO(1,3){displaystyle {text{SO}}(1,3)}

, this recovers the familiar covariant derivative for (tangent-)vector fields, of which the Levi-Civita connection is an example.

There are some subtleties in what kind of mathematical object the different types of covariant derivative are. The covariant derivative

Dαψ{displaystyle D_{alpha }psi }

in a coordinate basis is a vector-valued 1-form, which at each point

p{displaystyle p}

is an element of

EpTpM{displaystyle E_{p}otimes T_{p}^{*}M}

. The covariant derivative

Dμψ{displaystyle D_{mu }psi }

in an orthonormal basis uses the orthonormal frame

{eμ}{displaystyle {e_{mu }}}

to identify the vector-valued 1-form with a vector-valued dual vector which at each point

p{displaystyle p}

is an element of

EpR1,3,{displaystyle E_{p}otimes mathbb {R} ^{1,3},}

using that

R1,3R1,3{displaystyle {mathbb {R} ^{1,3}}^{*}cong mathbb {R} ^{1,3}}

canonically. We can then contract this with a gamma matrix 4-vector

γμ{displaystyle gamma ^{mu }}

which takes values at

p{displaystyle p}

in

End(Ep)R1,3{displaystyle {text{End}}(E_{p})otimes mathbb {R} ^{1,3}}

Dirac equation on curved spacetime[edit]

Recalling the Dirac equation on flat spacetime,

the Dirac equation on curved spacetime can be written down by promoting the partial derivative to a covariant one.

In this way, Dirac’s equation takes the following form in curved spacetime:[1].

Dirac equation on curved spacetime

(iγμDμm)Ψ=0.{displaystyle (igamma ^{mu }D_{mu }-m)Psi =0.}

where

Ψ{displaystyle Psi }

is a spinor field on spacetime. Mathematically, this is a section of a vector bundle associated to the spin-frame bundle by the representation

(1/2,0)(0,1/2).{displaystyle (1/2,0)oplus (0,1/2).}

Recovering the Klein–Gordon equation from the Dirac equation[edit]

The modified Klein–Gordon equation obtained by squaring the operator in the Dirac equation, first found by Erwin Schrödinger as cited by Pollock
[2] is given by

where

R{displaystyle R}

is the Ricci scalar, and

Fμν{displaystyle F_{mu nu }}

is the field strength of

Aμ{displaystyle A_{mu }}

. An alternative version of the Dirac equation whose Dirac operator remains the square root of the Laplacian is given by the Dirac–Kähler equation; the price to pay is the loss of Lorentz invariance in curved spacetime.

Note that here Latin indices denote the “Lorentzian” vierbein labels while Greek indices denote manifold coordinate indices.

Action formulation[edit]

We can formulate this theory in terms of an action. If in addition the spacetime

(M,g){displaystyle (M,mathbf {g} )}

is orientable, there is a preferred orientation known as the volume form

ϵ{displaystyle epsilon }

.
One can integrate functions against the volume form:

The function

Ψ¯(iγμμm)Ψ{displaystyle {bar {Psi }}(igamma ^{mu }partial _{mu }-m)Psi }


is integrated against the volume form to obtain the Dirac action

Dirac action on curved spacetime

IDirac=Md4xgΨ¯(iγμDμm)Ψ.{displaystyle I_{text{Dirac}}=int _{M}d^{4}x{sqrt {-g}},{bar {Psi }}(igamma ^{mu }D_{mu }-m)Psi .}

See also[edit]

References[edit]