Dirac equation in curved spacetime

Posted on October 25, 2016 by lordneo

Generalization of the Dirac equation

In mathematical physics, the Dirac equation in curved spacetime is a generalization of the Dirac equation from flat spacetime (Minkowski space) to curved spacetime, a general Lorentzian manifold.

Table of Contents

Mathematical formulation[edit]

Spacetime[edit]

In full generality the equation can be defined on

{displaystyle M}

$M$ or

{displaystyle (M,mathbf {g} )}

${displaystyle (M,mathbf {g} )}$ a pseudo-Riemannian manifold, but for concreteness we restrict to pseudo-Riemannian manifold with signature

{displaystyle (-+++)}

${displaystyle (-+++)}$ . The metric is referred to as

{displaystyle mathbf {g} }

$mathbf {g}$ , or

{displaystyle g_{ab}}

$g_{ab}$ in abstract index notation.

Frame fields[edit]

We use a set of vierbein or frame fields

{displaystyle {e_{mu }}={e_{0},e_{1},e_{2},e_{3}}}

${displaystyle {e_{mu }}={e_{0},e_{1},e_{2},e_{3}}}$ , which are a set of vector fields (which are not necessarily defined globally on

{displaystyle M}

$M$ ). Their defining equation is

{displaystyle g_{ab}e_{mu }^{a}e_{nu }^{b}=eta _{mu nu }.}

The vierbein defines a local rest frame, allowing the constant Gamma matrices to act at each spacetime point.

In differential-geometric language, the vierbein is equivalent to a section of the frame bundle, and so defines a local trivialization of the frame bundle.

Spin connection[edit]

To write down the equation we also need the spin connection, also known as the connection (1-)form. The dual frame fields

{displaystyle {e^{mu }}}

${displaystyle {e^{mu }}}$ have defining relation

{displaystyle e_{a}^{mu }e_{nu }^{a}=delta ^{mu }{}_{nu }.}

The connection 1-form is then

{displaystyle omega ^{mu }{}_{nu a}:=e_{b}^{mu }nabla _{a}e_{nu }^{b}}

where

{displaystyle nabla _{a}}

$nabla_a$ is a covariant derivative, or equivalently a choice of connection on the frame bundle, most often taken to be the Levi-Civita connection.

One should be careful not to treat the abstract Latin indices and Greek indices as the same, and further to note that neither of these are coordinate indices: it can be verified that

{displaystyle omega ^{mu }{}_{nu a}}

${displaystyle omega ^{mu }{}_{nu a}}$ doesn’t transform as a tensor under a change of coordinates.

Mathematically, the frame fields

{displaystyle {e_{mu }}}

${displaystyle {e_{mu }}}$ define an isomorphism at each point

{displaystyle p}

$p$ where they are defined from the tangent space

{displaystyle T_{p}M}

$T_{p}M$ to

{displaystyle mathbb {R} ^{1,3}}

${displaystyle mathbb {R} ^{1,3}}$ . Then abstract indices label the tangent space, while greek indices label

{displaystyle mathbb {R} ^{1,3}}

${displaystyle mathbb {R} ^{1,3}}$ . If the frame fields are position dependent then greek indices do not necessarily transform tensorially under a change of coordinates.

Raising and lowering indices is done with

{displaystyle g_{ab}}

$g_{ab}$ for latin indices and

{displaystyle eta _{mu nu }}

$eta_{munu}$ for greek indices.

The connection form can be viewed as a more abstract connection on a principal bundle, specifically on the frame bundle, which is defined on any smooth manifold, but which restricts to an orthonormal frame bundle on pseudo-Riemannian manifolds.

The connection form with respect to frame fields

{displaystyle {e_{mu }}}

${displaystyle {e_{mu }}}$ defined locally is, in differential-geometric language, the connection with respect to a local trivialization.

Clifford algebra[edit]

Just as with the Dirac equation on flat spacetime, we make use of the Clifford algebra, a set of four gamma matrices

{displaystyle {gamma ^{mu }}}

${displaystyle {gamma ^{mu }}}$ satisfying

{displaystyle {gamma ^{mu },gamma ^{nu }}=2eta ^{mu nu }}

where

{displaystyle {cdot ,cdot }}

${cdot ,cdot }$ is the anticommutator.

They can be used to construct a representation of the Lorentz algebra: defining

{displaystyle sigma ^{mu nu }=-{frac {i}{4}}[gamma ^{mu },gamma ^{nu }]=-{frac {i}{2}}gamma ^{mu }gamma ^{nu }+{frac {i}{2}}eta ^{mu nu }}

where

{displaystyle [cdot ,cdot ]}

$[cdot ,cdot ]$ is the commutator.

It can be shown they satisfy the commutation relations of the Lorentz algebra:

{displaystyle [sigma ^{mu nu },sigma ^{rho sigma }]=(-i)(sigma ^{mu sigma }eta ^{nu rho }-sigma ^{nu sigma }eta ^{mu rho }+sigma ^{nu rho }eta ^{mu sigma }-sigma ^{mu rho }eta ^{nu sigma })}

They therefore are the generators of a representation of the Lorentz algebra

{displaystyle {mathfrak {so}}(1,3)}

${displaystyle {mathfrak {so}}(1,3)}$ . But they do not generate a representation of the Lorentz group

{displaystyle {text{SO}}(1,3)}

${displaystyle {text{SO}}(1,3)}$ , just as the Pauli matrices generate a representation of the rotation algebra

{displaystyle {mathfrak {so}}(3)}

${mathfrak {so}}(3)$ but not

{displaystyle {text{SO}}(3)}

${displaystyle {text{SO}}(3)}$ . They in fact form a representation of

{displaystyle {text{Spin}}(1,3).}

${displaystyle {text{Spin}}(1,3).}$ However, it is a standard abuse of terminology to any representations of the Lorentz algebra as representations of the Lorentz group, even if they do not arise as representations of the Lorentz group.

The representation space is isomorphic to

{displaystyle mathbb {C} ^{4}}

${displaystyle mathbb {C} ^{4}}$ as a vector space. In the classification of Lorentz group representations, the representation is labelled

{displaystyle left({frac {1}{2}},0right)oplus left(0,{frac {1}{2}}right)}

${displaystyle left({frac {1}{2}},0right)oplus left(0,{frac {1}{2}}right)}$ .

The abuse of terminology extends to forming this representation at the group level. We can write a finite Lorentz transformation on

{displaystyle mathbb {R} ^{1,3}}

${displaystyle mathbb {R} ^{1,3}}$ as

{displaystyle Lambda _{sigma }^{rho }=exp left({frac {i}{2}}alpha _{mu nu }M^{mu nu }right){}_{sigma }^{rho }}

${displaystyle Lambda _{sigma }^{rho }=exp left({frac {i}{2}}alpha _{mu nu }M^{mu nu }right){}_{sigma }^{rho }}$
where

{displaystyle M^{mu nu }}

${displaystyle M^{mu nu }}$ is the standard basis for the Lorentz algebra. These generators have components

{displaystyle (M^{mu nu })_{sigma }^{rho }=eta ^{mu rho }delta _{sigma }^{nu }-eta ^{nu rho }delta _{sigma }^{mu }}

or, with both indices up or both indices down, simply matrices which have

{displaystyle +1}

$+1$ in the

{displaystyle mu ,nu }

${displaystyle mu,nu}$ index and

{displaystyle -1}

$-1$ in the

{displaystyle nu ,mu }

${displaystyle nu ,mu }$ index, and 0 everywhere else.

If another representation

{displaystyle rho }

$rho$ has generators

{displaystyle T^{mu nu }=rho (M^{mu nu }),}

${displaystyle T^{mu nu }=rho (M^{mu nu }),}$ then we write

{displaystyle rho (Lambda )_{j}^{i}=exp left({frac {i}{2}}alpha _{mu nu }T^{mu nu }right){}_{j}^{i}}

where

{displaystyle i,j}

$i,j$ are indices for the representation space.

In the case

{displaystyle T^{mu nu }=sigma ^{mu nu }}

${displaystyle T^{mu nu }=sigma ^{mu nu }}$ , without being given generator components

{displaystyle alpha _{mu nu }}

${displaystyle alpha _{mu nu }}$ for

{displaystyle Lambda _{sigma }^{rho }}

${displaystyle Lambda _{sigma }^{rho }}$ , this

{displaystyle rho (Lambda )}

${displaystyle rho (Lambda )}$ is not well defined: there are sets of generator components

{displaystyle alpha _{mu nu },beta _{mu nu }}

${displaystyle alpha _{mu nu },beta _{mu nu }}$ which give the same

{displaystyle Lambda _{sigma }^{rho }}

${displaystyle Lambda _{sigma }^{rho }}$ but different

{displaystyle rho (Lambda )_{j}^{i}.}

${displaystyle rho (Lambda )_{j}^{i}.}$

Covariant derivative for fields in a representation of the Lorentz group[edit]

Given a coordinate frame

{displaystyle {partial _{alpha }}}

${displaystyle {partial _{alpha }}}$ arising from say coordinates

{displaystyle {x^{alpha }}}

${displaystyle {x^{alpha }}}$ , the partial derivative with respect to a general orthonormal frame

{displaystyle {e_{mu }}}

${displaystyle {e_{mu }}}$ is defined

{displaystyle partial _{mu }psi =e_{mu }^{alpha }partial _{alpha }psi ,}

and connection components with respect to a general orthonormal frame are

{displaystyle omega ^{mu }{}_{nu rho }=e_{rho }^{alpha }omega ^{mu }{}_{nu alpha }.}

These components do not transform tensorially under a change of frame, but do when combined. Also, these are definitions rather than saying that these objects can arise as partial derivatives in some coordinate chart. In general there are non-coordinate orthonormal frames, for which the commutator of vector fields is non-vanishing.

It can be checked that under the transformation

{displaystyle psi mapsto rho (Lambda )psi ,}

if we define the covariant derivative

{displaystyle D_{mu }psi =partial _{mu }psi +{frac {1}{2}}(omega _{nu rho })_{mu }sigma ^{nu rho }psi }

then

{displaystyle D_{mu }psi }

${displaystyle D_{mu }psi }$ transforms as

{displaystyle D_{mu }psi mapsto rho (Lambda )D_{mu }psi }

This generalises to any representation

{displaystyle R}

$R$ for the Lorentz group: if

{displaystyle v}

$v$ is a vector field for the associated representation,

{displaystyle D_{mu }v=partial _{mu }v+{frac {1}{2}}(omega _{nu rho })_{mu }R(M^{nu rho })v=partial _{mu }v+{frac {1}{2}}(omega _{nu rho })_{mu }T^{nu rho }v.}

When

{displaystyle R}

$R$ is the fundamental representation for

{displaystyle {text{SO}}(1,3)}

${displaystyle {text{SO}}(1,3)}$ , this recovers the familiar covariant derivative for (tangent-)vector fields, of which the Levi-Civita connection is an example.

There are some subtleties in what kind of mathematical object the different types of covariant derivative are. The covariant derivative

{displaystyle D_{alpha }psi }

${displaystyle D_{alpha }psi }$ in a coordinate basis is a vector-valued 1-form, which at each point

{displaystyle p}

$p$ is an element of

{displaystyle E_{p}otimes T_{p}^{*}M}

${displaystyle E_{p}otimes T_{p}^{*}M}$ . The covariant derivative

{displaystyle D_{mu }psi }

${displaystyle D_{mu }psi }$ in an orthonormal basis uses the orthonormal frame

{displaystyle {e_{mu }}}

${displaystyle {e_{mu }}}$ to identify the vector-valued 1-form with a vector-valued dual vector which at each point

{displaystyle p}

$p$ is an element of

{displaystyle E_{p}otimes mathbb {R} ^{1,3},}

${displaystyle E_{p}otimes mathbb {R} ^{1,3},}$ using that

{displaystyle {mathbb {R} ^{1,3}}^{*}cong mathbb {R} ^{1,3}}

${displaystyle {mathbb {R} ^{1,3}}^{*}cong mathbb {R} ^{1,3}}$ canonically. We can then contract this with a gamma matrix 4-vector

{displaystyle gamma ^{mu }}

$gamma ^{mu }$ which takes values at

{displaystyle p}

$p$ in

{displaystyle {text{End}}(E_{p})otimes mathbb {R} ^{1,3}}

${displaystyle {text{End}}(E_{p})otimes mathbb {R} ^{1,3}}$

Dirac equation on curved spacetime[edit]

Recalling the Dirac equation on flat spacetime,

{displaystyle (igamma ^{mu }partial _{mu }-m)psi =0,}

the Dirac equation on curved spacetime can be written down by promoting the partial derivative to a covariant one.

In this way, Dirac’s equation takes the following form in curved spacetime:^[1].

Dirac equation on curved spacetime

{displaystyle (igamma ^{mu }D_{mu }-m)Psi =0.}

${displaystyle (igamma ^{mu }D_{mu }-m)Psi =0.}$

where

{displaystyle Psi }

$Psi$ is a spinor field on spacetime. Mathematically, this is a section of a vector bundle associated to the spin-frame bundle by the representation

{displaystyle (1/2,0)oplus (0,1/2).}

${displaystyle (1/2,0)oplus (0,1/2).}$

Recovering the Klein–Gordon equation from the Dirac equation[edit]

The modified Klein–Gordon equation obtained by squaring the operator in the Dirac equation, first found by Erwin Schrödinger as cited by Pollock
^[2] is given by

{displaystyle left({frac {1}{sqrt {-det g}}},{cal {D}}_{mu }left({sqrt {-det g}},g^{mu nu }{cal {D}}_{nu }right)-{frac {1}{4}}R+{frac {ie}{2}}F_{mu nu }s^{mu nu }-m^{2}right)Psi =0.}

where

{displaystyle R}

$R$ is the Ricci scalar, and

{displaystyle F_{mu nu }}

$F_{munu}$ is the field strength of

{displaystyle A_{mu }}

$A_{mu }$ . An alternative version of the Dirac equation whose Dirac operator remains the square root of the Laplacian is given by the Dirac–Kähler equation; the price to pay is the loss of Lorentz invariance in curved spacetime.

Note that here Latin indices denote the “Lorentzian” vierbein labels while Greek indices denote manifold coordinate indices.

Action formulation[edit]

We can formulate this theory in terms of an action. If in addition the spacetime

{displaystyle (M,mathbf {g} )}

${displaystyle (M,mathbf {g} )}$ is orientable, there is a preferred orientation known as the volume form

{displaystyle epsilon }

$epsilon$ .
One can integrate functions against the volume form:

{displaystyle int _{M}epsilon f=int _{M}d^{4}x{sqrt {-g}}f}

The function

{displaystyle {bar {Psi }}(igamma ^{mu }partial _{mu }-m)Psi }

${displaystyle {bar {Psi }}(igamma ^{mu }partial _{mu }-m)Psi }$
is integrated against the volume form to obtain the Dirac action

Dirac action on curved spacetime

{displaystyle I_{text{Dirac}}=int _{M}d^{4}x{sqrt {-g}},{bar {Psi }}(igamma ^{mu }D_{mu }-m)Psi .}

${displaystyle I_{text{Dirac}}=int _{M}d^{4}x{sqrt {-g}},{bar {Psi }}(igamma ^{mu }D_{mu }-m)Psi .}$

Dirac equation in curved spacetime

Mathematical formulation[edit]

Spacetime[edit]

Frame fields[edit]

Spin connection[edit]

Clifford algebra[edit]

Covariant derivative for fields in a representation of the Lorentz group[edit]

Dirac equation on curved spacetime[edit]

Recovering the Klein–Gordon equation from the Dirac equation[edit]

Action formulation[edit]

See also[edit]

References[edit]

Recent Posts

Recent Comments

Archives

Categories

Meta