Keywords

1 Introduction and Mathematical Notions

Since the publication of the Einstein’s theory of General Relativity in 1915, we understand gravitation as a geometrical effect. Many extensions of this theory have been formulated in order to solve various problems in theoretical physics, such as dark matter or the first corrections to General Relativity that could come from the quantum gravity regime.

In the geometrical framework introduced by Einstein, the spacetime is defined as a differentiable manifold ℳ. Omitting some mathematical details, a D-dimensional manifold is essentially a topological space that looks, locally, as the Euclidean space ℝD. For example, spheres, planes and hyperboloids are 2-dimensional manifolds.

Additionally, we include a lorentzian metric tensor, gμν, which allows to measure lengths, volumes and so on. Hence it is possible to talk about the module of a vector that is not necessarily non-negative, due to the lorentzian signature. Those vectors that are not trivial but have zero norm determine the lightlike paths and, then, light cones that define the casual structure of the spacetime.

Another fundamental notion that can be defined, even in the absence of metric, is parallelism. The motivation for this additional concept is the following. Consider the Euclidean space ℝD and a couple of vectors in different points, p and q (Fig. 5.1). If we want to compare them, we simply take, for example, the one in p and move it to q keeping the vector parallel to itself and without changing the module. And, finally, we subtract both vectors to see the difference.

Fig. 5.1
figure 1

Comparison of vectors in Euclidean space (natural notion of parallelism)

However, if the manifold is general, the initial vectors live in different spaces (the tangent spaces at p and q, respectively, Tpℳ and Tqℳ) and there is no natural way to relate them (Fig. 5.2). In the Euclidean space, both tangent spaces can be identified making the comparison trivial. In the general case, we need to introduce an additional structure that carries the information about parallelism, the affine structure, whose fundamental object is the (affine) connection \( {\Gamma}_{\mu \nu}^{\sigma } \). Once we have a connection, given a curve between two points, we have a rule to relate vectors in them: the parallel transport associated to the connection.

Fig. 5.2
figure 2

An affine connection represents a notion of parallel in the manifold and allows to parallely transport vectors along curves. The dashed line is, by definition, the parallel transport of Vμ(p) from p to q along the xλ direction. This vector and Vμ(q) can be compared since both live in Tq

The connection permits the definition of a covariant derivative (“covariant” means that once applied to a tensor, the result is also a tensor), and other intrinsic geometrical properties of the spacetime, such as the curvature and the torsion, respectively:

$$ {R_{\mu \nu \rho}}^{\lambda}\equiv {\partial}_{\mu }{\Gamma}_{\nu \rho}^{\lambda }-{\partial}_{\nu }{\Gamma}_{\mu \rho}^{\lambda }+{\Gamma}_{\mu \sigma}^{\lambda }{\Gamma}_{\nu \rho}^{\sigma }-{\Gamma}_{\nu \sigma}^{\lambda }{\Gamma}_{\mu \rho}^{\sigma }, $$
$$ {T_{\mu \nu}}^{\lambda}\equiv {\Gamma}_{\mu \nu}^{\lambda }-{\Gamma}_{\nu \mu}^{\lambda }. $$

Consider a manifold with a metric structure gμν, then it can be proved that there is only one connection, \( {\Gamma^{(g)}}_{\mu \nu}^{\lambda } \), compatible with the metric and torsionless, namely

$$ {\nabla}_{\lambda }{g}_{\mu \nu}=0,\kern1em {T_{\mu \nu}}^{\lambda }=0. $$

This connection is called the Levi-Civita connection of gμν, and it is completely determined by the metric:

$$ {\Gamma^{(g)}}_{\mu \nu}^{\lambda }=\frac{1}{2}{g}^{\lambda \sigma}\left({\partial}_{\mu }{g}_{\sigma \nu}+{\partial}_{\nu }{g}_{\mu \sigma}-{\partial}_{\sigma }{g}_{\mu \nu}\right). $$

In fact, given a metric, the Levi-Civita affine structure is the simplest connection we can deal with. The metric compatibility and the nullity of the torsion simplify many geometrical identities. Moreover, we are not introducing extra degrees of freedom in the theory, just the ones that come from the metric.

Before starting with the physics, let us introduce a few useful definitions. A curve γ(α) is a differentiable function γ : I → ℳ, where I is an interval of the real line. The image of a curve in the manifold is what we will call trajectory or path. So a trajectory is a set of spacetime points joined in a continuous and differentiable way, while the curve is the function that generates this set.

Let ℳ be a spacetime equipped with a connection \( {\Gamma}_{\mu \nu}^{\rho } \). An autoparallel of this affine structure is the image of a curve whose velocity is parallel to itself (with respect to \( {\Gamma}_{\mu \nu}^{\rho } \)). In other words, given a curve γ(α) with velocity vμ(α), its image is an autoparallel if the following equation holds:

$$ {v}^{\mu }{\nabla}_{\mu }{v}^{\rho}\equiv \kern1em \frac{{d v}^{\rho }}{d\alpha}+{\Gamma}_{\mu \nu}^{\rho }{v}^{\mu }{v}^{\nu }=f\left(\alpha \right){v}^{\rho }. $$

for some function f(α). If we reparametrize the trajectory, by doing α → β(α), we change the velocity as well as the function f. This function can always be set to zero (identically) with an appropriate choice of the parameter. Those are called affine parameters for the trajectory.

Consider that the manifold also has a metric structure. The autoparallels of the associated Levi-Civita connection are special because they can be derived from a completely metric approach, i.e. without using the Levi-Civita parallelism. If the velocity is timelike or spacelike, they correspond to trajectories that are critical points of the length functional:

$$ s\left[\gamma \right]\left(\alpha \right)=\underset{0}{\overset{\alpha }{\int }}\sqrt{\left|{g}_{\mu \nu}{v}^{\mu }{v}^{\nu}\right|}d{\alpha}^{\prime }. $$

The lightlike case should be treated separately, because the length functional cannot be varied smoothly. However, they can be seen as critical points of other functionals that, again, only involve the metric structure. For these reasons, in general, we will call the autoparallels of Levi-Civita critical trajectories. The associated affine parameters have a special meaning, because their changes are proportional to the length between the considered points. In the timelike case, it is essentially the proper time, so these parameters represent the rhythm of a proper (i.e. freely falling) clock along them.

2 Einstein’s Equations and Variational Principles

General relativity is a geometric theory of the spacetime whose dynamics is described by the Einstein’s equation:

$$ {R}_{\mu \nu}(g)-\frac{1}{2}{g}_{\mu \nu}R(g)=-\kappa {\mathcal{T}}_{\mu \nu}, $$
(5.1)

where \( {R}_{\mu \nu}(g)\equiv {R_{\mu \lambda \nu}}^{\lambda }(g) \) is the Levi-Civita Ricci tensor, R(g) ≡ gμνRμν(g) is the Levi-Civita Ricci scalar, κ ≡ 8πG (G is the Newton’s constant) and \( {\mathcal{T}}_{\mu \nu} \) is the (Hilbert) energy-momentum tensor that contains the information about the matter and energy content.

This equation can be obtained from a more fundamental object through a variational principle, the Einstein–Hilbert action:

$$ S\left[g,\psi \right]=\frac{1}{2\kappa}\int R(g)\sqrt{\left|g\right|}{d}^Dx+{S}_{\mathrm{matter}}\left[g,\psi \right]. $$

The energy-momentum tensor is then defined by

$$ {\mathcal{T}}_{\mu \nu}\equiv \frac{2}{\sqrt{\left|g\right|}}\frac{\delta {S}_{\mathrm{matter}}}{\delta {g}^{\mu \nu}}. $$

Notice that we are assuming (from the start) a particular affine structure, the one fixed by the metric. We are selecting the Levi-Civita connection by hand and this can be considered natural because it is the simplest one. When we obtain from a gravitational action the equations of motion admitting that the affine structure is Levi-Civita, we are using the so-called metric formalism, because the metric determines everything related to the gravitational field.

Another approach, which is called Palatini or metric-affine formalism, consists in considering the metric and the connection as independent fields. Now the connection is general and the corresponding equations of motion should determine whether it is Levi-Civita or not. The action in this formalism is

$$ S\left[g,\Gamma, \psi \right]=\frac{1}{2\kappa}\int {g}^{\mu \nu}{R}_{\mu \nu}\left(\Gamma \right)\sqrt{\left|g\right|}{d}^Dx+{S}_{\mathrm{matter}}\left[g,\psi \right]. $$

It is worth remarking that we are assuming that the matter part of the action does not depend on the affine connection Γ.

This formalism is interesting because we expect Levi-Civita connection to be fixed by the corresponding equations of motion, in contrast with the metric formalism in which it is selected artificially.

3 Palatini Solutions of the Einstein–Hilbert Action

If we vary the Einstein–Hilbert action in the metric-affine formalism, we obtain the following equations of motion for the metric and the connection, respectively:

$$ 0=\frac{1}{2}\left({R}_{\mu \nu}\left(\Gamma \right)+{R}_{\nu \mu}\left(\Gamma \right)\right)-\frac{1}{2}{g}_{\mu \nu}R\left(\Gamma \right)+\kappa {\mathcal{T}}_{\mu \nu}, $$
(5.2)
$$ 0={\nabla}_{\lambda }{g}^{\mu \nu}-{\nabla}_{\sigma }{g}^{\sigma \nu}\ {\delta}_{\lambda}^{\mu }+\frac{1}{2}{g}^{\mu \nu}{g}^{\rho \tau}{\nabla}_{\lambda }{g}_{\rho \tau}-\frac{1}{2}{g}^{\rho \tau}{\nabla}^{\nu }{g}_{\rho \tau}{\delta}_{\lambda}^{\mu }+{T_{\lambda \sigma}}^{\sigma }{g}^{\mu \nu}-{T_{\rho \sigma}}^{\sigma }{g}^{\rho \nu}{\delta}_{\lambda}^{\mu }+{T_{\sigma \lambda}}^{\mu }{g}^{\sigma \nu}. $$
(5.3)

Equation (5.3) can be simplified if the dimension of the spacetime is D > 2.Footnote 1 We then obtain:

$$ 0={\nabla}_{\lambda }{g}_{\mu \nu}-{T_{\nu \lambda}}^{\sigma }{g}_{\mu \sigma}+\frac{1}{D-1}{T_{\lambda \sigma}}^{\sigma }{g}_{\nu \mu}+\frac{1}{D-1}{T_{\nu \sigma}}^{\sigma }{g}_{\lambda \mu}. $$
(5.4)

Clearly, Levi-Civita is a solution, because in that case each term of the right hand side vanishes. However, let us try for other solutions. Consider only those that are torsionless, then, necessarily, ∇λgμν should be zero, so Levi-Civita is the only possibility. The same happens for metric-compatible solutions. In fact, when Palatini formalism is presented (in textbooks for example), one of these two conditions is assumed. Consequently, we lose the information about the general solution and it reduces to Levi-Civita.

The most general solution of Eq. (5.4) has the form:

$$ {\Gamma}_{\mu \nu}^{\sigma }={\Gamma^{(g)}}_{\mu \nu}^{\sigma }+{\mathcal{A}}_{\mu }{\delta}_{\nu}^{\sigma }, $$

where \( {\mathcal{A}}_{\mu } \) is an arbitrary 1-form (Bernal et al. 2017). We will call it Palatini connection from now on.

The associated covariant derivative of the metric (also called non-metricity tensor) and torsion are

$$ {\nabla}_{\lambda }{g}_{\mu \nu}=-2{\mathcal{A}}_{\lambda }{g}_{\mu \nu}, $$
$$ {T_{\mu \nu}}^{\sigma }={\mathcal{A}}_{\mu }{\delta}_{\nu}^{\sigma }-{\mathcal{A}}_{\nu }{\delta}_{\mu}^{\sigma }. $$

Here, we clearly notice what we stated before: switching off one of them implies \( {\mathcal{A}}_{\lambda }=0 \) and, then, Levi-Civita as the only possibility.

The Palatini Riemann tensor, Ricci tensor and Ricci scalar are given by

$$ {R_{\mu \nu \rho}}^{\lambda}\left(\Gamma \right)={R_{\mu \nu \rho}}^{\lambda }(g)+{\mathcal{F}}_{\mu \nu}{\delta}_{\rho}^{\lambda }, $$
$$ {R}_{\mu \nu}\left(\Gamma \right)={R}_{\mu \nu}(g)+{\mathrm{\mathcal{F}}}_{\mu \nu}, $$
$$ R\left(\Gamma \right)={R}_{\mu \nu}(g), $$

respectively, where \( {\mathrm{\mathcal{F}}}_{\mu \nu}={\partial}_{\mu }{\mathcal{A}}_{\nu }-{\partial}_{\nu }{\mathcal{A}}_{\mu } \). As a consequence of these expressions, the equation of motion of the metric (5.2) becomes the Einstein’s Eq. (5.1), see (Dadhich and Pons 2012). We now present some properties of these solutions.

3.1 Projective Relation Between Solutions

Any two Palatini connections, for example

$$ {\Gamma}_{\mu \nu}^{\rho }={\Gamma^{(g)}}_{\mu \nu}^{\sigma }+{\mathcal{A}}_{\mu }{\delta}_{\nu}^{\sigma },\kern1em {\Gamma^{\prime}}_{\mu \nu}^{\rho }={\Gamma^{(g)}}_{\mu \nu}^{\sigma }+{\mathcal{A}}_{\mu}^{\prime }{\delta}_{\nu}^{\sigma }, $$

are related by a transformation:

$$ {\Gamma}_{\mu \nu}^{\rho}\to {\Gamma^{\prime}}_{\mu \nu}^{\rho }={\Gamma}_{\mu \nu}^{\rho }+{k}_{\mu }{\delta}_{\nu}^{\sigma }, $$

for certain covector kμ. This transformation is a projective transformation which means that preserves autoparallels. This can be proved easily. First, consider an autoparallel trajectory for the connection \( {\Gamma^{\prime}}_{\mu \nu}^{\rho } \),

$$ \frac{{d v}^{\rho }}{d\beta}+{\Gamma^{\prime}}_{\mu \nu}^{\rho }{v}^{\mu }{v}^{\nu }=0. $$

Then, imposing the projective relation between both connections and defining −kμvμ ≡ f(β) we get to the expression:

$$ \frac{{d v}^{\rho }}{d\beta}+{\Gamma}_{\mu \nu}^{\rho }{v}^{\mu }{v}^{\nu }=f\left(\beta \right){v}^{\rho }. $$

And this is the equation of an autoparallel for the connection \( {\Gamma}_{\mu \nu}^{\rho } \) with a non-affine parametrization. If we parametrize the path affinely (β → α and vρ → uρ) we obtain:

$$ \frac{{d u}^{\rho }}{d\alpha}+{\Gamma}_{\mu \nu}^{\rho }{u}^{\mu }{u}^{\nu }=0. $$

Q.E.D.

Consequently, the whole set of Palatini connections shares the same autoparallels, up to reparametrizations, which have no physical meaning. As a matter of fact, since Levi-Civita is a particular Palatini connection (the case with \( {\mathcal{A}}_{\mu }=0 \)) we conclude: the autoparallels of any Palatini connection are critical trajectories of the metric.

3.2 Homothety Property

Let γ(τ) be a general curve in the spacetime and vμ its velocity, and a vector Wμ. If we compare the change of the vector along γ(τ) under Palatini and Levi-Civita parallel transport, we see that the difference between both transports is proportional to Wμ:

$$ \left({v}^{\mu }{\nabla}_{\mu }-{v}^{\mu }{\nabla}_{\mu}^{(g)}\right){W}^{\rho }=-{\mathcal{A}}_{\mu }{v}^{\mu }{W}^{\rho}\equiv \lambda \left(\tau \right){W}^{\rho }. $$

The module is not conserved but the direction does. Due to this, we say the Palatini parallel transport is homothetic with respect to the Levi-Civita transport. It can be proved that the only connections with this property are the Palatini connections (Bernal et al. 2017). Any other connection would generate a perturbation in the direction of Wμ.

4 Observability and Physical Implications

We introduced the Palatini formalism in order to see if the dynamics could fix Levi-Civita as the fundamental connection of the theory, in contrast with the metric formalism in which it is selected by hand. However, we have obtained a family of connections that differ in a vector field, with Levi-Civita as a particular case. In this section, we analyse the physical implications of this field. Indeed, we will see that it is undetectable or, equivalently, that metric and Palatini formalism describe the same physics.

The main point is that the gravitational dynamics is the same in both formalisms. The equation of the matter is clearly the same, because the corresponding action does not depend on the connection, so the difference between formalisms does not affect this part of the total action. And, as we previously showed, Palatini connections imply the reduction of the equation of the metric to the Einstein’s equation. The resulting dynamics for the metric and the matter content is given by

$$ {R}_{\mu \nu}(g)-\frac{1}{2}{g}_{\mu \nu}R(g)=-\kappa {\mathcal{T}}_{\mu \nu},\kern1em \frac{\delta {S}_{\mathrm{matter}}}{\delta \psi}=0, $$

in both formalisms.

Furthermore, in Einstein–Hilbert gravity the distinction between critical and autoparallel trajectories disappears due to the projective symmetry. In fact, defining the trajectory of a test particle is often presented as a basic problem of metric-affine theories. Those of critical length and those with covariantly constant velocity are candidates because both of them infinitesimally reduce to straight lines. The critical paths are the simplest approach, but there are authors who defend the description with autoparallels (Kleinert and Pelster 1999) and others who state that only the conserved currents determine the test paths (Hehl and Obukhov 2007).

In Einstein–Hilbert gravity, the conservation of the energy-momentum tensor selects the critical paths. However, fortunately, the autoparallels set by the Palatini dynamics coincide, as a consequence of the projective relation between Palatini connections (a family that includes Levi-Civita). Indeed, we have seen that the field \( {\mathcal{A}}_{\mu } \) can be eliminated by the freely falling observer with an appropriate choice of the parameter.

All of these ideas point in the same direction: the field \( {\mathcal{A}}_{\mu } \) has no physical effects (Bernal et al. 2017).

5 Equivalence in Other Theories

Finally, we add a few remarks about Palatini connections in other theories. One example is Lovelock theory in Palatini formalism:

$$ S\left[g,\Gamma, \psi \right]={S}_{\mathrm{Lov}}\left[g,\Gamma \right]+{S}_{\mathrm{matter}}\left[g,\psi \right],\kern1em {S}_{\mathrm{Lov}}\left[g,\Gamma \right]\equiv \int \sum \limits_{n=1}^K{a}_n{\mathrm{\mathcal{L}}}_n\left[g,\Gamma \right]\sqrt{\left|g\right|}{d}^Dx, $$

where an are certain dimensionful parameters, ℕ ∋ K ≤ ceiling(D/2 − 1) and the nth-order Lovelock lagrangian is defined by

$$ {\mathcal{L}}_n\left[g,\Gamma \right]={\delta}_{\nu_1}^{\Big[{\mu}_1}\dots {\delta}_{\nu_{2n}}^{\mu_{2n}\Big]}{g}^{\rho_1{\nu}_1}\dots {g}^{\rho_n{\nu}_{2n-1}}{R_{\mu_1{\mu}_2{\rho}_1}}^{\nu_2}\left(\Gamma \right)\dots {R_{\mu_{2n-1}{\mu}_{2n}{\rho}_n}}^{\nu_{2n}}\left(\Gamma \right). $$

It was shown in Borunda et al. (2008) that Levi-Civita is a solution for the equation of the connection in any of these theories. As far as we know, the general solution remains unknown, but we have found that the action presents the projective symmetry, \( {\Gamma}_{\mu \nu}^{\sigma}\to {\Gamma}_{\mu \nu}^{\sigma }+{\mathcal{A}}_{\mu }{\delta}_{\nu}^{\sigma } \). The proof is the following. Under the projective transformation, the Riemann tensor is modified:

$$ {R_{\mu \nu \rho}}^{\lambda}\to {R_{\mu \nu \rho}}^{\lambda }+{\mathcal{F}}_{\mu \nu}{\delta}_{\rho}^{\lambda }. $$

Then, the lagrangian transforms:

$$ {\mathcal{L}}_n\to {\delta}_{\Big[{\nu}_1}^{\mu_1}\dots {\delta}_{\nu_{2n\Big]}}^{\mu_{2n}}\left({R_{\mu_1{\mu}_2}}^{\nu_1{\nu}_2}+{\mathcal{F}}_{\mu_1{\mu}_2}{g}^{\nu_1{\nu}_2}\right)\dots \left({R_{\mu_{2n-1}{\mu}_{2n}}}^{\nu_{2n-1}{\nu}_{2n}}+{\mathcal{F}}_{\mu_{2n-1}{\mu}_{2n}}{g}^{\nu_{2n-1}{\nu}_{2n}}\right) $$

and the ν’s antisymmetrization cancels all the terms proportional to the metric, so

$$ {\delta}_{\mathrm{proj}}{\mathrm{\mathcal{L}}}_n=0,\kern1em \forall n\kern0.5em \Rightarrow \kern0.5em {\delta}_{\mathrm{proj}}\left\{{S}_{\mathrm{Lov}}\left[g,\Gamma \right]+{S}_{\mathrm{matter}}\left[g,\psi \right]\right\}=0. $$

Q.E.D.

Consequently, since Levi-Civita (\( {\Gamma^{(g)}}_{\mu \nu}^{\sigma } \)) is a solution, then \( {\Gamma^{(g)}}_{\mu \nu}^{\sigma }+{\mathcal{A}}_{\mu }{\delta}_{\nu}^{\sigma } \) is also a solution. Indeed, the whole set of solutions can be separated into equivalence classes of projectively related connections. So if a new solution \( {\Gamma^{\mathrm{sol}}}_{\mu \nu}^{\sigma } \) of a Lovelock theory that has not the form \( {\Gamma^{(g)}}_{\mu \nu}^{\sigma }+{\mathcal{A}}_{\mu }{\delta}_{\nu}^{\sigma } \) (for some \( {\mathcal{A}}_{\mu } \)) is found, then we could build a family of new solutions just adding a term \( {\mathrm{\mathcal{B}}}_{\mu }{\delta}_{\nu}^{\sigma } \) where ℬμ is arbitrary. This property also holds for any other lagrangians with projective invariance, such as f(R) gravity.

Other theories we have tested are those with quadratic torsion corrections to the Einstein–Hilbert lagrangian. The torsion corrections we consider are only those with even parity:

$$ S\left[g,\Gamma, \psi \right]=\frac{1}{2\kappa}\int {g}^{\mu \nu}{R}_{\mu \nu}\left(\Gamma \right)\sqrt{\left|g\right|}{d}^Dx+{S}_{matter}\left[g,\psi \right]+\frac{1}{2\kappa}\int \left[{b}_1{T}_{\mu \nu \rho}^{(1)}{T}^{(1)\mu \nu \rho}+{b}_2{T}_{\mu \nu \rho}^{(2)}{T}^{(2)\mu \nu \rho}+{b}_3{T}_{\mu \nu \rho}^{(3)}{T}^{(3)\mu \nu \rho}\right]\sqrt{\left|g\right|}{d}^Dx, $$

where bi are arbitrary dimensionless real constants and \( {T}_{\mu \nu \rho}^{(i)} \) are the irreducible components of the torsion (see (McCrea 1992)). For these extensions, the equivalence between metric and Palatini formalism holds.

6 Conclusions

To conclude we summarize our results. We have seen that Einstein–Hilbert gravity in the Palatini formalism has some interesting features. If we couple this theory with a matter action through the metric (and not the connection), the result is physically indistinguishable from the dynamics obtained assuming Levi-Civita as the fundamental affine structure from the beginning (metric formalism).

The general solution of the equation of the connection is Levi-Civita plus the term \( {\mathcal{A}}_{\mu }{\delta}_{\nu}^{\sigma } \) where \( {\mathcal{A}}_{\mu } \) is an undetermined field. However, the equations of motion are the same as in metric formalism. Therefore, we get to different mathematical descriptions (related through the projective symmetry) that describe the same physics. In other words, it is not necessary to set the connection to be Levi-Civita by hand. The dynamics fixes the affine structure.

Another additional property of the Palatini connections is that they are the only affine structures whose parallel transport is homothetic with respect to the Levi-Civita transport. So the directions obtained in both cases are coincident.

We have also proved that an autoparallel of a given Palatini connection is a trajectory with critical length (autoparallel of Levi-Civita). The undetermined field \( {\mathcal{A}}_{\mu } \) for a free falling observer can be absorbed in a reparametrization of its worldline, so it has no physical meaning since a particular choice of the parameter is meaningless.

We have found solutions of this kind in more general theories, for example, in Lovelock gravity. In addition, if we admit matter that does not feel the connection, the equivalence between formalisms can be extended from Einstein–Hilbert to any theory with additional quadratic torsion terms in the action. Current work involves the treatment with more general matter and with additional terms that introduce, for example, dynamics for the torsion field.