1 Introduction

The principle of equivalence requires that the metric is Lorentzian and the affine connection is zero in a freely falling frame. In metric theories of gravity, the simplest way to arrange this is to take the connection to be Riemannian (ie., that of Levi-Civita) and demand that the derivatives of the metric also vanish in the free-fall frame. That such a frame exists in Riemannian geometry is easily demonstrated: Riemann normal coordinates will do the job.

In this paper we will construct theories where the restriction that the metric derivatives have to vanish in the free-fall frame will be relaxed, while still requiring that the connection coefficients are zero. The basic idea is an old one going back to Riemann himself,Footnote 1 namely that of letting the geometry depend on higher rank tensor fields instead of the metric.

However, a dynamical metric that is relevant for geometry is crucial if one wants to have the possibility of connecting the theory to special relativity, so in this paper we will not completely sacrifice the metric. Some key features of our apporoach are:

  • We work with a coupled system containing the metric, higher rank symmetric tensors and the affine connection.

  • We work in the context of a diff-invariant dynamical (ie., action-based) theory of geometry, whose construction we will describe.

In these theories, by going to the geodesic frame, it becomes possible to bring the metric to the Minowski form as well as arrange the vanishing of the connection, while having the extra flexibility of new tensor fields in the theory. There will typically be a non-trivial background value for the higher rank tensor field in this frame, that spontaneously breaks Lorentz invariance. But this is a solution-dependent feature of the resulting theory, which one might (in principle) attempt to ameliorate or exploit. We will briefly discuss some salient features of these theories in the concluding section.

Various modifications and generalizations of general relativity have been reported in the literature over the last century, too many to be listed here in toto. Instead we will refer the reader to the Wikipedia page on the subject [1] which has a pretty elaborate sample of references.

2 Ingredients

One way to motivate our approach is to look for a generalization of the Levi-Civita condition

$$\begin{aligned} \nabla _a g_{bc}=0 \end{aligned}$$
(1)

in a way that incorporates higher tensors along with the metric. The primary problem in trying to find a generalized Levi-Civita condition is that the number counts for the independent components on either side do not match if one includes higher rank fields. If one assumes that the connection is torsion-free,Footnote 2 both sides of (1) contain the same number of independent components, \(d^2 (d+1)/2\) in d dimensions. This seems indeed like a precarious balance, so we might be tempted to give up any hope of generalizing the Levi-Civita condition.Footnote 3 But motivated by the fact that the relationship between the spin connection and the vielbeins does get modified in higher spin theories,Footnote 4 we will instead look for another perspective on the Levi-Civita condition which might be more suitable for generalization.

That perspective is provided by the Palatini approach to general relativity where the connection and metric are treated as independent fields. In this formalism, we will viewFootnote 5 the relationship between the two fields as a consequence of the choice of action: the specific choice of the Einstein–Hilbert action gives rise to the standard Levi-Civita condition as the connection equation of motion (when torsion is zero). This provides us with a natural avenue for generalizing (1), by generalizing the action to include higher tensors. Note that the Einstein–Hilbert action had the feature that the connection was determined algebraically: Levi-Civita condition contains no derivatives on \(\Gamma ^a_{bc}\). We are not guaranteed that this will remain true if we generalize the action, and indeed it doesn’t hold even in pure gravity if one includes higher derivative terms [16, 17]. For a general action the connection is determined in terms of its (differential) equation of motion. We will see later that there exist classes of actions (which have Riemann appearing linearly) where the connection is determined algebraically in terms of the metric and higher tensors. For simplicity, the explicit examples in this paper are all taken to be of this form. The Einstein–Hilbert action is a special case in this class where the dependence is only on the metric.

In any event, for now we will drop the algebraic determination of the connection as a strict requirement, and instead look at the equations of motion (algebraic or differential) from a Palatini formulation as the definition of our connection.

So our task then is to write down diff-invariant actionsFootnote 6 containing the metric, higher tensors and the connection. Variations with respect to each of these fields will give us equations of motion, which together will define a geometrical theory of “higher” gravity. To get an action, we need to construct an appropriate scalar Lagrangian density and a volume form out of these fields, and integrate the former using the latter over the manifold. Furthermore, this Lagrangian density needs to contain derivatives, because we want to get differential (as opposed to algebraic) equations of motion for the tensors. In a manifold with an affine connection, we automatically have a good candidate tensor to help us do this: the Riemann tensor. When there are no tensors besides the metric, the natural action to write down at lowest order in derivatives is the Einstein-Palatini action

$$\begin{aligned} S_{EH}[g, \Gamma ]=\int d^4x \sqrt{g} \ g^{ab} R_{ab}(\Gamma ). \end{aligned}$$
(2)

Note that the definition of both the standard Riemann tensor and the Ricci tensor do not require the metric. But construction of the Ricci scalar does require the metric for contracting the indices. In principle we could consider higher powers and covariant derivatives of Riemann (which again would only depend on the connection) and construct scalars by appropriate contractions with the metric, and we would have a well-defined starting point for the variational principle. Typically this will then lead to higher derivative equations of motions.

To include higher tensors (we will call them \(\phi _{a_1a_2 \ldots a_n}\)), we need two ingredients then. Firstly we need to be able to construct volume forms, and the second is we need to be able to write down scalar Lagrangians by contracting indices appropriately. We turn to the volume form first. Surely, we already have a well-defined volume form in

$$\begin{aligned} \sqrt{g}\ dx^1 \wedge \cdots \wedge dx^d. \end{aligned}$$
(3)

But we can do something more drastic. We can build our volume forms in ways that depend on the higher tensors as well. One of our goals in this paper is to see how much of the “geometrodynamic” perspective on gravity is reliant on the metric. Put another way, general relativity tells us that rank two tensors can have an interpretation as geometry: we would like to see if other fields (“matter”) can in fact be geometrized in some sense as well. So we would like an approach that can stand on its own even when there is no metric.

The defining property of the volume form is that it is a nowhere vanishing top form on the manifold, and that under diffeomorphisms, it transforms trivially (ie., upto a sign on an orientable manifold). Under a diffeomoprhism \(x \rightarrow x'\), the form \(dx^1 \wedge dx^2\wedge dx^3 \wedge dx^4\) transforms by a factor of \(\mathrm{det} (\Lambda )\) where \(\Lambda ^{a'}_{b}=\partial x^{a'}/\partial x^b\). The factor of \(\sqrt{g}\) is there to compensate for this in (3). This is accomplished because the determinant of the metric

$$\begin{aligned} g= \frac{1}{4!}\epsilon _{a_1b_1c_1d_1}\epsilon _{a_2b_2c_2d_2}\ \left( g_{a_1a_2}g_{b_1b_2}g_{c_1c_2}g_{d_1d_2}\right) , \end{aligned}$$
(4)

transforms precisely by a compensating (square of) \(\mathrm{det} (\Lambda )\). Note that the epsilons here stand for the epsilon symbol: they are not the Levi-Civita tensor and do not contain metric. The 4! here is a useful convention that corresponds to the dimensionality of spacetime which we have taken (for concreteness) to be 4. In d-dimensions, the determinant will be defined with a factor \(\frac{1}{d!}\), there will be two (note that this two is the rank of the metric) epsilon tensors with d legs each and there will be d copies of the metric tensor.

Now it is trivial to generalize this construction of such a volume form to a rank-r tensor. In d-dimensions, the generalization of the determininant will be defined with a factor of \(\frac{1}{d!}\), there will be r epsilon tensors with d legs each and there will be d copies of the rank-r symmetric tensor \(\phi ^{a_1 \ldots a_r}\). I.e., if one defines

$$\begin{aligned} \phi _r\equiv \frac{1}{d!}\epsilon _{a^1_1a^2_1 \ldots a ^d_1} \cdots \epsilon _{a^1_ra^2_r \ldots a^d_r}\ \phi _{a^1_1a^1_2 \ldots a^1_r} \cdots \phi _{a^d_1a^d_2 \ldots a^d_r} \end{aligned}$$
(5)

then it transforms with a factor of \((\mathrm{det} \Lambda )^r\) under GL(dR) so that

$$\begin{aligned} (\phi _r)^{1/r}\ dx^1\wedge \cdots \wedge dx^d \end{aligned}$$
(6)

is a perfectly acceptable volume form. We adopt the convention that the rank-r hyperdeterminant in (5) is denoted by \(\phi _r\), with \(\phi _2 \equiv g\). We can generalize this further and construct more general volume forms based on various symmetric tensors on the manifold by combining these:

$$\begin{aligned} dV=\prod _i(\phi _{r_i})^{q_{r_i}}\ dx^1\wedge \cdots \wedge dx^d \end{aligned}$$
(7)

This will be a well-defined volume form iff \(\sum _i r_i q_{r_i}=1\). The existance of such general volume forms that allow for much more general geometric couplings between various symmetric tensors, is a simple yet key observation of this paper. Of course, if we have multiple tensors of the same rank, a slight generalization of the above volume form is possible, but we will not present it explicitly to avoid too much clutter.

There is one caveat: The object (5) above only exists for even r. For odd r, the quantity identially vanishes.Footnote 7 But a simple way to adapt our previous definition to the odd rank case is to first construct a fully symmetric even rank object out of it. For example for \(\phi _{abc}\), we can construct \(\Phi _{abcdef}=\phi _{(abc}\phi _{def)}\) which is even rank. This symmetrization approach works for any rank. Now we can define \((\Phi _{2r})^{1/2r}\ dx^1\wedge \cdots \wedge dx^d\) as our volume form. We will mostly be dealing with even rank tensors in this paper.

The construction of the so-called Cayley hyperdeterminant for the specific case of rank 3 and dimension 2 has been discussed in [18] in the context of black holes and qubits. We have stumbled on this object by the apporach above of looking at the transformation properties of the volume form, but we can use the explicit expression in [18] as a sanity check: indeed, upto a numerical factor normalization the expression (4.2) in [18] is the same as our construction for rank three and dimension two. The even rank case is more well-known,Footnote 8 and we have checked that the object does not vanish.

That the volume forms we have constructed are “good” objects to construct diff-invariant theories will become even more evident, when we vary our action to construct equations of motion. We will see that the resulting equations of motion can all be combined into manifestly covariant forms, which is a consequence of the diff-invariance of our action.

Armed with these volume forms, now we go on to construct scalar Lagrangian densities. But what are the independent tensors available to us for doing these contractions? For concreteness lets consider the case where the metric and a rank-3 tensor are the fields in the theory. One can of course invert the metric,

$$\begin{aligned} g^{ac}g_{cb}=\delta ^a_b \end{aligned}$$
(8)

and get the inverse metric as a possible tensor, because the right hand side is a well-defined invariant object under the coordinate transformation matrix GL(dR). But if we are allowed to use only the inverse metric as an object that can raise indices, it is easy to see that one can never construct scalars in a theory where there are only higher rank (covariant) tensors and no metric.Footnote 9

A natural thing to try generalizing from (8) is something like

$$\begin{aligned} \phi ^{abe}\phi _{ecd}\sim \delta ^{(ab)}_{(cd)}, \end{aligned}$$
(9)

which would again define a well-defined tensor \(\phi ^{abe}\) (if it exists) because again the RHS is a GL(dR) invaraint. However the trouble is it doesn’t exist in general: the system is overconstrained as one can easily see by counting the number of equations, which is \(\frac{d^2(d+1)^2}{4}\), and the number of independent degrees of freedom in \(\phi \), which is \(\frac{d(d+1)(d+2)}{6}\). This leads to trivial solutions in \(d \ge 2\). The situation only gets worse if we go beyond rank 3 and consider even higher rank tensors. The basic problem is that in the case of the metric, since the product of two \(d \times d\) matrices was a \(d \times d\) matrix, multiplication had closure. Here we don’t. So what are we to do?

A hint is provided by variations of the volume form. From usual differential geometry, we are familar with the relation

$$\begin{aligned} g^{-1} \delta g= \ g^{a b}\ \delta g_{ab}. \end{aligned}$$
(10)

The left hand side is a tensor because the right hand side is. Now, explicitly evaluating the LHS and noting that \(\delta g_{ab}\) are arbitrary variations, in 4 dimensions we get the relation

$$\begin{aligned} g^{ab}=\frac{g^{-1}}{3!}\epsilon _{aa_1b_1c_1}\epsilon _{ba_2b_2c_2}g_{a_1a_2}g_{b_1b_2}g_{c_1c_2}. \end{aligned}$$
(11)

This gives us an alternate definition of \(g^{ab}\) indepedent of (8), which we can try to generalize to our higher determinants. Explicitly evaluating \(\phi _r^{-1}\delta \phi _r\) we get

$$\begin{aligned} \phi _r^{-1}\delta \phi _r= & {} \frac{\phi _r^{-1}}{(d-1)!}\epsilon _{a_1 a^2_1 \ldots a^d_1}\cdots \epsilon _{a_r a^2_r \ldots a^d_2}\times \nonumber \\&\times \phi _{a^2_1 \ldots a^2_r} \cdots \phi _{a^d_1 \ldots a^d_r}\ \delta \phi _{a_1 \ldots a_r}\nonumber \\\equiv & {} \ \phi ^{a_1 \dots a_r}\delta \phi _{a_1 \dots a_r} \end{aligned}$$
(12)

The object \( \phi ^{a_1 \ldots a_r}\) is perfectly well-defined, the question is if it is a tensor. But from our construction of the volume form we already know that \(\varepsilon ^{\phi }_{a_1 \ldots a_d}\equiv \phi _r^{1/r} \epsilon _{a_1 \ldots a_d}\) and its inverse \(\varepsilon ^{a_1 \ldots a_d}_{\phi } = \phi _r^{-1/r} \epsilon _{a_1 \ldots a_d}\) are tensors,Footnote 10 so

$$\begin{aligned} \phi ^{a_1 \ldots a_r}\equiv \frac{1}{(d-1)!}\varepsilon ^{a_1 a^2_1 \ldots a^d_1}\cdots \varepsilon ^{a_r a^2_r \ldots a^d_2} \phi _{a^2_1 \ldots a^2_r} \cdots \phi _{a^d_1 \ldots a^d_r} \nonumber \\ \end{aligned}$$
(13)

is a tensor, which is exactly what definition (12) is. So we have managed to obtain an upper index object purely from the higher tensor without using the metric, which can be used for constructing scalars.

So now a general theory that we can write down using these ingredients will consist of a Lagrangian density that is a scalar constructed from

$$\begin{aligned} \phi _{a_1 \ldots a_{r_i}}, \ \ \phi ^{a_1 \ldots a_{r_k}}, \ \ \nabla _a, \ \ R^a_{bcd}, \ \ R_{bd} \end{aligned}$$
(14)

where there can be tensors (including the metric) of various ranks and their “inverses”,Footnote 11 the covariant derivative \(\nabla _a\) is taken with the connection \(\Gamma ^a_{bc}\) which is thought of as an independent field, and the Riemann and Ricci are obtained as usual from the connection (and do not require the metric for their definition).

Integrating Lagrangian densities of this kind over volume forms of the kind (7) gives us theories that generalize the standard Palatini construction of Einstein’s gravity. Schematically, we can write such a theory as

$$\begin{aligned} S=\int dV\ {\mathcal L}(\phi _{a_1 \ldots a_{r_i}}, \ \phi ^{a_1 \ldots a_{r_k}}, \ \nabla _a, \ R^a_{bcd}, \ R_{bd}) \end{aligned}$$
(15)

where all the indices in \(\mathcal L\) are understood to be contracted so that it is a scalar. Note that the connection is treated as an independent field as well, and is to be varied with.Footnote 12

Of course there is always the possibility of coupling vaious terms of this kind together, as well as coupling these terms to covariantized matter: some interesting directions of this kind will be pursued in a forthcoming publication.

Next, we turn to a few simple explicit example theories of this kind.

3 Examples

Lets first introduce a 0-tensor (scalar) together with the metric. The simplest action is of the form

$$\begin{aligned} S_=\int d^4x e^{\phi }\sqrt{g} \ g^{ab} R_{ab} \end{aligned}$$
(16)

The fact that the volume form is nowhere-vanishing immediately leads to an \(e^{\phi }\) structure to its scalar part, leading to a dilaton-like structure. The dilaton equation of motion leads to \(g^{ab} R_{ab}=0\), and the metric equation of motion leads to the usual Einstein equation, both together leading to \(R_{ab}=0\). But note that the connection is determined by its own equation of motion and is no longer Levi-Civita. It is determined by

$$\begin{aligned} \nabla _ag^{bc}=\frac{2}{d-2} (\nabla _a\phi ) \ g^{bc} \end{aligned}$$
(17)

The connection can be algebraically computed, and using the usual cycling of the indices and adding an subtracting, one can find its explicit form, but we will not present it here. As it stands the above theory does not have equations of motion for the scalar, if we do not couple the action above to something else (say a standard scalar kinetic term). This is a generic feature when a field appears only in the volume form and not in the Lagrangian density. Note however that once we go from a Palatini formulation to a metric-like formulation by explicitly solving for the connection and plugging back in, we do have dilaton derivative terms in our action because our connection (17) is scalar-dependentFootnote 13!

Other possible theories include new types of bi-metric theories (see [23] for a review of bi-metric theories in cosmology): \(\int d^4x(g)^{1/p} (\tilde{g})^{1/q} \ g^{ab} R_{ab}\), \(\int d^4x(g)^{1/p} (\tilde{g})^{1/q} \ {\tilde{g}}^{ab}{\tilde{g}}^{cd}g_{ab} R_{cd}\) etc are examples. Note that here \(1/p+1/q=1/2\).

As long as the Riemann tensor appears without covariant derivatives, the connection is determined algebraically. And it is determined linearly in terms of field derivatives when the action is linear in the Riemann (or Ricci) tensors. Note that we have specifically chosen such actions here for illustration, but there is nothing that prevents one from considering more complicated actions with mode derivatives and powers of the curvatures.

We will present one more theory (richer than the ones we considered so far) before concluding. This one contains a metric and a higher rank field. Note first that the higher fields have to have non-zero vev in order for the volume form to be well-defined, but a non-zero vev will typically break Lorentz invariance. This is unavoidable if we are wroking with odd rank fields, but for even rank fields we can in principle have vevs for the fields which are constructed from \(\eta _{ab}\). So we will for the moment limit ourselves to even rank fields. An interesting theory is

$$\begin{aligned} S=\int d^dx (g)^{1/q}(\phi _4)^{1/p} \phi ^{abcd}g_{ab}R_{cd} \end{aligned}$$
(18)

with \(2/p+1/q=1/2\). The equations of motion are

$$\begin{aligned} g_{(ab}R_{cd)}-\frac{1}{p}\phi _{abcd}(\phi ^{pqrs}g_{pq}R_{rs})=0,\end{aligned}$$
(19)
$$\begin{aligned} \phi ^{abcd}R_{cd}+\frac{1}{q}g^{ab}(\phi ^{pqrs}g_{pq}R_{rs})=0, \end{aligned}$$
(20)
$$\begin{aligned} \nabla _a \beta ^{bc}+\beta ^{bc}\left( \frac{1}{q}g^{mn}\nabla _ag_{mn}+\frac{1}{p}\phi ^{pqrs}\nabla _a\phi _{pqrs}\right) =0. \end{aligned}$$
(21)

where \(\beta ^{ab}\equiv \phi ^{abcd}g_{cd}\), it is a symmetric rank-2 matrix whose inverse \(\beta _{ab}\) is also useful. The last equation is the connection equation of motion, and it algebraically determines the connection:

$$\begin{aligned} \Gamma ^q_{a d}=\frac{1}{2}\beta ^{qp}\Big (\beta _{pd,a}+\beta _{pa,d}-\beta _{a d,p} + \nonumber \\ +\alpha _{a}\beta _{pd}+\alpha _d \beta _{a d}- \alpha _p \beta _{d a}\Big ) \end{aligned}$$
(22)

Here \(\alpha _a\) is defined by

$$\begin{aligned} \alpha _a=\frac{\beta _{bc}\partial _a \beta ^{bc}}{d-2}-\frac{2}{d-2} \Big (\frac{1}{q}g_{mn}\partial _ag^{mn}+\frac{1}{p}\phi _{pqrs}\partial _a\phi ^{pqrs}\Big ). \end{aligned}$$
(23)

In many ways, this theory is a direct generalization of the Einstein–Hilbert action to include both rank 2 and rank 4 tensors. It will be interesting to investigate whether this connection here can be obtained from a curve length minimizationFootnote 14 like the Levi-Civita connection arises from proper-time minimization.Footnote 15 We will discuss some physical aspects and solutions of this theory in a follow-up paper.

Note that in principle we can work with much more general theories than what we have presented here, involving higher powers and higher derivatives. If one works with purely higher rank tensors without metric, one will typically have to deal with higher derivative equations of motion.

4 Discussion

One question is if some of these theories are of phenomenological viability. It is tempting to speculate that they might be relevant for dark matter/energy. To have an idea about this, we will first have to construct (cosmological, black-hole-like,...) solutions in specific theories, which seems certainly doable. Finding the Newtonian/quadratic limit of our actions will also be useful for identifying theories which might have phenomenologial viability [22]. Examples of gravity modification for dark sector phenomenology can be found in [2426].

Our theories have a Lorentzian metric in them, and one can always write them in a manifestly locally Lorentz invariant way by introducing local frames. We can also write locally Lorentz invariant matter couplings via frames, coupling fields of various spin. These actions will look manifestly local Lorentz invariant, and this is the way one imagines coupling standard quantum field theory to a curved geometry.

A generic solution of these theories will break Lorentz invariance spontaneously.Footnote 16 But the vacuum can be Lorentz invariant: in theory (18) consider \(g_{ab}=\eta _{ab}\) with \(\phi _{abcd}= \alpha (\eta _{ab}\eta _{cd}+ \eta _{ac}\eta _{bd} + \eta _{ad}\eta _{bc})\). There are three terms in this last expression and together they make the expression fully symmetric and \(\alpha \) is the vev of the field. It is easy to check that this solves the equation of motion: we have therefore constructed a Lorentz-invariant vacuum, for each value of \(\alpha \). Note that setting \(\phi _{abcd}=0\) is not acceptable because \(\phi \) shows up in the volume form. Note also that this type of non-trivial vacuum cannot be found with odd-rank fields, because giving them a non-trivial vev will break Lorentz.

However there is a subtelty here that is not there in standard general relativity. Even though we can make the theory manifestly locally Lorentz invariant by erecting local frames, the causal structure is not disctated by lightcones. The reason is easy to understand: the Levi-Civita condition is precisely the condition that guarantees that the proper-time element remains invariant along a geodesic. In our case the connection depends on derivatives of the higher field(s) as well, and therefore the notion of causality is no longer only based on the metric and lightcone. A somewhat similar situation happens in Born-Infeld theory where there is tension between two notions of causality [27].

This seems like a new type of example for what usually falls under the rubric of Lorentz violation, and might be worth exploring: typically, breaking of Lorentz invariance is due to the addition of explicit terms in the Lagrangian that break it, or spontaneously in the vacuum. Here, on the other hand a potential superluminality shows up because the parallel transport of the frames is not metric compatible. Note however that this does not necessarily mean that the theory is inconsistent, the notion of what causality means is modified in these theories.Footnote 17 For example, the usual argument that there exists Lorentz frames in which superluminal propagation looks backward in time is strictly speaking true only when the world is globally Lorentz invariant, so that there is sense in comparing the propagated event with the original event using local Lorentz transformations at the same point. Metric compatibility is what lets us extend this over different points on the manifold in standard GR. And here, we don’t have metric compatibility. Clearly, the issue of causality requires further study.

Another direction is to consider these theories in various dimensions. String theory is the interesting object to study in two dimensions using the volume form constructed from a worldsheet 2-tensor, the metric. In other dimensions and for other objects, it might be interesting to consider other rank tensors or combinations of them.

Note also that the question of what kind of theories of this kind (if any) are ghost-free is very interesting. There are various subtle issues related to higher spin fields which evidently need to be clarified [28]. Part of the purpose of the present note was reconnaissance of theory-space. At the moment, we suspect that these theories have similarities to (higher generalizations of) massive gravity in the massless limit [29] (as obtained via a Stuckelberg trick), but we will postpone discussions in this direction to a different paper [22].

What is the point of all this? For one, it is interesting that a whole class of theories, closely related to and generalizing Einstein–Hilbert gravity, exists. The interest in these theories is at first blush theoretical, but it could also have some phenomenological viability in making IR modifications to gravity. The ease with which all the constructions fell into place here is surprising, and it is evident that there is a whole plethora of questions that need answering.

I thank Aiyalam Parameswaran Balachandran, Pallab “No-Longer-Kumar” Basu and Mudumbai Seshachalu Narasimhan for discussions and/or encouragement and Payingattery Natarajan Bala Subramanian for a related collaboration.