1 Introduction

General Relativity (GR) is, almost by definition, a theory of geometry. The equations of motion follow from varying an action whose form is uniquely fixed by general coordinate invariance.Footnote 1 However, in the middle of the last century an alternate, though of not logically distinct, approach was instigated in which the Einstein equations were derived starting from the assumption of massless spin two particle in a flat background. These ideas were pioneered by Gupta [1] and Kraichnan [2], and independently by Feynman [3]. In the ladder case it was clear that the motivation arose from a desire to quantize gravity in analogy with Yang-Mills theory. Indeed 50 years ago at this very conference in Warsaw, Feynman presented his results on the subject. Later Weinberg [4] gave a very elegant proof of the need for the graviton to couple to a conserved current, at linearized level, and the accompanying (strong) equivalence principle by assuming that quantum mechanical amplitudes involving (soft) graviton emission should obey the standard assumptions of unitarity and analyticity in quantum field theory. In this methodology the structure of GR stems from the mismatch between number of degrees of freedom represented by on-shell asymptotic states (two transverse helicities) and the symmetric two index Lorentz tensor field. In particular, the diffeomorphism invariance is necessary in order to be able to eliminate the unphysical degrees of freedom that are forced upon us by the Lorentz group.

This non-geometricFootnote 2 (quantum based) approach does have the added benefit that one can utilize the machinery developed for quantum field theory, including Feynman diagrams, to organize calculations. Indeed, in Duff [5] the Schwarzschild solution was generated by coupling the graviton a point particle classical source and calculating all the tree level diagrams which contribute to the expectation value of the metric. The solution is then built up order by order in \(G_N\). Whether or not the full solution (including the horizon) emerges by summing the series (i.e. whether or not the series is asymptotic) is unknown. This question is complicated by the fact that this calculation is inherently gauge dependent. Later Feynman diagrams were used to calculate the potential in the Post-Newtonian (PN) approximation by using scattering amplitudes [68]. The idea was to utilize the relation between the quantum mechanical scattering amplitude and the Fourier transform of the potential (\(\tilde{V}(\varvec{q}^2)\)) in the center of mass frame

$$\begin{aligned} M(\varvec{p}_1,\varvec{p}_3,m_i)= -\int dt \tilde{V}((\varvec{q}^2,v_1,v_2)). \end{aligned}$$
(1)

While certainly these results were very interesting from a formal point of view, past this point it was not clear if the non-geometric approach could bear much fruit, at least as far as classical GR is concerned. It does seem that this line of reasoning did not have much impact on the relativity community, perhaps because no new results were generated.

It is interesting to note that in his talk at this conference, Feynman bemoaned the lack of experimental results in relativity [3]. But it is exactly phenomenological questions regarding newly developed experiments that has recently brought these non-geometric ideas back to relevance. Gravitational wave detectors hold the promise to open new windows in astronomy and astro-physics by detecting the signatures of binary inspirals. The utility of these experiments is predicated on our ability to make precise predictions. Not surprisingly there has been a tremendous effort put into generating these predictions both analytically and numerically.Footnote 3 Numerical techniques concentrate on the late stages of inspiral, which are (at least at this time) out of the reach of analytic techniques as, in this regime, there is no controlled approximation in which to work. Analytic calculations can be performed during early stages of infall when the constituents are well separated. Within this window there are two possible expansion parameters with which one can perform a well controlled approximation. For Extreme Mass Ratio Inspirals (EMRI) the ratio of masses \(\tau \equiv (M_L/M_H)\ll 1\) determines how the background geometry traversed by the light mass deviates from Schwarzchild, i.e. this is the probe limit. While in the case of a small relative velocity \(v/c\), we have the PN approximation in which the space-time is nearly Minkowskian. The former case is really semi-analytic as calculating Greens function in the Schwarzchild background entails numerical work.

In both the EMRI and PN cases the relevant physical observable is the wave form at null infinity. If we think of the binary system as a black box then clearly we should be organizing our calculation by considering the binary system as a single entity. In the case of EMRI we calculate the radiation of the small mass constituent moving in the Schwarzchild background while in the PN case the system can be considered a point like object endowed with a set of time dependent multipole moments. The calculational challenges in the two cases are quite distinct. In the EMRI problem one needs to utilize the nettlesome Schwarzchild propagator. Indeed, this is sufficiently challenging that even getting leading order results is no simple task. On the other hand the leading order PN calculations are essentially trivial, moreover, phenomenologically it is important to produce accurate wave forms in order to be able to do parameter estimation. Thus the challenge in the PN case is to be able to calculate systematically to as higher an order as possible.

In both cases we have problems with diverse scales. In the EMRI case we have only two relevant scales. The size of the smaller constituent (or the mass) and the curvature scale of the background. While for the PN case we have: the size of the constituents \(R_i\), the radius of the system \(r\) and the wavelength of the radiation, which scales as \(r/v\). The multi-scaled nature of these problema is what motivates the use of effective field theory (EFT).

EFT is a tool developed for quantum field theories. The fundamental ideas involved date back to the original work on the renormalization group where one utilizes the fact that short distance and long distance physics factorize as a consequence of the local nature of physical laws. That is to say, if we have an action (or Hamiltonian) which correctly describes the microscopic physics up to some scale \(\varLambda \)

$$\begin{aligned} H=\sum _i C_iO_i, \end{aligned}$$
(2)

where \(O_i\) are some generic operators and \(C_i\) are a set of accompanying constants sometimes called “ Wilson coefficients”. If we are interested in getting an approximate description of the theory at low energies \((E\ll \varLambda )\) then we may utilize an effective Hamiltonian

$$\begin{aligned} H^\prime =\sum _i C^\prime _iO^\prime _i. \end{aligned}$$
(3)

In general \(H^\prime \) is “simpler” then \(H\). For instance in a QFT \(H\) will in general involve more particles species then \(H^\prime \). In principle the sum in (3) is infinite while the sum in (2) is finite.Footnote 4 However, the sum is cut-off after one utilizes the fact that we can power count the operators in \(E/\varLambda \). This is typically done by dimensional analysis, where any operator with mass dimensionsFootnote 5 \(d\) is suppressed by powers of \((E/\varLambda )^{(4-d)}\). In more general theories, such as will be discussed in a rest of this article, the operators will be power counted in some other expansion parameter such as \(v\) in the case of the PN expansion. The \(C^\prime \) are determined, if the microscopic theory is not strongly interacting,Footnote 6 by a matching procedure, in which one matches the value some object (which need not be physical) in the effective theory (3) to its full theory result. This matching procedure is often said to entail “integrating out the degrees of freedom between \(\varLambda \) and \(E\)”. This seems to be an inherently quantum/statistical mechanical notion since it implies we are performing a path integrals over fluctuating degrees of freedom. However, the idea applies classically as well, in which case it simply means we are eliminating a field from the action by using its equation of motion and then expanding the propagator (in momentum space) to generate a local action.Footnote 7 As a trivial example suppose we have an action for heavy field \(\phi _H\) and light field \(\phi _L\) given by

$$\begin{aligned} S=\int d^4x \left( \frac{1}{2}(\partial \phi _H^2) -\frac{1}{2}m_H^2 \phi _H^2+\frac{1}{2}(\partial \phi _L^2)-\lambda \phi _L^2\phi _H\right) . \end{aligned}$$
(4)

At energies and momenta much below \(m_H\) we may approximate \(\phi _H\approx \lambda \phi _L^2/m_H^2\) leading to an effective theory with a self interaction \(\lambda ^2 \phi _L^4/m_H^2\). This corresponds to performing a saddle point approximation for the \(\phi _H\) path integral. In classical theories these masses would be replaced by some other scales such as typical sizes of macroscopic objects, or distance between objects. Finally we note that we can avoid calculating coefficients of operatorsFootnote 8 which are forbidden by symmetries since they necessarily vanish. That is, we can write down the effective theory directly simply by including all possible operators allowed by the symmetries.

At first site it appears that integrating out the short distance modes does not buy you much, given that we are asking the user to calculate in the full theory which presumably we were trying to avoid in the first place. However, the choice of matching calculation is up to the practitioner. That is, one can choose to match anything which is well defined, even if it is gauge dependent, since the coefficients themselves are gauge invariant. Thus the full theory calculation that needs to be done is typically much simpler then any full theory calculation that one would be interested in performing should one choose not to use EFT. This will be clarified and exemplified below.

In the EFT the low energy theory only contains light degrees of freedom, but where is the pay off in this procedure? Technically, reducing the number of scales simplifies calculations in the low energy theory. In particular, a plurality of scales can lead to integrals which may not have closed form solutions. Moreover, working in the EFT allows one to sum logarithms of the form \(\alpha Log(m_H/m_L)\) which may jeopardize the utility of perturbation theory.Footnote 9 Formally, this explicit mode separation (factorization), with the short distance modes being absorbed into the coefficients \(C_i\), allows one to control the systematics. In the cases of classical EFT that will be discussed here, working in the EFT allows us to utilize techniques from the non-geometric approach to GR such as Feynman diagrams. Furthermore, working at the level of the action allows one to handle ultra-violet divergences, which inevitably arise in any EFT, even classically, as consequence of the fact that the EFT mutilates the UV physics, via standard renormalization techniques. In particle physics mode factorization allows one to calculate scattering cross sections between nucleons at high energies [9] which is of crucial importance for understanding the scattering of protons at the LHC.

2 The extreme mass ratio problem

From the EFT point of view the EMRI problem is formally rather simple. We first integrate out all of the modes which are responsible for the internal dynamics of the probe (small mass) constituent. The resulting EFT, valid at distance larger then the size of the object is nothing but the theory of a point particle. Following EFT reasoning we write down all terms consistent with the symmetries, which in this case are general coordinate invariance and world-line reparameterization invariance

$$\begin{aligned} S_{E+B}=\int d\tau (-m+C_E E_{\mu \nu }E^{\mu \nu }+C_B B_{\mu \nu }B^{\mu \nu }+\cdots ), \end{aligned}$$
(5)

where the electric and magnetic Weyl tensors are defined as

$$\begin{aligned} {E}_{\rho \mu }=C_{\rho \sigma \mu \nu } v^\sigma v^\nu \end{aligned}$$
(6)
$$\begin{aligned} {B}_{\alpha \mu }=\frac{1}{2}\star C_{\rho \sigma \mu \nu } v^\sigma v^\nu \end{aligned}$$
(7)

with \(v\) being the word-line tangent vector. Notice that operators which vanish via the equations of motion have not been included, as they can be removed via a field redefinition and can thus not contribute to any physical quantity (for a discussion of this see [14]). The series (5) has been truncated as higher dimension terms are suppressed by powers of the size. Indeed, for a black hole coefficients \(C_{E,B}\) scale as \(M_{pl}^2r_s^5\). These operators induce geodesic deviation and can be thought of as polarizabilities. In fact, \(C_E\) is related to the static tidal Love number. These matching coefficients are, in general, scale dependent. That is, when we calculate in this effective theory we will face UV divergent integrals which need to be regulated. When these divergences are logarithmic the regulator necessitates the introduction of a scale. It is easiest to regulate using dimensional regularization as it preserves the symmetries of the action. Once regulated the divergences can be absorbed into the matching coefficients, which induces the aforementioned scale dependence. It is exactly this scale dependence which allows for the resummation of logs, as discussed below.

Fig. 1
figure 1

The diagram on the LHS is the leading order contribution to the effective action, while the diagram on the right is a correction due to bulk interactions

At this point there are no further short distance scales to remove in the case of the EMRI. The only remaining relevant scale is the \({R}\),

$$\begin{aligned} {R} \sim \sqrt{\frac{M_{pl}^2 r^3}{M_H}}. \end{aligned}$$
(8)

We can then see that the expansion parameter \(r_0/{R}\) (where \(r_0\) is the size of the smaller mass object) becomes \( M_L/M_H\) and at strong coupling \(r=M_H/M^2_{pl}\).

One may now calculate the radiation by calculating the motion of the probe particle and convolving the resulting trajectory with a retarded Greens function, which is technically challenging. Even the leading order radiation is troublesome due to the lack of a closed form expression for the retarded propagator in the Schwarzchild background. Recent progress in this direction was made in Wardell et al. [16], Zenginoglu and Galley [17]. Beyond leading order in the mass ratio one has to include the self-force effects which are both dissipative as well as conservative in nature. The calculations of the self-force in the EFT were pioneered by Galley and his collaborators [1820]. One calculates the effective action within the closed-time path formalism by calculating all diagram where there are no open graviton lines as shown in Fig. (1). One then varies this action with respect to the world-line to derive the self-force equation. Diagrams with higher order graviton vertices in the bulk are suppressed.

3 The Post-Newtonian case

In the PN case there is an additional hierarchy of scales. The relevant scales are now \(R,r\) and \(r/v\) corresponding to the typical size of the constituents, the radius of the orbit and the wavelength of the radiation, respectively. Thus once we have integrated out the scale \(1/R\) we must now integrate out the scale \(r\). The resulting theory will only have one relevant scale, \(vr\). To achieve this goal we must determine the relevant modes in the theory. I will not go through the derivation here, but suffice it to say that the analysis very similar to the case of NRQCD, the theory of non-relativistic quarks. For a discussion of the mode analysis see [14]. The result is that the relevant regions of momentum space correspond to potential modes and radiation modesFootnote 10 with momenta which scale as \(p^\mu \sim (v/r,1/r)\) and \(p^\mu \sim (v/r,v/r)\) respectively. EFT’s with these types mode decompositions are sometimes called “modal” to distinguish them from more canonical EFT’s. The potential mode has a invariant mass which is large compared to the radiation mode and gets integrated out. This is accomplished by decomposing the graviton field into a potential (\(H\)) and radiation (\(h\)) modes

$$\begin{aligned} g_{\mu \nu }=\eta _{\mu \nu } +\frac{H_{\mu \nu }}{M_{pl}}+\frac{h_{\mu \nu }}{M_{pl}}. \end{aligned}$$
(9)

In this procedure manifest diffeomorphism invariance is preserved by working in the background field gauge. That is, the long wavelength radiation mode is frozen while we integrate out the potential mode to generate an action for the radiation mode. To do this systematically, as an expansion in \(v\), entails first deriving a set of power counting rules for the fields such that each term in the action \(S(h,H)\) scales homogeneously in \(v\). This forces us to perform a multipole expansion [15] at the level of the action, which will play an important role when we renormalize the theory. By having a manifest power counting for each term in the action we can determine which Feynman diagrams contribute at a given order in \(v\).

Next one integrates out the potential mode by calculating all Feynman diagrams with no open potential lines.Footnote 11 We do however include diagrams with open radiation lines, as these will generate couplings in the low energy action between the potentials and the radiation (see Fig. (2)). Formally integrating out the potential modes can we written as

$$\begin{aligned} \int Dh DH e^{i S(h,H)}= \int Dh e^{iS(V,O,h)} \end{aligned}$$
(10)

where \(V\) are a set of potentials which depend upon the orbital radius and velocities, and \(O\) is a set of multipole moments (which are functions of the world lines of the constituents) for the coarsed grained binary, which is now treated as a point particle.

The calculation of potentials using this methodology can be automatized as was done by Foffa and Sturani [21] who calculate the 3PN potential, as well as pieces of the 4PN result [22]. By using Kaluza-Klein variables [23] one can reduce the number loops needed to calculate for odd power of \(n\) when working at \(n-PN\), but even so, the number of diagrams grows as \(n\)-factorial. At the end of this article I will discuss ways of avoiding this problem. The full 4PN potential was recently calculated in Damour et al. [25] using more traditional methods. Three body, and beyond, potentials can be calculated simply by adding more world lines and keeping track of all the possible Feynman diagrams [13]. Finally, by power counting [1012] the finite size operators (5) one concludes that these effects don’t show up until \(5PN\). This reproduces the so-called “effacement theorem” [24].

Fig. 2
figure 2

On the left is a generic diagram that contribute to the potential. The blob corresponds to a collection of potential graviton lines. The diagram on the right represent the generic form which contributes to the coupling of the radiation field to the multipole moments

3.1 Spin

The introduction of a spin degree of freedom onto the world line was accomplished by Porto [26]. One intrduces \(S_{\mu \nu }(\tau )\) which is conjugate to the rotational frequency. The action is then written as a Routhian, i.e. a Hamiltonian/Lagrangian for the spin/worldline degrees of freedom

$$\begin{aligned} {R} =-\sum _i \left( M_i \sqrt{u_i^2}+\frac{1}{2} S_i^{ab} \omega _{ab \mu }u^\mu _i\right) . \end{aligned}$$
(11)

Given this action one generates a set of vertices in the action which can be used to build up Feynman diagrams for the potentials. \(\omega \) is the spin connection and the small Roman letters represent the local orthogonal frame. The finite size effects for spin contribute at order 3PN. These can be calculated by including the the higher dimension operators analogous to (5)

$$\begin{aligned} L_{ES^2}= \frac{C_{1}}{2 m M_{pl}} \frac{E_{ab}}{\sqrt{u^2}}S^a_{~c}S^{cb}. \end{aligned}$$
(12)

In addition there is a term that arises from necessitating that the spin supplementarity condition be preserved upon evolution

$$\begin{aligned} L_{RS^2}= -\frac{1}{2M}R_{deab}S^{cd}S^{ab}\frac{u^eu_c}{\sqrt{u^2}}. \end{aligned}$$
(13)

Notice that the coefficient of the ladder term is fixed, while in the former \(C_1\) depends upon the short distance physics and is given by \(C_1=1\) for a spinning black hole. The 3PN potential proportional \(S_1 S_2\) was calculated in Porto and Rothstein [27]. These results did not agree with the subsequent results calculated using more traditional methods (TM) in Steinhoff et al. [28]. This was shown [29, 30] to be due to the fact that the results in Porto and Rothstein [27] did not include the contributions which carry over from the SSC effect in the lower order spin-orbit contribution. Once these terms were included the results did agree. The terms proportional to \(S_1^2\) were calculated in Porto and Rothstein [31] and were corroborated in Steinhoff et al. [32]. The 4PN \(S_1S_2\) potential was calculating using EFT in Levi [34] and using TM in Steinhoff et al. [32]. Whether or not the results agree is not known at this time.

3.2 Radiation

The radiative moments arise from two sources in the low energy theory \(L(O,h)\). The direct coupling of the radiation field to the matching coefficients (in this case being the moments \(O\)) and radiative moments which arise from the scattering of radiation off of the background geometry (tail effects) and or radiation (memory effects). These arise from the diagrams shown in Fig. (3), respectively. The moments \(O\) are calculated as matching coefficients when integrating out the potential modes. In the EFT methodology the higher order calculations were performed in the case of spinning constituents. All of the moments necessary to calculate the 3PN phase for spinning binaries and 2.5 amplitude were calculated in Porto et al. [35, 36]. Figure (4) shows a set of non-linear contributions to the \(3PN\) moments. When calculating the moments the external radiation graviton is truncated.

Fig. 3
figure 3

On the left is the relevant diagram for the tail effects. The dotted line is a potential graviton, while the full curly line is a propagating radiation graviton. The diagrams on the right is the non-local in time memory effect. The heavy black dots represent the insertion of a time dependent moment

Fig. 4
figure 4

First non-linear diagrams that contribute to the current (magnetic) octopole \(J^{ijk}\)

The procedure for calculating linear piece of the wave-from (i.e. ignoring bulk interaction) is canonical (see for instance [37]). To extract radiative moments, we start with the expression for the GW amplitude for a given stress energy source \(T^{ij}\)

$$\begin{aligned} h_{ij}^{TT}(t,\mathbf{x}) = -\frac{4 G}{|\mathbf x|} \varLambda _{ij,kl} \int \limits _{-\infty }^{+\infty } \frac{d\omega }{2\pi } T^{kl}(\omega ,\omega \mathbf{n}) e^{i\omega t_{ret}} + O \left( 1/|\mathbf{x}|^2\right) . \end{aligned}$$
(14)

where \(\varLambda _{ij,kl}\) is the transverse traceless projector ant \(t_{ret}\) is the retarded time. We then calculate \(T_{ij}\) by calculating the one-point function \(\langle h_{ij}(x,t) \rangle \), as in Fig. (4) truncating the external line. When calculating radiative moments, such as those in Fig. (3), IR divergence arises from integrating over the position of the three graviton vertex which can be argitrarily far from the source since the potential generated from the mass only drops as \(1/r\). These divergences can be resumed into the Coulomb phase, and are not of any physical significance as the IR divergence should be cut off by the finite time interval of the measurement. One needs to measure the phase at a point in time, to absorb the IR divergence. By matching the result to the form of the coupling to the quadrapole moment one can can extract the radiative moment

$$\begin{aligned} I_{rad}^{ij}(\omega )=I_0^{ij}\left( 1+G M \omega (sign(\omega )\pi + i \left[ \frac{2}{\epsilon _{IR}} + log\frac{\omega ^2}{\mu ^2}+\gamma _E-\frac{11}{6}\right] \right) , \end{aligned}$$
(15)

The arbitrariness in the scale \(\mu \) reflects the arbitrary initial phase of the system.

3.3 Renormalization and summing logs

In the EFT formalism one also encounters UV divergences. Given that the full theory is UV finite, we know that these singularities are artifacts of our approximation scheme. Recall that we broke the EFT process into two steps. First we integrated out the modes which are responsible for the internal dynamics of the individual constituents. This led to a theory of point particles. The point particle limit can obviously be singular, and responsible for (some) sub-set of UV divergences. While in the second stage, where we integrate out the potential modes, leading to the theory of a point particle with dynamical moments, we expect to generate further UV divergences. In both case we know how to handle these divergences. Since UV divergences are always local they can be absorbed into “counter-terms” (i.e. coefficients of operators in the action). Since those coefficients, after renormalizaion, are fixed by a matching procedure there is no loss in predictive power. It is important to remember however, that not all UV divergences are equivalent. Power divergences can have no effect on any observable. The reason is that that they do no generate any non-analytic dependence on momenta. Thus they are what’s known as “pure counter-term”, and can simply be discarded. Note that when we use dimensional regularization this happens automatically (e.g. \(\int \frac{d^{4-2 \epsilon } k}{k^2}=0\)). Logarithmic divergences, on the other hand, are always accompanied by \(Log(p^2/\mu ^2)\) which can not be absorbed into a counter-term. Furthermore, given that the result must be \(\mu \) independent we know that the operator coefficients must depend upon \(\mu \) in such a way as to make the result \(\mu \) independent. This leads to the renormalization group (RG) equations equation, which, when solved, generate an infinite tower of logarithms for free. Using the RG equations Goldberger and Ross were able to sum an infinite set of logs which correct the quadrapole moment. The relevant UV divergence arises from diagrams such as the tail diagram shown on the LHS of (3) but with yet another potential graviton strung from the particle to the radiation, forming a four graviton vertex. The divergence arises when that vertex overlaps with the effective world line. The divergence is absorbed into the quadrapole. The resulting RG equation is given by

$$\begin{aligned} \mu \frac{d}{d\mu } Q=-\frac{214}{105}(Gm\omega )^2 Q \end{aligned}$$
(16)

such that

$$\begin{aligned} Q(\omega ,\mu )=(\mu /\mu _0)^{-214/105 (Gm\omega )^2} Q(\omega ,\mu _0) \end{aligned}$$
(17)

This running corresponds to integrating out the modes between \(\mu _0=1/r\) and \(\mu =1/(rv)\). Expanding this result for small \(v\) leads to a series of logs. In the test mass limit the results have been checked up to orger \(Log^3(v)\) and agreement has been found [39].

In addition to the quadrapole renormalization, the one body theory of moments also involves mass renormalization [40]. This renormalization arises from self energy diagrams involving one mass insertion and two quadrapole insertions. The log which arises is the classical equivalent of the “Bethe Log” which arises in the Lamb shift. The shift in the mass results in a non-local in time contribution to the HamiltonianFootnote 12, as calculated in Damour et al. [25]. The resulting RG equation is given by

$$\begin{aligned} \mu \frac{d}{d\mu }\log \bar{m}=-2G^2 \langle Q_{ij}^{(3)}Q_{ij}^{(3)} \rangle (\mu ), \end{aligned}$$
(18)

where the brackets denote averages over a period. In the parlance of quantum field theory we would say that the mass “runs”, i.e. it is scale dependent. It is important to remember that this mass is scheme dependent (i.e. it depends upon the choice of counter-terms which has a finite ambiguity) and unphysical. It is only a place holder, i.e. it only shows up in intermediate stages of calculations of physical quantities which are scheme independent. However, using this renormalized perturbation theory allows us to calculate a set of logs which contribute to the energy [40]. One can expand the solution to this equation and generate logarithmic contributions to the energy. The 4PN log for a quasi circular orbit is found to agree with the results in Blanchet et al. [41]. Expanding out the mass to higher orders would generate an infinite tower of logs, but at any given order there are other sources of logs as well, coming from the higher renormalization of the multipole moments. So an additional calculation would have to be done to get the complete set of logs at a higher orders.

3.4 Finite size effects and dissipation

Let us now return to the finite size operators (5). These operators are the leading contributions to deviations from geodesic motion. As previously noted, they do not contribute until \(5PN\), however, it’s possible that for neutron stars they may be numerically enhanced. In any case formally they are still interesting as in principle they could also get logarithmically renormalized. Physically, these operators are generalized polarizabilities. However, if we are going to allow for deformations we should also allow for dissipation, as distorting the objects will naturally change their internal energy. Dissipation is notoriously difficult to handle at the level of a Lagrangian, which naturally leads to unitary (conservative) dynamics. To get around this issue we allow for new degrees of freedom [42, 43] on the world line to absorb the internal energy Footnote 13 \(Q_{ab}(\tau )\) whose coupling to the world line is uniquely fixed by symmetry

$$\begin{aligned} S_{dis}=\int d\tau \left( E_{ab}Q_{ab}(\tau )+B_{ab}Q_{ab}(\tau )\right) . \end{aligned}$$
(19)

\(Q(\tau )\) is an operator which acts of on a Hilbert space representing some unknown degrees of freedom. This picture seems, at first glance, to be surrendering predictive power. Nonetheless, let us see how these terms in the action effect the potential between our constituents. The leading contribution to the potential arises from the box diagram shown in fig (3.4). The state of the system \(\mid \! \varOmega \rangle \) most likely is a mixed state which we might expect to be thermal (more on this below). Note that this diagram looks two particle irreducible, however, it is not since the intermediate lines are now dynamical, and not part of the iterative potential. The imaginary part of this diagram generates an absorptive potential. Note that the degrees of the freedom on the world line can not be gapped, otherwise the imaginary part of the \(Q(\tau )Q(0)\) correlator would not have an imaginary part in the zero energy limit (Fig. 5).

Fig. 5
figure 5

The leading contribution to the dissipative potential. The blobs indicate an insertion of an operator of the form (19). The line between the blobs corresponds to the dynamical propagation of the underlying/unknown degrees of freedom

Evaluating this diagram gives [42]

$$\begin{aligned} \!-\!iVT\!=\! \frac{M_2}{1024 \pi ^2 M_{pl}^4} \int d\tau d\tau ^\prime \langle \varOmega \mid T(Q_1^{ij}(\tau )Q_1^{kl}(\tau ^\prime ) ) \mid \varOmega \rangle q_{ij}q_{kl} +(1 \leftrightarrow 2),\quad \end{aligned}$$
(20)

where \(q_{ij}=\partial _i \partial _j \frac{1}{\mid \varvec{x}_1-\varvec{x}_2 \mid }\) and \(\mid \varOmega \rangle \) is the ground-state of the object. Now we use the fact that the imaginary part of the correlator is directly proportional to the graviton absorptive cross section, and thus we can write the differential power loss as

$$\begin{aligned} \frac{dP}{d\omega }=-\frac{1}{T}\frac{G}{64\pi ^2}\sum _{a\ne b} \sigma ^b_{abs} (\omega )M_a^2 \mid q_{ij}(\omega ) \mid ^2. \end{aligned}$$
(21)

The power of this result is that it is universal. The dependence on the internal dynamics resides only in the absorptive cross section.

What about the real part of the correlator? In the limit of vanishing frequency the real part generates a local interaction. In fact, if we are considering the electric or magnetic quadruples then this limit of the real part will exactly mimic the contribution of the the operators in (12).Footnote 14 Remarkably, it has been shown by several groups that the coefficient \(a_R\) is zero for black holes [4650]. In Chakrabarti et al. [49, 50] this coefficient was calculated by solving the Regge Wheeler equation in the full theory and comparing it to the solution in the effective theory with a point particle source and a dynamical quadruple degree of freedom . The Fourier transformed of the the correlatorFootnote 15 can then be read off from the linear response relation

$$\begin{aligned} Q^{ab}(\omega )= -\frac{1}{2}E_{BG}^{ab}(\omega )F(\omega ). \end{aligned}$$
(22)

Expanding this response function in small \(\omega \)

$$\begin{aligned} F(\omega )\approx (a_R+i a_I+\omega (i b_I)+\omega ^2(c_R+i c_I)+\cdots ) \end{aligned}$$
(23)

By comparing to the full theory one may extract the coefficients it was found that \(a_R=0\) which in turn implies that \(C_E=0\). It was also found that \(c_R\) is non-zero and that it depends logarithmically on \(\omega \). That is, the Wilson coefficient for the operator \((\dot{E})^2\) is scale dependent. The matching for the magnetic piece has yet to be done to date.

Note that the vanishing of \(a_R\) does not imply that the black hole does not have a DC (electric) susceptibility since there is still a non-zero response, as the Schwarzchild solutions deforms. However, it does mean that if we we assumed that the magnetic response also vanished, then there would be no geodesic deviation up to four derivatives. Indeed, given the lack of a symmetry argument that would pick out the spin two representation as being special, it is tempting to conjecture that all the static susceptibilities vanish. This would be quite remarkable. However given the singular (in the literary sense) nature of black holes it would not be a complete surprise.

Also note that \(C_E\) is gauge invariant despite the necessary choice of frames in doing the matching, so it is unambiguous. It is important to appreciate that these results do not mean that that this coefficient does not get renormalized. Indeed in the effective theory one needs power law counter-terms. However, crucially this result implies that there are no logarithmic divergences. Thus the coupling does not run, if it’s zero at one scale its zero at all scales. Its vanishing however, is quite “unnatural” (for a discussion of naturalness in the context of QFT see [14]). This is a fine tuning problem, we would expect the coefficient to take a value on which scales with the UV cut-off (\(r_s\)), but instead it’s zero, with no apparent symmetry to protect it. The unnaturalness is also manifest when considering [52] the spectral decomposition of the retarded correlator,Footnote 16 we find that

$$\begin{aligned} Re F(\omega )= P \sum _{m} \frac{\mid \langle \varOmega \mid Q_{ab}\mid m \rangle \mid ^2 }{E_\varOmega -E_m-\omega }. \end{aligned}$$
(24)

If we look at the DC response we are left with the rather remarkable result

$$\begin{aligned} \sum _{m} \frac{\mid \langle \varOmega \mid Q_{ab}\mid m \rangle \mid ^2 }{E_\varOmega -E_m }=0. \end{aligned}$$
(25)

which seems to imply that the state \(\varOmega \) can not be pure given the the denominator has a fixed sign. Suppose that the state the state \(\mid \varOmega \rangle \) is thermal then we find

$$\begin{aligned} \sum _{m,n} e^{-\beta (E_n)}\frac{ \langle n \mid Q_{ab}\mid m \rangle \langle m \mid Q_{ab} \mid n \rangle }{E_n-E_m }=0, \end{aligned}$$
(26)

which would imply a delicate cancellation. It is interesting to note that a similar effect arises in the theory of dissipative fluids [51]. Given the membrane paradigm the possible connection is quite tantalizing.

3.5 Further De-Geometrization

Finally I would like to come back to the topic introduced at the beginning of the talk. We have seen that thinking about GR as a gauge theory, much like QCD, has some calculational benefits. However, we have not really fully de-geometrized the problem. We still needed to rely on the Einstein-Hilbert action to generate a set of Feynman rules. A completely ageometric approach should not depend upon an action which followed from the principles of GR. Furthermore, we have also seen that calculating Feynman diagrams when going to higher orders can be quite cumbersome. As previously discussed the number of diagrams grows factorially with the order of the calculation. Moreover, as one goes to order \(n\) one needs to include the interactions which have \(n+1\) gravitons, which becomes rapidly computationally intensive.

In fact the proliferation of Feynman diagrams is a direct result of the fact that the Einstein-Hilbert action carries with it gauge redundancies which are forced upon us by general coordinate invariance. Modern amplitude techniques [53] have taught us that calculating Feynman diagrams is a highly inefficient way to proceed. The reason being that individual diagrams are not gauge invariant and as a consequence all the extra baggage associated with the unphysical degrees are freedom need to be kept. If, on the other hand, we concentrate on calculating physical quantities things simplify greatly. Indeed when calculating on-shell scattering amplitudes, we only need the (on-shellFootnote 17) three graviton amplitude as all the higher point scattering amplitude can be fixed using the BCFW recursion relations [54, 55]. Moreover, the three graviton amplitude can be fixed by Lorentz covariance [56]. So if we are only interested in calculating on-shell quantities all of the non-linearities inherent in the Einstein-Hilbert action are completely unnecessary. In a sense we can avoid Einstein formulation of GR completely. All on-shell scattering amplitudes can be fixed starting only with the assumptions of: Lorentz invariance, the existence of a massless spin-two particle and that the only poles in tree level amplitudes arise from particle exchange (this is a restatement of the locality assumption which goes into the BCFW recursion relations.). Most importantly, there is no need for Feynman diagrams since we use the aforementioned recursion relations. If one uses spinor-helicity notation the results for the full diagrams are quite compact.

Fig. 6
figure 6

One loop diagrams which contribute to the 1PN potential

This line of reasoning begs the question, what do on-sell quantities have to do with the potential, which is an inherently off-shell object? The answer resides in (1). If we can calculate the on-shell scattering amplitude, say between scalars, then we can extract the potential. However, as I mentioned above the on-shell construction of the n-point amplitude applies only to tree level diagrams while the potentials (beyond Newtons’ law) arise from loop diagrams such as the ones shown in Fig. (6). Nonetheless, one can construct these diagrams from a pair of on-shell scalar-graviton n-point amplitudes using unitarity-sewing methods [57, 58] whereby the loop diagrams are constructed by sewing together two on-shell amplitudes and removing the on-shell conditions for the intermediate states. A generic potential generated in this way is shown in Fig. (7). Once the cut is lifted this generates the full scattering amplitude. Of course this includes quantum corrections, but it is simple to extract the classical pieces by looking at the form of the scalar integral. The final potential can be extracted after subtracting the iterations of lower order potentials. This procedure is demonstrated in Neill and Rothstein [59] where the 1-PN potential is calculated. In this paper is was also pointed that using this technique one can generate solutions to Einsteins’ equation directly without ever writing down the equations themselves. That is, one can generate classical space-times directly from the S-matrix elements of massless spin two particles. This is done by calculating the potentials and taking the probe limit, i.e. where one of the masses goes to zero. The metric can then be extracted or by order in \(G_N\) by formulating the most general ansatz and fixing the coefficients via a matching procedure. Of course this technique is limited as it assumes asymptotic flatness. Nevertheless it is quite remarkable that classical space times emerge naturally from S-matrices with the only assumptions being: The existence of a massless spin two particle, Lorentz invariance and locality. It would be interesting to determine if this procedure can be generalized.

Fig. 7
figure 7

Reconstructing the full scalar-scalar S-matrix by sewing together the scalar-scalar n point on shell scattering amplitudes. The horizontal lines represent the cuts which are lifted to allow the lines to be off-shell. The blobs represent a sum over all Feynman diagrams. A calculation which is avoided using the BCFW recursion relations

3.6 Conclusions

The EFT formalism has led to considerable progress in a relatively short period of time. In several areas the EFT formalism has matched the state of the art and in some case, in particular calculations involving spin, exceeded it. Of course the long term impact of the program remains to be seen. I should also mention that the type of world line effective theory discussed here has been utilized in other contexts as well including: The calculation of black hole solutions in compact space-times [6668], finite size effects in radiation reaction [64, 65], fluctuation forces on soft membranes[6063], and Casimir forces between cogs [69].