Abstract
This paper shows how geometric algebra can be used to derive a novel generalization of Heron’s classical formula for the area of a triangle in the plane to higher dimensions. It begins by illustrating some of the many ways in which the conformal model of three-dimensional Euclidean space yields provocative insights into some of our most basic intuitive notions of solid geometry. It then uses this conceptual framework to elucidate the geometric meaning of Heron’s formula in the plane, and explains in detail how it extends naturally to the volumes of tetrahedra in space. The paper closes by outlining a proof of a previously conjectured extension of the formula to the hyper-volumes of simplices in all dimensions.
Similar content being viewed by others
Avoid common mistakes on your manuscript.
1 Background and Introduction
The intuitions which inspired Hermann Günther Grassmann to invent the inner and outer products of vectors and points are seldom emphasized in modern presentations of geometric algebra (for relevant discussions, see Refs. [5, 9,10,11, 15, 17]). This is unfortunate because they still have something to offer, both pedagogically and as a source of inspiration for its applications and further developments. I recently had the rare pleasure of experiencing some of Grassmann’s spirit in the course of discovering a natural extension of Heron’s formula to the tetrahedron [14], and envisioning how those same geometric principles can be applied in higher dimensions. The algebra was tough going, but the payoff was in the geometric insights it led to.
In order to emphasize the elementary nature of those results and render them accessible to the widest possible audience, I eschewed the use of modern geometric algebra in my earlier papers on the tetrahedron and confined myself to the better-known vector algebra of Gibbs and Heaviside [14]. Nevertheless, the full power of geometric algebra is needed to build upon the geometric insights obtained in three dimensions in order to extend Heron’s formula to yet higher dimensions. We shall see that the algebraic framework obtained by applying geometric algebra to the conformal model of Euclidean geometry unites and codifies Grassmann’s original geometric intuitions in an eminently satisfactory fashion.Footnote 1
Accordingly, the first three sections of this paper use conformal geometric algebra to derive the relevant algebraic relations, which always boil down to scalar equations, among the invariants that connect various covariant multi-vector quantities of increasing grades in the conformal model. These algebraic relations are then used to give a new derivation of Heron’s formula for triangles, followed by its extension to tetrahedra. The paper closes by using the geometric insights thereby obtained to outline a proof of my previously conjectured [14, Part II] extension of the formula to higher dimensions.
2 Barycentric Sums in the Conformal Model
One curious feature of the conformal model in \(\mathcal G_{4,1}\) is that, although it contains the 4-dimensional homogeneous (projective) model as a subspace, barycentric sums in that subspace do not correspond to weighted sums of the corresponding conformal points.Footnote 2 This is because a naive barycentric sum of conformal points \(\varvec{a} = \varvec{n}_0 + \textbf{a}+ \varvec{n}_\infty \, \textbf{a}^2 / 2\), \(\varvec{b} = \varvec{n}_0 + \textbf{b}+ \varvec{n}_\infty \, \textbf{b}^2 / 2\) (\(\varvec{n}_0^2 = \varvec{n}_\infty ^2 = 0\), \(\varvec{n}_0 {\,\varvec{\cdot }\,}\varvec{n}_\infty = -1\), \(\textbf{a}, \textbf{b}\in \mathbb R^3\)) is not itself null vector:
Instead a non-linear correction term has to be added on, specifically:
For a general indexed sum \(\sum _{i=1}^N \beta _i\, \varvec{a}_i\) of points with \(\sum _{i=1}^N \beta _i = 1\), the corresponding formula is:
Far from being an encumbrance, this correction term contains valuable metrical information. For example, if \(\varvec{b}\) is such a corrected barycentric sum over a set of conformal points \(\{ \varvec{a}_i \}\), we can easily derive Lagrange’s first identity [12] for the radius of gyration about the barycenter \({\textbf {b}}\,{:}{=}\,\sum _i \beta _i\, {\textbf {a}}_i\) thereof as follows:
We can also derive Lagrange’s second identity, which connects these distances with those to an arbitrary third point \(\varvec{c}\), namely:
Lagrange’s identities in turn imply the Huygens-Leibniz identity which (given \(\sum _i \beta _i = 1\) and letting \({\textbf {c}} \,{:}{=}\, {\textbf {a}}_j\) for \(j = 1, \ldots , N\)) allows the barycentric coordinates \(\{ \beta _i \}\) of \(\textbf{b}\) to be computed from the distances among the points \(\{ \textbf{b}, \textbf{a}_i \}\):
If we are given a second barycentric sum \(\textbf{b}' = \sum _i \beta _i'\, \textbf{a}_i\), the squared distance between the two is
but since \(\varvec{n}_\infty {\,\varvec{\cdot }\,}\varvec{a}_k = -1\) for all k and \(\sum _k \beta _k' = 1\), we also have
Together with Eq. (7), this yields:
This last expression is known as Schönberg’s quadratic form [4]. It is well known that it is non-negative for all values of the variables \(\delta _i = \beta _i - \beta _i'\) with \(\sum _i \delta _i = 0\) if and only if the coefficients \( \Vert \textbf{a}_i - \textbf{a}_j \Vert ^2\) are indeed the squared distances among a system of points in Euclidean space. Its geometric interpretation as the squared Euclidean norm of an affine sum,Footnote 3 which is itself just a squared distance, shows very clearly why this is so.
A special class of particular interest occurs when the barycentric coefficients \(\beta _i\), \(\beta _j'\) vanish except on subsets \(\mathrm B\), \(\mathrm B'\) \(\subseteq \) \(\mathrm A {:}{=}\{ 1,\, \ldots \,, N \}\) of sizes \(| \mathrm B |\), \(| \mathrm B' |\), and \(\beta _i = |\mathrm B|^{-1}\), \(\beta _j' = |\mathrm B'|^{-1}\) for all \(i \in \mathrm B\), \(j \in \mathrm B'\) respectively. This results in a formula for the squared distance between the centroids \(\textbf{b}= |\mathrm B|^{-1} \sum _i\, \textbf{a}_i\,\), \(\textbf{b}' = |\mathrm B'|^{-1} \sum _j\, \textbf{a}_j\) of the two (possibly overlapping) subsets, namely:
This formula shows easily that the squared length of the median of a triangle \(\llbracket \hspace{0.5pt}\textbf{a}, \textbf{b}, \textbf{c}\hspace{0.5pt}\rrbracket \) from vertex \(\textbf{a}\) to the midpoint of \(\llbracket \hspace{0.5pt}\textbf{b}, \textbf{c}\hspace{0.5pt}\rrbracket \) is \(\frac{1}{2} \Vert \textbf{a}- \textbf{b}\Vert ^2 + \frac{1}{2} \Vert \textbf{a}- \textbf{c}\Vert ^2 - \tfrac{1}{4} \Vert \textbf{b}- \textbf{c}\Vert ^2\), that the squared length of the bimedian of a tetrahedron \(\llbracket \hspace{0.5pt}\textbf{a}, \textbf{b}, \textbf{c}, \textbf{d}\hspace{0.5pt}\rrbracket \) from the midpoint of \(\llbracket \hspace{0.5pt}\textbf{a}, \textbf{b}\hspace{0.5pt}\rrbracket \) to that of \(\llbracket \hspace{0.5pt}\textbf{c}, \textbf{d}\hspace{0.5pt}\rrbracket \) is \(\frac{1}{4} \big ( \Vert \textbf{a}\!-\! \textbf{c}\Vert ^2 + \Vert \textbf{a}\!-\! \textbf{d}\Vert ^2\) \(+\) \(\Vert \textbf{b}- \textbf{c}\Vert ^2 + \Vert \textbf{b}- \textbf{d}\Vert ^2 - \Vert \textbf{a}- \textbf{b}\Vert ^2 - \Vert \textbf{c}- \textbf{d}\Vert ^2 \big )\), and that the squared distance of the centroid \(\textbf{b}= |\mathrm A|^{-1} \sum _i \textbf{a}_i\) of a set of points indexed by \(\mathrm A = \{ 1,\, \ldots \,,\, N \}\) to each member \(i \in \mathrm A\) thereof is [6]
In the homogeneous model the difference between two points \(\varvec{n}_0 + \textbf{a}\) and \(\varvec{n}_0 + \textbf{b}\) yields a “free” vector \(\textbf{v}= \textbf{b}- \textbf{a}\). It acts as a translation by addition, producing in particular \(\varvec{n}_0 + \textbf{b}= \varvec{n}_0 + \textbf{a}+ \textbf{v}\). The difference between two conformal points \(\varvec{a}\) and \(\varvec{b}\), however, contains another term proportional to \(\varvec{n}_\infty \), namely
Even though \(\varvec{b} = \varvec{a} + \varvec{v}\) and \(\varvec{v}^2 = \Vert \textbf{b}- \textbf{a}\Vert ^2\), the addition of \(\varvec{v}\) to an arbitrary conformal point does not yield the translated point in general. Instead, multiplying it from the left by \(\varvec{n}_\infty / 2\) yields the bivector \(\varvec{n}_\infty \textbf{v}/ 2\), which in turn generates a rotor \(\exp (\varvec{n}_\infty \textbf{v}/ 2) = 1 + \varvec{n}_\infty \textbf{v}/ 2\) that translates a point \(\varvec{c} = \varvec{n}_0 + \textbf{c}+ \varvec{n}_\infty \textbf{c}^2 / 2\) via the usual multiplicative two-sided action:
Half the difference of the two “flat” points \({\varvec{n}}_\infty \hspace{-0.05em}\wedge \hspace{-0.05em}{} {\textbf {b}} - {\varvec{n}}_\infty \hspace{-0.05em}\wedge \hspace{-0.05em}{\varvec{a}} = {{\varvec{n}}}_\infty {\textbf {v}}\) generates this same rotor \(\exp ( \varvec{n}_\infty \textbf{v}/ 2)\), and more generally any affine sum of flat points \(\sum _i \delta _i (\varvec{n}_\infty \hspace{-0.05em}\wedge \hspace{-0.05em}\varvec{a}_i)\) \(=\) \(\varvec{n}_\infty \sum _i \delta _i\, \textbf{a}_i\) \(=\) \(\varvec{n}_\infty \textbf{v}\) with \(\sum _i \delta _i = 0\) also generates a translation. Alternatively, the difference between two flat points \(\varvec{n}_\infty \hspace{-0.05em}\wedge \hspace{-0.05em}(\varvec{b} \hspace{0.5pt}-\hspace{0.5pt} \varvec{a}) = \varvec{n}_\infty {\textbf {v}}\) does translate flat points by simple addition, i.e. \(\varvec{n}_\infty \hspace{-0.05em}\wedge \hspace{-0.05em}\varvec{c}' = \varvec{n}_\infty \hspace{-0.05em}\wedge \hspace{-0.05em}\varvec{c} + \varvec{n}_\infty {\textbf {v}}= \varvec{n}_\infty \hspace{-0.05em}\wedge \hspace{-0.05em}(\varvec{n}_0 + {\textbf {c}}+ {\textbf {v}})\), from which the corresponding conformal point can be extracted as \(\varvec{c}' = (\varvec{n}_\infty \hspace{-0.05em}\wedge \hspace{-0.05em}\varvec{c}') {\,\varvec{\cdot }\,}\varvec{n}_0 + \varvec{n}_\infty \, \Vert {\textbf {c}}+ {\textbf {v}}\Vert ^2 / 2 \).
3 Line-Bound Vectors and Tetrahedra
The next step up from flat points are line-bound vectors, which have the form
with \(\varvec{N} \,{:}{=}\, \varvec{n}_\infty \hspace{-0.1em}\wedge \hspace{-0.1em}\varvec{n}_0\,\). This can be constructed directly from the two flat points in question as \(\varvec{n}_\infty \hspace{-0.1em}\wedge \hspace{-0.1em}\varvec{a} \hspace{-0.1em}\wedge \hspace{-0.1em}\varvec{b} = \frac{1}{2}\, \varvec{n}_\infty \wedge (\varvec{a} + \varvec{b}) \wedge \big ( \varvec{n}_0 \,{\varvec{\cdot }}\, (\varvec{n}_\infty \wedge (\varvec{a} - \varvec{b})) \big )\).
The inner product of this with \(\varvec{n}_0\) gives a bivector \(({\textbf {b}}- {\textbf {a}}) \varvec{n}_0 -\hspace{0.5pt} {\textbf {a}} \hspace{-0.05em}\wedge \hspace{-0.05em}{} {\textbf {b}}\) that is converted by inversion in an origin-centered unit sphere into \((\textbf{b}- \textbf{a})\varvec{n}_\infty / 2 - \textbf{a}\hspace{-0.1em}\wedge \hspace{-0.1em}\textbf{b}\), which then generates a Euclidean “screw” motion (translation and rotation about an axis thereof). The line-bound vector itself is determined by an oriented segment \(\llbracket \hspace{0.5pt}\textbf{a}, \textbf{b}\hspace{0.5pt}\rrbracket \) of the line , where the segment’s orientation is \((\textbf{b}- \textbf{a}) \,/\, \Vert \textbf{b}- \textbf{a}\Vert \) and the line’s (minimum) distance from the origin is \(\Vert \textbf{a}\hspace{-0.1em}\wedge \hspace{-0.1em}\textbf{b}\Vert \,/\, \Vert \textbf{b}- \textbf{a}\Vert \). The same line-bound vector is however obtained for any other two points \(\varvec{a}', \varvec{b}' \in \mathcal G_{4,1}^{\,1}\) with \(\textbf{b}' \!- \textbf{a}' = \textbf{b}- \textbf{a}\) and \(\textbf{a}' \hspace{-0.1em}\wedge \hspace{-0.1em}\hspace{1pt} \textbf{b}' = \textbf{a}\hspace{-0.1em}\wedge \hspace{-0.1em}\textbf{b}\), meaning that it only determines the equivalence class of all translates of the original oriented segment \(\llbracket \hspace{0.5pt}\textbf{a}, \textbf{b}\hspace{0.5pt}\rrbracket \) parallel to (see Fig. 1). Today this indeterminacy would be described as an Abelian gauge symmetry.
The squared length of the (defining) segment \(\varvec{|}\llbracket \hspace{0.5pt}\textbf{a}, \textbf{b}\hspace{0.5pt}\rrbracket \varvec{|}^2\) may be obtained by squaring the line-bound vector, which yields a simple example of what is more generally known as a Cayley-Menger determinant [4, 8, 13]:
A little more generally, the inner product of two line-bound vectors is the inner product of the corresponding free vectors:
In order to obtain the higher-grade parts of the geometric product of two such line-bound vectors, we expand it via Eq. (14) to get:
The two-vector part of this, which its anti-symmetry under reversion shows to be the same as the commutator product “\(\bowtie \)” of the line-bound vectors, is:
Finally, the four-vector part of this product of line-bound vectors is:
This is \(3!\, \varvec{n}_\infty \) times the oriented (w.r.t. the pseudo-scalar \(\varvec{\iota } \,{:}{=}\, {\textbf {e}}_1 {\textbf {e}}_2 {\textbf {e}}_3\) of \(\mathbb R^3\)) volume of the tetrahedron \(\llbracket \hspace{0.5pt}\textbf{a}, \textbf{b}, \textbf{c}, \textbf{d}\hspace{0.5pt}\rrbracket \). Since the line-bound vectors do not change when \(\llbracket \hspace{0.5pt}\textbf{a}, \textbf{b}\hspace{0.5pt}\rrbracket \) and \(\llbracket \hspace{0.5pt}\textbf{c}, \textbf{d}\hspace{0.5pt}\rrbracket \) are translated along their respective lines, the volume of the tetrahedron is also unchanged by such translations, as well as by translations of its other edges along the lines they span.
If \(\varvec{L} \,{:}{=}\, \varvec{n}_\infty \hspace{-0.1em}\wedge \hspace{-0.1em}\varvec{a} \hspace{-0.1em}\wedge \hspace{-0.1em}\varvec{b}\) and \(\varvec{M} \,{:}{=}\, \varvec{n}_\infty \hspace{-0.1em}\wedge \hspace{-0.1em}\varvec{c} \hspace{-0.1em}\wedge \hspace{-0.1em}\varvec{d}\), then Eq. (15) and the reversion symmetry of the product of \(\varvec{LM}\) with its reverse shows that it is the scalar
This product, however, can also be written as
(since both \(\langle \varvec{LM} \rangle _{0}\) \(\langle \varvec{LM} \rangle _{2}^\sim \) \(+\) \(\langle \varvec{LM} \rangle _{2}\) \(\langle \varvec{LM} \rangle _{0}^\sim \) \(= 0\) and \(\langle \varvec{LM} \rangle _{2}\) \(\langle \varvec{LM} \rangle _{4}^\sim \) \(+\) \(\langle \varvec{LM} \rangle _{4}\) \(\langle \varvec{LM} \rangle _{2}^\sim \) \(= 0\)). Because \(\langle \varvec{LM} \rangle _{4}\) contains a factor of \(\varvec{n}_\infty \) by Eq. (19), we have \(\langle \varvec{LM} \rangle _{4}\) \(\langle \varvec{LM} \rangle _{4}^\sim = 0\), while \(\langle \varvec{LM} \rangle _{0} = ({\textbf {b}}- {\textbf {a}}) {\,\varvec{\cdot }\,} ({\textbf {d}}- {\textbf {c}})\) by Eq. (16) and the last term on the right-hand side of Eq. (21) contains no scalar part. Thus the squared magnitude of \(\langle \varvec{LM} \rangle _{2}\), or scalar product “\(*\)” with itself, is
Geometrically, this quantity is four times the squared area of the medial parallelogram \(\llbracket \hspace{0.5pt}(\textbf{a}+ \textbf{c})/2, (\textbf{c}+ \textbf{b})/2, (\textbf{b}+ \textbf{d})/2, (\textbf{d}+ \textbf{a})/2 \hspace{1pt} \hspace{0.5pt}\rrbracket \) of the tetrahedron \(\llbracket \hspace{0.5pt}\textbf{a}, \textbf{b}, \textbf{c}, \textbf{d}\hspace{0.5pt}\rrbracket \). To see that this is indeed a parallelogram, we need only note that the geometric products of the vectors along its two pairs of opposite sides are
The vanishing bivector parts of these equations show those pairs are parallel, while their scalar parts show their lengths are \(\Vert \textbf{a}- \textbf{b}\Vert / 2\) and \(\Vert \textbf{c}- \textbf{d}\Vert / 2\), resp. Furthermore, the outer product of the vectors along any two consecutive sides, say from \((\textbf{a}+ \textbf{c})/2\) to \((\textbf{c}+ \textbf{b})/2\) and \((\textbf{d}+ \textbf{a})/2\), is simply
thereby proving this paragraph’s first assertion.
In striking analogy to the volume of a tetrahedron being 1/3 the height of any vertex over its opposite face times the area of that face, that volume is also equal to 2/3 the area of the medial parallelogram times the perpendicular distance between the corresponding pair of lines. Up to sign, this distance is
where the duality of the inner, outer and cross products “\(\times \)” with respect to the pseudo-scalar \(\varvec{\iota }= -\tilde{\varvec{\iota }}\) of \(\mathbb R^3\) was utilized together with \((\textbf{c}- \textbf{a}) \hspace{-0.1em}\wedge \hspace{-0.1em}(\textbf{d}- \textbf{c}) = (\textbf{c}- \textbf{a}) \hspace{-0.1em}\wedge \hspace{-0.1em}(\textbf{d}- \textbf{a}+ \textbf{a}- \textbf{c}) = (\textbf{c}- \textbf{a}) \hspace{-0.1em}\wedge \hspace{-0.1em}(\textbf{d}- \textbf{a})\). Since \(\Vert (\textbf{b}- \textbf{a}) {\,\hspace{-0.1em}\wedge \hspace{-0.1em}\,} (\textbf{d}- \textbf{c}) \Vert / 4\) is the medial parallelogram’s area and \(\tilde{\varvec{\iota }}\, (\textbf{c}- \textbf{a}) \hspace{-0.1em}\wedge \hspace{-0.1em}(\textbf{b}- \textbf{a}) \hspace{-0.1em}\wedge \hspace{-0.1em}(\textbf{d}- \textbf{a}) / 6\) is the volume \(\varvec{|}\llbracket \hspace{0.5pt}\textbf{a}, \textbf{b}, \textbf{c}, \textbf{d}\hspace{0.5pt}\rrbracket \varvec{|}\) (or its negative if the orientation of \((\textbf{c}- \textbf{a}) \hspace{-0.1em}\wedge \hspace{-0.1em}(\textbf{b}- \textbf{a}) \hspace{-0.1em}\wedge \hspace{-0.1em}(\textbf{d}- \textbf{a})\) is opposite that of \(\varvec{\iota }\)), our claim follows.
In general, the sum of line-bound vectors is not itself a line-bound vector, but a composite entity which can be interpreted in various ways. Classical interpretations include the result of a system of forces acting at various points on a rigid body, and the result of a system of infinitesimal motions applied to a rigid body. We will not develop these theories here, but refer the interested reader to the extensive literature on the subject [16]. Instead we shall seek to interpret the graded components of the product of such an entity with itself in the context of the conformal model, as above.
Hence consider an arbitrary composite line-bound vector of the form:
By Eq. (16), the inner square of this expands to:
Thus the length of this diagonal of the medial parallelogram (which is also a bimedian of the tetrahedron) remains unchanged as \( \llbracket \hspace{0.5pt}\textbf{a}, \textbf{b}\hspace{0.5pt}\rrbracket \) and \( \llbracket \hspace{0.5pt}\textbf{c}, \textbf{d}\hspace{0.5pt}\rrbracket \) are translated along their respective lines. The lengths of its sides \(\Vert \textbf{b}- \textbf{a}\Vert / 2\) and \(\Vert \textbf{d}- \textbf{c}\Vert / 2\) are of course also invariant under such translations. Finally, Eq. (18) shows that the bivector of the medial parallelogram is likewise invariant. It follows that translations of \( \llbracket \hspace{0.5pt}\textbf{a}, \textbf{b}\hspace{0.5pt}\rrbracket \) and \( \llbracket \hspace{0.5pt}\textbf{c}, \textbf{d}\hspace{0.5pt}\rrbracket \) along their respective lines merely translate the corresponding medial parallelogram in the mid-plane of the lines without changing its shape or aspect (as illustrated in Fig. 2), and in accord with the above volume formula, cf. Eq. (25).
The anti-symmetry of Eq. (18) under the \((\textbf{a}, \textbf{b}) \leftrightarrow (\textbf{c}, \textbf{d})\) swap shows that the 2-vector part of the square of a composite line-bound vector vanishes. The 4-vector part, however, is:
This clearly does vanish if \(\textbf{b}- \textbf{a}= \textbf{d}- \textbf{c}\) or \(\textbf{a}\hspace{-0.1em}\wedge \hspace{-0.1em}\textbf{b}= \textbf{c}\hspace{-0.1em}\wedge \hspace{-0.1em}\textbf{d}\), i.e. the line-bound vector in question is simple (non-composite). On taking its dual with respect to \(\varvec{\iota }\), we obtain:
The vanishing of the scalar coefficient of \(2\, \varvec{n}_\infty \) on the right is a well-known condition for the lines and to either intersect or be parallel [16]. In fact for it is just a way of writing the famous Plücker identity for the six Plücker coordinates of a line in projective three-space.
It is seldom noted, nevertheless, that this same expression is equal to \(\pm 3!\) times the volume \(\varvec{|}\llbracket \hspace{0.5pt}\textbf{a}, \textbf{b}, \textbf{c}, \textbf{d}\hspace{0.5pt}\rrbracket \varvec{|}\) of the tetrahedron. This follows from the fact that the 4 vector parts of \(2\, \varvec{LM}\) and \((\varvec{L} + \varvec{M})^2\) are obviously the same (cf. Eq. (19)). A thorough study of the representation of the more general Grassmann-Plücker relations in geometric algebra was recently given by Sobczyk [27].
Finally, we have the relation (first noted in the homogeneous model by Grassmann) between the sum of the line-bound vectors around any triangle \(\llbracket \hspace{0.5pt}\textbf{a}, \textbf{b}, \textbf{c}\hspace{0.5pt}\rrbracket \) and (twice) the free bivector of that triangle, namely
because the free vector coefficients of \(\varvec{N}\) from Eq. (14) cancel. This can be regarded as a discrete version of Green’s theorem. A similar relation can also be obtained by considering the sum of the line-bound vectors around any one of the three Hamiltonian cycles of a tetrahedron \(\llbracket \hspace{0.5pt}\textbf{a}, \textbf{b}, \textbf{c}, \textbf{d}\hspace{0.5pt}\rrbracket \), e.g.
which is \(\varvec{n}_\infty \) times the free bivector of the medial parallelogram separating the line from (as in Fig. 2).
4 Plane-Bound Bivectors and Tetrahedra
We now consider the relations between two plane-bound bivectors, along with the space-bound trivector (oriented volume) which a plane-bound bivector and a flat point mutually define. In the conformal model, a plane-bound bivector has the form:
The free bivector \((\textbf{c}- \textbf{b}) \hspace{-0.1em}\wedge \hspace{-0.1em}(\textbf{d}- \textbf{b})\) has a squared magnitude of
which is of course four times the squared area \(\varvec{|}\llbracket \hspace{0.5pt}\textbf{b}, \textbf{c}, \textbf{d}\hspace{0.5pt}\rrbracket \varvec{|}^2\). Its normal vector parallel to \((\textbf{c}- \textbf{b}) \times (\textbf{d}- \textbf{b}) = -\varvec{\iota }\, (\textbf{c}- \textbf{b}) \hspace{-0.1em}\wedge \hspace{-0.1em}(\textbf{d}- \textbf{b})\) determines the orientation of the plane containing the triangle \(\llbracket \hspace{0.5pt}\textbf{b}, \textbf{c}, \textbf{d}\hspace{0.5pt}\rrbracket \subset \mathbb R^3\), while the distance of the plane from the origin is \(\Vert {\textbf {b}} \hspace{-0.05em}\wedge \hspace{-0.05em} {\textbf {c}} \hspace{-0.05em}\wedge \hspace{-0.05em}{} {\textbf {d}} \Vert \,/\, \Vert ({\textbf {c}} - {\textbf {b}}) \hspace{-0.05em}\wedge \hspace{-0.05em} ({\textbf {d}} - {\textbf {b}}) \Vert \). The same plane-bound bivector, however, is obtained for any other triple of conformal points \(\varvec{b}'\), \(\varvec{c}'\), \(\varvec{d}'\) that lie in the same plane and span a triangle \(\llbracket \hspace{0.5pt}\textbf{b}', \textbf{c}', \textbf{d}' \hspace{0.5pt}\rrbracket \) with the same oriented areaFootnote 4 (as depicted in Fig. 3).
The three-point Cayley-Menger determinant, obtained by multiplying the plane-bound bivector with the negative of its reverse, is:
In this following, this determinant will often be written as where is the magnitude of any \(\textbf{X}\in \mathcal G_{4,1}\). By expanding the squared dot product in Eq. (33) via the law of cosines \(\textbf{x}\,{\varvec{\cdot }}\, \textbf{y}= (\Vert \textbf{x}\Vert ^2 + \Vert \textbf{y}\Vert ^2 - \Vert \textbf{x}- \textbf{y}\Vert ^2)/2\), it can readily be shown that this Cayley-Menger determinant is also that free bivector’s squared norm. The factorized expression on the right-hand side of this equation is of course Heron’s formula for the squared area (times 4) of the triangle \(\llbracket \hspace{0.5pt}\textbf{b}, \textbf{c}, \textbf{d}\hspace{0.5pt}\rrbracket \), which will be studied in much greater detail in the next section. At this point it need only be noted that, as a polynomial in three variables variables \(d_\textsf{bc}\), \(d_\textsf{bd}\), \(d_\textsf{cd}\) \(\ge 0\) representing the corresponding distances, this formula is non-negative if and only if those variables satisfy all three triangle inequalities among them.
The negative inner product of different plane-bound bivectors gives a non-symmetric Cayley–Menger determinant (one of which will be reversed for consistency even though it’s reverse symmetric). Providing their planes intersect, we may assume without loss of generality that the point triples \(\varvec{a}, \varvec{b}, \varvec{c}\) and \(\varvec{a}, \varvec{b}, \varvec{d}\) have a pair \(\varvec{a}, \varvec{b}\) in common, so that:
Geometrically, this inner product is \(-\Vert (\textbf{b}- \textbf{a}) \hspace{-0.1em}\wedge \hspace{-0.1em}(\textbf{d}- \textbf{a}) \Vert \Vert (\textbf{b}- \textbf{a}) \hspace{-0.1em}\wedge \hspace{-0.1em}(\textbf{c}- \textbf{a}) \Vert \) \(\cos (\varphi _\textsf{ab})\), where \(\varphi _\textsf{ab}\) is the internal dihedral angle between the planes and . We also have the trivial identity among free bivectors,
which, upon taking the squared norms of both sides, becomes
Together with Eqs. (18), (22), (35) and (37), this leads to the little-known areal law of cosines,
where \(\varvec{L} \bowtie \varvec{M} = \big \langle \varvec{LM} \big \rangle _2\) is the commutator product of the line bound vectors as they were defined in Eq. (20).
There is also a corresponding areal law of sines, obtained from a simple case of the dual (Grassmann’s regressive) outer product “\(\vee \)” corresponding to the meet of the associated subspaces [18, 22]. Letting \(\textbf{X}\) be an arbitrary blade (here of grade 1 in \(\mathcal G_3\) or 3 in \(\mathcal G_{4,1}\), though the definition is general) and \(\textbf{y}, \textbf{z}\) be vectors with \(\Vert \textbf{X}\hspace{-0.1em}\wedge \hspace{-0.1em}\textbf{y}\hspace{-0.1em}\wedge \hspace{-0.1em}\textbf{z}\Vert ^2 \ne 0\), this is given by
where \((\textbf{X}\hspace{-0.1em}\wedge \hspace{-0.1em}\textbf{y}\hspace{-0.1em}\wedge \hspace{-0.1em}\textbf{z})^*\) is the dual w.r.t. the unit pseudo-scalar of the subspaces’ join. Upon taking squared norms and recalling (see e.g. Ref. [7, §5.6]) that the magnitude of such a dual outer product is where \(\varphi \) is the internal angle between the subspaces of \(\textbf{X}\hspace{-0.1em}\wedge \hspace{-0.1em}\textbf{y}\) and \(\textbf{X}\hspace{-0.1em}\wedge \hspace{-0.1em}\textbf{z}\), this implies:
where \(\varphi _\textsf{ab}\) is the dihedral angle as defined above.
Now consider the product of a flat point with a plane-bound bivector:
The \(\varvec{n}_\infty \, {\textbf {a}} {\,\varvec{\cdot }\,}\big ( ({\textbf {c}}- {\textbf {b}}) \hspace{-0.1em}\wedge \hspace{-0.1em} ({\textbf {d}} - {\textbf {b}}) \big )\) term is unfortunately not translation invariant, but the rest of its 2-vector part \((\textbf{c}- \textbf{b}) \hspace{-0.1em}\wedge \hspace{-0.1em}(\textbf{d}- \textbf{b})\) is, as is its 4-vector part \(\varvec{n}_\infty \, (\textbf{a}- \textbf{b}) \hspace{-0.1em}\wedge \hspace{-0.1em}(\textbf{c}- \textbf{b}) \hspace{-0.1em}\wedge \hspace{-0.1em}(\textbf{d}- \textbf{b})\). The latter is also the commutator product of the flat point and plane-bound bivector, which may be expanded using the general relation \(({\textbf {x}}\hspace{-0.1em}\wedge \hspace{-0.1em}{} {\textbf {y}}) \bowtie {\textbf {Z}}= {\textbf {x}}\hspace{-0.1em}\wedge \hspace{-0.1em}({\textbf {y}} {\,\varvec{\cdot }\,} {\textbf {Z}}) - {\textbf {y}} \hspace{-0.1em}\wedge \hspace{-0.1em} ({\textbf {x}} {\,\varvec{\cdot }\,} {\textbf {Z}})\) [18, Eq. (1.60b)] as:
Its translation invariance shows that the volume \(\varvec{|}\llbracket \hspace{0.5pt}\textbf{a}, \textbf{b}'\!, \textbf{c}'\!, \textbf{d}' \hspace{0.5pt}\rrbracket \varvec{|}\) of the tetrahedron is the same for all \(\textbf{b}'\), \(\textbf{c}'\), \(\textbf{d}'\) in the same plane and spanning a triangle of the same oriented area (cf. Fig. 3). It is also the same, of course, for any translate of \(\textbf{a}\) in the plane through \(\textbf{a}\) parallel to .
We can put \(\textbf{a}\) at the origin by translating all the points with the rotor \(\varvec{T}_{\textbf {a}} \,{:}{=}\, 1 - {\textbf {a}}\varvec{n}_\infty / 2\), obtaining:
Thus our remarks following Eq. (33) show that the height of \(\varvec{n}_\infty \hspace{-0.1em}\wedge \hspace{-0.1em}\varvec{a}\) above the plane of \(\varvec{n}_\infty \hspace{-0.1em}\wedge \hspace{-0.1em}\varvec{b} \hspace{-0.1em}\wedge \hspace{-0.1em}\varvec{c} \hspace{-0.1em}\wedge \hspace{-0.1em}\varvec{d}\) is
which is the same as the ratio of the norms of the corresponding translation-independent terms in Eq. (41).
Last but not least, we have the following discrete version of Stoke’s theorem among the plane-bound bivectors of the four faces of the tetrahedron, which follows from Eq. (42):
Together with Eq. (30), this implies that the sum (times \(\varvec{N}\), as in Eq. (32)) of the free bivector parts of these plane-bound bivectors vanishes, or
a result usually attributed to Hermann Minkowski although it must have been known to Grassmann. An alternative derivation of this formula, based upon the relation among the line-bound vectors around any face of the tetrahedron from Eq. (30), is sketched in Fig. 4 and its caption.
Note that the bivectors of the four faces determine those of the medial parallelogram \((\textbf{b}- \textbf{a}) \hspace{-0.1em}\wedge \hspace{-0.1em}(\textbf{d}- \textbf{c})\) via Eq. (36) and its analogues for the other two medial parallelogram bivectors \((\textbf{c}- \textbf{a}) \hspace{-0.1em}\wedge \hspace{-0.1em}(\textbf{d}- \textbf{b})\) and \((\textbf{c}- \textbf{b}) \hspace{-0.1em}\wedge \hspace{-0.1em}(\textbf{d}- \textbf{a})\).
This system of linear relations among these bivectors can be inverted to express the bivector of each face as a signed sum of those of the three medial parallelograms (cf. [14, Part I]), namely:
These analogues of Minkowski’s identity (together with Eq. (25)) justify the heterodox point-of-view that the tetrahedron actually has seven faces, where its three medial parallelograms qualify as interior faces. Further justification derives from the fact (first noted in unpublished work by B. D. S. McConnell circa 2012; see also Theorem 11 in Ref. [14, Part I]) that the areas of these seven faces mutually determine a non-degenerate tetrahedron up to isometry.
5 Heron’s Formula, the In-Circle, and the In-Touch Triangle
The geometric essence of Heron’s formula is that it is a relation between the radius of the in-circle of a triangle \(\llbracket \hspace{0.5pt}\textbf{a}, \textbf{b}, \textbf{c}\hspace{0.5pt}\rrbracket \subset \mathbb R^2\) and the lengths of the segments into which its edges are divided by the points at which the in-circle “touches” those edges.Footnote 5 As may be seen in Fig. 5, these segments come in pairs of equal lengths each with a vertex in common. Those lengths u, v, w have been called the “Heron parameters” of the triangle [3], and clearly determine the lengths of its edges \(a \,{:}{=}\, \varvec{|}\llbracket \hspace{0.5pt}{} {\textbf {b}}, {\textbf {c}}\hspace{0.5pt}\rrbracket \varvec{|}\), \(b \,{:}{=}\, \varvec{|}\llbracket \hspace{0.5pt}{} {\textbf {a}}, {\textbf {c}}\hspace{0.5pt}\rrbracket \varvec{|}\), \(c \,{:}{=}\, \varvec{|}\llbracket \hspace{0.5pt}{} {\textbf {a}}, {\textbf {b}}\hspace{0.5pt}\rrbracket \varvec{|}\) as \(a = v + w\), \(b = u + w\), \(c = u + v\). Conversely, one can solve this non-singular system of linear equations to obtain the Heron parameters from the edge lengths as \(u = (-a + b + c)/2\), \(v = (a - b + c)/2\), \(w = (a + b - c)/2\).
It follows that the Heron parameters specify the triangle up to isometry in a manifestly coordinate-free fashion. The same, of course, is true for the edge lengths a, b, c themselves, but the Heron parameters have the distinct advantage over the edge lengths of not being constrained by the triangle inequality, because the edge lengths obtained as above from any \(u, v, w \ge 0\) necessarily obey the triangle inequality. This means the Heron parameters can be varied wholly independently of one another, and that makes them truly the natural parameters for the venerable discipline of triangle geometry.
If \(\textbf{i}\in \mathbb R^2\) is the in-center of the triangle \(\llbracket \hspace{0.5pt}\textbf{a}, \textbf{b}, \textbf{c}\hspace{0.5pt}\rrbracket \), r is the in-radius thereof and \(t \,{:}{=}\, \varvec{|}\llbracket \hspace{0.5pt}{} {\textbf {a}}, {\textbf {b}}, {\textbf {c}}\hspace{0.5pt}\rrbracket \varvec{|}\) is the triangle’s area, we see easily from Fig. 5 that
where \(s \,{:}{=}\,(a+b+c)/2 = u+v+w\) is the triangle’s semi-perimeter. This shows that a formula for r is pretty much the same as a formula for t. Because the barycentric coordinates of \(\textbf{i}\) are the ratios of these signed areas to the total signed area and the signs of the areas of \(\llbracket \hspace{0.5pt}\textbf{i}, \textbf{b}, \textbf{c}\hspace{0.5pt}\rrbracket \), \(\llbracket \hspace{0.5pt}\textbf{a}, \textbf{i}, \textbf{c}\hspace{0.5pt}\rrbracket \) and \(\llbracket \hspace{0.5pt}\textbf{a}, \textbf{b}, \textbf{i}\hspace{0.5pt}\rrbracket \) agree with that of \(\llbracket \hspace{0.5pt}\textbf{a}, \textbf{b}, \textbf{c}\hspace{0.5pt}\rrbracket \), the in-center \(\textbf{i}\) is given by
Letting \(\varvec{a}, \varvec{b}, \varvec{c}, \varvec{i} \in \mathcal G_{3,1}^{\,1} \subset \mathcal G_{4,1}^{\,1}\) be the corresponding conformal points including the correction terms from Eq. (3), a neat proof of these claims can be obtained via Eq. (42) simply by noting that the commutator product \((\varvec{n}_\infty \hspace{-0.1em}\wedge \hspace{-0.1em}\varvec{a} \hspace{-0.1em}\wedge \hspace{-0.1em}\varvec{b} \hspace{-0.1em}\wedge \hspace{-0.1em}\varvec{c}) \bowtie (\varvec{n}_\infty \hspace{-0.1em}\wedge \hspace{-0.1em}\varvec{i}) = \varvec{n}_\infty \, ({\textbf {a}}- {\textbf {i}}) \hspace{-0.1em}\wedge \hspace{-0.1em}({\textbf {b}}- {\textbf {i}}) \hspace{-0.1em}\wedge \hspace{-0.1em}({\textbf {c}}- {\textbf {i}})\) vanishes because \(\textbf{i}\in \llbracket \hspace{0.5pt}\textbf{a}, \textbf{b}, \textbf{c}\hspace{0.5pt}\rrbracket \). This also shows that the dual representation [7, §14.1] of the in-circle itself can be written as
wherein \(\Vert \hspace{1pt} {\textbf {i}}\hspace{1pt}\Vert ^2 = -2\, \varvec{i} {\,\varvec{\cdot }\,} \varvec{n}_0 = (a\, {\textbf {a}}^2 + b\, {\textbf {b}}^2 + c\, {\textbf {c}}^2 - abc) / (2\, s)\). As is well known, \(abc = 4\, t R\) where R is the triangle’s circum-radius, so taking \(\textbf{i}\) as the origin shows that \((a \Vert {\textbf {a}} - {\textbf {i}} \Vert ^2 + b \Vert {\textbf {b}} - {\textbf {i}} \Vert ^2 + c \Vert {\textbf {c}} - {\textbf {i}} \Vert ^2) / (2\,s) = 2\, r R = R^2 - d^2\) by Euler’s theorem, where d is the distance of the in-center from the circum-center.
The Heron parameters can also be interpreted as the barycentric coordinates of the in-touch points \(\textbf{j}\in \llbracket \hspace{0.5pt}\textbf{b}, \textbf{c}\rrbracket \), \(\textbf{k}\in \llbracket \hspace{0.5pt}\textbf{a}, \textbf{c}\rrbracket \), \(\textbf{l}\in \llbracket \hspace{0.5pt}\textbf{a}, \textbf{b}\rrbracket \), at which the in-circle touches the triangle’s edges, times their lengths:
The corresponding conformal points are
From this, we find easily that the plane-bound bivector of \(\llbracket \hspace{0.5pt}\hspace{1pt}\textbf{j}, \textbf{k}, \textbf{l}\hspace{1pt} \hspace{0.5pt}\rrbracket \) is
and hence
where \(\Omega (u,v,w) \,{:}{=}\, 2\, uvw\) and \(\textbf{X}\, := \, \sqrt{ |\Vert \textbf{X}\Vert ^2 |}\) for \(\varvec{X} \in \mathcal G_{3,1}\) as before.
It follows that if we can find the area of \(\llbracket \hspace{0.5pt}\hspace{1pt}\textbf{j}, \textbf{k}, \textbf{l}\hspace{1pt} \hspace{0.5pt}\rrbracket \) as a function of the Heron parameters we will also get the area of \(\llbracket \hspace{0.5pt}\textbf{a}, \textbf{b}, \textbf{c}\hspace{0.5pt}\rrbracket \), again as a function of u, v, w. The triangle \(\llbracket \hspace{0.5pt}\hspace{1pt}\textbf{j}, \textbf{k}, \textbf{l}\hspace{1pt} \hspace{0.5pt}\rrbracket \) is variously known as the Gergonne, contact or in-touch triangle [1, 20, 25], and we shall use the latter nomenclature. As may be seen in Fig. 5, the circum-circle of the in-touch triangle coincides with the in-circle of the original and, as a result, the perpendicular bisectors of the in-touch triangle’s edges also coincide with the internal angle bisectors of the original. It is also apparent that the edge lengths of the in-touch triangle \(j \,{:}{=}\, \Vert \hspace{0.5pt} {\textbf {k}}- {\textbf {l}}\hspace{0.5pt}\Vert \), \(k \,{:}{=}\, \Vert \hspace{0.5pt} {\textbf {j}} - {\textbf {l}}\hspace{0.5pt}\Vert \), \(\ell \,{:}{=}\, \Vert \hspace{0.5pt} {\textbf {j}} - {\textbf {k}}\hspace{0.5pt}\Vert \) are twice the heights of \(\textbf{j}\), \(\textbf{k}\), \(\textbf{l}\) over the hypotenuses of the congruent pairs of right triangles \(\llbracket \hspace{0.5pt}\hspace{0.5pt}\textbf{i}, \textbf{k}, \textbf{a}\hspace{0.5pt}\rrbracket \approx \llbracket \hspace{0.5pt}\hspace{0.5pt}\textbf{i}, \textbf{l}, \textbf{a}\hspace{0.5pt}\rrbracket \), \(\llbracket \hspace{0.5pt}\hspace{0.5pt}\textbf{i}, \textbf{j}, \textbf{b}\hspace{0.5pt}\rrbracket \approx \llbracket \hspace{0.5pt}\hspace{0.5pt}\textbf{i}, \textbf{l}, \textbf{b}\hspace{0.5pt}\rrbracket \) and \(\llbracket \hspace{0.5pt}\hspace{0.5pt}\textbf{i}, \textbf{j}, \textbf{c}\hspace{0.5pt}\rrbracket \approx \llbracket \hspace{0.5pt}\hspace{0.5pt}\textbf{i}, \textbf{k}, \textbf{c}\hspace{0.5pt}\rrbracket \), respectively.
This latter fact can readily be used to show that the in-touch triangle’s edge lengths \(j, k, \ell \) are given by
Because they depend on the quantity we seek, namely \(r^2\), in a non-linear fashion, these simple expressions are not well-suited to our immediate purposes (although their three-dimensional analogs play an essential role in the next section). Hence we will instead derive three equivalent but quite different formulae for these distances, one of which is:
On the second line, we have used the definitions of the in-touch points and in-radius \(r = \Vert \, \textbf{j}- \textbf{i}\,\Vert = \Vert \, \textbf{k}- \textbf{i}\,\Vert \), the fact that the angle between \(\textbf{j}- \textbf{i}\) and \(\textbf{k}- \textbf{i}\) is supplementary to the original triangle’s angle \(\vartheta _\mathsf c\) at the vertex labeled \(\mathsf C\) in Fig. 5, and finally the law of cosines for this latter angle. On the third line, we used the definition \(s = (a\!+\!b\!+\!c)/2\) and formula \(w = (a\!+\!b\!-\!c)/2\) from this section’s first paragraph. In a similar fashion one can also show that \(k^2 = 4\, r^2\,s v / (ac)\) and \(j^2 = 4\, r^2\,s u / (bc)\).
We can utilize these formulae to calculate the squared area of the in-touch triangle (times 4) by expanding its three-point Cayley-Menger determinant \(-\Vert \varvec{n}_\infty \hspace{-0.1em}\wedge \hspace{-0.1em}\varvec{j} \hspace{-0.1em}\wedge \hspace{-0.1em}\varvec{k} \hspace{-0.1em}\wedge \hspace{-0.1em}\varvec{l}\, \Vert ^2\) via Eq. (34), but the algebra is a bit messy. The determinant can however be simplified considerably by defining the rescaled vectors \(\varvec{j}' \,{:}{=}\, a\, \varvec{j} / q\), \(\varvec{k}' \,{:}{=}\, b\, \varvec{k} / q\), \(\varvec{l}' \,{:}{=}\, c\, \varvec{l} / q\) with \(q \,{:}{=}\, \sqrt{4\, r^2\,s}\), so that
It follows from Eq. (56) and its analogs for the remaining edges of the in-touch triangle that the inner products among these rescaled vectors are
The determinant of the resulting matrix of inner products therefore assumes the relatively simple form:
(where the last equality follows from the definitions of q and s, and \(\Omega \) is as defined following Eq. (54)). Together with Eq. (57), this gives us
Finally, on squaring both sides of Eq. (54), multiplying through by , and substituting for using Eq. (60) together with \(r = t / s\), we obtain
This derivation of Heron’s formula is, of course, quite painful as compared to, for example, just factorizing the three-point Cayley-Menger determinant \(-\, \Vert \varvec{n}_\infty \hspace{-0.1em}\wedge \hspace{-0.1em}\varvec{a} \hspace{-0.1em}\wedge \hspace{-0.1em}\varvec{b} \hspace{-0.1em}\wedge \hspace{-0.1em}\varvec{c} \Vert ^2\) as in Eq. (34). Its advantage is that it can be generalized to higher dimensions, as will now be shown for tetrahedra.
6 A Natural Extension of Heron’s Formula to Tetrahedra
One well-known extension of Heron’s formula to tetrahedra may be found in the four-point Cayley-Menger determinant \(-\Vert \varvec{n}_\infty \hspace{-0.1em}\wedge \hspace{-0.1em}\varvec{a} \hspace{-0.1em}\wedge \hspace{-0.1em}\varvec{b} \hspace{-0.1em}\wedge \hspace{-0.1em}\varvec{c} \hspace{-0.1em}\wedge \hspace{-0.1em}\varvec{d}\, \Vert ^2\), which gives the squared volume as a homogeneous cubic polynomial in the squared inter-vertex distances. Unfortunately, this four-point determinant is an irreducible polynomial (in the language of commutative algebra), meaning it cannot be factorized like the three-point determinant was in Eq. (34). Consequently, this algebraic expression does not provide much insight into the geometric structure of its locus (set of zeros), any more than it did when Nicolo Tartaglia discovered a rational version of it in the sixteenth century.Footnote 6
In contrast, the extension given here (which was first derived by the author in Part II of Ref. [14]) expresses the fourth power of the volume as a polynomial in the tetrahedral analogs of the Heron parameters of a triangle, which in turn are simple rational functions of the areas of the tetrahedron’s four exterior and three interior (medial parallelogram) faces. Unlike Heron’s and Tartaglia’s formulae, moreover, its locus turns out to be a new kind of four-point configuration which is, in some sense, another boundary for the set of all full-dimensional Euclidean tetrahedra that differs greatly from the usual boundary (planar quadrilaterals). These configurations exhibit infinite inter-vertex distances and so cannot be realized in physical space, even though they are perfectly well defined mathematically by the (finite) values of their areas. This will be further discussed at the end of this section (and was studied in depth in Part III of Ref. [14]).
Triangles and tetrahedra nevertheless do have a great deal in common. In particular, the in-radius r of the in-sphere of a tetrahedron \(\llbracket \hspace{0.5pt}\textbf{a}, \textbf{b}, \textbf{c}, \textbf{d}\hspace{0.5pt}\rrbracket \subset \mathbb R^3\) satisfies \(r = t / s\) where t is now six (3!) times its volume and s is twice its (exterior) surface area.Footnote 7 Also, as can be seen in Fig. 6, the in-sphere “touches” the tetrahedron’s four facets (or exterior faces) at points \(\textbf{j}\in \llbracket \hspace{0.5pt}\textbf{b}, \textbf{c}, \textbf{d}\hspace{0.5pt}\rrbracket \), \(\textbf{k}\in \llbracket \hspace{0.5pt}\textbf{a}, \textbf{c}, \textbf{d}\hspace{0.5pt}\rrbracket \), \(\textbf{l}\in \llbracket \hspace{0.5pt}\textbf{a}, \textbf{b}, \textbf{d}\hspace{0.5pt}\rrbracket \), \(\textbf{m}\in \llbracket \hspace{0.5pt}\textbf{a}, \textbf{b}, \textbf{c}\hspace{0.5pt}\rrbracket \) which divide those facets into three triangles each. Finally, just as with triangles, the Pythagorean theorem shows that the distances from each vertex to its three adjacent in-touch points are all equal, and this in turn implies the pairs of such “contact triangles” sharing a common edge of the tetrahedron are congruent, giving rise to only six independent areas. Accordingly, the natural parameters of the tetrahedron will be defined as twice these areas’ values [14, Part II], specifically
If we denote twice the areas of the facets opposite to the vertices \(\textbf{a}, \textbf{b}, \textbf{c}, \textbf{d}\) by a, b, c, d respectively, then in analogy to Eq. (49) the in-center of the tetrahedron is the barycentric sum of its vertices given by
In analogy to Eq. (51), moreover, the in-touch points are given by the barycentric sums
wherein the natural parameters satisfy the linear equations
The main obstacle to extending Heron’s formula to tetrahedra stems from the fact that these four equations do not uniquely determine the six natural parameters, as the corresponding relations (\(a = v\!+\!w\), \(b = u\!+\!w\), \(c = u\!+\!v\)) between the edge lengths and the Heron parameters of a triangle did.
Like the in-circle of a triangle (Eq. (50)), the dual representation of the in-sphere of a tetrahedron in \(\mathcal G_{4,1}\) can be written as
but now a, b, c, d are areas not distances while \(\textbf{a} \varvec{\cdot } \textbf{b}\) etc. are still (negative one half times the) squared inter-vertex distances. This makes the geometric interpretation of the correction term \(\varvec{n}_\infty (ab\, \varvec{a} {\,\varvec{\cdot }\,} \varvec{b} +\cdots + cd\, \varvec{c} {\,\varvec{\cdot }\,} \varvec{d})\) relatively challenging. Fortunately, because the outer product with \(\varvec{n}_\infty \) wipes out this term, Eqs. (64) still allow us to express the conformal oriented volume of the in-touch tetrahedron as
which in a fashion similar to Eq. (54) works out to just
where \(\Omega (u,v,w,x,y,z) \,{:}{=}\, 2\, (uvyz + uwxz + vwxy) - (uz)^2 - (vy)^2 - (wx)^2\). (The minus sign in front means that the orientation of the in-touch tetrahedron \(\llbracket \hspace{0.5pt}\hspace{0.5pt}\textbf{j}, \textbf{k}, \textbf{l}, \textbf{m}\hspace{0.5pt}\rrbracket \) is opposite to that of the original.)
Thus we arrive, much we did above with triangles, at the problem finding the volume of the “in-touch tetrahedron,” or of evaluating the Cayley-Menger determinant \(-\Vert \varvec{n}_\infty \hspace{-0.1em}\wedge \hspace{-0.1em}\varvec{j} \hspace{-0.1em}\wedge \hspace{-0.1em}\varvec{k} \hspace{-0.1em}\wedge \hspace{-0.1em}\varvec{l} \hspace{-0.1em}\wedge \hspace{-0.1em}\varvec{m}\, \Vert ^2\) of the in-touch points. This requires us to find expressions for its squared inter-vertex distances \(\Vert \hspace{0.5pt}{} {\textbf {j}}- {\textbf {k}}\Vert ^2 = -2\, \varvec{j} {\,\varvec{\cdot }\,} \varvec{k}\), etc. Note, however, that this time the formula we seek is a polynomial in the natural parameters of a tetrahedron, which are areal magnitudes that cannot be expressed as rational functions of the squared distances in either the in-touch tetrahedron or the original (square roots must be extracted to do that). The squared distances among the vertices of the in-touch tetrahedron can nevertheless be obtained by an approach that is quite similar to that taken for the in-touch triangle in the previous section, but using the areal law of cosines (38) instead of the usual one as in Eq. (56), e.g.
where \(\varphi _\textsf{cd}\) is now the (internal) dihedral angle between the planes and , \(e \,{:}{=}\, \Vert ({\textbf {b}}- {\textbf {a}}) \hspace{-0.1em}\wedge \hspace{-0.1em}({\textbf {d}}- {\textbf {c}}) \Vert \) is four times the area of the medial parallelogram as defined after Eq. (22), and \(Z \,{:}{=}\, (a\!+\!b\!+\!e) (a\!+\!b\!-\!e) / (2\,s)\). All that is required to make the analogy between this equation and the corresponding equation (56) for triangles complete is \(Z = z\), where z is the natural parameter of the tetrahedron defined above as \(2\, \varvec{|}\llbracket \hspace{0.5pt}\textbf{c}, \textbf{d}, \textbf{j}\hspace{0.5pt}\rrbracket \varvec{|}= 2\, \varvec{|}\llbracket \hspace{0.5pt}\textbf{c}, \textbf{d}, \textbf{k}\hspace{0.5pt}\rrbracket \varvec{|}\).
A direct derivation of these algebraic incarnations of the natural parameters as rational functions of the seven areas \(a, b, c, d, e, f \,{:}{=}\, \Vert ({\textbf {c}}- {\textbf {a}}) \hspace{-0.1em}\wedge \hspace{-0.1em}({\textbf {d}}- {\textbf {b}}) \Vert \) and \(g \,{:}{=}\, \Vert ({\textbf {d}}- {\textbf {a}}) \hspace{-0.1em}\wedge \hspace{-0.1em}({\textbf {c}}- {\textbf {b}}) \Vert \) may be found in the Part II of the original reference [14]. It utilized classical three-dimensional vector algebra and basic trigonometry together with the areal laws of sines and cosines, which after several long pages of algebra yielded the results:
The non-negativity of the numerators in these expressions follows from the triangle inequality for Euclidean bivectors, namely \(\textbf{A}= \textbf{B}+ \textbf{C}\implies \Vert \textbf{A}\Vert \le \Vert \textbf{B}\Vert + \Vert \textbf{C}\Vert \), applied to Eq. (36) and its analogs with the other facets and interior faces; this constitutes a heretofore unremarked extension of the triangle inequality from line-bound vectors in the plane to plane-bound bivectors in space.Footnote 8
Instead of reformulating that derivation in \(\mathcal G_{4,1}\), here we give a conceptually simpler and rather more geometric (although unfortunately not much more compact) derivation of these formulae. This new derivation is based upon the consistency of two different expressions for the squared inter-vertex distances in the in-touch tetrahedron, where an example of the first may be found in Eq. (69) and the second are the three-dimensional analogs of the expressions given in Eq. (55) for those squared distances in the in-touch triangle. These analogs are based upon the congruence of the pairs of contact triangles meeting in a common edge as in Eq. (62) (see also Fig. 6), which implies the congruence of the six pairs of sub-tetrahedra given by
(whose disjoint union is \(\llbracket \hspace{0.5pt}\textbf{a}, \textbf{b}, \textbf{c}, \textbf{d}\hspace{0.5pt}\rrbracket \)). The orthogonality of the lines spanned by the in-center and each in-touch point (cf. Fig 7) to the facet containing the latter ensures that these are all “right tetrahedra” at their respective in-touch points.
Taking \(\llbracket \hspace{0.5pt}\textbf{c}, \textbf{d}, \textbf{i}, \textbf{j}\, \hspace{0.5pt}\rrbracket \) as an example, Minkowsi’s identity (46) implies that
or, upon taking the inner-squares of both sides and noticing that the inner products of \((\textbf{c}\hspace{-1pt}-\hspace{-1pt} \textbf{j}) \hspace{-0.1em}\wedge \hspace{-0.1em}(\textbf{d}\hspace{-1pt}-\hspace{-1pt} \textbf{j})\) with the other two terms on the right vanish as a consequence of \(\llbracket \hspace{0.5pt}\textbf{c}, \textbf{d}, \textbf{i}, \textbf{j}\hspace{1pt} \hspace{0.5pt}\rrbracket \) being a right tetrahedron at \(\textbf{j}\), that
By the areal law of cosines (38), however, the sum of the last three terms on the right is just \(\Vert (\textbf{i}- \textbf{j}) \hspace{-0.1em}\wedge \hspace{-0.1em}(\textbf{c}- \textbf{d}) \Vert ^2\), which in turn is \(\Vert \hspace{0.5pt} \textbf{i}- \textbf{j}\hspace{0.5pt}\Vert ^2 \Vert \hspace{0.5pt} \textbf{c}- \textbf{d}\hspace{0.5pt}\Vert ^2\) by the orthogonality of to . Recalling that \(r = \Vert \hspace{0.5pt} \textbf{i}- \textbf{j}\hspace{0.5pt}\Vert \) and \(z \,{:}{=}\, \Vert ({\textbf {c}}- {\textbf {j}}) \hspace{-0.1em}\wedge \hspace{-0.1em}({\textbf {d}}- {\textbf {j}}) \Vert \), this leads to the following areal analog of the Pythagorean theorem (cf. Fig. 7),
with similar equations for all the remaining pairs of congruent right tetrahedra listed in Eq. (71).
Next, note that again as a consequence of \(\llbracket \hspace{0.5pt}\textbf{c}, \textbf{d}, \textbf{i}, \textbf{j}\hspace{1pt} \hspace{0.5pt}\rrbracket \) being a right tetrahedron at \(\textbf{j}\), its volume \(\varvec{|}\llbracket \hspace{0.5pt}\textbf{c}, \textbf{d}, \textbf{i}, \textbf{j}\hspace{1pt} \hspace{0.5pt}\rrbracket \varvec{|}\) is simply \(\varvec{|}\llbracket \hspace{0.5pt}\hspace{0.5pt}\textbf{i}, \textbf{j}\hspace{0.5pt} \hspace{0.5pt}\rrbracket \varvec{|}\, \varvec{|}\llbracket \hspace{0.5pt}\textbf{c}, \textbf{d}, \textbf{j}\hspace{0.5pt} \hspace{0.5pt}\rrbracket \varvec{|}\,/\, 3 = r\, z \,/\, 6\). Much as we saw for a triangle in Fig. 5, however, the plane bisects the dihedral angle at \(\llbracket \hspace{0.5pt}\textbf{c}, \textbf{d}\hspace{0.5pt}\rrbracket \) and is also the perpendicular bisector of the edge \(\llbracket \hspace{0.5pt}\textbf{j}, \textbf{k}\hspace{0.5pt}\rrbracket \) of the in-touch tetrahedron, so that the height \(h_\textsf{j;cdi}\) of \(\textbf{j}\) over \(\llbracket \hspace{0.5pt}\textbf{c}, \textbf{d}, \textbf{i}\hspace{1pt} \hspace{0.5pt}\rrbracket \) is \(\Vert \hspace{0.5pt} \textbf{j}- \textbf{k}\hspace{0.5pt}\Vert / 2\). From this together with Eq. (74), we obtain
The other five analogs of the formulae (56) for a triangle now follow from the permutational symmetry of the equations. Upon equating this new expression for \(\Vert \textbf{j}- \textbf{k}\Vert ^2\) with the one in Eq. (69), we find that
where \(Z \,{:}{=}\, (a \!+\! b \!+\! e) (a \!+\! b \!-\! e) / (2s)\) as above. Our next task is to show that this implies the natural parameter \(z = Z\).
To do so we employ the areal law of sines for free bivectors, first by taking squared norms on both sides of Eq. ( 39) with \(\textbf{X}= \textbf{c}- \textbf{d}\), \(\textbf{y}= \textbf{b}- \textbf{d}\), \(\textbf{z}= \textbf{a}- \textbf{d}\), and then using the second equality of Eq. (40) followed by the areal law of cosines (38) to substitute for the squared sine in the result, obtaining
where t, a, b and e are all defined as above. The obvious resemblance of the right-hand side to Heron’s formula means it is equal, as a polynomial, to a Cayley-Menger determinant in the areas a, b, e rather than the distances.Footnote 9
Finally, we use Eq. (77) plus \(r = t / s\) to eliminate \(r^2\, \Vert \hspace{0.5pt} \textbf{c}- \textbf{d}\hspace{0.5pt}\Vert ^2\) from the numerator on the right-hand side of Eq. (76), and we use the definition of Z to eliminate it from the denominator, which gives us
The result now follows on taking positive square roots of both sides; similar derivations can of course also be given for all of the formulae in Eq. (70).Footnote 10
Now that we have validated these explicit formulae for the natural parameters in terms of the areas of the seven faces of the tetrahedron, we can use Eq. (69) together with its obvious analogs for the other squared distances among the in-touch points to compute the volume of the in-touch tetrahedron much as was done for the area of the in-touch triangle in the previous section. Utilizing rescaled versions of the conformal in-touch points just like we did with the in-touch triangle (cf. Eqs. (58) and (59)) together with \(s/2 = u \!+\! v \!+\! w \!+\! x \!+\! y \!+\! z\) (cf. Eq. (65)), one can readily show that
Then, on substituting this result into the equation obtained by taking the squared norms of both sides of Eq. (68), we arrive at long last at our extension of Heron’s formula to tetrahedra (cf. Eq. (61)):
In a fashion that closely resembles how Ptolemy’s inequalities among the distances can be derived by factorizing \(-\Vert \varvec{a} \hspace{-0.1em}\wedge \hspace{-0.1em}\varvec{b} \hspace{-0.1em}\wedge \hspace{-0.1em}\varvec{c} \hspace{-0.1em}\wedge \hspace{-0.1em}\varvec{d}\hspace{1pt} \Vert ^2\) as a polynomial in the squared inter-vertex distances [4], the polynomial \(\Omega (u,v,w,x,y,z)\) can be written as a product of four factors each of which is linear in the products of the square roots \(\hat{u} \,{:}{=}\, \sqrt{u},\, \ldots \,,\, \hat{z} \,{:}{=}\,\sqrt{z}\) of “opposite” pairs of natural parameters [14, Part II, Remark 4]), specifically:
At least one of the last three factors must vanish whenever \(\Omega = 0\), and the similarity between those expressions and the Plücker identity among the Plücker coordinates (as in Eq. (29)), which defines the famous Klein quadric \(\mathcal K \,{:}{=}\, \{ {\textbf {p}}\in \mathbb R^6 \!\mid \! p_1 p_6 - p_2 p_5 + p_3 p_4 = 0 \}\), is more than a coincidence. In fact the zeros of \(\Omega \) are canonically homeomorphic to a quotient of the Klein quadric by the action of a discrete group of reflections isomorphic to \(\mathbb Z_2^4\) on the Plücker coordinates. Moreover, within that five-parameter manifold lives a three-parameter sub-manifold homeomorphic to the set of all quadruples of points in the special (area invariant) affine plane [14, Part III].
The most astounding thing, though, is that the points of this manifold are the limits of a sequence of affine transformations with diagonal matrices \(\textbf{Diag}(\, \varepsilon _i, 1/\varepsilon _i, 1/\varepsilon _i )\), applied to any “generic” (random) tetrahedron in \(\mathbb R^3\), with \(\varepsilon _i \rightarrow +\infty \) as \(i \rightarrow \infty \). Such a sequence acts asymptotically on the (exterior and interior) faces as an area-preserving affine squeeze \(\textbf{Diag}(\varepsilon , 1/\varepsilon )\) in the plane of each face, so that the areas in such a sequence of tetrahedra converge to well-defined finite values. In the process, however, the tetrahedra volumes go to zero while their inter-vertex distances approach infinity. The ratios of these distances nevertheless remain well defined, and any finite distances with those same ratios correspond to four points on the Euclidean line \(\mathbb R^1 \approx \mathbb R\). Non-generic tetrahedra with one or more edges perpendicular to the first coordinate axis generate sequences that also converge to zeros of \(\Omega \) wherein the corresponding distances are zero, and when the initial tetrahedron lies in any plane not perpendicular to that axis, the resulting zero corresponds to a quadruple in the special affine plane as above.
The generic zeros clearly correspond to limiting cases of the equi-volume sets of tetrahedra defined by skew pairs of line-bound vectors or to those defined by a plane-bound bivector together with an out-of-plane flat point, as described in Secs. 3 and 4, respectively. The difference is that, while the inter-vertex distances in those sets can also approach infinity, when the volume of the tetrahedra is non-zero this is accompanied by all but one of the seven areas also becoming infinite. In Appendix B of [14, Part IV], I conjectured that there is a canonical one-to-one mapping between the zeros of \(\Omega \) and planar Euclidean quadrilaterals, where the former are the limits of sequences of affine transformations as above and the latter are orthogonal projections of the tetrahedra onto the plane perpendicular to the first coordinate axis. Otherwise, the very existence of such “areally degenerate” tetrahedra begs the question: Is Flatland a bigger place than A. Square ever knew [29]?Footnote 11
7 Heron’s Formula for \(\varvec{n}\)-Dimensional Simplices
Based upon the above insights into the geometric interpretation of Heron’s formula and its extension to three dimensions, it is not hard to see how it generalizes to higher dimensions. Given an n-simplex \(\llbracket \hspace{0.5pt}\textbf{a}_0,\, \textbf{a}_1,\, \textbf{a}_2,\, \ldots ,\, \textbf{a}_n \hspace{0.5pt}\rrbracket \subset \mathbb R^n\), let \(a_k \,{:}{=}\, \varvec{|}\llbracket \hspace{0.5pt}{} {\textbf {a}}_0,\, \ldots ,\, \check{{\textbf {a}}}_k,\, \ldots ,\, {\textbf {a}}_n \hspace{0.5pt}\rrbracket \varvec{|}\) be the “hyper-area” of the facet opposite to \(\textbf{a}_k\) for \(k = 0,\, \ldots ,\, n\),Footnote 12\(s_n \, := \, (n-1)!\, \sum _k a_k\) be its normalized surface hyper-area, and \(t_n \,{:}{=}\, n!\, \varvec{|}\llbracket \hspace{0.5pt}\textbf{a}_0\, ,\ldots , \mathbf \, {\textbf {a}}_n \hspace{0.5pt}\rrbracket \varvec{|}\) be its normalized hyper-volume. Via considerations analogous to those for triangles and tetrahedra above, it is easily seen that the in-radius of the n-simplex is again \(r_n = t_n / s_n\). Also let \(\textbf{i}= (n\!-1\!)!\, \sum _k a_k\, \textbf{a}_k / s_n\) be the in-center of the n-simplex and \(\textbf{j}_k \in \llbracket \hspace{0.5pt}\textbf{a}_0,\, \ldots ,\, \check{\textbf{a}}_k,\, \ldots ,\, \textbf{a}_n \hspace{0.5pt}\rrbracket \) be the “in-touch points” at which its in-sphere intersects its facets. In perfect analogy to triangles and tetrahedra, each facet of the n-simplex is divided into n sub-simplices \(\llbracket \hspace{0.5pt}\textbf{a}_0,\, \ldots ,\, \check{\textbf{a}}_k,\, \ldots ,\, \check{\textbf{a}}_\ell ,\, \ldots ,\, \textbf{a}_n,\, \textbf{j}_k \hspace{0.5pt}\rrbracket \) (\(0 \le \ell \ne k \le n\)) by its in-touch point and, because \(\Vert \, \textbf{i}- \textbf{j}_k \Vert = r_n\) for all k, the Pythagorean theorem shows that the distances \(\Vert \, \textbf{j}_k - \textbf{a}_\ell \Vert \) (\(k \ne \ell \)) from each vertex to its n adjacent in-touch points are once more all equal. This in turn implies that these “contact \((n\!-\!1)\)-simplices” are again congruent whenever they intersect in a common \((n\!-\!2)\)-dimensional face \(\llbracket \hspace{0.5pt}\textbf{a}_0,\, \ldots ,\, \check{\textbf{a}}_k,\, \ldots ,\, \check{\textbf{a}}_\ell ,\, \ldots ,\, \textbf{a}_n \hspace{0.5pt}\rrbracket \). The n-dimensional natural parameters may now be defined as the hyper-areas of these pairs of contact \((n\!-\!1)\)-simplices, now times \((n\!-\!1)!\) to normalize them. Since there are \(n(n\!+\!1)\) such \((n\!-\!1)\)-simplices, there are \(n(n\!+\!1)/2\) natural parameters \(\{\, u_{k\ell } \!=\! u_{\ell k},\, u_{kk} \!=\! 0 \mid k, \ell = 0, \ldots , n \,\}\) in all. Finally, because these natural parameters are the normalized hyper-areas of the members of a partition of each facet, they satisfy \((n\!-\!1)!\, a_k = \sum _\ell u_{k\ell }\) for \(k = 0,\ldots ,n\) and \(s_n = 2\, \sum _{k<\ell } u_{k\ell \,}\).
Given that for \(n = 2\) and 3 the polynomials \(\Omega _n \,{:}{=}\, \Omega \) as above are \((-1)^n\) times the determinant of the matrix of natural parameters, the imputed extension of Heron’s formula to higher dimensions is simply
where \(\Omega _n \,{:}{=}\, (-1)^n\, \text {det}({\textbf {U}}_n) \ge 0\) with \({\textbf {U}}_n \,{:}{=}\, [u_{k\ell }]_{k,\ell =0}^{\,n,n}\) as the matrix of natural parameters [14, Part II, Conjecture 11]. This extension is almost too elegant to doubt, but the question of how best to prove it in full generality remains. In this section we will simply outline the essential formulae needed to derive this extension by an approach which directly generalizes that taken above for \(n = 2\) and 3 dimensions, without taking up the (considerable) space that would be needed for a complete and formal proof.Footnote 13
The determinant multiplication formula shows that the oriented hyper-volume of any n-simplex \(\llbracket \hspace{0.5pt}\, \textbf{b}_0, \textbf{b}_1,\, \ldots ,\, \textbf{b}_n \hspace{0.5pt}\rrbracket \) with vertices given by the barycentric sums \(\textbf{b}_k = \sum _\ell \, \beta _{k\ell }\, \textbf{a}_\ell \) (\(\sum _\ell \beta _{k\ell } = 1\), \(k = 0, \ldots , n\)) of another’s will be related to that of the other by the determinant of the matrix of coefficients \(\textrm{det}[\beta _{k\ell }]\). Thus in analogy to Eq. (68) the conformal oriented hyper-volume of the in-touch n-simplex \(\llbracket \hspace{0.5pt}\, \textbf{j}_0,\, \textbf{j}_1,\, \ldots ,\, \textbf{j}_n \hspace{0.5pt}\rrbracket \) with \(\textbf{j}_k = \sum _\ell u_{k\ell }\, \textbf{a}_\ell / ((n\!-1\!)!\, a_k)\) for all k is related to that of the original as
where \(\varvec{j}_0,\, \ldots ,\, \varvec{j}_n, \varvec{a}_0,\, \ldots \,, \varvec{a}_n \in \mathcal G_{n+1,1}\) are the corresponding conformal points as usual. Accordingly, we wish to express the squared volume of the in-touch n-simplex as a Cayley-Menger determinant, which requires us to find the squared distances between the in-touch points via the higher-dimensional analogs of Eq. (69). This in turn requires a hyper-areal generalization of the areal law of cosines (38), which may be obtained from the \((n\hspace{-1pt}-\!1)\)-vector analog of the relation among bivectors in Eq. (36), as exemplified by
In analogy with Eq. (23), the norm of the left-hand side of this equation can be shown to be \((n-2)!\, 2^{n-1}\) times the hyper-area of the \((n\hspace{-0.5pt}-\hspace{-1pt}1)\)-dimensional convex polytope \(\llbracket \hspace{0.5pt}\textbf{b}_0,\, \ldots ,\, \textbf{b}_{n\!-\!2}, \textbf{b}_0',\, \ldots ,\, \textbf{b}_{n\!-\!2}' \hspace{0.5pt}\rrbracket \) spanned by the midpoints \({\textbf {b}}_k \,{:}{=}\, ({\textbf {a}}_k + {\textbf {a}}_{n\!-\!1}) / 2\) and \({\textbf {b}}_{k}' \,{:}{=}\, ({\textbf {a}}_k + {\textbf {a}}_{n}) / 2\) of the line segments connecting \(\{ \textbf{a}_0,\, \ldots ,\, \textbf{a}_{n\!-\!2} \}\) with \(\{ \textbf{a}_{n\!-\!1},\, \textbf{a}_n \}\) for \(k = 0, \ldots , n-2\). Because they are also equal to the intersection of the hyper-plane through those midpoints with the n-simplex itself, such polytopes are known as the \((n\!-\!1,2)\)-medial sections of the n-simplex. A general formula for these hyper-areas in terms of the edge lengths of the original simplex is available [30].Footnote 14
Just as we found with tetrahedra, we obtain an instance of the hyper-areal law of cosines by taking squared norms of both sides of Eq. (84) and rearranging. More generally, letting \(\textbf{c}_k \,{:}{=}\, \textbf{a}_k - \textbf{a}_0\) for \(k = 1, \ldots , n\), \(\textbf{C}_k \,{:}{=}\, \textbf{c}_1 \hspace{-0.1em}\wedge \hspace{-0.1em}\cdots \hspace{-0.1em}\wedge \hspace{-0.1em}\check{\textbf{c}}_k \hspace{-0.1em}\wedge \hspace{-0.1em}\cdots \hspace{-0.1em}\wedge \hspace{-0.1em}\textbf{c}_n\) and \(\textbf{C}_0 \,{:}{=}\, (\textbf{a}_2 - \textbf{a}_1) \hspace{-0.1em}\wedge \hspace{-0.1em}\cdots \hspace{-0.1em}\wedge \hspace{-0.1em}(\textbf{a}_n - \textbf{a}_1)\), this hyper-areal law of cosines may be written as
where \(\varphi _{k\ell }\) is the (smaller) angle between the hyper-planes represented by \(\textbf{C}_k\), \(\textbf{C}_\ell \) and \(\textbf{D}_{k\ell } \,{:}{=}\, \textbf{C}_k - \textbf{C}_\ell \) is an \((n\!-\!1)\)-vector representing the hyper-plane containing an \((n\!-\!1,2)\)-medial section of the n-simplex (\(0 \le k < \ell \le n\)). Noting that the barycentric sum for the in-center is \(\textbf{i}= \sum _k c_k\, \textbf{a}_k \,/\, s_n\) with \(c_k \,{:}{=}\, \Vert \textbf{C}_k \Vert = (n\!-1\!)!\, a_k\) and \(s_n \,{:}{=}\, \sum _k c_k\) as above, the higher-dimensional analog of Eq. (69) for the squared distances between in-touch points is
where \(d_{k\ell } \,{:}{=}\, \Vert \textbf{D}_{k\ell } \Vert \), \(\varphi _{k\ell }\) is the angle between the subspaces of \(\textbf{C}_k\), \(\textbf{C}_\ell \), and \(U_{k\ell } \,{:}{=}\, (c_k + c_\ell + d_{k\ell }) (c_k + c_\ell - d_{k\ell }) / (2\, s_n)\). What remains to be shown, just as with the tetrahedron, is that \(U_{k\ell } = u_{k\ell }\), the natural parameter given by the common normalized hyper-area of the two contact \((n\!-\!1)\)-simplices adjacent to the \((n\!-\!2)\)-face \(\llbracket \hspace{0.5pt}\textbf{a}_0,\, \ldots ,\, \check{\textbf{a}}_k,\, \ldots ,\, \check{\textbf{a}}_\ell ,\, \ldots ,\, \textbf{a}_n \hspace{0.5pt}\rrbracket \) of the original simplex.
This may be done using the n-dimensional generalization of Eq. (75),
which can be obtained via geometric considerations entirely analogous to those used in the \(n = 2\) and \(n = 3\) cases above, together with the generalization of Eq. (77) given by:
The rest of the proof is equally straightforward and likewise perfectly analogous to those given above for the triangle and the tetrahedron: Since \(c_k = \sum _\ell u_{k\ell }\) for \(k = 0, \ldots , n\), the volume of the in-touch n-simplex is given (in the n-dimensional conformal algebra \(\mathcal G_{n\!+\!1,1}\)) by
which generalizes Eq. (79). From that together with Eq. (83), we find that the volume of the original n-simplex satisfies the generalization of Eq. (80),
which in turn is equivalent to our extension (82) of Heron’s formula to n-dimensional simplices for all \(n > 2\).
It will be interesting to see if the zeros of \(\Omega _n\) with \(n > 3\) also exhibit infinite edge lengths and/or other divergent facial contents, and what the dimension of the subspace of \(\mathbb R^n\) that such degenerate n-simplices live in may be. In four dimensions, for example, there are ten natural parameters \(u_{k\ell } = u_{\ell k}\) (\(0 \le k \ne \ell \le 4\)), the sums of which will determine the volumes (hyper-areas) of the five tetrahedral facets \(c_k = \sum _\ell u_{k\ell }\) as above.Footnote 15 A 4-simplex also has ten (3, 2)-medial sections, which are not parallelopipeds but rather triangular prisms with parallel sides of equal length and parallel congruent triangles at their two ends (which are not necessarily perpendicular to the prism’s axis) [30].Footnote 16 Denoting their volumes (times 16) by \(d_{k\ell }\) as above, we see that \(d_{k\ell }^2 = (c_k + c_\ell )^2 -\, 2\, s_4\, u_{k\ell }\) with \(s_4 = 2\, \sum _{k<\ell } u_{k\ell }\). It follows that the (normalized) volumes \(u_{k\ell }\) of the contact tetrahedra determine those of the exterior and interior faces \(c_k\) and \(d_{k\ell }\) together, and vice versa. In addition to these volumes there are also ten areas for the 2-faces of the 4-simplex, as well as ten inter-vertex distances. Knowledge of any ten of those quantities must, in the generic case, reduce the possible values of any others to at most a (small) finite number of values, but these relations involve polynomials of much higher degrees and divergences are expected, presumably even in cases for which the hyper-volume of the 4-simplex is non-zero.Footnote 17
Despite the many similarities between the tetrahedron and higher dimensional n-simplices, one property that does not seem to generalize is the factorization of \(\Omega _3\) as a polynomial in the square roots of the natural parameters noted in Eq. (81); this is likely related to the fact that in \(n = 3\) dimensions there are only three \((n-1,2)\)-medial sections (parallelograms) and not \(n (n+1) / 2\) (i.e. six for \(n = 3\)) of them as in higher dimensions. As a result, the structure of the locus of the equation \(\Omega _n = 0\), which was analyzed in depth for \(n = 3\) in Ref. [14, Part III] making essential use of this factorization, appears to be much more complicated in higher dimensions. Indeed it is likely that the sophisticated machinery of algebraic geometry in projective spaces will be needed for such a study, although that machinery generally presumes that one is working over the field of complex numbers — without any of the geometric interpretations geometric algebra can provide; see e.g. Refs. [23, 24, 28]. It seems likely, however, that geometric algebra could significantly enhance that branch of mathematics as well.
Data availability
No data is associated with this article.
Notes
The term “affine sum” is widely used as a synonym for what I’m calling a barycentric sum; this defines an affine (sub)space known as the affine span of the points \(\textbf{a}_i\) (where N is not necessarily \(n\!+\!1\)). Instead, I’m using “affine sum” to mean a weighted sum \(\sum _i \delta _i\, \textbf{a}_i\) with \(\sum _i \delta _i = 0\), which is a free vector in (or “point at infinity” of) that affine (sub)space. The points \(\{ \textbf{a}_i \}\) are affinely independent if \(\sum _i \delta _i\, \textbf{a}_i = 0\) implies \(\delta _j = 0\) for \(j = 1, \ldots , N\), in which case \(N \le n\!+\!1\). A “convex sum,” in contrast, is the convex polytope \(\llbracket \hspace{0.5pt}{} {\textbf {a}}_i \in \mathbb R^n \mid i = 1, \ldots , N \hspace{0.5pt}\rrbracket \,{:}{=}\,\big \{ \sum _i \beta _i\, {\textbf {a}}_i \mid \sum _i \beta _i = 1 ~\text{ and }~ \beta _j \ge 0 ~\text{ for } \text{ all }~ j \in \{1, \ldots , N \} \big \}\), also known as the convex hull of the points \(\{ \textbf{a}_i \}\); this polytope is an n-simplex when \(N = n\!+\!1\) and the points are affinely independent.
As an exercise, the reader may wish to show that 4 times the plane-bound bivector of the medial parallelogram with \((\textbf{b}- \textbf{a}) \wedge (\textbf{d}- \textbf{c}) \,/\, 4\) as its free bivector is (up to an arbitrary sign) \(\varvec{N}\, (\textbf{b}- \textbf{a}) \wedge (\textbf{d}- \textbf{c}) + \varvec{n}_\infty \, \textbf{g}\wedge (\textbf{b}- \textbf{a}) \wedge (\textbf{d}- \textbf{c})\), where \(\textbf{g}\) is any point in the plane spanned by the midpoints of the edges \(\llbracket \hspace{0.5pt}\textbf{a}, \textbf{c}\hspace{0.5pt}\rrbracket \), \(\llbracket \hspace{0.5pt}\textbf{a}, \textbf{d}\hspace{0.5pt}\rrbracket \), \(\llbracket \hspace{0.5pt}\textbf{b}, \textbf{c}\hspace{0.5pt}\rrbracket \), \(\llbracket \hspace{0.5pt}\textbf{b}, \textbf{d}\hspace{0.5pt}\rrbracket \), most parsimoniously the centroid \(\textbf{g}= (\textbf{a}\!+\! \textbf{b}\!+\! \textbf{c}\!+\! \textbf{d})/4\).
As observed by Heron (aka Hero) of Alexandria himself in the \(1^\text {st}\) century AD [26].
Rumored to have also been known to Piero della Francesca in the \(15^\text {th}\) century.
This practice of “normalizing” (hyper-)volumes by including the factorial in their definition simplifies their inter-relationships in higher dimensions, and is widely used for example in studying the combinatorics of lattice polytopes, see e.g. Ref. [2].
All in all, there are 18 such tetrahedron inequalities consisting of six triples, an example of which is \(e \le a+b\), \(a \le b+e\), \(b \le a+e\) [14, Part I].
Adventurous readers are invited to find a blade in the conformal geometric algebra with a negative inner square that equals such a Cayley-Menger determinant.
The denominator in Eq. (78), divided by \(s^2\), is a geometric parameter \(\tilde{z}\) that is “inverse” (or complementary) to z in the sense that \(z\tilde{z} = r^2\, \Vert \hspace{0.5pt} \textbf{c}- \textbf{d}\hspace{0.5pt}\Vert ^2\). All in all, there are a total of six such inverse natural parameters \(\tilde{u},\, \ldots ,\, \tilde{z}\), which can be shown to be rational functions of the natural parameters [14, Part II]. Since they do not seem to determine the volume in any simple way by themselves, they play no essential role in the present saga.
Despite considerable effort, I have found only one rather dated paper that even considers a similar phenomenon, which proved that the set of n-simplices all having facets with the same hyper-areas contains members with arbitrarily small hyper-volumes [19].
As is common practice, “\(\textbf{a}_0,\, \ldots ,\, \check{\textbf{a}}_k,\, \ldots ,\, \textbf{a}_n\)” means that the k-th vertex is omitted from the sequence \(\textbf{a}_0,\ldots ,\textbf{a}_n\).
A formal proof will be published separately once the author is convinced he has found the most concise exposition possible, based on much the same approach.
That formula involves an \((n\!+\!3)\)-by-\((n\!+\!3)\) analog of Cayley-Menger determinants, which obviously cannot be represented as the inner square of any blade in \(\mathcal G_{n+1,1}\). Conformal geometric algebra does however provide a nice representation of these hyper-areas which generalizes that given for \(n = 3\) in Eq. (22), namely \(\Vert \textbf{D}_{k\ell } \Vert ^2 = \Vert \langle \varvec{L}_{k\ell } \tilde{\varvec{M}}_{k\ell } \rangle _{n-1} \Vert ^2 = \Vert \varvec{L}_{k\ell } \varvec{M}_{k\ell } + (-1)^n \varvec{M}_{k\ell } \varvec{L}_{k\ell } \Vert ^2 /\, 4\), where \(\varvec{L}_{k\ell }\) and \(\varvec{M}_{k\ell }\) are the blades of grade 3 and n respectively given by \(\varvec{L}_{k\ell } \,{:}{=}\, \varvec{n}_\infty \wedge \varvec{a}_k \wedge \varvec{a}_\ell \) and \(\varvec{M}_{k\ell } \,{:}{=}\, \varvec{n}_\infty \wedge \varvec{a}_0 \wedge \cdots \wedge \check{\varvec{a}}_k \wedge \cdots \wedge \check{\varvec{a}}_\ell \wedge \cdots \wedge \varvec{a}_n\).
An explicit numerical example on a random 4-simplex is worked out in full in the Jupyter notebook included in this paper’s Supplementary Materials, which uses the SageMath computer algebra system.
A parallelopiped with twice the volume may nonetheless be obtained by reflecting a copy of the prism in the mid-point of any one of its parallelogram faces.
In contrast to the \(n = 3\) case considered at the end of section 6, for \(n = 4\) there exist affine squeeze transformations \(\textbf{Diag}(\varepsilon ,\varepsilon ,1/\varepsilon ,1/\varepsilon )\) with \(\varepsilon > 0\) that will, in the limit as \(\varepsilon \rightarrow \infty \), map any generic 4-simplex in \(\mathbb R^4\) to a “degenerate” 4-simplex in the plane of the first two coordinate axes having the same 4-volume, but with infinite inter-vertex distances.
References
Altshiller-Court, N.: College Geometry: An Introduction to the Modern Geometry of the Triangle and the Circle (2nd Ed.). Barnes & Noble, Inc., New York (1952). (Reprinted by Dover Publ. Co., New York, NY, USA in 1980 and 2007)
Balletti, G.: Enumeration of lattice polytopes by their volume. Discrete Comput. Geom. 65, 1087–1122 (2021). https://doi.org/10.1007/s00454-020-00187-y
Beardon, A.F., Stephenson, P.: The Heron parameters of a triangle. Math. Gazette 99(545), 205–212 (2015). https://doi.org/10.1017/mag.2015.27
Blumenthal, L.M.: Theory and Applications of Distance Geometry. Oxford University Press, Oxford (1953). (Reprinted by Chelsea Publ. Co., Bronx, NY, USA in 1970)
Crapo, H.: An algebra of pieces of space—Hermann Grassmann to Gian–Carlo Rota. In: From Combinatorics to Philosophy, pp. 61–90. Springer, New York (2009). https://doi.org/10.1007/978-0-387-88753-1_5
Crippen, G.M., Havel, T.F.: Stable calculation of coordinates from distance information. Acta Cryst. A34, 282–284 (1978). https://doi.org/10.1107/S0567739478000522
Dorst, L., Fontijne, D., Mann, S.: The conformal model: operational Euclidean geometry. In: Geometric Algebra for Computer Science (Revised Edition): An Object-Oriented Approach to Geometry, chap. 13. Elsevier, Amsterdam (2009). https://doi.org/10.1016/B978-0-12-374942-0.00018-X
Dress, A.W.M., Havel, T.F.: Distance geometry and geometric algebra. Found. Phys. 23, 1357–1374 (1993). https://doi.org/10.1007/bf01883783
Fearnley-Sander, D.: Hermann Grassmann and the creation of linear algebra. Am. Math. Mon. 86(10), 809–817 (1979). https://doi.org/10.1080/00029890.1979.11994921
Fearnley-Sander, D.: The royal road to geometry. Math. Mag. 53, 259–268 (1980). https://doi.org/10.1080/0025570x.1980.11976868
Forder, H.G.: The Calculus of Extension. Cambridge University Press, Cambridge (1941). (Reprinted by Chelsea Publ. Co., 1960)
Gidea, M., Niculescu, C.P.: A brief account on Lagrange’s algebraic identity. Math. Intell. 34(3), 55–61 (2012). https://doi.org/10.1007/s00283-012-9305-0
Havel, T.F.: Geometric algebra and Möbius sphere geometry as a basis for Euclidean invariant theory. In: N.L. White (ed.) Invariant Methods in Discrete and Computational Geometry: Proc. Curaçao Conf., 13–17 June, 1994, pp. 245–256. Springer, Netherlands (1995). https://doi.org/10.1007/978-94-015-8402-9_11
Havel, T.F.: An extension of Heron’s formula to tetrahedra, and the projective nature of its zeros. In press: The \(\Pi \)ME Journal (2023). (Cornell Preprint Archive arXiv:2204.08089. https://doi.org/10.48550/arXiv.2204.08089)
Hestenes, D.: Grassmann’s vision. In: G. Schubring (ed.) Hermann Günther Graßmann (1809–1877): Visionary Mathematician, Scientist and Neohumanist Scholar, pp. 243–254. Springer, Netherlands (1996). https://doi.org/10.1007/978-94-015-8753-2_20
Hestenes, D.: New tools for computational geometry and rejuvenation of screw theory. In: E. Bayro-Corrochano, G. Scheuermann (eds.) Geometric Algebra Computing in Engineering and Computer Science, pp. 3–33. Springer, London (2010). https://doi.org/10.1007/978-1-84996-108-0_1
Hestenes, D.: Grassmann’s legacy. In: H.J. Petsche, A.C. Lewis, J. Liesen, S. Russ (eds.) From Past to Future: Graßmann’s Work in Context, pp. 243–260. Birkhäuser, Basel (2011). https://doi.org/10.1007/978-3-0346-0405-5_22
Hestenes, D., Sobczyk, G.: Clifford Algebra to Geometric Calculus. D. Reidel Publ. Co., New York (1984). https://doi.org/10.1007/978-94-009-6292-7
Iyengar, K.S.K., Iyengar, K.V.: On a problem relating to a tetrahedron. Proc. Indian Acad. Sci. A7, 269–278 (1938). https://doi.org/10.1007/bf03045489
Johnson, R.A.: Modern Geometry: An Elementary Treatise on the Geometry of the Triangle and the Circle. Houghton Mifflin, Inc., Boston (1929). (Reprinted by Dover Publ. Co., New York, NY, USA in 1960 and 2007)
Lavor, C., Souza, M., Aragón, J.L.: Orthogonality of isometries in the conformal model of the 3D space. Graph. Models 114(101100), 1–7 (2021). https://doi.org/10.1016/j.gmod.2021.101100
MacDonald, A.: Linear and Geometric Algebra. Alan MacDonald, Luther College, Decorah, IA, USA (2010)
Michalek, M., Sturmfels, B.: Invitation to Nonlinear Algebra. Am. Math. Soc., Providence, RI, USA (2021)
Miller, E., Sturmfels, B.: Combinatorial Commutative Algebra. Springer, Berlin-Heidelberg-New York (2005). https://doi.org/10.1007/b138602
Oldknow, A.: The Euler–Gergonne–Soddy triangle of a triangle. Am. Math. Mon. 103, 319–329 (1996). https://doi.org/10.2307/2975188
Scriba, C.J., Schreiber, P.: 5000 Years of Geometry, chap. 2.5.1, pp. 84–86. Birkhäuser (now Springer-Verlag), Berlin-Heidelberg-New York (2015). https://doi.org/10.1007/978-3-0348-0898-9
Sobczyk, G.: Notes on Plücker’s relations in geometric algebra. Adv. Math. 363, 106959 (2020). https://doi.org/10.1016/j.aim.2019.106959
Sottile, F.: Real Solutions to Equations from Geometry. Am. Math. Soc., Providence (2011). https://doi.org/10.1090/ulect/057
Stewart, I.: The Annotated Flatland, A Romance of Many Dimensions by Edwin A. Abbott. Perseus Book Group, Philadelphia (2002). (Paperback version published 2008)
Talata, I.: A volume formula for medial sections of simplices. Discrete Comput. Geom. 30, 343–353 (2003). https://doi.org/10.1007/s00454-003-0015-6
Funding
No funds, grants, or other support was received.
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of interest
The author has no competing interests to declare that are relevant to the content of this article.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
This article is part of the Topical Collection for the First International Conference on Advanced Computational Applications of Geometric Algebra (ICACGA) held in Denver, Colorado, USA, 2–5 October 2022, edited by David DaSilva, Eckhard Hitzer and Dietmar Hildenbrand.
Supplementary Information
Below is the link to the electronic supplementary material.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Havel, T.F. Heron’s Formula in Higher Dimensions. Adv. Appl. Clifford Algebras 34, 9 (2024). https://doi.org/10.1007/s00006-023-01305-8
Received:
Accepted:
Published:
DOI: https://doi.org/10.1007/s00006-023-01305-8