Shape Analysis of Surfaces Using General Elastic Metrics

Su, Zhe; Bauer, Martin; Preston, Stephen C.; Laga, Hamid; Klassen, Eric

doi:10.1007/s10851-020-00959-4

Shape Analysis of Surfaces Using General Elastic Metrics

Published: 14 May 2020

Volume 62, pages 1087–1106, (2020)
Cite this article

Download PDF

Access provided by Autonomous University of Puebla

Journal of Mathematical Imaging and Vision Aims and scope Submit manuscript

Shape Analysis of Surfaces Using General Elastic Metrics

Download PDF

Zhe Su¹,
Martin Bauer¹,
Stephen C. Preston²,
Hamid Laga^3,4 &
…
Eric Klassen¹

435 Accesses
13 Citations
Explore all metrics

Abstract

In this article, we introduce a family of elastic metrics on the space of parametrized surfaces in 3D space using a corresponding family of metrics on the space of vector-valued one-forms. We provide a numerical framework for the computation of geodesics with respect to these metrics. The family of metrics is invariant under rigid motions and reparametrizations; hence, it induces a metric on the “shape space” of surfaces. This new class of metrics generalizes a previously studied family of elastic metrics and includes in particular the Square Root Normal Field (SRNF) metric, which has been proven successful in various applications. We demonstrate our framework by showing several examples of geodesics and compare our results with earlier results obtained from the SRNF framework.

Elastic Metrics on Spaces of Euclidean Curves: Theory and Algorithms

Article 25 April 2024

Elastic Shape Analysis of Surfaces with Second-Order Sobolev Metrics: A Comprehensive Numerical Framework

Article Open access 21 January 2023

Elastic Shape Analysis of Surfaces and Images

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

Shape analysis of surfaces in ${\mathbb {R}}^3$ has been motivated by many applications in bioinformatics, computer graphics and medical imaging, see, e.g., [2, 14, 16, 19, 22, 32]. In most applications, the actual parametrization of the surfaces under consideration is unknown and one is only able to observe the “shape” of the object, i.e., a priori the point correspondences between the surfaces are unknown and should be an output of the performed analysis. Furthermore, we will often identify surfaces that only differ by a rigid motion. Thus, we define the shape space of surfaces as the quotient space of all parametrized surfaces modulo the group of reparametrizations and/or the group of rigid motions. One goal in shape analysis is to quantify the differences and find the optimal deformations between the given objects; see Fig. 1 for two examples of optimal deformations between distinct surfaces.

The main challenge in the context of shape analysis of surfaces consists in the registration problem, i.e., finding the (optimal) point correspondences between distinct surfaces, which can then be used as the basis for the resulting statistical analysis. In the previous work, the correspondence problem has often been solved in a preprocessing step, which is then followed by an independent statistical analysis of the resulting parametrized surfaces. This approach can yield several undesirable consequences on the statistical analysis, see, e.g., [29].

The goal of elastic shape analysis is to formulate this problem in a unified framework: Using a reparametrization invariant metric on the space of all parametrized surfaces, one then studies the induced Riemannian metric on the quotient space. Using this approach, the point correspondences and the resulting statistical analysis can be performed in a consistent way.

In the past years, several metrics and frameworks have been proposed as potential approaches to this goal, see, e.g., [4, 18, 24, 25, 29, 33]. In particular, a class of elastic metrics has been proposed in [17], which is defined as a weighted sum of three components that measure the differences in shearing, stretching and bending of the surface. This family of metrics is actually a subfamily of the general class of reparametrization invariant Sobolev metrics, as studied in [4,5,6]. It is also a natural generalization of the family of elastic $G^{a,b}$-metrics on the space of curves [26], which has been proven efficient and successful in numerous shape analysis applications [10, 28,29,30,31, 34,35,36].

To obtain a numerically efficient representation, Srivastava et al. [28] introduced the so-called Square Root Velocity Function (SRVF) for comparing curves. In this framework, the space of curves endowed with the elastic metric for a particular choice of coefficients is isometric to an $L^2$-space, which makes the computation of geodesics extremely easy and efficient. Motivated by this progress, Jermyn et al. [18] introduced the Square Root Normal Field (SRNF) representation for elastic shape analysis of surfaces and showed that the $L^2$-metric on the space of SRNFs corresponds to one member of a more general class of elastic metrics on the space of surfaces. While it is computationally efficient, there are several drawbacks to this approach: The SRNF metric only consists of the last two terms of the general elastic metric for surfaces and is thus highly degenerate; i.e., there exists a high-dimensional space of deformations that has no cost in this framework.^{Footnote 1} Furthermore, the SRNF map is neither injective nor surjective, and its image is not fully understood. In consequence, there exists no analytic formula for geodesics in the image space and geodesics are usually approximated by numerically inverting the straight line between the given SRNFs, where each inversion is calculated as the solution to an optimization problem [25].

Contributions of this article The purpose of the present article is to introduce a numerical framework for the computation of the geodesic initial and boundary value problem with respect to a family of metrics that contains the general elastic metric as a special case. The framework complements [7] which defined, using vector-valued one-forms, a metric on the space of surfaces that is invariant under rigid motions and reparametrizations. It does not require a numerical inversion of the SRNF map and thus overcomes some of the aforementioned difficulties. Furthermore, this framework will allow us in the future to choose the constants of the metric in a data-driven way, which has potential importance in many applications. See [3, 23] for related considerations regarding the choice of constants for the elastic metric on the space of curves.

2 Mathematical Framework and Background

In this section, we will give the formal definition of the space of shapes and describe the general elastic metric. Then, we will introduce a new representation for the elastic metric using vector valued one-forms, which will still allow us to obtain an efficient discretization of the geodesic boundary value problem.

From here on, we will model a surface as an immersion f from a model space M into ${\mathbb {R}}^3$, i.e., a smooth map from M to ${\mathbb {R}}^3$ that has an injective tangent mapping. Here, M is a two-dimensional compact manifold encoding the topology of the objects under consideration. Typically, choices of M include the two-sphere $M=S^2$ or the sheet $M=[0,1]^2$.

Denote by ${\text {Imm}}(M, {\mathbb {R}}^3)$ the space of all immersions. To define the space of shapes, we now consider the actions of the group of rigid motions and the group of diffeomorphisms on ${\text {Imm}}(M, {\mathbb {R}}^3)$. The group of rigid motions is given by the semidirect product of the group of rotations and the group of translations, i.e., ${\text {SO}}(3)\ltimes {\mathbb {R}}^3$, where ${\text {SO}}(3)$ is the set of all rotation matrices. It acts on ${\text {Imm}}(M, {\mathbb {R}}^3)$ as follows:

$$\begin{aligned} \left( {\text {SO}}(3)\ltimes {\mathbb {R}}^3\right) \times {\text {Imm}}(M, {\mathbb {R}}^3)&\rightarrow {\text {Imm}}(M, {\mathbb {R}}^3) \\ \left( (R,v), f\right)&\mapsto Rf+v. \end{aligned}$$

Denote by ${\text {Diff}}_+(M)$ the group of diffeomorphisms that preserve the orientation of M. The action of ${\text {Diff}}_+(M)$ on ${\text {Imm}}(M, {\mathbb {R}}^3)$ is given by composition from the right:

$$\begin{aligned} {\text {Imm}}(M, {\mathbb {R}}^3)\times {\text {Diff}}_+(M)&\rightarrow {\text {Imm}}(M, {\mathbb {R}}^3) \\ (f, \gamma )&\mapsto f\circ \gamma . \end{aligned}$$

We say that two immersions $f_1$ and $f_2$ have the same shape if they are in the same orbit of the action of ${\text {Diff}}_+(M)$, or both actions depending on whether we want to mod out rigid motions. The space of shapes can then be defined as the quotient space:

$$\begin{aligned} {\mathcal {S}}(M,{\mathbb {R}}^3) = {\text {Imm}}(M, {\mathbb {R}}^3)/{\mathcal {G}}, \end{aligned}$$

where ${\mathcal {G}} = {\text {Diff}}_+(M)$ or ${\mathcal {G}} = {\text {Diff}}_+(M)\times ({\text {SO}}(3)\ltimes {\mathbb {R}}^3) $.

This quotient space has some mild singularities and does not carry the structure of a smooth manifold but only of an infinite-dimensional orbifold [11]. However, for the purpose of this article, we can ignore these subtleties and assume that we are always working away from the singularities, which allows us to treat ${\mathcal {S}}(M,{\mathbb {R}}^3)$ as an infinite-dimensional manifold.

By endowing the space of immersions ${\text {Imm}}(M,{\mathbb {R}}^3)$ with a Riemannian metric that is invariant under the actions of ${\text {SO}}(3)\ltimes {\mathbb {R}}^3$ and ${\text {Diff}}_+(M)$, the space of shapes ${\mathcal {S}}(M, {\mathbb {R}}^3)$ becomes a Riemannian manifold (orbifold), where the metric is induced by the Riemannian metric on ${\text {Imm}}(M, {\mathbb {R}}^3)$.

In the following, we will denote by ${\text {dist}}_{{\text {Imm}}}$ the geodesic distance function of a Riemannian metric on the space of immersions ${\text {Imm}}(M, {\mathbb {R}}^3)$ and by [f] the equivalence class of f under the action of ${\mathcal {G}}$. Given two surfaces $f_1$ and $f_2$, we can define the distance between $[f_1]$ and $[f_2]$ as the infimum of the distance between the orbits of $f_1$ and $f_2$ under the action of ${\mathcal {G}}$. For example, the distance function on the space of unparametrized surfaces ${\mathcal {S}} = {\text {Imm}}(M, {\mathbb {R}}^3)/{\text {Diff}}_+(M)$ can be defined as follows:

$$\begin{aligned} {\text {dist}}_{{\mathcal {S}}}([f_1], [f_2]) = \inf _{\gamma \in {\text {Diff}}_+(M)}{\text {dist}}_{{\text {Imm}}}(f_1\circ \gamma , f_2). \end{aligned}$$

We will use this induced distance as our measure for comparing unparametrized surfaces. Given two parametrized surfaces, to measure the similarity between them, we will need to find the optimal reparametrization in ${\text {Diff}}_+(M)$ that realizes the infimum. If we also want to mod out rigid motions and find the distance between two elements in the space of unparametrized surfaces modulo rigid motions ${\text {Imm}}(M, {\mathbb {R}}^3)/\left( {\text {Diff}}_+(M)\times {\text {SO}}(3)\ltimes {\mathbb {R}}^3\right) $, we will need to solve a joint optimization problem of finding the best reparametrization, rotation and translation.

2.1 The General Elastic Metric and the SRNF Framework

Jermyn et al. introduced in [18] the general elastic metric which has the desired invariance properties under shape-preserving deformations. To define this metric, we first introduce a transformation that maps an immersion onto its induced surface metric and normal vector field:

$$\begin{aligned} {\text {Imm}}(M,{\mathbb {R}}^3)&\mapsto {\text {Met}}(M)\times C^{\infty }(M,{\mathbb {R}}^3) \\ f&\rightarrow \left( g:=g^f,n:=n^f\right) \;, \end{aligned}$$

where $n^f$ is the unit normal vector field to the surface f, which is given in local coordinates by

$$\begin{aligned} n=\frac{f_x\times f_y}{|f_x\times f_y|} \end{aligned}$$

and where the surface metric is given by

$$\begin{aligned} g=f^*\langle .,.\rangle _{{\mathbb {R}}^3}=\langle Tf.,Tf. \rangle _{{\mathbb {R}}^3}. \end{aligned}$$

It is classical result in Riemannian geometry that any surface can be reconstructed uniquely by these two quantities [1]. Thus, this representation allows one to define a Riemannian metric on the space of immersions by describing it on the image ${\text {Met}}(M)\times C^{\infty }(M,\mathbb R^3)$. The general elastic metric as introduced in [18] is defined by:

$$\begin{aligned}&G_{g,n}((\delta g,\delta n),(\delta g, \delta n)) \nonumber \\&\quad = A\int _M {\text {tr}} (g^{-1} \delta g g^{-1}\delta g) \mu _g + B\int _M {\text {tr}} (g^{-1} \delta g)^2 \mu _g\nonumber \\&\qquad + C\int _M \langle \delta n, \delta n \rangle _{{\mathbb {R}}^3} \mu _g\; \end{aligned}$$

(1)

where $A,B,C\ge 0$ are constants and where $\mu _g$ denotes the induced volume density of the surface f.

Each of the three terms appearing in the metric (1) has a natural geometric interpretation: The first term penalizes local change in the metric (shearing), the second term measures the change in the volume density (scaling) and the third term quantifies the change in the normal vector (bending).

Instead of using the (g, n) representation for comparing surfaces, in the same paper [18], Jermyn et al. introduced the SRNF framework, where a surface is represented only as a rescaled normal vector field:

$$\begin{aligned} {\mathcal {Q}}: {\text {Imm}}(M, {\mathbb {R}}^3)&\rightarrow C^{\infty }(M, {\mathbb {R}}^3)\\ f(s)&\mapsto \sqrt{A(s)}n(s), \end{aligned}$$

where A(s) denotes the local area-multiplication factor, which is given in local coordinates by $A(s)=|f_x(s)\times f_y(s)|$. After equipping the target space $C^{\infty }(M, {\mathbb {R}}^3)$ with the flat $L^2$ metric, the map Q becomes an infinitesimal isometry, where the space ${\text {Imm}}(M, {\mathbb {R}}^3)$ is equipped with the elastic metric $G^{A,B,C}$ with $A = 0, B = \frac{1}{16}$ and $C = 1$, i.e., the pullback of the $L^2$ metric on $C^{\infty }(M, {\mathbb {R}}^3)$ along the map Q is equal to the metric $G^{0,\frac{1}{16},1}$. Note, however, that the resulting metric is degenerate for this choice of constants, i.e., there might exist deformation fields that have no cost with respect to the metric. Furthermore, given $q\in C^{\infty }(M, {\mathbb {R}}^3)$, there may be either no preimage $Q^{-1}(q) \in {\text {Imm}}(M, {\mathbb {R}}^3)$ of q or many preimages. Most importantly, the image of the space of immersions under the SRNF map cannot be easily characterized and, so far, it is not well understood.

Although the distance between two surfaces, which is given by the $L^2$ difference between their SRNFs, can be easily calculated, finding the inversion of the linear path between their SRNFs that realizes this distance is not possible as the linear path will usually leave the image of the SRNF representation. In [25], Laga et al. introduced a way to approximate the inversion of arbitrary paths between SRNFs by formulating inversion as an optimization problem. In practice, this has been used to approximate geodesics, by numerically inverting straight lines between the SRNFs. However, since the image of the SRNF map is not convex in $L^2$, this method will not yield geodesics with respect to the SRNF metric, see Table 3.

2.2 Immersions and Vector Valued One-Forms

In the following, we will introduce our framework for comparing surfaces. Therefore, let $\varOmega ^1(M,{\mathbb {R}}^3)$ denote the space of all smooth ${\mathbb {R}}^3$ valued one-forms on M and $\varOmega _+^1(M,{\mathbb {R}}^3)$ denote the subset of $\varOmega ^1(M,{\mathbb {R}}^3)$, which contains all full-ranked one-forms on M. Given a metric g on M, in a local chart with a field of orthonormal bases, an element of $\varOmega _+^1(M,{\mathbb {R}}^3)$ can be represented as a field of full-ranked $3\times 2$ matrices. We consider the differential as a mapping

$$\begin{aligned} d:{\text {Imm}}(M,{\mathbb {R}}^3)/{{\text {trans}}}&\rightarrow \varOmega _+^1(M,{\mathbb {R}}^3) \nonumber \\ f&\mapsto df\;. \end{aligned}$$

(2)

The differential d as defined above is injective, but not surjective. Furthermore, in contrast to the SRNF mapping ${\mathcal {Q}}$ mentioned in Sect. 2.1, it is easy to characterize the image of the differential d. The following theorem contains this characterization and a result concerning the manifold structure space of full-ranked one forms $\varOmega _+^1(M, {\mathbb {R}}^3)$:

Theorem 1

The space of smooth full-ranked one-forms $\varOmega _+^1(M, {\mathbb {R}}^3)$ is an open subset of an infinite-dimensional vector space of one-forms $\varOmega ^1(M, {\mathbb {R}}^3)$, and thus it is an infinite-dimensional Fréchet manifold, where the tangent space at each point is simply $\varOmega ^1(M, {\mathbb {R}}^3)$.

Furthermore, the image of the differential d is the space of all exact full-ranked one-forms, which is the intersection of $\varOmega ^1_ +(M, {\mathbb {R}}^3)$ with a linear subspace of $\varOmega ^1 (M, {\mathbb {R}}^3)$.

Proof

The proof of this result follows directly from the definition of these spaces. $\square $

This theorem allows us to define a Riemannian metric on these spaces as follows. Let $\alpha \in \varOmega _+^1(M, {\mathbb {R}}^3)$ and $\xi \in T_{\alpha }\varOmega _+^1(M, {\mathbb {R}}^3)$. For the volume form $\mu $ on M induced by the metric g, we let

$$\begin{aligned} G_{\alpha }(\xi ,\xi ) = \int _M{\text {tr}}\left( \xi _x(\alpha _x^T\alpha _x)^{-1}\xi _x^T\right) \sqrt{\det (\alpha _x^T\alpha _x)}\mu . \end{aligned}$$

(3)

It is easy to see that the integrand is positive definite, and thus the formula defines a nondegenerated Riemannian metric. This metric does not depend on the choice of orthonormal bases we choose and is actually independent of the metric g on M, see [7] for more details. Thus, we can choose any convenient metric g on M and use it to calculate this metric on $\varOmega _+^1(M, {\mathbb {R}}^3)$.

Using the injection (2), we obtain a pullback metric on the space ${\text {Imm}}(M, {\mathbb {R}}^3)$ modulo translations and it turns out that this metric is related to the full elastic metric. The space of immersions equipped with this inner product is an infinite-dimensional Riemannian manifold. It should be noted that, with respect to this metric, ${\text {Imm}}(M,{\mathbb {R}}^3)/{{\text {trans}}}$ is neither geodesically complete nor geodesically convex. In addition, there exists no explicit formula to calculate minimizing geodesics between two given immersions $f_1$ and $f_2$. Instead, we will rely on numerical methods to minimize the path length over all paths of immersions connecting the given immersions $f_1$ and $f_2$. Alternatively, these minimizing deformations can be found by solving the Lagrangian optimality condition for the energy functional, called the geodesic equation. Although we will not follow this strategy, we will present this equation in “Appendix A.”

First, however, we will orthogonally decompose the tangent space at $\alpha $ in a similar manner as in the definition of the elastic metric earlier. In the following, we will denote the Moore–Penrose inverse of $\alpha $ by $\alpha ^+$, which is defined by $\alpha ^+=(\alpha ^T\alpha )^{-1}\alpha ^T$ where $\alpha $ is a $3\times 2$ matrix of rank 2. Using this notation, we let

$$\begin{aligned} \xi = \xi _m + \frac{1}{2}{{\,\mathrm{tr}\,}}(\alpha ^+\xi )\alpha +\xi ^{\perp } +\xi _0, \end{aligned}$$

where

$$\begin{aligned} \xi _m&= \frac{1}{2}\alpha (\alpha ^T\alpha )^{-1}(\alpha ^T\xi +\xi ^T\alpha ) - \frac{1}{2}{{\,\mathrm{tr}\,}}(\alpha ^+\xi )\alpha \\ \xi ^{\perp }&= \xi - \alpha (\alpha ^T\alpha )^{-1}\alpha ^T\xi \\ \xi _0&= \frac{1}{2}\alpha (\alpha ^T\alpha )^{-1}(\alpha ^T\xi -\xi ^T\alpha ) \end{aligned}$$

It is easy to check that these terms are orthogonal with respect to metric (3). We can now obtain a family of metrics on $\varOmega _+^1(M, {\mathbb {R}}^3)$:

$$\begin{aligned}&G^{{\mathfrak {a}},{\mathfrak {b}},{\mathfrak {c}},{\mathfrak {d}}}_{\alpha }(\xi , \xi ) \nonumber \\&\quad = {\mathfrak {a}} G_{\alpha }(\xi _m, \xi _m) + {\mathfrak {b}} G_{\alpha }\left( \frac{1}{2}{{\,\mathrm{tr}\,}}(\alpha ^+\xi )\alpha , \frac{1}{2}{{\,\mathrm{tr}\,}}(\alpha ^+\xi ){\alpha }\right) \nonumber \\&\qquad + {\mathfrak {c}}G_{\alpha }( \xi ^{\perp }, \xi ^{\perp }) + {\mathfrak {d}}G_{\alpha }(\xi _0, \xi _0), \end{aligned}$$

(4)

where the first summand is measuring the deformation of the metric (within the class of metrics with the same volume form), the second summand is measuring the deformation of the volume density, the third summand is measuring the deformation of the normal vector. The interpretation of the last summand is less intuitive: it measures changes in the one-form that locally come from rotations about the normal vector.

The following theorem shows the connection of our split metric (4) with the elastic metric (1) on surfaces.

Theorem 2

If ${\mathfrak {d}} = 0$, then the pullback of the split metric (4) gives rise to the elastic metric (1) on the space of immersions.

Proof

See “Appendix B” for a proof of this result. $\square $

In Fig. 2, we show geodesics between two parametrized cylinders with respect to the split metric (4) for different choices of coefficients ${\mathfrak {a}},{\mathfrak {b}},{\mathfrak {c}}$ and ${\mathfrak {d}}$. One can see how the choice of coefficients affects the resulting geodesic. Thus, in each specific application, we are now able to adjust the coefficients of the metric in a data-driven way to obtain desired deformations between the shapes under consideration.

Remark 1

In [7], we have presented a detailed study of metric (3) on the space $\varOmega _+^1(M, {\mathbb {R}}^3)$. In particular, we have obtained an explicit formula for the corresponding geodesic initial value problem; in that situation, geodesics can be computed pointwise, so the problem reduces to a finite-dimensional ODE which can be solved explicitly, and gives the solution in the infinite-dimensional context we are dealing with here.

The space of full-ranked exact one-forms $\varOmega ^1_{+,ex}(M,{\mathbb {R}}^3)$ is, however, a proper subspace of the space of full-ranked one-forms $\varOmega ^1_+(M,{\mathbb {R}}^3)$ and is not a totally geodesic submanifold of $\varOmega ^1_+(M,{\mathbb {R}}^3)$ with respect to metric (3). As the space of immersions corresponds to the space of full-ranked exact one-forms, the obtained explicit formula for geodesics does not directly help to calculate geodesics on the space of immersions, which is the main goal of this article. In order to solve the geodesic problem, we will thus introduce a discretization of the metric and solve the geodesic matching problem using path-straightening algorithms.

Note that the split metric (4) is defined on differentials and thus is, by definition, independent of translations. To show the invariance of the split metric under rigid motions and diffeomorphisms, we now consider the action of the group of rotations ${\text {SO}}(3)$ on $\varOmega _+^1(M,{\mathbb {R}}^3)$, which is defined by pointwise left multiplication:

$$\begin{aligned} {\text {SO}}(3)\times \varOmega _+^1(M,{\mathbb {R}}^3)&\rightarrow \varOmega _+^1(M,{\mathbb {R}}^3) \\ (R, \alpha )&\mapsto R\alpha , \end{aligned}$$

where $(R\alpha )_x = R\alpha _x$, and the action of the group of diffeomorphisms ${\text {Diff}}_+(M)$ on $\varOmega _+^1(M,{\mathbb {R}}^3)$, which is defined via pullback:

$$\begin{aligned} \varOmega _+^1(M, {\mathbb {R}}^3)\times {\text {Diff}}_+(M)&\rightarrow \varOmega _+^1(M, {\mathbb {R}}^3) \\ (\alpha , \varphi )&\mapsto \varphi ^*\alpha , \end{aligned}$$

where $ (\varphi ^*\alpha )_x = \alpha _{\varphi (x)}\circ d\varphi _x. $ The following proposition summarizes the most important invariances of the metric on $\varOmega _+^1(M, {\mathbb {R}}^3)$:

Proposition 1

Let $\alpha \in \varOmega ^1_+(M,{\mathbb {R}}^3)$ and $\zeta , \eta \in T_\alpha \varOmega ^1_+(M,{\mathbb {R}}^3)$.

1.
Metric (4) is invariant under pointwise left multiplication with ${\text {SO}}(3)$, i.e., if $R \in {\text {SO}}(3)$, then
$$\begin{aligned} G_{\alpha }(\zeta , \eta )=G_{R\alpha }(R\zeta ,R\eta ) \end{aligned}$$
2.
Metric (4) is invariant under the right action of the diffeomorphism group, i.e., for any $\varphi \in {\text {Diff}}_+(M)$, we have
$$\begin{aligned} G_{\alpha }(\zeta , \eta )=G_{\varphi ^*\alpha }(\varphi ^*\zeta ,\varphi ^*\eta ). \end{aligned}$$

Proof

The proof of the proposition follows exactly as for the metric (3), which can be found in [7]. $\square $

The group of rotations ${\text {SO}}(3)$ acts on the space of immersions by left multiplication, which is the same as it acts on the space of one forms. Thus, by the first statement of Proposition 1, the pullback metric on ${\text {Imm}}(M, {\mathbb {R}}^3)$ is also invariant under the group of rigid motions ${\text {SO}}(3)\ltimes {\mathbb {R}}^3$. For the standard action of ${\text {Diff}}_+(M)$ by composition from the right on ${\text {Imm}}(M, {\mathbb {R}}^3)$, the following commutative diagram illustrates that the pullback action of ${\text {Diff}}_+(M)$ on $\varOmega _+^1(M, {\mathbb {R}}^3)$ is compatible with the action of ${\text {Diff}}_+(M)$ on ${\text {Imm}}(M, {\mathbb {R}}^3)$:

Therefore, the second statement of Proposition 1 gives the reparametrization invariance of the pullback metric on the space ${\text {Imm}}(M,{\mathbb {R}}^3)$.

Thus, the metric on the space of immersions ${\text {Imm}}(M, {\mathbb {R}}^3)$ induces a metric on the space of unparametrized surfaces ${\text {Imm}}(M, {\mathbb {R}}^3)/{\text {Diff}}_+(M)$ and a metric on the space of unparametrized surfaces modulo rigid motions ${\text {Imm}}(M, {\mathbb {R}}^3)/\left( {\text {Diff}}_+(M)\times {\text {SO}}(3)\ltimes {\mathbb {R}}^3\right) $. In Fig. 3, we show geodesics between two cylinders in the space ${\text {Imm}}(M, {\mathbb {R}}^3)/{\text {Diff}}_+(M)$ with respect to the split metric (4) for different choices of coefficients ${\mathfrak {a}},{\mathfrak {b}},{\mathfrak {c}}$ and ${\mathfrak {d}}$. The corresponding geodesics in the space ${\text {Imm}}(M, {\mathbb {R}}^3)/\left( {\text {Diff}}_+(M)\times {\text {SO}}(3)\ltimes {\mathbb {R}}^3\right) $ are shown in Fig. 4.

3 A Numerical Framework for the General Elastic Metric

In this section, we will describe the discretization and optimization procedure that we implemented to solve the geodesic boundary value problem. From here on, we assume that $M = S^2$ and use a spherical coordinate system to represent an immersion $f: S^2\rightarrow {\mathbb {R}}^3$ as a function $f:[0,2\pi ]\times [0,\pi ]\rightarrow {\mathbb {R}}^3$ such that $f(0, \phi ) = f(2\pi , \phi ), f(\theta , 0) = f(0,0)$ and $f(\theta , \pi ) = f(0, \pi )$, see Remark 2 below on how we obtain such (discrete) parametrizations in practice from a triangulated surface.

Remark 2

We represent the surface of a given 3D shape with its embedding on a sphere $f: S^2 \rightarrow {\mathbb {R}}^3$, which is always possible for genus-0 surfaces. In practice, methods such as conformal mapping introduce significant distortions when dealing with complex shapes that contain many elongated parts. Since the proposed approach does not require the mapping to be conformal, we adopt the approach of Praun and Hoppe [27], which has been implemented by Kurtek et.al. [24]. The idea is to progressively embed a surface on a sphere while minimizing area distortion. The approach starts by reducing the mesh, using progressive mesh simplification, to a basic polyhedra that can be easily embedded on $S^2$. Then, it iteratively inserts vertices and embeds each new vertex inside the spherical kernel of its one-ring neighborhood while optimizing for the area distortion. Using the implementation provided in [24], we reconstruct the mesh up to 1500 vertices, which is sufficient for computing geodesics. This procedure produces spherical maps that preserve important shape features as shown in all of the examples in this paper. We want to remark here that finding parameterizations of higher genus surfaces is still an open problem. Since we are not aiming at solving the parameterization problem, we focus in this paper on genus-0 manifold surfaces only.

The identity immersion $i: S^2\rightarrow {\mathbb {R}}^3$ induces the spherical metric on $S^2$, which will serve as a background metric for the discretization; the vector fields

$$\begin{aligned} \left\{ \dfrac{1}{\sin \phi }\dfrac{\partial }{\partial \theta }, \dfrac{\partial }{\partial \phi }\right\} \end{aligned}$$

form an orthonormal basis of the tangent space for any $(\theta , \phi )\in [0, 2\pi ]\times (0, \pi )$. With respect to this basis and the standard basis on ${\mathbb {R}}^3$, the differential df of an immersion $f = (x, y, z)^T$ can be represented by a field of $3\times 2$ matrices:

$$\begin{aligned} df\left( \dfrac{1}{\sin \phi }\dfrac{\partial }{\partial \theta }, \dfrac{\partial }{\partial \phi }\right) = \begin{pmatrix} \dfrac{1}{\sin \phi }\dfrac{\partial x}{\partial \theta }, \dfrac{\partial x}{\partial \phi } \\ \dfrac{1}{\sin \phi }\dfrac{\partial y}{\partial \theta }, \dfrac{\partial y}{\partial \phi } \\ \dfrac{1}{\sin \phi }\dfrac{\partial z}{\partial \theta }, \dfrac{\partial z}{\partial \phi } \\ \end{pmatrix}. \end{aligned}$$

In the following, we denote by $\left\| \cdot \right\| _f$ the norm induced by the pullback of the split metric (4) and let $u\in T_f{\text {Imm}}(S^2, {\mathbb {R}}^3)$ be a tangent vector. Since u can be seen as a function from $S^2$ to ${\mathbb {R}}^3$, using this representation, the norm of u with respect to the split metric will be given as follows:

$$\begin{aligned} \left\| u\right\| _f = \big [G^{{\mathfrak {a}},{\mathfrak {b}},{\mathfrak {c}},{\mathfrak {d}}}_{df}(du, du)\big ]^{1/2}. \end{aligned}$$

3.1 Geodesics in the Space of Surfaces

We will now describe the solution of the boundary value problem in the preshape space of all parametrized surfaces.

Remark 3

We should emphasize here that the space of immersions with respect to the proposed family of Riemannian metrics is not geodesically convex. Thus, the solution of the minimization problem might not exist in the space of immersions but only in a larger space of functions (including those with possibly degenerate differential). In fact, for ${\text {dim}} M\ge 2$, there exists no Riemannian metric on the space of immersions for which geodesic completeness or convexity results have been obtained; in the case of immersed curves, such results have been achieved for metrics of order two or higher [9], but it remains unknown for our higher dimensional situation.

Given two parametrized surfaces $f_1$ and $f_2$, we can discretize the linear path connecting $f_1$ and $f_2$ in T time steps:

$$\begin{aligned} f_{\text {lin}}(t_i) = (1 - t_i)f_1 + t_if_2. \end{aligned}$$

where $t_i = i/T, i = 0,\ldots ,T$. The differential $df_{\text {lin}}$ is then the linear path between $df_1$ and $df_2$, which stays by definition in the space of exact one-forms for all $i= 0,\ldots , T$. Note that this path does not necessarily stay in the space of full-ranked one-forms, e.g., if $df_1=-df_2$ for some $x\in S^2$. However, we did not encounter any problems with this possible degeneracy. To solve the geodesic boundary value problem, we will perturb f(t) in all possible directions that fix the end points and that remain in the space of immersions. Since the map d, as defined in Equation (2), is injective, this is equivalent to perturbing the differential df(t) in all possible directions in the space of exact one-forms that keep the two boundary one-forms fixed.

To obtain a basis of perturbations in the space of immersions, we use the fact that the set of spherical harmonics in each component form a Hilbert basis of $L^2(S^2,{\mathbb {R}}^3)$. We truncate this basis at a chosen maximal degree ${\text {deg}}$ and denote the obtained set by $\{S_i\}$. The number of elements in this basis is $L = 3(({\text {deg}}+1)^2-1)$ (here, we remove the spherical harmonic of degree 0 and order 0 since it is a constant function, which corresponds to a pure translation). To calculate the optimal deformation between two given surfaces, we aim to minimize the (discrete) path energy over all curves of the form

$$\begin{aligned} f(t_0)&= f_1,\quad f(t_T) = f_2 \nonumber \\ f(t_i)&= (1 - t_i)f_1 + t_if_2 + \sum _{j =1}^L{\text {Coeff}}(j, i)S_j, \end{aligned}$$

(5)

where $i = 1,\ldots ,T-1$ and ${\text {Coeff}}$ is a $L\times (T-1)$ coefficient matrix.

The discrete energy functional $F: {\mathbb {R}}^{L\times (T-1)}\rightarrow {\mathbb {R}}$ is then given by

$$\begin{aligned} F({\text {Coeff}}) = \sum _{i =1}^T\left\| f_t(t_{i-1})\right\| ^2_{f(t_{i-1})}\varDelta T \end{aligned}$$

(6)

where the norm $\left\| \cdot \right\| $ is induced by the pullback of the split metric (4),

$$\begin{aligned} f_t(t_{i-1}) = \frac{f(t_{i}) - f(t_{i-1})}{\varDelta T} \end{aligned}$$

(7)

is the (discrete) derivative of f(t) at $f(t_{i-1})$ and $\varDelta T = \frac{1}{T}$ is the width of a subinterval. Alternatively, one can also discretize the derivative of f using the central difference for interior data points, which makes the energy functional symmetric, but leads to slightly higher computational cost. To find the optimal coefficient matrix ${\text {Coeff}}$, we employ a BFGS method, which is a quasi-Newton method for solving unconstrained minimization problems [13], as provided in the optimize package of scipy. We calculate the gradient using automatic differentiation in Pytorch, which leads to the algorithm described in Algorithm 1. See [21] for more examples of applying tools of deep learning and in particular automatic differentiation in shape and image analysis.

3.2 Geodesics in the Space of Unparametrized Surfaces

Now we present our algorithm for calculating geodesics in the space of unparametrized surfaces ${\text {Imm}}(S^2, {\mathbb {R}}^3)/{\text {Diff}}_+(S^2)$. The main difficulty for this task is to find the optimal $\gamma \in {\text {Diff}}_+(S^2)$ that realizes the distance

$$\begin{aligned} {\text {dist}}_{{\mathcal {S}}}([f_1], [f_2]) = \inf _{\gamma \in {\text {Diff}}_+(S^2)}{\text {dist}}_{{\text {Imm}}}(f_1\circ \gamma , f_2), \end{aligned}$$

where [f] is the equivalence class of f under the action of the group of orientation-preserving diffeomorphisms ${\text {Diff}}_+(S^2)$ and ${\text {dist}}_{{\mathcal {S}}}$ denotes the distance function on the space ${\text {Imm}}(S^2, {\mathbb {R}}^3)/{\text {Diff}}_+(S^2)$ with respect to the metric that is induced from the split metric (4).

In order to practically perform the minimization over the infinite-dimensional space ${\text {Diff}}_+(S^2)$, we have to choose a suitable discretization of this group: Let ${\text {Id}}$ be the identity map from $S^2$ to itself. The tangent space $T_{{\text {Id}}}{\text {Diff}}_+(S^2)$ is the set of all (smooth) vector fields on $S^2$. The set of gradient and skew gradient vector fields of the set of spherical harmonics provides an orthogonal basis for this tangent space—here orthogonal means with respect to the standard $L^2$ metric, see, e.g., [22]. Normalizing these basis, we obtain an orthonormal basis for the tangent space $T_{{\text {Id}}}{\text {Diff}}_+(S^2)$. To choose a finite-dimensional discretization of the tangent space, we truncate this basis at a maximal degree $\overline{{\text {deg}}}$; then, the number of elements in this basis is ${\bar{L}} = 2(\overline{{\text {deg}}}+1)^2-2$. From here on, we will denote this truncated basis by $\{v_i, i = 1,...,{\bar{L}}\}$. Let $X^v = (X^v_1, X^v_2,\ldots ,X^v_{{\bar{L}}})$ be the coefficients of a vector field with respect to this basis, and consider the induced mapping

$$\begin{aligned} \gamma = {\text {Proj}}\left( {\text {Id}} + \sum _{k=1}^{{\bar{L}}}X^v_kv_k\right) , \end{aligned}$$

(8)

where ${\text {Proj}}(x)=\frac{x}{|x|}$ denotes the map that projects nonzero vectors in ${\mathbb {R}}^3$ onto the unit sphere $S^2\subset {\mathbb {R}}^3$. The following result gives an explicit bound on the size of $X^v$ that ensures that the corresponding $\gamma $, defined by (8), is a diffeomorphisms of $S^2$.

Theorem 3

Let $ U= \sum _{k=1}^{{\bar{L}}}X^v_kv_k$ be a vector field on the sphere $S^2$ and let $\gamma = {\text {Proj}}\left( {\text {Id}} + t U \right) $ be the corresponding map as defined in (8), for some real t. Then, $\gamma $ is a diffeomorphism if

$$\begin{aligned} |t|< -\frac{1}{\inf _{p\in M} \lambda _-(\nabla U)}, \end{aligned}$$

(9)

where $\nabla U$ is the (1, 1) tensor field $v\mapsto \nabla _vU$ and $\lambda _-(\nabla U)$ is the smaller of the two real eigenvalues of the symmetrized matrix $\overline{\nabla U} = \tfrac{1}{2} \big (\nabla U + (\nabla U)^T\big )$.

Note that $\nabla U$ is a tensor that for each point $x\in S^2$ gives a linear transformation $T_xS^2\rightarrow T_xS^2$, which is defined by $\nabla U (v)$ being the covariant derivative of U in the direction v.

Proof

The proof of this result is postponed to “Appendix B.” Note that since $\mathrm {Tr}{(\nabla U)} = \mathrm {div}{U}$, which integrates to zero over the compact manifold M, we know that $\lambda _-(\nabla U)$ is always negative somewhere; hence, the bound on $|t|$ is some positive number. $\square $

We are now able to describe the discrete optimization problem on the space of unparametrized surfaces, i.e., we aim to minimize the discrete functional ${{\bar{F}}}: {\mathbb {R}}^{{{\bar{L}}}+ L\times (T-2)}\rightarrow \mathbb R$ given by

$$\begin{aligned} {{\bar{F}}}(X^v, {\text {Coeff}}) = \sum _{i =1}^T\left\| f_t(t_{i-1})\right\| ^2_{f(t_{i-1})}\varDelta T, \end{aligned}$$

(10)

where the norm $\left\| \cdot \right\| $ is induced by the pullback of the split metric (4), ${\text {Coeff}}$, $S_i$, $\varDelta T = \frac{1}{T}$ are as in Sect. 3.1 and where the discrete curve f is now of the form

$$\begin{aligned} f(t_0)&= f_1\circ \gamma ,\quad f(t_T) = f_2 \nonumber \\ f(t_i)&= (1 - t_i)f_1 + t_if_2 + \sum _{j =1}^L{\text {Coeff}}(j, i)S_j, \end{aligned}$$

(11)

and where the reparametrization $\gamma $ is given by formula (8) with coefficient vector $X^v = (X^v_1, X^v_2,\ldots ,X^v_{{{\bar{L}}}})$.

Remark 4

(Initialization over ${\text {Diff}}_+(S^2)$) When using a gradient-based optimization method, it is always an important issue to find a good initialization, as the optimization procedure can get stuck in local minima and is usually sensitive to this initialization. In order to find a good initial guess for the optimal reparametrization of the surface $f_1$, we first align the corresponding SRNFs of the two boundary surfaces $f_1$ and $f_2$. This seems a natural initialization for the $(0,\frac{1}{2},1,0)$ metric as the $L^2$-distance on the space of SRNFs is a first-order approximation of the geodesic distance of this metrics. However, in all our experiments, it turned out that this initialization works well for other choices of constants as well, as the optimal point correspondences for different choices of constants, albeit different, are still similar on a global scale. Furthermore, we note that any three-dimensional rotation can be seen as a diffeomorphism of $S^2$. We use this fact to first minimize only over this finite-dimensional subgroup of the infinite-dimensional reparametrization group. Finally, to initialize the optimization over this finite-dimensional group, we first consider the icosahedral group, which contains 60 orientation-preserving rotations denoted by $h_i, i = 1,\ldots , 60$, as a finite subset of ${\text {SO}}(3)$. We then choose the best diffeomorphism among these 60 elements as our initial guess. See Fig. 5 for examples of registration before and after this initialization and the whole optimization process over the group of orientation-preserving diffeomorphisms ${\text {Diff}}_+(S^2)$.

In the following, we will describe two algorithms for calculating geodesics in the space of unparametrized surfaces ${\text {Imm}}(S^2, {\mathbb {R}}^3)$: a joint optimization procedure and a coordinate descent approach, where we minimize alternating in the space of parametrized surfaces and over the reparametrization group separately.

We will start by describing the joint optimization procedure, which is analogous to the optimization for parametrized surfaces with one caveat: Since Formula (8) only leads to diffeomorphisms near the identity, i.e., reparametrizations that map points on $S^2$ to nearby points, we will describe large deformations between $S^2$ as a composition of N such (small) deformations. This will lead us to iteratively solve the joint optimization problem. The corresponding algorithm is described in Algorithm 2.

As an alternative to the joint optimization, we will present in the following a coordinate descent method, where we separate the variables in the space of surfaces from the variables that govern the reparametrization of the initial surface, i.e., we alternate between calculating a discrete geodesic, denoted by $f_{{\text {opt}}}$, between the parametrized surfaces $f_1$ and $f_2$ in the space of immersions ${\text {Imm}}(S^2, {\mathbb {R}}^3)$ and reparametrizing the initial surface ${{\bar{f}}}=f_1$. To update the reparametrization, we consider only the first two time points of $f_{{\text {opt}}}$, i.e., ${{\bar{f}}}$ and $f_{{\text {opt}}}(t_1)$ and define the following functional

$$\begin{aligned} F_r(X^v) = \left\| f_{{\text {opt}}}(t_1) - \bar{f}\circ \gamma \right\| _{{\bar{f}}\circ \gamma }^2, \end{aligned}$$

(12)

where $\gamma $ is given by Formula (8) and $X^v=(X^v_1, X^v_2,\ldots ,X^v_{{{\bar{L}}}})$. We can now employ a BFGS method to find the optimal coefficient vector $X^v_{{\text {opt}}}$, compute $\gamma $ using Formula (8) and then update ${{\bar{f}}} = \bar{f}\circ \gamma $. Then, we repeat this process by recalculating the geodesic in the space of parametrized surfaces (with the changed initial surface ${{\bar{f}}}$). The whole optimization process is summarized in Algorithm 3.

3.3 Geodesics in the Space of Unparametrized Surfaces Modulo Rigid Motions

Note that the split metric (4) associates no cost with translation and thus the obtained geodesic is automatically in the space of surfaces modulo translations. To calculate the geodesic between two surfaces $[f_1]$ and $[f_2]$ in the space of unparametrized surfaces modulo rigid motions ${\text {Imm}}(S^2, {\mathbb {R}}^3)/\left( {\text {Diff}}_+(S^2)\times {\text {SO}}(3)\ltimes {\mathbb {R}}^3\right) $, we will need to minimize in addition over the rotation group, i.e., solve the optimization problem on ${\text {SO}}(3)\times {\text {Diff}}_+(S^2)$:

$$\begin{aligned} {\text {dist}}_{{\mathcal {S}}}([f_1], [f_2])&= \inf _{\begin{array}{c} R\in {\text {SO}}(3) \\ \gamma \in {\text {Diff}}_+(S^2) \end{array}}{\text {dist}}_{{\text {Imm}}}(f_1\circ \gamma , Rf_2), \end{aligned}$$

where [f] is the equivalence class of f under the actions of ${\text {Diff}}_+(S^2)$ and ${\text {SO}}(3)$ and ${\text {dist}}_{{\mathcal {S}}}$ denotes the distance function on the space ${\text {Imm}}(S^2, {\mathbb {R}}^3)/\left( {\text {Diff}}_+(S^2)\times {\text {SO}}(3)\ltimes {\mathbb {R}}^3\right) $.

Let $\left\| \cdot \right\| , {\text {Coeff}}, S_i, \varDelta T$ be as in Sect. 3.1, and let ${{\bar{f}}} $ be the current parametrization of the first boundary surface. It is known that the group of rotations ${\text {SO}}(3)$ is a three-dimensional Lie group and the matrix exponential $\exp $ from its Lie algebra $\mathfrak {so}(3)$ is surjective. Since there is an isomorphism between ${\mathbb {R}}^3$ and $\mathfrak {so}(3)$, the discrete optimization problem on the space of unparametrized surfaces modulo rigid motions will be minimizing the discrete functional ${\tilde{F}}:{\mathbb {R}}^{3+ {{\bar{L}}}+ L\times (T-2)}\rightarrow {\mathbb {R}}$ given by

$$\begin{aligned} {\tilde{F}}(X^R, X^v, {\text {Coeff}}) = \sum _{i =1}^{T}\left\| f_t(t_{i-1})\right\| ^2_{f_{i-1}}\varDelta T, \end{aligned}$$

where the discrete curve in this case is of the form

$$\begin{aligned} f(t_0)&= {{\bar{f}}}\circ \gamma ,\quad f(t_T) = \exp (X^R)f_2 \\ f(t_i)&= (1 - t_i)f_1 + t_if(t_T) + \sum _{j =1}^L{\text {Coeff}}(j, i)S_j, \end{aligned}$$

$i = 1,\cdots , T-1$ and where the reparametrization $\gamma $ is given by Formula (8) with coefficient vector $X^v = (X^v_1, X^v_2,\ldots ,X^v_{{{\bar{L}}}})$. We will tackle this simpler (finite-dimensional) optimization problem using an analogous approach as in the previous section and will thus omit further details (Fig. 6).

4 Experiments

In this section, we will present examples of geodesics as calculated using our optimization procedures. The human body shapes have been kindly provided by Nil Hasler [15], and the hand shape is taken from SHREC07 watertight models. All other shapes are courtesy of the TOSCA shape database [8].

4.1 Geodesics and Karcher Mean

In Fig. 8, we present examples of geodesics between given surfaces in the space ${\text {Imm}}(S^2, {\mathbb {R}}^3)/{\text {Diff}}_+(S^2)$ with respect to the split (1, 1, 0.1, 0) metric and the corresponding evolutions of energies. In all our examples, we observed a good and relatively fast convergence of the optimization procedure, and we present some selected results of the resulting deformation and the corresponding computation times in Table 1. In Fig. 7, we present the Karcher mean of a family of cat surfaces with respect to the split (1, 1, 0.1, 0) metric in the space of unparametrized surfaces modulo rigid motions ${\text {Imm}}(S^2, {\mathbb {R}}^3)/({\text {Diff}}_+(S^2)\times {\text {SO(3)}}\ltimes {\mathbb {R}}^{3})$. One can observe that the mean captures the overall characteristics of the family of surfaces under consideration, but simplifies some of the features that undergo high variability. To calculate the Karcher mean, we followed an iterative algorithm as described e.g. in [12] in the context of diffeomorphism-based shape analysis. In this method, one arbitrarily orders the data points as $f_1,\ldots , f_n$ and let $q_1$ be the midpoint of the geodesic between $f_1$ and $f_2$. Then, one defines $q_{i+1}$ by travelling until time $\frac{1}{i+1}$ on the geodesic that connects $q_{i-1}$ to $f_{i+1}$ in time 1. The approximation of the Karcher mean is then given by $q_{n-1}$. We want to remark here that this method depends on the ordering of the data points (initialization), and in future work, we plan to further investigate this and compare it to other Karcher mean algorithms, such as directly minimizing the sum of squares of geodesic distances. All results were obtained on a standard laptop without any parallelization or GPU implementation, which could certainly be used to obtain a significant increase in speed (Fig. 8).

Table 1 Numerical results of matching surfaces with different resolutions in time and space: low: $12 \times 25$, ${\text {deg}}= \overline{{\text {deg}}}= 5$, $T = 5$; middle: $25\times 49$, ${\text {deg}}= 7$, $\overline{{\text {deg}}}= 8$, $T = 10$; and high: $50 \times 99$, ${\text {deg}}= 9$, $\overline{{\text {deg}}}= 11$, $T = 15$. Here, Iter denotes the number of iterations until convergence in the optimization process

Full size table

Remark 5

The results in Table 1 suggest that our methods are well suited for multiresolution methods, i.e., to solve the geodesic matching problem first on a coarser resolution (in time, space and degree of spherical harmonics) and then use an upsampled version of the previously obtained solution as initial guess for solving a high-resolution version of the matching problem. Our numerical framework allows for these approaches in all available parameters, and in all our experiments, this procedure seems to allow for as moderate improvements in the speed of the optimization. See Fig. 6 for an example of a multiresolution geodesic in spatial resolution.

4.2 Comparison to the SRNF Framework

Finally, we aim to compare the results obtained with our method to the results using the inversion of linear paths in the SRNF space. The SRNF metric corresponds to the split metric (4) with constant (0, 1/2, 1, 0), see “Appendix B.” To demonstrate this correspondence, we consider four pairs of boundary surfaces. We calculated the length of the linear path between each pair of surfaces under the split (0, 1/2, 1, 0) and the length of the image of the linear path under the SRNF framework. The relative errors between the lengths for different time step sizes are shown in Table 2 and demonstrate that these two metrics indeed coincide.

Table 2 Comparisons between the lengths of linear paths with respect to the split $(0,\frac{1}{2}, 1,0)$ metric and the lengths of the SRNF representations of the linear paths with respect to the $L^2$ metric. $L_l$: the length of linear path; $L_{L_2}$: the length of the SRNF representation of the linear path with respect to the $L^2$ metric

Full size table

Since the image of the SRNF map is not convex in $L^2$, the linear interpolation between two SRNFs may not have a preimage under the SRNF map. Also, even for functions that are in the image of SRNF map, the inverse does not have an analytic expression; in fact, such an expression does not exist in general, since the SRNF map is not injective. As a way to overcome this difficulty, Laga et al. [25] introduced a numerical method to calculate an approximated inversion of any path between two given SRNFs. In practice, this has been used to approximate the geodesic by inverting the linear path between the given SRNFs. We want to remark here that the algorithm of [25] could also be used to invert a geodesic in the image of the SRNF map. However, calculating geodesics in the image of the SRNF map is a nontrivial process, which to the best of our knowledge has not yet been attempted. We would expect that this procedure would lead to minimizers thatrecover the minimizers obtained in the present framework. In Fig. 9, we consider two pairs of surfaces and calculate the geodesic between each pair of the boundary surfaces under the split (0, 1/2, 1, 0) metric with ${\text {deg}}= \overline{{\text {deg}}}= 7, T = 13$ in the space of unparametrized surfaces ${\text {Imm}}(S^2,{\mathbb {R}}^3)/{\text {Diff}}_+(S^2)$. The comparisons of these geodesics with the approximated inversions of the linear paths between the boundary surfaces are shown in Fig. 9. One can see that in the last row for the geodesic between the human body surfaces, the arms are shrinking at the beginning and then stretching, which may be not a desired deformation for some applications. However, by adjusting the coefficients of our metric, we could obtain geodesics with the natural behavior, see Fig. 10 for geodesics with respect to different choices of coefficients.

In Table 3, we compare the lengths of geodesics for four pairs of surfaces in the space of parametrized surfaces ${\text {Imm}}(S^2,{\mathbb {R}}^3)$, the lengths of the approximated inversions (with seven time steps) under the split (0, 1/2, 1, 0) metric and the $L^2$ differences between the SRNFs of the boundary surfaces. One can see from the table that for each pair of surfaces, the length of the geodesic is much closer to the $L^2$ difference than the length of the approximated inversion of the straight line between the SRNFs of the boundary surfaces. Note that the $L^2$-difference is a lower bound for the geodesic distance that will, in general, be strictly smaller than the true geodesic distance, as the image of the SNRF-representation is not a totally geodesic (open) subspace of the space of all $L^2$-functions.

Table 3 Lengths of deformations with respect to the $(0,\frac{1}{2},1,0)$ metric between boundary surfaces with the maximal spherical harmonic degree of 7 and time step size of 25. $L_l$: the length of the linear path between boundary surfaces; $L_g$: the length of geodesic as calculated in our numerical framework; $L_i$: the length of approximated inversion from SRNF straight line; and $L^2$-Diff: the $L^2$ difference between the SRNFs of these boundary surfaces

Full size table

5 Conclusion

In this article, we have introduced a family of elastic metrics on the space of parametrized surfaces in 3D space using a corresponding family of metrics on the space of vector-valued one-forms. For this class of metrics, we have provided a numerical framework for the computation of geodesics on the space of both parametrized and unparametrized surfaces. This new class of metrics generalizes a previously studied family of elastic metrics and includes, in particular, the Square Root Normal Field (SRNF) metric, which has been proven successful in various applications. In the numerical experiment, provided in Sect. 4, we have demonstrated our framework by showing several examples of geodesics and compared our results with earlier results obtained from the SRNF framework. Our framework does not require a numerical inversion of the SRNF map and thus overcomes some of the difficulties of previous work. Furthermore, it allows to choose the constants of the metric in a data-driven way, which has potential importance in many applications. In future work, we plan to further demonstrate the viability of the proposed method in applications to real data. In addition, we are currently working toward developing a generalization of the SRNF map that will allow us to approximate the geodesic distance for our general class of metrics and will thus speed up the computation by choosing a better initial guess for the parametrization of the boundary surface.

Notes

See the article [20] for an example of a path of closed surfaces that connects two distinct shapes, such that the whole path has the same SRNF.

References

Abe, K., Erbacher, J.: Isometric immersions with the same gauss map. Math. Ann. 215(3), 197–201 (1975)
Article MathSciNet Google Scholar
Allen, B., Curless, B., Popović, Z.: The space of human body shapes: reconstruction and parameterization from range scans. ACM Trans. Graph. 22(3), 587–594 (2003)
Article Google Scholar
Bauer, M., Bruveris, M., Charon, N., Møller-Andersen, J.: A relaxed approach for curve matching with elastic metrics. To appear in ESAIM: COCV (2018)
Bauer, M., Bruveris, M., Michor, P.W.: Overview of the geometries of shape spaces and diffeomorphism groups. J. Math. Imaging Vis. 50(1–2), 60–97 (2014)
Article MathSciNet Google Scholar
Bauer, M., Harms, P., Michor, P.W.: Sobolev metrics on shape space of surfaces. J. Geom. Mech. 3(4), 389–438 (2011)
Article MathSciNet Google Scholar
Bauer, M., Harms, P., Michor, P.W.: Sobolev metrics on shape space, ii: weighted sobolev metrics and almost local metrics. J. Geom. Mech. 4(4), 365–383 (2012)
Article MathSciNet Google Scholar
Bauer, M., Klassen, E., Preston, S.C., Su, Z.: A diffeomorphism-invariant metric on the space of vector-valued one-forms. arXiv:1812.10867 (2018)
Bronstein, A.M., Bronstein, M.M., Kimmel, R.: Numerical Geometry of Non-Rigid Shapes. Springer, Berlin (2008)
MATH Google Scholar
Bruveris, M., Michor, P.W., Mumford, D.: Geodesic completeness for sobolev metrics on the space of immersed plane curves. In: Guralnick, R. (ed.) Forum of Mathematics, Sigma, vol. 2. Cambridge University Press (2014)
Celledoni, E., Eslitzbichler, M., Schmeding, A.: Shape analysis on Lie groups with applications in computer animation. J. Geom. Mech. 8(3), 273–304 (2015)
Article MathSciNet Google Scholar
Cervera, V., Mascaro, F., Michor, P.W.: The action of the diffeomorphism group on the space of immersions. Differ. Geom. Appl. 1(4), 391–401 (1991)
Article MathSciNet Google Scholar
Cury, C., Glaunes, J.A., Colliot, O.: Template estimation for large database: a diffeomorphic iterative centroid method using currents. In: International Conference on Geometric Science of Information, pp. 103–111. Springer (2013)
Fletcher, R.: Practical Methods of Optimization. Wiley, Amsterdam (2013)
MATH Google Scholar
Grenander, U., Miller, M.I.: Computational anatomy: an emerging discipline. Q. Appl. Math. 56(4), 617–694 (1998)
Article MathSciNet Google Scholar
Hasler, N., Stoll, C., Sunkel, M., Rosenhahn, B., Seidel, H.P.: A statistical model of human pose and body shape. In: Hauser, H., Benes, B (eds.) Computer Graphics Forum, vol. 28, pp. 337–346. Wiley Online Library (2009)
Heeren, B., Rumpf, M., Wardetzky, M., Wirth, B.: Time-discrete geodesics in the space of shells. In: Hauser, H., Benes, B (eds.) Computer Graphics Forum, vol. 31, pp. 1755–1764. Wiley Online Library (2012)
Jermyn, I., Kurtek, S., Laga, H., Srivastava, A.: Elastic shape analysis of three-dimensional objects. Synth. Lect. Comput. Vis. 7, 1–185 (2017)
Article Google Scholar
Jermyn, I.H., Kurtek, S., Klassen, E., Srivastava, A.: Elastic shape matching of parameterized surfaces using square root normal fields. Comput. Vis. ECCV 2012, 804–817 (2012)
Google Scholar
Kilian, M., Mitra, N.J., Pottmann, H.: Geometric modeling in shape space. In: ACM Transactions on Graphics (TOG), vol. 26, p. 64. ACM (2007)
Klassen, E., Michor, P.W.: Closed surfaces with different shapes that are indistinguishable by the srnf. arXiv:1910.10804. (2019)
Kühnel, L., Sommer, S., Arnaudon, A.: Differential geometry and stochastic dynamics with deep learning numerics. Appl. Math. Comput. 356, 411–437 (2019)
MathSciNet MATH Google Scholar
Kurtek, S., Klassen, E., Ding, Z., Jacobson, W., Jacobson, S., Avison, J.L.J.M., Srivastava, A.: Parameterization-invariant shape comparisons of anatomical surfaces. IEEE Trans. Med. Imaging 30(3), 849–858 (2011)
Article Google Scholar
Kurtek, S., Needham, T.: Simplifying transforms for general elastic metrics on the space of plane curves. arXiv preprint arXiv:1803.10894 (2018)
Kurtek, S., Srivastava, A., Klassen, E., Laga, H.: Landmark-guided elastic shape analysis of spherically-parameterized surfaces. In: Hauser, H., Benes, B (eds.) Computer Graphics Forum, vol. 32, pp. 429–438. Wiley Online Library (2013)
Laga, H., Xie, Q., Jermyn, I.H., Srivastava, A.: Numerical inversion of srnf maps for elastic shape analysis of genus-zero surfaces. IEEE Trans. Pattern Anal. Mach. Intell. 39(12), 2451–2464 (2017)
Article Google Scholar
Mio, W., Srivastava, A., Joshi, S.: On shape of plane elastic curves. Int. J. Comput. Vis. 73(3), 307–324 (2007)
Article Google Scholar
Praun, E., Hoppe, H.: Spherical parametrization and remeshing. In: ACM Transactions on Graphics (TOG), vol. 22, pp. 340–349. ACM (2003)
Srivastava, A., Klassen, E., Joshi, S.H., Jermyn, I.H.: Shape analysis of elastic curves in euclidean spaces. IEEE Trans. Pattern Anal. Mach. Intell. 33(7), 1415–1428 (2011)
Article Google Scholar
Srivastava, A., Klassen, E.P.: Functional and Shape Data Analysis. Springer, Berlin (2016)
Book Google Scholar
Su, J., Kurtek, S., Klassen, E., Srivastava, A.: Statistical analysis of trajectories on Riemannian manifolds: bird migration, hurricane tracking and video surveillance. Ann. Appl. Stat. 8(1), 530–552 (2014)
Article MathSciNet Google Scholar
Su, Z., Klassen, E., Bauer, M.: Comparing curves in homogeneous spaces. In: Slovák, J. (ed.) Differential Geometry and its Applications, vol. 60, pp. 9–32. Elsevier (2018)
Tumpach, A.B.: Gauge invariance of degenerate riemannian metrics. Notices of the AMS 63(4), 342–350 (2016)
Tumpach, A.B., Drira, H., Daoudi, M., Srivastava, A.: Gauge invariant framework for shape analysis of surfaces. IEEE Trans. Pattern Anal. Mach. Intell. 38(1), 46–59 (2015)
Article Google Scholar
Younes, L.: Computable elastic distances between shapes. SIAM J. Appl. Math. 58(2), 565–586 (1998)
Article MathSciNet Google Scholar
Younes, L., Michor, P.W., Shah, J.M., Mumford, D.B.: A metric on shape space with explicit geodesics. Rendiconti Lincei-Matematica e Applicazioni 19(1), 25–57 (2008)
Article MathSciNet Google Scholar
Zhang, Z., Su, J., Klassen, E., Le, H., Srivastava, A.: Video-based action recognition using rateinvariant analysis of covariance trajectories. arXiv:1503.06699 (2015)

Download references

Acknowledgements

The authors thank Anuj Srivastava and all the members of the Florida State statistical shape analysis group for helpful discussions during the preparation of this manuscript. In addition, we are grateful to Sebastian Kurtek and Alice Barbara Tumpach for discussion about the implementation of the minimization over the diffeomorphism group.

Author information

Authors and Affiliations

Department of Mathematics, Florida State University, Tallahassee, USA
Zhe Su, Martin Bauer & Eric Klassen
Department of Mathematics, Brooklyn College and CUNY Graduate Center, New York, USA
Stephen C. Preston
Information Technology, Mathematics and Statistics, Murdoch University, Perth, Australia
Hamid Laga
The Phenomics and Bioinformatics Research Centre, University of South Australia, Adelaide, Australia
Hamid Laga

Authors

Zhe Su
View author publications
You can also search for this author in PubMed Google Scholar
Martin Bauer
View author publications
You can also search for this author in PubMed Google Scholar
Stephen C. Preston
View author publications
You can also search for this author in PubMed Google Scholar
Hamid Laga
View author publications
You can also search for this author in PubMed Google Scholar
Eric Klassen
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Martin Bauer.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

M. Bauer was partially supported by NSF-Grant 1912037 (collaborative research in connection with NSF-Grant 1912030). S. C. Preston was partially supported by Simons Foundation Collaboration Grant for Mathematicians No. 318969. E. Klassen was partially supported by Simons Foundation Collaboration Grant for Mathematicians No. 317865.

Appendices

Appendix A: The Geodesic Equation

In the following, we give the geodesic equation on the space of immersions ${\text {Imm}}(M, {\mathbb {R}}^3)$ with respect to the pullback of the metric (3) on the space of 1-forms. In this “Appendix,” we will assume that the domain M is a compact orientable surface without boundary, because we will need to use the Hodge decomposition. We will view $\alpha \in \varOmega ^1_+(M,{\mathbb {R}}^3)$ as a vector-valued 1-form with components $( \alpha ^1, \alpha ^2,\alpha ^3)$, where each $\alpha ^i$ is a 1-form on M in the usual sense. Then, metric (3) can be rewritten as

$$\begin{aligned} G_{\alpha }(\xi , \xi )&= \int _M {{\,\mathrm{tr}\,}}(\xi \varLambda _{\alpha }\xi ^T) \, \varphi _{\alpha } \, \mu \\&= \sum _{i=1}^3 \int _M \langle \xi _x^i, \varLambda _{\alpha } \xi _x^i\rangle \, \varphi _{\alpha } \, \mu \end{aligned}$$

where $\xi = (\xi ^1, \xi ^2,\xi ^3)\in T_{\alpha }\varOmega ^1_+(M,{\mathbb {R}}^3)$, $\varLambda _{\alpha } = (\alpha ^T\alpha )^{-1}$ is the induced Riemannian metric on 1-forms on M, and $\varphi _{\alpha } = \sqrt{\det {(\alpha ^T\alpha )}}$ is the induced volume form on M. As such, all computations can be done one component at a time.

If $F = (f^1, f^2, f^3)$ is a vector-valued function with each $f^i:M\rightarrow {\mathbb {R}}$ real-valued, then $\beta = dF$ is a vector-valued 1-form with $\beta ^i = df^i$. The Hodge decomposition tells us that every 1-form $\xi $ may be written as

$$\begin{aligned} \xi = df + \gamma , \end{aligned}$$

where $\delta \gamma = 0$ and $\delta :\varOmega ^1(M, {\mathbb {R}})\rightarrow C^{\infty }(M)$ is the codifferential operator.

The space ${\text {Imm}}(M,{\mathbb {R}}^3)$ is formally a submanifold of $\varOmega ^1_+(M,{\mathbb {R}}^3)$, and thus by general submanifold geometry, we know that the geodesic equation on ${\text {Imm}}(M,{\mathbb {R}}^3)$ will be given by

$$\begin{aligned} \frac{D}{dt} \frac{d}{dt} \alpha = \gamma , \qquad \alpha = d\varPhi , \qquad \delta \gamma = 0. \end{aligned}$$

Since $\delta \gamma = 0$, we know that $\star \gamma $ is an exact form, where $\star $ denotes the Hodge star operator. Then, there is a function p, unique up to a constant, such that $dp = \star \gamma $. We obtain

$$\begin{aligned} \varDelta p = \delta dp = \delta \!\star \!\gamma = \star d\left( \frac{D}{dt} \frac{d}{dt} \alpha \right) . \end{aligned}$$

In coordinates (u, v) on M, the operator $\star d$ is given by:

$$\begin{aligned} \star d (f \, du + g \, dv) = \frac{g_u - f_v}{\varphi }, \end{aligned}$$

where $\varphi $ is the volume form on M. From the geodesic equation on $\varOmega ^1_+(M,{\mathbb {R}}^3)$ with respect to the metric (3), as calculated in our previous paper [7], we know that the covariant derivative is given by

$$\begin{aligned}&\frac{D}{dt} \frac{d\alpha }{dt} = \alpha _{tt} - \alpha _t(\alpha ^T\alpha )^{-1} \alpha ^T_t \alpha - \alpha _t \alpha ^+\alpha _t + (\alpha _t\alpha ^+)^T\alpha _t \\&\quad - \tfrac{1}{2} {{\,\mathrm{tr}\,}}(\alpha _t(\alpha ^T\alpha )^{-1}\alpha ^T_t)\alpha + {{\,\mathrm{tr}\,}}(\alpha _t\alpha ^+) \alpha _t. \end{aligned}$$

Since $d\alpha _{tt} = 0$, we obtain

$$\begin{aligned} \varDelta p= & {} \star d\Big (-\alpha _t(\alpha ^T\alpha )^{-1} \alpha ^T_t \alpha \Big ) - \alpha _t \alpha ^+\alpha _t + (\alpha _t\alpha ^+)^T\alpha _t \\&- \tfrac{1}{2} {{\,\mathrm{tr}\,}}(\alpha _t(\alpha ^T\alpha )^{-1}\alpha ^T_t)\alpha + {{\,\mathrm{tr}\,}}(\alpha _t\alpha ^+) \alpha _t\Big ). \end{aligned}$$

Let $L = \alpha _t \alpha ^+$. Applying the Hodge star operator to both sides, we see that $\varPhi $ is a geodesic on ${\text {Imm}}(M, {\mathbb {R}}^3)$ if and only if we have

$$\begin{aligned} \star \varDelta p = \star \star d(\varOmega d\varPhi ) = d\varOmega \wedge d\varPhi \end{aligned}$$

where $\varOmega = LL^T + L^2 - L^T L +\frac{1}{2}{{\,\mathrm{tr}\,}}(L^TL) - {{\,\mathrm{tr}\,}}(L)L$. Here, we emphasize that p and $\varPhi $ are actually vector-valued functions, so these computations are done componentwise for each $i\in \{1,2,3\}$. In other words, we have

$$\begin{aligned} \star \varDelta p_i = \sum _{j=1}^3 d\varOmega _{ij} \wedge d\varPhi _j, \qquad i\in \{1,2,3\}. \end{aligned}$$

Appendix B: Proofs

Proof of Theorem 2

In the following, we prove the correspondence between our split metric on the space $\varOmega _+^1(M,{\mathbb {R}}^3)$ and the SRNF metric on the space of surfaces. Let $M_+(3,2)$ be the space of $3\times 2$ matrices with rank 2. Using the pointwise property of our metric, we will focus on the corresponding split metric on the matrix space $M_+(3,2)$. For $a\in M_+(3,2)$ and $v\in T_mM_+(3,2)$, we decompose v into four parts

$$\begin{aligned} v = v_m + \frac{1}{2}{{\,\mathrm{tr}\,}}(a^+v)a + v^{\perp } + v_0, \end{aligned}$$

where

$$\begin{aligned} v_m&= \frac{1}{2}a(a^Ta)^{-1}(a^Tv + v^Ta) - \frac{1}{2}{{\,\mathrm{tr}\,}}(a^+v)a \\ v^{\perp }&= v - a(a^Ta)^{-1}a^Tv \\ v_0&= \frac{1}{2}a(a^Ta)^{-1}(a^Tv - v^Ta). \end{aligned}$$

The corresponding split metric on $M_+(3,2)$ is then of the form:

$$\begin{aligned}&G^{{\mathfrak {a}},{\mathfrak {b}},{\mathfrak {c}},{\mathfrak {d}}}_{a}(v, v) \nonumber \\&\quad = {\mathfrak {a}}\langle v_m, v_m\rangle _a + {\mathfrak {b}}\left\langle \frac{1}{2}{{\,\mathrm{tr}\,}}(a^+v)a, \frac{1}{2}{{\,\mathrm{tr}\,}}(a^+v)a\right\rangle _a \nonumber \\&\qquad + {\mathfrak {c}}\langle v^{\perp }, v^{\perp }\rangle _a + {\mathfrak {d}}\langle v_0, v_0\rangle _a, \end{aligned}$$

(13)

Now consider the projection $\pi :M_+(3,2)\rightarrow {\text {Sym}}_+(2), \ a\mapsto a^Ta$. This projection is a Riemannian submersion, where $M_+(3,2)$ carries metric (13) with choices of constants (1, 1, 1, 1) and the space ${\text {Sym}}_+(2)$ is equipped with the following metric:

$$\begin{aligned} \langle h, k\rangle _g^{{\text {Sym}}} = \frac{1}{4}{{\,\mathrm{tr}\,}}(g^{-1}hg^{-1}k)\sqrt{\det (g)}. \end{aligned}$$

The horizontal bundle with respect to the projection $\pi $ is given by

$$\begin{aligned} {\mathcal {H}}_a = \{u\in M(3,2)\,|\,ua^+\in {\text {Sym}}(n)\}, \end{aligned}$$

and the differential $d\pi $ induces an isometry

$$\begin{aligned} d\pi _a: {\mathcal {H}}_a \rightarrow T_{\pi (a)}{\text {Sym}}_+(m). \end{aligned}$$

It is easy to check that $v_m$ and $\frac{1}{2}{{\,\mathrm{tr}\,}}(a^+v)a$ are horizontal vectors.

Let $g = \pi (a) =a^Ta$. By computation, we have

$$\begin{aligned} {{\,\mathrm{tr}\,}}(a^+v) = \frac{1}{2}{{\,\mathrm{tr}\,}}(g^{-1}d\pi _av) \end{aligned}$$

and

$$\begin{aligned} d\pi _a(v_m)&= a^Tv_m + v_m^Ta = a^Tv + v^Ta - {{\,\mathrm{tr}\,}}(a^+v)a^Ta\\&= d\pi _av - \frac{1}{2}{{\,\mathrm{tr}\,}}\left( g^{-1}d\pi _a v\right) g. \end{aligned}$$

Therefore, the first term in (13) becomes

$$\begin{aligned}&\langle v_m, v_m\rangle _a\\&\quad = \left\langle d\pi _a(v_m), d\pi _a(v_m)\right\rangle _{\pi (a)}^{{\text {Sym}}}\\&\quad =\left\langle d\pi _av - \frac{1}{2}{{\,\mathrm{tr}\,}}\left( g^{-1}d\pi _a v\right) g, d\pi _av - \frac{1}{2}{{\,\mathrm{tr}\,}}\left( g^{-1}d\pi _a v\right) g\right\rangle _g^{{\text {Sym}}} \\&\quad = \left\langle d\pi _av, d\pi _av\right\rangle _g^{{\text {Sym}}} - {{\,\mathrm{tr}\,}}\left( g^{-1}d\pi _a v\right) \left\langle d\pi _av, g\right\rangle _g^{{\text {Sym}}}\\&\qquad + \frac{1}{4}{{\,\mathrm{tr}\,}}^2\left( g^{-1}d\pi _a v\right) \left\langle g, g\right\rangle _g^{{\text {Sym}}} \\&\quad = \frac{1}{4}{{\,\mathrm{tr}\,}}\left( g^{-1}d\pi _avg^{-1}d\pi _av\right) \sqrt{\det (g)}\\&\qquad -\frac{1}{8}{{\,\mathrm{tr}\,}}^2\left( g^{-1}d\pi _a v\right) \sqrt{\det (g)} \end{aligned}$$

and the second term becomes

$$\begin{aligned} \left\langle \frac{1}{2}{{\,\mathrm{tr}\,}}(a^+v)a, \frac{1}{2}{{\,\mathrm{tr}\,}}(a^+v)a\right\rangle _a&= \frac{1}{2}{{\,\mathrm{tr}\,}}^2(a^+v) \sqrt{\det (a^Ta)} \\&= \frac{1}{8}{{\,\mathrm{tr}\,}}^2(g^{-1}d\pi _av)\sqrt{\det (g)}. \end{aligned}$$

For the third term in (13), we consider the corresponding unit normal map on the space of matrices given by

$$\begin{aligned} n: M_+(3,2)&\rightarrow {\mathbb {R}}^3\\ a&\mapsto \frac{a_1\times a_2}{|a_1\times a_2|} = \frac{a_1\times a_2}{\sqrt{\det (a^Ta)}}, \end{aligned}$$

where $a_1$ and $a_2$ are the first and the second columns of a, respectively. For any tangent vector $u = \begin{pmatrix} u_1&u_2 \end{pmatrix}$ at a, the differential of n at a is

$$\begin{aligned} dn_a(u) = \frac{u_1\times a_2+ a_1\times u_2 - (a_1\times a_2){{\,\mathrm{tr}\,}}(a^+u)}{\sqrt{\det (a^Ta)}}. \end{aligned}$$

It is easy to check that $aa^+v$ is in the kernel of the differential $dn_a$, i.e.,

$$\begin{aligned} dn_a(v) = dn_a(v^{\perp } + aa^+v) = dn_a(v^{\perp }). \end{aligned}$$

Note that ${{\,\mathrm{tr}\,}}(a^+v^{\perp }) = 0$, $aa^+a_1 = a_1$ and $aa^+a_2 = a_2$. Using the following identity for three-dimensional vectors b, c, d, e:

$$\begin{aligned} (b\times c)\cdot (d\times e) = b^Tdc^Te - b^Tec^Td \end{aligned}$$

and the formula for the inverse of $a^Ta$:

$$\begin{aligned} (a^Ta)^{-1} = \frac{1}{\det (a^Ta)}\begin{pmatrix} a_2^Ta_2 &{} -a_1^Ta_2\\ a_1^Ta_2 &{} a_1^Ta_1 \end{pmatrix}, \end{aligned}$$

we have

$$\begin{aligned}&\langle dn_av, dn_av\rangle _{{\mathbb {R}}^3} = \langle dn_av^{\perp }, dn_av^{\perp }\rangle _{{\mathbb {R}}^3}\\&\quad = \frac{1}{\det (a^Ta)}\langle v^{\perp }_1\times a_2+ a_1\times v^{\perp }_2, v^{\perp }_1\times a_2+ a_1\times v^{\perp }_2\rangle _{{\mathbb {R}}^3}\\&\quad = \frac{1}{\det (a^Ta)}\big [\left( v_1^Tv_1 - v_1^Taa^+v_1\right) a_2^Ta_2\\&\qquad - 2\left( v_1^Tv_2 - v_1^Taa^+v_2\right) a_1^Ta_2 + \left( v_2^Tv_2 - v_2^Taa^+v_2\right) a_1^Ta_1\big ], \end{aligned}$$

where $v^{\perp }_1, v^{\perp }_2$ are the first and the second columns of $v^{\perp }$ and $v_1, v_2$ are the first and the second columns of v, respectively. It follows that

$$\begin{aligned}&\left\langle v^{\perp }, v^{\perp }\right\rangle _a\sqrt{\det (a^Ta)} = {{\,\mathrm{tr}\,}}(v^{\perp }(a^Ta)^{-1}(v^{\perp })^T)\det (a^Ta)\\&\quad = \left( {{\,\mathrm{tr}\,}}(v(a^Ta)^{-1}v^T) - {{\,\mathrm{tr}\,}}(aa^+v(a^Ta)^{-1}v^T)\right) \det (a^Ta)\\&\quad = \left( {{\,\mathrm{tr}\,}}(v^Tv(a^Ta)^{-1}) - {{\,\mathrm{tr}\,}}(v^Taa^+v(a^Ta)^{-1})\right) \det (a^Ta)\\&\quad = {{\,\mathrm{tr}\,}}\left( \begin{pmatrix} v_1^T(I - aa^+)v_1 &{} v_1^T(I - aa^+)v_2\\ v_1^T(I - aa^+)v_2 &{} v_2^T(I - aa^+)v_2 \end{pmatrix}\begin{pmatrix} a_2^Ta_2 &{} -a_1^Ta_2\\ a_1^Ta_2 &{} a_1^Ta_1 \end{pmatrix}\right) \\&\quad = \left( v_1^Tv_1 - v_1^Taa^+v_1\right) a_2^Ta_2 - 2\left( v_1^Tv_2 - v_1^Taa^+v_2\right) a_1^Ta_2 \\&\qquad + \left( v_2^Tv_2 - v_2^Taa^+v_2\right) a_1^Ta_1\\&\quad = \langle dn_av, dn_av\rangle _{{\mathbb {R}}^3}\det (a^Ta), \end{aligned}$$

that is,

$$\begin{aligned} \left\langle v^{\perp }, v^{\perp }\right\rangle _a = \langle dn_av, dn_av\rangle _{{\mathbb {R}}^3}\sqrt{\det (g)}. \end{aligned}$$

Therefore, the split metric (13) on $M_+(3,2)$ can be rewritten as

$$\begin{aligned}&G^{{\mathfrak {a}},{\mathfrak {b}},{\mathfrak {c}},{\mathfrak {d}}}_{a}(v, v)\\&\quad = {\mathfrak {a}}\left( \frac{1}{4}{{\,\mathrm{tr}\,}}\left( g^{-1}d\pi _avg^{-1}d\pi _av\right) -\frac{1}{8}{{\,\mathrm{tr}\,}}^2\left( g^{-1}d\pi _a v\right) \right) \sqrt{\det (g)}\\&\qquad + \frac{{\mathfrak {b}}}{8}{{\,\mathrm{tr}\,}}^2\left( g^{-1}d\pi _a v\right) \sqrt{\det (g)}+ {\mathfrak {c}}\langle dn_av, dn_av\rangle _{{\mathbb {R}}^3}\sqrt{\det (g)}\\&\qquad + {\mathfrak {d}}\langle v_0, v_0\rangle _a. \end{aligned}$$

Now it is easy to see that the first three terms give rise to the formula of the full elastic metric on the space of surfaces for $A = {\mathfrak {a}}/4, B = ({\mathfrak {b}}-{\mathfrak {a}})/8, C = {\mathfrak {c}}$ and the SRNF metric corresponds to the split metric (4) with constants $(0 ,\frac{1}{2}, 1, 0)$. $\square $

Proof of Theorem 3

We first perform the computation in spherical coordinates $(\theta , \phi )\in [0, 2\pi ]\times [0, \pi ]$. Denote the usual spherical coordinate orthonormal basis by

$$\begin{aligned} e_1&= \langle \sin {\phi } \cos {\theta }, \sin {\phi } \sin {\theta }, \cos {\phi }\rangle , \\ e_2&= \langle \cos {\phi } \cos {\theta }, \cos {\phi } \sin {\theta }, -\sin {\phi }\rangle , \\ e_3&= \langle -\sin {\theta }, \cos {\theta }, 0\rangle . \end{aligned}$$

We have the following formulas for the partial derivatives:

$$\begin{aligned} \partial _{\phi } e_1&= e_2, \qquad \partial _{\phi } e_2 = -e_1, \qquad \partial _{\phi } e_3 = 0, \nonumber \\ \partial _{\theta } e_1&= \sin {\phi } e_3, \qquad \partial _{\theta } e_2 = \cos {\phi } e_3, \nonumber \\ \partial _{\theta } e_3&= -\sin {\phi } e_1 - \cos {\phi } e_2. \end{aligned}$$

(14)

We also note that the covariant derivatives are given by

$$\begin{aligned} \nabla _{e_2}e_2&= 0,\quad&\nabla _{e_2}\ e_3&= 0\nonumber \\ \nabla _{e_3}e_2&= \cot {\phi }\, e_3,\quad&\nabla _{e_3}e_3&= -\cot {\phi }\, e_2. \end{aligned}$$

(15)

Write

$$\begin{aligned} U(\theta , \phi ) = u(\theta , \phi ) e_2(\theta , \phi ) + v(\theta , \phi ) e_3(\theta , \phi ). \end{aligned}$$

For a real parameter t, we consider the following map $W:S^2 \rightarrow {\mathbb {R}}^3$ given in coordinates by

$$\begin{aligned} W(\theta , \phi )&= e_1(\theta , \phi ) + tU(\theta , \phi )\\&= e_1(\theta , \phi ) + t u(\theta , \phi ) e_2(\theta , \phi ) + t v(\theta , \phi ) e_3(\theta , \phi ). \end{aligned}$$

Then, $\eta = W/|W|$.

Note that in order for $\eta $ to be a diffeomorphism, we require that the Jacobian determinant be nonzero; it is given by

$$\begin{aligned} {\text {Jac}}(\eta ) = \frac{1}{\sin \phi }\left|\frac{\partial \eta }{\partial \phi }\times \frac{\partial \eta }{\partial \theta }\right|. \end{aligned}$$

Observe that

$$\begin{aligned} \eta _{\phi }&= \frac{1}{|W|} \left( W_{\phi } - \frac{W\cdot W_{\phi }}{|W|^2}\, W\right) = \frac{1}{|W|} P_{W^{\perp }}(W_{\phi }),\\ \eta _{\theta }&= \frac{1}{|W|} \left( W_{\theta } - \frac{W\cdot W_{\theta }}{|W|^2}\, W\right) = \frac{1}{|W|} P_{W^{\perp }}(W_{\theta }). \end{aligned}$$

Since $\eta _{\phi }$ and $\eta _{\theta }$ are both perpendicular to W, we know that $\eta _{\phi }\times \eta _{\theta }$ is parallel to W; thus, we obtain the formula

$$\begin{aligned} {\text {Jac}}(\eta )&= \frac{1}{\sin \phi |W|^2}\big |P_{W^{\perp }}(W_\phi )\times P_{W^{\perp }}(W_\theta )\big |\\&= \frac{1}{\sin \phi |W|^3}\left|W\cdot \big (P_{W^{\perp }}(W_\phi )\times P_{W^{\perp }}(W_\theta )\big )\right|\\&= \frac{1}{\sin \phi |W|^3}\left|W\cdot \big (W_\phi \times W_\theta \big )\right|, \end{aligned}$$

using the cyclic invariance of the scalar triple product and the fact that $W\times P_{W^{\perp }}(V) = W\times V$ for any vector V.

Since $W = e_1+tU$ for the vector field $U = ue_2+ve_3$, it is straightforward to compute using (14)-(15) that

$$\begin{aligned} W_{\phi }&= e_2+tU_{\phi } = e_2 +t\nabla _{e_2}U - tue_1,\\ \frac{W_{\theta }}{\sin \phi }&= e_3+\frac{t}{\sin \phi }U_{\theta } = e_3 +t\nabla _{e_3}U - tve_1, \end{aligned}$$

Let $m_1 = u_{\phi }, m_2 = v_{\phi }, m_3 = \frac{u_{\theta }-v\cos {\phi }}{\sin {\phi }}, m_4= \frac{v_{\theta }+u\cos {\phi }}{\sin {\phi }}$. We have by (15) that

$$\begin{aligned} \nabla _{e_2}U = m_1e_2+m_2e_3,\qquad \nabla _{e_3}U = m_3e_2+m_4e_3, \end{aligned}$$

which we abbreviate by

$$\begin{aligned} M :=\nabla U = \begin{pmatrix} m_1 &{} m_2\\ m_3 &{} m_4 \end{pmatrix}. \end{aligned}$$

Thus, the Jacobian is nonzero if and only if the following determinant is nonzero:

$$\begin{aligned} D = \left|W\cdot \big (W_\phi \times W_\theta \big )\right|= \begin{vmatrix} 1&tu&tv\\ -tu&1+ tm_1&tm_2\\ -tv&tm_3&1+tm_4 \end{vmatrix}. \end{aligned}$$

(16)

Expanding the determinant (16) along the first column, then it is given by

$$\begin{aligned} D = \det (1+tM) + t^2\langle JU, (1+tM)JU\rangle , \end{aligned}$$

where $J = \begin{pmatrix} 0 &{} -1\\ 1 &{} 0 \end{pmatrix}$.

Let ${\overline{M}} = \tfrac{1}{2} (M+M^T)$ denote the symmetrization of M, and let $\lambda _1\le \lambda _2$ denote the real eigenvalues of ${\overline{M}}$. Then, ${{\,\mathrm{tr}\,}}{M} = {{\,\mathrm{tr}\,}}{{\overline{M}}}$ and $\det {M} = \det {{\overline{M}}} + \tfrac{1}{4} (m_2-m_3)^2$, so that

$$\begin{aligned} \det (1+tM) \ge \det (1+t{\overline{M}}) = (1+\lambda _1t)(1+\lambda _2t). \end{aligned}$$

Since J is a rotation, we have

$$\begin{aligned} \langle JU, (1+tM)JU\rangle&= \langle JU, (1+t{\overline{M}})JU\rangle \\&\ge (1+\lambda _1t)|JU|^2\\&= (1+\lambda _1t)|U|^2. \end{aligned}$$

Thus,

$$\begin{aligned} D\ge (1+\lambda _1t)(1+\lambda _2t+|U|^2t^2). \end{aligned}$$

For sufficiently small t, we know $(1+\lambda _1t)$ is positive, and since $\lambda _1\le \lambda _2$, we obtain

$$\begin{aligned} D\ge (1+\lambda _1t)^2 \end{aligned}$$

Thus, $1+\lambda _1t>0$ is a sufficient condition for positivity of D, and this happens as long as $|t|< \frac{1}{|\lambda _1|}$. It is easy to compute that

$$\begin{aligned} \lambda _1 = \frac{m_1+m_4 - \sqrt{(m_1-m_4)^2 + (m_2 + m_3)^2}}{2}. \end{aligned}$$

In particular, $m_1+m_4 = {{\,\mathrm{tr}\,}}{(\nabla U)} = {\text {div}}{U}$, and by the divergence theorem, we know the integral of $m_1+m_4$ over $S^2$ is zero, and in particular $m_1+m_4$ is either identically zero or changes sign on $S^2$. Since t is nonnegative, we therefore are concerned about the most negative that $\lambda _1(x)$ can be:

$$\begin{aligned} 1 + \lambda _1(x) t \ge 1 + t\inf _{x\in S^2} \lambda _1(x) = 1 - t \sup _{p\in S^2}(-\lambda _1(x)) \ge 0, \end{aligned}$$

which is equivalent to

$$\begin{aligned} t < \frac{2}{\sup \limits _{p\in S^2} -(m_1+m_4) + \sqrt{(m_1-m_4)^2 + (m_2+m_3)^2}}. \end{aligned}$$

This is clearly (9). $\square $

Rights and permissions

Reprints and permissions

About this article

Cite this article

Su, Z., Bauer, M., Preston, S.C. et al. Shape Analysis of Surfaces Using General Elastic Metrics. J Math Imaging Vis 62, 1087–1106 (2020). https://doi.org/10.1007/s10851-020-00959-4

Download citation

Received: 15 October 2019
Accepted: 09 April 2020
Published: 14 May 2020
Issue Date: October 2020
DOI: https://doi.org/10.1007/s10851-020-00959-4

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Shape Analysis of Surfaces Using General Elastic Metrics

Abstract

Similar content being viewed by others

Elastic Metrics on Spaces of Euclidean Curves: Theory and Algorithms

Elastic Shape Analysis of Surfaces with Second-Order Sobolev Metrics: A Comprehensive Numerical Framework

Elastic Shape Analysis of Surfaces and Images

1 Introduction

2 Mathematical Framework and Background

2.1 The General Elastic Metric and the SRNF Framework

2.2 Immersions and Vector Valued One-Forms

Theorem 1

Proof

Theorem 2

Proof

Remark 1

Proposition 1

Proof

3 A Numerical Framework for the General Elastic Metric

Remark 2

3.1 Geodesics in the Space of Surfaces

Remark 3

3.2 Geodesics in the Space of Unparametrized Surfaces

Theorem 3

Proof

Remark 4

3.3 Geodesics in the Space of Unparametrized Surfaces Modulo Rigid Motions

4 Experiments

4.1 Geodesics and Karcher Mean

Remark 5

4.2 Comparison to the SRNF Framework

5 Conclusion

Notes

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher's Note

Appendices

Appendix A: The Geodesic Equation

Appendix B: Proofs

Proof of Theorem 2

Proof of Theorem 3

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation