Keywords

1 Introduction

Computational anatomy aims at modeling the temporal evolution and cross-sectional variability of anatomical shapes. The deformations between shapes are obtained by applying non-rigid registration algorithms that seek the smallest transformation - in a sense that will be defined precisely - of the ambient space to match two shapes. In the diffeomorphic registration setting, the deformations are modeled by diffeomorphisms, that provide invertible and folding-free transformations.

In the Large Deformation Diffeomorphic Metric Mapping (LDDMM) framework, the space of diffeomorphisms is endowed with a right invariant metric and deformations of interest are obtained by geodesic flows from the identity transformation. They are parameterized by their initial velocity fields, which are tangent vectors at the identity deformation, defined by initial control points and dual momenta. A transformation is then computed by integration of differential equations. Registration is performed by solving an optimization problem on the initial momenta and control points with a gradient descent. These numerical schemes efficiently implement an exponential and logarithm map on a subspace of diffeomorphisms.

However in practice, the optimization problem is relaxed to enforce smooth deformations and results in inexact matching. Therefore we can think of the exponential map as not going “far enough”. In this work we introduce a modified exponential map that accounts for a residual error due to registration, indifferent to the choice of the regularization parameter.

The choice of the metric and regularization parameter affects the geometric structure of the space of deformations under consideration. Many convergence results depend on the curvature of this space, e.g. [9], and especially on its covariant derivative. This gradient being null in locally symmetric spaces, they form a very convenient setting to perform statistics on shapes. In order to assess how far from symmetric our structure is, it would therefore be valuable to develop a procedure to measure this gradient. In this paper we build on a specific parallel transport scheme.

Fig. 1.
figure 1

Pole Ladder

In the statistical analysis of temporal deformations, parallel transport along geodesics is commonly used to perform inter-subject normalization, that is the vector transport of velocity fields from each subject’s space to a common atlas’ space. An approximation based on Jacobi fields was proposed in [7, 11]. A numerical implementation named Pole Ladder (PL) was proposed in [5] and relies only on the computation of exponential and logarithm maps. Following the Shild’s ladder, it consists in the construction of geodesic parallelograms. The progression between two shapes S and \(S'\) is transported to T by:

  • first computing a “midpoint” M on the geodesic between S and T. It is seen as the diagonal of the geodesic parallelogram;

  • then extending the geodesic from \(S'\) to M by the same length to obtain \(T''\);

  • similarly extending the geodesic from \(T''\) to T to obtain the parallel deformation of the template T;

See Fig. 1 for a schematic representation.

For large deformations that typically arise in inter-subject registration, this procedure is usually iterated. We therefore expect numerical errors to grow linearly in the number of steps, and lose crucial numerical accuracy. The accuracy of this scheme was analyzed in [9] and shown to be a third order scheme in general, and exact in symmetric spaces where it is equivalent to a single transvection, that is, a composition of two symmetries, in our case symmetries with respect to M and T.

In many cases, it is well known that numerical errors can be drastically reduced in the presence of a symmetry. This is for instance the case to diagonalize a symmetric matrix versus an arbitrary one. Using the Stationary Velocity Fields (SVF) framework for registration, a symmetric variant of Pole Ladder built on a Lie Group intrinsic symmetric structure was proposed in [3]. This procedure is recalled in Sect. 2.2. In this work, we build on this idea, but rely on LDDMM to implement a more general involution that accounts for the registration residual. This elementary algorithmic component constructs the symmetric shape of an original shape with respect to another one. It is presented in Sect. 2.3. In Sect. 2.4, we introduce the basic properties that symmetries must verify in an affine symmetric space and discuss whether these are fulfilled by our implementation. From a theoretical perspective, deviations from these properties are due to a non zero covariant derivative of the curvature tensor with the LDDMM metric. Conversely, we may interpret transvection errors as estimates of the numerical curvature gradient that encompass all the approximations due to the implementation. The numerical experiments of Sect. 3 show that there is an optimal regularisation parameter for which the space of deformations can best be approximated by a symmetric space.

The paper is organised as follows: in Sect. 2.1, we recall the LDDMM framework following [2] and in Sect. 2.2 the Pole Ladder procedure. We then introduce in Sects. 2.3 and 2.4 the main contributions of the paper, namely accounting for residuals, defining symmetries and their properties: centrality, involutivity and transvectivity. In Sect. 3 we present the numerical experiments and comment on the results.

2 Background and Method

2.1 The LDDMM Framework

In this work we consider shapes represented by 3D meshes. However, the methodology seamlessly applies to images. In order to define a practical finite dimensional parameterization of a subspace of diffeomorphisms G acting on the ambient space \({\mathbb R}^d\), we consider time-varying velocity fields \(v_t(x) = \sum _{k=1}^{N_c} K(x, c^{(t)}_k) \mu ^{(t)}_k \) obtained by convolution of a Gaussian kernel \(K(x,y) = \exp (- \frac{\Vert x-y \Vert ^2}{\sigma ^2})\) over \(N_c\) control points \(c^{(t)} = [c_k^{(t)}]_k\) and momenta \(\mu ^{(t)} = [\mu _k^{(t)}]_k\).

The set of such fields forms a pre-Hilbert space with scalar product between \(v = \sum _k K(\cdot ,c_k)\mu _k\) and \(v' = \sum _k K(\cdot ,c_k')\mu _k'\) defined by

$$\begin{aligned} \textstyle {{<}v, v'{>}}_H = \sum _i\sum _j K(c_i,c_j')\mu _i^T \mu _j'. \end{aligned}$$
(1)

Diffeomorphisms are then defined as flows of velocity fields from \(\phi _0 = Id\). This amounts to integrating the ordinary differential equation (ODE) \(\partial \phi _t(\cdot ) = v_t[\phi _t(\cdot )]\) between 0 and 1. The scalar product on velocity fields induces a right-invariant Riemannian metric on the obtained subspace of diffeomorphisms, and geodesics of this metric are parameterized by control points and momenta that satisfy the following Hamiltonian equations:

(2)

\(\phi _t\) and \(v_t\) are thus uniquely determined by their initial control points \(\mu \), this dependence will be explicitly written \(\phi _t^{c,\mu }\) and \(v_t^{c,\mu }\). In practice the interval [0, 1] is discretized with n time steps and the ODE is solved with an iterative Euler forward or Runge-Kutta 2 method. This implements an exponential map at identity. The registration problem between a template shape T and a target mesh S optimizes the following criterion over initial control points and momenta \(c,\mu \):

$$\begin{aligned} C(c, \mu ) = \Vert S - \phi _1^{c,\mu }(T) \Vert _2^2 + \alpha ^2 \cdot \Vert v_0^{c,\mu } \Vert _H^2. \end{aligned}$$
(3)

For simplicity, we measure the distance between shapes by the \(L_2\) distance between nodes of the meshes. \(\Vert \cdot \Vert _H\) is the norm defined by the scalar product of Eq. 1, which is actually the metric on G. The resulting \(v_0\) is a logarithm at identity of \(\phi _1\). In fact the metric is scaled by a factor \(\alpha \), and this impacts the geometry of the underlying space as will be demonstrated. It allows to smoothly interpolate between solutions that belong to two paradigms:

  • \(\alpha \rightarrow 0\): exact matching between shapes;

  • \(\alpha \rightarrow \infty \): point distribution models (PDM), no deformation.

2.2 Symmetric Pole Ladder for Parallel Transport

In the context of computational anatomy, the aforementioned registration framework is used to represent a subject-specific temporal deformation between shapes S and \(S'\), and to transport this deformation to a common atlas or template T along the geodesic segment [ST]. The anatomical shapes are modeled as points in a manifold \({\mathcal V}\) under the action of the space of diffeomorphisms G described above. We suppose here that this manifold is equipped with an affine connection, which defines parallel transport and the \({\text {Exp}}\) map. Locally it further defines the \({\text {Log}}\) map.

Algorithm 1 presents a symmetric variant of Pole Ladder [5] introduced in [3] to approximately perform parallel transport in \({\mathcal V}\).

figure a

A Taylor expansion at the midpoint M of the error between the vector transported by Pole Ladder and exact parallel transport of the vector \(u_S = {\text {Log}}_S(S')\) is derived in [9]. We denote by \(u = \varPi _S^M u_S\) the exact transport to M, and \(u' = \varPi _T^M {\text {Log}}_T(T')\) where \(T'\) is obtained by Pole Ladder. Let also \(v = {\text {Log}}_M(T)\). Then

$$\begin{aligned} u' - u = \frac{1}{12}\left( (\nabla _v R)(u,v)(5 u - 2v) + (\nabla _{u} R)(u,v)(v -2 u) \right) + O(\Vert v+u\Vert ^5). \end{aligned}$$
(4)

In fact this scheme is exact in an affine locally symmetric space, where \(\nabla R = 0\). In this case, using the local symmetries \(s_X\) at point X, we have \(T' = s_T \circ s_M(S')\) meaning that Pole Ladder is equivalent to a transvection. As local symmetries are affine mappings, the following diagrams commute:

figure b

Thus, \(\varPi _T^M \circ (ds_M)_S= -\varPi _S^M\) [10, Prop. 4.3]. So with the previous notations, \((ds_M)_S u_S = -\varPi _M^T u\) and \((ds_M)_S u_S = Log_T(T") = - \varPi _M^T u'\) yielding \(u' = u\).

Fig. 2.
figure 2

Midpoint with residuals

2.3 Accounting for Residuals to Improve Centrality and Symmetry

In practice the registration is never exact due to the regularization. Thus, we propose to decompose the space of shapes into a deformation part encoding the orbit of the template, and a Euclidean space of residual displacement fields

$$\begin{aligned} \textstyle S = \phi _1(T) + \delta , \end{aligned}$$
(5)

where \(\delta \) is a displacement field between corresponding points of the two meshes. Of course different possibilities exist to compute the residual, to transport it, and to apply it to different shapes, but our experiments suggest that this very simple formulation may be sufficient, so that we will not detail other approaches in this paper. With this decomposition, a midpoint between T and S is defined by (Fig. 2):

$$\begin{aligned} M = \phi _{\frac{1}{2}}(T) + \frac{1}{2} \delta = {\text {Exp}}_T\Big (\frac{1}{2} {\text {Log}}_T(S)\Big ) + \frac{1}{2} \delta . \end{aligned}$$
(6)

Unfortunately this formulation is not symmetric in T and S, and registering S on T and shooting from S results in a different midpoint in general. We will see however that using residuals decreases the distance between the midpoints obtained with the two initial points.

Fig. 3.
figure 3

Symmetry with residuals

Similarly, a symmetry is defined by inverting the geodesics and the residuals (Fig. 3):

$$\begin{aligned} s_T(S) = {\text {Exp}}_T \Big (- {\text {Log}}_T(S)\Big ) - \delta . \end{aligned}$$
(7)

The first desired consistency property which we refer to as centrality is the compatibility of the midpoint with the symmetry, namely \(s_M(S) = T\). Note that this construction of a central midpoint and a local geodesic symmetry is possible in arbitrary affine connection manifolds for close enough points. However, these symmetries are affine mappings if and only if the space is locally symmetric [10, Prop. 4.2].

2.4 Involutivity and Transvectivity to Measure Curvature Gradient

By construction our symmetry verifies \(s_T(T) = T\) for all \(T \in {\mathcal V}\), and if the \(\log \) and \(\exp \) maps are exact, \(s_T \circ s_T = Id\). We will evaluate the exactitude of this property, called involutivity. Note that in a Lie Group of transformations, natural symmetries may be defined at \(\phi \) by \(\psi \mapsto \phi \circ \psi ^{-1} \circ \phi \), which, at \(\phi = Id\) is in fact the inversion. Involutivity in this case reduces to inverse consistency: \((\psi ^{-1})^{-1} = \psi \iff \psi \circ \psi ^{-1} = Id\). However, in our framework, both types of errors are different because the metric exponential differs from the one defined by the canonical Cartan-Shouten connection, which is the only one compatible with the group operations.

Finally, in an affine globally symmetric space, symmetries must verify the following property that we will call transvectivity [10, Prop. 5.3]:

$$\begin{aligned} \forall M,S \in {\mathcal V}\;\; s_M \circ s_S = s_T \circ s_M, \;\; \text {where} \; T=s_M(S). \end{aligned}$$
(8)

We want to evaluate the exactitude of this property, and use the deviation to this ideal case as a proxy to measure the gradient of the curvature of the space in the directions of interest. At this point the role of \(\alpha \) becomes clearer, it allows to form a continuum of decompositions of the space of diffeomorphisms under consideration:

  • \(\alpha \rightarrow 0, \; \delta \rightarrow 0\): the Riemannian space of deformations where the metric is not compatible with the Cartan-Shouten connection. This space is not symmetric, which shows in the transvectivity error.

  • \(\alpha \rightarrow \infty , \; v_0 \rightarrow 0\): the shapes are considered in the ambient space with Euclidean norm, this space is of course symmetric. This is the PDM framework.

We saw in Sect. 2.2 that Pole Ladder was doing the transvection \(s_T \circ s_M\). With the same notations and using a Taylor expansion from [9], the deviation to parallel transport when applying the opposite transvection \(s_M \circ s_S\) is:

$$\begin{aligned} u'' - u = -\frac{1}{12}\left( (\nabla _v R)(u,v)(5 u + v) + (\nabla _{u} R)(u,v)(v + 2 u) \right) + O(\Vert v+u\Vert ^5). \end{aligned}$$
(9)

Thus when measuring the transvectivity error \(\Vert s_M \circ s_S(S') - s_T \circ s_M(S') \Vert _2\), we in fact measure \(\Vert {\text {Exp}}_T(\varPi _M^T u") - {\text {Exp}}_T(\varPi _M^T u') \Vert _2\) where

$$\begin{aligned} u' - u'' = \frac{1}{12}\left( (\nabla _v R)(u,v)(10 u - v) - 4 (\nabla _{u} R)(u,v)u \right) + O(\Vert v+u\Vert ^5). \end{aligned}$$
(10)

The transvectivity error thus provides a practical way to measure the gradient of the curvature of the space even in the absence of any closed-form expression. This is noticeable in regards to the complexity of the curvature tensor itself in Mario’s formula [8] and it may lead in the future to new ways of estimating the curvature and its gradient.

3 Experiments and Application to Cardiac Shapes

In this section we assess the consistency of our numerical implementation of the symmetry compared to its theoretical properties. We compare the symmetry with residuals to standard symmetry without residuals (\(\delta =0\)) for different values of the parameter \(\alpha \). We used Deformetrica [1] for all our experiments. We also compare the parallel transport obtained by Pole Ladder with both types of symmetry to the one implemented in Deformetrica [6] using the fanning scheme.

We use a database of cardiac shapes from 138 subjects [3] for which the shapes at two time-points are available: at end-diastole (S) and at end-systole (\(S'\)). We use a population atlas as template T. Four types of errors are first measured. The first is the distance between midpoints when shooting from T or from S. The three others are, where M is the midpoint obtained by shooting from T:

  • \(\Vert s_M(T) - S \Vert _2\): the centrality error (Fig. 4a)

  • \(\Vert s_M \circ s_M(S') - S' \Vert _2\): the involutivity error (Fig. 4b)

  • \(\Vert s_T \circ s_M(S') - s_M \circ s_S(S') \Vert _2\): the transvectivity error (Fig. 4c).

Mean results for extreme values of \(\alpha \) are given in Table 1. The average registration error of T on each subject’s S, as well as the norm of the deformation and inverse consistency (by registering S on T) for each regularisation parameter are also given for reference.

Table 1. Mean errors measured on cardiac shapes, in millimeters
Fig. 4.
figure 4

The three types of errors measured

Fig. 5.
figure 5

The four errors for different values of \(\alpha ^2\), with and without residuals in the computation of the midpoint and the symmetries.

These results illustrate the two contributions of this paper. Firstly, using residuals in the symmetry considerably improves the numerical accuracy of the computation of a midpoint. As we can see on Fig. 5a, the distance between midpoints computed by shooting from T or from S is reduced when using the residuals. This error compares well with the inverse consistency error in general and is even significantly lower for \(\alpha \ge 1\).

Moreover, this increase in numerical accuracy is also visible on the centrality and involutivity errors. Indeed, for centrality (Fig. 5b), the error when using residuals is consistently smaller than without residuals, and this gain becomes larger as \(\alpha \) grows. It is also remarkable that this error is significantly lower than the registration error for \(\alpha \ge 1\). The same behavior is observed for the involutivity on Fig. 5c. This means that for reasonable values of \(\alpha \) we obtain reliable implementations of the midpoint and symmetry.

Secondly, the LDDMM space endowed with the right invariant metric is not symmetric, and the transvectivity error reflects the covariant derivative of the curvature. Figure 5d gives further details: pushing registration with a small \(\alpha \) generates larger errors, and there is an optimal \(\alpha \in [1;2]\) for which the space is closest to being symmetric. Using residuals, this error decreases as the space flattens to a Euclidean space and deformations tend to the identity.

Finally, in order to evaluate the result of symmetric Pole Ladder and compare it to the fanning scheme, we compute the local area strain (LAS) between end-diastole and end-systole at every landmark of the mesh. For two corresponding points \(m_i,m_i'\) that belong to \(k_i\) triangular cells, we compute the mean of the difference of area of each of these cells between S and \(S'\).

$$\begin{aligned} LAS_i^{S-S'}= \frac{1}{k_i} \sum _{j=1}^{k_i} \frac{(a_j - a_j')}{a_j} \end{aligned}$$
(11)

This feature is commonly used by clinicians to characterise the cardiac motion [4]. Here we use it to test the isometric property of the parallel transport scheme: we measure the area strain between the original subject’s meshes S and \(S'\), and compare it to the one measured between the atlas T and the deformed atlas \(T'\) obtained by the parallel transport algorithm. We report in Fig. 6a the square root of the sum of squared differences over all landmarks:

$$\begin{aligned} ASE^2 = \sum _{i} (LAS_i^{S-S'} - LAS_i^{T-T'})^2 \end{aligned}$$
(12)
Fig. 6.
figure 6

(b) Area strain maps for one patient computed between S and S’ and represented on S (top left), and computed between T and T’ and represented on T, where T’ is obtained with the different methods and \(\alpha =1\).

As \(\alpha \) grows, both the temporal and subject-to-atlas deformations decrease, thus generating less area changes, which explains the growing errors. Furthermore, we can see on Fig. 6b that the area strain is dominated by a bending artifact of the valve sections at the borders. Using residuals reduces this effect in this example. Although more suitable methods may exist to directly map scalar functions from one shape to another, these results emphasize the contribution of this paper: using residuals improves the symmetry and parallel transport with Pole Ladder.

4 Conclusion

We introduced residuals from registration errors to compute midpoints and symmetries between shapes. This results in improved numerical consistency for the centrality and involutivity properties. Furthermore the transvectivity error reflects the curvature of the underlying Riemannian space of deformations, and allows to estimate how far from symmetric this space is, depending on the regularisation parameter. Performing the same experiments in the framework of SVF with a similar implementation would yield very interesting comparison as the space is naturally symmetric, and accounting for residuals would thus provide a more consistent method for parallel transport. This gain could be even more interesting when considering registration between images. Indeed images are not in the template’s orbit in practice and residuals would encode intensity bias, which is a key source of error in image registration.