1 Introduction

Simulations stand as a powerful support to understand complex physiological phenomena, and as a potential way to enrich real databases with large amounts of realistic data [1, 2]. However, strong differences may exist between the simulations obtained from distinct models, and between simulated and real data. For cardiac biomechanical models, personalization often focuses on matching global parameters such as volumes or pressures [3], although some works went further by matching finer traits such as myocardial deformation patterns [4, 5].

Nonetheless, matching the data from different but comparable sources can be performed a-posteriori, and falls under the umbrella of domain adaptation techniques [6, 7]. Simple examples in medical imaging include the global adaptation of data distributions, illustrated on real cardiac meshes from different magnetic resonance protocols [8] or real vs. simulated fluoroscopy images to train a transducer localization algorithm [1]. The merging of databases can be made more sample-specific after reducing the dimensionality of the data. Recent works investigated the estimation of joint spaces that merge heterogeneous data features [9, 10]. We are more interested in adapting the data from one database to another one, as pursued globally in [1, 8]. A mathematically sound framework for this was developed in [11] with diffusion maps, a spectral embedding technique relevant for clustered data samples.

Table 1. Summary of populations and models used, with 11 cases shared between the two populations and simulated with both models.

Here, we demonstrate the interest of this strategy to quantify differences between cardiac simulations from two different biomechanical models, assessed through myocardial deformation patterns. Such simulations build upon different input segmentations, modeling strategies, and personalization. We specifically explore linear and non-linear ways to formulate correspondences between the two populations and therefore to perform domain adaptation, and benchmark them against simple data mapping using mesh registration.

2 Methods

2.1 Data and Pre-processing

We processed healthy left ventricular deformation patterns generated using two distinct cardiac biomechanical models personalized to two different populations, as summarized in Table 1. Population #2 consists of all the cases from the cMAC-STACOM 2011 challenge [12]. Population #1 consists of 11 of these cases, as the 4 remaining cMAC-STACOM cases could not be loaded to the segmentation software CVI42, and 9 other cases from our clinical collaborators at CHU Saint Etienne, France. The 11 cases simulated with both models were used to define the mapping between the two databases as described in Sect. 2.2. We therefore focused the analysis on myocardial deformation parameters that are common to the output of the two models (radial, circumferential and longitudinal strain).

These data are originally defined on subject-specific meshes, and were matched to a given template for each database before the population analysis (here, an arbitrarily chosen mesh in each population). For population #1, correspondences between meshes were obtained by mesh parameterization along the radial, circumferential, and longitudinal directions taking advantage of the hexahedral elements. For population #2, such parameterization is more challenging due to the tetrahedral elements and the biventricular segmentations. Correspondences between meshes were therefore obtained by registering left ventricular surface meshes using the currents representation and the large deformation diffeomorphic metric mapping framework, with the open-source software Deformetrica [14], and finally propagating the transformation to the left ventricular volumetric meshes. For both populations, data from each individual were finally interpolated at the cells and vertices of the template mesh using kernel ridge regression and the estimated mesh correspondences as in [13].

A similar pipeline was used to match the template meshes from the two populations, and therefore transport the data from one population to the other without domain adaptation, for benchmarking purposes.

Fig. 1.
figure 1

Left: pairwise comparison of cases in each population, using the Euclidean distance between deformation patterns—treated as column vectors, after concatenating the radial, circumferential and longitudinal components. Colored dots point out the 11 cases shared between the two databases. Right: eigenvalues normalized by the first eigenvalue. (Color figure online)

2.2 Dimensionality Reduction and Domain Adaptation

Let’s denote \(\mathbf {y}_{m,k} \in \mathcal {Y}_m\) the high-dimensional input data associated to subject \(k \in [1,K_m]\) for the dataset \(m \in \{1,2\}\) (in our case, the deformation patterns defined over the left ventricular meshes for each individual—treated as column vectors, after concatenating the radial, circumferential and longitudinal components). For each dataset, standard dimensionality reduction is applied to the \(K_m\) samples, and provides the low-dimensional coordinates \(\mathbf {x}_{m,k} \in \mathcal {X}_m\). Here, we tested our methods with the Isomap algorithm [15], a non-linear dimensionality reduction algorithm that builds a neighborhood graph between the high-dimensional input samples, and looks for a low-dimensional space in which the Euclidean distance approximates the geodesic distance in the high-dimensional space.

The small population size may limit the generalization ability of the representation, but may be less critical compared to recently popular techniques such as auto-encoders, which may require larger populations. Also, domain adaptation via dimensionality reduction was thoroughly investigated for the diffusion maps algorithm [11], but this algorithm is more relevant for clustered data samples. In any case, our approach is generic and can be applied to any manifold learning algorithm.

Domain adaptation from \(\mathcal {X}_1\) to \(\mathcal {X}_2\) (similar formulations for \(\mathcal {X}_2\) to \(\mathcal {X}_1\)) is performed as a linear change of basis derived from the eigenvectors learnt for each database, or as a non-linear regression based on the low-dimensional coordinates from each database. For both options, the mapping was defined using the 11 cases that are present in both populations, denoted as the “shared cases” in Figs. 1 and 2. Without lack of generalizability, we can assume that these cases correspond to the first cases in each database, and denote \(\varOmega \) the subset of indices associated to these cases.

The linear mapping from \(\mathcal {X}_1\) to \(\mathcal {X}_2\) can be formulated through the following change of basis [11]:

$$\begin{aligned} \forall ~\mathbf {x}_1 \in \mathcal {X}_1, \quad f_{1 \rightarrow 2} \left( \mathbf {x}_1 \right) = (\mathbf {\Psi }_1)^t \mathbf {\Psi }_2 \mathbf {x}_1 \quad \in \mathcal {X}_2, \end{aligned}$$
(1)

where \(\mathbf {\Psi }_1\) and \(\mathbf {\Psi }_2\) are the matrices of eigenvectors of \(\mathcal {X}_1\) and \(\mathcal {X}_2\) associated to the shared cases of indices \(\varOmega \), and \(.^t\) is the transposition operator.  

The non-linear mapping between these two spaces can be estimated through non-linear regression. We chose kernel ridge regression, which is formulated as:

$$\begin{aligned} \forall ~\mathbf {x}_1 \in \mathcal {X}_1, \quad g_{1 \rightarrow 2} \left( \mathbf {x}_1 \right) = \sum _{i \in \varOmega } k( \mathbf {x}_1 , \mathbf {x}_{1,i} ) \mathbf {c}_i \quad \in \mathcal {X}_2, \end{aligned}$$
(2)

and in practice computed using the matrix formulation \(\mathbf {C} = \left( \mathbf {K} + \mathbf {I}/\gamma \right) ^{-1} \mathbf {X}_2\), where \(\mathbf {C} = (\mathbf {c}_i)_{i \in \varOmega }\), \(\mathbf {K} = ( k( \mathbf {x}_{1,i} , \mathbf {x}_{1,j} ) )_{(i,j) \in \varOmega ^2} = \exp ( - \Vert \mathbf {x}_{1,i} - \mathbf {x}_{1,j} \Vert ^2 / \sigma ^2 )\) and \(\sigma \) controls the scale of the regression, \(\mathbf {X}_2 = (\mathbf {x}_{2,i})_{i \in \varOmega }\), \(\mathbf {I}\) is the identity matrix, and \(\gamma \) balances the contributions of the data fidelity and regularization terms.

We used a multi-scale version of this algorithmFootnote 1 [16], which is suitable for non-uniformly distributed samples, as in our case and in most real-life applications. This multi-scale algorithm starts with a scale \(\sigma \) equal to the largest distance between samples, and reduces it by a factor 2 at each iteration s to estimate the residual of the regression, until \(\sigma /2^s\) is smaller than a fraction \(\alpha \) of the average distance between each sample and its nearest neighbors (the samples density).

3 Experiments and Results

3.1 Original Pattern Comparisons

The distances between the deformation patterns in each population are displayed in the left part of Fig. 1. They already point out differences between the two populations, which are lower for population #2. Such changes come from differences in the mesh segmentation process used before the simulations (3D interpolation and remeshing from segmented 2D slices for population #1, against statistical atlas for population #2), different biomechanical models and therefore different modeling errors, and different personalization strategies—and potential errors in the personalization.

3.2 Parameters Setting

We used 5 neighbors to construct the neighborhood graph of the Isomap dimensionality reduction algorithm. This value was chosen to minimize the relative importance of the second eigenvalue and the sum of eigenvalues (the compactness of the representation, illustrated in the right part of Fig. 1), within a reasonable amount of possible neighbors (up to half the database).

Fig. 2.
figure 2

Low-dimensional coordinates before and after domain adaptation. Dots correspond to the samples that are mapped to the target population, whose samples are represented by squares. Colored dots and squares stand for the 11 cases shared between the two populations: after mapping the colored dots lay near the center of the colored squares. The black dots and squares correspond to the remaining cases that also served to obtain the low-dimensional embedding, but were not used in the domain adaptation process. The red-green-blue curves represent the main directions of variation in the original population (from −2 SD to +2 SD), before and after mapping. Top: population #2 mapped to population #1, using the linear (middle) or non-linear right mapping. Bottom: similar plots for population #1 mapped to population #2. (Color figure online)

Fig. 3.
figure 3

Radial strain patterns before and after domain adaptation (first mode of variation from −2 SD to +2 SD), or the registration process described in Sect. 2.1.

The maximum amount of dimensions for the low-dimensional space was set to 14 (one unit less than the size of the smallest studied population—population #2). Although the intrinsic dimensionality of the data may be lower (right part of Fig. 1), some artifacts were observed in our implementation of the linear mapping when computing the scalar product \((\mathbf {\Psi }_1)^t \mathbf {\Psi }_2\) with cut dimensions, which deserves further investigation.

Leave-one-out cross correlation was used to determine the regression parameters (the weight \(\gamma \) and the bandwidth at which to stop the multi-scale scheme, determined as a fraction \(\alpha \) of the samples density). The retained values were \(\gamma = 10\) and \(\alpha = 0.5\), respectively.

3.3 Transported Coordinates

As observed in Fig. 2, the cases that are shared between the two populations and that are used to compute the mapping were exactly matched (linear mapping) or almost exactly matched (non-linear mapping) in the target population space. The main modes of variation substantially changed after the mapping (a given mode in the original population is expressed as a combination of several modes of the target population after mapping).

Fig. 4.
figure 4

Differences between the original deformation patterns and the deformation patterns reconstructed from the low-dimensional coordinates after domain adaptation, across all the modes of variation (Euclidean distance on the radial-circumferential-longitudinal deformation patterns treated as column vectors—as in Fig. 1).

We also observed the limits of the linear mapping, through substantial shrinkage of the variations around zero, due to the non-guaranteed correlation between the coordinates of the shared cases in the two populations, and therefore the potentially opposing contribution of some cases to the mapping. This shrinkage effect is still present for the non-linear mapping, but much less pronounced. Besides, the curvature of the modes of variation after the non-linear mapping supports the relevance of this mapping strategy against the linear one.

3.4 Domain-Adapted Patterns

Figure 3 illustrates the deformation patterns encoded in the first mode of variations, before and after the domain adaptation. Patterns were reconstructed from low-dimensional coordinates evolving from −2 SD to +2 SD along the first dimension. Reconstruction was achieved through the multi-scale regression presented above, with \(\gamma = 1\) and \(\alpha = 0.5\).

As visible in Fig. 3, the deformation patterns from populations #1 and #2 before domain adaptation substantially differ in terms of amplitude of the deformation, and spatial distribution of the patterns. This is even more visible when the patterns in the original population are transported to the target population by registration (as described in Sect. 2.1). After domain adaptation, the mapped patterns are much closer from the target population both regarding amplitude and spatial distribution of the patterns. Subtle differences remain, which should correspond to the intrinsic differences between these two populations.

Figure 4 complements this information by quantifying differences between the original deformation patterns and the deformation patterns after domain adaptation, across all the modes of variation. With the linear mapping, patterns near the average are unchanged, due to the definition of this mapping (linear change of basis, centered around the average). Differences are not guaranteed to be concentrated on the first dimensions, as visible for dimensions 10 and 13. With the non-linear mapping, patterns near the average are also transported and therefore exhibit differences. Also, the modes of variation become curved (Fig. 2) after domain adaptation, and differences between patterns are therefore not guaranteed to be symmetric on each side of the average.

4 Conclusion

We have demonstrated the potential of domain adaptation to compare databases from distinct sources, upon the constraint that some cases are shared between the databases to define the mappings. In our tested populations, non-linear mapping stands as a relevant solution, given that the distribution of the shared cases is not guaranteed to be the same across the two populations.

In terms of application, the approach is promising to better understand differences between populations from different sources (here, from different biomechanical models), and may pave the ground for enriching a real database with simulated data or the comparison of different clinical cohorts. These perspectives will require further exploring up to which limit two different databases are actually compatible, and scaling the proposed methods to larger populations—including a substantial amount of cases shared between the two populations.