Abstract
We tackle the determination of a relevant data space to quantify differences between two databases coming from different sources. In the present paper, we propose to quantify differences between cardiac simulations from two different biomechanical models, assessed through myocardial deformation patterns. At stake is the evaluation of a given model with respect to another one, and the potential correction of bias necessary to merge two databases. We address this from a domain adaptation perspective. We first represent the data using non-linear dimensionality reduction on each database. Then, we formulate the mapping between databases using cases that are shared between the two databases: either as a linear change of basis derived from the learnt eigenvectors, or as a non-linear regression based on the low-dimensional coordinates from each database. We demonstrate these concepts by examining the principal variations in deformation patterns obtained from two cardiac biomechanical models personalized to 20 and 15 real healthy cases, respectively, from which 11 cases were simulated with both models.
Access provided by Autonomous University of Puebla. Download conference paper PDF
Similar content being viewed by others
1 Introduction
Simulations stand as a powerful support to understand complex physiological phenomena, and as a potential way to enrich real databases with large amounts of realistic data [1, 2]. However, strong differences may exist between the simulations obtained from distinct models, and between simulated and real data. For cardiac biomechanical models, personalization often focuses on matching global parameters such as volumes or pressures [3], although some works went further by matching finer traits such as myocardial deformation patterns [4, 5].
Nonetheless, matching the data from different but comparable sources can be performed a-posteriori, and falls under the umbrella of domain adaptation techniques [6, 7]. Simple examples in medical imaging include the global adaptation of data distributions, illustrated on real cardiac meshes from different magnetic resonance protocols [8] or real vs. simulated fluoroscopy images to train a transducer localization algorithm [1]. The merging of databases can be made more sample-specific after reducing the dimensionality of the data. Recent works investigated the estimation of joint spaces that merge heterogeneous data features [9, 10]. We are more interested in adapting the data from one database to another one, as pursued globally in [1, 8]. A mathematically sound framework for this was developed in [11] with diffusion maps, a spectral embedding technique relevant for clustered data samples.
Here, we demonstrate the interest of this strategy to quantify differences between cardiac simulations from two different biomechanical models, assessed through myocardial deformation patterns. Such simulations build upon different input segmentations, modeling strategies, and personalization. We specifically explore linear and non-linear ways to formulate correspondences between the two populations and therefore to perform domain adaptation, and benchmark them against simple data mapping using mesh registration.
2 Methods
2.1 Data and Pre-processing
We processed healthy left ventricular deformation patterns generated using two distinct cardiac biomechanical models personalized to two different populations, as summarized in Table 1. Population #2 consists of all the cases from the cMAC-STACOM 2011 challenge [12]. Population #1 consists of 11 of these cases, as the 4 remaining cMAC-STACOM cases could not be loaded to the segmentation software CVI42, and 9 other cases from our clinical collaborators at CHU Saint Etienne, France. The 11 cases simulated with both models were used to define the mapping between the two databases as described in Sect. 2.2. We therefore focused the analysis on myocardial deformation parameters that are common to the output of the two models (radial, circumferential and longitudinal strain).
These data are originally defined on subject-specific meshes, and were matched to a given template for each database before the population analysis (here, an arbitrarily chosen mesh in each population). For population #1, correspondences between meshes were obtained by mesh parameterization along the radial, circumferential, and longitudinal directions taking advantage of the hexahedral elements. For population #2, such parameterization is more challenging due to the tetrahedral elements and the biventricular segmentations. Correspondences between meshes were therefore obtained by registering left ventricular surface meshes using the currents representation and the large deformation diffeomorphic metric mapping framework, with the open-source software Deformetrica [14], and finally propagating the transformation to the left ventricular volumetric meshes. For both populations, data from each individual were finally interpolated at the cells and vertices of the template mesh using kernel ridge regression and the estimated mesh correspondences as in [13].
A similar pipeline was used to match the template meshes from the two populations, and therefore transport the data from one population to the other without domain adaptation, for benchmarking purposes.
2.2 Dimensionality Reduction and Domain Adaptation
Let’s denote \(\mathbf {y}_{m,k} \in \mathcal {Y}_m\) the high-dimensional input data associated to subject \(k \in [1,K_m]\) for the dataset \(m \in \{1,2\}\) (in our case, the deformation patterns defined over the left ventricular meshes for each individual—treated as column vectors, after concatenating the radial, circumferential and longitudinal components). For each dataset, standard dimensionality reduction is applied to the \(K_m\) samples, and provides the low-dimensional coordinates \(\mathbf {x}_{m,k} \in \mathcal {X}_m\). Here, we tested our methods with the Isomap algorithm [15], a non-linear dimensionality reduction algorithm that builds a neighborhood graph between the high-dimensional input samples, and looks for a low-dimensional space in which the Euclidean distance approximates the geodesic distance in the high-dimensional space.
The small population size may limit the generalization ability of the representation, but may be less critical compared to recently popular techniques such as auto-encoders, which may require larger populations. Also, domain adaptation via dimensionality reduction was thoroughly investigated for the diffusion maps algorithm [11], but this algorithm is more relevant for clustered data samples. In any case, our approach is generic and can be applied to any manifold learning algorithm.
Domain adaptation from \(\mathcal {X}_1\) to \(\mathcal {X}_2\) (similar formulations for \(\mathcal {X}_2\) to \(\mathcal {X}_1\)) is performed as a linear change of basis derived from the eigenvectors learnt for each database, or as a non-linear regression based on the low-dimensional coordinates from each database. For both options, the mapping was defined using the 11 cases that are present in both populations, denoted as the “shared cases” in Figs. 1 and 2. Without lack of generalizability, we can assume that these cases correspond to the first cases in each database, and denote \(\varOmega \) the subset of indices associated to these cases.
The linear mapping from \(\mathcal {X}_1\) to \(\mathcal {X}_2\) can be formulated through the following change of basis [11]:
where \(\mathbf {\Psi }_1\) and \(\mathbf {\Psi }_2\) are the matrices of eigenvectors of \(\mathcal {X}_1\) and \(\mathcal {X}_2\) associated to the shared cases of indices \(\varOmega \), and \(.^t\) is the transposition operator.
The non-linear mapping between these two spaces can be estimated through non-linear regression. We chose kernel ridge regression, which is formulated as:
and in practice computed using the matrix formulation \(\mathbf {C} = \left( \mathbf {K} + \mathbf {I}/\gamma \right) ^{-1} \mathbf {X}_2\), where \(\mathbf {C} = (\mathbf {c}_i)_{i \in \varOmega }\), \(\mathbf {K} = ( k( \mathbf {x}_{1,i} , \mathbf {x}_{1,j} ) )_{(i,j) \in \varOmega ^2} = \exp ( - \Vert \mathbf {x}_{1,i} - \mathbf {x}_{1,j} \Vert ^2 / \sigma ^2 )\) and \(\sigma \) controls the scale of the regression, \(\mathbf {X}_2 = (\mathbf {x}_{2,i})_{i \in \varOmega }\), \(\mathbf {I}\) is the identity matrix, and \(\gamma \) balances the contributions of the data fidelity and regularization terms.
We used a multi-scale version of this algorithmFootnote 1 [16], which is suitable for non-uniformly distributed samples, as in our case and in most real-life applications. This multi-scale algorithm starts with a scale \(\sigma \) equal to the largest distance between samples, and reduces it by a factor 2 at each iteration s to estimate the residual of the regression, until \(\sigma /2^s\) is smaller than a fraction \(\alpha \) of the average distance between each sample and its nearest neighbors (the samples density).
3 Experiments and Results
3.1 Original Pattern Comparisons
The distances between the deformation patterns in each population are displayed in the left part of Fig. 1. They already point out differences between the two populations, which are lower for population #2. Such changes come from differences in the mesh segmentation process used before the simulations (3D interpolation and remeshing from segmented 2D slices for population #1, against statistical atlas for population #2), different biomechanical models and therefore different modeling errors, and different personalization strategies—and potential errors in the personalization.
3.2 Parameters Setting
We used 5 neighbors to construct the neighborhood graph of the Isomap dimensionality reduction algorithm. This value was chosen to minimize the relative importance of the second eigenvalue and the sum of eigenvalues (the compactness of the representation, illustrated in the right part of Fig. 1), within a reasonable amount of possible neighbors (up to half the database).
The maximum amount of dimensions for the low-dimensional space was set to 14 (one unit less than the size of the smallest studied population—population #2). Although the intrinsic dimensionality of the data may be lower (right part of Fig. 1), some artifacts were observed in our implementation of the linear mapping when computing the scalar product \((\mathbf {\Psi }_1)^t \mathbf {\Psi }_2\) with cut dimensions, which deserves further investigation.
Leave-one-out cross correlation was used to determine the regression parameters (the weight \(\gamma \) and the bandwidth at which to stop the multi-scale scheme, determined as a fraction \(\alpha \) of the samples density). The retained values were \(\gamma = 10\) and \(\alpha = 0.5\), respectively.
3.3 Transported Coordinates
As observed in Fig. 2, the cases that are shared between the two populations and that are used to compute the mapping were exactly matched (linear mapping) or almost exactly matched (non-linear mapping) in the target population space. The main modes of variation substantially changed after the mapping (a given mode in the original population is expressed as a combination of several modes of the target population after mapping).
We also observed the limits of the linear mapping, through substantial shrinkage of the variations around zero, due to the non-guaranteed correlation between the coordinates of the shared cases in the two populations, and therefore the potentially opposing contribution of some cases to the mapping. This shrinkage effect is still present for the non-linear mapping, but much less pronounced. Besides, the curvature of the modes of variation after the non-linear mapping supports the relevance of this mapping strategy against the linear one.
3.4 Domain-Adapted Patterns
Figure 3 illustrates the deformation patterns encoded in the first mode of variations, before and after the domain adaptation. Patterns were reconstructed from low-dimensional coordinates evolving from −2 SD to +2 SD along the first dimension. Reconstruction was achieved through the multi-scale regression presented above, with \(\gamma = 1\) and \(\alpha = 0.5\).
As visible in Fig. 3, the deformation patterns from populations #1 and #2 before domain adaptation substantially differ in terms of amplitude of the deformation, and spatial distribution of the patterns. This is even more visible when the patterns in the original population are transported to the target population by registration (as described in Sect. 2.1). After domain adaptation, the mapped patterns are much closer from the target population both regarding amplitude and spatial distribution of the patterns. Subtle differences remain, which should correspond to the intrinsic differences between these two populations.
Figure 4 complements this information by quantifying differences between the original deformation patterns and the deformation patterns after domain adaptation, across all the modes of variation. With the linear mapping, patterns near the average are unchanged, due to the definition of this mapping (linear change of basis, centered around the average). Differences are not guaranteed to be concentrated on the first dimensions, as visible for dimensions 10 and 13. With the non-linear mapping, patterns near the average are also transported and therefore exhibit differences. Also, the modes of variation become curved (Fig. 2) after domain adaptation, and differences between patterns are therefore not guaranteed to be symmetric on each side of the average.
4 Conclusion
We have demonstrated the potential of domain adaptation to compare databases from distinct sources, upon the constraint that some cases are shared between the databases to define the mappings. In our tested populations, non-linear mapping stands as a relevant solution, given that the distribution of the shared cases is not guaranteed to be the same across the two populations.
In terms of application, the approach is promising to better understand differences between populations from different sources (here, from different biomechanical models), and may pave the ground for enriching a real database with simulated data or the comparison of different clinical cohorts. These perspectives will require further exploring up to which limit two different databases are actually compatible, and scaling the proposed methods to larger populations—including a substantial amount of cases shared between the two populations.
Notes
- 1.
Source code and demo available at https://nicolasduchateau.wordpress.com/downloads/.
References
Heimann, T., Mountney, P., John, M., et al.: Real-time ultrasound transducer localization in fluoroscopy images by transfer learning from synthetic training data. Med. Image Anal. 18, 1320–1328 (2014)
Duchateau, N., Sermesant, M., Delingette, H., et al.: Model-based generation of large databases of cardiac images: synthesis of pathological cine MR sequences from real healthy cases. IEEE Trans. Med. Imaging 37, 755–766 (2018)
Molléro, R., Pennec, X., Delingette, H., et al.: Multifidelity-CMA: a multifidelity approach for efficient personalisation of 3D cardiac electromechanical models. Biomech. Model. Mechanobiol. 17, 285–300 (2018)
Wang, V.Y., Lam, H.I., Ennis, D.B., et al.: Modelling passive diastolic mechanics with quantitative MRI of cardiac structure and function. Med. Image Anal. 13, 773–84 (2009)
Chabiniok, R., Moireau, P., Lesault, P.F., et al.: Estimation of tissue contractility from cardiac cine-MRI using a biomechanical heart model. Biomech. Model. Mechanobiol. 11, 609–30 (2012)
Csurka, G. (ed.): Domain Adaptation in Computer Vision Applications. ACVPR. Springer, Cham (2017). https://doi.org/10.1007/978-3-319-58347-1
Wang, M., Deng, W.: Deep visual domain adaptation: a survey. Neurocomputing 312, 135–53 (2018)
Medrano-Gracia, P., Cowan, B.R., Bluemke, D.A., et al.: Atlas-based analysis of cardiac shape and function: correction of regional shape bias due to imaging protocol for population studies. J. Cardiovasc. Magn. Reson. 15, 80 (2013)
Sanchez-Martinez, S., Duchateau, N., Erdei, T., et al.: Characterization of myocardial motion patterns by unsupervised multiple kernel learning. Med. Image Anal. 35, 70–82 (2017)
Puyol-Antón, E., Sinclair, M., Gerber, B., et al.: A multimodal spatiotemporal cardiac motion atlas from MR and ultrasound data. Med. Image Anal. 40, 96–110 (2017)
Coifman, R.R., Hirn, M.J.: Diffusion maps for changing data. Appl. Comp. Harm. Anal. 36, 79–107 (2014)
Tobon-Gomez, C., De Craene, M., McLeod, K., et al.: Benchmarking framework for myocardial tracking and deformation algorithms: an open access database. Med. Image Anal. 17, 632–648 (2013)
Rumindo, G.K., Duchateau, N., Croisille, P., Ohayon, J., Clarysse, P.: Strain-based parameters for infarct localization: evaluation via a learning algorithm on a synthetic database of pathological hearts. In: Pop, M., Wright, G.A. (eds.) FIMH 2017. LNCS, vol. 10263, pp. 106–114. Springer, Cham (2017). https://doi.org/10.1007/978-3-319-59448-4_11
Bône, A., Louis, M., Martin, B., Durrleman, S.: Deformetrica 4: an open-source software for statistical shape analysis. In: Reuter, M., Wachinger, C., Lombaert, H., Paniagua, B., Lüthi, M., Egger, B. (eds.) ShapeMI 2018. LNCS, vol. 11167, pp. 3–13. Springer, Cham (2018). https://doi.org/10.1007/978-3-030-04747-4_1
Tenenbaum, J., De Silva, V., Langford, J.: A global geometric framework for nonlinear dimensionality reduction. Science 290, 2319–23 (2000)
Bermanis, A., Averbuch, A., Coifman, R.R.: Multiscale data sampling and function extension. Appl. Comp. Harm. Anal. 34, 15–29 (2013)
Acknowledgements
GK Rumindo was supported by the European Commission H2020 Marie Sklodowska-Curie Training Network (VPH-CaSE-642612). The authors also acknowledge the partial support from the French ANR (LABEX PRIMES of Université de Lyon [ANR-11-LABX-0063], within the program “Investissements d’Avenir” [ANR-11-IDEX-0007]). Finally, the authors thank M Sermesant (INRIA Sophia-Antipolis, France) and J Ohayon (TIMC-IMAG Grenoble, France), who contributed to the development of the cardiac simulations, as well as P Croisille and M Viallon (CREATIS, CHU Saint Etienne, France), who provided the 9 cases used in population #1.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2019 Springer Nature Switzerland AG
About this paper
Cite this paper
Duchateau, N., Rumindo, K., Clarysse, P. (2019). Domain Adaptation via Dimensionality Reduction for the Comparison of Cardiac Simulation Models. In: Coudière, Y., Ozenne, V., Vigmond, E., Zemzemi, N. (eds) Functional Imaging and Modeling of the Heart. FIMH 2019. Lecture Notes in Computer Science(), vol 11504. Springer, Cham. https://doi.org/10.1007/978-3-030-21949-9_30
Download citation
DOI: https://doi.org/10.1007/978-3-030-21949-9_30
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-21948-2
Online ISBN: 978-3-030-21949-9
eBook Packages: Computer ScienceComputer Science (R0)