Tensor Factorization for Precision Medicine in Heart Failure with Preserved Ejection Fraction

Luo, Yuan; Ahmad, Faraz S.; Shah, Sanjiv J.

doi:10.1007/s12265-016-9727-8

Tensor Factorization for Precision Medicine in Heart Failure with Preserved Ejection Fraction

Review
Published: 23 January 2017

Volume 10, pages 305–312, (2017)
Cite this article

Download PDF

Access provided by Autonomous University of Puebla

Journal of Cardiovascular Translational Research Aims and scope Submit manuscript

Tensor Factorization for Precision Medicine in Heart Failure with Preserved Ejection Fraction

Download PDF

Yuan Luo¹,
Faraz S. Ahmad^1,2 &
Sanjiv J. Shah²

1681 Accesses
32 Citations
3 Altmetric
Explore all metrics

Abstract

Heart failure with preserved ejection fraction (HFpEF) is a heterogeneous clinical syndrome that may benefit from improved subtyping in order to better characterize its pathophysiology and to develop novel targeted therapies. The United States Precision Medicine Initiative comes amid the rapid growth in quantity and modality of clinical data for HFpEF patients ranging from deep phenotypic to trans-omic data. Tensor factorization, a form of machine learning, allows for the integration of multiple data modalities to derive clinically relevant HFpEF subtypes that may have significant differences in underlying pathophysiology and differential response to therapies. Tensor factorization also allows for better interpretability by supporting dimensionality reduction and identifying latent groups of data for meaningful summarization of both features and disease outcomes. In this narrative review, we analyze the modest literature on the application of tensor factorization to related biomedical fields including genotyping and phenotyping. Based on the cited work including work of our own, we suggest multiple tensor factorization formulations capable of integrating the deep phenotypic and trans-omic modalities of data for HFpEF, or accounting for interactions between genetic variants at different omic hierarchies. We encourage extensive experimental studies to tackle challenges in applying tensor factorization for precision medicine in HFpEF, including effectively incorporating existing medical knowledge, properly accounting for uncertainty, and efficiently enforcing sparsity for better interpretability.

Extracting Phenotypes from Patient Claim Records Using Nonnegative Tensor Factorization

Genome-wide association and multi-trait analyses characterize the common genetic architecture of heart failure

Article Open access 14 November 2022

Deep Phenotyping in Cardiovascular Disease

Article 26 November 2020

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Introduction

Heart failure is a common and morbid condition affecting over 5.7 million Americans [1] and defined by fatigue, shortness of breath, and exercise intolerance. Heart failure is typically divided into two subtypes: heart failure with preserved ejection fraction (HFpEF) and heart failure with reduced ejection fraction (HFrEF). Patients in these groups tend to have different demographics, comorbidities, and responses to therapy. Several large, randomized controlled trials in patients with HFrEF have shown therapeutic benefit for a range of neurohormonal medications and intracardiac devices; however, large clinical trials have not demonstrated similar clinical benefit in patients with HFpEF [2, 3]. The heterogeneity in the pathogenesis and in the clinical phenotypes of HFpEF may have contributed to lack of large, positive clinical trials [4].

Recent studies have identified the centrality of chronic systemic inflammation in the pathogenesis of HFpEF [5]. Patients with HFpEF tend to be older females with several comorbidities, including obesity, hypertension, diabetes, coronary artery disease, chronic obstructive pulmonary disease, and chronic kidney disease. The combination of older age and these comorbidities may contribute to the systemic inflammation that in turn affects multiple signaling cascades and organ systems, including the heart, lungs, skeletal muscles, and kidneys [4]. The culmination of these pathways leads to different manifestations of the clinical syndrome of HFpEF, including unique combinations of comorbidities, changes in cardiac remodeling and mechanics, biomarker profiles, and clinical symptoms [3, 4, 6]. Understanding these combinations may be informative to the design of future trials testing targeted therapeutic approaches.

Unsupervised machine learning has been previously used to identify clusters, or “phenogroups”, of patients with HFpEF using demographic and physical characteristics and laboratory, electrocardiographic, and echocardiographic data [7]. Layering in genetic data may elucidate the mechanistic underpinnings of different HFpEF phenotype groups or even lead to additional refinement in the classification of patients with HFpEF. Prior studies have demonstrated genetic differences in cardiac geometry and mechanics [8–10], risk for new onset heart failure [11, 12], and mortality after heart failure diagnosis [13]. Additionally, linking epigenetic signatures to specific HFpEF phenotypic subgroups may provide additional mechanistic understanding of pathogenesis and identify future targets for therapy [14]. Identifying methodologies for a trans-omic approach, including with detailed phenotypic data, is therefore essential to better subtyping patients with HFpEF and identifying the mechanistic underpinnings of the syndrome.

Precision medicine aims to utilize information from multiple modalities—including phenotypic, genomic, and environmental measurements—to develop an individualized and comprehensive view of a patient’s pathophysiologic progression, to identify unique subtypes of the patient, and to administer personalized therapies [15]. Existing efforts are often based on only a selected set of biomarkers. The rapid growth of phenotypic, genetic, medication prescription, and environmental data for HFpEF patients poses technical challenges for subtyping them, due to the large volume of data, diversity of data types, and uncertainty from noise and missing data. However, the rapid growth of multiple data modalities, when linked to the right patients, may provide a prismatic view of the patients’ pathophysiologic evolution and offers a basis for meaningful subtyping of these patients. Figure 1 shows multiple data modalities for HFpEF patients, including deep phenotyping and trans-omic data. One of the example datasets with linked evaluation on multiple modality measurements is the Multi-Ethnic Study of Atherosclerosis (MESA) dataset [16], which is curated by a medical research study involving more than 6000 men and women from six communities in the USA. In particular, over 6000 patients in MESA were genotyped using Affymetrix 6.0, in addition to routinely collected laboratory tests and exam measurements. In addition, the advent of RNAseq and epigenetic data will likely offer trans-omic evidence to HFpEF patient subtyping and identify individualized therapy targets. We will use the scenario of MESA dataset containing both a high density of phenotypic variables and genome-wide genetic variants as an illustrative example in this review.

The Problem of Complex, Multimodal Data in Precision Medicine

The lack of positive, large-scale HFpEF clinical trials may be due to distinct systemic and myocardial signaling in HFpEF (compared to HFrEF) and the underlying heterogeneity of HFpEF. A precision medicine approach, leveraging multiple modalities and sources of information, including deep phenotyping and trans-omic data, may better define subtypes of HFpEF that are more homogeneous in their responses to specific targeted therapies. With the rapid development of next-generation sequencing and sophisticated phenotyping tools such as comprehensive cardiovascular imaging, the linked data for HFpEF patients from various modalities are becoming increasingly complex, defined as follows:

Data complexity: the data objects themselves are becoming more complex. They are becoming larger in scale and higher in dimension (e.g., millions of genetic loci identified by whole genome sequencing). The features (especially phenotypic features) are usually heterogeneous, sparse, and time-evolving.
Relation complexity: the relationships between multiple modalities of electronic health record (EHR) data are becoming more complex. Such relationships can link RNA expression to phenotypic abnormalities or link epigenetic signature changes (e.g., DNA methylation, histone modifications) to upregulation or downregulation of genes (e.g., α-MHC gene and SPR-Ca²⁺ ATPase gene). Relations also hold between features in the same measurement modality. For example, some phenotypic variables can be grouped into echocardiographic measurements (e.g., global longitudinal strain, left ventricular end-diastolic volume) or electrocardiogram (ECG) parameters (e.g., PR interval, QRS-T angle).

Recent advances in machine learning have opened avenues towards more effective mining and modeling from EHRs to facilitate translational research [17]. However, clinicians often regard existing machine learning models as hard-to-interpret black boxes. Traditional machine learning algorithms usually treat phenotypic variables as independent features instead of exploring clinically meaningful groups of phenotypic variables that together can characterize HFpEF subtypes (e.g., younger patients with moderate diastolic dysfunction and relatively normal BNP as a distinct HFpEF archetype [7]). It is also difficult for conventional machine learning algorithms to model patient physiologic temporal progressions for disease/syndrome subtyping. Patients are often monitored in physiological time series in which vital measurements and laboratory test values fluctuate as time progresses (e.g., there is significant intra-person variation in blood pressure measurements due to setting, method of measurement, time of day, and health status). The fact that these physiologic time series are sampled at irregular time intervals and may contain missing data further complicates complexity of feature modeling. Intuitively, the temporal trends, and in general relations as features, are more expressive and informative, but their extraction is often difficult and often involves manual work such as pre-specifying rules or patterns [18] and matching against clinical time series [19]. In contrast, independent measurements (e.g., individual blood pressure measurements) have been widely used because they are simple to extract and have robust statistical properties. However, these independent measurements are less informative and interpretable than relational features. In fact, modeling relational features are usually ignored by machine learning algorithms that mostly adopt a flat patient-by-feature matrix view (patients as rows and features as columns). Because of complexity, the data required to capture HFpEF characteristics, the traditional vector- or matrix-based representations (e.g., nonnegative matrix factorization [20], topic modeling [21]) are not flexible enough to capture all the degrees of freedom contained in the data. Although theoretically one can add interactions as additional features or embed graphical models to account for feature interactions, the problem quickly becomes intractable for large feature dimensionality (e.g., at the genome scale). Our previous research in cancer subtyping and intensive care unit (ICU) mortality prediction shows that using the relational features and independent raw features jointly can take advantage of both in order to improve the interpretability and accuracy of the machine learning model [22–24].

Tensor Factorization: a Potential Solution for Multimodal Data in HFpEF

Tensor modeling has emerged as a promising solution for the computational challenges of precision medicine. A tensor is a multidimensional array where each modality spans one axis (denoted as a mode in tensor terminology). In matrix representation, one may have to concatenate multiple data modalities into a single second dimension of the matrix, thus disallowing explicit representation of interactions among these modalities. Tensors, as natural generalizations of vectors and matrices, are becoming increasingly popular for representing multimodality data. Figure 2 shows the tensor for modeling interactions among patients, phenotypic measurements, and genetic variants. Various tensor factorization models with such parsimonious structures and accompanying computational tools have been integral in the analysis and process of big tensor data (see Kolda et al. [25] and Cichocki et al. [26] for further reading). These factorization models not only reduce dimensionality but also help discover latent groups in each data modality and identify group-wise interactions. In addition, specifically designed tensor factorizations can also integrate additional domain-specific prior knowledge to constrain the tensor structure [27, 28]. Following our illustrative MESA dataset, Fig. 2 shows a visualization of two types of factorization—Tucker [29] and CANDECOMP/PARAFAC (CP) [30]—in order to integrate the phenotypic and genetic measurements and model their relations for the subtyping of HFpEF. The Tucker factorization [29] (top panel in Fig. 2) decomposes the data tensor χ into three factor matrices specifying groups in each mode and a core tensor G specifying levels of interaction between the groups from different modes. In general, the number of groups in each mode is less than the dimensionality of that mode, and the core \( \mathrm{tensor}\kern0.3em \mathcal{G} \) can be regarded as a compression of χ. The CANDECOMP/PARAFAC (CP) factorization [30] (bottom panel in Fig. 2) \( \mathrm{decomposes}\kern0.3em \mathcal{X} \) as a weighted sum of rank-1 sub-tensors, each of which is the outer product (S, S _ijk = α _i β _j γ _k) of a patient factor vector (α), an intervention factor vector (β), and a biomarker factor vector (γ). The weights λ _r , r = 1 … R indicate relative importance of sub-tensors. When interpreting the Tucker factorization regarding HFpEF subtyping, the factor \( \mathrm{matrix}\kern0.3em \mathcal{A} \) corresponds to HFpEF subtypes, the factor \( \mathrm{matrix}\kern0.3em \mathcal{B} \) corresponds to groups of phenotypic variables that characterize HFpEF subtypes, and the factor \( \mathrm{matrix}\kern0.3em \mathcal{C} \) corresponds to groups of genetic variants that characterize HFpEF subtypes. With CP factorization, the factor vectors α _i ’s correspond to HFpEF subtypes, the factor vectors β _i ’s correspond to groups of phenotypic variables that characterize HFpEF subtypes, and the factor vectors γ _i’s correspond to groups of genetic variants that characterize HFpEF subtypes. Compared to Tucker, the structural hypothesis of CP requires the same number of groups for each mode. The simplified structures in CP allow easier linkage from phenotypic variable groups and genetic variant groups to HFpEF subtypes (simply linking those that are in the same sub-tensor). On the other hand, the structural flexibility by Tucker factorization may offer more accurate data fitting but typically requires more intensive computation [23]. In practice, care needs to be taken when trading off model flexibility with simplified interpretation and computation [31].

When modeling HFpEF patients subtyping using tensor factorization, certain types of features can in fact display a hierarchical structure. Although genetic variants, such as single nucleotide polymorphisms (SNPs) and copy number variations (CNVs), are the most primitive components in trans-omic features, other trans-omic data such as epigenetics and pathways can arguably provide invaluable information. It is widely acknowledged that viewing SNPs and CNVs as independent features and fitting them to linear models lose critical information such as the interaction between proteins encoded by the affected genes [32, 33]. Decades of trans-omic research have resulted in evidence of protein interaction, transcription factor regulation and signaling, and other molecular pathways. Much of the data are curated and archived as public databases such as STRING [34], KEGG [35], InterPro [36], Aceview [37], and Pfam [38]. These databases provide information sources for regulatory or interaction pathways involving genes affected by SNPs or CNVs. Thus, we can build a tensor that account for higher-order relations between SNPs and CNVs as follows. For a particular patient, we scan through genetic variants, such as SNPs or CNVs, and use interval tree search [39] to identify relevant genes whose chromosomal regions contain those of the genetic variants. Next, we query the pathway databases to identify pathways or gene sets that involve the identified genes. Then, the tensor entry, indexed by the patient, the pathway, and the genetic variant, is increased by the genotype of the variant (0, 1, or 2 corresponding to none, single-allelic, or bi-allelic variant). The SNPs and CNVs may be of high dimensionality; thus, one may need to aggregate the SNP and CNV counts according to the affected genes to avoid impractically large tensor. The tensor constructed this way falls into the category of subgraph augmented tensor, and in particular, pathways or gene sets can be precisely represented as graphs or subgraphs. Pathways as a mode of the tensor help to put the genetic variants in the context of functional relations between genes. Genetic variants help to link correlated pathways in order to render a comprehensive view of the HFpEF pathophysiology (Fig. 3). Our previous research showed that subgraph augmented tensor can be efficiently factorized and the groups of pathways, which functionally link the related genetic variants, can be linked to patient groupings [23].

The tensor formulations in Figs. 2 and 3 are alternative schemes that focus on exploring the interactions between different feature types and exploring hierarchical structures of features of the same type, respectively. Both Tucker and CP factorization seem to have broader adoptions in non-genomic biomedical fields, perhaps due to the relative ease of imposing probabilistic and other regularizations. Although CP produces summation of rank-1 sub-tensors (Fig. 2) and leads to simplified interpretation, Tucker provides a more flexible and sometimes more realistic factorization by allowing varying number of groups in different modalities. The choice between these two alternatives depends on data availability, outcome to track, and focus of hypothesis and opens questions in the clinical domain of HFpEF that deserves extensive experimental studies and characterizations. Although to our knowledge no prior research studies have applied tensor factorization to subtype HFpEF patients, a substantial body of research on applying tensor factorization to handle multiple modalities of biomedical data has emerged over the past decade. We refer the reader to general reviews [40, 41] for tensor modeling application in biomedical domains. Below, we provide a more detailed discussion on the applications of tensor modeling in cardiovascular medicine.

In cardiovascular disease, prior studies have investigated the interactions between heart failure-related diagnoses and administered medications to heart failure patient groupings. Ho et al. [42] studied the problem of heart failure onset prediction with clinically meaningful sub-tensors. They build a patient-diagnosis-procedure tensor and derive patient clusters on specific diagnoses and medications by applying CP while enforcing sparsity constraints. In a follow-up study, Ho et al. [43] investigated the Centers for Medicare and Medicaid Services (CMS) claims data to predict high-cost (above 75th percentile) beneficiaries by using phenotypes within chronic diseases including hypertension, arthritis, heart failure, and diabetes as features (generated by tensor factorization). They build a patient-diagnosis-procedure tensor and apply CP-APR factorization to decompose it as summations of rank-1 bias tensors and rank-R interaction tensors with sparsity constraints on the factor matrices of interaction tensors, in order to explicitly account for interactions among groups of the same modality. Wang et al. [44] studied the problem of predicting the onset risk of patients with heart failure. They applied tensor modeling to generalize sparse logistic regression to multiple modalities on EHR data, such as comorbidity diagnosis codes and medications, and called their model High-Order Sparse Logistic Regression (HOSLR). They reported that HOSLR not only achieved good prediction accuracy on newly diagnosed heart failure but also discovered interesting predictive patterns capturing the interaction between diagnosis and medications. Wang et al. [28] studied the problems of detecting sub-phenotypes of hypertension, type 1 and 2 diabetes, and heart failure based on EHR data. Their tensor formulation incorporated medical knowledge via customized regularization terms. Medical knowledge guidance is a subset of columns in the target factor matrix, and the resultant factor matrix is required to be close to the target on the pre-specified subset of columns. They also constrained that the columns of the factor matrix should be close to pairwise orthogonal to ensure distinct phenotypes.

Applying Tensor Factorization to HFpEF: Potential Challenges

The advent of precision medicine initiatives in HFpEF, coupled with the welcome growth of new modes of data in cardiovascular medicine, produces not only opportunities but also challenges when moving towards tensor modeling. Although tensor factorization naturally integrates multiple modalities or hierarchies of features, common factorization schemes such as Tucker and CP usually lack the machinery to incorporate existing medical knowledge as probabilistic priors or to evaluate extracted and grouped relations as clinical evidence from a Bayesian perspective (e.g., posterior probability and confidence interval). Confidence intervals and prior and posterior probabilities are the most basic primitives for statistical decision making, but few tensor-based approaches have adopted them in clinical decision support. Our preliminary data show that mining and grouping relation subgraphs lead to improved accuracy and better interpretability in diagnostic reasoning but call for a Bayesian formulation to incorporate existing medical knowledge, provide confidence estimation, and further improve prediction accuracy to practical level [22, 23].

To account for uncertainty, multiple authors proposed probabilistic Tucker and/or CP factorizations to incorporate priors on tensor structural parameters. Those priors can specify dependence between environmental exposures and SNP level differences [45], or probability of gene sequence conditioned on the composing nucleotides and chromosomal positions [46, 47]. In addition, probabilistic CP was shown to improve EEG classification accuracy when missing data is present [48]. The above Bayesian formulations allow incorporating existing medical knowledge as probability priors and reliably estimating the posterior probabilities and confidence intervals of any findings from the model. In Tucker factorization in Fig. 2, the vectors {β ₁ … β _M} in the factor \( \mathrm{matrix}\kern0.2em \mathcal{B} \) that correspond to phenotypic subtyping criteria and outcome risk predictors can be used to integrate existing medical knowledge. We can select a subset \( \left\{{\beta}_1\dots {\beta}_{M^{\prime }}\right\} \) where M ^′ < M. Upon initialization, the existing knowledge that comes from diagnosis guidelines or other clinical guidelines is encoded in a guidance vector \( {\beta}_m\in \left\{{\beta}_1\dots {\beta}_{M^{\prime }}\right\} \) where positive entries indicate relevant feature dimensions. For example, we can have a guidance vector corresponding to a HFpEF subtype of “obese, diabetic patients with a high prevalence of obstructive sleep apnea who have the worst left ventricle (LV) relaxation”, where the disease-related entries are set to positive values (e.g., close to one) and the remaining entries are zero.

The efficient enforcement of sparsity constraints represents another challenge in applying tensor factorization to HFpEF patients. In tensor factorization, it is desirable to have sparse factor representations for improved interpretability. In the case of using CP tensor factorization to integrate phenotypic variables and genetic variants, we need sparse phenotypic factor vectors and sparse genetic variant factor vectors so that each time we specify a group interaction (i.e., only a small subset of phenotypic variables and a small subset of genetic variants are linked). To achieve this goal, Morup et al. proposed a sparse nonnegative Tucker decomposition approach by using a specially designed penalty to regulate number of nonzero entries in the factor vectors [49]. However, it is computational expensive due to sparse optimization after factorization. More recently, the approach called Tensor Truncated Power (TTP) [50] shows promise compared to sparse Tucker tensor factorization by incorporating an efficient truncation step in the iteration step of computation of factors. More work still needs to be done in order to generalize this approach to accommodate Bayesian tensor factorization under Tucker or SP schemes.

Conclusion

HFpEF is a heterogeneous clinical syndrome that may benefit from improved subtyping in order to inform the design of future clinical trials and to identify responders to therapies. Modern medicine has accumulated multiple modalities of clinical data for HFpEF patients ranging from deep phenotypic to trans-omic data. Precision medicine with phenotypic and trans-omic data from multiple domains appears to be feasible and may result in meaningful, clinically relevant HFpEF subtypes with significant differences in the underlying etiology, pathophysiology, and risk of adverse outcomes. By integrating the multiple modalities of data for HFpEF, by properly accounting for interactions between genetic variants at different omic hierarchies, by integrating existing medical knowledge as priors, and by utilizing Bayesian inference to provide uncertainty estimates, tensor factorization is a promising machine learning technique that could be helpful for HFpEF subtyping and contribute to the development of novel targeted therapies. However, applying tensor factorization for precision medicine in HFpEF faces a number of challenges, including effectively incorporating existing medical knowledge, properly accounting for uncertainty, and efficiently enforcing sparsity for better interpretability. The successful application of tensor factorization for the development of precision medicine approaches in the diagnosis and treatment of HFpEF is contingent on answering all these challenges.

References

Mozaffarian, D., Benjamin, E.J., Go, A.S., et al. (2015).Heart disease and stroke statistics—2016 update. A Report From the American Heart Association.
Yancy, C. W., Jessup, M., Bozkurt, B., et al. (2013). 2013 ACCF/AHA guideline for the management of heart failure: a report of the American College of Cardiology Foundation/American Heart Association Task Force on Practice Guidelines. Journal of the American College of Cardiology, 62(16), e147–e239.
Article PubMed Google Scholar
Samson, R., Jaiswal, A., Ennezat, P. V., Cassidy, M., & Le Jemtel, T. H. (2016). Clinical phenotypes in heart failure with preserved ejection fraction. Journal of the American Heart Association., 5(1), e002477.
Article PubMed PubMed Central Google Scholar
Shah, S. J., Kitzman, D. W., Borlaug, B. A., et al. (2016). Phenotype-specific treatment of heart failure with preserved ejection fraction. A Multiorgan Roadmap., 134(1), 73–90.
Google Scholar
Paulus, W. J., & Tschöpe, C. (2013). A novel paradigm for heart failure with preserved ejection fraction: comorbidities drive myocardial dysfunction and remodeling through coronary microvascular endothelial inflammation. Journal of the American College of Cardiology, 62(4), 263–271.
Article PubMed Google Scholar
Vaduganathan, M., Michel, A., Hall, K., et al. (2016). Spectrum of epidemiological and clinical findings in patients with heart failure with preserved ejection fraction stratified by study design: a systematic review. European Journal of Heart Failure, 18(1), 54–65.
Article PubMed Google Scholar
Shah, S.J., Katz, D.H., Selvaraj, S., et al. (2014). Phenomapping for novel classification of heart failure with preserved ejection fraction. Circulation :CIRCULATIONAHA. 114.010637.
Kapuku, G. K., Ge, D., Vemulapalli, S., Harshfield, G. A., Treiber, F. A., & Snieder, H. (2008). Change of genetic determinants of left ventricular structure in adolescence: longitudinal evidence from the Georgia cardiovascular twin study. American Journal of Hypertension, 21(7), 799–805.
Article CAS PubMed Google Scholar
Tang, W., Devereux, R. B., Li, N., et al. (2009). Identification of a pleiotropic locus on chromosome 7q for a composite left ventricular wall thickness factor and body mass index: the HyperGEN Study. BMC Medical Genetics, 10(1), 1.
Article Google Scholar
Vasan, R. S., Glazer, N. L., Felix, J. F., et al. (2009). Genetic variants associated with cardiac structure and function: a meta-analysis and replication of genome-wide association data. Journal of the American Medical Association, 302(2), 168–178.
Article CAS PubMed PubMed Central Google Scholar
Smith, N. L., Felix, J. F., Morrison, A. C., et al. (2010). Association of genome-wide variation with the risk of incident heart failure in adults of European and African ancestry a prospective meta-analysis from the cohorts for heart and aging research in genomic epidemiology (CHARGE) consortium. Circulation. Cardiovascular Genetics, 3(3), 256–266.
Article CAS PubMed PubMed Central Google Scholar
Larson, M. G., Atwood, L. D., Benjamin, E. J., et al. (2007). Framingham Heart Study 100 K project: genome-wide associations for cardiovascular disease outcomes. BMC Medical Genetics, 8(1), 1.
Google Scholar
Morrison, A. C., Felix, J. F., Cupples, L. A., et al. (2010). Genomic variation associated with mortality among adults of European and African ancestry with heart failure the cohorts for heart and aging research in genomic epidemiology consortium. Circulation. Cardiovascular Genetics, 3(3), 248–255.
Article CAS PubMed PubMed Central Google Scholar
Berezin, A. (2016). Epigenetics in heart failure phenotypes. BBA Clinical., 6, 31–37.
Article PubMed PubMed Central Google Scholar
Kohane, I. S. (2015). Ten things we have to do to achieve precision medicine. Science, 349(6243), 37–38.
Article CAS PubMed Google Scholar
Bild, D. E., Bluemke, D. A., Burke, G. L., et al. (2002). Multi-ethnic study of atherosclerosis: objectives and design. American Journal of Epidemiology, 156(9), 871–881.
Article PubMed Google Scholar
Winslow, R. L., Trayanova, N., Geman, D., & Miller, M. I. (2012). Computational medicine: translating models to clinical care. Science Translational Medicine, 4(158), 158rv111–158rv111.
Article Google Scholar
Luo, Y., Uzuner, Ö., Szolovits, P. (2016). Bridging semantics and syntax with graph algorithms—state-of-the-art of extracting biomedical relations. Briefings in Bioinformatics.
Moskovitch, R., & Shahar, Y. (2015). Classification of multivariate time series via temporal abstraction and time intervals mining. Knowledge and Information Systems., 45(1), 35–74.
Article Google Scholar
Lee, D. D., & Seung, H. S. (1999). Learning the parts of objects by non-negative matrix factorization. Nature, 401(6755), 788–791.
Article CAS PubMed Google Scholar
Steyvers, M., & Griffiths, T. (2007). Probabilistic topic models. Handbook of latent semantic analysis., 427(7), 424–440.
Google Scholar
Luo, Y., Sohani, A. R., Hochberg, E. P., & Szolovits, P. (2014). Automatic lymphoma classification with sentence subgraph mining from pathology reports. Journal of the American Medical Informatics Association, 21(5), 824–832.
Article PubMed PubMed Central Google Scholar
Luo, Y., Xin, Y., Hochberg, E., Joshi, R., Uzuner, O., Szolovits, P. (2015). Subgraph augmented non-negative tensor factorization (SANTF) for modeling clinical narrative text. Journal of the American Medical Informatics Association :ocv016.
Luo, Y., Xin, Y., Joshi, R., Celi, L., Szolovits, P. (2016). Predicting ICU mortality risk by grouping temporal trends from a multivariate panel of physiologic measurements.
Kolda, T. G., & Bader, B. W. (2009). Tensor decompositions and applications. SIAM review., 51(3), 455–500.
Article Google Scholar
Cichocki, A. (2014). Tensor networks for big data analytics and large-scale optimization problems. arXiv preprint arXiv :14073124.
Han, D., Wang, S., Jiang, C., et al. (2015). Trends in biomedical informatics: automated topic analysis of <em>JAMIA</em> articles. Journal of the American Medical Informatics Association, 22(6), 1153–1163.
Article PubMed PubMed Central Google Scholar
Wang, Y., Chen, R., Ghosh, J., et al. (2015). Rubik: knowledge guided tensor factorization and completion for health data analytics. Paper presented at: Proceedings of the 21th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining.
Tucker, L. R. (1966). Some mathematical notes on three-mode factor analysis. Psychometrika, 31(3), 279–311.
Article CAS PubMed Google Scholar
Carroll, J. D., & Chang, J.-J. (1970). Analysis of individual differences in multidimensional scaling via an N-way generalization of “Eckart-Young” decomposition. Psychometrika, 35(3), 283–319.
Article Google Scholar
Mørup, M. (2011). Applications of tensor (multiway array) factorizations and decompositions in data mining. Wiley Interdisciplinary Reviews: Data Mining and Knowledge Discovery., 1(1), 24–40.
Google Scholar
Vogelstein, B., Papadopoulos, N., Velculescu, V. E., Zhou, S., Diaz, L. A., & Kinzler, K. W. (2013). Cancer genome landscapes. Science, 339(6127), 1546–1558.
Article CAS PubMed PubMed Central Google Scholar
Nik-Zainal, S., Alexandrov, L. B., Wedge, D. C., et al. (2012). Mutational processes molding the genomes of 21 breast cancers. Cell, 149(5), 979–993.
Article CAS PubMed PubMed Central Google Scholar
Franceschini, A., Szklarczyk, D., Frankild, S., et al. (2013). STRING v9. 1: protein-protein interaction networks, with increased coverage and integration. Nucleic Acids Research, 41(D1), D808–D815.
Article CAS PubMed Google Scholar
Kanehisa, M., Goto, S., Sato, Y., Furumichi, M., Tanabe, M. (2011). KEGG for integration and interpretation of large-scale molecular data sets. Nucleic Acids Research :gkr988.
Hunter, S., Jones, P., Mitchell, A., et al. (2011). InterPro in 2011: new developments in the family and domain prediction database. Nucleic Acids Research :gkr948.
Thierry-Mieg, D., & Thierry-Mieg, J. (2006). AceView: a comprehensive cDNA-supported gene and transcripts annotation. Genome Biology, 7(1), 1.
Article PubMed Google Scholar
Finn, R.D., Bateman, A., Clements, J., et al. (2013). Pfam: the protein families database. Nucleic Acids Research :gkt1223.
Luo, Y., & Szolovits, P. (2016). Efficient queries of stand-off annotations for natural language processing on electronic medical records. Biomedical Informatics Insights., 8, 29–38.
Article PubMed PubMed Central Google Scholar
Luo, Y., Wang, F., Szolovits, P. (2016). Tensor factorization toward precision medicine. Briefings in Bioinformatics.
Cong, F., Lin, Q.-H., Kuang, L.-D., Gong, X.-F., Astikainen, P., & Ristaniemi, T. (2015). Tensor decomposition of EEG signals: a brief review. Journal of Neuroscience Methods, 248, 59–69.
Article PubMed Google Scholar
Ho, J. C., Ghosh, J., Steinhubl, S. R., et al. (2014). Limestone: high-throughput candidate phenotype generation via tensor factorization. Journal of biomedical informatics., 52, 199–211.
Article PubMed Google Scholar
Ho, J.C., Ghosh, J., Sun, J. (2014). Marble: high-throughput phenotyping from electronic health records via sparse nonnegative tensor factorization. Paper presented at: Proceedings of the 20th ACM SIGKDD international conference on Knowledge discovery and data mining.
Wang, F., Zhang, P., Qian, B., Wang, X., Davidson, I. (2014). Clinical risk prediction with multilinear sparse logistic regression. Paper presented at: Proceedings of the 20th ACM SIGKDD international conference on Knowledge discovery and data mining.
Kessler, D.C., Taylor, J., Dunson, D.B. (2014). Learning phenotype densities conditional on many interacting predictors. Bioinformatics :btu040.
Yang, Y., Dunson, D.B. (2015). Bayesian conditional tensor factorizations for high-dimensional classification. Journal of the American Statistical Association. (just-accepted).
Zhou, J., Bhattacharya, A., Herring, A. H., & Dunson, D. B. (2015). Bayesian factorizations of big sparse tensors. Journal of the American Statistical Association., 110(512), 1562–1576.
Article CAS Google Scholar
Rai P, Wang Y, Guo S, Chen G, Dunson DB, Carin L. (2014). Scalable Bayesian low-rank decomposition of incomplete multiway tensors. Paper presented at: ICML.
Mørup, M., Hansen, L. K., & Arnfred, S. M. (2008). Algorithms for sparse nonnegative Tucker decompositions. Neural Computation, 20(8), 2112–2131.
Article PubMed Google Scholar
Sun, W., Lu, J., Liu, H., Cheng, G. (2015). Provable sparse tensor decomposition. arXiv preprint arXiv :150201425.

Download references

Acknowledgements

Dr. Ahmad is supported by the National Heart, Lung, and Blood Institute of the National Institutes of Health (NIH) under Award number T32HL069771. Dr. Shah is supported by the NIH R01 HL107577 and R01 HL127028 and American Heart Association grants #15CVGPSD27260148 and #16SFRN28780016.

Author information

Authors and Affiliations

Department of Preventive Medicine, Northwestern University Feinberg School of Medicine, 11th Floor, Arthur Rubloff Building, 750 N. Lake Shore Drive, Chicago, IL, 60611, USA
Yuan Luo & Faraz S. Ahmad
Division of Cardiology, Department of Medicine, Northwestern University Feinberg School of Medicine, Chicago, IL, USA
Faraz S. Ahmad & Sanjiv J. Shah

Authors

Yuan Luo
View author publications
You can also search for this author in PubMed Google Scholar
Faraz S. Ahmad
View author publications
You can also search for this author in PubMed Google Scholar
Sanjiv J. Shah
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Yuan Luo.

Additional information

Associate Editor Daniel P. Judge oversaw the review of this article

Rights and permissions

Reprints and permissions

About this article

Cite this article

Luo, Y., Ahmad, F.S. & Shah, S.J. Tensor Factorization for Precision Medicine in Heart Failure with Preserved Ejection Fraction. J. of Cardiovasc. Trans. Res. 10, 305–312 (2017). https://doi.org/10.1007/s12265-016-9727-8

Download citation

Received: 21 November 2016
Accepted: 23 December 2016
Published: 23 January 2017
Issue Date: June 2017
DOI: https://doi.org/10.1007/s12265-016-9727-8

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Tensor Factorization for Precision Medicine in Heart Failure with Preserved Ejection Fraction

Abstract

Similar content being viewed by others

Extracting Phenotypes from Patient Claim Records Using Nonnegative Tensor Factorization

Genome-wide association and multi-trait analyses characterize the common genetic architecture of heart failure

Deep Phenotyping in Cardiovascular Disease

Introduction

The Problem of Complex, Multimodal Data in Precision Medicine

Tensor Factorization: a Potential Solution for Multimodal Data in HFpEF

Applying Tensor Factorization to HFpEF: Potential Challenges

Conclusion

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Additional information

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Tensor Factorization for Precision Medicine in Heart Failure with Preserved Ejection Fraction

Abstract

Similar content being viewed by others

Extracting Phenotypes from Patient Claim Records Using Nonnegative Tensor Factorization

Genome-wide association and multi-trait analyses characterize the common genetic architecture of heart failure

Deep Phenotyping in Cardiovascular Disease

Introduction

The Problem of Complex, Multimodal Data in Precision Medicine

Tensor Factorization: a Potential Solution for Multimodal Data in HFpEF

Applying Tensor Factorization to HFpEF: Potential Challenges

Conclusion

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Additional information

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation