Construction and optimization of multi-platform precision pathways for precision medicine

Tran, Andy; Wang, Andy; Mickaill, Jamie; Strbenac, Dario; Larance, Mark; Vernon, Stephen T.; Grieve, Stuart M.; Figtree, Gemma A.; Patrick, Ellis; Yang, Jean Yee Hwa

doi:10.1038/s41598-024-54517-8

Construction and optimization of multi-platform precision pathways for precision medicine

Article
Open access
Published: 21 February 2024

Volume 14, article number 4248, (2024)
Cite this article

Download PDF

You have full access to this open access article

Scientific Reports

Construction and optimization of multi-platform precision pathways for precision medicine

Download PDF

Andy Tran^1,2,3,
Andy Wang⁴,
Jamie Mickaill^1,5,
Dario Strbenac^1,2,3,
Mark Larance²,
Stephen T. Vernon^2,6,
Stuart M. Grieve^2,7,
Gemma A. Figtree^2,6,
Ellis Patrick^1,2,3,8 &
…
Jean Yee Hwa Yang^1,2,3,8

1311 Accesses
5 Altmetric
Explore all metrics

Abstract

In the enduring challenge against disease, advancements in medical technology have empowered clinicians with novel diagnostic platforms. Whilst in some cases, a single test may provide a confident diagnosis, often additional tests are required. However, to strike a balance between diagnostic accuracy and cost-effectiveness, one must rigorously construct the clinical pathways. Here, we developed a framework to build multi-platform precision pathways in an automated, unbiased way, recommending the key steps a clinician would take to reach a diagnosis. We achieve this by developing a confidence score, used to simulate a clinical scenario, where at each stage, either a confident diagnosis is made, or another test is performed. Our framework provides a range of tools to interpret, visualize and compare the pathways, improving communication and enabling their evaluation on accuracy and cost, specific to different contexts. This framework will guide the development of novel diagnostic pathways for different diseases, accelerating the implementation of precision medicine into clinical practice.

The Future of Precision Medicine: Potential Impacts for Health Technology Assessment

Article Open access 13 July 2018

Reconciling evidence-based medicine and precision medicine in the era of big data: challenges and opportunities

Article Open access 19 December 2016

The Landscape of PPM Informatics and the Future of Medicine

Discover the latest articles, news and stories from top researchers in related subjects.

Introduction

In recent years, the medical field has seen rapid developments in various high-throughput biotechnologies, allowing the collection of biological data on a variety of “omics” platforms, at an increasingly scalable and affordable level¹. For example, the cost of whole exome sequencing for a single sample has dramatically declined over the last two decades from around $20 million (USD) in 2006 to around $1000 (USD) in 2018². This new access to a plethora of information is leading a revolution in precision medicine, by providing an insight into the biological mechanisms behind different diseases. Indeed, we already see modern omics data being used to help personalize cancer treatments³, among other diseases⁴. However, uptake of these technologies in clinical practice has been slow, and consistency in their implementation and interpretation remains a challenge^5,6,7,8.

With so many novel technologies as potential diagnostic platforms⁹, much of current research aims to build a model for a cohort of patients using a single platform in isolation^10,11,12, or integratively with other data^13,14. However, this is separated from the reality of a clinical application, where a range of diagnostic platforms/tests are available, and a variety of other factors need to be considered, such as health economics and time. In particular, with highly heterogeneous cohorts, a clinician may not necessarily need or want to perform such a test on all their patients, as there may be cheaper or more effective alternatives for some patients. Some recent research has aimed to identify clinical features to make cost-effective diagnoses under time and resource constraints¹⁵. Nonetheless for complex diseases, the specific order of testing, evaluation and clinical decision making is an important consideration, along with approaches for the integration of clinical, imaging and omics data¹⁶. For instance, genomic testing is rapidly transitioning into a “standard of care” to guide treatment plans for rare childhood diseases^17,18 and cancer^19,20.

Here, we present a framework (MultiP) to construct a multi-platform precision pathway given a range of available platforms to diagnose a disease (Fig. 1). The MultiP pathways mimic a clinical diagnostic pathway, where given the results of a diagnostic test, a confident decision may be made, or the patient may be referred to collect more data from a different platform. The data-driven construction of the pathways ensures a consistent implementation of evidence-based diagnostic decisions. The software provides a variety of tools and visualizations to ensure that the constructed pathways are interpretable, an important consideration for any clinical pathway model. We can evaluate and optimize the pathways for accuracy, cost and optionally other factors at the population level. We demonstrate the application of the MultiP framework on two complex diseases: coronary artery disease (CAD) and stage III melanoma. The framework is implemented in the R package ClassifyR, available on Bioconductor.

Results

The development of an “uncertain” class enables multi-stage classification

In a diagnostic setting, typical machine learning classifiers aim to classify all patients into either the positive or negative class²¹, which is a limited representation of the reality in the clinic. When given the results of a diagnostic test, a clinician may be able to make a diagnosis with high confidence, either positive or negative, or if they are “not sure”, they may refer the patient to take further tests to obtain more data. To capture this aspect of decision making, we introduce an “uncertain” class to the typical binary classification problem (turning it into a ternary classification problem), which allows us to perform a multi-stage classification, using the multiple modes of data available to us.

To build this multi-stage classification, we develop a confidence score for each patient and platform combination, representing the confidence in which we can make a diagnosis for that patient using that platform. We achieve this in a similar way to our previous work by Patrick and colleagues²², where a patient-specific accuracy rate is calculated by aggregating the predictions at a patient-level in repeated cross-validation. This can be imagined as having many different clinicians (models from each repeat in the cross-validation) to diagnose a new patient, and the confidence score is equivalent to the degree of agreement among the different clinicians. This captures the uncertainty of the diagnostic process in an unbiased and data-driven way. See “Methods” (“Confidence score” in “MultiP algorithm”) for full details.

We allow the user to customize the confidence threshold, as different contexts may require different stringencies. Based on this threshold, individuals can either be classified (if the model can make a high-confidence decision) or progressed (if the model cannot make a high-confidence decision). In the latter case, the individual proceeds to the next stage, where data from another platform will be collected. This process repeats until the final platform (all possible tests have been performed), where a decision must be made. See “Methods” (“Construction” in “MultiP algorithm”) for full details.

Our MultiP framework is implemented as a part of the ClassifyR package²³, available on Bioconductor. ClassifyR formalizes a framework for performing and evaluating classification in R using repeated cross-validation and includes many in-built feature selection and classification approaches. Many components of the ClassifyR framework can be customized with a single line of code, including the feature selection, classification algorithm and cross validation parameters, making it ideal for implementing MultiP. This flexibility provides the versatility needed to cater to the varied contexts of different diseases and populations. The full list of parameters that can be adjusted in the MultiP framework are summarized in Table 1. See “Methods” (“Implementation”) for full details.

Table 1 Summary of parameters for MultiP.

Full size table

Clinical precision pathways made transparent by MultiP

When applying a machine learning model that can impact treatment decisions and people’s lives as well as the value and cost to the health system, it is imperative that the model is not treated as a black box, to ensure a robust and equitable implementation^24,25. In MultiP, we ensure that the constructed pathways are interpretable, by providing a range of tools and visualizations to dissect the biomarkers driving the models. We demonstrate the utility and interpretability of MultiP on its ability to detect coronary artery disease (CAD) in the BioHEART-CT cohort, using four platforms: clinical, metabolomics, lipidomics and proteomics. A summary of the clinical characteristics of the cohort is presented in Supplementary Table 1, and for full details about the cohort and data, see “Methods” (“Datasets”).

For a single pathway, the flow chart provides an overall visualization of the progression of individuals at the population-level (Fig. 2A). In our example, it can be seen that 78% of individuals can be confidently classified with just clinical information including standard modifiable risk factors, meaning that these individuals would not need any of the more expensive data to be collected. The major clinical unmet need is evident, where individuals without standard risk factors are in the subclinical phase of atherosclerotic development and at risk of heart attack, not detectable by traditional approaches. And equally, some individuals may appear to be at high risk based on clinical data alone (such as high cholesterol or smoking), but have distinct resilience, without the development of CAD. Knowledge of the latter may allow for avoidance of life-long pharmacotherapy.

A more detailed look at the population can be seen in our strata plot that displays this data at the individual sample level (Fig. 2B). At each stage of the pathway, the individuals are split by their true class, and the accuracy of the classifier is plotted for each individual. This can allow the user to assess the performance of each stage of the pathway and identify potential cohort heterogeneity. For instance, we see that the Lipidomics model of our MultiP pathway has a high accuracy for non-CAD individuals, but low accuracy for CAD individuals (i.e. high specificity and low sensitivity). This suggests that this platform may have a bias for categorizing individuals as non-CAD, which could be investigated further. To assess the robustness of our pathways, we constructed pathways using a range of confidence thresholds, which have comparable balanced accuracies (Table S2). As expected, we see that increasing the confidence threshold causes patients to require more tests to be performed (Fig. S1), increasing the balanced accuracy but also the cost (Table S2).

To dissect the models created for each stage and identify the biomarkers that are driving the decision-making process, we produce feature importance plots (Figs. S2, S3). Reassuringly, we see that in each of the models, the key features all have well-established links to CAD. Namely, the Lipidomics model (Fig. S2A) is driven by Ceramide²⁶, Hydroxylated acylcarnitine²⁷ and Sulfatide²⁸; the Metabolomics model (Fig. S2B) is driven by Riboflavin²⁹, 2-arachidonoylglycerol³⁰ and DMGV³¹; and the Proteomics model (Fig. S2C) is driven by PON1³², IGFALS³³ and SERPINC1³⁴.

To examine the characteristics between classified and progressed individuals, we also provide a cohort summary of the classified and progressed group of individuals at each stage of the pathway, to explore the differences in individual cohorts that are classified or progressed at each stage, potentially revealing cohort heterogeneity (Fig. S4). For instance, we see that the lipidomics model (Fig. S4B) confidently classifies considerably more females than males, indicating a potential sex-bias in the data and/or model. This suggests that the different subpopulations (eg. sex) may be more appropriately classified with separate models, as has been well-studied in the context of CAD³⁵.

Criterion-guided optimization of precision pathways outperforms baseline models

To implement a multi-platform framework, an important consideration would be the “order of the platforms”, to represent the order that clinicians would perform diagnostic tests for individuals. In our MultiP framework, we train a precision pathway by first constructing a series of possible precision pathways for all possible orders of the platforms. Here the users can specify if any platforms must be used first, such as clinical data. Secondly, the final precision pathway is selected by comparing and assessing them based on accuracy at the population level.

To assess the performance of our constructed precision pathways, we compare their balanced accuracy to a range of baseline models. Here we consider two types of baseline models, the first group are models built on a single omics platform with clinical data, and the second group considers a model built on all platforms combined (Fig. 3), see “Methods” (“Baseline comparison”) for full details about the construction of baseline models. We see that the precision pathways perform considerably better than single platform classifications, confirming the common belief that these different platforms contain complementary information, and demonstrating the value of integrating different platforms to make clinical diagnoses.

The precision pathways not only maintain a high accuracy compared to the combined data set (as they are closer to the top of the plot), but only use a fraction of the data and thus a fraction of the cost (as they are closer to the left of the plot). This suggests that the combined data set, with all platforms for all patients, contains a large proportion of redundant information, i.e. there are many patients that can be confidently classified with little data. This highlights the value of a precision pathway to use only the important tests to reach confident diagnoses, while minimizing the cost of healthcare with limited sacrifice in accuracy.

MultiP pathways can be optimized on multiple criteria

Classical machine learning approaches typically optimize their models based on accuracy alone²¹. However for a clinical implementation, there are a range of practical factors that need to be considered as well, such as cost or time. The MultiP framework allows users to incorporate additional criteria, and choose a weighting for the importance of each criterion when determining an optimal precision pathway. See “Methods” (“Evaluation” in “MultiP algorithm”) for full details. A variety of tools and visualizations are provided to assist the user to compare the candidate models under these criteria.

Users can view a summary table of the constructed precision pathways, with their accuracy and cost at each level (Fig. 4A). They are also given an overall score, based on their rankings for each of the criteria, aggregated by the user-chosen weightings. This allows for an unbiased selection of an optimal precision pathway. A bubble plot is also produced which allows users to compare the candidate precision pathways based on accuracy and cost (Fig. 4B). Here, the ideal precision pathway would have a high accuracy and low cost and we see that there are three well-performing and economical pathways: C-L-P-M, C-L-M-P and C-M-L-P (where C = Clinical, L = Lipidomics, M = Metabolomics, P = Proteomics). However, the final choice of optimal pathway is a tradeoff between accuracy and cost.

MultiP pathways are transferable across different cohorts

A major barrier to the implementation of new protocols for diagnostic purposes, such as omics technologies, is that molecular signatures are often cohort-specific and do not transfer well between different populations³⁶. We further demonstrate the applicability and transferability of our framework for another complex disease, the prognosis of stage III melanoma. We achieve this by using MultiP to construct a pathway to classify patients into a “good prognosis” (survival > 4 years) or “poor prognosis” (survival < 1 year), trained on data from The Cancer Genome Atlas (TCGA)³⁷. We then assess the performance of this model on an independent dataset generated by the Melanoma Institute of Australia (MIA)^38,39,40. See “Methods” (“Datasets”) for details about the cohorts.

Between these two cohorts, the data corresponding to the same molecular modality is sometimes generated from different technology platforms. In this situation, the mRNA and microRNA data are generated using a count-based RNA-seq in the TCGA dataset and using a fluorescence-based microarray in the MIA dataset. To ensure transferability between the models, we use the log ratios between pairs of features as the input into the MultiP framework, as this has been demonstrated to be more appropriate for transferability⁴⁰. See “Methods” (“Transferability analysis”) for full details.

We find that the models at each level are driven by well-known markers, suggesting that the constructed precision pathway is reasonable. In particular, we see that in the mRNA-level data (Fig. S5B), the prediction for a poor prognosis is driven by higher levels of CCL21 and HAMP, both previously linked to the metastasis of melanomas^41,42. The prediction for a good prognosis is driven by higher levels of DNAH2, a known modulator of cell homologous recombination repair which may have a protective effect⁴³. In the microRNA model (Fig. S5C), we observe that predictions for poor prognosis are driven by hsa-miR-205 and hsa-miR-518b, both known to be dysregulated in melanomas^44,45. And a good prognosis is driven by hsa-miR-944 and hsa-miR-487a, known suppressors of cancer promoting genes^46,47.

We then apply the precision pathway trained on the TCGA cohort to classify the individuals in the external MIA cohort (Fig. 5). We find that the overall precision pathway maintains a good performance in balanced accuracy and F1 score (Table 2). However, we note that between the two cohorts, there was a considerable tradeoff between sensitivity and specificity. This is likely due to the small sample size available (65 in TCGA and 30 in MIA) resulting in overfitting to some subpopulations in the data.

Table 2 Summary of performance metrics for model trained on TCGA dataset evaluated on TCGA dataset (with repeated cross-validation) and MIA dataset.

Full size table

Discussion

The increasing number of diagnostic platforms carry tremendous potential for clinical applications, but this brings the challenge of how to optimally use such data. Here, we present MultiP, a versatile framework to automatically construct precision pathways for a variety of contexts, using a range of available platforms. By defining a confidence score, we quantitatively capture the uncertainty in the diagnostic process, allowing for unbiased and consistent diagnosis. We provide a range of tools and visualizations to allow users to interpret the constructed pathways, and compare candidate pathways on different criteria. We have demonstrated the applicability of the framework in two distinct contexts: the diagnosis of CAD, and the prognosis of stage III melanoma on different cohorts.

We observed that these precision pathways have a similar performance to models constructed on a complete set of data, where data on all platforms are used for all patients. This implies that there are many patients for which a confident and accurate diagnosis can be made with minimal information, suggesting that it is not necessary to perform all tests for these patients. By following a precision pathway, we ensure that only the informative tests are performed, alleviating the huge economic burden of the healthcare system with minimal loss of accuracy.

The MultiP framework is implemented in the ClassifyR package, granting it access to a vast library of classification models and parameters in a single line of code. This flexibility allows MultiP to cater to a wide range of contexts, as different diseases, populations and platforms would require tailored models. As further functionalities are incorporated into the ClassifyR package, such as new classifiers and multiview methods, MultiP will also expand in its applicability.

Despite the vast functionality and potential for MultiP to build diagnostic pathways, there remains a few limitations and scope for future work to improve its performance in a clinical application. Notably, cohort heterogeneity is a challenging issue, where there may be subpopulations in the cohort that would benefit from different classification models. The current MultiP framework uses the entire training cohort to build an ensemble model for all pathways, which may not accurately diagnose underrepresented subpopulations in the data. A workaround to this is to train a separate model at each stage of the pathway, so that the classifier used is more closely tailored to the subpopulation that is progressed, however these models will suffer from smaller sample size to train on. Another issue arising from cohort heterogeneity is that there may be different subpopulations that are better classified using a different order of platforms, whereas the current implementation forces the same order of platforms for all patient pathways. A solution could be that at each level, to identify potential subpopulations that are progressed and test different orders of platforms. However, this opens up exponentially more models that need to be constructed and tuned, quickly increasing the computational burden to construct the pathway.

In summary, our MultiP framework is the first to our knowledge that builds clinical pathways using multiple platforms and incorporating health economics. Considering the practical, legal, and economical barriers to implementing modern technologies for clinical diagnoses, our framework provides a data-driven tool to build and implement evidence-based pathways in an unbiased way. We hope that this can serve as a foundation for future studies to translate omics research from benchtop to bedside, accelerating the progress towards precision medicine.

Methods

MultiP algorithm

Confidence score

The MultiP framework trains individual models for each platform using repeated cross-validation (with default parameters of 2 folds and 50 repeats). For an individual patient on a single platform, this will create many models (equal to the number of repeats) where the patient is in the test set, each with a predicted class. The final predicted class is then chosen based on the majority prediction across the many models. If the predictions are perfectly split across the two classes, the final predicted class is randomly chosen.

The confidence score is defined as the agreement of the predicted class of these models. More precisely, if the predicted classes are split in the ratio $p:1-p$, then the confidence score is defined as $2|p-0.5|$. That is, if all models predict the same class ($p=0$ or $p=1$), then the confidence score is 1, but if there is a perfect 50–50 split among the predictions ($p=0.5$), then the confidence score is 0. For each patient in each platform, we have now calculated a final predicted class and a confidence score.

In our framework, we used default parameter values as stated in Table 1, and performed model building using the runTests function in ClassifyR²³.

Construction

For a specific sequence of platforms and a user-defined confidence threshold, the pathway is constructed as follows:

1.
All patients start at the first platform.
2.
At the current platform, classify the patients, whose confidence score for that platform exceeds the threshold, with their final predicted class.
3.
For the patients whose confidence score does not exceed the threshold, they are considered “uncertain” and then progressed onto the next platform.
4.
Repeat steps 2 and 3 until the final platform.
5.
At the final platform, classify all patients based on the final predicted class for that platform.

We use a default confidence threshold of 0.9, however the optimal threshold may vary greatly across contexts, as different diseases, populations and platforms would have different accuracies and confidence.

Evaluation

When a pathway is constructed, each patient is assigned a predicted class. By comparing these predictions to their true class, we can calculate any classification metric, for example: accuracy, balanced accuracy, F1 score, specificity, sensitivity/recall or precision.

To evaluate a list of candidate pathways, we assign a ranking to each one in each criteria, such as accuracy and cost. A weighted average of these rankings, based on user-defined weights, is taken to be the final score used to determine the optimal pathway. We choose default weights of 0.5 for accuracy and 0.5 for cost.

Cross-validation

To build ensemble models for classification, and to estimate out-of-sample performance of the models, MultiP uses a cross-validation framework. In $k$-fold cross validation, the cohort of patients is randomly split into $k$ folds (of approximately equal size). For a given platform, a model is then trained on $k-1$ folds and then tested on the remaining fold, on which the model performance is evaluated. This ensures that the testing data is independent of the model training, so the performance metrics are representative of an out-of-sample performance. This process is applied across the $k$ folds, where each fold is taken as the testing set and the remaining $k-1$ folds form the training data, producing an out-of-sample prediction for each patient in the data. This framework can be repeated $r$ times, to give $r$ out-of-sample predictions for each patient, which provides an estimate of the variability of the predictions.

The MultiP framework can be applied in one of two ways: (i) on a single data set which uses cross validation to estimate the out-of-sample performance, or (ii) trained on one data set and applied to an independent set. In the first case, the $r$-repeat $k$-fold cross validation framework creates $r$ out-of-sample predictions for each patient in each platform. These predictions are considered as the output of an ensemble classifier which is then used to calculate the confidence scores and simulate the clinical pathway. This case was demonstrated with the BioHEART-CT cohort. In the second case, the $r$-repeat $k$-fold cross validation framework creates $rk$ models (in each of $r$ repeats, a model is trained on each of the $k$ combinations of $k-1$ folds). These models can then be applied on the independent data set to create $rk$ predictions, which is then used to calculate the confidence scores and simulate the clinical pathway. This case was demonstrated with the melanoma example, training on the TCGA cohort and testing on the MIA cohort.

The default is to use a diagonal linear discriminant analysis (DLDA) classifier with two-fold cross-validation, as this creates a larger diversity in the ensemble models, and to use 50 repeats, as this was found to be sufficient to produce stable results in our experiments. However, the most appropriate classifier and cross-validation parameters will vary based on the data and context.

Datasets

BioHEART-CT

This study has been described in detail previously⁴⁸ and we analyze the Discovery 1000 patients, which is the first 1000 patients of the BioHEART-CT study who have completed deep imaging and molecular phenotyping. The study was approved by the Northern Sydney Local Health District Human Research Ethics Committee (HREC/17/HAWKE/343) and all participants provided informed written consent. All methods were performed in accordance with relevant guidelines and regulations. The deep imaging (CTCA images) were acquired on a 256-slice scanner using standard clinical protocols, overseen and dual-reported by accredited cardiologists and radiologists. CTCAs were analyzed using the validated 17-segment Gensini score⁴⁹ to identify those with CAD (Gensini > 0) and without CAD (Gensini = 0). Data from molecular phenotyping include proteomics, lipidomics⁵⁰ and metabolomics⁵¹. For demonstration purposes, the cost of each platform was chosen to be: Clinical = $30, Lipidomics = $50, Metabolomics = $15, Proteomics = $75. The data normalization steps have been described previously⁵⁰. In brief, features with more than 50% missing values were removed, and the remaining missing values were imputed with k-nearest neighbors. Only patients with complete information on all platforms were retained for analysis. Patients on statin medications were also excluded from analysis, as this would have an undesired confounding effect on the molecular signatures.

The Cancer Genome Atlas (TCGA)

The SKCM (Skin Cutaneous Melanoma) data set was downloaded from TCGA using the R package curatedTCGAData⁵². The RNASeq2GeneNorm and miRNASeqGene assays were taken to represent the mRNA and microRNA platforms respectively. The cohort was filtered down to those with stage III cancers to match with the MIA dataset. A “Good” prognosis was defined to be survival greater than 4 years from the date of tumor banking, and a “Poor” prognosis was defined to be death less than 1 year from the date of tumor banking. Patients who do not match a “Good” or “Poor” prognosis are excluded from analysis. The T-stages of the patients were reclassified into T0, T1, T2, T3 and T4, where patients with missing or undetermined T-stage were excluded.

Melanoma institute of Australia (MIA)

This data collection includes data presented in Mann et al.³⁸ and Jayawardana et al.³⁹ and is accessible at Melanoma Explorer⁵³. In brief, mRNA was assayed using Sentrix Human-6 v3 Expression BeadChips (Illumina, San Diego, CA) and microRNA expression profiling was performed using Agilent Technologies' microRNA platform (version 16, Agilent Technologies, Santa Clara, CA). Similarly to the TCGA dataset, a “Good” prognosis was defined to be survival greater than 4 years from the date of tumor banking, and a “Poor” prognosis was defined to be death less than 1 year from the date of tumor banking. Patients who do not match a “Good” or “Poor” prognosis are excluded from analysis.

Baseline comparison

To evaluate the performance of pathways generated by MultiP, we compare against two categories of baseline models. Single platform models are built on each individual platform with clinical data and the combined model was built on all platforms integratively. The integration of different platforms for classification was implemented with the crossValidate function in ClassifyR with the parameter multiViewMethod = "merge". The same cross-validation parameters as the MultiP pathways were used to ensure a fair comparison.

Transferability analysis

To build a transferable model between the TCGA and MIA datasets, we first perform library size normalization and then filter the features in each platform to those that are common in both datasets. In the TCGA data (the training data), we filter the microRNA features to those with standard deviation greater than 5, to keep the number of features reasonable for the next step while retaining important features.

As the data was collected from different platforms, with values on different scales, we calculate the log-ratios between each pair of features, using the method described by Wang and colleagues⁴⁰. We then standardize these log ratios at the patient-level, shifting the mean to 0 and scaling the variance to 1. Pairs with very low standard deviation (< 0.1) are removed from analysis to ensure model stability. By performing normalization at the patient-level, we ensure that the models will be applicable to future incoming data.

Implementation

The implementation of MultiP is made available through the ClassifyR package on Bioconductor, using the statistics software R. The code to reproduce the analysis is available on Github at https://github.com/SydneyBioX/MultiP.

Data availability

The TCGA data that support the findings of this study are publicly available. For the BioHEART-CT data, data requests can be made through the BioHEART data committee via email (michael.gray@sydney.edu.au).

Code availability

The code for running the above methods and evaluation are available at https://github.com/SydneyBioX/MultiP.

References

Stark, R., Grzelak, M. & Hadfield, J. RNA sequencing: The teenage years. Nat. Rev. Genet. 20, 631–656. https://doi.org/10.1038/s41576-019-0150-2 (2019).
Article CAS PubMed Google Scholar
Schwarze, K., Buchanan, J., Taylor, J. C. & Wordsworth, S. Are whole-exome and whole-genome sequencing approaches cost-effective? A systematic review of the literature. Genet. Med. 20, 1122–1130 (2018).
Article PubMed Google Scholar
D’Adamo, G. L., Widdop, J. T. & Giles, E. M. The future is now? Clinical and translational aspects of ‘Omics’ technologies. Immunol. Cell Biol. 99, 168–176 (2021).
Article PubMed Google Scholar
Karczewski, K. J. & Snyder, M. P. Integrative omics for health and disease. Nat. Rev. Genet. 19, 299–310. https://doi.org/10.1038/nrg.2018.4 (2018).
Article CAS PubMed PubMed Central Google Scholar
Wafi, A. & Mirnezami, R. Translational-omics: Future potential and current challenges in precision medicine. Methods 151, 3–11 (2018).
Article CAS PubMed Google Scholar
National Academies of Sciences, Engineering and Medicine, Institute of Medicine, Board on Health Care Services & Committee on Diagnostic Error in Health Care. Improving Diagnosis in Health Care (National Academies Press, 2016).
Google Scholar
Hickner, J. et al. Primary care physicians’ challenges in ordering clinical laboratory tests and interpreting results. J. Am. Board Fam. Med. 27, 268–274 (2014).
Article PubMed Google Scholar
Johansen Taber, K. A., Dickinson, B. D. & Wilson, M. The promise and challenges of next-generation genome sequencing for clinical care. JAMA Intern. Med. 174, 275–280 (2014).
Article PubMed Google Scholar
De Maria Marchiano, R. et al. Translational research in the era of precision medicine: Where we are and where we will go. J. Pers. Med. 11, 216 (2021).
Article PubMed PubMed Central Google Scholar
Biesecker, L. G. & Green, R. C. Diagnostic clinical genome and exome sequencing. N. Engl. J. Med. 370, 2418–2425. https://doi.org/10.1056/nejmra1312543 (2014).
Article PubMed Google Scholar
Sharma, A., Das, P., Buschmann, M. & Gilbert, J. A. The future of microbiome-based therapeutics in clinical applications. Clin. Pharmacol. Ther. 107, 123–128 (2020).
Article CAS PubMed Google Scholar
Zhang, L. et al. Clinical and translational values of spatial transcriptomics. Signal Transduct. Target. Ther. 7, 111 (2022).
Article CAS PubMed PubMed Central Google Scholar
Khan, S., Ince-Dunn, G., Suomalainen, A. & Elo, L. L. Integrative omics approaches provide biological and clinical insights: Examples from mitochondrial diseases. J. Clin. Investig. 130, 20–28 (2020).
Article CAS PubMed PubMed Central Google Scholar
Soenksen, L. R. et al. Integrated multimodal artificial intelligence framework for healthcare applications. NPJ Digit. Med. 5, 149 (2022).
Article ADS PubMed PubMed Central Google Scholar
Erion, G. et al. A cost-aware framework for the development of AI models for healthcare applications. Nat. Biomed. Eng. https://doi.org/10.1038/s41551-022-00872-8 (2022).
Article PubMed PubMed Central Google Scholar
Bewicke-Copley, F., Arjun Kumar, E., Palladino, G., Korfi, K. & Wang, J. Applications and analysis of targeted genomic sequencing in cancer studies. Comput. Struct. Biotechnol. J. 17, 1348–1359 (2019).
Article CAS PubMed PubMed Central Google Scholar
Stark, Z. & Ellard, S. Rapid genomic testing for critically ill children: Time to become standard of care?. Eur. J. Hum. Genet. 30, 142–149 (2022).
Article PubMed Google Scholar
Owen, M. J. et al. Rapid sequencing-based diagnosis of thiamine metabolism dysfunction syndrome. N. Engl. J. Med. 384, 2159–2161 (2021).
Article PubMed PubMed Central Google Scholar
Pritchard, D., Goodman, C. & Nadauld, L. D. Clinical utility of genomic testing in cancer care. JCO Precis. Oncol. 6, e2100349 (2022).
Article PubMed PubMed Central Google Scholar
Steuten, L., Goulart, B., Meropol, N. J., Pritchard, D. & Ramsey, S. D. Cost effectiveness of multigene panel sequencing for patients with advanced non-small-cell lung cancer. JCO Clin. Cancer Inform. 3, 1–10 (2019).
Article PubMed Google Scholar
James, G., Witten, D., Hastie, T. & Tibshirani, R. An Introduction to Statistical Learning: With Applications in R (Springer Science & Business Media, 2013).
Book Google Scholar
Patrick, E. et al. A multi-step classifier addressing cohort heterogeneity improves performance of prognostic biomarkers in three cancer types. Oncotarget 8, 2807–2815 (2017).
Article PubMed Google Scholar
Strbenac, D., Mann, G. J., Ormerod, J. T. & Yang, J. Y. H. ClassifyR: An R package for performance assessment of classification with applications to transcriptomics. Bioinformatics 31, 1851–1853. https://doi.org/10.1093/bioinformatics/btv066 (2015).
Article CAS PubMed Google Scholar
Reyes, M. et al. On the interpretability of artificial intelligence in radiology: Challenges and opportunities. Radiol. Artif. Intell. 2, e190043. https://doi.org/10.1148/ryai.2020190043 (2020).
Article PubMed PubMed Central Google Scholar
Salahuddin, Z., Woodruff, H. C., Chatterjee, A. & Lambin, P. Transparency of deep neural networks for medical image analysis: A review of interpretability methods. Comput. Biol. Med. 140, 105111 (2021).
Article PubMed Google Scholar
Anroedh, S. et al. Plasma concentrations of molecular lipid species predict long-term clinical outcome in coronary artery disease patients. J. Lipid Res. 59, 1729–1737 (2018).
Article CAS PubMed PubMed Central Google Scholar
Su, X., Han, X., Mancuso, D. J., Abendschein, D. R. & Gross, R. W. Accumulation of long-chain acylcarnitine and 3-hydroxy acylcarnitine molecular species in diabetic myocardium: identification of alterations in mitochondrial fatty acid processing in diabetic myocardium by shotgun lipidomics. Biochemistry 44, 5234–5245 (2005).
Article CAS PubMed Google Scholar
Li, G. et al. Circulating sulfatide, A novel biomarker for ST-segment elevation myocardial infarction. J. Atheroscler. Thromb. 26, 84–92 (2019).
Article PubMed PubMed Central Google Scholar
Balasubramaniam, S., Christodoulou, J. & Rahman, S. Disorders of riboflavin metabolism. J. Inherit. Metab. Dis. 42, 608–619. https://doi.org/10.1002/jimd.12058 (2019).
Article CAS PubMed Google Scholar
Pacher, P. et al. Modulation of the endocannabinoid system in cardiovascular disease: Therapeutic potential and limitations. Hypertension 52, 601–607 (2008).
Article CAS PubMed Google Scholar
Ottosson, F. et al. Dimethylguanidino valerate: A lifestyle-related metabolite associated with future coronary artery disease and cardiovascular mortality. J. Am. Heart Assoc. 8, e012846 (2019).
Article PubMed PubMed Central Google Scholar
Petrič, B., Kunej, T. & Bavec, A. A multi-omics analysis of PON1 lactonase activity in relation to human health and disease. OMICS 25, 38–51 (2021).
Article PubMed PubMed Central Google Scholar
Prentice, R. L. et al. Novel proteins associated with risk for coronary heart disease or stroke among postmenopausal women identified by in-depth plasma proteome profiling. Genome Med. 2, 48. https://doi.org/10.1186/gm169 (2010).
Article CAS PubMed PubMed Central Google Scholar
Barrachina, M. N., Calderón-Cruz, B., Fernandez-Rocca, L. & García, Á. Application of extracellular vesicles proteomics to cardiovascular disease: Guidelines, data analysis, and future perspectives. Proteomics 19, e1800247 (2019).
Article PubMed Google Scholar
Chan, A., Jiang, W., Blyth, E., Yang, J. & Patrick, E. treekoR: Identifying cellular-to-phenotype associations by elucidating hierarchical relationships in high-dimensional cytometry data. Genome Biol. 22, 324 (2021).
Article PubMed PubMed Central Google Scholar
Altenbuchinger, M. et al. Molecular signatures that can be transferred across different omics platforms. Bioinformatics 33, 2790 (2017).
Article CAS PubMed PubMed Central Google Scholar
Watson, I. R. et al. Abstract 2972: Genomic classification of cutaneous melanoma. Cancer Res. 75, 2972–2972. https://doi.org/10.1158/1538-7445.am2015-2972 (2015).
Article Google Scholar
Mann, G. J. et al. BRAF mutation, NRAS mutation, and the absence of an immune-related expressed gene profile predict poor outcome in patients with stage III melanoma. J. Investig. Dermatol. 133, 509–517 (2013).
Article CAS PubMed Google Scholar
Jayawardana, K. et al. Determination of prognosis in metastatic melanoma through integration of clinico-pathologic, mutation, mRNA, microRNA, and protein information. Int. J. Cancer 136, 863–874 (2015).
Article CAS PubMed Google Scholar
Wang, K. Y. X. et al. Cross-platform omics prediction procedure: A statistical machine learning framework for wider implementation of precision medicine. NPJ Digit. Med. 5, 85 (2022).
Article PubMed PubMed Central Google Scholar
Liu, Y. et al. Identification of immune-related prognostic biomarkers based on the tumor microenvironment in 20 malignant tumor types with poor prognosis. Front. Oncol. https://doi.org/10.3389/fonc.2020.01008 (2020).
Article PubMed PubMed Central Google Scholar
Hussain, B. et al. High endothelial venules as potential gateways for therapeutics. Trends Immunol. 43, 728–740 (2022).
Article CAS PubMed PubMed Central Google Scholar
Chang, L. et al. DNAH2 facilitates the homologous recombination repair of Fanconi anemia pathway through modulating FANCD2 ubiquitination. Blood Sci. 3, 71–77 (2021).
Article PubMed PubMed Central Google Scholar
Dahmke, I. N. et al. Curcumin intake affects miRNA signature in murine melanoma with mmu-miR-205-5p most significantly altered. PLoS One 8, e81122 (2013).
Article ADS PubMed PubMed Central Google Scholar
Mueller, D. W., Rehli, M. & Bosserhoff, A. K. miRNA expression profiling in melanocytes and melanoma cell lines reveals miRNAs associated with formation and progression of malignant melanoma. J. Investig. Dermatol. 129, 1740–1751 (2009).
Article CAS PubMed Google Scholar
Shen, J. et al. Novel insights into miR-944 in cancer. Cancers 14, 4232 (2022).
Article MathSciNet CAS PubMed PubMed Central Google Scholar
Chang, R.-M. et al. miRNA-487a promotes proliferation and metastasis in hepatocellular carcinoma. Clin. Cancer Res. 23, 2593–2604 (2017).
Article CAS PubMed Google Scholar
Kott, K. A. et al. Biobanking for discovery of novel cardiovascular biomarkers using imaging-quantified disease burden: Protocol for the longitudinal, prospective, BioHEART-CT cohort study. BMJ Open 9, e028649 (2019).
Article PubMed PubMed Central Google Scholar
Gensini, G. G. A more meaningful scoring system for determining the severity of coronary heart disease. Am. J. Cardiol. 51, 606 (1983).
Article CAS PubMed Google Scholar
Zhu, D. et al. Lipidomics profiling and risk of coronary artery disease in the BioHEART-CT discovery cohort. Biomolecules 13, 917 (2023).
Article PubMed PubMed Central Google Scholar
Vernon, S. T. et al. Metabolic signatures in coronary artery disease: Results from the BioHEART-CT study. Cells 10, 980 (2021).
Article CAS PubMed PubMed Central Google Scholar
Ramos, M. et al. Multiomic integration of public oncology databases in bioconductor. JCO Clin. Cancer Inform. 4, 958–971 (2020).
Article PubMed Google Scholar
Strbenac, D. et al. Melanoma explorer: A web application to allow easy reanalysis of publicly available and clinically annotated melanoma omics data sets. Melanoma Res. 29, 342–344. https://doi.org/10.1097/cmr.0000000000000533 (2019).
Article PubMed Google Scholar

Download references

Acknowledgements

The authors thank the staff at the Royal North Shore Hospital, in particular staff at North Shore Radiology and staff at NSW Health Pathology for their support. The authors also thank their colleagues at the University of Sydney, School of Mathematics and Statistics, Charles Perkins Centre, and Kolling Institute for their intellectual engagement.

Funding

The following sources of funding for each author, and for the manuscript preparation, are gratefully acknowledged: AT is supported by an Australian Commonwealth Government Research Training Program Stipend Scholarship; GF is supported by a National Health and Medical Research Council Practitioner Fellowship (grant number APP11359290), Heart Research Australia, and the New South Wales Office of Health and Medical Research. JYHY and EP are supported by the AIR@innoHK programme of the Innovation and Technology Commission of Hong Kong.

Author information

Authors and Affiliations

School of Mathematics and Statistics, The University of Sydney, Camperdown, NSW, Australia
Andy Tran, Jamie Mickaill, Dario Strbenac, Ellis Patrick & Jean Yee Hwa Yang
Charles Perkins Centre, The University of Sydney, Camperdown, NSW, Australia
Andy Tran, Dario Strbenac, Mark Larance, Stephen T. Vernon, Stuart M. Grieve, Gemma A. Figtree, Ellis Patrick & Jean Yee Hwa Yang
Sydney Precision Data Science Centre, The University of Sydney, Camperdown, NSW, Australia
Andy Tran, Dario Strbenac, Ellis Patrick & Jean Yee Hwa Yang
Westmead Medical Institute, Westmead, NSW, Australia
Andy Wang
School of Computer Science, The University of Sydney, Camperdown, NSW, Australia
Jamie Mickaill
Kolling Institute of Medical Research, St Leonards, NSW, Australia
Stephen T. Vernon & Gemma A. Figtree
Department of Radiology, Royal Prince Alfred Hospital, Camperdown, Australia
Stuart M. Grieve
Laboratory of Data Discovery for Health Limited (D24H), Science Park, Hong Kong SAR, China
Ellis Patrick & Jean Yee Hwa Yang

Authors

Andy Tran
View author publications
You can also search for this author in PubMed Google Scholar
Andy Wang
View author publications
You can also search for this author in PubMed Google Scholar
Jamie Mickaill
View author publications
You can also search for this author in PubMed Google Scholar
Dario Strbenac
View author publications
You can also search for this author in PubMed Google Scholar
Mark Larance
View author publications
You can also search for this author in PubMed Google Scholar
Stephen T. Vernon
View author publications
You can also search for this author in PubMed Google Scholar
Stuart M. Grieve
View author publications
You can also search for this author in PubMed Google Scholar
Gemma A. Figtree
View author publications
You can also search for this author in PubMed Google Scholar
Ellis Patrick
View author publications
You can also search for this author in PubMed Google Scholar
Jean Yee Hwa Yang
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

J.Y. conceived the design of the project. A.T. led the computational development described in this work with contributions from A.W., J.M. and D.S., with supervision from E.P. and J.Y. G.F. and S.G. led the BioHEART-CT study with contributions from S.V. M.L. led the proteomics component and S.G. led the imaging component of the BioHEART-CT study respectively.

Corresponding author

Correspondence to Jean Yee Hwa Yang.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Supplementary Figure S1.

Supplementary Figure S2.

Supplementary Figure S3.

Supplementary Figure S4.

Supplementary Figure S5.

Supplementary Legends.

Supplementary Tables.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Tran, A., Wang, A., Mickaill, J. et al. Construction and optimization of multi-platform precision pathways for precision medicine. Sci Rep 14, 4248 (2024). https://doi.org/10.1038/s41598-024-54517-8

Download citation

Received: 24 April 2023
Accepted: 13 February 2024
Published: 21 February 2024
DOI: https://doi.org/10.1038/s41598-024-54517-8
Springer Nature Limited

Construction and optimization of multi-platform precision pathways for precision medicine

Abstract

Similar content being viewed by others

Explore related subjects

Introduction

Results

The development of an “uncertain” class enables multi-stage classification

Clinical precision pathways made transparent by MultiP

Criterion-guided optimization of precision pathways outperforms baseline models

MultiP pathways can be optimized on multiple criteria

MultiP pathways are transferable across different cohorts

Discussion

Methods

MultiP algorithm

Confidence score

Construction

Evaluation

Cross-validation

Datasets

BioHEART-CT

The Cancer Genome Atlas (TCGA)

Melanoma institute of Australia (MIA)

Baseline comparison

Transferability analysis

Implementation

Data availability

Code availability

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Publisher's note

Supplementary Information

Rights and permissions

About this article

Cite this article

Share this article

Search

Navigation