Keywords

1 Introduction

In recent years, clinical practice has seen a dramatic increase in the utilization of targeted and untargeted metabolomics (TM and UM, respectively) [1, 2]. Metabolomics has been termed “the stethoscope of the twenty-first century” with broad applications in many fields of medicine: In oncology, it is utilized for biomarkers; in neurology, for severity stratification of neurodegenerative disorders; in endocrinology for diabetes modulators; in rheumatology and cardiology where certain metabolites may improve prognostication of chronic diseases (e.g., osteoarthritis and atherosclerosis); and in gastroenterology, where metabolomics may assist in differentiating between Crohn’s disease and ulcerative colitis [3]. UM has been used in multifactorial disorders to assist in risk assessment and early diagnosis, such as the observation that increased plasma levels in three out of five aromatic and branched-chain amino acids (isoleucine, leucine, valine, tyrosine, and phenylalanine) confers a fivefold increased likelihood for developing type 2 diabetes [4] or that increased plasma levels of glycocholate, taurocholate, and glycochenodeoxycholate are associated with nonalcoholic fatty liver disease (NAFLD). In contrast, decreased plasma levels of free carnitine, butyrylcarnitine, methyl-butyryl carnitine, and cysteine-glutathione are seen in nonalcoholic steatohepatitis (NASH) [5]. Similarly, breast cancer showed a pattern of increased total choline-containing substances and decreased glycerophosphocholine in the plasma [6], correlating malignancy to a glycerophosphocholine-to-phosphocholine ratio switch [7].

Naturally, UM can highlight alterations in complex “metabolic disorders” and refine our understanding of metabolic flux in health and disease. The untargeted global assessment can further provide a high-resolution cellular homeostasis map and multivariant perturbation metrics that can assist in prognostication, the need for intervention, and the effect of a therapeutic modality. From a clinical perspective, a rare disease is a primary focus, and areas of influence for untargeted metabolomics include improving diagnostic rates, allowing affordable high-throughput disease screening, and identifying novel disease biomarkers that can be translated to the clinic. UM can contribute significantly toward a facilitated diagnosis and direct targeted treatment, thus enabling a reduction of the traditionally high morbidity and mortality associated with the under-recognition and undertreatment of metabolic disorders.

2 Metabolomics Joins the Diagnostic Front Seat

Clinicians justifiably consider exome sequencing (ES) to be the panacea of all diagnostic dilemmas, particularly when routine laboratory studies and tissue biopsies fail to establish a diagnosis. Indeed, ES has ushered in a new diagnostic era. Thanks to the reduced cost of sequencing, massive utilization of ES is now widely available, turning it into first-tier clinical testing for diagnostic evaluation of developmental delay and other congenital anomalies [8,9,10]. ES is perhaps most heavily relied upon in the neonatal intensive care unit (ICU), in which a third of admitted patients are due to genetic causes [11]. ES is also heavily relied upon for primary mitochondrial disorders (PMD) [12], for which there is a notorious lack of biomarkers with adequate specificity [13]. This is further exemplified by the staggering finding of normal muscle respiratory chain enzymes in 10–20% of cases with mitochondrial myopathy undergoing invasive diagnostic procedures [14].

Nonetheless, as intriguing as it may appear, a diagnosis relying solely on ES without additional laboratory results lowers the test’s pretest probability and increases the likelihood of false-positive results [15]. The American College of Medical Genetics and Genomics (ACMG) recommends that effort should be made to avoid using the pathogenicity of a variant as the sole evidence of a Mendelian disease but rather should be used in conjunction with other clinical information [16]. Perhaps, the best demonstration of the chasm between the diagnostic promise of ES and reality is variants of uncertain significance (VUS) [17], perceived as a “challenge” in ~1/3 of the articles reporting ES results [18]. With inconclusive molecular data, generating a snapshot of metabolites during illness creates a functional bioassay for candidate metabolic pathways. It provides an independent tool for ruling in or ruling out suspected diagnoses. Following the ACMG variant interpretation algorithm [16], metabolomic data fall into the functional data category by providing “well-established functional studies to show deleterious [or non-deleterious] effect,” which is considered strong supportive information for the pathogenic or benign nature of a variant. Metabolomic data, therefore, can push the needle from the neutral zone (a VUS) to the actionable zone [19, 20]. It is, nevertheless, of utmost importance to report only significant variations in metabolites to avoid reporting fluctuations stemming merely from dietary changes, environmental factors, drug exposure, and normal daily changes in metabolism. However, the definition of significance is still laboratory dependent [21]. It is also of equal importance to provide detailed clinical information and the nutritional and therapeutic status of the patient to the performing laboratory to optimize the data analysis. Given that a VUS is a common finding in ES, estimated to occur in 30–80% of clinically indicated ES tests [22, 23], and comprises ~30% of variants found by targeted sequencing for suspected inborn errors of metabolism (IEMs) [24], a variant validation and classification tool alongside molecular diagnostics is critical. UM serves as a valuable biochemical, functional validation instrument that can be integrated into clinical care.

2.1 Monoamine Synthesis

The utilization of UM for VUS reclassification was demonstrated by Atwal et al. in a diagnostic odyssey of an 11-month-old boy presenting with intellectual disability and hypotonia with episodes of generalized stiffening [25]. Initially diagnosed with cerebral palsy, exome sequencing showed two missense VUS in DCC, which encodes aromatic-l-amino acid decarboxylase (AADC), a key enzyme in monoamine synthesis. AADC deficiency (AADCD) results in a severe neurometabolic disorder due to the combined deficiency of serotonin, dopamine, norepinephrine, and epinephrine. The onset of the disease is typically in the first months of life. However, the phenotypic spectrum of disease is broad and includes hypotonia, oculogyric crises, ptosis, dystonia, hypokinesia, developmental delays, and autonomic dysfunction [26]. Diagnosis is typically made by abnormal monoamine metabolites in CSF, followed by molecular confirmation or enzyme analysis in plasma. Elevation of 3-O-methyldopa (3-OMD), also termed 3-methoxytyrosine (3-MT), a catabolite of dihydroxyphenylalanine (l-DOPA), indicating a blockage in the conversion of l-DOPA into dopamine. For the patient described in Atwal et al., UM showed a plasma level of 3-MT 69 times higher than the control population.

Further confirmation of the UM findings was achieved by demonstrating absolute levels of 3-MT in CSF > three-fold the upper limit of normal, along with undetectable levels of the dopamine and serotonin derivatives, homovanillic acid (HVA), and 5-hydroxy indoleacetic acid (5-HIAA), respectively. Neurotransmitter analysis in CSF has been considered the gold standard for diagnosing AADC deficiency. Yet, this work presents that UM can attain similar results in plasma without requiring an invasive procedure. At times of atypical presentation and a wide differential diagnosis, UM can cast a wide net in a single test and may render specific diagnostic tests and procedures, each for a single suspected disease, inessential. The presentation can be atypical for the late-onset subgroup of patients with AADCD, including milder symptoms of hypotonia, dystonia, and fatigue [27]. As symptoms of AADC uniformly present in the first years of life [28, 29], a differential diagnosis may include “dopa-responsive dystonia,” with a trial of l-DOPA initiated as part of the diagnostic workup [30]. In light of this treatment approach [31], an additional work examined the ability of UM to differentiate an AADCD metabolic profile from other l-DOPA-treated conditions, as 3-MT will be elevated in both cases [32]. Two patients with AADCD before initiation of l-DOPA showed similar elevations of 3-MT as five patients with non-AADCD pathology (Z-scores of +5.88 and +7.65, respectively). However, reduced levels of dopamine 3-O-sulfate (D3OS) and vanillylmandelic acid (VMA) downstream of the AADC enzyme were seen in AADCD patients. In contrast, non-AADCD patients had elevated levels of D3OS and 3-methoxytyramine sulfate (3-MTS), showing a surplus of dopamine following treatment with L-DOPA. As these results were obtained from plasma and not from CSF, these data again demonstrate the robustness of the metabolic profile of AADCD obtained noninvasively.

2.2 Ornithine Metabolism

Diagnosis by variant reclassification was also made in a 7-year-old male with global developmental delay (GDD), ADHD, epilepsy, and ectodermal abnormalities [33]. The boy was found to have a novel, presumed splice-site VUS in ODC1. This gene encodes ornithine decarboxylase, converting ornithine into putrescine, the first step of the spermidine pathway. UM showed excessive N-acetylputrescine, a metabolite of putrescine, confirming a gain-of-function variant in this gene, consistent with a diagnosis of Bachmann-Bupp syndrome (MIM 619075), an autosomal dominant disorder of neurodevelopment characterized by GDD, macrocephaly, white matter and callosal abnormalities, spasticity, seizures, and ectodermal defects (alopecia, cutaneous vascular malformations) [34, 35].

2.3 NAD(P)HX Repair System

VUS reclassification can also direct treatment for metabolic disorders amenable to the intervention. The NAD(P)HX repair system is a highly conserved two-enzyme system that restores damaged NAD(P)H [36]. In an acidic and hyperthermic environment, NADH and NADPH can both undergo hydration into NAD(P)HX nonenzymatically [37]. Without the repair system, the inactive NAD(P)HX accumulates and depletes the NAD+ pool under cellular stress. Biallelic pathogenic variants in either of the genes coding for the two enzymes, NAXD and NAXE, cause a rapidly progressive neurometabolic disorder triggered by inflammatory stress (mostly febrile illness), bearing high mortality in the first decade of life (MIM 618321 and 617186, respectively) [38,39,40]. A case of a fever-triggered encephalomyopathy crisis in a 16-year-old adolescent demonstrated a small deletion encompassing the first two exons of NAXD, in trans to a missense VUS in the exon 1-intron 1 splice donor, suspected to alter both splicing and the mitochondrial localization of NAXD, which contains the mitochondrial targeting sequence in its first exon [41]. Plasma UM during metabolic crisis demonstrated NAD+ depletion and led the medical team to initiate niacin therapy. Under therapy, clinical status improved, and baseline metabolomics demonstrated repletion of NAD+. Not only did UM biochemically support the pathogenicity of the variant, but it also provided monitoring data for the targeted treatment. These results were further validated in a case of NAXE deficiency, for which evidence of depletion of NAD+ derivatives was presented during a crisis. Niacin treatment appeared to prevent metabolic decompensation during a subsequent febrile illness while on niacin supplementation (Fig. 1).

Fig. 1
An illustration features a defective N A D, P, H X repair system that leads to reduced N A D positive availability. Metabolomic profiles reveal N A D positive depletion with inflammation indicated by low 1-methylnicotinamide and high quinolinate. Niacin supplementation corrects these issues.

NAD(P)HX repair system deficiency and plasma untargeted metabolomics. Top: NAD+ utilization (red) and repletion by biosynthesis (green) pathways are shown. When the repair system is defective (left, black), correction of spontaneously converted NADH to NADHX back to NADH (blue) is impaired, lowering available NAD+ and, thus, lowering utilization products while increasing upstream biosynthesis markers (quinolinate). Bottom: Plasma metabolomic profiles of deficiency in the NAD(P)HX repair system when patients are under inflammatory stress. Two patient profiles are shown, one for each repair system enzyme (NAXD, NAXE), with the effect of NAD+ depletion on selected analytes (red bars) and after supplementation with niacin (green bars). Taken together, in both NAXD and NAXE deficiencies, these results point to NAD+ depletion during inflammatory stress that is amenable to correction with substrate repletion. Bottom: Left panel—At the time of acute inflammatory stress, the red bars show the metabolomic profile from a patient with NAD(P)HX dehydratase deficiency (NAXD, EC 4.2.1.93), demonstrating the absence of 1-methylnicotinamide, marked deficiency in N1-methyl-2-pyridine-5-carboxamide, and increased quinolinate (red bars). At 11 months’ post-crisis, the patient underwent another UM profiling (green bars), showing a reversal of these alterations while under niacin treatment. Bottom: Right panel—Shown are the same key molecules in plasma from a patient with NAD(P)HX epimerase deficiency (NAXE, EC 5.1.99.6) demonstrating similar trends during an acute inflammatory crisis (red bars). Repeat UM profiling at 9 months’ post-crisis while on niacin supplementation showed a reversal of these alterations (green bars)

2.4 Riboflavin Metabolism

Diagnosis and treatment guidelines were also provided in a case of a 2-year-old male diagnosed with hypoplastic macrocytic anemia (hemoglobin of 5.5 g/dL and mean corpuscular volume of 107.1 fL, range 10.5–14.0 g/dL and 76–90 fL, respectively) who later developed ataxia and nystagmus in the context of respiratory syncytial virus (RSV) infection [42]. Brain magnetic resonance imaging (MRI) showed enhancement of cranial nerves III and V, diffuse intramedullary T2 hyperintensity of the entire cord, and cauda equina nerve root thickening and enhancement. ES showed a pathogenic variant and a novel VUS predicted to cause an in-frame single amino acid deletion (p.Phe153del) in SLC52A2, an intestinal basolateral and a blood-brain barrier riboflavin transporter (MIM 607882). UM showed elevated C6 (hexanoylcarnitine), C8 (octanoylcarnitine), C10 (decanoylcarnitine), and C10:1 (decenoylcarnitine), supporting the pathogenicity of the in-frame deletion variant, in addition to elevations in 2-hydroxyglutarate, methyl succinate, and ethylmalonate, common secondary alterations in short- and medium-chain fatty acid oxidation defects. These perturbations can also be detected on TM, yet the value of UM was nicely demonstrated by showing additional perturbations in metabolic pathways that are related to riboflavin deficiency. Riboflavin is the precursor of flavin adenine dinucleotide (FAD), a necessary cofactor for kynurenine-3-monooxygenase, the de novo NAD+ biosynthesis pathway member; in the patient, the proximal kynurenine was increased, and the distal picolinate was decreased. Reduction of 5,10-methylene tetrahydrofolate into 5-methyltetrahydrofolate is also dependent on FAD, leading to inhibition of the one-carbon cycle and recycling of sulfur-containing amino acid, ultimately resulting in elevations in sarcosine, dimethylglycine, methionine, and glycine. These values normalized upon initiation of high-dose riboflavin (70 mg/kg/day), with a resolution of the anemia and nystagmus and improvement in ataxia. While ataxia and nystagmus are common findings, the highly suggestive upper body proximal muscle weakness with prominent neck flexion and the early symptom of dysphagia were not seen; however, macrocytic anemia is a rare finding [43], making a diagnosis based on clinical suspicion very difficult. Indeed, SLC52A2 deficiency, also called Brown-Vialetto-Van Laere syndrome 2 (MIM 614707), is a difficult diagnosis to make, with an average time to diagnosis of more than 2 years, a significant delay for a disorder manifesting in the first decade of life [43]. For example, normocytic anemia was misdiagnosed in a toddler as pure red cell aplasia overlooking an insidious weakness and areflexia and delaying the diagnosis of SLC52A2 deficiency for 3 years [44].

2.5 Histidine Metabolism

Not only for VUS can interpretation by UM assist in diagnostic dilemmas. Deficiency of urocanase (also called urocanate hydratase), an enzyme participating in the histidine deamination breakdown pathway, was previously considered to be associated with intellectual disability [45, 46]. Intellectual disability (ID) is a relatively common clinical finding, with a 0.8–3.7% prevalence in the pediatric population [47, 48]. Common disorders can manifest independently in rare diseases. A connection can be falsely established due to (a) small cohorts (an inherent problem of rare disorders), (b) consanguinity increasing independently the prevalence of IEM [49, 50] and ID [51, 52], and (c) publication bias, resulting from a top-down sequencing of patients presenting with a multitude of symptoms. In a recent work, plasma and urine UM showed a significant increase in both cis- and trans-urocanate and imidazole propionate in two asymptomatic patients with biallelic pathogenic variants in UROC1 and normal intellect and with no other significant metabolic alterations [53] (Fig. 2). UM helped expand our understanding of the benign nature of urocanase deficiency (MIM 276880) and the need to pursue a genetic diagnosis for ID in patients with urocanase deficiency, per standard guidelines [10].

Fig. 2
2 metabolomic maps of histidine catabolism in a patient. Green arrows exhibit upstream accumulations in plasma. Blue circles indicate downstream deficiencies in urine. The indications are as follows. Red, excess. Blue, deficient. Pink, moderate excess. Black, within range, Gray, unmeasured.

Metabolomic map of the histidine catabolism pathway in a patient with urocanate hydratase (EC 4.2.1.49, urocanase) deficiency in (a) plasma and (b) urine. The affected arm of the pathway, highlighted in green arrows in both plasma and urine, shows the accumulation of analytes immediately upstream to the enzymatic block, trans-urocanate and cis-urocanate. These accumulations then result in the subsequent accumulation of imidazole propionate (in equilibrium with trans-urocanate) and, to a lesser degree, 1-methylhistidine. In urine, the deficiency of the downstream analytes, hydantoin-5-propionate and glutamate, are also shown (blue circles). The color, diameter, and shading of each circle are proportional to the Z-score. Red circles indicate analytes in excess (Z-score > +2); blue circles represent deficient analytes (Z-score < −2); pink circles indicate analyte excess with a Z-score > +1.5 < +2. Black circles indicate analytes within −1.5 ≤ Z-score ≤ +1.5. Gray circles indicate analytes that are not measured in this assay. (Adapted from Glinton et al., 2018. Urocanate hydratase (urocanase) is indicated by the black “no entry” sign)

2.6 The Diagnostic Rate Among Inborn Error of Metabolism

A direct comparison of TM (plasma amino acids [PAA], acylcarnitine profile [ACP], and urine organic acids [UOA]) to UM showed that the latter has an overall ~sixfold higher diagnostic rate for IEM (7.1% vs. 1.3%) [19]. That result may not be unexpected, yet the power of UM was demonstrated by the variety of diagnoses it allows over TM. Diagnoses included disorders of synthesis of neurotransmitters, cholesterol, and peroxisomal biogenesis. γ-aminobutyric acid-(GABA)-transaminase deficiency (MIM 613163). This early infantile epileptic encephalopathy with a movement disorder and hypersomnolence due to GABA catabolism defect, was diagnosed in a 1-year-old patient with hypotonia and movement disorder and compound heterozygous VUS in ABAT, which encodes GABA-transaminase. UM showed 2-pyrrolidinone, succinamic acid, and succinimide elevations that were not seen on UOA. These three metabolites are reliably detected in both CSF and plasma, in contrast to CSF GABA alone, the gold standard for diagnosis but prone to false-negative results if not handled appropriately [27]. Specifically, 2-pyrrolidinone is a butyrolactam spontaneously converted from GABA when the latter is not broken down to succinic semialdehyde (SSA) by GABA-transaminase. 2-pyrrolidinone is converted to succinimide and succinamic acid [54], making these three metabolites alternative biomarkers. The same metabolic profile was seen in a 6-year-old with global developmental delay (GDD) and movement disorder who was also homozygous for a VUS and two other patients presenting with encephalopathy, seizures, cortical blindness, motor developmental delay, hypotonia, strabismus, ataxia, and intellectual disability [19, 54].

A diagnosis of Smith-Lemli-Opitz (MIM 270400), a disorder of cholesterol biosynthesis, was made in a typically presenting 10-year-old male with microcephaly, hypotonia, developmental delays, and a congenital cardiac defect. In that patient, 7-dehydrocholesterol was elevated, while cholesterol was decreased. Initially, a diagnosis could be ascertained by comparing this profile to the near identical profile of a molecularly confirmed 3-year-old boy [19] and later confirmed molecularly. In a different study, such a diagnosis was excluded in a 28-year-old patient with developmental delays by demonstrating normal 8-dehydrocholesterol and 7-dehydrocholesterol, downgrading a VUS to likely benign [55].

The latter study presented 16 cases of either homozygous or compound heterozygous VUS in which metabolomics assisted in variant reclassification after ES was nondiagnostic. Noteworthy in this study are (1) a case of a patient presenting with early-onset recurrent nephrolithiasis, urosepsis, and transfusion-dependent anemia, showing an excess of orotic acid in the urine (a metabolite not easily detected in UOA); thus, the two missense VUS were reclassified to pathogenic and likely pathogenic, and treatment with uridine monophosphate was initiated; and (2) a case of psychomotor retardation and retinitis pigmentosa with increased urinary 5-oxoproline excretion, and despite negative ES, the finding that prompted targeted sequencing of the GSS gene, demonstrating homozygosity for a deep intronic variant for this highly heterogeneous condition, 5-oxoprolinase deficiency (MIM 260005).

UM performed on plasma from a 17-year-old with agenesis of the corpus callosum, autism spectrum disorder, lactic acidosis, hyperammonemia, and electrolyte abnormalities revealed reduced pantothenate, carnitine, and carnitine derivatives due to SLC5A6 deficiency, confirmed by biallelic VUS identified by subsequent exome sequencing [19]. SLC5A6 is a cellular cotransporter of pantothenate, biotin, and α-lipoic acid in the intestine and blood-brain barrier [56]. SLC5A6 deficiency leads to early-onset neurodegenerative disorder and also includes failure to thrive, acquired microcephaly, movement disorder, immunodeficiency, gastrointestinal dysfunction, and osteopenia. Early treatment with high-dose pantothenate, biotin, and α-lipoic acid seems to improve outcomes [57, 58]. UM also confirmed the diagnosis of medium-chain acyl-CoA dehydrogenase deficiency, argininemia, and X-linked glycerol kinase deficiency (GKD, MIM 307030), in which molecular diagnoses were inconclusive or missing, among other conditions [19].

To directly analyze the contribution of UM to the interpretation of variants identified in ES, Alaimo et al. examined clinical samples with both ES and UM testing that were sent for diagnostic purposes in a cohort of 170 patients [20]. This cohort was similarly primarily pediatric, with>90% presenting with neurological symptoms. The researchers identified 145 variants in 74 patients in their cohort in 73 genes associated with an IEM. Based on the metabolomic data, the 12.3% diagnostic rate in this cohort facilitated the reclassification of 27 variants (19%). Of the reclassified variants, 24 VUS were reclassified as either likely pathogenic (n = 15) or likely benign (n = 9), while an additional three variants were upgraded from likely pathogenic to pathogenic. Classifications were done according to the ACMG guidelines [16]. For 17 additional pathogenic variants, the study presented confirmatory metabolic perturbations. One case upgraded a homozygous VUS to a likely pathogenic variant, making the diagnosis of guanidinoacetate methyltransferase (GAMT) deficiency likely in the patient and allowing the clinician to focus on the treatment for this disorder of cerebral creatine deficiency. Moreover, for autosomal recessive conditions in which sequencing revealed only heterozygous variants, UM ruled out the suspected condition in ~60% of cases by showing a normal metabolite profile for the metabolic pathway in question, thus excluding the possibility of a biochemically symptomatic carrier possessing a second disease-causing allele not detected due to the constraints of ES.

2.7 Automation of Data Interpretation

Similarly to ES diagnostic capabilities, no discussion of UM clinical diagnostic capabilities will be complete without discussing the interpretation of the (untargeted) data. The ability to correctly diagnose a condition based on UM heavily depends on the art of data interpretation. Rather than the convoluted manual analysis of metabolomic data (with or without genomic information), automated bioinformatic tools provide means for pattern recognition and thus hold a great promise of an improved matching between datasets and a particular disease or pathway. One approach applies the clique problem in a metabolomic graph. A clique is defined as an interconnected group of metabolites (nodes), and small highly connected cliques are extracted based on computational analysis bounding cliques’ p values [59]. Across 539 plasma samples, this connect-the-dots (CTD) approach reproduced accurate diagnosis of 16 different IEMs [60]. The top-down bioinformatic approach seeks to identify causative genes by providing a likelihood scoring system based on heuristic algorithms predicting the effects they expect to exert on the -omic dataset. Several metrics achieved prioritization of candidate genes based on UM. In cross-omics, a metabolomic study performed using dried blood spots (DBS) [61], candidate genes were prioritized based on their distance from the perturbed metabolite (where each reaction accounts for 1 step), with a narrowing process of including metabolites that participate in only limited amount of reactions (“uniqueness”) and limiting metabolites to only those with a significant Z-score (“significance”) to generate gene-specific metabolite sets. Successful prioritization was achieved by considering metabolites up to four reactions away from the primary reaction, uniqueness of up to 15 reactions, and significance of >+3 or <−3. In metPropagate [62], each protein is assigned a rank based on its associated metabolite enrichment, and dynamic ranking is propagated with the protein’s functional linkage network. This approach successfully prioritized causative genes (within the top 20th centile) in 20% of IEMs in the study group and 82% (9/11) in the test group of neurometabolic disorders. This algorithm outperformed Exomiser [63], a causative variant prioritization tool based on human phenotype ontology data and standard pathogenicity prediction (variant prevalence, conservation, and information from model organisms and inheritance patterns). In another approach, Reafect [64] assigned a score based on cumulative pathway perturbations, including metabolites with sub-significant Z-scores, correctly predicting the causative gene in the top fifth centile in 80% of the 76 patients harboring 36 different IEMs. When combined with the deleteriousness predictive score, Combined Annotation Dependent Depletion (CADD) [65], specificity further increased. Another promising approach is the implementation of a siamese neural network, weighing in the tandem computational metabolic network (for predicting metabolite flux) and machine learning (ML, for matching metabolic networks to diseases) trained by the ML algorithm [66]. The model used a single simulated profile for each disease with real data points from only 2% of the diseases to outperform a generic algorithm that prioritizes causative genes based on distance from real data profiles (using the L1 Manhattan metrics). Limitations to ML include the quality of the training set and the generalizability of metabolic profiles to novel diseases. Nonetheless, implementing predictive ML-based algorithms promises to reduce the number of patient-derived samples required for disease discovery (smaller cohorts), a vital prerequisite in the world of rare diseases such as IEMs [59].

3 Metabolomic Fingerprinting: Identifying Diseases’ Biometrics and Finding New Disease Biomarkers

Biomarkers are crucial for the effective screening, designation, and diagnostic confirmation or exclusion of diseases by UM. Biomarkers are also critical in monitoring an affected patient’s metabolic status and treatment efficacy. Clinical biomarkers for IEMs are abundant [67], although most are derived from TM. Many biomarkers are assessed by routine biochemical tests, such as ammonia, lactate, uric acid, and cholesterol, or from TM, such as UOA (e.g., trimethylamine in fish odor syndrome) or PAA (high phenylalanine in PKU). To increase specificity, either identification of disease-specific biomarkers can assist in the diagnosis (or ruling out) of a disease, such as allo-isoleucine in MSUD or argininosuccinic acid in argininosuccinate lyase (ASL) deficiency, or the identification of several non-pathognomonic metabolites, such as low levels of lysine, ornithine, and arginine in plasma amino acids suggestive of LPI, or the elevation of propionylcarnitine on ACP, methylmalonic acid in UOA, low methionine on PAA, and homocysteine in the blood, indicative of an intracellular cobalamin utilization defect. UM can instigate both strategies. By its untargeted nature, UM has the potential to uncover biomarkers that were not known to be associated with an IEM or not visible by targeted testing.

3.1 Peroxisome Biogenesis

Peroxisome biogenesis disorders in the Zellweger spectrum (PBD-ZSD) are a heterogeneous group of genetic disorders caused by mutations in genes responsible for normal peroxisome assembly and functions [68]. The majority of cases are due to biallelic pathogenic variants in PEX1 (61%), followed by PEX6 (15%) and PEX12 (8%), with additional contribution from at least another 10 PEX genes [69], an important group of proteins essential for the assembly of peroxisomes and the recognition and transport of cytoplasmic proteins that contain peroxisomal targeting sequence [70]. The severe end of PBD-ZSD includes neuronal migration defects with leukodystrophy, neonatal onset seizures, hypotonia, failure to thrive, liver dysfunction, bony stippling respiratory insufficiency, cataracts, sensorineural hearing loss, and renal cortical microcysts; the mild end of the spectrum includes developmental delays and intellectual disability that can be mild and slowly progressive retinopathy and sensorineural hearing loss [68, 69, 71]. Severe cases can be screened by elevation of the very-long-chain fatty acid (VLCFA), C26:0-lysophosphatidylcholine (DBS, plasma), phytanic and pristanic acids (plasma); reduction in plasmalogens (plasma, erythrocyte membranes); increase in pipecolic acid (plasma, urine); and increase in the bile acids, dihydroxycholestanoate, and trihydroxycholestanoate (plasma, urine) [69]. Intermediate and mild cases may show only subtle alterations, and in combination with subtle clinical symptoms, screening for PBD-ZSD may not be performed. In a cohort of 19 mild to intermediate PBD-ZSD pediatric patients, Wangler et al. showed a distinct PBD-ZSD plasma metabolome (Fig. 3), with elevated levels of long-chain dicarboxylic acids, pipecolic acid, and the bile acid derivative 7α-hydroxy-3-oxo-4-cholestenoic acid and with reductions in phosphatidylcholines, phosphatidylethanolamines, and plasmalogens [71]. Pipecolic acid and the lysophospholipid 1-lignoceroyl-GPC (24:0) were most strikingly elevated with a Z-score of +3.7. Less anticipated changes included dicarboxylic acids of 16–22 carbons. UM also proposed novel biomarkers, with a reduction in nine sphingomyelin species. Reduction of several sphingomyelins in the clinical UM database, excluding PBD-ZSD, was observed in only 2% (interestingly, one of which was diagnosed with bifunctional protein deficiency, which can mimic PBD-ZSD). Moreover, the plasma elevations in pipecolic acid and reductions in the sphingomyelins attenuated with age, an expected finding correlating milder phenotypes among older surviving patients.

Fig. 3
An illustration of plasma metabolomic profile in P B D-Z S D. Early diagnosis, less than 10 years, reveals significant lipid abnormalities. Late diagnosis, greater than 10 years, indicates milder perturbations. Blue circles in the metabolomic tree indicate lipid deficiencies in younger patients.

Plasma metabolomic footprint of peroxisome biogenesis disorder-Zellweger spectrum disorder (PBD-ZSD). Top: Left panel. Early diagnosis (age <10 years) of PBD-ZSD shows a more significant metabolic phenotype, with enrichment of lysoplasmalogens, plasmalogens, sphingolipid metabolism, and phosphatidylcholines, reaching statistical significance (colored bars) for patients when compared to control population. Top: Right panel. The relative deficiencies of most of these lipids in PBD-ZSD in children under the age of 10 years are represented by the blue circles in the metabolomic tree of lipid analytes (lipidomics). Bottom: Left panel. Later diagnosis of PBD-ZSD (age >10 years) exhibits a more subtle metabolic phenotype. Bottom: Right panel. The metabolomic fingerprint of PBD-ZSD in individuals >10 years of age showed mild perturbations in which reductions of the plasmalogens and lysoplasmalogens did not reach statistical significance. The lipidomic tree shows considerably fewer perturbations when compared to plasma obtained from a younger patient. Trees were rendered using Cytoscape (https://www.cytoscape.org). (Figure adapted from Wangler et al. 2018)

3.2 Urea Cycle

The urea cycle is the principal mechanism for the clearance of waste nitrogen resulting from protein turnover, the sole source of endogenous production of arginine, ornithine, and citrulline, and a principal component of the nitric oxide (NO) production pathway [72]. The urea cycle is also connected with TCA anaplerosis via the alternative synthesis of fumarate. UCDs include eight different IEMs resulting from defects in any one of the six enzymes or two transporters involved in the hepatic removal of ammonia as waste nitrogen by its conversion to urea and excretion by the kidneys [73]. Mortality and morbidity primarily contribute to neurological damage resulting from hyperammonemia (HA) and the elevation of other neurotoxic intermediates of metabolism [74]. A typical presentation of a UCD includes neonatal hyperammonemic crisis; however, nontypical presentations of later-onset HA, acute liver dysfunction, intellectual disability, or insidious pyramidal signs of the lower extremities with minimal HA crises (in arginase deficiency) have been described [27]. It has been suggested that arginase deficiency, with its unique presentation compared to other UCDs, exerts a neurotoxic effect by the guanylation of glycine, with the excess arginine serving as a guanidine donor rather than acute HA crises [75]. In a cohort of 13 patients with argininemia, Burrage et al. demonstrated an increase in additional guanidine compounds, namely, N-acetylarginine, homoarginine, argininate, and 2-oxoarginine, to a greater degree than guanidinoacetate (GA) [76]. Guanidine compound (GC) toxicity is believed to be derived from the observed in vitro neurotoxicity: diminished response to the inhibitory GABA and glycine neurotransmitters [77], promotion of non-apoptotic cell death and axonal hypersprouting [78], and inhibition of Na+/K+-ATPase activity and glutamate uptake, and decrease in antioxidant defense in the rat brain [79]. Other toxic effects of GC include ethanol-induced liver injury, stimulated osteoclastogenesis, generation of reactive oxygen species (ROS), and modulation of cerebral cortex potentials [79]. In GAMT deficiency, GA is the main GC accumulating and phenotypically exerting a greater degree of intellectual disability, refractory epilepsy, and dystonia. At the same time, pyramidal signs are less dominant as compared to arginase deficiency. While these results may indicate an important role of GA in the developing brain, another important modifier between the two disorders is the creatine level, which is normal in arginase deficiency and low in GAMT deficiency [80]. Peripheral administration of polyethylene glycol amalgamated to (PEGylated) arginase, now in advanced stages of development, will help elucidate further pathomechanistic insights by the peripheral reduction of arginine excess without restoring urea cycle function in the liver.

The same study by Burrage and colleagues also examined a UM profile of OTC deficiency, a severe X-linked disorder bearing high morbidity and mortality among males and affected females. In this UCD, morbidity is attributed to HA crises, and biomarkers are scarce, making this disorder “unscreenable.” In this study, 83% of patients (10/12) with a history of hyperammonemia (excluding females with no such history) showed a significant elevation (Z-score >+2) of either orotate or uridine. Other biomarkers of increased pyrimidine metabolism (due to the shunting of the cabamoylphosphate from the dysfunctional urea cycle), β-ureidopropionate, and uracil were not significantly elevated. For screening purposes, the sensitivity of these markers is ~60% (10/17), and the specificity is also low, given shared pathway perturbation with other UCD and pyrimidine metabolism defects; however, these results point to an important consideration for biochemical testing, indicating such abnormalities in cases of uncertain diagnosis given the sensitive nature of this disorder. The authors are aware of a case of a 1.5-year-old female presenting with fulminant liver failure and nonspecific liver biopsy for which only uracil was elevated in urine on traditional metabolic screening. Based on that result, a presumptive diagnosis of OTC deficiency was made, and targeted treatment was promptly provided; molecular testing later confirmed the suspected diagnosis. Both citrullinemia and ASL deficiency did not show unique metabolic fingerprints beyond the accumulation of citrulline and argininosuccinic acid, respectively [76 ].

3.3 Pyruvate Kinase

Pyruvate kinase deficiency (PKD, MIM 266200) is the most common form of inherited anemia due to glycolytic defects. It results in a spectrum of hemolytic anemia that can result in infantile-onset transfusion-dependent anemia or a milder form of compensated anemia [81]. A diagnostic gap exists for PKD due to the unsatisfactory performance of activity assays, a genetic composition complicating molecular diagnosis (variants in regulatory elements or effector genes such as KLF1), and the confounding effect of frequent blood transfusion on these methods [82]. In a cohort of 16 patients with PKD (against 32 controls), van Dooijeweert et al. showed three groups of metabolites in DBS that differed between the two cohorts: glycolytic products phosphoenolpyruvate and 2- and 3-phosphoglycerate, as one might expect, along with polyamines, such as spermine, spermidine, N1-acetylspermidine, and putrescine, which are associated with red blood cell (RBC) membrane integrity, and acylcarnitines such as methylmalonylcarnitine and propionylcarnitine which are involved in turnover and repair of the RBC membrane [83]. Principal core analysis showed a separation of metabolic profiles between the two groups. Interestingly, the mildly affected patients with no history of transfusion dependence or splenectomy more closely resembled the control group, followed by transfusion dependence (severe phenotype), probably due to the frequent retrieval of donor RBC, and the splenectomized patients (moderate phenotype) were furthest away from control. Based on these findings, a machine learning algorithm trained on a subset of the cohort could predict the disease in 94% of cases.

3.4 Glucose Transporter 1

UM can also reveal a metabolic fingerprinting of disease-modifying treatments, aiding the monitoring of both therapeutic efficacy and disease progression. The brain glucose transporter GLUT1 facilitates the diffusion of glucose to brain tissue to compensate for insufficient passive diffusion, given the elevated requirement for glucogenic energy production by that tissue. Heterozygous pathogenic variants in SLC2A1, which encodes GLUT1, result in a brain energy failure syndrome (GLUT1DS) caused by impaired glucose transport and result in a spectrum of phenotypes including epileptic encephalopathy, intellectual disability, acquired microcephaly, ataxia, action limb dystonia, chorea, and tremor in its severe form and paroxysmal ataxia in its mild end. Diagnosis is made by demonstrating a low CSF concentration of glucose in the presence of plasma normoglycemia and/or identification of a pathogenic variant in SLC2A1 [84]. Aside from probable energy depletion, it was suggested that depletion of glycolysis intermediates might play a role in the disorder’s pathogenesis; however, it is unknown to what extent, as in vitro models include near-complete abrogation of the transporter [85]. The sole therapeutic option available is a classic ketogenic diet (CKD). By altering the brain energy fuel into mainly ketone bodies, this therapy dramatically affects seizures and can improve cognitive outcomes. However, diet discontinuation was reported in up to 10% of patients due to side effects [86], and low compliance led experts to recommend alternative low carbohydrate diets (such as the modified Atkins diet) for adolescents and adults [87] and even exploring amylopectin-based diet of the low glycemic index [88]. Monitoring CKD is based on serum β-hydroxybutyrate levels, as measuring urine ketones—a convenient and qualitative measure of acetoacetate—has a limited role in diet monitoring [87]. A more comprehensive picture created by UM performed on six treated patients with GLUT1DS [89] showed elevations in β-hydroxybutyrate, β-hydroxybutyrylcarnitine, β-methyladipate, and N-acetylglycine. Other elevated derivatives were α-ketobutyrate, β-hydroxylaurate, 10-nonadecenoate, margarate, 15-methylpalmitate, and α-aminoheptanoate. Furthermore, pathway analysis using Kruskal-Wallis analysis (comparing pathway metabolite perturbations vs. non-pathway metabolite perturbations) showed involvement of long-chain fatty acids, phospholipids, acylcarnitines, and, to a lesser degree, sphingolipids, mono-hydroxy fatty acids, and polyunsaturated fatty acids. In accordance with fatty acid utilization, free carnitine levels were low, while carnitine-bound metabolites were elevated. CSF UM of three patients prior to CKD onset revealed low levels of glycerol 3-phosphate, an intermediary metabolite in lipid metabolism, and an increased level of isocitrate, which can indicate a TCA dysfunction [89]. These results supply a CKD metabolic profile with a more complete ketosis map, which can assist in diet fine-tuning, e.g., increasing the fat-to-carbohydrate ratio to increase overall ketosis or examining the effect of decreasing the ratio in reduced diet tolerability. The results can also guide the need for supplemental carnitine due to increased secondary excretion in ketosis, which, although it is considered a benign supplementary agent, can also exacerbate GI-related symptoms in patients under this GI-unfriendly diet and can also independently elevate plasma trimethylamine N-oxide (TMAO) levels, an atherosclerotic agent [90, 91].

3.5 Serine Metabolism

Serine biosynthesis defects occur due to deficiency in either of the three enzymes converting 3-phosphoglycerate into serine, phosphoglycerate dehydrogenase (PGDH), phosphoserine aminotransferase (PSAT), and phosphoserine phosphatase (PSP), and result in a severe neurometabolic disorder including severe intellectual disability, ataxia, nystagmus, epilepsy, hypertonia/spasticity, microcephaly, and poor growth. Deficiencies in enzymes of the serine de novo biosynthesis pathway result in low plasma and CSF levels of serine. Low serine impedes the synthesis of sphingolipids (from serine and palmitoyl-CoA), phosphatidylserine, d-serine (an agonist to the ionotropic glutamate receptor N-methyl-d-aspartate (NMDA)), and 5,10-methylenetetrahydrofolate (by converting serine to glycine) [92]. This multi-pathway effect could explain neurodevelopmental abnormalities associated with this disorder; however, a more direct association has not been clinically demonstrated. In four children with two serine biosynthesis defects, PGDH deficiency (three children) and PSAT deficiency (one child), Glinton et al. demonstrated an abnormal phospholipid profile at the time of diagnosis and, upon treatment, normalization thereof. Before initiation of treatment, serine and glycine were both decreased in all patients, and multiple phospholipids were reduced, including species of the mono- and di-unsaturated 18-carbon phosphatidylcholine and phosphatidylethanolamine in three or four of the patients [93]. Multiple sphingolipid species were reduced in two patients, most notably sphingomyelin. Except for phospholipids, no other compounds were found to be altered (either reduced or elevated) consistently in all patients evaluated [93]. The majority of these abnormalities normalized under supplementation. These results emphasize the need for early treatment. De novo synthesis of phosphatidylcholine (PC) and phosphatidylethanolamine (PE) from choline, ATP, cytidine triphosphate (CTP), and diacylglycerol by choline kinase, phosphocholine cytidylyltransferase, and choline transferase (also referred to as the Kennedy pathway) may, in fact, be inadequate at the time of rapid growth or neuronal differentiation in utero and require supplementation of PE and PC from phosphatidylserine pool (by mitochondrial phosphatidylserine decarboxylase). Another possibility is serine palmitoyltransferase (SPT) substrate promiscuity, leading to the condensation of alanine and palmitoyl-CoA into 1-deoxy-sphinganine (instead of 1-dehydro-sphinganine when serine is sufficient). Indeed, decreased sphingomyelin and increased 1-deoxy-sphingomyelin have been reported in targeted metabolomics applied for primary serine biosynthesis defects and secondary serine deficiency due to mitochondrial disorders [94]. Interestingly, a DBS sample from one patient at 38 hours of life showed markedly reduced serine, which could have served as abnormal NBS, as discussed above [93].

3.6 Pentose Phosphate Pathway and Polyol Metabolism

The pentose phosphate pathway is an important cytosolic pathway that converts glucose into ribulose-5-phosphate and produces reduced nicotinamide adenine dinucleotide phosphate (NADPH) for restoration of the antioxidant glutathione. Its oxidation product, ribulose-5-phosphate, can serve as a nucleotide building block or be further converted into phosphorylated mono-sugars that can serve as glycolytic intermediates [95]. The nonoxidative portion includes four enzymes, ribulose 5-phosphate isomerase and reductase, transaldolase (TALDO), and transketolase (TKT). Common features of TALDO deficiency (MIM 606003) are hepatosplenomegaly, anemia, thrombocytopenia, renal tubulopathy, heart abnormalities, and cholestatic liver dysfunction that can develop into cirrhosis [96]. For TKT deficiency (MIM 617044), manifestations include short stature, developmental delays, and congenital heart defects [27]. These manifestations are nonspecific; further, considering the potential reversibility of these two disorders (despite no currently available treatment), high-yield screening can facilitate diagnosis and improve outcomes. While TALDO deficiency was indicated by UM screening by identification of sedoheptulose, a single polyol that can point out an abnormality within the nonoxidative portion of PPP, Shayota et al. successfully demonstrated high-specificity multiple polyol alterations in plasma and urine metabolomics for the diagnosis of these two enzyme deficiencies (see Table 1) [97]. Two novel biomarkers were shown for both disorders (erythronate and ribonate). Secondary alterations in purine and pyrimidine metabolites, as seen in this work, are most probably due to defect in ribulose-5-phosphate processing.

Table 1 TM versus UM findings in two disorders of the nonoxidative portion of the pentose phosphate pathway [97]

3.7 Metabolomics of Muscular Diseases

The muscular disease has a broad differential diagnosis and multiple pathological mechanisms. An important etiological group is the mitochondrial myopathies, with can have early- or late-onset and acute or subacute course and are progressive in nature [14]. Buzkova et al. compared the metabolomic profiles of mitochondrial myopathies and ataxias (lumping the sporadic inflammatory disorder inclusion body myositis, which also affects mitochondrial functioning) in comparison to non-mitochondrial neuromuscular diseases [98]. This study utilized TM of 94 metabolites but is included herein, given its broad application and interest in disease fingerprinting. The first group demonstrated alterations in the transsulfuration pathway, including elevated cystathionine (1.9–4.1-fold increase) and a less consistent reduction in taurine. In contrast, the second group was characterized by normal levels of cystathionine, depletion of nicotinamide (−1.7-fold change), and increased creatine (2.1-fold change). Alterations were also found in carbohydrate metabolism leading the authors to propose a quad-biomarker set of elevations in sorbitol, alanine, myoinositol, and cystathionine, producing an area under the curve similar to fibroblast growth factor-21 (FGF-21), lactate, and pyruvate, to distinguish the mitochondrial origin of myopathy (with an overall 76% sensitivity, 95% specificity). A particular mitochondrial myopathy, mitochondrial encephalomyopathy, lactic acidosis, and stroke-like episodes (MELAS, MIM 540000) showed elevations in carbohydrate derivatives (sorbitol, glucuronate, myoinositol, and sucrose), decreased arginine, and an increase in transsulfuration intermediates (cystathionine, γ-glutamyl-cysteine S-adenosylmethionine, and glutamate) with a decrease in adenosine, guanidinoacetate, and betaine. In another UM study of MELAS patients, Sharma et al. demonstrated novel amino acid, acylcarnitine, and fatty acid biomarkers [99]: N-lactoyl attached to the branched-chain amino acids leucine, isoleucine, or valine or to the aromatic amino acids phenylalanine and tyrosine; β-hydroxy acylcarnitines of even-length C10:0 to C16:0 (C10:0, C12:0, C14:0, C16:0); and β-hydroxy fatty acids of even-length C8:0 to C14:0. These biomarkers were associated with the degree of diseased severity (per Karnofsky performance score). Interestingly, β-OH-C16:0 carnitine showed a severity correlation similar to the well-accepted growth/differentiation factor-15 (GDF-15) [100], while β-hydroxy acylcarnitines and β-hydroxy fatty acids correlated with ventricular lactate levels, and the N-lactoyl-amino acids correlated with urine heteroplasmy. Table 2 summarizes selected metabolic pathways in which more than 10% of metabolites are significantly altered in the individual disorders (two-sample t-test for significance).

Table 2 Plasma metabolic pathways in which ≥10% of metabolites are altered compared to control in selected mitochondrial myopathies and ataxias, compared to non-mitochondrial myopathies [98]

4 Future Perspectives

We shall end our discussion of bedside UM with a futuristic perspective of UM in the service of autism spectrum disorder (ASD). The autism spectrum is a range of neurodevelopmental conditions exhibiting persistent deficits in reciprocal social interaction and restricted, repetitive patterns of behavior, interests, or activities. Historically, ASD gained an independent psychiatric status from schizophrenia only in the second half of the twentieth century (the word autism was initially coined to describe severe schizophrenia) [101] and became a spectrum only at the end of that century. We now appreciate a 60–80% heritability of ASD [102, 103], with an estimated genetic or genomic etiology in 30–40% of cases [104]. Within the group of ASD, due to a known genetic variant, an estimated 3–5% are due to an IEM [105]. Examples include [106] amino acidopathies (PKU, homocystinuria, S-adenosylhomocysteine hydrolase deficiency, MSUD, UCD), organic acidurias (PA, L-2-hydroxyglutaric aciduria), cholesterol biosynthesis defects (Smith-Lemli-Opitz syndrome), disorders of neurotransmitter synthesis or degradation (SSA deficiency; SSADH deficiency), disorders of purine metabolism (ADSLD, Lesch-Nyhan syndrome), cerebral creatine deficiency syndromes (GAMT deficiency, creatine transporter defect), disorders of folate transport and metabolism (cerebral folate deficiency, MTHFR deficiency), lysosomal storage disorders (mucopolysaccharidosis type III, neuronal ceroid lipofuscinoses (NCL), Niemann-Pick disease type C), CTX, MELAS, Wilson disease, and several types of neurodegeneration with brain iron accumulation, among others. Moreover, multiple lines of evidence have suggested biochemical alterations in children with ASD compared to peers. Alterations can derive from the large intestine microbiome, showing differential fecal content for isopropanol, p-cresol, and short-chain fatty acids. Mitochondrial dysfunction can result in complex I, IV, or V deficiencies in children with ASD (up to 7%). Alterations were also demonstrated in some cases of glutathione reduction in the CNS [107]. While attempts to find unifying markers of autism demonstrated interesting candidates, they failed to show consistency across multiple UM platforms and ASD cohorts. Glutaric acid, arginine, histidine (and its catabolites), taurine, β-alanine, and succinic acid were most consistently elevated, along with a reduction of creatine and creatinine [107]. In a cohort of 52 pregnant women whose children developed ASD, compared to 62 control pregnant women in an NMR-based UM, differences were found in glycosphingolipid metabolism, N-glycan and pyrimidine metabolism, bile acid pathways, and C21-steroid hormone biosynthesis but with only a mild perturbation in each metabolite [108]. The latter two candidates are of interest due to the involvement of cholesterol metabolism, as ASD is highly prevalent among patients with Smith-Lemli-Opitz syndrome, a disorder of cholesterol synthesis. Abnormal bile acid synthesis can cause perturbations in taurine, and pyrimidine synthesis defects can alter β-alanine. However, a lesson learned from these works is that ASD may represent shared neurodevelopmental outcomes of quite different neurometabolic processes, complicating the search for biomarkers, as the case separation from control may be shadowed by intragroup variability [109]. Instead, a machine learning approach to recognize a pattern of altered seemingly unrelated metabolites in a study group could allow the discovery of heterogeneous biomarkers, indicating a high-risk status for ASD, Alzheimer’s disease, Parkinson’s disease, or other conditions, and, once defined, may also be then targeted for treatment and/or monitoring. Such a learning process was performed on UM data from a cohort of 500 children with ASD (against 200 controls) [110]. The group was halved into study and discovery groups. The study group yielded 34 groups sharing metabolic phenotypes with a specificity of ≥95%. Those groups were then clustered into six metabolic groups, each with an abnormal ratio of either α-ketoglutarate, 4-hydroxyproline, glycine, lactate/pyruvate, ornithine, or succinic acid compared to other metabolites. These clusters were then validated in the discovery group. When screening for the clusters together, this assay had 53% sensitivity and 91% specificity for indicating “high risk” for ASD. These outcomes demonstrate the potential for a higher level of complexity in data interpretation. Identification of such abnormalities by the physician ordering an “ASD risk stratification” test may be a difficult task. Still, computer-assisted data analysis could flag such abnormal results and promote a revolution in the field of developmental neuroscience.

While UM offers an opportunity for discovery both inside and outside clinical settings, its use in the clinical lab supports broad screening for inborn errors of metabolism well beyond the newborn screen, supports the development of metabolic profiles for a disease that may be monitored during treatment, and, when integrated with genomics, provides a precision medicine approach to the diagnosis of rare disease.