1 Introduction

Accumulating evidence shows that engaging in physical activity (PA) has multiple salutary effects on human health. Acute exercise exerts profound effects on many biological pathways (Sakaguchi et al., 2019; Schranner et al., 2020). However, metabolic profiles that define physical fitness and/or regular exercise participation are not established.

Metabolomics simultaneously quantifies numerous metabolites with low molecular weights, which are the final products of upstream biological processes and many other physiological functions (Wishart, 2019). It allows for the identification of biological significance underlying changes in metabolism and/or metabolic pathways, a collection of biochemical reactions catalyzed by different enzymes (Hu et al., 2020). Because metabolites are intermediates or products of cellular metabolism, it helps to understand mechanisms underlying the effects of health behaviors on human health and diseases. To this end, metabolomics can provide a comprehensive understanding of metabolic changes associated with chronic adaptations to physical activity (PA). Previous review studies have suggested that the effect of acute exercise may be the opposite of the effect of being physically active over the long term. For example, acute exercise increases oxidative stress and inflammation, while chronic adaptations to regular PA decrease oxidative stress and may also reduce the pro-inflammatory state (Anderson et al., 2016; Mastaloudis et al., 2001; Nikolaidis et al., 2011).

In this systematic review, we aimed to identify metabolites that were associated with PA, fitness, and/or sedentary time in community-dwelling adults, and to identify potential pathways implicated in the benefits of regular PA participation. We hypothesized that metabolite adaptation induced by regular PA participation would share similarities with the effect of acute exercise on TCA cycle intermediates, lipid-related metabolites, and fatty acids. A more advanced understanding of metabolomic profiles of being physically active and fit can provide valuable insights into early prevention and treatment for health conditions that are often age-related, such as cardiovascular and metabolic diseases.

2 Methods

2.1 Literature search and study selection

We followed PRISMA guidelines to conduct this systematic review (Liberati et al., 2009). One author (QT) searched literature that was written in English and published after January 1, 1999, using PubMed and Scopus databases. In the PubMed database, the search terms included (1) "Metabolomics"[Mesh] OR “Metabolite”[tw] OR “Metabolome”[tw], AND (2) "Sedentary Behavior"[Mesh] OR "Cardiorespiratory Fitness"[Mesh] OR "Exercise"[Mesh] OR “Physical activity”[tw] OR “Sedentary time”[tw] OR “Fitness”[tw]. PubMed search retrieved 421 records. In the Scopus database, the same search terms were used; TITLE-ABS-KEY((metabolomics OR metabolite* OR Metabolome) AND (sedentary time* OR fitness OR exercise OR physical activity)). Scopus search retrieved 287 records. Four additional records that were not shown in the PubMed or Scopus search were added. These four records were discovered in relevant literature or based on authors’ knowledge.

After removing 26 duplicated records, two authors (QT and RM) evaluated the remaining 677 records. Because this systematic review focused on observational studies of community-dwelling adults and biofluid-based metabolomics, we first excluded the following records after screening by title and abstract, including reviews, case reports, intervention studies, studies of acute exercise or other exposures, and studies of unique populations, studies of muscle metabolites, and animal studies (Fig. 1). We further excluded studies that met the following exclusion criteria: (1) biofluid-based metabolomics data were not collected, or only a limited number of metabolites were examined (less than 10), (2) data on physical activity (PA), fitness, and/or sedentary time were not collected, or (3) there were no reported relationships of biofluid-based metabolomics with PA, fitness, and/or sedentary time. A total of 22 studies were eligible and included in this systematic review (Fig. 1).

Fig. 1
figure 1

Flowchart of study selection. This systematic review focused on metabolomics measured from biofluids of serum, plasma, and urine

2.2 Analysis

We first evaluated the quality of all 22 studies based on study background and methodology. Three authors scored separately, discussed scores in each study, and reached a consensus on the final assessment (QT, AC, and RM). Criteria and scoring are presented in Supplementary Table 1.

We then summarized key elements of each study, including study cohort, sample size, age, sex, assessment of PA, fitness, and/or sedentary time, sample type, metabolomics technique, number of metabolites examined, and study quality scores. Details of these studies are presented in Table 1, sorted by the number of metabolites examined.

Table 1 Details of studies included in this systematic review (n = 22)

We analyzed metabolites that were associated with PA, fitness, and/or sedentary time (Fig. 2 for Venn diagram) and identified the direction of the associations (Supplementary Figs. 1 and 2 for heatmaps). Of note, when more than one study found the same metabolites, we reported the findings from the study with higher methodology scores. If there was a tie in methodology scores, we reported the findings from the study with higher overall scores.

Fig. 2
figure 2

Venn diagram of metabolites that are upregulated (red) and downregulated (blue) with higher physical activity, higher fitness, and less sedentary time. CA Carboxylic acid-containing metabolites. LPL lysophosphatidylcholines or lysophosphatidylethanolamines. PL phosphatidylcholines, phosphatidylethanolamines, phosphatidylinositol, or phosphoglycerides. *Factor Score

We further conducted the KEGG pathway analysis and SMPDB enrichment analysis via https://www.metaboanalyst.ca/. As various databases exist for metabolomics pathway analysis, we used both KEGG and SMPDB as complementary databases to capture an accurate depiction of the metabolome. Compared to KEGG, SMPDB is focused on small molecules and unique to humans (Frolkis et al., 2010). First, we identified HMDB IDs for metabolites that were associated with PA, fitness, and/or sedentary time. Only metabolites with unique HMDB IDs were considered for pathway analysis and enrichment analysis. We then entered HMDB IDs into the KEGG database for pathway analysis and the SMPDB database for enrichment analysis separately. Pathway analysis parameters were as follows: input type = HMDB ID, visualization method = scatter plot (testing significant features), enrichment method = Hypergeometric Test, Topology analysis = Relative-betweenness Centrality, Reference metabolome = use all compounds in KEGG (homo sapiens) library. Enrichment analysis parameters were as follows: input type = HMDB ID, feature type: metabolites, pathway-based = SMPDB (99 metabolite sets based on normal human metabolic pathways), only use metabolite sets containing at least 2 entries, and use all the compounds in the selected library. Significant pathways were reported at an FDR-adjusted p value < 0.05 or p value < 0.05 as appropriate.

3 Results

3.1 Study design and population

All 22 studies included in this systematic review were cross-sectional. Concerning the quality of the studies, 2 were considered fair, 15 good, and 5 excellent (Supplementary Table 1).

The age of participants ranged from 18 to 96 years. Of the 22 studies, 14 studies included both men and women, 5 examined women only, and 3 examined men only. Study populations were geographically diverse with 10 from Europe, 8 from North America, and 4 from Asia (Table 1).

3.2 Assessment of physical activity, fitness, and sedentary time

PA and fitness are correlated but also distinct in their concepts and assessment methods. In general, PA, fitness, and sedentary time can be quantified by self-report questionnaires or by objective devices or fitness tests. Compared to subjective measures, objective measures of PA can capture the whole spectrum of activity levels, especially low-intensity activities. Of 22 studies, 7 used subjective measures, such as self-report questionnaires with questions regarding the total amount of PA, intensity, duration, frequency, and type of activities; 14 studies used objective measures, such as graded treadmill test, step test, and cycle ergometer test for fitness, and accelerometers or doubly labeled water for PA and sedentary time. One study used both subjective and objective measures to assess PA (Table 1, Supplementary Table 1).

3.3 Metabolomics

We categorized metabolomics techniques as relative/semi-quantitative, mixed, and quantitative which were scored as 1, 2, and 3, respectively. Of 22 studies, 4 used semi-quantitative/relative methods, 8 used mixed methods, and 10 used quantitative methods (Table 1, Supplementary Table 1). Regarding the analytical platform, 17 studies used mass spectrometry, 6 used nuclear magnetic resonance, and 1 used immunoassay. Across 22 studies, the number of metabolites examined ranged from 11 to 337 (Table 1).

3.4 Metabolites associated with PA, fitness, and/or sedentary time

A complete list of metabolites associated with PA, fitness, and/or sedentary time is summarized in a heatmap (Supplementary Figs. 1 and 2). In this section, we specifically focused on metabolites that were consistently associated with at least two parameters (PA, fitness, and/or sedentary time).

Metabolites associated with lower levels of PA/fitness and/or more sedentary time were from classes of carbohydrates (glucose, mannose), carboxylic acid-containing metabolites (lactate, 2-Oxoisopentanoate, 4-Methyl-2-Oxopentanoate), estrogens (2-Methoxyestradiol, 2-Methoxyestrone, 4-Methoxyestradiol, estradiol, estrone, Unconjugated Oestrone), fatty acids (linoleic acid, MUFA, MUFA: total FA ratio, omega 3 FA, omega 6 FA, PUFAs, saturated FA, total FAs), glycoproteins (Alpha-Acid Glycoprotein), ketone bodies (Acetoacetate), proteins/peptides (Gamma-Glutamylvaline), and triglycerides (HDL-TG, IDL-TG, LDL-TG, M-HDL-TG, S-HDL-TG, S-LDL-TG, S-VLDL-TG, total triglycerides, total VLDL-TG, triglyceride, VLDL-TG (M, L, XL), XS-VLDL-TG, XXL-VLDL-TG).

In classes of amino acids (AA), cholesterol, and lipoprotein particles, some metabolites were upregulated among active and fit individuals (i.e. AA: glutamine, glycine, ornithine, betaine; cholesterol: HDL2-C, HDL-C), while some metabolites were downregulated (AA: alanine, isoleucine, leucine, methionine, phenylalanine, proline, tyrosine, valine, 2-hydroxybutyrate, alpha-hydroxyisovalerate, pyruvate; cholesterol: LDL-C, remnant-C, S-VLDL-C, total cholesterol, VLDL-C, XL-VLDL-C).

3.5 Pathway analysis

The KEGG pathway analysis revealed three pathways that were related to PA, fitness, and sedentary time (FDR p < 0.05), including aminoacyl-tRNA biosynthesis, valine, leucine, and isoleucine biosynthesis, and valine, leucine, and isoleucine degradation (Table 2). The SMPDB enrichment analysis showed several top enrichment pathways related to PA, fitness, and sedentary time, including urea cycle, glucose and alanine cycle, and glycine and serine metabolism (Fig. 3; Supplementary Table 2).

Table 2 KEGG pathway analysis
Fig. 3
figure 3

The SMPDB Enrichment analysis

4 Discussion

Overall, multiple metabolites appear to be affected by levels of PA, fitness, and sedentary time. We found that metabolite differences with habitual PA levels share several similarities with those observed after acute exercise or exercise interventions. These similarities include changes in lipids, branch-chain amino acids (BCAAs), and TCA cycle intermediate metabolites, fatty acids, and ketone bodies (Kelly et al., 2020; Sakaguchi et al., 2019; Schranner et al., 2020). While it is difficult to fully understand the role of every metabolite in the metabolic adaptation to PA, the enrichment analysis associated with PA, fitness, and sedentary time identified six specific metabolic pathways; urea cycle, arginine biosynthesis (NO cycle), alanine, aspartate, and glutamate metabolism (TCA cycle), glycine, serine, and threonine metabolism, and valine, leucine, and isoleucine biosynthesis/ degradation.

4.1 Theme 1 Valine, leucine, and isoleucine biosynthesis/degradation

BCAAs are metabolized in muscle tissue by mitochondrial dehydrogenase and branch-chain ketoacid dehydrogenase to produce the branch-chain keto acids (Riddle et al., 2016; Shimomura et al., 2004; Thalacker-Mercer et al., 2020; Valerio et al., 2011). The generated ketoacids are used by the muscle to generate ATP via Kreb’s cycle or transported to the liver for oxidation. BCAA catabolism is an important metabolic process that aids in protein synthesis and energy production. BCAAs are sensed by mTOR as biomarkers of energy and substrate availability (Holecek, 2018). Through this mechanism, BCAAs promote protein synthesis through the S6Kinase and increase mitochondrial biogenesis by stimulating SIRT1 (Nie et al., 2018). In this review, we found that the majority of BCAAs (leucine, isoleucine, valine) were decreasing with higher levels of PA and fitness and less sedentary time, which was consistent with previous findings of exercise training decreasing BCAAs in serum (Pitkanen et al., 2002). Our findings are also in line with recent findings that participants with high muscle quality had lower levels of BCAAs in plasma compared to those with low muscle quality (Moaddel et al., 2016). Notably, in a preliminary study of 18 participants, higher circulating levels of BCAAs were observed in biopsy specimens collected from vastus lateralis of participants with high muscle quality compared to low muscle quality (Gonzalez-Freire et al., 2018). This may suggest that a decrease of BCAAs in circulation may correlate with a corresponding increase in vastus lateralis where it can be used for energy and protein synthesis. It has been hypothesized that PA upregulates the BCAA catabolic pathway, critical in energy production and protein synthesis, and may play an intermediary role between exercise and health benefits (Xiao et al., 2016). Consistent with this hypothesis, fit individuals had lower levels of glutamate, an intermediate of BCAA catabolism. Additionally, the dehydrogenation product of leucine and acetoacetate is ketogenic. The concomitant decrease in branched-chain α-keto acids (BCKAs) metabolites 4-methyl-2-oxopentanoate, 3-methyl-2-oxovalerate and alpha-hydroxyisovalerate is consistent with decreased BCAAs in circulation. Notably, isoleucine and 4-methyl-2-oxopentanoate (BCKA), derived from valine, are also lower with higher levels of PA and fitness, and less sedentary time. BCAAs are converted to BCKAs in the mitochondria and subsequently used in the production of acylcarnitines. BCKAs have been suggested as a predictive biomarker for type 2 diabetes and impaired fasting glucose (Menni et al., 2013).

4.2 Theme 2. Aminoacyl-tRNA biosynthesis

Transfer RNAs (tRNA) are central nucleic acid molecules, and their interaction with aminoacyl-tRNA synthetase protein enzymes defines the specificities of proteins for the genetic code (Borchers & Pieler, 2010). Aminoacyl-tRNA synthetases (ARSs) are ubiquitously expressed enzymes essential for the first step in protein synthesis and can also play non-catalytic roles in diverse biological processes (Boczonadi et al., 2018; Lee et al., 2004; Ognjenovic & Simonovic, 2018; PouralijanAmiri et al., 2020). In this review, ARSs are identified as a significant pathway associated with higher PA and fitness, and less sedentary time, with 6 common significant metabolites (Glutamine, Methionine, Valine, Alanine, Isoleucine, Leucine). This pattern is consistent with the role of aminoacyl-tRNA synthetase in protein synthesis. Further, the ARSs pathway has been identified in several studies as a significant pathway for high-intensity interval training (HIIT) (Castro et al., 2019; Robinson et al., 2017). Another study of a combined 8-week continuous endurance training and HIIT program identified ARSs and carbohydrate and amino acid metabolism as the most common pathways associated with baseline serum metabolite levels (Castro et al., 2019). These findings are also consistent with recent studies demonstrating that essential amino acids and resistance exercise are stimulators of skeletal muscle protein synthesis in human and animal models (Drummond et al., 2009).

4.3 Theme 3. Arginine biosynthesis (urea cycle, NO cycle)

Several metabolites in arginine metabolism, including glutamine, arginine, citrulline, and ornithine, tended to be higher with higher PA, whereas glutamate levels were lower. The increase in arginine, citrulline and ornithine suggests a role in both the nitric oxide (NO) cycle and urea cycle. The concomitant increase in arginine and citrulline suggests an increase in circulating NO levels with higher levels of PA and fitness, consistent with NO’s role as a powerful vasodilatory. NO is a signaling molecule involved in many physiological and pathological processes. Previous studies have also shown that exercise training stimulates NO synthesis in endothelial cells (Nosarev et al., 2014), stimulates the release of NO (Lewis et al., 1999; Nosarev et al., 2014; Tsukiyama et al., 2017), and increases NO in skeletal muscle in response to exercise (Dyakova et al., 2015). While electron paramagnetic resonance spectroscopy is used for the direct measurement of NO, indirect methods, such as measurement of NOS activities and/or activation of molecular targets, or products of reactions of NO, are predominantly used (Csonka et al., 2015). Recently, the global arginine bioavailability ratio (arginine to [ornithine + citrulline] ratio) has been identified as a proxy measure of NO synthetic capacity in vivo (Baranyi et al., 2015, Ali-Sisto et al. 2018, Moaddel et al., 2018). Future studies are needed to explore ratio measures, which may offer a window to new therapeutic targets. The concomitant increase in arginine, ornithine, and creatinine also suggests increasing levels of urea via the urea cycle. This is consistent with previous findings reporting an increase in circulating urea levels after prolonged exercise and suggesting new protein synthesis is paralleled by high rates of protein degradation, possibly due to the chronic microdamage accumulation during muscle contraction (Haralambie & Berg, 1976).

4.4 Theme 4. Glycolysis and TCA cycle (Alanine, aspartate, and glutamate metabolism)

Glucose levels were lower with higher levels of PA and fitness, and/or less sedentary time. This finding is consistent with previous studies showing decreased glucose after acute exercise (Sakaguchi et al., 2019; Schranner et al., 2020) and chronic improvements in insulin sensitivity and superior baseline glycemic control with regular exercise (Bird & Hawley, 2016). These findings further support the notion that increasing fitness is beneficial in the prevention and control of diabetes, in addition to reducing the risk of cardiovascular mortality (Riddell & Perkins, 2009). The end-products of glycolysis, pyruvate, and lactate, were also decreasing with higher PA (Gray et al., 2014), consistent with previous findings that glycolytic pathway metabolites decreased after exercise training (Schranner et al., 2020). It is important to highlight that since there is an upregulation of aerobic metabolism, there may be less need to access the extra energy required for glycolysis. Pyruvate drives several major biosynthetic pathways intersecting the citric acid cycle (Gray et al., 2014). It is worth noting that the lactate/pyruvate ratio is an indicator of mitochondrial function (Parikh et al., 2015). Whether this ratio is related to PA or exercise requires further investigation.

Several studies have shown that citrate synthase activity in human muscles increased after exercise training and acute exercise (Bruce et al., 2004; Carter et al., 2001; Coggan et al., 1993; Howarth et al., 2004; Leek et al., 2001). Interestingly, while glucose, pyruvate, lactate, and alanine were found lower with higher PA, citrate was not significantly associated with PA, fitness, or sedentary time in three out of four studies reporting citrate (Kujala et al., 2019; Pang et al., 2019; Xiao et al., 2016). Only one study reported a modest increase in serum levels with higher PA (Kujala et al., 2013). This is consistent with recent reports that prolonged exercise training attenuates the exercise-induced increase in TCA cycle intermediates in vastus lateralis (Howarth et al., 2004). It is also possible that these changes could be detected in muscles but may not be detected in circulation because of the integrity of mitochondria. These data suggest that TCA cycle intermediate metabolites increase at the start of training but then stabilize when mitochondrial biogenesis has coped with the increased demand for regular PA.

4.5 Theme 5. Lipids

Several lipids were downregulated in individuals with high levels of PA and fitness, including specific lipoprotein particles, cholesterol and triglycerides, and fatty acids. Several LDL particles were found lower with higher PA and fitness, including cholesterol from LDL and VLDL particles. Correspondingly, with higher PA and fitness and less sedentary time, we observed a concomitant decrease in cholesterol, and triglyceride-containing particles (Lu et al., 2011). Conversely, higher PA and fitness and less sedentary time were associated with higher levels of cholesterol from HDL particles, which was consistent with previous reports that ApoA1 was upregulated with higher PA, fitness, and less sedentary time (Gu et al., 2018; Kujala et al., 2013, 2019; Pang et al., 2019). ApoA1, a major apolipoprotein in HDL, has both antioxidant and anti-inflammatory effects and can stimulate endothelial production of nitric oxide (Lu et al., 2011). The increase in HDL and ApoA1 with higher PA and fitness is further supported by the increase in cholesterol esters, phosphatidylcholines, and sphingomyelin because these lipids are present in HDL. There is evidence that HDL particles contribute to the efflux of cholesterol from the arterial endothelium which prevents the initial phase of the atherosclerotic process. The findings of this review of the literature are consistent with the results of exercise intervention studies showing that aerobic exercise improves the lipid profile, particularly increasing HDL-C (Mann et al., 2014; Wang & Xu, 2017). A recent aerobic exercise intervention study reported a small but statistically significant reduction in total cholesterol (TC), low-density lipoprotein cholesterol (LDL-C), and triglycerides ranging in a span of 0.08 mmol/L to 0.10 mmol/L, as well as an increase in HDL-C about 0.05 mmol/L (Patel et al., 2017).

4.6 Theme 6. Estrogen

In our review, six studies examined estrogen metabolites in women. Parent estrogens, including estrone, conjugated estrone, unconjugated estrone, estradiol, and testosterone, were downregulated in physically active and fit women. This pattern was detected in both premenopausal and postmenopausal women and findings were consistent in urine and serum samples. Most estrogen metabolites in three estrogen metabolism pathways (2-, 4-, and 16-Hydroxylation) were consistently downregulated, such as 2-Hydroxyestrone-3-methyl-ether, 2-methoxyestrone, 2-methoxyestrone, 4-hydroxyestrone, 4-methoxyestrone, 16-Epiestriol, and 17-Epiestriol. Relative to parent estrogen, increased hydroxylation at 2-, 4-, and 6-sites was associated with higher PA and fitness or less sedentary time. Collectively, these data suggest that high levels of PA may increase estrogen hydroxylation and metabolism and subsequently lower urinary or serum estrogen levels. Notably, the 2-Hydroxyestrone/16α-Hydroxyestrone ratio, lower levels of which predict breast cancer risk, is found higher in women with higher levels of PA and fitness, and less sedentary time. Findings on the effect of long-term exercise training on estrogen metabolites are somewhat mixed. Some studies reported that a 12-month exercise intervention increased the 2-Hydroxyestrone/16α-Hydroxyestrone ratio in healthy premenopausal women (Smith et al., 2013), and increased fitness by exercise intervention was associated with decreased levels of 2-pathway and 2-Hydroxyestrone concentrations in postmenopausal women (Matthews et al., 2018). Others did not find a significant effect of 12-month exercise training on 2-Hydroxyestrone, 16α-Hydroxyestrone, or 2-Hydroxyestrone/16α-Hydroxyestrone ratio in either postmenopausal women (Atkinson et al., 2004) or premenopausal women (Campbell et al., 2007). These discrepancies may be due to variations in sample characteristics, such as menopause status and hormone treatment, exercise intervention components, or measurements of estrogen metabolites. Future studies are needed to understand whether being physically active and fit affects the estrogen metabolomic profile in a meaningful direction in both men and women and whether menopause status or aging affects such profile.

As hypothesized, the metabolomic profiles of high levels of PA and fitness share some consistencies with metabolite changes to acute exercise but also differ in several aspects. First, while there is a strong signature of increased mitochondrial biogenesis and enhancement of energy metabolism immediately after acute exercise, blood metabolites that convey these signatures become less evident as individuals assume an active lifestyle. Second, while there is an overall increase in TCA intermediates, lactate, and pyruvate during and immediately after acute exercise, the TCA cycle appears to be more efficient in individuals with regular PA participation.

This study has several strengths. First, to address the diversity of metabolomics and other measures, we evaluated the quality of each study and summarized metabolite findings based on scoring. Second, to comprehensively capture the effect of PA on metabolites, we considered three measures, including PA, fitness, and sedentary time. Third, we conducted both pathway analyses and enrichment pathway analyses to gain an understanding of the related pathways involved. Limitations also exist. First, due to limited data on the relationship between metabolomics and sedentary time, it was difficult to compare patterns across three measures. Second, this review only focused on metabolites measured in biofluids.

5 Conclusion

The metabolomic profiles of physically active, fit, and less sedentary individuals may indicate a favorable biosynthesis of macromolecules, likely in skeletal muscle. This is supported by enhanced oxidative phosphorylation and glycolysis and improved lipid and estrogen metabolism, arginine biosynthesis, and an efficient TCA cycle. Notably, these findings are based on observational studies and only some of the findings reported here are confirmed in intervention studies. Future research on intervention studies is needed to confirm these findings. Whether metabolic changes are in the causal pathways to the benefits of being physically active and fit on health and longevity warrants future investigations.