Introduction

As an important organelle in eukaryotic cells, mitochondria play a pivotal role in cellular activity. Mitochondria provide energy for normal cell activities and thus is referred to as cell energy factory (Frazier et al. 2019); mitochondria are the centres of cellular material metabolism (Lozoya et al. 2018; Schell et al. 2017) and function as important nodes in cell signalling, such as activating longevity and mediating apoptosis (Gureev et al. 2019; Zhao et al. 2016). However, mitochondrial DNA (mtDNA) is not encapsulated by nucleosomes (Dobson et al. 2000) and mitochondria are important sites for generating reactive oxygen species (ROS) during cellular emergency (Chouchani et al. 2014), making mtDNA susceptible to ROS attack. Additionally, many DNA repair systems that repair nuclear DNA damage are not active in mitochondria (Van Houten et al. 2016). Thus, mtDNA is more frequently damaged and mutated than nuclear DNA. Studies have shown that the frequency of mtDNA mutations in Saccharomyces cerevisiae is approximately 10-fold that of nuclear DNA mutations (Lynch et al. 2008). As basic mitochondrial functions depend on the encoded products of mitochondrial genes (Yan et al. 2019), mtDNA mutations often lead to functional defects, resulting in mitochondrial diseases (Ryzhkova et al. 2018). Therefore, the importance and vulnerability of mitochondrial function reinforce the great significance of studies on mitochondria.

Mitochondria have always been a significantly important subject of study in the field of life sciences. On the one hand, a large amount of basic data that substantiate previous conclusions have been obtained. For example, mtDNA mutational hotspots, which are closely related to disease and ageing, have been extensively examined (Hirose et al. 2016; Niemann et al. 2017; Spangenberg et al. 2019), mitochondrial protein maps have been accurately drawn (Calvo et al. 2016; Sickmann et al. 2003), and the localizations of many important signalling molecules in mitochondria have been identified (Calvo et al. 2016). On the other hand, new findings about mitochondria are constantly being reported. Recently, the relationship between mitochondrial metabolism and epigenetic modification of chromatin gained attention (Matilainen et al. 2017; Mayorga et al. 2019). Additionally, studies by Benador et al. (2018) demonstrated that a specific set of mitochondria in cells can adsorb fat droplets and provide energy to synthesize and store lipid molecules. Sun et al. (2017) found that the mitochondrial “pseudomembrane potential” maintained by exogenous positive charges can induce autophagy in cells.

Although many specific signalling pathways, metabolic pathways, and metabolites affected by mitochondria have been characterized, quantitative data are scarce. Additionally, data from distinctive studies are lacking crosswise comparison. Thus, studies are needed to relatively or absolutely quantify metabolic pathways and metabolites affected by mitochondria at the global cellular level. Mitochondrial mutants are the preferred material for these studies; however, it is difficult for researchers to obtain important mitochondrial mutants for large-scale functional studies. First, the loss of important mitochondrial genes is often accompanied by inactivation of mammalian cells. Second, the current commercial gene editing technology of clustered regularly interspaced short palindromic repeats-associated protein 9 (CRISPR-Cas9) has not been widely applied to mtDNA editing. The main limitation is that the CRISPR-cas9 system cannot be simply and efficiently transferred into mitochondria and the mitochondrial relatively narrow space is not conducive to operation of the CRISPR-cas9 system (Bacman et al. 2018; Gammage et al. 2018). Moreover, human cells contain thousands of copies of mitochondria, with dozens of copies in S. cerevisiae (Solieri 2010; St John 2016). Thus, genetic manipulation of mtDNA involves a relative proportion of edited mtDNA compared to normal mtDNA, making the analyses complex.

Heavy ion beam (HIB) irradiation is a typical ionising radiation (Feng et al. 2006). Compared with traditional mutagenesis methods, HIB irradiation mutagenesis possesses the advantages of high mutation rate, large mutation spectrum, and significant mutant traits (Feng et al. 2006; Du et al. 2017; Guo et al. 2019a). It has been widely used to develop advantageous mutants with great application potential and research value (Du et al. 2014; Li et al. 2018; Luo et al. 2016; Zhang et al. 2018a). Particularly, HIB irradiation mutagenesis showed an excellent performance in the establishment of a mutant library (Du et al. 2017; Guo et al. 2019a). In this study, a group of HIB irradiation-induced S. cerevisiae mitochondrial respiration-deficient mutants were selected as the research material. As a facultative anaerobic eukaryote, S. cerevisiae can grow at a low rate by anaerobic respiration when mitochondrial function is severely impaired. This enabled us to obtain detailed information on the effects of impaired mitochondrial function on cells. Multi-omics studies were conducted at the mutant population level. Genomic, transcriptomic, and metabolomic analyses were combined to ensure high throughput and reliability. The research methods and strategies focus on measuring the extent to which metabolic pathways are most dominated by mitochondria and the ultimate phenotypic effects of this dominance.

Materials and methods

Strain culture and mitochondrial respiration ability measurement

S. cerevisiae BY4743 (American Type Culture Collection (ATCC), Manassas, VA, USA; collection number: ATCC4040005) is amphidiploid (MATa/α, budding) and was derived from type strain S288c. The strains used in this study were a wild-type S. cerevisiae BY4743 strain and eight genetically stable respiration-deficient mutants of BY4743. These mutants (RD-1, RD-2, RD-3, RD-4, RD-5, RD-6, RD-7, and RD-8) were previously screened following moderate-energy carbon ion beam irradiation (Guo et al. 2019a). Carbon ion beam irradiation was generated by accelerating 12C6+ to approaching light speed using the Heavy Ion Research Facility in Lanzhou (HIRFL) at the Institute of Modern Physics at the Chinese Academy of Sciences. The average does rate was about 40 Gy min−1. As it reached the sample surface, the retained energy was approximately 76.37 MeV u−1, and the expected range in water was about 16 mm. More details on mutagenesis, screening, and mutant evaluation were described in our previous report (Guo et al. 2019a). Each strain was inoculated into yeast extract peptone dextrose medium (YEPD) at a ratio of 1:100, and then incubated in a shaker (200 rpm) at 30 °C. In addition, triphenyl tetrazolium chloride (TTC) medium was used to detect the weak mitochondrial respiration of each mutant. The principle, protocols, and validity of the TTC method have been described previously (Guo et al. 2019a; Mao et al. 2009).

Genomic analysis

The genome sequences of nine strains involved in this study were obtained from our previous work (Guo et al. 2019a). Specific mutations in each mutant strain were identified. In this study, the BAM format of sequence files were sorted, indexed, and then loaded into Integrative Genomics Viewer (http://software.broadinstitute.org/software/igv/, v.2.3). Next, the mitochondrial genomes of the nine strains were visually presented in a X-Y scheme. The mutation sites in the nuclear genomes of eight mutants were drawn using Circos software based on Perl programming language (Krzywinski et al. 2009).

RNA-seq

The S. cerevisiae cells (~ 108) were collected at logarithmic phase and then snap-frozen in liquid nitrogen. Total RNA extraction, library construction, clustering and sequencing, data cleaning, quality control, transcript assembly, and fragments per kilobase of exon per million fragments mapped (FPKM) calculation were performed as described previously (Guo et al. 2019b). The transcripts were picked and searched in the genome browser of BioCyc database to identify the specific genes included in the transcripts. The genes with a fold change (FC) of FPKM > 1.5 or < 0.67 and corresponding p value < 0.05 were identified as differentially expressed genes (DEGs) between the wild-type and mutant strains. A portion of genes from DEGs were subjected to reverse transcriptase quantitative polymerase chain reaction (RT-qPCR) to verify the results of RNA sequencing (RNA-seq). The detailed manipulations were identical to the protocols introduced in our previous study (Guo et al. 2019b). The primer sequences are shown in Supplemental Table S1.

GO and KEGG analysis

All genes in the reference genome were annotated based on Goatools (https://github.com/tanghaibao/Goatools) and KOBAS (Wu et al. 2006). Next, the gene sets obtained from genomic or transcriptomic analysis were enriched with different gene ontology (GO) or Kyoto Encyclopaedia of Genes and Genomes (KEGG) terms. The enrichment factor and enrichment significance (q value) were calculated based on R programming language.

Lipidome detection

The centrifugally harvested cells at logarithmic phase were snap-frozen in liquid nitrogen after washing twice with phosphate-buffered saline. The cell precipitates were thawed at 4 °C. Next, the same weight of the cell precipitates of the wild-type strain, randomly selected five mutant strains, and quality control (QC) sample were weighed. The QC sample consists of the same weight of cell precipitates from the wild-type strain and five mutant strains. The seven groups of cell precipitates were prepared to extract lipids after resuspension in phosphate-buffered saline. Six repeats were used for each group.

Total lipids were extracted as described previously (Ejsing et al. 2009). Ultra-high-performance liquid chromatography-electrospray ionization mass spectrometry (UHPLC-ESI-MS) analysis was carried out as follows. First, the extracted lipid was reconstituted in 200 μL of isopropanol solution and vortexed, and the insoluble matter was removed by centrifugation. The sample was then separated by UHPLC and ion trap mass spectrometry (ITMS) (Q Exactive Plus, Thermo Scientific, Waltham, MA, USA). Data were collected using ESI in positive and negative ion modes, respectively. The raw data from UHPLC-ITMS was manipulated based on LipidSearch™ software version 4.1 (Thermo Scientific) for peak identification, lipid identification (secondary identification), peak extraction, peak alignment, and lipid quantification. The LC-MS conditions and data procurement parameters conformed to the established pattern in our previous study (Zhang et al. 2018b).

Data reliability

The average read coverage was ~ 200× for the whole-genome sequence data. RNA-seq was repeated at least three times. The FPKM and DEGs were determined by three repeated tests showing the closest transcriptional pattern. RT-qPCR was performed in at least triplicate and repeated three times independently. Lipidomic detection was performed using six independent repeats.

Availability of supporting data

The raw sequence data obtained by genomic sequencing and transcriptomic sequencing have been deposited in the Genome Sequence Archive in the BIG Data Center, Beijing Institute of Genomics, Chinese Academy of Sciences, respectively, under the accession numbers CRA001107 and CRA001812, which are publicly accessible at http://bigd.big.ac.cn/gsa.

Results

Large structural variations co-presented in the mitochondrial genomes of the eight mitochondrial respiration-deficient mutants

The TTC method was used to qualitatively show the mitochondrial respiration intensity in each mutant (Fig. 1a). The lawns of the mutant strains appeared white when incubated in TTC medium, suggesting that the mutants did not effectively convert TTC to red triphenylformazan as in the wild-type strain, which was a sign of severe dysfunction in the mitochondrial electron transport respiratory chain (Guo et al. 2019a; Mao et al. 2009). The extremely weak mitochondrial respiration intensity compared to the wild-type strain indicated the phenotypic commonality of the eight mutants.

Fig. 1
figure 1

Comparison of mitochondrial respiration intensity and mitochondrial genomes between mutant and wild-type strains. After 3 h of incubation in TTC medium, the lawns of the wild-type strain stained red, while the mutant lawns remained white (a), indicating that the mutant respiratory function was severely impaired, so that few [H] (reduced hydrogen) in the electron-transferred respiratory chain were not enough to reduce the TTC to produce red triphenylformazan. Under the same sequencing system, the mitochondrial genome of the wild-type strain was covered by sufficient reads with an average read coverage of > 2000. However, a large range of regions in the mitochondrial genome of each mutant showed nearly no read coverage, and these regions contained important mitochondrial genes (b)

At the molecular level, the eight mutants were found to have very large deletions in the mitochondrial genomic DNA (Guo et al. 2019a). Moreover, these regions contained important mitochondrial coding genes. For example, COX1, COB, and COX2 were deleted in seven mutants, while COX3 was deleted in five (Fig. 1b). These mitochondrial genes code for important components of the electron transport respiratory chain. Additionally, we investigated nuclear genes that are likely to decrease mitochondrial respiratory function due to the nuclear genome mutations. A nuclear genome mutation may alter biological functions in one of the following situations. First, the mutation was annotated to the class of non-synonymous coding, frame shift, or codon change plus codon insertion. Second, the mutation located at 1–2000 base pairs (bp) upstream of the transcription start site and the transcriptional level of potentially affected genes was changed. To reduce the false-negative rate of judgment, p < 0.2 was selected as the significant difference threshold to identify potentially effected genes with altered transcript abundance. Finally, three nuclear genes likely to decrease mitochondrial respiratory function were identified in three respective mutants (Supplemental Table S2). Thus, the mitochondrial respiratory dysfunctions in only ≤ 3 mutants may have been contributed by functional variances of mitochondrial respiration-related nuclear genes. Based on comprehensive analysis, the large structural variation (SV) in the mitochondrial genome was the shared characteristic of all investigated mutants and the main cause of mitochondrial dysfunction. The mitochondrial dysfunctions in ≥ 5 mutants were caused by large SVs in the mitochondrial genomes.

Low co-occurrence frequency (≤ 3) among the eight mutants for nuclear genome variations

In the nuclear genome, the average number of mutation sites for each mutant was ~ 14 (Guo et al. 2019a). The distribution patterns between mutants were distinct (Fig. 2a). Most mutation sites were unique to the nuclear genome of any one mutant. Only four mutation sites were shared by any two mutants (Fig. 2a). The genes potentially affected by nuclear genome mutations were identified as described above. Mutations in both coding regions and promoter regions were considered. Whether mutation in the promoter region was likely to change the biological function, was judged by the variance in mRNA expression abundance of the corresponding gene. Finally, an average of 11 nuclear genes were regarded as potentially affected by nuclear genome mutations in each mutant, with only a few very small intersections (Fig. 2b). YHR201C was co-identified in the RD-6 and RD-7 strains, while YKL068W and YKR079C were co-identified in the RD-1 and RD-2 strains. Therefore, the maximum co-occurrence frequency of genes affected by the nuclear genome mutations was two among the eight mutants (Fig. 2b). The identified genes were evaluated by KEGG pathway annotation. A total of 24 KEGG pathways were annotated for these genes potentially affected by the nuclear genome mutations (Table 1); fifteen of them existed only in one of the mutants (Table 1 and Fig. 2c). For example, only a lipid metabolism-related pathway (steroid biosynthesis) was found in RD-1 (Table 1). Additionally, eight of them showed a co-occurrence frequency of two; purine metabolism co-existed in the RD-4, RD-6, and RD-7 mutants, which hold the highest co-occurrence frequency of three among the eight mutants (Table 1 and Fig. 2c).

Fig. 2
figure 2

Very low co-occurrence frequency of nuclear genome mutation (≤ 3 mutants) in the mutant population. a Scatters in the first circle represents the accumulative mutation frequency of the eight mutants at a nuclear genome position, in which near the centre and far from the centre represented values of 1 and 2, respectively; the distribution of mutation sites in the nuclear genomes of eight mutants (from RD-1–8) was displayed in the highlighted form in the circles 2–8 of the Circos diagram. b Potentially affected genes by nuclear genome mutations and their intersections in eight mutants. The area of the legend represents a gene, while the shaded part indicates genes potentially affected by mutations located in the promoter regions. c Annotated KEGG pathway for genes potentially affected by the nuclear genome mutations and their intersection in eight mutants. The maximum co-occurrence frequency of the nuclear genome mutation sites, its potentially affected genes, and thus the possibly altered KEGG pathways in the eight mutants were 2, 2, and 3, respectively

Table 1 Potentially affected KEGG pathways by mutations in the nuclear genome among the mutant population

DEG distribution pattern among the eight mutants

The distribution pattern of all DEGs among the eight mutants is shown in Fig. 3. The average number of DEGs was 775 in each mutant (Fig. 3a). A total of 1548 genes were down-regulated and 1296 genes were up-regulated in the investigated mutant population (Fig. 3b–d). As the co-occurrence frequency increased, the number of DEGs decreased rapidly, both for up-regulated and down-regulated genes (Fig. 3b–d). A total of 506 DEGs showed a co-occurrence frequency of ≥ 4, of which 349 were down-regulated and 157 were up-regulated (Fig. 3b–d). Sixteen, 51, 69, 134, and 236 DEGs co-occurred in any of the 8, 7, 6, 5, and 4 mutants, respectively (Fig. 3b–d).

Fig. 3
figure 3

Distribution pattern of the DEGs among the eight mutants. The DEG number of each mutant (vs. wild-type strain) are shown in a DEG number corresponding to a specific co-occurrence frequency among eight mutants are shown in b. c, d Up-regulated and down-regulated DEGs, respectively; the eight colors (from top to bottom) represent different co-occurrence frequencies (from 8 to 1); the thickness of the legend line represents a DEG

KEGG enrichment analysis of DEGs with co-occurrence frequencies ≥ 4 and ≥ 5 among the eight mutants

For mutations in the nuclear genomes, the mutation sites, genes potentially affected by the mutations, or possibly altered KEGG pathways showed a maximum co-occurrence frequency of three among the eight mutants (Fig. 2). However, the phenotype of mitochondrial respiratory dysfunction and the large deletions in the mitochondrial genome co-occurred in all investigated mutants (Fig. 1). Based on this genetic background, DEGs with a co-occurrence frequency > 3 among the eight respiratory-deficient mutants were selected for analysis to conservatively identify the most significant metabolic pathways affected by mitochondrial dysfunction and exclude the interference caused by unrelated nuclear genes.

The DEGs with co-occurrence frequencies ≥ 4 and ≥ 5 were subjected to their respective KEGG pathway enrichment analysis (Fig. 4a, b). All 20 secondary enrichment terms were selected because the biological pathways related to mitochondria are very extensive, making it difficult to obtain a few relatively prominent KEGG enrichment terms at the third-class level. The q value and enrichment factor were used as the first and second indicators for judging the significance of enrichment, respectively. Lipid metabolism showed the lowest q value for DEGs with co-occurrence frequencies of both ≥ 4 and ≥ 5. In addition, the enrichment factor of lipid metabolism was in the top three among all enriched terms (Fig. 4a, b). Based on the overall consideration of the q value and enrichment factor, lipid metabolism was the most significant enrichment term.

Fig. 4
figure 4

KEGG pathway enrichment results for DEGs with co-occurrence frequencies ≥ 4 and ≥ 5 among the eight mutants. Considering the q value and enrichment factor, lipid metabolism was the most significantly enriched KEGG term for DEGs with co-occurrence frequencies of both ≥ 4 (a) and ≥ 5 (b). Each DEG enriched in lipid metabolism showed the same up- or down-regulated pattern among nearly all eight mutants (c). The randomly selected RNA-seq results and corresponding RT-qPCR results were linearly fitted. The determination coefficient was approximately 0.9, indicating that the RNA-seq results agreed well with RT-qPCR results (d)

Similar expression patterns of the identified lipid metabolism-related DEGs among nearly all eight mutants at the transcriptional level

Based on KEGG pathway enrichment of the DEGs with a co-occurrence frequency ≥ 4 among the eight mutants, genes enriched in the term of lipid metabolism were collected. In Fig. 4c, the FC in the expression abundance of these genes in the eight mutants (vs. wild-type strain) was enumerated. All these lipid metabolism-related genes showed the same up-/down-regulated expression pattern among nearly all eight mutants at the transcriptional level (Fig. 4c). As the annotation results on nuclear genome mutations suggests that only a lipid metabolism-related pathway was potentially affected in one of the mutants (Table 1), the similar change pattern of the identified lipid metabolism-related DEGs among the eight mutants can be conservatively attributed to the population commonality of mitochondrial dysfunction. Additionally, RT-qPCR was performed to determine the FC in the expression abundance of some lipid-related genes (YBR183W, YHR190W, YOL011W, YJR073C, YLL012W, and YIL009W) in the eight mutants (vs. wild-type strain) (Supplemental Table S3). The transcriptome and RT-qPCR results were evaluated by linear correlation analysis (Fig. 4d). The coefficient of determination was ~ 0.9, supporting the reliability of transcriptomic analysis results.

The third-class metabolic pathways that may be affected by the identified lipid metabolism-related DEGs were analyzed (Fig. 5a). Fatty acids are one of the simplest fats and a component of many complex lipids. Both the fatty acid synthesis and degradation pathways contained up-regulated as well as down-regulated DEGs. Additionally, the glycerophospholipid metabolism pathway contained the most DEGs (Fig. 5a). Thus, the potentially affected catalytic enzymes by the identified lipid metabolism-related DEGs were presented in the glycerophospholipid metabolism network (Fig. 5b). It can be seen that catalytic enzymes that may be affected by the DEGs were dispersed throughout the metabolic network; crosstalk was common among various glycerophospholipids. Taking phosphatidylethanolamine (PE) as an example, almost all of the DEGs affecting the catalytic enzymes in the metabolic network located upstream or downstream of PE metabolism (Fig. 5b). Therefore, it was challenging to infer the change pattern (up-regulated or down-regulated) of a certain lipid class based on these DEGs.

Fig. 5
figure 5

Roles of identified lipid metabolism-related DEGs in the third-class lipid metabolism-related KEGG pathways. a The third-class metabolic pathways that may be affected by the identified lipid metabolism-related DEGs. b The potentially affected catalytic enzymes by the identified lipid metabolism-related DEGs in the glycerophospholipid metabolism network. The red mark indicated a down-regulation, while the green mark indicated an up-regulation

Similar change patterns of the identified differentially expressed lipid species among nearly all investigated mutants revealed by lipidomic analysis

Five randomly selected mutants and the wild-type strain were evaluated using lipidomic analysis. A total of 36 lipid extracts were detected by UHPLC-ESI-MS. Additionally, five QC samples were set at intervals to evaluate the stability of the analytical system during the detection process. Figure 6 a and b show the base peaks of UHPLC-ESI-MS of five QC samples under positive and negative ion modes, respectively. The response intensity and retention time of each peak in the QC samples showed high reproducibility, suggesting the stability of the analytic system and reliability of the experimental data.

Fig. 6
figure 6

Stable and reliable UHPLC-ESI-MS system and identified lipid classes. a, b The base peak of UHPLC-ESI-MS for five QC samples in positive ion and negative ion modes, respectively. The response intensity and retention time of each chromatographic peak exhibited good reproducibility, indicating the high stability and reliability of the analytical system. c Identified lipid classes and number of lipid species in each lipid class

Based on the established lipid detection system, a total of 648 lipid species in 25 lipid classes were identified (Fig. 6c). These 25 classes belong to several major lipid categories, such as fatty acids [fatty acyls (FA)], glycerides [diacylglycerol (DG), triglyceride (TG)], sphingolipids [ceramide (Cer), glucocerebroside (CerG1), sphingomyelin (SM), sphingosine (So)], glycolipids [monogalactosyl diacylglycerol (MGDG), mulfoquinovosyl diacylglycerol (SQDG)], and glycerophospholipids [cardioplipin (CL), phosphatidic acid (PA), phosphatidylcholine (PC), PE, phosphatidylglycerol (PG), phosphatidylinositol (PI), phosphatidylserine (PS)]. Additionally, principal component analysis (PCA) was performed based on the whole lipidomic dataset, and a satisfactory PCA model was obtained (Fig. 7a; Supplemental Table S4). In the PCA plot, QC samples were distributed at the center, and repeated tests of each strain were closely clustered together (Fig. 7a). This result also illustrates the accuracy and precision of the lipidomic detection and corresponding analysis. Additionally, the wild-type strain was in the third quadrant and was distinctly separated from the mutants; RD-1 and RD-2 were closely located in the first and second quadrants, respectively; RD-3, RD-4, and RD-5 were clustered in the fourth quadrant (Fig. 7a). Overall, the mutant strains shifted to the right relative to the wild-type strain.

Fig. 7
figure 7

Distinction of different groups of samples by PCA, PLS-DA, and OPLS-DA models. a Principal component analysis of five QC samples, six wild-type strain samples, and 30 mutant samples based on the entire lipidome data. Two main composite variables (t[1] and t[2]) were used to reflect information for the original variables. The PCA model parameters R2X (mutant/control) and Q2 represent the model interpretation rate and model prediction ability, respectively, and were obtained by 7-fold cross-validation. The relationship between lipid species intensity and groups (mutant/wild-type strain) was fitted using the PCA, PLS-DA, and OPLS-DA models. bf The score plots of three models for five randomly selected mutants. The detailed model parameters are shown in supplemental Table S4

Further, the relationship between the expression intensities of the 648 identified lipid species and any mutant (vs. wild-type strain) was fitted using three models: PCA, partial least squares discrimination analysis (PLS-DA), and orthogonal PLS-DA (OPLS-DA). The score plots of the three models for each mutant/control are shown in Fig. 7b–f. The mutant and control were well distinguished by all three models. The model evaluation parameters (R2Y and Q2) obtained through 7-cycle cross-validation are listed in Supplemental Table S4. Both R2Y and Q2 were close to 1, indicating that the obtained models were stable and reliable. The PLS-DA and OPLS-DA models were tested using a displacement test. The intercept of the Q2 regression line and Y-axis was less than 0.05 (Supplemental Table S4), suggesting that the data had not been over-fitted.

The FC and corresponding p values of the identified lipid species are shown in Fig. 8a for each mutant (vs. wild-type strain). The lipid species with p < 0.05 and FC > 2 or < 0.5 accounted for a large proportion of all identified lipid species. To identify the differentially expressed lipid species (DELS) with great biological significance, the variable importance in projection (VIP) > 1 was selected as one of the screening thresholds, which was obtained according to the valid OPLS-DA model, and was used to measure the influence intensity and explanatory ability of each lipid expression pattern on the classification and discrimination of each group of samples. In addition, the screening threshold of FC was adjusted to > 1.5 or < 0.67 to relieve the strictness of screening conditions. Finally, the DELS were judged by a p value < 0.05, FC > 1.5 or < 0.67, and VIP > 1. Based on this, a total of 126 DELS were collected and the relative FCs of these lipid species in the examined mutants (vs. wild-type strain) was enumerated (Fig. 8b). At the metabolic level, all DELS showed the same pattern of up-/down-regulation among nearly all investigated mutants, which is consistent with the transcriptional data and can be also conservatively attributed to the population commonality of mitochondrial dysfunction.

Fig. 8
figure 8

Expression pattern of DELS identified by UHPLC-ITMS in the five randomly selected mutants. a Volcano plots highlighting the lipid species with p < 0.05, FC > 2 or < 0.5 in mutants. b DELS after considering the VIP value in the OPLS-DA model, in which those with VIP ≥ 1, p value ≥ 0.05, and FC ≤ 1.5 and ≥ 0.68 were filtered out; a heatmap corresponding to the lipid species was used to indicate the frequency of mutants with the same up- or down-regulation pattern. Overall, all DELS showed the same up-/down-regulated pattern among nearly all mutants examined. c Distribution of DELS in different classes (or sets). d Statistical results for FC in expression intensity of DELS included in different classes (or sets); the content in the brackets following lipid class (or set) name represents the number of the values involved in the statistics, which were obtained by the number of identified DELS × the number of investigated mutants. Square filled with a specific color indicated the average value

Up-regulated lipid classes contained TG (32 lipid species), PC (15 lipid species), lysophosphatidylcholine (LPC; 10 lipid species), lysophosphatidylethanolamine (LPE; 7 lipid species), PI (5 lipid species), DG (4 lipid species), lysophosphatidylinositol (LPI; 2 lipid species), PA (2 lipid species), PE (1 lipid species), PS (1 lipid species), and SQDG (1 lipid species) (Fig. 8b). Down-regulated lipid classes contained PE (18 lipid species), PC (14 lipid species), PS (8 lipid species), PI (3 lipid species), LPE (1 lipid species), LPI (1 lipid species), and lysophosphatidylserine (LPS; 1 lipid species) (Fig. 8b). The number of up-regulated lipid species was 1.74-fold higher than that of down-regulated lipid species (Fig. 8b, c). The average log2(FC) of all identified DELS in all detected mutants was 0.54 (Fig. 8b, d). The average log2(FC) of up-regulated lipid species in all detected mutants was 1.49, which was 1.33-fold higher than the average −log2(FC) of down-regulated lipid species (Fig. 8b, d). TG, PC, PE, LPC, PS, LPE, and PI were the major lipid classes showing differential expression. Each of these lipid classes contained > 8 DELS, and the total lipid species involved accounted for 90.55% of all DELS (Fig. 8b, c). TG, LPC, LPE, and PI were mainly up-regulated; PE and PS were mainly down-regulated; PC was up-/down-regulated, and the average log2(FC) was ~ 0.233 (Fig. 8b, d). Additionally, the differentially expressed lipid classes mainly belonged to two major categories, glycerides and glycerophospholipids.

Discussion

Generally, mitochondrial respiratory function is only reduced to a certain extent in mitochondrial diseases (Yu et al. 2017). In this study, large SVs of the mitochondrial genome and severely impaired mitochondrial respiratory function in S. cerevisiae essentially reflect the limiting state in human cells in many mitochondrial diseases. In terms of fate determination, mitochondria play different roles in S. cerevisiae and human cells. When mitochondrial respiratory function is severely impaired, S. cerevisiae cells can sustain low-rate activity through anaerobic respiration, while most human cells are inactivated (Guo et al. 2019a). However, severely impaired mitochondrial function leads to energy shortage and abnormal mitochondrial components, which block biochemical reactions in S. cerevisiae mitochondrial respiration-deficient mutants. These cascade reactions are consistent between S. cerevisiae cells in this study and mammalian cells with decreased mitochondrial function. Therefore, the S. cerevisiae respiration-deficient mutant selected for this study is a suitable model for human cells in the mitochondrial disease state, demonstrating the high reference value of this study.

The frequency of mtDNA mutations is much higher than that of nuclear DNA (Guo et al. 2019a; Lynch et al. 2008). This characteristic makes it relatively easy to obtain stable S. cerevisiae mitochondrial mutants by mutagenesis and directed screening. To study mitochondrial function, S. cerevisiae respiration-deficient mutants were obtained by HIB irradiation mutagenesis in our previous study (Guo et al. 2019a). However, HIB irradiation mutagenesis inevitably and randomly led to mitochondrial dysfunction-unrelated mutations in the nuclear genome, interfering with the study on the effects of mitochondrial dysfunction. Thus, this study adopted the following strategies to eliminate interference from the mitochondrial dysfunction-unrelated nuclear genome mutations. First, population studies were conducted based on a mitochondrial respiratory-deficient mutant group; the co-occurrence frequency was used to determine whether any phenotypic or molecular change was caused by a loss of mitochondrial function, as mitochondrial mutation is common to the whole group and mutation in the nuclear genome is typically individual. The second was to use whole-genome sequencing to obtain a clear population genetic background. Determining the threshold of co-occurrence frequency requires a clear genetic background of the population, and whether there are nuclear genome mutations that can interfere with the focused scientific questions under study can be quantitatively judged. In the present study, all of the nuclear genome mutations, and thus potentially affected genes and metabolic pathways, showed a co-occurrence frequency of ≤ 3 in the investigated mutant population. Therefore, DEGs with a co-occurrence frequency of ≥ 4 and ≥ 5 were selected for their respective KEGG enrichment analysis to identify the metabolic pathway significantly affected by mitochondrial dysfunction at the transcriptional level. The enrichment results for DEGs ≥ 4 and ≥ 5 were similar, and lipid metabolism was the most significantly enriched KEGG term. Any identified DEG related to lipid metabolism showed the same up-/down-regulated pattern among nearly all eight mutants. Any identified DELS also exhibited a common up-/down-regulation pattern among nearly all examined mutants. This illustrates the rationality of our threshold selection. Additionally, genomic analysis revealed that only a lipid metabolism-related pathway was likely to be affected by the nuclear genome mutations in the RD-1 strain. Therefore, changes in lipid species that co-existed in nearly all investigated mutants were only attributed to mitochondrial dysfunction in this study.

The investigation on mitochondrial function in this study was based on a multi-omics analysis, making the results more detailed and rigorous. Genomic analysis revealed the properties of the study material and provided a clear genetic background. Significant SVs in the mitochondrial genome were common to the entire S. cerevisiae mutant population investigated in this study. The nuclear genome of each mutant showed an average of only 14 mutation sites, and the average number of genes potentially affected by the mutations was 11. Additionally, nuclear genome mutations were unique to each other with very few intersections. This genetic background determined that the interference from the mitochondrial dysfunction-unrelated nuclear genome mutations can be eliminated by setting high thresholds of co-occurrence frequency of differentially expressed molecules among mutant population. Therefore, the mitochondrial respiration-deficient mutant population used in this study is a suitable model for conducting mitochondrial function studies. Gene transcription is an important process for producing phenotypic characteristics. In this study, we quantitatively compared the extent to which each metabolic change was affected by mitochondrial dysfunction at the transcriptional level, and lipid metabolism was considered the most significantly enriched pathway. However, it is difficult to accurately characterize and quantify changes in gene products corresponding to genetic variation only based on transcriptional results. The changes are determined by the complex cellular metabolic network (Logue and Morrison 2012; Volterra et al. 2014). In this study, the metabolisms of most lipid classes interact with each other. The analysis on glycerophospholipid metabolism network showed that the change pattern of a glycerophospholipid species is a comprehensive effect caused by the potential changes both in incoming and outgoing, as well as dense crosstalk from various other metabolites in the metabolic network. The fatty acid synthesis and degradation pathways were simultaneously affected both by the up- and down-regulated DEGs. This suggests that, in this study, most of the DELS were more inclined to be an anfractuous comprehensive effect caused by the relevant DEGs. Meanwhile, transcription is only one process involved in gene product formation. The relationship between the number of transcripts and abundances of gene products is not linear and is also affected by translation, biological enzyme catalysis, and many other biological processes. Therefore, metabolomic analysis was performed based on transcriptomic analysis, which further accurately and quantitatively described changes in gene products caused by genomic changes. Significant changes in lipid species in all investigated mutants confirmed the predictions of transcriptomic analysis. Additionally, the DELS showed the same up-/down-regulated patterns among nearly all examined mutants, which was consistent with the results of transcriptomic analysis. Overall, the genomic, transcriptomic, and metabolomic results were mutually causal and validated.

Mitochondria are one of the major organelles directly involved in lipid metabolism. Fatty acids are the simplest type of fat and are components of many complex lipid molecules. In the presence of intact mitochondrial function, acetyl-CoA produced in mitochondria is an important carbon source for fatty acid synthesis (Moreau et al. 2006; Yang et al. 2019). Transport of acetyl-CoA to the outside of mitochondria depends on the “citrate pyruvate cycle” that occurs in mitochondria (Mishra et al. 2016; Yang et al. 2019). By continuously condensing acetyl-CoA to the carboxy terminus, the carbon chain of palmitic acid or other saturated fatty acids can be prolonged in mitochondria (Van Vranken et al. 2018). In addition, the beta oxidation of fatty acids is mostly completed in mitochondria. Fatty acyl-CoA produced by the activation of fatty acids is transported to mitochondria, which is oxidatively decomposed to produce energy, carbon dioxide, and water (Adeva-Andany et al. 2019). Additionally, changes in the mitochondrial energy supply, various signalling pathways related to mitochondria, and mitochondrial metabolite-mediated epigenetic modifications certainly affect numerous cellular pathways, including lipid metabolism. Although mitochondria play an important role in lipid metabolism, few quantitative indicators have been identified for measuring the relative and absolute effects of mitochondria on lipid metabolism. This study demonstrated that the most enriched KEGG term of the DEGs was lipid metabolism when mitochondrial function was deficient. Further, the results of lipidomic analysis quantified the effect of mitochondrial dysfunction on lipid metabolism. In detail, the DELS accounted for more than 20% of the total lipid species identified; the average log2(FC) of each DELS in each mutant was 0.54. For the entire examined mutant population, the number and average |log2(FC)| of the up-regulated lipid species were 1.74- and 1.33-fold higher than those of down-regulated lipid species, respectively. The lipid class with the most obvious variance was TG, which appeared to be only up-regulated. TG is an important fat for energy storage and energy supply in S. cerevisiae cells. This conclusion is consistent with previous findings showing that decreased mitochondrial number or function will increase the risk of diseases such as atherosclerosis and fatty liver (Peng et al. 2019; Wieckowski et al. 2019).

PE and PC, the most abundant phospholipid classes in the plasma membrane (Patel and Witt 2017; van der Veen et al. 2017), mainly presented an obvious change in relative proportion. In particular, PE species showed an obvious decrease in content among the entire mitochondrial respiration-deficient mutant population. This phenomenon is consistent with the fact that PE synthesis via the PS decarboxylase (PSD) pathway or the acylation of LPE is closely related to mitochondria (Horvath and Daum 2013; Vance 2018). The PSD pathway occurs in mitochondria; in several types of cultured cells, more than 80% of PE appears to be derived from the PSD pathway (Vance 2018). The production of PE by the acylation of LPE is active in yeasts, in which the majority of LPE acyltransferase activity is localized to the mitochondria-associated membranes (Vance 2018). For PC species, 15 of them were up-regulated and 14 of them were down-regulated. This change pattern for PC species was common among the entire investigated population, and the average log2(FC) was ~ 0.233. Thus, the ratio of PE and PC decreased. Therefore, mitochondrial dysfunction corresponded to an increased fat content and a decreased ratio between PE and PC. Previous data showing that changes in the PC and/or PE content of various tissues are implicated in metabolic disorders such as atherosclerosis, insulin resistance, obesity, and cancer (Fahrmann et al. 2016; Patel and Witt 2017; van der Veen et al. 2017, 2019). At the cellular level, phospholipids in biological membranes typically play a key role in cellular biological function and membrane molecular organization (Arashiki and Takakuwa 2017; Pennington et al. 2019; Renne and de Kroon 2018). For example, PE, a phospholipid class showing an obvious down-regulation in our study model, has attracted much interest due to its wide-ranging effects on cells. In the mammalian cell model, PE has been confirmed to be involved in signal transduction, such as activation of many cell surface signalling proteins and the initiation of autophagy process (Patel and Witt 2017; Vance 2018). The other remarkable activities of PE include: assisting in the folding of certain membrane proteins (Patel and Witt 2017); promoting membrane curvature, enhancing the membrane fusion, and regulating the membrane fluidity (Dawaliby et al. 2016; Vance 2018). It is also required for the activity of several of the respiratory complexes (Patel and Witt 2017). In the S. cerevisiae cell model, studies revealed that the lack of PE can result in a failure of the permease targeting to plasma membrane (Opekarova et al. 2002). PE made in the inner mitochondrial membrane is also essential for yeast cytochrome bc1 complex function (Calzada et al. 2019).

In the present study, multi-omics analysis was combined with the investigation on co-occurrence frequency of mutations and differentially expressed molecules among the entire investigated mutant population, which were demonstrated to be effective for eliminating interference from the mitochondrial dysfunction-unrelated nuclear genome mutations. In conclusion, the similar up-/down-regulated change patterns of the identified lipid metabolism-related DEGs and DELS among the nearly all examined mutants can be conservatively attributed to population commonality (large deletions in the mitochondrial genome and remarkable mitochondrial dysfunction) but not individual characteristics (unique mitochondrial dysfunction-unrelated nuclear genome mutations in each mutant). Based on this, lipid metabolism was revealed as one of the most vulnerable cellular pathways affected by mitochondrial dysfunction. The change in the spectrum of lipid species attributed to mitochondrial dysfunction was quantitatively mapped. The affected lipids have characteristics of many kinds and deep degree. Specifically, mitochondrial dysfunction mainly up-regulated energy storage-related fat and decreased ratio between PE and PC. Additionally, the established strategies in this study provides an important reference for high-throughput mitochondrial DNA mutation-related functional studies at the population level. Particularly, the co-occurrence frequency was consistent throughout genomic, transcriptomic, and metabolomic analyses.