Introduction

Citrus is the most economically important evergreen subtropical fruit crop in the world, with more than 9 million hectares grown and about 125 million tons of fruits produced in 2017 (FAO 2017; Navarro et al. 2015; Willer and Lernoud 2016). All citrus fruits are an essential source of vitamin C, contributing to the nutritional supply of humans, together with polyphenols and carotenoids. Conventional breeding in citrus is hindered by several biological barriers, such as sexual incompatibility, nucellar polyembryony, long juvenility, pollen and ovule sterility. Accordingly, most citrus cultivars derive from somatic mutations, which lead to a poor and narrow genetic diversity.

Biotechnological approaches circumvent these difficulties, improving breeding efficiency and expediting genetic improvement processes. Among the current strategies used to this purpose, somatic hybridization via protoplast fusion represents the most powerful method to overcome sexual barriers and to generate novel and unobtainable germplasm resources (Grosser and Gmitter 1990; Guo and Deng 2001; Fu et al. 2009; Eeckhaut et al. 2013; Guo et al. 2013; Tarwacka et al. 2013; Liu and Xia 2014). In the past two decades, a large number of intergeneric and interspecific citrus somatic hybrids has been obtained and used in scion and rootstock improvement; they have been used in breeding programs for specific agronomic traits, such as seedlessness and disease resistance (Cai et al. 2007; Fu et al. 2009, 2011; Grosser and Gmitter 2011; Dambier et al. 2011; Guo et al. 2013). Nevertheless, since most citrus cultivars are diploids, the resulting somatic hybrids are mainly allotetraploids, containing the complete nuclear genomes of both parents (Grosser et al. 2000). Unfortunately, the increased chromosome number in the hybrid and the incompatibility of genome recombination may influence the genetic integration of the desired traits into the cultivated varieties (Osborn et al. 2003); accordingly, tetraploidy has often been associated with several unfavourable agronomic traits. On this basis, the use of tetraploid varieties in commercial citrus orchards has found a very limited application. However, several diploid plants with morphology similar to the leaf-parent (non-embryogenic) have also been generated after somatic hybridization experiments (Grosser et al. 1996; Liu et al. 1999). Molecular characterization of these leaf-parent-type plants revealed that they are diploid cybrids, which contain nuclear DNA genome of the leaf parent, and either mtDNA and cpDNA of the embryogenic callus/suspension parent or randomly inherited from both parental species (Moreira et al. 2000; Guo et al. 2004, 2006). In contrast to sexual hybridization, where the cytoplasmic genomes are maternally inherited, somatic hybridization generates novel nuclear-cytoplasmic combinations. Since higher plants features are genetically controlled by the interaction among nuclear, plastidial and mitochondrial genomes (Suzuki et al. 2012; Luo et al. 2013; Ng et al. 2014), the possibility of having new and original combinations of nuclei and organelles may have interesting applications in the creation of novel genotypes with potential agronomic values (Grosser et al. 1996; Guo et al. 2004, 2013; Cabasson et al. 2001; Xiao et al. 2014; Aleza et al. 2016; Fang et al. 2016; Omar et al. 2017). Plastids and mitochondria fulfil important metabolic functions that greatly affect plant growth and productivity. Many metabolic and developmental pathways involved in plant adaptation intersect within the plastid (Rolland et al. 2012), such as phytohormone- and lipid-centred processes and plant biotic defence. Likewise, heat tolerance, cytoplasmic male sterility and oxidative stress response (Jacoby et al. 2012) are controlled by mitochondria. The peculiar genetic structure of cybrids makes them the most appropriate candidates for studies on plant adaptive mechanisms and for the analysis of nuclear-cytoplasmic interactions.

Plant organellar replacement has been associated with changes in the corresponding transcriptome, metabolome, and proteome. Differential proteomic analysis is a powerful molecular approach for describing the rearrangement of the protein repertoire as a result of different genetic, endogenous and physiological factors (Zhao et al. 2013). Through the use of proteomic approaches, several advances have been made in understanding molecular events associated with plant physiology and function, e.g. mineral deficiency (Liang et al. 2013), abiotic stress response (Kosová et al. 2011; Zhang et al. 2013), disease tolerance (Rampitsch and Bykova 2012; Chetouhi et al. 2015), heavy metal hyper-accumulation (Visioli and Marmiroli 2013), flood tolerance (Komatsu et al. 2013) and fruit development/ripening (Palma et al. 2011).

A specific diploid cybrid was obtained by somatic hybridization experiments included in a selective breeding program aimed at improving citrus lemon quality traits (Abbate et al. 2012; Fatta Del Bosco et al. 2013) and enhancing resistance toward the most pathogenic fungal disease for lemon, i.e. “malsecco” caused by Phoma tracheiphila (Tusa et al. 2000; Scarano et al. 2003). In the present work, the selected cybrid was studied in comparison to its parents and the corresponding somatic allotetraploid hybrid by using integrated physiological, proteomic and metabolite-oriented analyses in order to highlight the role of the cytoplasmic genome rearrangement and to provide useful information on protein expression associated with beneficial and/or valuable (pharmaceutical, nutraceutical and agronomic) traits.

Materials and methods

Plant materials

Citrus plants were obtained by somatic protoplast fusion between heterozygous inbred lines of ‘Valencia’ sweet orange (Citrus sinensis L. Osbeck) (V) and ‘femminello’ lemon (Citrus limon L. Burm) (F). The protoplasts were isolated from embryogenic callus of sweet orange and leave of lemon. The allotetraploid somatic hybrid ‘Valencia’ + ‘femminello’ (V + F) and the diploid ‘femminello’ lemon cybrid (C2N) were developed by the polyethylene glycol (PEG) method, as previously described (Tusa et al. 1990, 2000; Grosser et al. 1996; Abbate et al. 2012) and characterized by isozymes (Tusa et al. 1990), RFLP (Grosser et al. 1996) and SSR (Scarano et al. 2003). Here, five nuclear (nuSSR), eleven chloroplast (cpSSR) and nine mitochondrial (mtSSR) loci (Cheng et al. 2003, 2005; Jannati et al. 2009; Froelicher et al. 2011) (Online Resource 1: Table S1) were used to characterize the cytoplasmic background of investigated plants. DNA was extracted from young fresh leaves (Lodhi et al. 1994), and its quality and concentration was verified by a NanoDrop spectrophotometer (Thermo Scientific). PCR reactions were performed following the procedures reported in Mercati et al. (2013). The fragments were separated by capillary electrophoresis using an ABI PRISM 3500 Genetic Analyzer (Applied Biosystems).

Citrus cybrid, somatic hybrid and parental plants were grafted onto the same rootstock (Citrus aurantium L.) and subjected to identical, standard cultivation practices in the experimental field station of IBBR–CNR in Collesano, Palermo (Italy). The plant material was collected in June 2015. For all experiments, 30 healthy and fully-expanded leaves from the last finished sprouting branches were harvested. The effects of tree and branch position were minimized by random sampling. Leaves were collected at noon (under full sun) and immediately frozen in liquid nitrogen. Three different specimens for each genotype were analysed. Leaf samples were stored at − 80 °C until use.

Protein extraction

For each sample, whole protein extracts were obtained from 100 mg of collected leaves. Eight different extraction methods were tested, using the procedures described by Puglia et al. (1995) and Wang et al. (2006) with or without extensive modifications (Online Resource 2: Table S2). All the biomass samples were washed two times with 1 ml of 10 mM Tris–HCl pH 7.5, 5 mM EDTA, 1 mM DTT, 0.5 mM PMSF, 4 mg/ml leucopeptin, 0.7 mg/ml pepstatin, 5 mg/ml benzamidin by vortexing, and then centrifuged (15,000×g, 10 min, at 4 °C). All the other pre-wash or wash steps (Table S2) were performed using 2 ml of solvents/buffers by vortexing and following centrifugation (15,000×g, 4 min, at 4 °C). Cells were disrupted by sonication on ice in 1 ml of extraction buffer (output control 4, 8 × 15 s, Vibra Cell, USA). All protein extracts were boiled for 5 min, and rapidly cooled down on ice for 15 min; then they were treated with DNase (100 μg/ml) and RNase (50 μg/ml) in ice, for 15 min. Cell debris and non-broken cells were separated by centrifugation (15,000×g, for 15 min, at 4 °C). In order to remove salts and low molecular mass contaminants, which may interfere with CyDye-labeling of proteins and their 2D-gel electrophoretic separation, extracts were dialyzed against distilled water (for 3 h, at 4 °C) and/or treated with 1 vol of phenol/chloroform/isoamyl alcohol (25/24/1, v/v; Sigma-Aldrich) for 5 min at room temperature mixing by vortex. After centrifugation (15,000×g, 5 min, at 4 °C), they were recovered from interface and organic phases by discharging the aqueous phase. Finally, proteins were precipitated using one of the two methods reported in Table S2, at − 20 °C, overnight, and protein precipitate was recovered by centrifugation (15,000×g, 10 min, at 4 °C). Before resuspension, if needed, protein pellets were further washed with 1 ml of methanol and/or acetone, centrifuged (15,000×g, 20 min, at 4 °C), and then solved in 50–100 µl of isoelectric focusing (IEF) buffer (30 mM Tris pH 8.5, 7 M urea, 2 M thiourea, 4% w/v CHAPS, pH 8.4).

2D-differential in gel electrophoresis (2D-DIGE) and imaging

For Two-Dimensional Differential in Gel Electrophoresis (2D-DIGE) analysis, leaf proteins from three replicates for each genotype (i.e. V, F, V + F and C2N) were used. Protein samples were labelled for 2D-DIGE analysis using the CyDye™ DIGE minimal labeling kit (GE Healthcare, Sweden) as previously described (Palazzotto et al. 2015). In particular, after determination of protein concentration using Bradford reagent, 50 μg of each protein sample were labelled with 200 pmol of CyDye on ice, in the dark, for 30 min, performing dye-swapping to minimize dye-specific labelling effects according to manufacturer’s recommendations. Labelling reaction was quenched by the addition of 0.5 μl of 10 mM lysine; incubation was continued on ice for 10 min, in the dark. For IEF, DeStreak rehydration solution (GE Healthcare, Sweden) containing 0.5% v/v immobilized pH gradient (IPG) buffer (GE Healthcare, Sweden) and 20 mM DTT (Sigma, Australia) was added to each mix to reach a 340 μl final volume. IEF was performed as previously described (Palazzotto et al. 2015), using non-linear 3–10 pH range, 18 cm IPG strips (GE Healthcare, Sweden) in an Ettan IPGphor III apparatus (GE Healthcare, Sweden). After IEF, IPG strips were incubated with an equilibration buffer (6 M urea, 30% v/v glycerol, 2% w/v SDS, and 0.05 M Tris–HCl, pH 6.8) containing 2% w/v DTE for 10 min. Protein thiol groups were then blocked by further incubation with equilibration buffer containing 2.5% w/v iodoacetamide. Focused proteins were then separated by using 12% SDS–PAGE, at 10 °C, in an Ettan Daltsix (GE Healthcare, Sweden) apparatus, with a maximum setting of 40 μA per gel and 110 V. The obtained 2D-gels were digitalized by using Typhoon FLA 9500 instrument (GE Healthcare, Sweden) following manufacturer’s instructions, and setting photomultiplier tube at 740, 600 and 700 V for Cy3, Cy5 and Cy2 channels, respectively. 2D-gel image analysis was performed using Image Master 2-D platinum (version 7.0) 2D-DIGE enabled software (GE Healthcare, Sweden), according to manufacturer’s instructions. Protein spots showing at least 1.5-fold change in their normalized volume (increased for over-representation or decreased for down-representation), with a statistically significant ANOVA value (P ≤ 0.05), were considered as differentially represented; they were selected for further mass spectrometry (MS) analysis for protein identification. To this aim, 2D-gels were also stained with ammoniacal silver, as described in Palazzotto et al. (2016). The relative protein abundance profiles of identified protein spots were illustrated through heatmap clustering by using the R package ggplot2 (https://cran.r-project.org/web/packages/ggplot2/index.html).

Protein identification by mass spectrometry

Protein spots were carefully cut from the gels, triturated, in gel-reduced with dithiothreitol, S-alkylated with iodoacetamide, and subsequently in-gel digested with trypsin (sequencing grade, Roche). Resulting digests were desalted by μZip-TipC18 (Millipore) before nanoLC–ESI–LIT–MS/MS analysis (Gallo et al. 2016) using a LTQ XL mass spectrometer (ThermoFisher, USA) equipped with a Proxeon nanospray source connected to an Easy-nanoLC (ThermoFisher). Peptides were resolved on an Easy C18 column (100 mm × 0.075 mm, 3 μm) (Proxeon, Denmark). Mobile phases (LC-MS grade) were 0.1% v/v formic acid (solvent A) and 0.1% v/v formic acid in acetonitrile (solvent B), running at a total flow rate of 300 nl/min. A linear gradient was initiated 20 min after sample loading; solvent B ramped from 5 to 35% over 10 min, and from 35 to 95% over 2 min. Spectra were acquired in the range of m/z 400–2000. Peptides were analysed under collision-induced dissociation data-dependent product ion scanning procedure, enabling dynamic exclusion (repeat count 1 and exclusion duration 60 s) over the three most abundant ions. Mass isolation window and collision energy were set to m/z 3 and 35%, respectively.

Raw data from nanoLC–ESI–LIT–MS/MS analysis were searched by MASCOT search engine (version 2.2, Matrix Science, UK) against an updated NCBI non-redundant database (October 2015) with the aim to identify proteins from gel spots. Database searching was performed by using Cys carbamidomethylation and Met oxidation as fixed and variable protein modifications, respectively, a mass tolerance value of 1.8 Da for precursor ion and 0.8 Da for MS/MS fragments, trypsin as proteolytic enzyme, and a missed cleavage maximum value of 2. Other MASCOT parameters were kept as default. Protein candidates assigned according to at least two sequenced peptides with an individual peptide expectation value < 0.05 (corresponding to a confidence level for peptide identification > 95%) were considered confidently identified. Definitive peptide assignment was always associated with manual spectra visualization and verification.

GC/MS analysis

Freshly collected leaves (around 500 mg) were cut and placed in a special vial for headspace. The sample preparation was performed with a MultiPurpose Sampler MPS apparatus (Gerstel, Mulheim an der Ruhr, Germany) equipped with a cooled agitator. The temperature of the syringe was set to 40 °C, flush time 30 s; the sample was heated to 40 °C, for 10 min, under stirring (speed 250 rpm).

The headspace analysis was performed in an Agilent 7000C Triple Quadrupole GC/MS system, fitted with a fused silica Agilent HP-5 capillary column (10 m × 0.25 mm i.d.; 0.25 μm film thickness), coupled to an Agilent Mass Selective Detector MSD 5973; ionization voltage 70 eV; electron multiplier energy 2000 V. Helium was the carrier gas (1 ml/min). Column temperature was initially kept at 40 °C for 5 min, then gradually increased to 250 °C, at a rate of 2 °C/min, held for 15 min, and finally raised to 270 °C, at 10 °C/min. Transfer line temperature was 295 °C. Most constituents were identified by gas chromatography by comparison of their retention indices (I) with those from the literature (Hijaz et al. 2016) or with those of standard compounds. The retention indices were determined in relation to a homologous series of n-alkanes (C8–C32) under the same operating conditions. Further identification was made by comparison of their mass spectra with counterparts stored in NIST 02 and Wiley 275 libraries or with mass spectra from the literature (Sparkman 2005) and from a homemade library. Component relative concentrations were calculated based on GC peak areas, without using correction factors.

Polyphenol extraction

Freshly collected leaves (around 50 g) were lyophilized for 24 h and then extracted with 96% v/v ethanol in a Büchi B-811 extraction system, for 1 h. In order to remove lipophilic metabolites, ethanolic extracts were dried and fractioned with a Biotage Isolera One automatic MPLC system, equipped with a cartridge SNAP Ultra C18 (HP Sphere C18 25 µm, 120 g). Mobile phases were water (solvent A) and methanol (solvent B), which were used in a linear gradient from 0 to 100% of solvent B; eluates were monitored with an UV detector in the range 200–400 nm. Fractions containing polyphenols were lyophilized, weighted and stored at 4 °C until analysis. Samples for further HPLC analysis were prepared dissolving 1 mg of the obtained extract with methanol (1 ml).

Reversed-phase HPLC–ESI–Q-TOF–MS/MS analysis of polyphenols

HPLC analysis was performed with an 1260 Infinity system (Agilent) equipped with a reversed-phase Phenomenex Luna C18 (2) column (150 × 4.6 mm, particle size 3 µm) bearing a Phenomenex C18 security guard column (4 × 3 mm). The flow-rate was 0.5 ml/min and the column temperature was set to 30 °C. The eluents were 0.1% v/v formic acid (solvent A) and 0.1% v/v formic acid in acetonitrile (solvent B). The following gradient was used: 0–5 min, 5% solvent B isocratic; 5–15 min, linear gradient from 5 to 15% solvent B; 15–20 min, 15% solvent B isocratic; 20–25 min, linear gradient from 15 to 30% solvent B; 25–35 min, 30% solvent B isocratic; 35–45 min, washing and reconditioning of the column. Injection volume was 25 µl. The eluate was monitored through MS by measuring total ion current. Mass spectra were obtained on an Agilent 6540 UHD accurate-mass Q-TOF spectrometer equipped with a Dual AJS ESI source working in negative mode. N2 was used as desolvation gas at 300 °C and a flow rate of 8 l/min. The nebulizer was set to 45 psig. The sheath gas temperature was set at 400 °C and a flow of 12 l/min. A potential of 2.6 kV was applied to the capillary for negative ion mode. The fragmentor was set to 75 V. MS spectra were recorded in the range of m/z 150–1000; MS/MS spectra were recorded in negative mode with a collision energy of 50 V.

Leaf gas exchange measurements

Measurements of net photosynthesis (A, μmol m−2 s−1), leaf conductance (gs, mmol m−2 s−1), intercellular CO2 concentration (Ci, μmol m−1) and the ratio between internal and external CO2 concentration (Ci/Ca) were carried out using a portable infra-red gas analyzer (HCM-1000 Walz, Germany) under ambient light and CO2 conditions. Measurements were taken on five fully expanded mature leaves of three plants per genotype, between 09:30 and 11:30 h am. The incident photosynthetic photon flux density (PPFD) was 1500 μmol m−2 s−1, cuvette air temperature ranged from 28.2 to 30.1 °C and reference CO2 concentration was between 387 and 407 µmol mol−1. Gas exchange data were reported on a leaf area basis. The data were analysed by one-way ANOVA comparing among studied genotypes, and mean values were separated by Tukey’s honest significant difference (HSD) test (P < 0.05).

Results

Preliminary investigations towards comparative analyses

Molecular characterization by microsatellites confirmed that V + F and C2N were the hybrid and cybrid, respectively (Online Resource 3: Table S3), as reported also in previous studies (Grosser et al. 1996; Scarano et al. 2003). In addition, cytoplasmic markers distinguished parental genomes, highlighting that the chloroplasts and mitochondria were inherited from both parents in V + F and C2N genotypes. Eight protein extraction procedures based on existing methods (Puglia et al. 1995; Wang et al. 2006) with or without extensive modifications were also comparatively evaluated (Table S2), to optimize a suitable protocol for further proteomic analysis based on 2D-DIGE technology and compatible with subsequent silver staining for spot picking and MS-based protein identification. According to the experimental evidences described in Online Resource 4 (Fig. S1) and Online Resource 5 (Fig. S2), method n. 3 resulted the most effective and, therefore, was chosen to prepare leaf protein extracts for subsequent comparative proteomic analysis.

Comparative analysis of the leaf proteome from different citrus cultivars

In order to identify up- or down-regulated molecular pathways in the C2N genotype, a differential proteomic analysis was performed, combining 2D-DIGE and nanoLC–ESI–LIT–MS/MS experiments. Protein extracts from leaf biomass samples of C2N, V + F and the parental cultivars F and V were comparatively analysed, with the former used as pivotal condition. In order to obtain a simplified representation of the quantitative behaviour of all proteome patterns a heatmap clustering was generated using the mean abundance values of a total of 52 protein species with a P-value < 0.05 (ANOVA test) (Online Resource 6: Fig. S3). This analysis revealed three major clusters highlighting a distinct proteome profile of C2N genotype, clearly distinguishing it from its parents and the resultant allotetraploid somatic hybrid. Among the above described group of protein spots, those showing differential abundance in the C2N proteome were selected for protein identification as reported in Table 1 and in Online Resource 7 (Fig. S4). In particular, taking into count that some of the spots were shared between the three datasets of differentially represented protein species resulting from the three paired comparisons (C2N vs. F, C2N vs. C, and C2N vs. V + F), a total of 20 spots were univocally assigned to a protein identification with the following abundance profile: (i) 18 spots over-represented in C2N in comparison with F; (ii) 15 over-and 1 down-represented in the C2N in comparison with V; (iii) 17 over- and 1 down-represented in the C2N in comparison with V + F (Table 1). Based on putative function deduced after BLAST interrogations against KEGG2 database (Kanehisa and Goto 2000), the identified protein species were clustered into 6 functional groups, which included carbon (20%), energy (20%), nucleotide (15%), amino acid (10%) and protein (10%) metabolism, and components with unknown function (20%) (Table 1). Interestingly, 9 out 20 identified proteins were assigned to cytoplasmic organelles, i.e. mitochondria and chloroplasts. This analysis definitely revealed that the C2N proteome has a peculiar pattern, which highly differs from that of both parents and somatic hybrid. Indeed, except for a component involved in photosynthesis (spot 25), 2 species related to protein metabolism (spots 26 and 35) and 4 species with unknown function (spots 15, 17, 31 and 32), the other differentially represented components were up-regulated in C2N genotype, when compared to parents (Table 1; Online Resource 6: Fig. S3). Similarly, differentially represented proteins in the C2N genotype were always over-represented when compared to the allotetraploid somatic hybrid, with the exception of 3 (spot 31, 32 and 35) out of the above mentioned components (Table 1, Online Resource 6: Fig. S3). The overall protein accumulation pattern in C2N suggested a general induction of metabolic and molecular processes therein, which thereof implies a cybrid heterosis compared to its parents and the corresponding allotetraploid somatic hybrid. It is noteworthy that 8 proteins over-expressed in the C2N proteome profile, when compared to the V + F allotetraploid hybrid, are involved in carbon and energy metabolism (Table 1, Online Resource 6: Fig. S3). Among them, worth mentioning are: (i) ribulose bisphosphate carboxylase/oxygenase (rubisco) activase (RCA, spot 9), a key enzyme involved in photosynthesis and CO2 assimilation (Portis et al. 2008); (ii) photosystem II oxygen-evolving enhancer protein 2 (PsbP, spot 25); (iii) the plastidial glyceraldehyde-3-phosphate dehydrogenase A (GAPCp, spot 34), a key glycolytic enzyme; (iv) two subunits (δ and ε) of the ATP synthase complex (spot 24 and 29); (v) the mitochondrial malate dehydrogenase enzyme (mMDH, spot 10), involved in tricarboxylic acid (TCA) cycle. An accumulation pattern was also observed for peroxidase 3-like protein (spot 8) and two different 14-3-3-like proteins (spot 19 and 20).

Table 1 Differentially represented proteins in C2N as deduced by combined 2D-DIGE and nanoLC–ESI–LIT–MS/MS analyses

Analysis of volatile and phenolic compounds

Leaf headspace composition from each sample was evaluated through GC/MS analysis (Hijaz et al. 2016). Fourteen volatile compounds were identified (Table 2); among them, monoterpenic hydrocarbons were the main class, in which limonene was observed as the most abundant volatile constituent in C2N and V + F, while sabinene was the main metabolite in femminello and Valencia (Table 2; Online Resource 8: Table S4). Interestingly, normalizing with the amount of limonene identified in the Valencia genotype, femminello, V + F and C2N showed 2.9-, 6.9- and 11.2-fold higher production, respectively. Similarly, the analysis for sabinene showed a ratio of 1:2.5:3.3:3.4 for V, F, V + F and C2N, respectively (Online Resource 8: Table S4). The distribution of sabinene and limonene content in each genotype and the corresponding ratio are reported in Fig. 1.

Table 2 Composition of the leaf headspace and relative amount (peak area %) of 14 volatile compounds detected in each analysed genotype
Fig. 1
figure 1

a Distribution of principal metabolites (sabinene and limonene) in the four genotypes reported in this study, as deduced by GC/MS leaf’s headspace analysis. The values are expressed in peak area %. b Relative amount (peak area/mg of leaf) of principal metabolites (sabinene and limonene) in femminello (F), V + F and C2N, as normalized to that of Valencia (V) (see Online Resource 6: Table S4)

Concerning the corresponding polyphenolic profile, the ethanolic extracts from lyophilized leaves were also analysed; they corresponded to 2.42, 6.38, 1.33 and 0.69% w/leaf FW for V, F, V + F and C2N, respectively. A quali-quantitative characterization of polyphenolic derivatives was performed through HPLC–ESI–Q-TOF–MS/MS (Brito et al. 2014). Fifteen compounds were detected, mainly di-glycosilated flavones and flavanones (Table 3). Although a lower amount of the ethanolic extract was measured for genotypes obtained by protoplast fusion (V + F and C2N), when compared to that of parents (V and F), the former ones presented the most variable profile from a qualitative point of view. In addition, V + F hybrid showed a quite lower flavones/flavanones ratio than the other three genotypes.

Table 3 Composition of the ethanolic extract from the leaves of each genotype

Gas exchange

To assess the effects of the cytoplasmic genome rearrangement into C2N genotype and to validate the above-mentioned biochemical evidences according to a cumulative physiological perspective, leaf gas exchange and net photosynthesis were also evaluated in all genotypes. The most relevant difference was found in the stomatal conductance of the regenerated C2N plants, which was significantly higher than that of the other genotypes (Fig. 2a). Indeed, gs in C2N leaves was more than twice that of the leaf parent F and 60% greater than the somatic hybrid (V + F). Correspondingly, internal CO2 concentration (Ci) was highest in C2N (274 µbar), 40% greater than in V + F, and the lowest Ci was measured in F (144 µbar) (Fig. 2b). The [CO2] increase in C2N resulted in a higher rate of photosynthesis (A) therein, when compared to the other genotypes (Fig. 2c), though this increase was not significant, and the lowest average value of A (2.8 µmol m−2 s−1) was found in the F parent (Fig. 2c). Finally, we also evaluated the ratio between internal and external CO2 concentration (Ci/Ca), which represents the balance between diffusion of CO2 into the leaf and controlled by photosynthetic reactions (Ehleringer and Cerling 1995). The C2N genotype showed the greatest value of Ci/Ca, significantly higher than that of F and V + F genotypes (Fig. 2d).

Fig. 2
figure 2

Leaf gas exchange parameters. a Stomatal conductance to water vapour (gs); b intercellular CO2 concentration (Ci); c photosynthetic rate (A); d Ci/Ca ratio measured in the studied samples. Values represent average measurements ± SD. Bars sharing the same letter are not significantly different according to Tukey’s HSD test (P < 0.05)

Discussion

The generation of new nuclear-cytoplasmic combinations following cybridization is an important tool plant improvement that can also inform understanding of cyto-nuclear evolutionary dynamics (Greiner and Bock 2013). Indeed, since the nucleus encodes organelle-destined proteins for the control of subcellular districts and, in return, organelles send signals to the nucleus to coordinate nuclear and organellar activities (Woodson and Chory 2008), the cybrids obtained from somatic hybridization may represent a new technological tool for transferring organelle-encoded traits to cultivars. It is well known that somatic hybridization often results in novel mitochondrial and/or chloroplast genome arrangements (Belliard et al. 1979; Vedel et al. 1986; Dambier et al. 2011; Aleza et al. 2016; Fatta Del Bosco et al. 2017). Therefore, protoplast fusion leads to de novo nuclear and cytoplasmic genome combinations and to fertile and functional hybrid development (Tusa et al. 2000; Dambier et al. 2011; Xiao et al. 2014; Fang et al. 2016).

The present study has been conducted in attempt to advance our understanding of the role that cybridization plays in citrus breeding. Different studies examined the effect of cybridization on citrus phenotype, including changes to leaf volatile compounds (Fanciullino et al. 2005), fruit quality (Bassene et al. 2008) and organoleptic characteristics (Satpute et al. 2015), resistance to citrus “malsecco” disease caused by Phoma tracheiphila (Tusa et al. 2000), alteration in photosynthesis and stress resistance (Wang et al. 2010), altered gene expression related to floral organ development (Zheng et al. 2012), modifications of carbohydrate metabolism pathway and mitochondrial proteins in a male sterile cybrid (Zheng et al. 2014). A previous study (Bassene et al. 2011) reported a genome-wide gene expression analysis in leaves of a citrus cybrid and established that mitochondrial replacement affected the expression of different nuclear genes, including particularly genes predicted to be involved in mitochondrial retrograde signaling.

In this study, we have originally investigated the object of a plant cybridization process by using an integrated approach, which allowed comparatively evaluating at molecular and physiological level a citrus diploid cybrid isolated through a selective breeding program (Tusa et al. 2000; Scarano et al. 2003; Abbate et al. 2012; Fatta del Bosco et al. 2013), its corresponding parents and an allotetraploid somatic hybrid obtained from the same interspecific fusion combination.

With the aim of elucidating molecular and metabolic pathways differentially regulated in the leaves of selected genotypes we performed a comparative proteomic analysis. Firstly, we established a method for protein extraction suitable for 2D-DIGE analysis. Citrus leaf tissue contains low amounts of proteins and high amounts of other components, such as lipids, organic acids and phenols, which vary in a specie-specific way and which may interfere with subsequent protein separation and analysis (Saravanan and Rose 2004; Audrius and Andrew 2005; Wang et al. 2006). Therefore, an ad hoc protocol was necessary. On a whole, our testing revealed that three key steps are crucial for protein recovery and removal of interfering compounds from citrus leaves: (i) three washing treatments using organic solvents, (ii) sonication in a SDS-containing buffer, with subsequent phenol/chloroform/isoamyl alcohol treatment; (iii) protein precipitation by methanol/ammonium acetate. Thus, we successfully identified 20 differentially represented proteins in the C2N genotype, nine of which are products of mitochondrial and chloroplast genomes. Hierarchical cluster analysis according to protein expression patterns showed a clear separation between C2N and other studied genotypes, underlining a private cluster for cybrid fully detached from both its parents and the somatic hybrid. Overall, our differential proteomic analysis suggests improved capabilities in terms of processes related to photosynthesis, energy metabolism and stress tolerance of the citrus cybrid, when compared to parental and somatic tetraploid hybrid genotypes as consequence of the improvement of the following molecular and biochemical processes: (i) CO2 assimilation due to RCA activity (Crafts-Brandner and Salvucci 2000; Pollock et al. 2003; Salvucci 2008; Hozain et al. 2010); (ii) light-driven oxidation of water to molecular oxygen based on the augmented levels of PsbP—which plays a crucial role in enhancing O2 generation in the oxygen-evolving complex of photosystem II (Roose et al. 2010); (iii) glycolytic capability as inferred by the augmented levels of GAPCp—an important connector between glycolysis and serine biosynthesis, ammonium assimilation and γ-aminobutyrate metabolism (Anoman et al. 2015) which has also a role in plant development (Muñoz-Bertomeu et al. 2010, 2011); (iv) reduced cofactor and ATP biosynthesis, as inferred by the augmented levels of the mMDH TCA cycle enzyme (Alqurashi et al. 2016) and chloroplastic ATP synthase complex components, which may have also a beneficial role in energetic homeostasis in consequence of biotic or abiotic stresses (Yamori et al. 2011; Rott et al. 2011; Liu et al. 2014).

On the other hand, other proteins showing increased levels in cybrid leaves, such as 14-3-3-like proteins, have already been described as ubiquitous regulators (DeLille et al. 2001) which act in the cytoplasm (Bihn et al. 1997), in the chloroplast (Sehnke and Ferl 2000), in the nucleus (Bihn et al. 1997) and in mitochondria (Sehnke et al. 2000) completing signal transduction processes in many critical physiological pathways such as: nitrogen and carbon assimilation pathways (Sehnke et al. 2000); plant cell turgor pressure (Korthout and Boer 1994; Oecking et al. 1994); the activity of the enzymes starch synthase (Sehnke et al. 2001), glutamic acid synthase, F1 ATP synthase, ascorbate peroxidase, and O-methyl transferase (Finnie et al. 1999).

Although further studies are needed to verify the hypotheses of beneficial effects on different physiological aspects of the activated pathways, the evidences here obtained through biochemical and physiological (see below) experiments are suggestive of the main molecular and macroscopic mechanisms underlying the peculiar C2N behaviour. In this context, it has to be considered that plants produce a wide spectrum of biogenic volatile organic compounds (BVOCs), such as limonene, in various tissues; their emission varies widely depending on the species, organ, developmental stage and environmental conditions. BVOCs have important roles in plant adaptation and communication. For example, BVOCs have been reported to face the negative effects of biotic/abiotic stresses in an unanticipated, new mode of defence (Holopainen and Gershenzon 2010). In this light, it is noteworthy that a correlation between photosynthetic capability and terpene biosynthesis was already established (Penuelas and Munne-Bosch 2005; Loreto and Schnitzler 2010; Tezara et al. 2014). Our results demonstrate a higher production of limonene in C2N, when compared to that of parent and somatic hybrid genotypes. Since terpene production in the leaf was already associated with photosynthetic parameters, and a high photosynthetic capacity was related to high terpene content (Tezara et al. 2014), the higher concentration of limonene observed in the cybrid was here hypothetically related to its improved photosynthetic capacity and to its tolerance to stresses, based on proteomic assessment. Indeed, isoprenoids can improve plasticity in photo-protection and can help plants to survive under adverse climatic conditions. The protective role of some isoprenoids under non-photorespiratory conditions and their location in chloroplasts further underline their role in photoprotection and stress tolerance (Penuelas and Llusià 2002). Accordingly, these compounds might be of particular relevance in plant adaptation under adverse climatic conditions; thus, a comparative evaluation of their concentration may be useful in order to identify plants with a selective advantage for growing in contrasting habitats.

The hypothesized improved photosynthetic capability of the citrus cybrid, as based on proteomic experiments, was further assayed by dedicated physiological measurements, which were also performed in comparison with V, F and V + F genotypes. In this context, it is generally accepted that stomatal conductance (gs) is an important factor controlling photosynthetic rate and related with the required mesophyll [CO2], maintaining the Ci/Ca ratio at a constant value (Wong et al. 1979; Sharkey and Raschke 1981; Kusumi et al. 2012). A high Ci/Ca ratio associated to greater net photosynthesis (A) was recorded in the C2N genotype, suggesting that the higher assimilation rates were due to greater gs, which can enhance CO2 diffusion into leaf and chloroplasts (Kusumi et al. 2012). A low value of Ci/Ca ratio, on the other hand, can result from low gs levels, high photosynthetic capacity per unit leaf area, or a combination of both factors (Farquhar et al. 1989; Condon et al. 2004; Miner et al. 2017). High leaf stomatal conductance not only allows greater diffusion of CO2 within the leaf, but allows high levels of evapotranspiration that lower leaf temperature under well-watered conditions. Cooler leaves avoid the effects of high temperature on the activity of enzymes involved in carbon fixation and reduce photorespiration. When leaf temperatures become supra-optimal, photosynthetic rates are reduced (Prior et al. 1997; Long and Ort 2010; Yamori et al. 2014). In this perspective, leaves of C2N were significantly cooler than V + F ones (data not shown).

The proteomic comparison reported in this study also highlighted some molecular effectors that may explain the eventual performances of the citrus cybrid in facing biotic stresses and in tolerating Phoma tracheiphila infection (Tusa et al. 2000). In this context, worth mentioning was the over-expression in C2N proteome of peroxidase 3-like protein, an enzyme that was already associated with the biosynthesis of syringyl lignin. Accordingly, specific tolerance mechanisms to Phoma may be activated in the cybrid due to the increase of lignin biosynthesis. Actually, it has been reported that lignin plays a crucial role in the disease tolerance of different species (Menden et al. 2007; Gayoso et al. 2010; Tronchet et al. 2010; Xu et al. 2011; Skyba et al. 2013). Indeed, a defense-induced enrichment of lignin has already been observed in wheat (Menden et al. 2007), tomato (Gayoso et al. 2010) and cotton (Xu et al. 2011). This phenomenon was related to a triggering of the activities of peroxidase and phenylalanine-ammonia lyase (PAL); the latter enzyme is involved in phenylpropanoid metabolism and biosynthesis of lignin (Huang et al. 2010; Xu et al. 2011). In addition, lignin content contributes to mechanical support and water transport in the leaf and can play an important role in drought tolerance. Carins Murphy et al. (2016) have highlighted the relationship between leaf lignin content and minor vein volume, showing that a substantial proportion of leaf tissue dedicated to veins ensures that the investment in the vascular system is appropriate for the photosynthetic capacity of the leaf. Genetically modified poplars with low lignin content were found to have lower gs values, lower hydraulic conductivity and greater vulnerability to cavitation (Coleman et al. 2008; Voelker et al. 2011). Since the rigidity and compression resistance of xylem elements are due to lignin deposition, which has been shown to increase under drought conditions (Lee et al. 2007), higher lignin content could increase hydraulic conductivity of the leaves and allow the tolerance of higher tensions within veins, a trait that is particularly relevant to maintain water transport under high evaporative demand. In low-lignin transgenic poplars it was observed a significant decrease in photosynthesis, which could be explained by a negative feedback mechanism due to the accumulation of carbohydrates not removed by lignin biosynthesis (Coleman et al. 2008; Marchin et al. 2017). Furthermore, lignin deficiency was found to interfere with the temperature-response of photosynthesis (Marchin et al. 2017).

Conclusion

This work illustrates that protein-, metabolite- and physiological-oriented approaches can be a valuable tool to investigate the impact of genome rearrangement in the genotypes obtained by protoplast fusion. These combined data provide a comprehensive picture of the molecular events characterizing the regenerated genotype and opens avenues to better understand the mechanisms underlying metabolic capacity regulation, such as oxidation process and photosynthetic capacity. Overall, the outcome of this study suggests novel insights into cellular processes that are regulated by the integration of different molecular signals from compartmentalized genomes. The knowledge of the way in which organellar activity is perceived by the nucleus and coordinated with the rest of cellular function, together with the targeted use of cybridization, may have a deep impact in citrus breeding. Primary objectives for citrus genetic improvement programs might be achieved developing alloplasmic cybrid plants capable to express specific nuclear genetic background in combination with useful organelle genomes related traits.