Introduction

Nature has bestowed on mankind a vast and diversified plant community, which unfortunately is currently exploited at an alarming rate (Chen et al. 2016). A lot of measures are being taken up nationally and internationally to conserve plant diversity and sustainably use plant resources for the welfare of mankind. One of the strategies for plant diversity conservation is the in vitro production of plants or plant parts to meet both the goals of plant propagation and their commercial utilization. Tissue culture technology has been very promising in fulfilling the goal of in vitro propagation and conservation of elite plant germplasm. This has only been possible after the discovery of totipotency of plant cells and associated process such as somatic embryogenesis. SE is an intricate in vitro multi-step biotechnological tool used to develop embryos/plants from a single or a group of somatic cells. The process has been utilized to address various problems and helps shortening the breeding period, synthetic seed development, wide hybridization via embryo rescue, scale up plant propagation etc (Lu et al. 2017).

Plant growth regulators (PGRs) play a very significant role in the azygotic embryogenesis process. Different PGRs are used depending on the nature of plant and the explants used. Among PGRs, auxin or cytokinin or both may be added to the culture medium (Imin et al. 2005; Nolan et al. 2003). Auxin is essentially used for the induction of SE and its concentration is subsequently lowered or completely omitted for further development of embryos. However, in C. roseus, auxin is critical for callus induction while the combinations of BAP and NAA are effective in inducing embryos (Mujib et al. 2014).

Somatic embryogenesis has been reported in a vast array of plants of diverse plant groups including ferns. Out of such vast plant diversity, an immensely important medicinal plant, C. roseus was selected for present study. The plant produces different alkaloids, the most important being vincristine and vinblastine, both having anti-cancerous properties. In SE transition of cells from non embryogenic to embryogenic state undergo an array of morphogical, biochemical and molecular changes, often triggered by stressful in vitro micro-environment (Schmidt et al. 1997; Feher 2015). Study of these morphological, biochemical and molecular changes is important for elucidating and understanding the molecular mechanism underlying the SE (Mordhorst et al. 1997; Mahdavi-Darvari et al. 2014; Heringer et al. 2015). SE is extensively studied at molecular level (Rupps et al. 2016; Orłowska et al. 2017) and several genes and proteins have been reported to take part in the process (Lu et al. 2017) but, still it is the least understood process (Feher 2015). Various tools and techniques are used to identify embryo specific proteins. One of the most reliable and sensitive tool that is currently used for the identification and quantification of proteins is the gel free based proteome analysis via liquid chromatography–mass spectroscopy (Karpievitch et al. 2010). Gel free proteomic approach is much more reliable and could identify those proteins that have low abundance and cannot be identified by gel-based techniques (Washburn et al. 2001). LC–MS is the bottom up approach in which the intricate protein mixtures are digested enzymatically, generating fragments of peptides which are ionized by different methods, identified on the basis of mass/charge (m/z) ratio, subsequently rolled up to proteins and further identification and quantification is made by comparing the ionized fragments with already identified proteins using various softwares like SEQUEST (Eng et al. 1994; Karpievitch et al. 2010). The present study was conducted to make a proteomic map of SE and to differentiate embryogenic callus from non-embryogenic tissue based on protein profiles in C. roseus. The study identifies a number of proteins that are exclusively present in the embryogenic calli most of which are stress related proteins. The present study shows stress as key player in the acquisition of embryogenic competence by non embryogenic/somatic cells.

Materials and methods

Seed germination and establishment of culture

The seeds of Catharanthus roseus (L.) G. Don were collected from the herbal garden of JamiaHamdard, New Delhi. The seeds were surface sterilized and placed in conical flask containing 50 ml solidified MS (Murashige and Skoog 1962), medium without any plant growth regulator (Junaid et al. 2008; Mujib et al. 2014). Seedlings were allowed to grow and various explants (leaf, hypocotyl and stem) were taken and inoculated in test tubes containing 8.0 ml solidified MS supplemented with 0.5, 1.0, 1.5,2.0, 2.5, and 3.0 mg/l 2,4-dichlorophenoxyacetic acid (2,4-D) for callus. After a few weeks, the induced callus of each explant was separately sub cultured in the same medium for proliferation. The calli obtained from different explants were transferred separately to MS, supplemented with various concentrations and combinations of N6-benzyladenine (0.5, 1.0, 1.5 and 2.0 mg/l) + naphthalene acetic acid (0.5, 1.0, 1.5 and 2.0 mg/l) Within 2–3 weeks, hypocotyl-induced embryogenic callus produced somatic embryos only while callus from other explants (leaf and stem) were observed to be non-embryogenic (no embryo formed) therefore was discontinued for protein study. The combination of 2.0 mg/l BAP and 1.5 mg/l NAA was noted to be the best treatment for somatic embryo induction. Repeated sub culturing of embryo bearing callus was conducted for subsequent embryo proliferation process. Some of the embryogenic calli with developed embryos were transferred to the MS medium added with 2.0 mg/l GA3 (gibberellic acid) for embryo maturation.

Protein extraction from tissues

The working samples of embryogenic and non embryogenic calli from hypocotyl were drawn separately for the extraction of total protein. The Isaacson et al. (2006) protocol (phenol method) was followed for the extraction of protein. Two and a half (2.5) grams of callus (pooled from three to five test tubes) of each (embryogenic and non embryogenic) was crushed to fine powder in liquid nitrogen using pre-chilled mortar pestle. The callus powder was suspended in 10 ml solution of phenylmethane sulfonylfluoride (PMSF), 4-(2-hydroxyethyl)-l-piperazine ethane sulfonic acid (HEPES), β-mercaptoethanol and sucrose containing extraction buffer in the Oak Ridge tubes. The solution was vortexed for 1.5 min and 15 ml of phenol was added to the solution and vortexed for another 1.5 min. Oak Ridge tubes containing the solution were sealed and covered with ice in tray for incubation on orbital shaker for 30 min in order to mix the solution. The solution was centrifuged at 5500 rpm at 4 °C for 10 min. The upper phenolic phase was carefully transferred to the separate tubes; 15 ml of ice-cold ammonium-acetate 0.1 M solution was added and incubated overnight at − 20 °C for protein precipitation. The proteins were pelleted by centrifuging at 10,000 rpm for 15 min at 4 °C. The pellet was washed thrice, first by ice cold methanol, incubated for 1 h. After centrifugation at 10,000 rpm for 10 min at 4 °C the pellet was resuspended in ice cold acetone and incubated for 1 h. The protein was again pelleted at 10,000 rpm for 10 min at 4 °C and the same step was repeated. Pellet obtained was carefully transferred to fresh 2 ml Eppendorf tubes and then solubilized in 6 M Guanidium chloride, dissolved in 0.1 M Tris–HClwith pH 8.5. The Guanidium chloride solution was boiled at 100 °C for 10 min later 200 µl solution was added to each 2 ml Eppendorf tube containing pellet and was vortexed for 2 min. The sample (extracted proteins) was boiled for 5 min and was again vortexed for complete solubilization and then centrifuged at 10,000 rpm for 10 min for pelleting out the debris. The supernatant (upper liquid phase) was transferred to fresh centrifuge tube and stored at − 80 °C.

Sample preparation for LC–MS

Sample was heated at 90 °C for 10 min for better lysis and then precipitated by trichloroacetic acid or TCA acetone. Precipitate was obtained, washed 3 times with cold acetone and was directly solubilized in 0.1% Rapigest (water) in 50 mM ammonium bicarbonate (ABC) (pH ~ 7). Reduction of samples was done using 5 mM Tris (2-carboxyethyl) phosphine [TCEP] (sigma) in 25 mM ABC (pH ~ 7) at 37 °C for 30 min and were alkylated using 55 mM iodoacetamide (IAA) (Sigma) in 25 mM ABC (pH ~ 7) at room temperature in dark for 30 min. Trypsin (Promega) was added in the ratio of 1:50 for overnight digestion at 37 °C. Rapigest was removed by spining at 13,400 rpm for 10 min (pellet was discarded); supernatant was collected in another tube for desalting column. Desalting of samples was done according to manufacturer’s protocol (https://assets.thermofisher.com/TFSAssets/LSG/manuals/MAN0011495_Pierce_C18_SpinCol_UG.pdf). The samples were vacuum dried by speedvac and finally reconstituted in 0.1% Formic Acid, FA. About ~ 3 µg amount of peptide was loaded on the column [PepMap RSLC C18 2 µm × 50 cm (Thermo scientific)]. The column temperature was set at 40 °C. The general reverse phase nano-Liquid Chromatography (Thermo Scientific Easy-nLC 1200) setting (Table 1) was used and the optimized LC program is given below:

Table 1 The total time, duration, percentage of B solvent and flow (nl/min) of LC program

Mobile phases for two solvent gradient elution were: A (water + 0.1% Formic acid) and B (acetonitrile + 0.1% formic acid). All solvents were of LC–MS grade (Fischer Scientific), Lock mass of 445.12003 Da was used for internal calibration. The Mass Spectroscopy, MS (Thermo Scientific Q Exactive Orbitrap), run time was 0–123 min, polarity was positive, default charge state was 2 with MS (MS1) setting as microscan 1, resolution was 70,000, automatic gain control (AGC) 3e6, maximum IT (ion transfer) time was 60 mS, number of scan range was 1, scan ranges were 350–2000 m/z and spectrum was obtained in profile mode. MS MS (MS2) setting like microscan was 1, resolution 17,500, AGC 1e5, maximum IT (ion transfer) 120 mS, loop count 10, maximum number of precursor to be plexed in single event was 1, isolation window 1.5 m/z, isolation offset was 0.0 m/z, fixed first mass 100 m/z, normalized collision energy 27, and the data dependent settings as minimum automatic gain control (AGC) target was 1.00 e2, charge exclusion was unassigned and dynamic exclusion time was 50.0 S.

About ~ 3 µg of sample was loaded on C18 reverse phase column, obtained raw data was finally analyzed in Proteome Discoverer 2.2 (Thermo Scientific).The databases were directly downloaded from http://www.uniprot.org in FASTA format. The maximum allowed missed cleavage was 3, minimum peptide length for search was 4, maximum peptide length was 144, precursor mass tolerance 10 ppm, fragment mass tolerance 0.05 Da, dynamic modification—oxidation of methionine and acetylation at N terminus of proteins, static modification was carbamido methylation, target FDR (false discovery rate) 0.01 (for decoy database search) and validation was based on q-value.

Results and discussion

On inoculation in solid MS medium added with various 2,4-D concentrations (0.5, 1.0, 1.5, 2.0, 2.5, and 3.0 mg/l) more than 90% of the explants responded in culture and produced whitish callus within 4–5 weeks. Of all the tested concentrations, 2.0 mg/l was observed to be the most effective dose for callus induction. The induced callus from different explants was separately sub cultured and placed into test tubes containing 8–10 ml MS medium for callus proliferation. After 3–4 weeks, significant amount of new callus was observed with increased (more than 2.0 g of callus) biomass (Fig. 1a, c). Callus from each source was separately transferred into two solid MS media, one supplemented with same PGR (2.0 mg/l 2,4-D) for the maintenance of callus and the other was amended with 2.0 mg/l BAP and 1.5 mg/l NAA for induction of somatic embryos. Hypocotyl-induced callus turned into embryogenic calli while calli obtained from leaf and stem became non embryogenic in nature. Within 3–4 weeks, globular embryos were formed (Fig. 1b); later other developmental stages of embryos were induced from hypocotyl embryogenic calli. Some of the embryogenic calli with developing embryos were transferred to solid MS, amended with 2.0 mg/l GA3 for embryo maturation (Fig. 1d).

Total protein was extracted separately from embryogenic callus designated henceforth as F1 and non embryogenic callus designated as F2, were subject to LC–MS analysis. Proteins were identified and quantified using Protein Discoverer software. A total of 3573 proteins were identified among which 1511 proteins were present or common in both F1 and F2. LC-MS analysis revealed that 1079 proteins were exclusively found in embryogenic callus, F1 and 982 proteins were present only in F2 i.e. non embryogenic callus (Fig. 2). The overall abundance of proteins was found high in F1 (Fig. 3).

Fig. 1
figure 1

a Hypocotyl derived callus in MS medium supplemented with 2,4-D b the induction of somatic embryos (globular stage) in MS medium supplemented with BAP and NAA. c Non embryogenic callus derived from leaf explant in MS medium supplemented with 2,4-D and d matured embryo under the effect of GA3

Fig. 2
figure 2

Venn diagram showing common and exclusive proteins of embryogenic (F1) and non embryogenic (F2) tissue; 1079 proteins were exclusively identified in F1, 938 in F2 and 1511 in both

Fig. 3
figure 3

High abundances of proteins in embryogenic (F1) callus than non embryogenic (F2) tissue

The embryogenic cells are metabolically highly active as the somatic cells while attaining embryogenic competence undergo thousands of physiological and biochemical alterations (Fehér et al. 2003; Jiménez 2005). In our study, we observed that among the core exclusive proteins in F1, 732 proteins are associated with metabolic processes while in F2 the number of proteins was only 685 indicating that the embryogenic cells are intensely active. This observation is very consistent with previous reports that suggest differentially expressed proteins are mostly related to metabolic processes in embryogenic cells (Heringer et al. 2015, 2018). The presence of dense cytoplasm, prominent nucleus, fragmented vacuole, highly active nucleus with large nucleolus, high nucleus to cytoplasm ratio and low level of hetrochromatin are some of those features confirming active embryogenic nature in other studies (Verdeil et al. 2007; Campos et al. 2017). Among the exclusive proteins found in F1, a large number was noted to regulate carbohydrate, protein and lipid metabolism. A significant number of proteins seem to be associated with transcription and translation, DNA and RNA binding, ribosome, transport and other related physiological processes. A classification of F1 exclusive protein list based on molecular function is given in Fig. 4, and also a classification on the basis of the biological processes is given in Fig. 5. The house keeping proteins of both F1 and F2 were almost the same and are not discussed here. The proteins exclusively present or expressed differentially were the main focus of this study. Out of core 1079 proteins found exclusively in the F1, 210 proteins are present in cytoplasm, 128 associated with cellular and organelle membranes and 55 proteins were found to be present in nucleus (Fig. 6). Five isoforms of ATP synthase subunit alpha (fragment), three ATP synthase subunit beta, one ATP synthase subunit d (mitochondrial), ATPase ASNA1 homolog, ATPase subunit 4, ATPase (AAA-type, CDC48 protein), and ATP/GTP-binding family proteins were identified exclusively in F1 suggesting the intense energy metabolism or very high requirement of ATP synthesis in order to fulfill the demand of energy in highly active cells. This observation was further supported by finding of translation related proteins in significant number in F1. In this study, seven 40S ribosomal proteins/isoforms, three 60S acidic ribosomal protein, two 60S ribosomal protein L6, one 60S ribosomal L6-like protein, one 60S ribosomal protein L26-1, and six other ribosomal proteins/isoforms were identified. Different translation factors like eukaryotic translation initiation factor 2c, six other proteins, indicating intense protein synthesis activity and high energy demand in F1. High protein synthesis may necessitate the need of higher number of different Heat shock Proteins (HSP) that assist in proper folding of the newly synthesized proteins. Nearly 12 isoforms of HSP70 and HSP90, six other chaparones and chaparonins were also found. Intense protein synthesis was accompanied with a large number of transport proteins suggesting high transportation of different micro and macro-molecules within and outside cells. One of these transporter proteins was a nuclear transport protein, importin, that has been earlier reported to be differentially expressed in embryogenic cells (Zhao et al. 2015). Here, in this study, two AP-2 complex subunit alpha were exclusively present in F1; AP-2 subunit alpha is a subunit of adaptor protein complex 2 (AP-2). Adaptor protein complexes are associated with vesicular transport of proteins in different membrane traffic pathways, play important role in cargo selection and vesicle formation in clathrin- dependent endocytosis (uniprot.org). Three clathrin heavy chains were also identified exclusively in embryogenic callus suggesting the cells are actively engaged in protein trafficking. As proteins are intensely synthesized in highly active cells, the older proteins need to be degraded and constituents recycled (Heringer et al. 2018), which is carried out by proteasome and ubiquitin (Guerra and Callis 2012). In this present investigation, 18 recycling and degradation proteins or isoforms of proteasome (16) and ubiquitin (2) were present exclusively in F1 (Table 2).

Fig. 4
figure 4

Molecular function wise classification of 1079 proteins exclusively found in embryogenic callus (F1). Y-axis shows the number of proteins and X-axis the molecular function. It may be noted that some proteins have more than one molecular function and calculated multiple times and hence the sum of all categories would not be equal to 1079

Fig. 5
figure 5

Biological process-wise classification of 1079 proteins found exclusively in embryogenic callus. Y-axis represents the number of proteins and X-axis represents biological processes. Some of the proteins are involved in two or more biological processes, calculated multiple times and hence the sum would not add up to total 1079

Fig. 6
figure 6

Cellular location of 1079 proteins exclusively found in F1. In the figure ‘Others’ represent the proteins whose cellular location is not assigned, includes 669 proteins. Some of the proteins are present in more than one component

Table 2 Some of the proteins/isoforms exclusively found in F1 i.e. embryogenic tissue of Catharanthus roseus

Stress related protein accumulation

As described earlier, stress has a very important role in the acquisition of embryogenic competence by non embryogenic or somatic cells in plants. Various stress responsive proteins have been reported to accumulate during SE. Glutathione S-transferase (GST) for example, is induced in embryogenic cells of several investigated plants like Cyclamen persicume (Hoenemann et al. 2012), Picea asperata (Jing et al. 2016), Zea mays (Ge et al. 2017) etc. GST plays significant role in various developmental processes including detoxification of xenobiotics, abiotic, disease and environmental stress tolerance. In C. roseus F1, more than 72 stress responsive proteins/isoforms were observed exclusively e.g.ascorbate peroxidase, catalase, glutathione S transferase, alkylhydro peroxidase, superoxide dismutase, SOD Fe N domain containing protein, pyridine nucleotide disulphideoxidoreductase, thioredoxinreductase (Table 2) etc. The presence of vast array of stress related proteins in F1 and fewer in F2 suggests that rigorous changes in molecular or biochemical environment may be the reason for the paradigm shift in morphogenetic route. This observation is getting support from previous reports, suggesting stress as a main cause to switch on/off of thousands of genes (Feher 2015; Horstman et al. 2017), cause a wide array of morphological, biochemical and molecular changes, resulting in acquisition of embryogenic competence of somatic cells (Pulianmackal et al. 2014).

In Catharanthus, we identified several isoforms of 14-3-3 protein in both embryogenic (F1) and non embryogenic (F2) tissues, many of them were exclusively found in embryogenic callus (F1) (Table 2) and only two isoforms in the non embryogenic callus. This shows its possible role in SE. Therefore, our results are consistent with the reports of the presence of 14-3-3 proteins in embryogenic cells in different plants like Saccharum (Heringer et al. 2015, 2017), Larix principisrupprechtii (Zhao et al. 2015), Zea mays (Ge et al. 2017) Araucaria angustifolia (Fraga et al. 2016). Earlier reports indicated 14-3-3 proteins’ role in plant immunity, stress responses, metabolism, transcription, signal transduction, cell cycle control, programmed cell death and protein trafficking, beside SE (Oh 2010). These are regulatory, acidic proteins, binding in phosphorylation dependent manner to target proteins like phosphothreonine and phosphoserine and thus have significant role in plant growth and development.

Proteomic analysis of Catharanthus revealed the presence of Heat Shock protein 70 (HSP70), Heat Shock protein 90 (HSP 90) and their several isoforms (more than 23) in both the types of calli i.e. F1 and F2. Fourteen HSPs were found exclusively in F1 (Table 2), and Supplementary Table 1 enlists accession numbers and other details of proteins found exclusively in the F1 callus. The significant number of HSPs in F1 suggests their possible role in induction of SE. Rest of the HSPs were expressed either in both or exclusively in F2 callus but the number was less than that of F1. A variety of heat shock proteins and their isoforms were previously reported to be abundantly accumulated in embryogenic cells of different plant genera e.g. Saccharum (Heringer et al. 2015), Larix principisrupprechtii (Zhao et al. 2015), Araucaria angustifolia (Fraga et al. 2016). These proteins accumulate abundantly under different stress conditions and their role is essentially in proper folding of other proteins (Park and Seo 2015).

In this Catharanthus study, two Late embryogenesis abundant (LEA) proteins, Late embryogenesis abundant protein accession number A0A072UGZ8, and Dehydrin accession number A0A1U9Y619; and other related proteins like Ca2+ binding, DNA binding and repair etc. were identified exclusively in F1; confirming that these proteins play a very important role in embryo induction and development during SE process. Interestingly, these proteins were observed to be accumulated early i.e. in globular stage of embryogenic callus. LEA, a large class of hydrophilic proteins however, was previously reported to abundantly accumulate at late stages of zygotic embryo development (Ikeda et al. 2006). Although the exact function of such a large class of proteins is still unknown, it is now known that these proteins protect the cellular damages during abiotic stress (Tolleter et al. 2010; Hincha and Thalhammer 2012), accumulate late during embryo development and are also induced during in vitro embryogenesis (Ikeda et al. 2006).

Proteomic study of Catharanthus tissue showed pathogenesis related proteins (PR),PR-4B accession number P29063, two chitinases, accession numbers G7IDV3 and Q9FS45, osmotin-like protein (accession number Q2MJK2) were present only in F1 (Table 2, Supplementary Table 1). These results are in agreements with previous report of the presence or accumulation of pathogen related proteins present in Cichorium embryogenic cells in high abundance as compared to non embryogenic callus (Helleboid et al. 2000). Presence of several PRs in embryogenic callus indicate that the cells are under stress and need various strategies for stress tolerance and protection from cellular damage, for which cells may have recruited the PR proteins as one way to get endurance under such conditions. PR accumulation occurs under various stresses like salicylic acid, UV light, heat, cold, wounding, heavy metal, hormone e.g. ethylene (Helleboid et al. 2000). Hence PR proteins appear to be involved in equipping the cells with endurance under stress condition.

Out of 1079 proteins exclusively found in F1 embryogenic callus, 336 proteins were uncharacterized and their function is unknown. Presence in high number of these proteins suggests their possible role in SE, but unfortunately, more information regarding these proteins is still to come. More than 334 proteins have abundance twice or more in F1 than in F2. A list of these protein accession numbers, abundance ratio of F1/F2 in decreasing order and other parameters is attached (Supplementary Table 2). Increased abundances of the 334 proteins in embryogenic callus twice or more as found in non embryogenic callus, shows their possible role the attainment of embryogenic competence. The maximum proteins among the 334 enlisted are related to energy metabolism, protein synthesis and stress related which further confirms the active nature of embryogenic cells, high ATP/energy demand and stress as the key player in attainment of embryogenic potential.

It is important to mention here that all culture conditions like nutrient concentration, light, temperature, humidity etc. were the same for both embryogenic and non embryogenic callus except the PGRs used. So we may infer from the study that the PGRs may induce stress, cause gross morphological, physical, molecular and biochemical changes, enrouting the cells to SE developmental pathway.

Conclusion and future prospects

Catharanthus roseus shows the potentiality of SE at high frequency in MS medium supplemented with BAP and NAA. SE was only induced in hypocotyl derived calli, calli from other explant sources were non embryogenic. A sizable 1079 proteins were identified in embryogenic callus exclusively having diverse functions. These proteins have functional homology with a number of proteins identified earlier from embryogenic cells in many other investigated plants. Some of the newly identified proteins were discussed with reference to previous others. The study suggests that a large number of stress related proteins accumulate in embryogenic cells of C. roseus, may activate an array of different signaling cascades in acquiring embryogenic competence. The abundance of different proteins, regulating metabolic processes indicates that the embryogenic cells are highly active compared to non embryogenic cells. The identification of SE related proteins in C. roseus is a step to understand the process of SE and further need to be consolidated at genomic and transcriptomic level. The SE related proteins isolated from diverse plant groups, could be used as important markers for the identification of embryogenic cells. Further research is however; necessary to unveil the mechanism underlying the process of SE as the process of SE has often been considered as a model to understand plant developmental pathways. The elucidation of SE proteomic maps may broaden its scope and applicability to other economically important and endangered plants that need rapid propagation and conservation.