Introduction

Cytokinins (CKs) are the class of adenine-derivative signalling molecules widely distributed in nature. Different forms of CKs have been identified in all known taxa: bacteria (Kisiala et al. 2013; Creason et al., 2014; Seo et al. 2016; Samanovic et al. 2018; Kabbara et al. 2020); fungi (Morrison et al. 2015a, b; Hinsch et al. 2015; Chanclud et al. 2016; Trdá et al. 2017); nematodes (Siddique et al. 2015); insects (Zhang et al. 2017; Body et al. 2019; Andreas et al. 2020); mammals (Seegobin et al. 2018); humans (Colombo et al. 2009; Reiter et al. 2012); algae (Stirk et al. 2003; Romanenko et al. 2016). In small amounts, CKs appear as a usual sub-product of the normal de novo biosynthesis or salvage pathways in purine metabolism (Ashihara et al. 2020). Besides, a set of cis CKs are formed as a result of tRNA degradation (reviewed in (Dabravolski 2020). In comparison with animals and fungi, plants have evolved more complex purine interconversion pathways, resulting in the production of a wide range of different CKs (reviewed in (Ashihara et al. 2020).

CKs are one of the main plant hormones, playing an essential role in the majority of plants’ physiological and metabolic reactions. Additionally, CKs control plants’ development, response to biotic and abiotic stress, influence nutrition and agronomical traits (Kieber and Schaller 2018). Modern genomics and bioinformatic resources allow us to expand our understanding of the CKs and consider them also as a universal class of signalling molecules. The CKs metabolism in plants is complex and suggested to be classified into two types: the modification of the adenine moiety and that of the side chain (Sakakibara 2006). Amongst the wide variety of plant proteins associated with CKs metabolism, the functions of phosphate-isopentenyl transferases (IPTs) and phosphoribohydrolase ‘Lonely Guy’ (LOG) in biosynthesis and Arabidopsis Histidine Kinases (AHKs), Arabidopsis Histidine phosphotransmitter (AHPs) and Response Regulators (RRs) in signalling and cytokinin dehydrogenase (CKX) in degradation have been extensively studied (Kieber and Schaller 2014) (Fig. 1). Many attempts have been made to unravel the evolution of the signalling pathway (Gruhn et al. 2014) and the entire metabolic pathway (Frebort et al. 2011). The main outcome of these studies is a commonly accepted idea that CK-related proteins originated from cyanobacteria and remained after endosymbiosis to the proto-chloroplast (Martin et al. 2015). The current model suggests that the CK regulatory mechanism originated from cyanobacteria and passed to angiosperms through Chlorophytes, Lycophytes and Bryophytes (Gruhn and Heyl 2013). It is known that all plastids (including algae) have a cyanobacterial origin (reviewed in (Keeling 2013) and (Martin et al. 2015), therefore the CKX absence in algae is intriguing.

Fig. 1
figure 1

Number of main cytokinin-related genes, presented in different taxa

CKs were shown to play an important role in the inter-species communication molecules for many bacterial, fungal and insect pathogens (Giron et al. 2013). Recently, it was shown, that the plant-pathogenic gram-positive bacteria Rhodococcus fascians (also known as Corynebacterium fascians) can produce (Pertry et al. 2009) and release a mixture of different CKs to modulate and redirect plant development (Pertry et al. 2010; Radhika et al. 2015) in favour of pathogen spreading. The human-specific gram-negative pathogen Bordetella pertussis employs a similar strategy by producing a set of CKs (Moramarco et al. 2019) that could modulate the function of the host’s immune system (Lappas 2015). Altogether, this data suggests an ancient and universal nature of the CKs (Robischon 2015).

The omnipresent FAD-binding (PF01565) domain is an important part of many catalytic proteins with wide representation in all taxa and known to be involved in immune defence, DNA repair, cellular signalling and metabolism, apoptosis, neural development and drug metabolism (reviewed in (Leys and Scrutton 2016). Moreover, recent data on organisms from extreme environments and pathogens have proved new emerging roles of FAD-binding domain-containing proteins, suggesting their adjustable and adaptive nature (reviewed in (Piano et al. 2017).

CKX is the only enzyme, responsible for the irreversible breakdown of CKs. From the time of identification in 1971 by Paces (Pačes et al. 1971), CKXs were studied exclusively in plants and their evolution was traced back only to the first primitive land conquer—Physcomitrella patens (Hedw.) (von Schwartzenberg 2006). To define the evolutionary origin of CKX, phylogenetic and structural analysis has been conducted. According to our data, we could conclude that CKX is a rather recent event of development that emerged as a part of the LOG-mediated defence mechanism, required to metabolise excess non-canonical nucleotides delivered from the environment. During growth plants are constantly interacting with other prokaryotic and eukaryotic organisms, some of them are pathogens, using CKs and CK conjugates as important virulence factors (Chanclud et al. 2016; Akhtar et al. 2020). Whilst more primitive organisms rely on simpler broad rage-specificity LOG-system (Stepchenkova et al. 2005; Sévin et al. 2017), plants require a more specific and efficient CKX-based mechanism to manage interactions with other organisms. For plants it is crucial to distinguish between two extremes: some of them could be pathogenic (should be eliminated) or symbiotic (should be promoted). The role of LOGs has remained unclear. For example, the bacterial LOG homologue (Escherichia coli, P0ADR8) was shown to catalyse irreversible hydrolysis of the N-glycosidic bond for different purine and pyrimidine substrates (AMP, dTMP, GMP, CMP, IMP, and UMP) (Sévin et al. 2017). Moreover, similar broad substrate specificity for a LOG homologue was shown for the pathogen Bordetella pertussis (Moramarco et al. 2019). Similarly, in the yeast Saccharomyces cerevisiae, the LOG homologue (YJL055W) was responsible for the detoxification of the chemotherapeutic drugs such as purine analogue 6-N-hydroxylaminopurine (HAP) and pyrimidine analogue 5-fluorouracil (5-FU) (Stepchenkova et al. 2005; Ko et al. 2008). Based on those data and known antioxidant properties of CKs (Brizzolari et al. 2016) it was suggested, that the possible function of the LOG is to protect the DNA and RNA from the incorporation of the non-canonical nucleotides (Carlsson et al. 2018). However, the suggested LOG function was neither shown nor proven in plants.

Our data suggested that D-lactate dehydrogenase, an omnipresent FAD-dependent oxidase/dehydrogenase, is the most probable source of origin for CKX. Thus, we suggested a new model of the possible neofunctionalization of this protein in multiple lineages across the tree of life. Our findings clarify and provide one possible explanation of the presence/absence of specific features of the respective proteins in algae, bacteria, metazoa and plantae.

Material and Methods

Sequences Retrieval

The sequences of 7 Arabidopsis Cytokinin dehydrogenases (CKXs) [Uniprot IDs: O22213, Q9FUJ3, Q9LTS3, Q9FUJ2, Q67YU0, Q9LY71, Q9FUJ1], Zea mais [Q9T0N8]; D-2-hydroxyglutarate dehydrogenase [O23240] and the consensus sequence of the cytokinin binding and FAD-linked oxidase (C-terminal) domains were extracted from Uniprot database and used for the following BLAST (Shiryev et al. 2007) searches in NCBI (with pBLAST, PSI and DELTA-BLAST algorithms against non-redundant protein sequences and ResSeq Select proteins databases), InterPro (Mitchell et al. 2019) (with in-build InterPro scan tool, sequence and domain architecture search modes) and Pfam (El-Gebali et al. 2019) (HMMER algorithm, sequence and domain architecture search modes) databases with P value set at ≤ 0.001 (summarised results of manually processed sequences are present in Fig. 1). All partial and fragmented sequences were eliminated. The presence of the cytokinin binding (PF09265/IPR015345) and FAD-linked oxidase (PF02913/IPR004113) domains were checked with CD-search tool (Batch mode) (NCBI) (Marchler-Bauer et al. 2017) and MOTIF search (KEGG) (Kanehisa et al. 2016) tools with E value (P ≤ 0.001). Domains, associated with the cytokinin binding domain were verified with the same tools and threshold.

Phylogenetic Analysis

Multiple sequence alignments of protein sequences were performed using MUSCLE (Edgar 2004) with default settings in Ugene software (Okonechnikov et al. 2012). Substitution models test and phylogeny analysis were carried out using the MEGA X software (Kumar et al. 2018). For the maximum likelihood tree (Whelan and Goldman 2001) the LG substitution model (Le and Gascuel 2008) was selected assuming an estimated proportion of invariant sites and four gamma-distributed rate categories to account for rate heterogeneity across sites. The gamma shape parameter was estimated directly from the data. Reliability for the internal branch was assessed using the bootstrapping method (1000 bootstrap replicates). The same settings were used in another tree reconstruction method, Neighbour-Joining (Saitou and Nei 1987), with similar results obtained.

Protein 3D Model Prediction, Search and Comparison

Structural models of the cytokinin binding domain and FAD-linked oxidase were built with SWISS-MODEL (Waterhouse et al. 2018). SWISS-MODEL is a fully automated protein homology modelling server. Effective template search algorithm combines accurate BLAST (Camacho et al. 2009) and sensitive HHblits (Remmert et al. 2012). Further, the most suitable templates are estimated based on the Global Model Quality Estimate (GMQE) (Biasini et al. 2014), describing the most likely structural similarity, and Quaternary Structure Quality Estimate (QSQE) (Bertoni et al. 2017) methods. Predicted structures were refined with the online tool 3D refine (http://sysbio.rnet.missouri.edu/3Drefine/) (Bhattacharya et al. 2016) (2020a) and verified with QMEAN (https://swissmodel.expasy.org/qmean/) (2020b) a linear combination of six structural descriptors of a given protein (Benkert et al. 2011). iPBA webserver was used for the pdb structures alignment (https://www.dsimb.inserm.fr/dsimb_tools/ipba/index.php) (2020c). The quality of the structure’s alignments was evaluated with RMSD (quality of alignment, calculated from the superimposition of protein pairs based on PB alignment) and Normalized score (the dynamic programming alignment score, calculated from the relation of the alignment score to the alignment length) (Zemla 2003; Tyagi et al. 2006, 2008). Chimera software (Pettersen et al. 2004) was used for structure visualization. To verify efficacy of our modelling approach, RaptorX (Källberg et al. 2014), Geno3D2 (Combet et al. 2002) and Robetta (Kim et al. 2004) servers were additionally used to build CKX7 structural models. Further, all generated models were compared to the original CKX7 model (2EXR). Obtained results (Supplementary table 1) suggest that SWISS-MODEL is suitable for our goals. Generated models for each tool and aligned to 2exr (pdb files) could be found in the Supplementary File 1.

Results

Sequences Identification

To track the evolutionary origin of the CKXs, we have searched for the homologous sequences from the Uniprot, Pfam, and NCBI databases using the homology-based BLAST method. Well annotated protein sequences of these proteins were used as query sequences for sequence-based homology searches (specifically, from Arabidopsis thaliana (L.) Heynh., Zea mays L., and consensus sequences from the corresponding domains, deposited in the mentioned databases).

In total, 969 sequences were identified (Supplementary Table 2) (Bacteria: 217, Eukaryota: Heterolobosea 1; Opisthokonta 3; Viridiplantae: 748). During sequence search, the truncated, partial and identical sequences were removed (Supplementary Table 3).

Interestingly, the bacterial CKXs are represented by a single-copy gene for all taxa. In contrast to bacteria, plants usually have multiple CKX genes. Moreover, only 1 protein was identified in Heterolobosea (D2V8E5) in Naegleria gruberi (Amoeba) and only 3 CKXs were identified in fungi: Basidiobolus meristosporus (A0A1Y1XXL7), Antrodiella citrinella (A0A4S4MT84) and Coprinopsis marcescibilis (A0A5C3KPW4). No homologues were found in metazoa and algae taxa.

Conservation of the Domain Architecture in the Cytokinin Dehydrogenase

Domain architecture plays a key role in understanding the functionality of proteins along with their evolutionary history. To evaluate the gain and loss of different domains in CKX proteins, we have checked the domain combinations in each clade.

The dominant domain architecture associated with the cytokinin dehydrogenase consists of N-terminal FAD-binding (PF01565) (app 200aa) and C-terminal cytokinin binding (pfam09265) (app 250aa) domains. However, some bacteria represent only a single cytokinin binding domain (Supplementary Table 3).

Phylogenetic Analysis

To understand the evolutionary history of the C-terminal cytokinin binding domain proteins the phylogenetic trees were inferred with the maximum likelihood and Neighbour-Joining methods. Some bacterial proteins have no FAD-binding domain, therefore only cytokinin binding domain sequences were extracted, aligned and used for the phylogenetic tree reconstruction. In general, both reconstruction methods (the maximum likelihood and Neighbour-Joining) have shown similar results (Fig. 2 and Supplementary Fig. 1). All CKX proteins delivered from the Viridiplantae have formed a separated cluster of closely related proteins. Further sub-divisions of this clade were well-correlated with the described localization of each studied protein as it was shown, for example, for Arabidopsis (Bae et al. 2007). As expected, CKXas of mosses, clubmosses and liverworts were located on separate branches (Fig. 2). Interestingly, amoeba (Naegleria gruberi) and fungi (Basidiobolus meristosporus, Antrodiella citrinella and Coprinopsis marcescibilis) proteins are closer to the bacterial protein’s clade, than to other eukaryotes.

Fig. 2
figure 2

Phylogeny estimation of the identified cytokinin binding domains. The Maximum Likelihood method and LG model were used; 1000 bootstrap replicates. Only branches with bootstrap value > 50 are shown. Fungal species highlighted in cyan, amoeba—magenta, Arabidopsis—blue, cyanobacteria—light green. Alignment length—323, Conserved site—1, Log-Likelihood -36,780.74, Discrete Rates—0.4791, 1.5209 (Color figure online)

Our results correlate with sequences similarity to the AtCKX7 cytokinin binding domain: all examined bacteria taxa (from 31 to 51%), from 39 to 43% for fungi and 52% for the amoeba. As expected, the similarity to the Viridiplantae sequences was above 60%. Thus, according to the results of the constructed phylogenetic trees, we could not prove the cyanobacterial origin of the cytokinin binding domain or any other photosynthetic bacteria taxa.

Structures Modelling and Comparison

Because our phylogenetic tree analysis could not answer the question regarding the origin of the cytokinin binding domain, we have analysed structural models of the CK-binding domains. In total, 36 models were built. The cytokinin binding domain of AtCKX7 was compared to the other cytokinin binding domains from other species (Table 1). It is not surprising that the viridiplantae and amoeba cytokinin binding domain exhibited the highest score. Bacteria from several taxa have shown similarly high matching scores (Chloroflexi, Actinobacteria, Deltaproteobacteria, Chloroflexi, Betaproteobacteria). Interestingly, fungi-derived cytokinin binding domains have exhibited rather low structural similarity.

Table 1 Comparison of structural models of the AtCKX7 with CKX domain from other species and Arabidopsis D2HGDH/ d-LDH

Identification of the Origin of the Cytokinin Dehydrogenase

To identify an omnipresent FAD-dependent protein, that could be related to the cytokinin binding domain we performed a pBlast search. By application of pBlast search, we have found that on the sequence level the closest to CKX protein in Arabidopsis thaliana is the FAD-linked D-2-hydroxyglutarate dehydrogenase (D2HGDH, At4g36400) (22.88%) and its ortholog D-lactate dehydrogenase (d-LDH, At5g06580) with sequence similarity at 30.37%. The closest CKX homologue from Zea mays (1W1O) has 41.85% of sequence similarity.

To find out how D2HGDH is structurally similar to the bacterial homologues, we have built structural models from the bacterial species, with the best match for the cytokinin binding domain, and compared it to the D2HGDH (Table 2). Our analysis revealed that on the amino acid level the D2HGDH exhibited a rather low sequence similarity (about 40% for all). Our comparative analysis of the different cytokinin binding domains revealed a close structural relation of the D2HGDH to the bacterial homologous. Besides, we included 2 FAD-linked oxidase domains from the green algae proteins (A0A2K3DH85 (Chlamydomonas reinhardtii and D8U4H2 (Volvox carteri f. nagariensis). As expected, the FAD-linked oxidase domain from the Arabidopsis thaliana has shown the highest matching scores to the green algae proteins. Interestingly, Betaproteobacteria (A0A069PK84—Caballeronia glathei), PVC group (A0A1Z8SV81—Planctomycetia bacterium) and Deltaproteobacteria (A0A017TCE0—Chondromyces apiculatus) demonstrated a structural similarity level comparable to the algae (Table 2). Our results of the structure comparison are supported by the phylogenetic estimation (Fig. 3 and Supplementary Fig. 2). Similar to the structural models, D2HGDH has shown the closest relation to the green algae FAD-linked oxidases, whilst cyanobacterial protein (A0A1Z4LVD4, Calothrix parasitica) was rather distant.

Table 2 Structural comparison and amino acid similarity of the Arabidopsis D2HGDH to FAD-linked oxidase from other species
Fig. 3
figure 3

Phylogeny estimation of the selected FAD-linked oxidases domains used for the structural comparison. The Maximum Likelihood method and LG model were used; 1000 bootstrap replicates. Only branches with bootstrap value > 50 are shown. Arabidopsis proteins are highlighted in blue, green algae—dark green, cyanobacteria—light green. Alignment length—265, Conserved site—6, Log-Likelihood -7931.97, Discrete Rates—0.4636, 1.5364 (Color figure online)

Taken all together, we could assume that FAD-linked oxidase is an ancient protein, rather similar in all taxa. Perhaps, this oxidase was inherited from the eukaryotes progenitor. Thus, based on our data analysis, we hypothesise, that the CKX was developed from the FAD-linked oxidase in bacteria and viridiplantae independently, as a result of an adaptation to the environmental condition. However, the suggested hypothesis needs further experimental verification.

Discussion

We have summarised the distribution of the main CK regulating genes in different taxa (Fig. 1). The dominant bacterial signalling mechanism comprises a two-component regulatory system, that could include a wide range of sensor domain/s, kinase (Ser/Thr/Tyr kinase or histidine kinase) and RRs (Stock et al. 2000). In contrast to bacteria, in the eukaryotes, this signalling system is rare and very likely to be inherited, from the endosymbiotic organelles (mitochondria and chloroplasts) (Capra and Laub 2012). The CHASE domain is the only known sensor for the CKs (Mougel and Zhulin 2001) that was suggested to emerge from the chemicals sensing pathway (Bilwes et al. 1999; Wang et al. 2017). Whilst the RR domain is ubiquitous for all taxa, the HPT domain is missing in red algae and the CHASE domain—in red algae and metazoan (Fig. 1).

CKXs are involved in irreversible degradation of the wide range of CKs, releasing adenine (or its derivative) and aldehyde. CKXs belong to FAD-dependent enzymes, containing FAD-binding N-terminal domain and C-terminal cytokinin binding domain. Due to the inhibitor properties of adenine for several metabolic pathways, the natural CKX concentration is usually low (Ashihara et al. 2018). Thus, it makes an additional complication for the study of CKX in vivo. CKXs have been studied only in viridiplantae (reviewed in (Kieber and Schaller 2014), whilst they have been also identified in bacteria, Naegleria gruberi (Percolozoa, Amoeba) and 3 fungi species (Basidiobolus meristosporus, Antrodiella citrinella and Coprinopsis marcescibilis) (Supplementary Table 2). In comparison to LOG and IPT genes (their representation in different taxa is similar), CKX is rather under-represented in all living organisms (Fig. 1). Notably, that bacteria and fungi with CKXs exhibit mainly pathogenic (Pertry et al. 2010) or closely associate with plants or animals (Mondo et al. 2017) lifestyle.

Cytokinin binding (pfam09265) domain was identified as a key part in the irreversible degradation of the wide range of the CKS in plants (Bilyeu et al. 2001). The amino acid “signature” of the adenine binding site exhibits a strong influence on substrate recognition and turnover rates (Malito et al. 2004). The proved crystal structures of the CKXs from Zea mais (Malito et al. 2004) and Arabidopsis thaliana (Bae et al. 2007) suggested a conserved catalytic mechanism for those two species. However, no functional data are available from non-plant species.

The phylogenetic data of the CKX proteins could not provide a definitive conclusion about the evolutionary origin of the CKXs. Also, our data could not support the relation to the cyanobacteria, as it was shown for other CK-related genes (Pils and Heyl 2009) or the amoeba Naegleria gruberi, where CKX genes have been delivered via the Chlamydia, that was recently reported by another group (Wang et al. 2020). The proposed role of Chlamydia is based on the close relationship between the human pneumonia-causing bacteria Protochlamydia naegleriophila with amoeba Naegleria gruberi (Casson et al. 2008) and several shared genes between Chlamydia and plant progenitor. There are several studies, proving participation of Chlamydia in the evolution of the photoautotrophic eukaryotes: 39 proteins have been identified by Becker et al. 2008 (Becker et al. 2008); 53 by Collingro et al. 2011 (Collingro et al. 2011) and 21 by Huang and Gogarten 2007 (Huang and Gogarten 2007). None of the identified genes is associated with CK signalling or CK metabolism. In comparison to the massive transfer of genes from cyanobacterial endosymbiont (about 4500 (Martin et al. 2002), many of which are associated with CK, the role of Chlamydia in CKX transfer remains very doubtful. On the contrary, it appears that CKXs in prokaryotes and eukaryotes may have developed independently. Due to the low number of available sequence samples (only one for amoeba and three for fungi), it is not possible to make any solid conclusion or assumption regarding their evolution pathway based only on the close relation to the prokaryotes on the phylogenetic tree. On the other hand, phylogeny and structural comparison of the FAD-linked oxidases from different taxa provides a strong link to a close relation between bacterial, algal and viridiplantae proteins. In this study, we have identified that bacterial, fungal, amoebal and viridiplantae CKXs are orthologs, delivered from FAD-linked oxidase. Our structural analysis reveals that FAD-linked oxidase was, most probably, inherited in the eukaryotes progenitor (Fig. 4, pathway 1). Furthermore, our data are in strong agreement with the previous report (Cristescu and Egbosimba 2009) investigating the evolutional history of the D-Lactate Dehydrogenases, where authors have defined CKX as a new clade on the common tree of the FAD-binding enzymes.

Fig. 4
figure 4

Model for the evolutionary origin of the CKX from FAD-linked oxidase

In total, based on the phylogenetic analysis and comparison of the structures we could conclude that the plant cytokinin binding domain is not originated from the cyanobacteria as was previously suggested (Gruhn and Heyl 2013) for the CK signalling pathway.

In Arabidopsis thaliana D-2-hydroxyglutarate dehydrogenase (D2HGDH, At4g36400) is a protein with mitochondrial localization, and D-lactate dehydrogenase (d-LDH, At5g06580) was identified in both, chloroplasts and mitochondria. As it was shown, both enzymes have wide substrate specificity (Engqvist et al. 2009) and participate in the dark-induced senescence and starvation by catabolism of the amino acids and chlorophyll (Engqvist et al. 2011). Until now no data available indicating the role of some CKs as substrates for the D2HGDH or d-LDH.

Another possible scenario of the FAD-linked oxidase evolution could be from the pro-mitochondrial endosymbiont, as it was suggested to be Proteobacteria (Yang et al. 1985; Martin et al. 2015). In addition to the mitochondrial subcellular localization of the D2HGDH and d-LDH, D2HGDH has shown close structural similarity to several Proteobacterial FAD-linked oxidases (Table 2), pointing to horizontal gene transfer from the endosymbiont (Fig. 4, pathway 2).Cyanobacteria have both genes (CKXs and FAD-linked oxidase) identified (Table 2 and Supplementary Table 4), suggesting the possibility that both genes could be transferred with pro-chloroplastic endosymbiont (Fig. 4, pathway 3) but CKX was lost during green algae evolution. On the next step, mitochondria- and plastid-delivered FAD-linked oxidases could be inherited by the land plants independently (Fig. 4, pathway 4)—which would explain the weak similarity between sequences. Based on the structural similarity of the FAD-linked oxidase and its omnipresent nature, evolutionary pathway 1 is more likely. Similarly, CKX is more likely to emerge independently in every taxon (Fig. 4, pathway 5), which is supported by an investigation of the evolution of the FAD-binding enzyme (Cristescu and Egbosimba 2009).

Thus, according to our results, we hypothesise that CKX is an evolutionarily recent protein that emerged from the FAD-linked oxidase. Many CKX-encoding genes were identified in bacteria, however, none of them was characterized and their substrate specificity remains unknown. It would be interesting to examine the difference in substrate preferences between the single cytokinin binding domain and the fused to the N-terminal FAD-binding domain. The natural environment of the algae does not support close interaction with any bacteria or provide a condition for the exchange of the CKs. Several plant-parasitic algae with a wide range of hosts have been described (Brooks 2004) whilst their genomes were not examined. Thus, the possibility of the CKX existence in algae remains, however further research is required to answer this question.

The CK degradation system of the evolutionarily primitive land plant Physcomitrella patens (Hedw.) is relatively simple, with high preferences in substrates to natural cis-zeatin (von Schwartzenberg et al. 2007). On contrary, the CKXs of vascular land plants (especially angiosperms) similarly to the CK signalling pathway (Kaltenegger et al. 2018), have undergone several duplication events, resulting in a high level of complexity and specificity (Niemann et al. 2018; Czajkowska et al. 2019). Such complexity could explain a high number of different CK forms, presented and metabolised in plants (Galuszka et al. 2007). We could notice a positive correlation between the complexity of the CKX and the number of potential pathogens (Czajkowska et al. 2019). From one side plants have to metabolise divers forms of non-canonical nucleotides, delivered from soil microflora and pathogens. On the other side, some forms of CKs could be toxic for bacterial or fungal invaders, so representing a defence mechanism. Until now no research has been conducted to evaluate the antibacterial/antifungal potential of the different CK conjugates. Therefore, it would be a good option for future studies.

Conclusion

Our results suggest that CKX is a recently evolved gene that resulted from the specialization of the duplicated FAD-linked oxidase. We assume CKX has emerged as a result of close interaction between different organisms (bacteria, fungi, plants and animals) probably as a pathogen defence mechanism. For the higher plants, this interaction began with land colonisation, for bacteria and fungi this interaction began as a pathogenic or commensal lifestyle. Our results suggest that FAD-linked oxidase is the most probable source of origin for the CKXs. Our findings also explain the possible reason for the absence of CKXs in algae.