Introduction

Lichens are a symbiotic partnership between a mycobiont (fungal partner) and a photobiont (photoautotrophic partner). Lichens produce a plethora of secondary metabolites, which allows them to thrive in hostile environments (Goga et al. 2020). Various studies have predicted that these compounds assist lichens with habitat expansion and resilience to various biotic and abiotic stresses (Mitrović et al. 2011; Gaya et al. 2015; Solhaug and Gauslaa 2012). These secondary metabolites are also known to have antibacterial, antiproliferative, antioxidant, antiviral, anti-inflammatory, antiherbivore allelopathic, and photoprotective effects (Goga et al. 2020). Hence, lichens are widely used in ethnomedicine for the treatment of various diseases (Crawford 2015; Yang et al. 2021). A vast majority of these secondary metabolites are small aromatic polyketides synthesized by fungal polyketide synthases (PKSs) (Kealey et al. 2021; Kim et al. 2021b; Wang et al. 2018; Armaleo et al. 2011; Yousuf et al. 2014; Schmitt et al. 2008).

PKSs have been reported from a diverse range of bacteria, fungi, and plants (Austin and Noel 2003). PKSs are multidomain or multiprotein enzyme complexes that produce a wide range of natural compounds. These enzymes share structural and functional similarities with fatty acid synthase (Korman et al. 2010). PKSs are classified into four types based on their mechanisms and structure, which include enzyme catalysis, molecular recognition, protein–protein interaction, and domain organization. These are modular Type I, iterative Type I, Type II, and Type III (Shen 2003). Among these, Type I and II PKSs are large protein complexes, while Type III is a small homodimeric protein (Weng and Noel 2012). Concurrently, based on the presence of optional domains, fungal PKSs can be classified into three subgroups. These are including non-reducing (NR), partially reducing (PR), and reducing (R) PKSs (Gerasimova et al. 2022). Recent comparative genomic research of 29 lichenized fungi revealed that the number of PKS genes varies greatly between the groups of lichenized fungi. Parmelia sp., for example, has 70 Type I PKS genes, whereas only three genes were found in Sclerophora sanguinea (Kim et al. 2021b). Similarly, Gerasimova et al. (2022) examined 23 Ascomycota lichen-forming fungus genomes and observed a significant diversity of Type I PKS genes in Bacidia rubella. Despite the difficulty of linking PKS genes to their products in lichen-forming fungi, which is often due to a lack of established gene knockout or heterologous gene expression systems, some recent studies have been successful in achieving this, such as those by Okorafor et al. (2021), Kim et al. (2021a), Kim et al. (2021b), Kealey et al. (2021), Singh et al. (2022a).

The lack of well-established gene knockout or heterologous gene expression techniques in lichens has made it difficult to conclusively link a PKS gene to its corresponding metabolic product (Kealey et al. 2021). Even though over 1000 lichen metabolites have been reported, only around ten have been effectively linked to the genes that encode them (Singh 2023). Currently, several PKS genes in lichen have been characterized using genome mining approach. For example, CgrPKS16 from Cladonia grayi is engaged in the grayanic acid depsidone biosynthesis pathway (Armaleo et al. 2011), NPPKS7 from Nephromopsis pallescens in usnic acid biosynthesis (Wang et al. 2018), and PKS PFUR17_02294 and atr1 from Pseudevernia furfuracea in lecanoric acid and atranorin production, respectively (Kim et al. 2021b; Kealey et al. 2021).

The presence of PKS genes in the genome of a lichenized fungus does not necessarily correspond to the production of secondary metabolites. A recent study identified 14 PKS genes in the genome of Endocarpon pusillum. However, no secondary metabolites were detected under conventional culture conditions. When the fungus was propagated in a vermiculite medium, however, three secondary metabolites were produced (Liu et al. 2018). Similarly, Jeong et al. (2021) demonstrated that Cladonia metacorallifera var. reagens requires fructose concentration in medium and light to produce cristazarin, whereas pH is important for Cladonia uncialis to produce usnic acid (Gunawardana et al. 2021). Certain secondary metabolites are only produced by the fungus when it is in a symbiotic relationship with an alga in a lichenized condition, such as hyperhomosekikaic acid (Yousuf et al. 2014). As a result, developing a co-culture system is critical for identifying some of the secondary metabolites produced by a lichenized fungus (Li et al. 2020).

The lichen Xanthoparmelia taractica has been reported to grow in various regions of China, Europe, and South and North America. However, no co-culture system for X. taractica and its algal partners has been established, as well as no genomic data for the lichenized fungal species is available. To address this, we established co-culture systems with the lichenized fungus (X. taractica), the endolichenic fungus (Coniochaeta fibrosae) and its algal partner (Trebouxia sp.), or either of the fungus and the alga. We also sequenced the whole genome of X. taractica, identified the PKS genes, and investigated the expression patterns of selected PKS in both co-culture and monoculture setups. We hypothesized: (1) the early phase of symbiosis will be observable in co-culture systems combining the lichenized fungus X. taractica and the alga, but not in the setup comprising endolichenic fungus C. fibrosae and the alga. This is because the former is known to form symbiotic partnerships with alga, while the latter is a non-lichenized fungal species that dwell within the lichen host, comparable to "endophytes"; and (2) when co-cultured with the alga, the expression pattern of selected PKS genes of the lichenized fungus X. taractica will vary compared to the pure culture of the fungus.

Material and method

Collection of lichen sample

In July 2019, samples of Xanthoparmelia taractica were collected from Balin Right Banner (14 98.8 m a.s.l., 44°13′45"N, 118°44′57"E), Chifeng City, Inner Mongolia Autonomous Region of China.

Isolation of fungi and alga from lichen thalli

To eliminate organic debris, lichen thalli were washed under running tap water. The thalli were then rinsed several times with sterile deionized water. Under a Leica Zoom 2000 stereomicroscope, the top cortical layer was scraped off with a sterile blade. After dissecting the medullary layer, tissues were rinsed with sterile deionized water.

For fungal isolation, pieces of medullary tissues were placed on the surface of 2% potato dextrose agar (PDA; Qingdao Hope Bio-Technology Co., Ltd., China). All Petri plates were incubated at 25 °C for 14 days. Mycelia emerging from tissue pieces were sub-cultured onto new PDA plates. The single hyphal tip isolation technique was used to establish pure cultures of fungal isolates. All the plates were incubated in darkness at 25 °C.

To isolate the alga, pieces of medullary tissues recovered as previously indicated were plated onto fresh Bold's Basal Medium (BBM) (Deason 1960). Thereafter, all Petri plates were incubated at 25 °C under a light/dark cycle (12/12 h) for 14 days. Algal colonies were sub-cultured on fresh BBM plates using the streak culture method to establish pure cultures. All the algal cultures were incubated at 25 °C under a light/dark cycle (12/12 h).

Identification of lichenized fungi and algae

Total genomic DNA from fungal and algal isolates was extracted using a modified CTAB method (Zhang et al. 2010). For molecular identification of the fungal and algal isolates, the complete internal transcribed spacer (ITS) was amplified using primer pair ITS1/ITS4 (White et al. 1990).

Each 25 μl PCR reaction included 12.5 μl of 2 × Taq Master Mix (buffer, dNTPs and Taq; Vazyme Biotech Co., Ltd, China), 9.5 μl of PCR grade H2O, 1 μl each of forward and reverse primers, and 1 μl of DNA extract. PCR amplifications were conducted with an initial denaturation at 95 °C for 5 min, followed by 30 cycles of 95 °C for 30 s, annealing temperatures 54 ºC for 60 s, 72 °C for 90 s, followed by a final elongation at 72 °C for 10 min. Positive amplifications were confirmed using agarose gel electrophoresis. All the PCR products were sequenced by Sangon Bioengineering (Shanghai) Co., Ltd. The resulting sequences were assembled using Geneious v. 10.2.2 (https://www.geneious.com).

Preliminary identification of fungal and algal isolates was done by using the BLAST algorithm (Altschul et al. 1990) available through the NCBI GenBank. Thereafter, sequences of closely related fungal and algal species were retrieved from the GenBank using the distance tree feature (Table S1-2). All datasets were aligned using MAFFT v. 7 (Katoh and Standley 2013). If needed, manually adjusted using MEGA v. 7 (Kumar et al. 2016).

Phylogenetic analyses were done using software available through the CIPRES Science Gateway (Miller et al. 2010). Appropriate evolutionary models were selected using jModelTest2 (Darriba et al. 2012). Maximum likelihood (ML) phylogenetic analyses of the datasets were done using RAxML v. 8.2.2 (Stamatakis 2014). These analyses were carried out using the default settings and bootstrap values derived from 1,000 replicated searches. Phylogenetic trees were visualized using FigTree v. 1.4.3 (http://tree.bio.ed.ac.uk/).

Co-culturing of fungi and alga

Two fungal isolates and one algal isolate were co-cultured in this study. Isolates of lichenized fungi (423E) and alga (S420) were recovered in this study. Henceforth, these isolates will now be referred to as LF-423E and AGL-S420. The isolate of endolichenic fungus, Coniochaeta fibrosae (CGMCC 3.20304), was recovered in a previous study (Si et al. 2021) but from the same lichen thallus.

The fungal and algal cultures for this co-culturing experiment were prepared following the protocol suggested by Joneson et al. (2011). Fungal and algal isolates were sub-cultured separately in 100 ml each of malt yeast broth (MY, 20 g malt extract and 2 g yeast extract/L) and BBM broth, respectively. The flasks were placed in an incubator shaker at 120 rpm, 20 °C under a light/dark cycle (12/12 h) for four weeks. After the incubation period, 50 ml of each fungal and algal culture were pipetted into three polyethene tubes and centrifuged at 12 000 rpm for 10 min. The algal pellets were re-suspended into a 50 ml blend of BBM and MY broths in a 99:1 ratio. The fungal pellets were macerated using a tissue grinder before being resuspended in the 50 ml blend of media. In suspensions containing more than one organism, the mixes were made in ratios of 1:1 or 1:1:1.

The co-culturing trial included six set-ups. These are: 1) LF-423E only; 2) LF-423E + AGL-S420; 3) AGL-S420 only; 4) CGMCC3.20304 only; 5) CGMCC3.20304 + AGL-S420; and 6) LF-423E + AGL-S420 + CGMCC3.20304.

Sterile nitrocellulose membranes were placed on the surface of BBM and MY (99:1) agar plates. 500 µl of each of the above-mentioned suspensions were aspirated onto the surface of these nitrocellulose membranes and air-dried inside a laminar flow for 1 h. Each set-up was replicated thrice. All of the Petri plates were incubated at 20 °C for 21 days with a 12/12 h light/dark cycle.

Scanning electron microscopy

The interaction between fungal and algal isolates was observed using a scanning electron microscope. The nitrocellulose membrane was harvested and sliced into 1–2 cm strips from representative plates from set-up 2, 5 and 6 of the co-culture trial. Nitrocellulose strips with the organism were fixed using Karnovsky's fixative (2.5% glutaraldehyde and 1.0% paraformaldehyde in 0.1 M cacodylate buffer, pH 7.2) at 4 °C for 24 h. Fixed nitrocellulose strips were rinsed with 0.1 M cacodylate buffer (pH 7.2) followed by post-fixing with 1% OsO4 dissolved in 0.1 M cacodylate for 2 h at 4 °C. Post-fixed samples were gradually dehydrated with 70, 80, 90, and 100% ethanol. Thereafter, all the dehydrated samples were steeped in absolute ethanol for 30 min, and this step was repeated thrice. Samples were further dehydrated using absolute acetone for 20 min twice. All samples were then dried with the Leica EM CPD300 critical point dryer. The morphological characteristics of interactions between fungi and algae were observed using a Hitachi TM-3030 scanning electron microscope.

Genome sequencing and assembly of Xanthoparmelia taractica

DNA for the whole genome sequencing of X. taractica was extracted using E.Z.N.A.® HP Fungal DNA Kit (Omega Biotek). Library preparation and sequencing were done by Shanghai Personalbio Technology Co., Ltd. (China). Construction of the library was done using a TruSeqTM DNA Sample Prep Kit (Illumina, USA). Runs were conducted with 2 × 150 base paired-end reads with insert size 400 using the Illumina NovaSeq System. The quality of the raw sequence data was checked using FastQC v. 0.11.9 (Andrews 2010). Adapter sequences were trimmed using AdapterRemoval v. 2.0 (Schubert et al. 2016). SOAPec v. 2.0 was used for quality correction of all reads based on k-mer frequency, with the k-mer value set to 17 (Luo et al. 2012). High-quality reads from each sample were used for k-mer counting using Jellyfish v. 2.0.0 (Marçais and Kingsford 2011). The genome was assembled by a hybrid approach using the software Falcon (Chin et al. 2016) and CANU (Koren et al. 2017). The draft genome assembly was polished using Pilon V.1.18 (Walker et al. 2014). The resulting genome assembly was assessed using BUSCO v.3.0.2 (Simão et al. 2015).

The repeat sequence content of the genome was detected using RepeatMasker v. 4.0.5 (Tempel 2012) and the Repbase collection of repeats for fungus v. 20150807 (Kapitonov and Jurka 2008). RECON v. 1.0.8 (Bao and Eddy 2002) and RepeatScout v. 1.0.5 (Price et al. 2005) available through RepeatModeler v. 1.0.4 (Smit et al. 2015) were used to identify de novo-based repeat sequences. The transfer RNA (tRNA) and ribosomal RNA (rRNA) genes were identified using tRNAscan-SE v. 1.3.1 (Lowe and Eddy 1997) and RNAmmer v. 1.2 (Lagesen et al. 2007), respectively. The remaining non-coding RNA sequences (ncRNAs) were identified using the Rfam database for ncRNAs (Griffiths‐Jones 2005).

Gene prediction of Xanthoparmelia taractica

The ab initio gene predictions were performed using the programs Augustus v. 3.03, glimmerHMM v. 3.0.1, and GeneMark-ES v. 4.35 (Stanke and Waack 2003; Majoros et al. 2004; Ter-Hovhannisyan et al. 2008). Homology-based gene prediction was done using Exonerate v2.2.0 (Slater and Birney 2005). The ab initio gene predictions and exonerate alignments were then combined using EvidenceModeller v1.1.1 (Haas et al. 2008) with a minimum intron length of 10 bp and weightings of CodingQuarry:1, GeneMark.hmm:1, protein exonerate:2.

Annotations of carbohydrate-active enzymes, antibiotic-related genes, and fungal virulence factors of Xanthoparmelia taractica

Carbohydrate-active enzymes (CAZymes) annotation was done using hmmscan v. 3.1b2 against CAZy database (Lombard et al. 2014). If the length of the alignment was greater than 80 then a stringent e-value cut-off was used 1e-5, with coverage of 30%. However, if the length of the HMM alignment was less than 80 then the e-value was reduced to 1e-3, with coverage of 30%. Antibiotic-related genes were identified using BLAST against the comprehensive antibiotic resistance database (McArthur et al. 2013) with an e-value cutoff of 1e-6, identity > 45%, and coverage > 70%. Fungal virulence factors were also identified using BLAST against the database of fungal virulence factors (DFVF) (Lu et al. 2012) with an e-value cutoff of 1e-6, identity > 45%, and coverage > 70%.

Gene functional annotation of Xanthoparmelia taractica

The protein sequences were functionally annotated using Diamond v. 0.9.10.111 (Buchfink et al. 2015) against the NCBI-nr database (NR) (release 2017.10.10) with an e-value cutoff of 1e-6. The Swiss-Prot annotation was conducted by BLAST against the Swiss-Prot database (release 2017.11.22) with an e-value cutoff of 1e-6. The functional annotation through orthology assignment was done by eggNOG-mapper v. 2 (Cantalapiedra et al. 2021) with the DIAMOND mapping mode (Buchfink et al. 2021) with an e-value cutoff of 1e-6. Transport proteins were annotated by BLAST against the Transporter Classification Database (Saier et al. 2014). Protein domain families were annotated by using the Pfam database-A. Genes involved in host–pathogen interactions were identified by using the pathogen-host interactions (PHI) database (Urban et al. 2021). The KEGG Ortholog (KO) and pathway were annotated by KEGG orthology identifiers available through the web-based server KAAS (KEGG Automatic Annotation Server, v. 2.1) (Moriya et al. 2007) with a representative set: for eukaryotes, KO assignment methods were: bi-directional best hit. Gene Ontology (GO) was annotated by InterProScan (Quevillon et al. 2005) against the InterPro v. 66.0 protein signature database (Finn et al. 2017). Cytochrome P450 was identified by BLASTP (2.5.0 +) against the fungal cytochrome P450 database (Park et al. 2008) with an e-value cutoff of 1e-5.

PKS gene clustering and analyses

Annotation of secondary metabolite biosynthesis gene clusters was performed using antiSMASH fungal version 6.0 (Blin et al. 2021) and the corresponding protein sequences of PKS genes from X. taractica were manually checked using BlastP. PKSs that were found in X. taractica and those from the NCBI were compared, and if there was less than a 70% similarity, they were excluded from the phylogenetic analysis. The final dataset included protein sequences of PKSs identified from X. taractica and those retrieved from the NCBI. The dataset was aligned using MAFFT and the ML trees were constructed and viewed using RaxML and FigTree, respectively.

PKS genes expression analyses

For the expression studies, the protein sequences of the X. taractica PKS genes that clustered with other PKSs from fungi with well-defined functions were considered. The β-tubulin (BT) gene was used as a reference.

Following the settings recommended by Thornton and Basu (2015), the primers for RT-qPCR were designed using PrimerQuest Tool (https://sg.idtdna.com/PrimerQuest/Home/Index). Primer Tm (°C): lowest melting temperature of 50 and optimal melting temperature of 60; Primer GC (%): minimum 50, optimum 55, and maximum 60; Primer size (nt): optimum 20 and maximum 24; GC Clamp (nt) 1; Amplicon Size (bp): smallest size 75, optimum size 100, and largest size 150. The presence of secondary structures (if any) and cross, self-dimers, and hairpin construction were verified using UNAFold (http://www.idtdna.com/UNAFold) and Beacon Designer (http://www.premierbiosoft.com/qpcr/), respectively. Table S3 lists all of the primer pairings designed in this study.

Total RNA was extracted from set-ups 1 and 2 after 21 days using TRIzol reagent (Thermo Fisher Scientific) following the manufacturer’s protocol. cDNA was synthesized using the SPARKscript II RT Plus Kit (Shandong Sparkjade Scientific Instruments Co., Ltd, China). Each 20 μl of PCR reaction included 2 × SYBR Green qPCR Mix 10 μl, Primer F (10 μM) 0.4 μl, Primer R (10 μM) 0.4 μl, cDNA 1 μl, and RNase Free water 8.2 μl. For both PKS and BT gene regions, PCR amplifications were conducted with an initial denaturation at 95 °C for 3 min, followed by 45 cycles of 95 °C for 20 s, 60 °C for 15 s, 95 °C for 5 s; melting curves at 65 °C for 1 min.

The dataset from the expression study (monoculture vs. co-culture) was statistically analysed using one-way analysis of variance (ANOVA) available through the R v. 4.2.2 (R Core Team 2022). A p-value ≤ 0.05 was considered significant.

Results

Identification of lichenized fungi and alga

Fungal isolate LF-423E recovered from the lichen X. taractica grew significantly slower than other fungal isolates, indicating that it was most likely a lichenized fungal species. Preliminary identification of the ITS sequences obtained from the fungal and algal isolates revealed that LF-423E was closely related to the fungus X. taractica, while algal strain AGL-S420 was closely related to an undescribed Trebouxia species. Maximum likelihood phylogenies using ITS sequences validated the identification of the isolates (Fig. S1 and S2). The sequences generated in this study were submitted to GenBank under the accession numbers OP279582 (LF-423E) and OP279583 (AGL-S420).

SEM of fungal and algal co-cultivation setups

In either of the co-culture settings, no lichen-like structure was observed. Slow growth was observed in all instances (Fig. 1a, b, c).

Fig. 1
figure 1

Co-culture of mycobiont (Xanthoparmelia taractica), photobiont (Trebouxia sp.), and endolichenic fungi (Coniochaeta fibrosae). Cultures of a: mycobiont Xanthoparmelia taractica; b: mycobiont Xanthoparmelia taractica + photobiont Trebouxia sp.; c: photobiont Trebouxia sp. on nitrocellulose membrane placed on the surface of Bold's basal medium and malt extract yeast (99:1) agar plates. d: Co-culture of mycobiont and photobiont. Algal cells are enclosed in hyphae and form many small aggregations on the nitrocellulose membrane. White arrows = algal cell clusters enclosed by hyphae, black arrow = fungal hyphae on the nitrocellulose membrane; e, f, g, h: Co-culture of mycobiont and photobiont. The hyphae of X. taractica were observed to penetrate algal cells in this co-culture setup. White arrow = appressoria; i, j: Co-culture of endolichenic fungi and photobiont. In this set-up the fungal hyphae didn’t penetrate the algal cells. White arrow = conidia produced by endolichenic fungus; k, l: Co-culture of mycobiont, photobiont, and endolichenic fungus. The hyphae of mycobiont penetrated photobiont, but endolichenic fungus did not. Scale bars: d = 50 μm; e-l = 5 μm

Algal cells (AGL-S420) formed several small aggregations on the nitrocellulose membrane (Fig. 1d-l). Lichenized fungus (LF-423E) mycelia produced a reticulate structure that linked algal aggregations (Fig. 1d). Fungal mycelia were also seen growing between algal cells and loosely enveloping them (Fig. 1e, f, g, h). Appressoria were detected on the surface of algal cells in some situations. However, in the majority of cases, lichenized fungal mycelia entered algal cells without the formation of appressoria.

The set-up 5 involved co-culturing of the endolichenic fungus C. fibrosae (CGMCC 3.20304) and the alga (AGL-S420). Here, algal cells also form aggregations. However, the mycelia of C. fibrosae exclusively grew on the nitrocellulose membrane, without entering or enclosing the algal cells. Spores were observed in this set-up (Fig. 1i, j).

We co-cultured lichenized fungus (LF-423E), endolichenic fungus (CGMCC 3.20304), and alga (AGL-S420) in setup 6. In this case, our observation was a combination of the preceding two observations. The mycelia of LF-423E penetrated and encased the algal cell clusters, but the mycelia of CGMCC 3.20304 exclusively proliferated and sporulated on the surface of the nitrocellulose membrane (Fig. 1k, l).

Genome sequencing and assembly of Xanthoparmelia taractica

The X. taractica assembly consists of 1336 scaffolds with an N50 of 112, 782 bp. The computed genome size was roughly 43.1 Mb, with a 43.70% GC content. The genomic completeness calculated by BUSCO was 98.6%. (Table S4). According to RepeatMasker annotation, the bulk of repeats is long terminal repeat (LTR) elements (8.19%) and DNA elements (2.69%) (Table S5). Annotated non-coding RNA included 61 copies of tRNA, 12 copies of rRNA, and 25 copies of various forms of ncRNAs (Table S6). The genome sequence was deposited at NCBI under the accession number JANQBS000000000.

Gene prediction of Xanthoparmelia taractica

This assembly was predicted to have 10, 730 ORFs with an average gene length of 1593.9 bps. The total gene length was around 17 M bps, which accounts for 39.64% of the genome. The total exon number was 30, 187 and the total exon length was around 16 M, which is 36.32% of the genome.

Annotations of carbohydrate-active enzymes, antibiotic-related genes, and fungal virulence factors of Xanthoparmelia taractica

CAZymes annotation revealed that there were no polysaccharide lyases in the genome, but there were seven carbohydrate-binding modules (Fig. 2a). Annotation of antibiotic-related genes revealed two antibiotic resistance genes and one antibiotic biosynthesis gene. BLAST against DFVF found 255 fungal virulence factors.

Fig. 2
figure 2

Carbohydrate-active enzymes (CAZymes) identification and gene functional annotation of Xanthoparmelia taractica. a: CAZymes identified in the genome which were divided into six functional classes. These were Glycosyl Transferases (GT), Polysaccharide Lyases (PL), Carbohydrate Esterases (CE), Auxiliary Activities (AA), Carbohydrate-Binding Modules (CMB), and Glycoside Hydrolases (GH); b: eggNOG classifications of assembled unigenes. The functional annotations were divided into 25 categories, corresponding to clusters of orthologous groups (COGs). A = RNA processing and modification, B = Chromatin structure and dynamics, C = Energy production and conversion, D = Cell cycle control, cell division, chromosome partitioning, E = Amino acid transport and metabolism, F = Nucleotide transport and metabolism, G = Carbohydrate transport and metabolism, H = Coenzyme transport and metabolism, I = Lipid transport and metabolism, J = Translation, ribosomal structure and biogenesis, K = Transcription, L = Replication, recombination and repair, M = Cell wall/membrane/envelope biogenesis, N = Cell motility, O = Posttranslational modification, protein turnover, chaperones, P = Inorganic ion transport and metabolism, Q = Secondary metabolites biosynthesis, transport and catabolism, R = General function prediction only, S = Function unknown, T = Signal transduction mechanisms, U = Intracellular trafficking, secretion, and vesicular transport, V = Defense mechanisms, W = Extracellular structures, Y = Nuclear structure, Z = Cytoskeleton; c: Transport proteins annotation of assembled unigenes. Unigenes were summarized into seven main categories. A = Channels/Pores, B = Electrochemical Potential-driven Transporters, C = Primary Active Transporters, D = Group Translocators, E = Transmembrane Electron Carriers, F = Accessory Factors Involved in Transport, G = Incompletely Characterized Transport Systems; d: Pathogen-host interactions (PHI) annotation of assembled unigenes. Unigenes were summarized into nine main categories. A = Chemistry target: resistance to chemical, B = Chemistry target: sensitivity to chemical, C = Effector (plant avirulence determinant), D = Enhanced antagonism, E = Increased virulence (hypervirulence), F = Lethal, G = Loss of pathogenicity, H = Reduced virulence, I = Unaffected pathogenicity; e: Kyoto Encyclopedia of Genes and Genomes (KEGG) Orthology annotation of assembled unigenes. The unigenes were summarized into eight main categories; f: Gene ontology (GO) classification of assembled unigenes. Unigenes were summarized into three main categories

Gene functional annotation of Xanthoparmelia taractica

The NCBI-nr database and the Swiss-Prot database were used to annotate 8883 and 6452 genes, respectively (Table S7). The eggNOG-mapper found 8278 genes and divided them into 25 categories. Among these, the functions of a majority of genes could not be ascertained (3247). Categories that include the highest number of genes with known functions were post-translational modification, protein turnover, chaperones (490); secondary metabolites biosynthesis (453), and carbohydrate transport and metabolism (441). The remaining categories had less than 400 genes. The lowest number of genes (4) were detected under the category nuclear structure and extracellular structures (Fig. 2b).

In the fungal cytochrome P450 database, there were 10,480 putative genes with homologs, of which 5,413 had more than 30% similarity. Whereas, in the transport categorization database, there were 1,364 homologs with 1,085 genes sharing more than 30% similarity, with the most (399) categorized as electrochemical potential-driven transporters and the fewest (7) classified as transmembrane electron carriers (Fig. 2c). Host–pathogen interactions database predicted that 2,314 genes were involved in this interaction. A majority of these genes were linked to loss of pathogenicity (222), reduced virulence (1123), and unaffected pathogenicity (989) (Fig. 2 d). Through KEGG annotation we identified 10, 055 genes that were divided into eight categories. Among these, 3457 genes were associated with BRITE functional-hierarchies, 820 with cellular processes, 568 with environmental information processing, 869 with genetic information processing, 1334 with human diseases, 1801 with metabolism, 1013 with organismal systems, and the functions for the remaining 193 remained unresolved (Fig. 2e). InterProScan summarized the genes into three main categories, with 14683 genes that were involved in biological processes. Among these, 84 were involved with secondary metabolic activities. 10,922 and 11,787 were classified as cellular components, and molecular functions, respectively (Fig. 2f).

PKS gene clustering and analyses

The antiSMASH annotation predicted 46 secondary metabolite gene clusters in the genome of X. taractica. These included nine NRPS, three NRPS-Type I PKS, nine terpene, and 25 PKS gene clusters. Among these, 28 genes represented PKS clusters. There was 27 Type I and the rest were Type III. In the BlastP (Table 1) and ML phylogenetic analyses (Fig. S3), XtPKS12 nested within a group that included a PKS with acyl transferase, acyl hydrolase, and lysophopholipase function identified from Diplocarpon rosae (PBP21839). Whereas, XtPKS18 and XtPKS22 grouped with PKS having 6-methylsalicylic acid synthase (MCJ1371537) and methylphloroacetophenone synthesis activities (QEO24563 and A0A0R8YWJ7), respectively. The remaining PKSs are grouped with uncharacterized proteins or hypothetical proteins (Fig. S3). Hence, genes XTPKS12, XTPKS18, and XTPKS22 were selected for the expression analyses.

Table 1 The list of PKS gene annotated in Xanthoparmelia taractica

PKS gene expression analyses

The PKS genes XTPKS12, XTPKS18, and XTPKS22 were expressed at low levels in X. taractica monoculture (Fig. 3). However, the expression pattern of these genes altered in the co-culture system. When X. taractica and alga were co-cultured, XTPKS12 was down-regulated (0.06 folds) while XTPKS18 and XTPKS22 were up-regulated (1.75 and 1.17 folds, respectively) compared to monoculture.

Fig. 3
figure 3

RT-qPCR expression analysis of PKS12, PKS18, and PKS22 in Xanthoparmelia taractica monoculture and with Trebouxia sp. in co-culture systems. Bars indicate standard errors, while an asterisk is equivalent to a p-value ≤ 0.05

Statistical analyses of the expression data revealed that the expression of XTPKS18 and XTPKS22 was significantly different between monoculture and coculture (p < 0.05). There was, however, no significant difference in the expression of XTPKS12 between monoculture and coculture.

Discussion

In this study, we isolated a lichenized fungus, Xanthoparmelia taractica, and its algal partner, Trebouxia sp. from lichen samples collected from China in 2019. We sequenced the whole genome of the mycobiont, X. taractica. The genome was 43.1 Mb with 10730 ORFs. We detected 27 Type I and one Type III PKS gene. Except for three PKS genes, XTPKS12, XTPKS18, and XTPKS22, the function of most of them remained unknown. We selected these three genes for downstream expression analyses using a co-culture system. The setups included the mycobiont, photobiont, and an endolichenic fungus, Coniochaeta fibrosae. The setups with the mycobiont and the photobiont showed an early stage of symbiosis. Hyphal networks of the mycobiont connected and penetrated the algal cells were observed in these setups. Moreover, XTPKS12 was down-regulated, and XTPKS18 and XTPKS22 were modestly up-regulated. Coniochaeta fibrosae did not participate in the symbiosis or had a significant effect on the functioning of the PKS genes tested.

The genome of X. taractica was around 43.1 MB in size, with a projected gene count of 10,730. There were no Xanthoparmelia genomes accessible in public repositories before the present study. So, we compared our genome size to that of other fungi from Lecanorales. We observed a significant difference in genome sizes in this order. Cetradonia linearis, for example, has the smallest genome (19.5 MB) whereas Alectoria fallacina has the largest (65.2 MB). However, there is no clear link between genome size and habitat choice. For example, Niebla homalea (50.6 MB), Ramalina intermedia (26.2 MB), and X. taractica (43.1 MB) are all rock-inhabiting lichens but with varying genome sizes. Similarly, a recent study that compared the genomes of 46 lichen-forming fungi from Lecanoromycetes also couldn't predict why the size of biosynthetic gene clusters varied between species (Pizarro et al. 2020).

Twenty-eight PKS genes were identified from the genome of X. taractica. This included one Type III PKS gene and 27 Type I PKS genes (including three NRPS-Type I PKS hybrid genes). Several recent studies indicate that lichenized fungi may have a higher number of PKS genes compared to other fungi (Song et al. 2022; Singh et al. 2022b; Gerasimova et al. 2022); however, this trend is not well established. The variety of unique secondary metabolites synthesized by lichenized fungi may be correlated with a larger number of PKS genes. These genes might have been gained by lichenized fungi as they evolved from non-lichenized ancestors (Song et al. 2022). Compared to other fungal lifestyles, lichenized fungi have a higher tolerance to harsh environments, such as high temperatures, intense UV light, and dry weather. The secondary metabolites produced by PKS genes in lichenized fungi may help the lichens adapt to unfavorable conditions (Devashree and Dikshit, 2021).

Early phases of lichenization were detected in a co-culture of X. taractica and Trebouxia sp. Similarly, to previous studies, we also found fungal hyphae entering algal cells, frequently through the development of appressoria, which is considered indicative of symbiont compatibility (Muggia et al. 2021; Ahmadjian 1964; Athukorala et al. 2014). The hyphae of lichenized fungus disintegrate the algal cells during nutrient exchange. This is analogous to the interactions between plants and pathogens. Hence, we identified 2,314 genes in X. taractica that might be involved in host–pathogen interactions. One of which (scaffold25.t62) was orthologous to MagB and GzGpa3 genes of Magnaporthe oryzae and Fusarium graminearum, respectively. A similar gene (UmGPA3) was also identified in the lichenized fungus Umbilicaria muhlenbergii that was homologous to Ustilago maydis Gpa3 and Candida albicans Gpa2 and is likely to be involved in the initial establishment of symbiosis (Wang et al. 2020). Concurrently, an in vitro resynthesis involving Usnea hakonensis and its algal partner revealed 305 fungal and 203 algal genes that promote lichenization through symbiotic interface establishment, nutrient flow, and many other mechanisms (Kono et al. 2020). Similarly, cellular ultrastructure and their biochemical components, such as wall fibrils, extracellular matrix, and lectins, can also promote lichenization (Roth and Goodenough 2021; Wang et al. 2014). This demonstrates that lichenization is a complex process triggered by a combination of factors, many of which are yet unknown.

The endolichenic fungus, C. fibrosae, was previously isolated from the same lichen thallus, from which we also recovered the mycobiont and the photobiont (Si et al. 2021). However, the outcomes from our co-culture study showed C. fibrosae doesn't influence the symbiosis between X. taractica and Trebouxia sp. In all setups that included C. fibrosae, the fungus profusely sporulated without penetrating the algal cells. However, we predict C. fibrosae is plausibly dependent on the photobiont for nutrition. This is because nutrient exchange can also happen through physical contact between organisms (without penetrating the algal cells), as seen in Mortierella elongata and Nannochloropsis oceanica (Du et al. 2019). Similarly, some lichenized fungi also adopt this mechanism for withdrawing nutrients from the photobiont, such as Coenogonium subvirescens (Honegger 2009). More in-depth studies are needed to demystify whether C. fibrosae is nutritionally dependent on Trebouxia sp. and what its mode of nutrient acquisition is.

In monoculture, the expression levels of all tested PKS genes from X. taractica were substantially low even after 21 days. When X. taractica and Trebouxia sp. were co-cultured, XTPKS18 and XTPKS22 were significantly up-regulated, while XTPKS12 was slightly down-regulated. There are a few plausible explanations for this. The life cycle stage of the mycobiont could have affected the expression of genes XTPKS12, XTPKS18, and XTPKS22. In a co-culture study of Gyalolechia flavorubescens and Trebouxia gelatinosa, several PKS genes were highly expressed at 12, 24, 48, and 72 h whereas other PKS genes were expressed after four to six weeks (Song et al. 2022). Besides this, the expression of PKS genes can also be affected by cultural conditions. This was confirmed by a transcriptomic study of secondary metabolite genes in Cladonia rangiferina where expression of PKS genes was substantially influenced by culture conditions, microbial media, and abiotic factors such as light (Sveshnikova and Piercey-Normore 2021). The expression of some PKS genes is also known to vary between isolates of the same fungus. For example, in Cladonia macilenta, PKS21 is responsible for the biosynthesis of biruloquinone, which gets exclusively upregulated in the purple isolate but not in the white variety (Kim et al. 2021a). So, in X. taractica the regulation of PKS genes could influence by one of the above scenarios or a mixture of them.

In this work, we successfully co-cultured X. taractica with the alga Trebouxia sp. and sequenced the whole genome of the lichenized fungus. We were unable to determine the involvement of the endolichenic fungal, C. fibrosae, in the lichen thallus or how this fungus obtains nutrients from the algae. Similarly, we identified 28 PKS genes in the genome of the mycobiont. For a majority of these PKS genes, their putative functions couldn’t be identified during the BlastP searches and phylogenetic analyses. This demonstrates that our present understanding of the biology and genetics of lichenized fungi is sparse. Nonetheless, several promising discoveries from recent investigations have progressed our knowledge substantially (Pizarro et al. 2020; Armaleo et al. 2019; Keuler et al. 2020; Tagirdzhanova et al. 2021; Allen and Lendemer 2022; Resl et al. 2022; Merges et al. 2021). The information gleaned from this and subsequent investigations will help us better understand lichen symbiosis.