Statement of Novelty

We confirm that this work is original and has neither been published elsewhere nor is currently under consideration for publication. In this contribution, we address actinobacterial strains as enzyme producers involved in the decomposition of cellulose and hemicellulose. We sequenced their genomes and identified their corresponding genes encoding enzymes. Since the decomposition of lignocellulosic biomass is a complex process, we have exploited genome sequencing and enzyme assays to reveal the presence of a complex enzymatic machinery required for raw lignocellulose material.

Introduction

The photosynthetically derived polymers cellulose and hemicellulose represent the most abundant renewable resources on Earth. These materials are derived from fixed carbon dioxide and they form the main building blocks of lignocellulosic biomass [1]. Agriculture, forestry, industry, and urban waste are promising clean and sustainable sources of lignocellulose. In recent years, increasing attention has been devoted to the production of bioethanol from low-cost lignocellulose contained in agri-food residues (zero waste energy) using enzymes to access the energy embedded in the C-H-bonded molecules of cellulose and hemicellulose polymers [2].

Although many Actinobacteria are as efficient at degrading lignocellulosic biomass as fungi [3, 4], their enzymatic systems are so far underrepresented in literature [5]. Both groups share almost the same morphological features including hyphal growth and spore formation [2] which allows them to proliferate in bulky substrate such as dead plant biomass. Actinobacteria have been observed in a broad spectrum of habitats which correlates with their metabolic diversity in decomposing various organic components [6].

Carbohydrate-active enzymes (CAZymes) include glycoside hydrolases (GHs), polysaccharide lyases (PLs), carbohydrate esterases (CEs), glycosyltransferases (GTs), auxiliary activities (AAs) and non-catalytic, carbohydrate-binding modules (CBMs) [7]. Within each group, enzymes are distinguished into families on the basis of the amino acid sequence in the CAZy database (for Carbohydrate Active enZymes) (http://www.cazy.org/) [8]. GHs are enzymes responsible for cleaving glycosidic bonds and are often associated to CBMs that facilitate specific substrate binding. CBMs are typical to aerobic bacteria and fungi, and their role may ensure the efficient processing of the polymeric substrate compared to cellulosome systems of anaerobic microbial degraders of cellulose [9]. Cellulose breakdown can be either partial, resulting in the formation of cellodextrins, which are oligosaccharides formed of two or more glucose monomers, or complete, resulting in separated glucose molecules [10].

This study aims to relate the enzymatic activity and lignocellulose hydrolysis in the isolates of actinobacterial lignocellulose degraders during their growth on various lignocellulose raw materials and to compare the diversity of CAZymes within their genomes.

Material and Methods

Actinobacterial Strains and Culture Conditions

Four (hemi)cellulolytic Actinobacteria strains were isolated from soil and compost, GESEQ-4, isolated on Starch Casein Agar [11] from an agricultural soil (N 35° 30′ 46.9″, E 6° 15′ 31.812″) on August 8th, 2014, GESEQ-9 isolated on Kuster’s Agar [11] and GESEQ-13 on humic acid vitamin agar [12], from a compost supplied by Biocompost (Algeria) on April 5th, 2014, and GESEQ-35 isolated on Kuster’s Agar [11] from a forest soil (N 35° 34′ 19.81″, E 6° 11′ 40.634″) on August 8th, 2014. The inocula were grown for seven days in 250 mL flasks containing 50 mL (g L−1): glucose, 4; malt extract, 10; yeast extract, 4; (pH 7.2) at 25 °C. Four distinct substrates were tested as carbon sources: wheat straw, olive pomace, oat flakes, and sawdust. The residues were milled to a fine powder using an Analytical Mill (IKA-A 11 Basic Mill, Germany). A total of 2.0 g of each substrate was added to 100 mL of Modified Melin Norkrans Medium (MNM) consisting of (g L−1): NH4H2PO4, 2, KH2PO4, 0.6, MgSO4.7H2O, 0.5, K2HPO4, 0.4 and 10 mL L−1 of trace elements solution containing (g L−1): CaCl2⋅2H2O, 7.40; FeSO4⋅7H2O, 1.20; ZnSO4⋅7H2O, 0.66; MnSO4⋅4H2O, 0.5; CoCl2⋅6H2O, 0.1 at pH 7.0 followed by sterilization at 121 °C for 15 min. The cultures were incubated at 25 °C in the dark for 21 days. A 25 mL aliquot of hydrolysed biomass was centrifuged at 4 °C at 6000x g for 20 min (Hermle, Z 323K, Germany). The strains were deposited at the DSMZ, as GESEQ-4 (= DSM 106287), GESEQ-9 (= DSM 107958), GESEQ-13 (DSM 105759) and GESEQ-35 (= DSM 105785).

Enzyme Assays of Cellulases and Hemicellulases

The activities of cellulases (endoglucanase, cellobiohydralase and β-glucosidase) and hemicellulases (endoxylanase, β-xylosidase, α-glucuronidase, α-glucosidase, β-galactosidase, β-mannosidase and α-arabinosidase) were measured after 21 days of incubation as described previously [13, 14]. Following the protocols of the substrate supplier (Megazyme, Ireland), endo-cleaving enzymes were quantified using azo-dyed carboxymethyl cellulose and birchwood xylan. The samples were collected after centrifugation of the culture broth at 4000×g for 20 min (HERMLE Z 324, Germany) to obtain the culture supernatant which is tested for enzymatic activity. A reaction mixture containing 0.15 mL of substrate and 0.15 mL of sample was prepared. After 60 min incubation at 40 °C, the reaction was stopped by adding 0.75 mL ethanol followed by 10 s of shaking and spinning down precipitate by 10 min of centrifugation at 1000x g. The enzyme activity was measured at 595 nm based on standard curves correlating dye and reducing sugar release. The activity of cellulases and hemicellulases were determined by measuring the fluorescence of 4-methylumbelliferone (MUF)-linked substrates, as described previously [13]. Fluorescence was measured after 5 and 125 min using a 96-well microplate incubated at 40 °C. Enzyme activity is defined as the amount of enzyme that releases 1 nmol of reducing sugars per minute. All measurements were conducted in triplicate. Statistical analysis was performed using the R software package (version 3.6.2; https://www.r-project.org/). Kruskal–Wallis test with Bonferroni correction was performed for pairwise comparisons of all possible combinations of categories related to enzyme activity with different carbon sources.

Genome Sequencing and Annotation

DNA was extracted from liquid axenic cultures using UltraClean Microbial DNA Isolation Kit (MoBio) according to manufacturer's protocol, with the modification that the cells were shaken in a lysis buffer on FastPrep 5 m/s for 2 × 30 s. Sanger sequencing to confirm the taxonomy of isolates was done in the Centre for DNA Sequencing, Prague, Czech Republic, using bacterial primer 1100R. The concentration of extracted DNA was measured on a Qubit 2.0 Fluorometer (Life Technologies, USA) using the Qubit dsDNA BR kit [15]. At least 500 ng of DNA was ligated using TruSeq Nano DNA Sample Preparation Kit and sequenced on Illumina MiSeq with 2 × 250 pair-end runs. Genome assembly was done using either Velvet (1.2.10) [16] or SPAdes (3.9.0) algorithm [17]. Genes were predicted and annotated on the RAST server [18]. GH families were annotated using dbCAN database (HMMdb 5.0 release) [19]. Additional genome annotation for CBM-enzyme couples identification was performed using Prokka gene prediction [20] followed by a search with dbCAN HMMs of CAZy families combined with a BLAST search with nucleotide sequences from the CAZy homepage (update 10/2020). The raw data, genome assembly files and 16S rRNA gene sequences are available with the accession numbers GESEQ-4 (PRJNA702041, JAFFSU000000000, MF150027), GESEQ-9 (PRJNA702043, JAFFSW000000000, MF150030), GESEQ-13 (PRJNA702042, JAFFSV000000000, MF150029) and GESEQ-35 (PRJNA702046, JAFFSX000000000, MF150031). To assess the phylogenetic placement of the cultivated strains, all RefSeq genomes of the genera Streptomyces (n = 2,143) and Mycolicibacterium (n = 303) were downloaded in February 2021 using NCBI-genome-download 0.3.0 script (https://github.com/kblin/ncbi-genome-180 download). GToTree v1.5.46 [21] together with Prodigal [22], HMMER3 [23], Muscle [24], trimAI [25], FastTree2 [26] and GNU Parallel [27] were used to infer phylogenetic trees of the two genera based on a set of 74 bacterial single-copy gene HMM profiles. Trees were visualized using iTOL [28].

Results and Discussion

(Hemi)cellulose hydrolysis by the Actinobacteria is highlighted by comparative genomics and supported by biochemical measurements of the extracellular enzymes produced into the culture liquid. Genomic information collected from Actinobacteria stresses their potential for lignocellulosic biomass decomposition. All the sequenced genomes of the strains have large genomes and a high G+C content (Table 1), which are common characteristics of Actinobacteria [29].

(Hemi)cellulolytic Potential of the Actinobacterial Strains on Biomass

Maximal endoglucanase production, 29.6 U mL−1, was observed when sawdust as a sole carbon source was used with the strain Streptomyces sp. GESEQ-4. This strain also exhibited the highest arabinosidase activity, 249 U mL−1, on oat flakes (Fig. 1). Sawdust induced the highest release of endoxylanase, 29.2 U mL−1, in Streptomyces sp, GESEQ-35. The highest activities of cellobiohydrolase (258 U mL−1), β-glucosidase (994 U mL−1), β-xylosidase (626 U mL−1), β-galactosidase (1692 U mL−1), and α-glucosidase (2106 U mL−1) were obtained on wheat straw with strain GESEQ-13. Wheat straw induced the highest activities of β-mannosidase (55.0 U mL−1) and α-glucuronidase (163 U mL−1) with strain GESEQ-35.

Fig. 1
figure 1

Enzyme activities of the actinobacterial strains on biomass

For strain GESEQ-4, the enzymatic activity of β-xylosidase activity was higher in wheat straw compared to olive pomace (p = 0.039) and cellobiohydrolase in wheat straw versus oat flakes (p = 0.019). For strain GESEQ-13 the activities of α-glucosidase (p = 0.013), arabinosidase (p = 0.012), β-galactosidase (p = 0.013), 1-4-β-glucosidase (p = 0.013), β-xylosidase (p = 0.013), and cellobiohydrolase (p = 0.019) were significantly different between wheat straw versus oat flakes. For strain GESEQ-35 the activities of α-d-Glucuronidase (p = 0.039), β-galactosidase (p = 0.013), 1–4-β-glucosidase (p = 0.019) and cellobiohydrolase (p = 0.019) were significantly different between wheat straw and olive pomace, whereas the activity of endoxylanase was significantly different between wheat straw and sawdust (p = 0.039). For strain GESEQ-9 the activity of α-glucosidase differed significantly between olive pomace and sawdust (p = 0.013), and the activity of 1-4-β-glucosidase differed significantly between wheat straw and oat flakes (p = 0.039).

Genome Sequencing and Annotation

CAZyme identification in the genomes was used to describe the potential of the Actinobacteria to degrade cellulose and hemicelluloses. A set of genes encoding enzymes involved in lignocellulose degradation were predicted by annotating the CAZyme profiles (Table 2). Figure 2 comprises all the CAZymes identified in the genomes. With a larger genome size of over 10 Mbp, Streptomyces sp. GESEQ-35 encodes the highest number of CAZymes (608) and GHs (262). In general, Streptomyces strains showed better enzyme production, which was also confirmed by higher number of GHs as well as more pronounced visual growth on biomass (Fig. 3). Main cellulolytic families GH1, GH3, GH5, GH6 were detected in all the four actinobacterial genomes. GH9, GH12 and GH48 were identified in Streptomyces strains but not in Mycolicibacterium fortuitum subsp. fortuitum GESEQ-9. Cellulose degradation through lytic polysaccharide monooxygenases (LPMOs, AA10) and chitin (AA10 and GH18) is also predicted in the Streptomyces strains. Hemicellulolytic enzymes GH2, GH16 were annotated in the four genomes. Xylanases/xyloglucanases (GH10), mannanases (GH26), α-glucanases (GH31), (hemi)cellulases (GH36, GH43, GH51, GH74, and GH95) were not identified in Mycolicibacterium fortuitum subsp. fortuitum GESEQ-9. Endoxylanase (GH11) and hemicellulases (GH39) were identified in Streptomyces sp. GESEQ-4 and Streptomyces sp. GESEQ-13. (Hemi)cellulases (GH30) were identified in Streptomyces sp. GESEQ-4 and Streptomyces sp. GESEQ-35. GH54 with α-L-arabinosidase and/or β-xylosidase activity was detected in Streptomyces sp. GESEQ-4. α-1,4-galactosidase (GH27) and α-fucosidase (GH29) were encoded in both Streptomyces sp. GESEQ-4 and Streptomyces sp. GESEQ-35. Polygalacturonases (GH28), β-1,4-galactosidase (GH35 and GH42), α-1,4-galactosidase (GH36) and α-fucosidase (GH95) were encoded in the three Streptomyces strains. α-1,4-galactosidase or α-amylase (GH57) was only identified in Mycolicibacterium fortuitum subsp. fortuitum GESEQ-9 and α-1,4-galactosidase (GH97) in Streptomyces sp. GESEQ-13. Xylan esterases (CE1, CE2, CE3, CE4, and CE7) were identified in all actinobacterial strains, except for CE2 missing in Streptomyces sp. GESEQ-35 and Mycolicibacterium fortuitum subsp. fortuitum GESEQ-9. Although, the enzymatic degradation of other biomass compounds, including lignocellulosic biomass (lignin and pectin) and chitin was not verified by enzyme assays measurements, CAZyme families known in their biotransformation were detected at the genome level. In that respect, vanillyl-alcohol oxidase (AA4) and benzoquinone reductase (AA6) associated with lignin degradation were identified among the strains.

Table 1 Genome features of the actinobacterial strains
Table 2 CAZyme genomic profile of the actinobacterial strains
Fig. 2
figure 2figure 2figure 2figure 2

CAZyme profile encoded by the strains

Fig. 3
figure 3

Photos of the Actinobacterial strains grown on biomass

There are very few studies concerning bacterial degradation of lignin [3], however, recently discovered enzymes from bacteria such as Streptomyces sp. have been useful for delignification [30, 31]. Families involved in pectin degradation PL1, PL3 and PL9 and pectin methylesterase (CE8) were predicted except for GESEQ-9. Although the pectin content in plant biomass is typically lower than that of cellulose and hemicellulose, interest of pectin degradation has grown recently. Pectinases have also emerged a class of enzymes with particular interest in the degradation of pectin, along with their role in degradation of other polysaccharides [32]. α-1,4-glucosidase, α-rhamnosidase, β-amylase, or α-amylase (GH13), glucoamylase (GH15), chitinase (GH23) and α-1,4-glucosidase (GH63) were encoded in the four studied strains, while α-rhamnosidases (GH2 and GH78), were detected only in the Streptomyces strains, and α-rhamnosidase (GH106) in GESEQ-4 and GESEQ-35. In addition to lignocellulosic decomposing enzymes, GH annotation also showed the capacity of the strains to produce several other enzymes, such as amylases, proteases, chitinases, and pectinases.

Genomics, along with genome sequencing, is a powerful tool for exploring lignocellulose enzymes. To complement genomic analysis, a biochemical analysis of the enzymes in question is required to determine their catalytic efficiency on natural substrates for CAZymes [33, 34]. Annotations on CAZyme show that the isolated Actinobacteria strains contain a broad range of enzymes designed to break down both cellulose and hemicellulose. Enzymes of the CAZy family are in accordance with the hydrolysis profile of the studied actinobacterial strains. We have found that GESEQ-4 and GESEQ-35 seem to be the dominant contributors to the numbers of genes encoding enzymes in lignocellulose degradation, while GESEQ-9 and GESEQ-13 appear to be less generalistic degraders. Phylogenetic clustering in the context of whole genus shows strain GESEQ-9 to be closely related with Mycolicibacterium fortuitum (Fig. 4). Streptomyces strains GESEQ-35 and GESEQ-4 cluster together in a separate clade from GESEQ-13: these strains are not closely related to identified Streptomyces species, which have higher numbers of sequenced genomes.

Fig. 4
figure 4

Phylogenetic trees based on 74 bacterial single-copy genes for the genus Streptomyces (A, n = 2,143 RefSeq genomes) and Mycolicibacterium (B, n = 303 RefSeq genomes). Positions of strains cultivated within this study are denoted as circles. Related taxa with multiple sequenced genomes are described with species names

Despite belonging to the same genus of Streptomyces, the types and numbers of enzyme families can differ drastically between bacterial strains and species [35]. In this way, some bacterial strains might possess a more specialized set of enzymes. Therefore, it is important to consider the sequences of closely related species to expand the number of valuable carbohydrate-producing bacteria that are efficient at degrading lignocellulose. In the study of Berlemont et al., Streptomyces strains have been shown to be able to utilize variable carbon sources, while Mycobacterium strains with few GHs do have a decreased capacity for polysaccharide deconstruction because they depend on their hosts for resources [36, 37]. Streptomyces biocatalytic enzymes have been studied extensively, to the point where they are now commercially available [4]. However, the relative contributions of the various enzymes responsible for modifying plant cell walls are not yet fully appreciated. A reliable model has not yet been developed to explain the evolutionary development of actinobacterial enzyme machinery [38]. These enzymes typically include multifunctional domains, each composed of two or more CBMs. Indeed, CBM2, CBM32, CBM48 and CBM50 have been annotated in the four genomes. The genomes comprised many couples of GH, AA, CE, PL with CBMs (Supplementary Material). However, most of the other CBMs such as CBM4 and CBM6 were only encoded in Streptomyces genomes. Some CBMs were strain specific, namely CBM14 in the Mycobacterium strain, CBM30 in GESEQ-35 and CBM46 in GESEQ-4. CBM3 and CBM11 were only detected in GESEQ-13 and GESEQ-35. CBMs do not possess their own catalytic activity, but they are still highly important because they bind the catalytic domain of the enzyme to cellulose and enhance enzyme–substrate interaction [7, 39].

Conclusion

Genome sequencing indicates that the strains contain strong candidates for biotechnologically important cellulose-degrading enzymes such as GH3 and GH12. As a result, actinobacterial strains studied here can be added to a list of microorganisms that convert biomass to sugars, as seen from an industrial perspective. Genomic studies of Actinobacteria also hold the promise enabling bacterial strains to be engineered to generate novel enzymes and cocktail enzymes to degrade lignocellulosic biomass.