Introduction

The Antarctic Peninsula (Maritime Antarctica) consists of a north–south oriented chain of heavily glaciated mountains which drive strong climatic differences between its warmer and moister western coastal regions and colder eastern coast in the Weddell Sea [1]. Various lakes are present in this region, varying greatly in terms of trophic status and salinity [2, 3]. Vega Island is located off the eastern tip of the Antarctic Peninsula, in the Weddell Sea. Although there is a broad general literature on the algal communities of lakes and ponds of the Antarctic Peninsula [4,5,6,7,8], information on lakes from Vega Island is largely restricted to abiotic aspects [1, 9], excepting two recent studies exploring diatom paleo-assemblages [10] and sediment fungal communities [11].

Green algae (Viridiplantae) are noted in all studies of Antarctic lake algal communities, represented mainly by the Phylum Chlorophyta [12], with zygnematophycean green algae and other taxa phylogenetically close to higher plants (Streptophyta) being less common. De Wever et al. [8] highlighted the wide phylogenetic diversity of apparently endemic Antarctic lineages of microscopic green algae, consistent with hypotheses of strong regionalization and long-term evolutionary isolation within Antarctica, even of microbial diversity [13,14,15]. According to De Wever et al. [8], their findings, supported by molecular analyses, contrasted with most previous morphological studies, which generally concluded that Antarctic green algae were mostly represented by cosmopolitan species. While these algae are a ubiquitous group present in ecosystems globally [12], many green algae are also known to tolerate extreme conditions such as high salinity and low temperatures, particularly within the order Chlamydomonadales [16, 17]. Several new green algal species have recently been described from Antarctica [18], highlighting the presence of many psychrotrophic or psychrophilic taxa [19, 20]. Green algae are also the primary constituent of the vividly coloured snow algal communities that are well represented in polar regions [16, 21]. Snow algae can decrease the albedo of snow, accelerating the rate of snowmelt [22]. Snow algal species (e.g., Chlamydomonas nivalis) have been also reported in other Antarctic habitats such as lake sediments and soils, in both morphological [5,6,7] and molecular [23] studies.

Considerable advances in the assessment of microbial diversity in environmental samples have been made possible thanks to recent developments in molecular biology [24]. In the Antarctic continent, although a number of recent studies of green algal diversity on soil and rock substrates using molecular tools have been published [23, 25,26,27], the use of such approaches on freshwater communities currently remains limited [8]. As with other microbial groups, some algae cannot be grown in culture, while reliance on morphology alone can also be misleading as high levels of morphological conservation typify algae present in the harsh environmental conditions of Antarctica; this is particularly true with regard to green algae, whose relatively high level of cryptic species and phenotypic plasticity may hamper conclusive diagnosis based solely on morphological features [28]. Furthermore, resting stages and spores cannot usually be identified using traditional methods and, consequently, morphology-based techniques fail to adequately represent the range of microalgal diversity in the natural environment [29]. DNA metabarcoding using high throughput sequencing (HTS) represents a powerful method for the detection of rare species [25, 29]. In this study, we used metabarcoding to investigate green algal (Viridiplantae) DNA sequence diversity present in sediments obtained from three lakes on Vega Island, generating new information about the biodiversity of these poorly known lakes.

Methods

Study area

Vega Island is located in the James Ross Archipelago, of the north-east tip of the Antarctic Peninsula. It has an area of 253 km2, more than 80% covered by permanent ice. The climate of the island is cold and semi-arid, with precipitation (water equivalent) of ca. 300–500 mm year−1 [30] and a mean annual temperature of ca. − 7 °C [31]. Sediment samples were obtained from three freshwater lakes on Vega Island: Esmeralda (63° 52′ 21.4″ S; 57° 36′ 24.1″ W), Copépodo (63° 52′ 41.9″ S; 57° 35′ 43.8″ W) and Pan Negro (63° 52′ 04.6″ S; 57° 37′ 12.6″ W). The three lakes are located at Cape Lamb, at the south-west tip of Vega Island (Supplementary File 1). The cape has the largest ice-free area on the island, with a maximum elevation of 482 m a.s.l [10]. All three lakes are surrounded by moraine deposits and located in endorheic basins, which retain water from snow melt in their catchments and have no outflow. A summary of lake characteristics is given in Table 1.

Table 1 General characteristics of the three lakes sampled on Vega Island

Sampling

Sampling was conducted during the austral summer of 2016/17, from the littoral zone of each lake. One sediment core of 30 cm length was collected manually from each lake using PVC pipes (60 mm diameter × 50 cm length) previously disinfected to avoid contamination following the protocol described by [11]. The cores were kept at − 20 °C in sterilized plastic bags after collection and during transportation to the laboratory at the Federal University of Minas Gerais, Brazil. There, the cores were thawed and cut into 5 cm sections; only the top, middle and base sections of the core were used for analysis. One subsample of each core section (top, middle and base) was processed together into the same DNA extraction in order to increase DNA yield. Water physical and chemical parameters (temperature, pH, electrical conductivity and total dissolved solids) were measured in situ simultaneously with sediment sampling in each lake, using a Hanna multi-parameter probe HI 9828 (Hanna Instruments, USA).

DNA extraction, Illumina library construction and sequencing

Total DNA was extracted from ca. 0.5 g of soil using the QIAGEN Power Soil Kit (QIAGEN, Carlsbad, USA), following the manufacturer’s instructions. DNA quality was analyzed by agarose gel electrophoresis (1% agarose in 1 × Trisborate-EDTA) and then quantified using the Quanti-iT™ Pico Green dsDNA Assay (Invitrogen). The internal transcribed spacer 2 region (ITS2) of the nuclear ribosomal DNA was used as a DNA barcode for molecular species identification [32] using the universal primers ITS3 and ITS4 [33]. Library construction and DNA amplification were performed using the Herculase II Fusion DNA Polymerase Nextera XT Index Kit V2, following the Illumina 16S Metagenomic Sequencing Library Preparation Part #15044223 Rev. B protocol. Paired-end sequencing (2 × 300 bp) was performed on a MiSeq System (Illumina) by Macrogen Inc. (South Korea).

Data analysis and taxa assignment

The PLANiTS2 database [34] was used for identification of the Viridiplantae sequences obtained. Raw fastq files were filtered using BBDuk version 38.34 (BBMap – Bushnell B. – sourceforge.net/projects/bbmap/) to remove Illumina adapters, known Illumina artefacts and the PhiX Control v3 Library. Quality read filtering was carried out using Sickle version 1.33 -q 30 -l 50 [35] and to trim 3ʹ or 5ʹ ends with low Phred quality score. Sequences shorter than 50 bp were also discarded. The remaining sequences were imported to QIIME2 version 2019.10 (https://qiime2.org/) for bioinformatics analyses [36].

For assessing the sequence data obtained against the PLANiTS2 database, the pipeline was executed for merged paired-ended sequences with the following plug-ins: vsearch join-pairs [37], vsearch dereplicate-sequences, quality-filter q-score-joined [38], vsearch cluster-features-de-novo 97% identity limit and vsearch uchime-denovo. Taxonomic assignments were determined for operational taxonomic units (OTUs) using the feature-classifier [39] classify-sklearn against the PLANiTS2 database [34] trained with Naïve Bayes classifier. For comparative purposes we used the number of DNA reads as a proxy for relative abundance [40], and each OTU was considered as a different taxon. It is important to recognize that the assignment of a detected DNA sequence does not confirm the presence of the organism in a viable form, while assignment is also limited by the level of diversity coverage of the available databases [23, 41]. Taxa assignment into higher taxonomical ranks followed Leliaert et al. [12] and Algaebase [42].

Diversity analyses

Diversity analyses were performed using R software version 4.0.2 [43]. Rarefaction curves were prepared with the iNEXT (iNterpolation and EXTrapolation) package [44]. Alpha diversity was assessed using Shannon (H’) and Pielou (J’ Equitability) indices and natural logarithms. For assessment of dissimilarity between locations, the Bray–Curtis Index was used. A Venn diagram was also prepared to compare OTU occurrences across the three lakes [45].

Results

The rarefaction curves reached asymptote, indicating that the sampling effort was sufficient to represent the sequence diversity present within the DNA reads at each sampling location (Supplementary File 2). A total of 1,194,202 paired-end ITS2 DNA reads were generated in the sequencing run, of which 243,373 remained after quality filtering. The large majority of these reads represented other taxonomic groups not considered in the present study (e.g., fungi). Pan Negro Lake generated the highest number of green algal DNA reads (11,205), followed by Esmeralda (9085) and Copépodo (1595) Lakes. Esmeralda Lake contained the highest number of green algal taxa assigned (59), with Copépodo and Pan Negro Lakes each containing 30 taxa. Altogether 78 taxa were assigned in the three lakes (Table 2).

Table 2 Taxa of green algae and their respective numbers of DNA reads assigned in sediments obtained from three lakes on Vega Island, Antarctica

The highest diversity of green algae in Esmeralda Lake was corroborated by its Shannon Index (2.92), followed by Copépodo Lake (2.63). The lowest Shannon Index (1.50) and Equitability Index (0.44) were obtained in Pan Negro Lake. In this lake, the single Chlamydomonadales species Chlorococcum microstigmatum contributed 60% of the total number of DNA reads (Fig. 1), decreasing the evenness; in the other lakes, the distribution of DNA reads among the different taxa was more uniform, resulting in an Equitability Index above 0.70 (0.72 at Esmeralda Lake and 0.77 at Copépodo Lake) (Fig. 1).

Fig. 1
figure 1

The highest contributing green algae in sediments from three lakes on Vega Island, Antarctica, in terms of DNA reads (%) (the six taxa shown for each lake represent at least 60% of the total local abundance). Color bar: green = Class Chlorophyceae; gray = Class Trebouxiophyceae; blue = Class Ulvophyceae

All green algal taxa assigned in this study belonged to Phylum Chlorophyta, representing three classes (Chlorophyceae, Trebouxiophyceae and Ulvophyceae) and nine orders (Fig. 2). Of the 78 taxa assigned, 25 (32%) are not listed in the available literature and are likely first records from Antarctica. These included the genera Chlamydocapsa, Paulschulzia, Auxenochlorella, Didymogenes, Jaagichlorella, Desmococcus, Lobosphaera, Ploeotila and Desmochloris (Table 2).

Fig. 2
figure 2

Relative contribution of green algal taxonomic orders in sediments from three lakes on Vega Island, Antarctica, based on numbers of taxa and of DNA reads

The Order Chlamydomonadales (Chlorophyceae) represented the highest contribution in terms of numbers of both taxa and DNA reads in all lakes (Fig. 2). Its relative abundance was above 80% in Copépodo and Pan Negro Lakes. In Esmeralda Lake, the relative contribution of the different orders was more uniform compared to Copépodo and Pan Negro Lakes, both in terms of number of taxa and number of reads (Fig. 2).

Eleven taxa (14%) were present in all three lakes (Fig. 3). Pan Negro Lake was the most dissimilar, presenting Bray–Curtis Index values of 0.88 and 0.83 with Copépodo and Esmeralda Lakes, respectively; between these two lakes, the Bray–Curtis Index was 0.79.

Fig. 3
figure 3

Venn diagram showing the number of green algal taxa assigned in sediments obtained from three lakes on Vega Island, Antarctica. The names of the taxa detected in all three lakes are shown

Discussion

This study revealed a relatively high diversity of green algal sequences in sediments of Antarctic lakes (78 taxa), while recognizing that the assignment of a DNA sequence does not confirm the presence of the organism in a viable form [26, 44]. In general, green algal richness reported in previous studies of Antarctic algae from different habitats has ranged from 12 to 48 taxa, in studies based on morphological data [4,5,6,7] or from isolated cultures [8]. Consistent with this previous literature, our study confirmed the dominance of Chlorophyta and its “UTC” core group (Ulvophyceae, Trebouxiophyceae and Chlorophyceae) over green algae representing the Streptophyta [12].

A majority of the green algal taxa assigned here have previously been reported in Antarctic studies either using traditional morphological approaches or in metabarcoding studies [23, 42]. Amongst the potentially new Antarctic records, the majority of taxa originate in the Northern Hemisphere [42]. Incorrect taxonomic assignments (e.g., synonyms) may be a factor here. For example, Hoham & Mullet [46] consider Scotiella cryophila, assigned here as a first Antarctic record, a synonym of Chloromonas nivalis and S. antarctica, known Antarctic species [4].

The green algal diversity assigned in the sediments examined here included both typically planktonic genera (e.g., Chlamydomonas, Desmodesmus, Coenochloris, Micractinium, Koliella) [4, 7], genera typically growing on substrates or found in lichenized forms (e.g., Stigeoclonium, Stichococcus, Bracteacoccus, Prasiola, Elliptochloris, Chlorococcum, Coccomyxa, Lobosphaera, Myrmecia, Trebouxia, Desmochloris, Ulothrix, Pseudendochlonium) [5, 6, 47] and snow algal genera (e.g., Scotiella, Chloromonas, Chlamydomonas) [17, 48]. Some marine taxa (e.g., Pseudendoclonium) were also detected, which is not uncommon in Antarctic studies of this type [23] considering the common occurrence of high winds and local bird populations that can link terrestrial/freshwater and marine environments.

The three maritime Antarctic lakes studied here shared abiotic features more similar to some continental Antarctic lakes [49], especially in terms of high values of electrical conductivity (680 to 1250 µS cm−1 in our study). As they have no outflow, over time they will concentrate salts washed into them in melt water, leading gradually to increased salinity. On King George Island, located north-west of the Antarctic Peninsula, most lakes have electrical conductivity around 130 µS cm−1 [3]. As our lakes are not directly exposed to sea spray, these differences when compared to other maritime lakes from the western side of the Peninsula may also be contributed to by the low precipitation that is typical of the Vega Islands, where they are located. Copépodo Lake, in particular, showed the most extreme conditions, with the highest electrical conductivity (1250 µS cm−1) and an alkaline pH (8.1). Copépodo Lake generated the lowest number of DNA reads in our study; it is possible that these more extreme conditions may limit the diversity of green algal communities in this lake.

The use of DNA metabarcoding as applied here was effective in the detection of rare species, enhancing knowledge of potential green algal diversity in Antarctic environments. Câmara et al. [23] detected 65 distinct green algal OTUs in soil samples from two protected and non-protected sites on Deception Island (South Shetland Islands) using the same approach, of which 71% (45 taxa) were common to both sites. In contrast, in the current study, only 14% of the taxa detected were shared by all three lakes, highlighting the dissimilarity (high beta diversity) among the lakes studied. The geological and hydrological differences among lakes might have contributed to this relatively high dissimilarity. However, we recognize that the limited replication possible in the current study does not permit detailed examination of the diversity differences between the lakes, and that future studies to understand the nature of dissimilarity between such lakes will require greater replication.

Pan Negro Lake, which had the same taxon richness as Copépodo Lake, generated the highest number of DNA reads, and was the most distinct lake. The number of taxa in this lake is likely to be underestimated, as 17% of the DNA reads were identified within the Order Chlamydomonadales, and not to species level. This Order originally comprised flagellated green algae (e.g., Chlamydomonas, Chloromonas); later, molecular studies expanded it to include some coccoid taxa, such as the genus Chlorococcum [50]. This genus was particularly abundant in the lakes studied here, especially Pan Negro Lake. Chlamydomonas spp. also dominate in the phytoplankton of Antarctic hypertrophic lakes [7]; the presence of the Chlamydomonas group at high relative abundance in these lake sediments is consistent with them acting as a repository of genetic information on taxa present in the overlying water column. Members of Chlamydomonas are frequently reported in Antarctic phycological studies, with many species well adapted to low-temperature environments [20]. In the current study, besides C. nivalis, a key component of snow algal communities [16, 21], the species C. raudensis was among the most abundant in all three lakes. This psychrophilic species (also referred to in the literature as Chlamydomonas sp. UWO231) was originally described from samples collected in the permanently ice-covered Lake Bonney in the Victoria Land Dry Valleys [19]. It is very plausible that other as yet unknown species of green algae are present in the lakes of Vega Island, and not represented in current sequence databases.

Conclusions

Diverse assemblages of green algal DNA sequences were detected in sediments obtained from three lakes on Vega Island, including many assignments to taxa not previously recorded from the region. The study gives information about this part of the Maritime Antarctica and confirms the utility of DNA metabarcoding as a tool for assessing potential green algal diversity in Antarctic lakes, including the recognition of taxa not previously recorded from the continent. There is a clear need for increased effort to collect and sequence green algal voucher specimens in this region in order to reconcile molecular and morphological diversity studies, and confirm the presence of currently unrecorded or undescribed species as strongly implied by our data.