Introduction

The successful conservation and management of threatened wildlife requires accurate information on population abundance, structure and genetic variability. This kind of data can be difficult to obtain, especially when dealing with rare and elusive species. Advances in molecular ecology and data analysis offer accurate tools to study and monitor wild populations in a non-invasive way (Koelewijn et al. 2010; Arandjelovic et al. 2011; Caniglia et al. 2012; Brzeski et al. 2013; Pennell et al. 2013; Silveira Trinca et al. 2013).

After a strong decline during the 1980s, the Eurasian otter (Lutra lutra) was recovered in many European countries, and it has been recently downgraded from the ‘Vulnerable’ to the ‘Near Threatened’ category in the IUCN Red List (Ruiz-Olmo et al. 2008). In Italy, the species is considered rare, and it is still listed as ‘Endangered’ in the national Red List (Rondinini et al. 2013). The Eurasian otter disappeared from northern Italy during the 1980s (Macdonald and Mason 1983; Cassola 1986; Macdonald and Mason 1994), and its decline continued in the central and southern regions all over the 1990s while the other European populations were already recovering (Conroy and Chanin 2000). By 1990, its distribution was confined to the south with a small geographically and genetically isolated population (Spagnesi et al. 2000; Randi et al. 2003). The current otter’s presence in Italy consists of two subpopulations: a larger core in the south (Campania, Puglia, Basilicata and Calabria regions) and a small nucleus in the Abruzzo and Molise regions (south-central Italy) (Loy et al. 2004, 2010; Prigioni et al. 2007; Panzacchi et al. 2010). Based on the non-invasive genetic sampling (NGS) data, the otter population in Italy was estimated at 229–257 individuals (Prigioni et al. 2006). According to recent unpublished data, the south-central remnant population is now expanding northward, but the lack of detailed information about its structure and biology represents a serious impediment to the adoption of appropriate management actions and conservation plans. As direct observations are almost unfeasible due to the nocturnal habit of the species, NGS represents a valuable opportunity to get precious data on the status and structure of local populations. This is particularly important in Italy where the Eurasian otter shows the highest genetic differentiation among the European populations. This indicates that local adaptations might have developed there, and thus, Italian populations should be considered as an evolutionary significant unit (ESU) and should also be of special conservation interest due to their uniqueness (Randi et al. 2003; Mucci et al. 2010). The Eurasian otter is a suitable species for applying NGS due to the typical scat marking behaviour used for intra-specific communication (Chanin 1985; Gorman and Trowbridge 1989; Kruuk 2006). Eurasian otters use two main types of scent marking signs: spraints (faeces) and anal jellies (Kruuk 2006). Spraints consist mostly of food remains, while jellies are mucous secretions produced by their characteristic anal glands. Both spraints and jellies can be used for genetic analysis with different amplification success rates (Fike et al. 2004; Hájková et al. 2006; Lampa et al. 2008; Hájková et al. 2009). The main challenge in NGS studies concerns the reliability of results due to the high levels of genotyping errors and the low levels of genotyping success (Buchan et al. 2005; Waits and Paetkau 2005; Hájková et al. 2009). Many factors can influence genotyping success rates including sample type (Coxon et al. 1999; Hájková et al. 2006; Lampa et al. 2008), environmental conditions (Nsubuga et al. 2004), storage time and mediums (Frantz et al. 2003; Lampa et al. 2008), diet (Murphy et al. 2003) and DNA extraction and amplification protocols (Lampa et al. 2008). Non-invasive samples, especially faecal samples, usually contain a low quality and quantity of target DNA and high amounts of PCR inhibitors and non-target DNA resulting in low genotyping success and high error rates. Two main types of genotyping errors are linked to NGS: allelic dropout (ADO), stochastic failure of one allele of heterozygote to amplify, and false allele (FA), an amplification artefact. In both cases, the result is a mistakenly identified genotype. Genotyping errors have to be quantified and incorporated into subsequent analysis for population studies (Bonin et al. 2004; McKelvey and Schwartz 2004; Hájková et al. 2006). In this context, L. lutra is one of the most problematic species as the DNA in scats is exposed to a very quick degradation process and the DNA amplification success rate is usually very low (Jansman et al. 2001; Dallas et al. 2003; Hung et al. 2004; Prigioni et al. 2006; Ferrando 2008; Lampa et al. 2008; Hájková et al. 2009).

Every step from sample collection to DNA amplification presents some critical aspects that can potentially influence genotyping success. Our aim is to test and compare different types of non-invasive samples, sample ages, storage times, storage buffer mediums, extraction kits and nuclear microsatellite loci in order to optimize NGS protocols and choose the most appropriate microsatellite loci panel for genotyping wild Eurasian otters.

Materials and methods

Study area

The study has been conducted in the Sangro basin in central Italy (Fig. 1) which was recently re-colonized by the Eurasian otter after the species was declared extinct in the region during the 1980s (Cassola 1986). This area represents the northern boundary of the otter range in Italy. The Sangro River flows for about 122 km from 1,441 m slm to the Adriatic Sea, and the basin covers 1,545 km2. Spraints and anal jellies were collected along the Sangro River and its main tributaries. Marking sites were detected during a pilot study (from May to October 2010; Lerone et al. 2011) through linear transect sampling run according to the European standard survey method (Reuther et al. 2000).

Fig. 1
figure 1

Sampling sites and sites providing successfully genotyped individual otters in the Sangro river basin

Sample collection and storage

Sampling season extended from April to September 2011. Almost no otter signs were found under bridges or on river banks, while most of the marking sites were located on rocks in the middle of the water course. Despite the fact that temperature could be a critical issue for DNA degradation (Hájková et al. 2006), we decided to collect samples during the warm period as most of the marking sites were unreachable during winter. The exposure of samples to UV light, wind, rain, etc. was highly variable among sites. Before each sampling, old spraints and jellies were removed from each site. Sites were then checked daily until fresh samples were collected within 24 h of deposition. According to the estimated age, the samples were categorized as (i) ‘very fresh’—the moist samples collected within 24 h from deposition (48), (ii) ‘fresh’—the samples (deposed within 24 h or not) that appeared moist below the surface (13) and (iii) ‘medium/old’—the samples with uncertain deposition that did not appeared completely dry (5). The medium/old samples were collected at less responsive marking sites. The sampling started early in the morning and went on until the afternoon with the collection time recorded along with the mean daily temperature and the humidity as recorded by the nearest meteorological station in the region (Abruzzo). Samples were classified as spraint, jelly or mixed spraint (jelly in addition to prey remains; Hájková et al. 2009) and immediately stored in ethanol 96 % in 1.5 ml tubes in an ice box and later preserved at −20 °C (during the same day). A subset of 30 samples was divided in two halves with one half stored in ethanol 96 % and the other in white cell lysis buffer (WCLB) in order to test for the effect of the conservation buffer medium on PCR amplification success.

Also, the time period between sample collection and DNA extraction, defined as storage time, can influence PCR amplification success (Frantzen et al. 1998; Murphy et al. 2007; Lampa et al. 2008), and it was recorded and analyzed.

DNA extraction

As Lampa et al. (2008) reported a significant difference in the amplification success rate between two different extraction methods (Chelex® 100 and QIAmp® DNA Stool Mini Kit), DNA from 13 non-invasive samples was extracted according to the manufacturer protocols of two different kits, DNeasy Blood and Tissue Kit (Qiagen®) and ZR Genomic DNA II Kit (Zymo Research®). The 13 samples were divided into two parts each, and subsequently, each of the two parts was extracted by a different kit. The successive samples were only extracted with the better performing kit (DNeasy Blood and Tissue Kit, Qiagen®). Negative controls were included in each extraction to monitor contamination.

DNA amplification and genotyping

We analyzed otter samples with two sets of nuclear microsatellite loci. In particular, we genotyped six loci ‘Lut’ (set1: Lut453, Lut604, Lut701, Lut832, Lut833 and Lut902; Dallas et al. 1998) already used in many previous studies (Dallas et al. 2002; Kalz et al. 2006; Hájková et al. 2009) and seven loci ‘OT’ (set2: OT04, OT05, OT07, OT14, OT17, OT19, OT22; Huang et al. 2005) only recently used for L. lutra (Koelewijn et al. 2010; Quaglietta et al. 2013), but never explicitly compared to Lut loci. Due to the low genetic variability of our small otter population, we had to use both sets of loci for individual recognition. We defined successfully genotyped samples as those where all the 13 microsatellite loci revealed complete genotype. We estimated the theoretical probability of identity (PID), which is the probability that two individuals randomly sampled from a population share the same genotype, by using GeneAlEx v.6 (Peakall and Smouse 2006). All samples were initially screened for three microsatellite loci (OT07, OT14 and OT17) with four independent repeats per locus. Due to the high cost of analysis and the low quality of non-invasive samples, only samples with 50 % or more positive screening PCRs (Peatkau 2003) were further amplified four times at each of the remaining ten microsatellite loci and at ZFX/ZFY loci for genetic sexing (Mucci and Randi 2007). All samples which provided less than 50 % of positive screening PCRs were ‘negative’, and all the others were finally successfully genotyped and considered ‘positive’. Negative controls were always included to monitor for contaminations. PCRs were performed in 10-μl volumes using 2 μl of DNA solution, 1 μl of PCR buffer 10× (1.5 mM of MgCl2, 5 PRIME-Eppendorf), 1 μl of bovine serum albumin 0.2 % (BSA), 0.4 μl of 2.5 mM dNTP mix, 0.15 μl of each primer (10 μM), 0.25 units of Taq (5 PRIME-Eppendorf) and 5.25 μl of RNA-free water. In order to optimize the costs and the time spent for analyses, we ran five multiplex PCR (M1 included Lut453, OT05 and OT22; M2 included Lut833 and OT19; M3 included Lut604 and Lut 701; M4 included Lut832 and Lut902; and M5 included OT17 and OT07) and one simplex PCR for OT04. Microsatellite loci analyzed within each multiplex PCR were chosen on the basis of their allelic range using Multiplex Manager Software v.1.0 (Holleley and Geerts 2009) and then tested on tissue samples before using non-invasive samples. We decided to perform simplex PCRs for OT04 due to its low amplification rate in multiplex conditions. When further amplifications were needed to verify data, only simplex PCR was used. Amplifications were performed using the following thermal profile: a first denaturating step at 94 °C for 2 min, 45 cycles at 94 °C for 40 s, 55 °C for 60 s, 72 °C for 90 s and a final extension step at 72 °C for 10 min. PCR products were electrophoresed with GeneScan-350 ROX (Applied Biosystem) as marker ladder in an ABI 3130XL sequencer, and allele sizes were scored using the software GENEMAPPER v.4.0® (Applied Biosystems). Consensus genotypes were reconstructed using the GIMLET software v.1.3.3 (Valiere 2002) and accepted heterozygotes only if the two alleles were seen in at least two replicates and homozygotes only if the same homozygous genotype was seen at least in four replicates. Uncertain loci were additionally replicated another four times with a simplex PCR protocol. False allele and allelic dropout rates were estimated using GIMLET software v.1.3.3, and allele frequencies and unique genotypes were identified by GenAlEx v.6.

Data analysis

We compared the two subset of samples stored in different buffer mediums and extracted by different kits by using the McNemar test for paired samples. We carried out a multifactor analysis on the whole data set to test the parameters that could affect genotyping success and PCR success rate, i.e. sample type (jelly, spraint and mixed spraint), sample age (very fresh, fresh and medium), collection time (h), mean daily temperature (°C) and humidity (%) and storage time (days). Generalized linear models (GLMs) were run on both the whole sample (successfully and unsuccessfully genotyped samples) and on the subset of successful samples by using the genotyping success (positive-negative) and the PCR success rate as the response variable respectively. Despite the fact that the PCR success rate should have a more informative value, we also used the genotyping success as a dependent variable to analyze the entire data set in order to predict significant factors to be evaluated during sampling. Multivariate models were compared using the Akaike information criterion (AIC) with the lowest value indicating the best fitting model. Statistical analyses were performed using the free software R v.3.0.2.

Results

Sample collection, storage and DNA extraction

A total of 191 samples were collected including spraints (n = 137), jellies (n = 29) and mixed spraints (n = 25). Of these, 113 were very fresh, 45 fresh and 32 medium/old samples. Storage time ranged from 3 to 24 days. Out of the 30 samples stored in two different storage mediums, 14 (46.7 %) samples stored in ethanol and 6 (20.0 %) stored in WCLB provided complete genotypes. Despite the difference being poorly significant, the results of McNemar’s chi-squared test for paired data (X 2 = 3.68, p = 0.05) indicates ethanol 96 % as a slightly better storage medium for our samples. Nevertheless, this result will need further confirmation since our small sample size (n = 30) could hide both a stronger difference and an equivalence between the two buffers. Only samples stored in ethanol 96 % were included in GLM analysis.

Despite having a genotyping success for the samples extracted with DNeasy Blood and Tissue Kit (Qiagen®) and ZR Genomic DNA II Kit (Zymo Research®) of 53.84 and 15.38 % respectively, statistical testing did not show any significant difference between the two kits (McNemar’s chi-squared test for paired data X 2 = 0.5, p = 0.48), which is likely due to the limited sample size (n = 13). However, we decided to continue the DNA extraction with the better performing Blood and Tissue Kit (Qiagen®). Only samples extracted by Qiagen kit were included in GLM analysis.

DNA amplification and genotyping

We performed a total of 7 158 PCR amplifications, and we completely genotyped 35.1 % (n = 67) of the total samples. The highest genotyping success was obtained from pure jellies (79.3 %) followed by mixed spraints (28.0 %) and spraints (27.0 %). We also obtained significant differences in the genotyping success among samples of different age with very fresh samples providing the highest success rate (42.5 %) followed by fresh (28.9 %) and medium/old (18.7 %) samples (Table 1).

Table 1 Percentage of genotyping success, PCR amplification success rate, number of total PCRs performed (N PCRs), allelic dropout rate (ADO) and false alleles rate (FA) for different sample types and ages

All loci, but one (Lut453), were polymorphic, allele number ranged from one to four and the size of alleles ranged from 120 to 227 bp (Table 2). The OT loci were more variable than Lut loci as we identified a mean number of alleles per locus of 2.57 and 2.17 respectively (Table 2). In order to compare the performance of the 13 single loci, the mean values of PCR success rate and the ADO rates were reported in Table 3. The ADO rate differed significantly among different sample types (Kruskal-Wallis test: H = 11.4, p = 0.003). Jelly was the sample type with the lowest ADO rate.

Table 2 Used microsatellite loci, length of their alleles (bp) and their frequencies in studied population
Table 3 Comparison of the performance of the 13 single loci

As the allele size may influence the amplification success and ADO rates (Buchan et al. 2005; Broquet et al. 2007), we analyzed the effect of the mean molecular weight of DNA fragments on PCR success, ADO and FA rates. No relationship was found between fragment weight and allelic dropout rate (t = 1.26, p = 0.16) or false allele rate (t = −1.32, p = 0.21) while PCR success rate decreased significantly with increased median allele size (r 2 = 0.88, p = 1.4e-06).

Factors affecting genotyping and PCR success rate—multifactor analysis

Considering the genotyping success as the binary response variable (0 negative, 1 positive), the GLM multifactor analysis ran on the whole data set (positive and negative samples) showed that sample type and age as well as collection time and temperature significantly influence the genotyping success (Table 4). Jellies had a higher genotyping success followed by mixed spraints and spraints, while the genotyping success decreased with increasing mean daily temperature. Sample age and collection time had a lower influence on the response variable; nevertheless, the genotyping success was higher for very fresh samples collected earlier during the day. Storage time and mean daily humidity were not significant factors.

Table 4 Factors affecting genotyping success (binary response variable) in otter faeces genotyping according to the best model of generalized linear model (GLM) analysis. Successfully and unsuccessfully genotyped samples were included

Considering the PCR success rate as the response variable, the GLMs performed on the successfully genotyped data set highlighted a significant influence of sample type and age as well as mean daily temperature and humidity (Table 5). For the genotyping success, the PCR success rate was higher for jellies than for mixed spraints and spraints. Amplification success was also higher for very fresh samples. The PCR success rate significantly decreased with increasing mean daily humidity, while it increased with high temperatures. The collection time had no significant effect on the genotyping success.

Table 5 Factors affecting PCR success rate (response variable) in otter faeces genotyping according to the best model of generalized linear model (GLM) analysis. Only successfully genotyped samples were included

Individual identification

We analyzed the theoretical probability of identity among unrelated (PID) or sibling individuals (PIDsibs) (Fig. 2). Six microsatellite loci (Lut701, OT22, OT 17, OT19, Lut 833 and OT04) should be theoretically sufficient to distinguish the individual unrelated otters (PID = 0.001), while 13 loci were needed to distinguish sibling otters (PIDsibs = 0.002) with high probability (>99 %). All loci except OT17 resulted in Hardy-Weinberg equilibrium after the Bonferroni correction, and only nine of a total of 78 comparisons between loci resulted in linkage disequilibrium.

Fig. 2
figure 2

Probability of identity for unrelated individuals (PI) and for siblings (PIsibs) based on allele frequencies. The PI values by locus are reported on the right

Results showed the presence of at least 11 individual otters in the sampling area with a number of captures per individual ranging from 1 to 23. All individuals had a complete genotype (13 microsatellites) and were recognized as homozygotes at a particular locus if at least four independent PCR gave a coherent result. Molecular sexing was carried out according to the protocol of Mucci and Randi (2007). False allele errors for ZFX/ZFY sequences (119 and 178 bp) did not occur, while we found a mean ADO rate of 0.337 and 77 % of positive PCRs. We ascertained the presence of five males and four females and two individuals with uncertain molecular sexing for which only two successful PCR indicated homozygosity (females), which was not enough to accept the result as definite.

Discussion

Although our sampling extended during the warm period and almost no otter signs were found under bridges or on river banks causing our samples to become really exposed to environmental conditions, we succeeded in obtaining good results to characterize the otter population in the study area. The genotyping success on non-invasive samples in our study was 35.1 % which is a value falling within the range of prior studies on river otters (Dallas et al. 2003; Hung et al. 2004; Kalz et al. 2006; Prigioni et al. 2006; Arrendal et al. 2007; Ferrando et al. 2008; Lanszki et al. 2008; Hájková et al. 2009; Mowry et al. 2011). Jellies collected within 24 h from deposition had the highest genotyping success rate (79.3 %) and was consistent with previous studies which also reported a higher genotyping success for anal jelly samples (Coxon et al. 1999; Lampa et al. 2008; Hájková et al. 2009; Mowry et al. 2011). We estimated the rates of the main genotyping errors, allelic dropouts and false alleles in order to avoid unreliable results (Kalz et al. 2006; Hájková et al. 2009; Spiering et al. 2009, Bjǒrklund 2010; Guertin et al. 2012). The mean frequencies of ADO and FA among samples in our study were 14.4 and 0.3 %, respectively, with ADO rates being lower than that found in previous studies (Ferrando et al. 2008; Lampa et al. 2008; Hájková et al. 2009). Despite the extraction methods possibly influencing the genotyping success (Lampa et al. 2008), we did not find any significant differences between extractions performed with DNeasy Blood and Tissue Kit (Qiagen®) and ZR Genomic DNA II Kit (Zymo Research®). A marginally significant difference was found among the two compared storage mediums with the ethanol performing better than WCLB. Nevertheless, this topic needs further investigation since our small sample size could have influenced the result.

Besides sample type and age, the most important factors affecting the genotyping success were the mean daily temperature and the collection time. Increasing temperatures significantly reduced the genotyping success in our study, and Nsubuga et al. (2004) also recorded a decreasing DNA amount with increasing temperature in mountain gorilla scat samples. While Hájková et al. (2006) reported a lower PCR success rate for samples collected at high temperatures, we had a positive effect on PCR success. This result appears in contrast with the effect of temperature on the genotyping success and with what was expected since the high temperatures should cause an increase in hydrolytic enzyme activity (Hájková et al. 2006). It could be possible that temperature effect was confounded with other undetected environmental factors.

In accordance with Hájková et al. (2006), we recorded no effect of storage time on genotyping success. The range of storage times was too narrow to test the effect on PCR success rate. Lampa et al. (2008) observed a drastic decrease in amplification success (from 80 to 63 %) after 1 week, but in our data set, only a few samples exceed 7 days of storage.

We found that humidity negatively influenced the PCR success rate, as reported for other carnivores (Farrell et al. 2000; Murphy et al. 2007) and was expected for the increased activity of enzyme and bacteria in moist conditions (Lampa et al. 2008). As for genotyping success, jelly samples presented the highest PCR success rate. The high PCR success of jellies compared with mixed spraints seems to support the distinction made by Lampa et al. (2008) between the two types of mucous secretions found in jellies and in ‘spraints plus mucus’.

We analyzed seven Lut and six OT microsatellite loci in order to highlight the better performing microsatellites. In small and isolated otter population, we studied had a low genetic variability, and the number of alleles per locus was lower than in other studies on both L. lutra and Lontra canadensis (Hájková et al. 2009; Brzeski et al. 2013; Cohen et al. 2013). OT19 had the highest number of alleles (4) while Lut453 was monomorphic.

In accordance with Buchan et al. (2005), we detected an influence of DNA fragment size on genotyping success. On the contrary, we did not observe any increase in the ADO and FA rate with increasing fragment size.

Direct observation or radiotracking of rare and elusive species is difficult and expensive, but the need for demographic and ecological data is critical for monitoring and conserving wild populations. The traditional spraint survey method used to estimate abundance in Eurasian otter populations (Reuther et al. 2000) is quickly being improved by the addition of non-invasive genetic data. Non-invasive genetics has the potential to provide more detailed data about population abundance, structure, sex ratio, spatial organization and degree of kinship, but there is also the need to increase the success rate and the reliability of information that can be obtained by this approach.

In conclusion, we found the sample type and age, air temperature and humidity and collection time as important factors to account for when planning a non-invasive genetic sampling of Eurasian otter wild populations. Nevertheless, previous studies (Lampa et al. 2008; Bonesi et al. 2012) highlighted also other important factors such as time to extraction, sex and dominance factors. Further investigations of environmental factors, DNA analysis protocols and otter marking behaviour may provide valuable improvements in non-invasive genetic studies. The development of new microsatellite primers could greatly improve the individual identification in small and little variable populations which are the most urgent target of conservation biology.