Introduction

The Pacific white shrimp Litopenaeus vannamei (L. vannamei) is native to the Pacific coast of Mexico southward to Peru where the water temperatures normally higher than 20 °C throughout the year. As L. vannamei shows fast growth, relatively low protein requirements (20–25%), and ability to adapt well to variable environmental conditions, it is now becoming the predominant aquaculture shrimp farmed in China, Southeast Asia, South, and North America (Mathew Briggs et al. 2005). The total production was 3,668,681 t in 2014 which accounts for >70% of the worldwide farmed shrimp production (FAO). Like the other animals, the L. vannamei exhibited sexual dimorphism in growth trait (Gitterle et al. 2005). The female individuals are bigger than the males after sexual maturity (Pérez-Rostro CIaI, A.M. 2003). Clarification of the sex-determining system will not only enrich the knowledge in crustacean but also benefit the shrimp aquaculture industry towards the production of mono-sex stocks or sex-controlled stocks (Palaiokostas et al. 2013b).

Sex determination is an important area of study in developmental, evolutionary biology and ecology (Charlesworth and Mank 2010). The mechanisms of sex determination in animals are remarkably diverse and were controlled by genetic and/or environmental factor. The genetic mechanism of sex determination commonly shows either XX/XY (male heterogamety), or ZZ/ZW (female heterogamety) mechanism. In eutherian mammals, sex determination is defined by the male heterogametic XX/XY system (Charlesworth and Mank 2010); while in birds (Stevens 1997), lepidopteran insects (Sanchez 2008), and some teleost fish (Chen et al. 2014; Cnaani et al. 2008; Koyama et al. 2015), the sex determination system is principally female heterogametic ZW/ZZ system. In order to clarify the sex determination mechanism in a species, the first step is to map the sex-determining region. Previous reports showed that sex could be controlled by a major sex determination gene in one chromosome or multiple genes in different chromosomes (Gamble and Zarkower 2014). For penaeid shrimp (Decapoda), as non-environmental sex determination has been reported, the sex might be controlled mainly by genetic factor (Benzie 1998; Campos-Ramos et al. 2006; Legrand et al. 1987). The linkage-mapping analysis of Penaeus monodon, Penaeus japonicus, and L. vannamei detected major sex-determining QTLs in one LG (Alcivar-Warren et al. 2007; Du et al. 2010; Li et al. 2003; Pérez et al. 2004; Robinson et al. 2014; Staelens et al. 2008; Zhang et al. 2006), which give evidence to the existence of major sex-determining gene and sex chromosome. The sex-determining region of P. monodon was mapped in a narrow region on LG 30 based on the constructed high-density linkage map (Robinson et al. 2014); however, the marker showing the strongest association with sex was not completely linked to sex. Meanwhile, the sex-determining region in P. japonicus was only mapped in a large region of LG 28 and was still not fine-mapped (Li et al. 2003). Several studies have identified the QTLs of sex determination in L. vannamei, the mapped region was still almost 20 cM and none sex-specific marker was identified (Du et al. 2010). So, more detailed mapping studies are needed to identify the region and genes associated with sex determination.

Linkage mapping and association mapping are two major strategies to identify genetic loci and markers for interested traits. In previous research, mapping of sex-determining loci was conducted by AFLP, RAPD, or SSR markers. To map the sex-determining region or markers accurately, a large amount of markers were needed. With the development of next generation sequencing, markers genotyping such as SNPs and Indel became easier. Among the genotyping method based on next generation sequencing, one effective method was called SLAF-seq, which has proved to be effective in marker-genotyping and linkage map construction (Sun et al. 2013; Zhang et al. 2013). Recently, a high-density linkage map with 6146 marker mapped to 44 sex-averaged linkage groups has been constructed for L. vannamei in our group, the average marker distance was 0.7 cM (Yu et al. 2015).

The aim of the present study was to identify the sex-determining region and sex-specific marker for L. vannamei. Linkage mapping was performed based on the high-density linkage map constructed previously. Association analysis was conducted using the SLAF-seq data of two parents, 50 female progenies, and 50 male progenies. As a result, the sex-determining region was mapped to linkage group (LG) 18 and a sex marker was identified. The results support the female heterogametic (ZW) sex determination mechanism in L. vannamei and will contribute to the mapping of sex-determining genes in the further research.

Materials and Methods

Data Collection

The materials and the data used in this study were the same as previously reported (Yu et al. 2015). Briefly, a full-sib family including two parents and 205 offspring was collected. The sex of each individual was determined by the presence or absence of petasma on the first pleopods. Female individuals were scored as 1 and males were scored as 0. DNA from all these individuals was extracted for library construction. The sample DNA was firstly digested with MseI (New England Biolabs, (NEB)) and then digested with the additional restriction enzyme EcoRI and Nla III. PCR reactions were performed to amplify the digested DNA and barcodes were added to the different samples, then the DNA was pooled and used for library construction. The libraries were sequenced using paired-end sequencing method on Illumina HiSeq 2500 platform (Illumina, Inc., San Diego, CA, USA).

Linkage-mapping Analysis

Based on the next generation sequencing data, a high-density linkage map was constructed. A total of 6146 markers spanning 4271.43 cM were mapped to 44 sex-averaged linkage groups, with an average marker distance of 0.7 cM (Yu et al. 2015). To identify regions of the genome that possess QTLs of sex, the composite interval mapping method applied in MapQTL5 software package was used (Van Ooijen 2004). The LOD score significance thresholds were determined by 1000 permutation tests for the sex trait. A QTL was determined to be significant if the LOD score was higher than the genome-wide significance threshold estimated by permutation.

Validation of Sex Markers Identified by Linkage Mapping

The association between the genotype of mapped markers and the phenotypic sex in the mapping family was re-analyzed using R/SNPassoc (Gonzalez et al. 2007). As the package can only handle the bi-allele markers, the three or four allele SLAF markers were discarded. The markers that are statistically significant after correcting for the number of tests performed (Bonferroni correction) were considered as sex-associated markers. Then, the significant markers detected by linkage mapping and the R/SNPassoc analysis were validated in different populations by Sanger sequencing. The marker sequences were blasted to the assembled genome scaffold of L. vannamei to obtain the sequence beside the markers. Primers were designed to successfully amplify and sequence the SNPs. The PCR profile included an initial 5 min of denaturation at 94 °C, followed by 35 cycles of 95 °C for 30 s, 55 °C for 30 s, and 72 °C for 45 s; and a final step of extension at 72 °C for 5 min. The PCR products were sequenced using ABI Prism 3730 (Applied Biosystems, USA) with the forward or reverse primer. The genotypes of candidate markers in female and male individuals were obtained by checking the Sanger sequence chromatogram at the SNP position. The populations used for verification made up of five commercial lines: SIS commercial line at the year of 2014 (Shrimp Improve System Co., USA), CP commercial line at the year of 2014 (Chia Tai Group, Thailand), Kona Bay commercial line at the year of 2014 (Kona Bay Marine Resources, USA), local selective breeding line selected from Hainan Guangtai Ocean Breeding Co., Ltd. (Hainan province, China), and shrimp individuals imported from Ecuador in 2014. Each line contained 12 female and 12 male individuals. The markers showed complete associations with sex among all the female and male individuals were considered as real sex-associated markers.

Association-mapping Analysis

An association-mapping method similar to bulked segregant analysis (BSA) was applied to further map the sex determination region and sex markers. Data from two parents and 50 female progenies and 50 male progenies with high-sequencing coverage were selected. These data was divided into four groups: female parent (M), pool of female progenies (aa), male parent (P), and pool of male progenies (ab).

De novo marker development was achieved by the following procedures. The paired sequences were combined by ten repeat of “X”. The trimmed SLAF pair-end reads were clustered based on the sequence similarity as detected by BLAT with the following parameters: “−tileSize = 10 and −stepSize = 5” (Kent 2002). Sequences with over 90% identity were grouped in one SLAF locus (Zhang et al. 2013). The SLAF loci were separated by the sequence differences into various SLAF tags. Those SLAF loci with more than four tags were filtered out because SLAF locus can contain no more than four genotypes in a mapping population of diploid species. The read depth of each SLAF tag in each individual or pool was calculated based on the mapping result.

For the polymorphic SLAF locus, read depth of each allele in the four datasets (M, aa, P, and ab) was obtained. The ratio of each allele read depth in female pool (M + aa) dividing male pool (P + ab) was calculated respectively. Then the markers were ranked by the ratio. The read depth in M and P dataset were also used to verify whether the marker was female or male-specific. For the non-polymorphic SLAF markers, similar analysis was conducted. Those sequences detected in M and aa with high read depth but not in P and ab were considered as female-specific locus, and vice versa for male-specific locus.

Validation of Sex Markers Identified by Association Mapping

The candidate sex markers identified by association analysis was first verified in mapping family using 30 female and 30 male progenies, and then were validated in five different populations. The procedure of validation was the same as that in linkage-mapping analysis.

Integration of Linkage and Association-mapping Results

In order to fine-map the sex-determining region in the LGs, besides the marker identified by linkage mapping, markers identified by association mapping were genotyped and mapped to the constructed linkage map. These markers were firstly blasted to the markers identified by linkage mapping to check whether there are same markers. Then the markers detected by association analysis were genotyped by ABI SNaPshot assay in the mapping family. The genotypes of these markers were joined with other data generated from high-throughput sequencing. In order to reduce computational complexity, only markers in the sex-associated LGs were selected. The integrated data were imported to JoinMap 4.0 (Kyasma, NL). A LOD threshold of 10.0 was used for assigning markers into different linkage groups. The Kosambi-mapping function based on regression mapping was used for map construction.

Results

Linkage Mapping of Sex-determining Region

The genome-wide significance thresholds of LOD score detected by permutation test was 5.1. Results from the linkage-mapping analysis showed that a large region from 0 to 21.205 cM in LG18 showed significant association with sex determination (Table 1). The highest LOD score was 71.44, which was observed from 9.144 to 9.242 cM in LG18. Four adjacent markers (Marker13273, Marker16044, Marker23846, Marker12948) were located in this region. The LOD scores for the nearby markers including Marker728, Marker16149, Marker616, Marker6711, Marker1962, Marker13107, and Marker23573 were also high to 71.43, which were very similar to the peak scores (Fig. 1).

Table 1 QTL mapping of sex determination region in Litopeneaus vannamei
Fig. 1
figure 1

Linkage mapping of sex determination region in linkage group 18. The blue line indicates the LOD value for the corresponding markers, the red line indicates the phenotypic variations explained by the corresponding markers

Validation of Candidate Sex Markers Identified by Linkage Mapping

After Bonferroni correction, ten markers in LG18 showed significant associations with sex in the mapping family (Supplementary Table S1). The most significant markers were Marker728 and Marker16149, which were located at 8.605 cM of LG18 (Fig. 2). The other eight markers were located in a region ranged from 2.907 to 15.711 cM. Combining the linkage analysis and association analysis using R/SNPassoc, the most significantly associated region with sex in LG18 was located at 8.605 cM.

Fig. 2
figure 2

Association analysis for candidate sex marker linkage group 18 with sex trait. The X-axis represents candidate sex marker in linkage group 18. The Y-axis represents −log10 p value as determined by R/SNPassoc

These significant markers detected by the two methods were validated in different populations. However, none of them showed complete association with sex determination.

Association Mapping of Sex-associated Markers

As no marker identified by linkage mapping was fully linked with sex in the validated populations, we applied an association analysis approach to further map the sex determination region and sex markers. The association analysis was similar to bulk segregant analysis (BSA) analysis. A total of 8,840,934 paired-end reads from the female parent, 11,626,188 paired-end reads from the male parent, 129 million reads from female offspring pool, and 126 million from male offspring pool were used for further analysis. Based on these data, SLAF markers were developed separately for the four datasets. As a result, 89,699 markers were generated for the female parent and 94,256 markers for the male parent. As the female and male progeny pool contained much more reads, more markers were obtained, including 149,529 markers from the pool of female progenies and 149,659 markers from the pool of male progenies.

The read depth of markers in each dataset was calculated based on the mapping result. For the polymorphic markers, three markers showing significant differences between female and male were identified (Table 2). All the three markers contained two alleles. One allele could be observed both in female and male pool with similar read depth. The other allele showed significantly higher read depth in female than male. For the non-polymorphic markers, a total of 13 candidate sex markers were observed. Among them, 12 markers showed significantly higher read depth in female pool than that in male pool, the other one marker was only observed in male pool (Table 3). All the detected sex-associated markers were displayed in Tables S2 and S3.

Table 2 Candidate sex-associated markers identified from the polymorphic SLAF markers
Table 3 Candidate sex-associated markers identified from the non-polymorphic SLAF markers

Validation of Candidate Sex Markers Identified by Association Mapping

Among the markers identified by association analysis, all could find the hit to the assembled genome scaffold of L. vannamei except for Marker41731, Marker10360, Marker36508, and Marker 99554. Interestingly, the non-polymorphic markers associated with sex, which was detected in one gender in association analysis, could be amplified in both genders in the individuals of the mapping family. Through Sanger sequencing of these amplified fragments, we identified SNPs in the enzyme sites, which showed differences between female and male.

After verification of these candidate markers in the mapping family, we found that three markers including Marker24863, Marker24577, and Marker19299 showed significant difference between female and male individuals in the mapping family (Supplementary Figs. S1, S2, and S3). The primer sequence designed for amplifying these three marker sequences was displayed in Table S4. For Marker19299, the sequence difference between female and male was located at the position of 20 bp in Fig. S3. However, it was not completely different between female and male. So, we further checked the sequencing chromas. Interestingly, the nucleotide at the position of 80 bp of Marker19299 showed a pattern that all females were C/G heterozygotes and all the males were C homozygotes. Another mutation of Marker19299 was observed at 284 bp, there was a 9-bp length insert in female compared to male sequence. The existence of this insert resulted in the sequencing error for the behind sequence. The sequencing chromas of Marker24863 and Marker24577 were checked and they also showed female heterogamety and male homogamety.

The validation of all the identified markers in different populations showed that Marker19299 was completely associated with sex trait. At the 80 bp of the Marker19299 sequence in Fig. S3, all the female individuals were C/G heterozygotes and all the male individuals were C homozygotes. Although the other markers showed the trend of female heterogamety and male homogamety, the pattern was not applicable to all the females and males. This result indicated that Marker19299 was a real sex marker.

Integration of Linkage and Association-mapping Results

After blasting markers identified from association analysis to those in linkage analysis, it showed that Marker41731 in association analysis was the same as Marker65257 in linkage analysis. In the linkage map constructed previously, Marker65257 was located at 1.106 cM of LG 18, which was a QTL for sex with high LOD score. The reconstruction of sex-associated LGs after adding Marker24863, Marker24577, and Marker19299 reveals that all markers except for Marker24577 were mapped to LG 18 (Fig. 3). High degree of collinearity between the reconstructed map and the previous map was observed (Fig. 4). In the LG 18 of the reconstructed map, Marker19299 was mapped to 8.955 cM, which was 1.751 cM away from Marker16149 and Marker728, and it was 1.4 cM away from Marker1297 (Fig. 3). As Marker16149 and Marker728 were the most significant markers in the mapping family, the sex-determining region should be located between Marker19299 and Marker16149 (Marker728). Since the genome assembling of L. vannamei was not finished yet, the sequence between these markers was not fully assembled and annotated.

Fig. 3
figure 3

The reconstructed linkage group 18 after adding markers identified by association analysis. The new added markers were highlighted

Fig. 4
figure 4

Collinearity in LG18 between the previous linkage map and the reconstructed linkage map

Discussion

In the present study, linkage mapping and association mapping analyses were applied to identify QTLs for sex determination and the corresponding sex-associated markers. Linkage mapping analysis identified a major QTL in LG18. The association analysis identified three sex-associated markers, among which one marker was fully sex-associated. The integration of linkage and association mapping precisely mapped the sex-determining region in a narrow region along LG 18. This study was the first report to map QTL of sex determination in a narrow region for L. vannamei. The finding in this study will be useful for mapping of the sex-determining genes and revealing the sex chromosome characteristics in the future study.

Previously, screening of sex-specific markers was performed using random amplified polymorphic DNA (RAPD) and amplified fragment length polymorphism (AFLP) approach (Pan et al. 2015; Vale et al. 2014; Ventura et al. 2011). Recently, next generation sequencing has allowed to genotype thousands of markers simultaneously in a short time. Genotyping-based next generation sequencing has been proved effective in genetic variation discovery (DePristo et al. 2011; Kerstens et al. 2009; Yu et al. 2014), linkage map construction (Li et al. 2014; Shao et al. 2015; Wu et al. 2014), population genetic research (Fumagalli et al. 2013; Larson et al. 2014), and association mappings (Houston et al. 2012; Kim et al. 2010). So, many studies focused on the mapping of sex determination QTLs using the next generation sequencing platform. Researchers identified sex-linked SNP markers using RAD sequencing and supported ZW/ZZ sex determination mechanism in Pistacia vera L. (Kafkas et al. 2015). In another study, seven male and ten female Anolis carolinensis were sequenced using RAD-seq technology, one male-specific molecular marker was identified (Gamble and Zarkower 2014). In aquaculture, most of the similar study were reported in fish, including Nile tilapia (Palaiokostas et al. 2013b), Atlantic halibut (Palaiokostas et al. 2013a), tiger pufferfish (Kamiya et al. 2012), and zebrafish (Anderson et al. 2012). These studies identified QTLs for sex and male-specific markers, which supported the XX/XY sex determination system in these fish species. Among crustaceans, the sex marker of salmon louse, which was one of the major ectoparasitisms occurred on marine salmonids, has been identified through RAD sequencing. But for penaeid, this study is the first report to identify sex-specific markers using next generation sequencing. Our result illustrated the efficiency of this method for identifying QTLs and the sex-associated marker. The data of the female heterogamety shown by sex-associated markers give more evidence to support the ZW/ZZ sex determination mechanism in L. vannamei.

To map QTLs for sex and identify sex-associated marker, most of the study used family material and some use unrelated individuals collected from different populations. Usually, the family material was firstly used to map the sex loci in the chromosome, and then the mapped loci were validated in the other individuals, like what was performed in this study and the research of yellowtail, pistacia, and Atlantic halibut (Kafkas et al. 2015; Koyama et al. 2015; Palaiokostas et al. 2013a). Recently, as the development of marker-genotyping, researchers proposed an approach to sequence the unrelated individuals from natural populations to screen the sex-associated markers (Gamble and Zarkower 2014). One advantage of this method is that there’s no need to perform experimental crosses and construct families, and the materials were easy to be collected. In our opinion, both the methods were efficient in identifying sex-specific markers. For species without genome reference, it would be better to map the sex-determining gene using family material. For species with its reference genome available, mapping of the sex-determining gene can be achieved directly by genome-wide association study (GWAS) using different populations.

Linkage analysis, as well as association analysis, is a high-effective way for identifying the genome position which controls the phenotype. In the present study, both methods were applied to identify the sex-determining region and sex markers in the L. vannamei. Linkage analysis found the sex-associated chromosomes and identified the loci significantly linked with sex. At the beginning, a candidate sex chromosome was detected by linkage analysis. Unfortunately, none of the markers identified by linkage mapping could be used as sex marker in the other populations. So, an association mapping approach similar to BSA analysis was performed. More sex-associated markers were identified, including one marker fully associated with sex. Most of the markers detected in association analysis were not identified in the linkage analysis. One reason was that the marker development procedure in linkage analysis was only conducted in parents (Yu et al. 2015) while the marker identified in the association analysis was performed in both parents and pool of offspring, and the high read depth in the pool of offspring could lead to more markers. Another reason was that the markers mapped in the linkage map should be successfully genotyped in at least 75% of the offspring, the markers genotyped in less than 75% of the offspring will not be appeared in the linkage map, and these markers will be discarded from the linkage analysis. Our study illustrated that the integration of linkage and association analyses could make good use of the data generated from the next generation sequencing and could assist to fine map the sex-determining region.

In the previous cytological study, no heteromorphic sex chromosomes were observed in L. vannamei (Campos-Ramos 1997). It indicated that the sex chromosome pair was very similar in female and male. The large region of specific sequence in one sex may not be existed in shrimp. The fact that there are no heteromorphic sex chromosomes indicated that the sex chromosome of shrimp was young. The sex chromosome pair has not diverged yet (Charlesworth and Mank 2010). So the non-recombining region in this species may be small. Besides, the recombination rate was reported to be high in L. vannamei (Zhang et al. 2006); the detected sex-linked markers may cross over frequently during meiosis. So, the linkage disequilibrium of the identified sex-linked markers with the sex-determining genes may disappear in the other population. This can partly explained why some of the screened sex-linked markers in family were not applicable in the other populations. Nevertheless, the Marker19299 was detected fully associated with sex both in family and populations. This sex marker was mapped to the most significant region-associated with sex in the LG18, which as a result fine-mapped the sex-determining loci in a region of 1.751 cM. To our knowledge, this is the most precise sex-determining region for L. vannamei. With the help of genome sequencing, the fine-mapping of sex-determining genes and clarifying sex determination mechanism of L. vannamei will be possible in the near future.

Conclusions

Overall, this study successfully mapped sex-determining region and identified a marker completely associated with sex by linkage and association analyses. The sex-determining region was fine-mapped in the LG 18. The sex marker showed female heterogamety, which supported the hypothesis of ZW/ZZ sex determination system in L. vannamei. The identified sex-associated marker could be of high value towards the determination of genetic sex and the production of mono-sex stocks or sex-controlled stocks. These findings will contribute to an improved understanding of sex determination and the further fine-mapping of sex-determining genes for L. vannamei.