Introduction

DNA fingerprinting has become an imperative tool in forensic science and provides a valuable aid to the criminal justice system. The current trend of forensic DNA typing relies on the analysis of a combination of short tandem repeats (STR) markers from the DNA isolated from the questioned biological samples for its possible match with the reference samples with highest degree of fidelity [16]. Molecular analysis through DNA typing is not only useful for unravelling criminal and civil cases of varied nature such as paternity disputes, identification of mass disaster victims, investigations of missing persons and sexual assault cases, but also determines the correct genetic sex of an individual [89]. Though many phenotypic parameters have been reported for the sex determination of a skeletonized body, molecular methods of sex determination have been proven to be superior to any other currently available technique [14]. Additionally, most of the phenotypic tests such as cranial and pelvic traits, meatus acusticus internus characteristics, anthropometric measurements of the limbs and hands require intact bone(s), which are mostly unavailable in some forensic scenarios [1].

In this regard, Y chromosome specific markers are highly useful in sex determination of human skeletal remains as well as the origin of the biological samples [62]. These markers can also provide vital information in sexual assault cases to determine the presence of male DNA in the victim’s belongings. Most of the currently available commercial forensic DNA typing kits have included amelogenin marker for sex determination along with other STR markers [122]. US National DNA Index System (NDIS) and Combined DNA Index System (CODIS) have also recommended Amelogenin marker to be included for sex determination during forensic DNA typing analysis [45]. However, discrepancies in the results of amelogenin based sex determination are widely reported throughout the globe [73]. Failure of amelogenin marker for sex determination has been attributed to many factors, such as genetic basis of ethnicity, degradation of biological sample, presence of PCR inhibitors and mutations in primer binding sites to mention a few.

In forensic cases, the molecular technique to be used for sex determination should produce highly precise, reliable and consistent results and any chance of possible errors should be seconded by additional/alternative techniques. To overcome this problem, many Y specific markers for sex determination are flourishing day by day to be used either independently or in combination with amelogenin to produce highly reliable results. Some of the possible alternative markers include the STS gene, SRY gene, TSPY, DXY156, DYZ1 and many SNPs [15]. Additionally, advanced techniques such as NGS and pyrosequencing have also proved to be useful in sex determination at molecular level. In this regard, this article discusses the possible alternative markers and their expediency in distinguishing sex with higher fidelity in forensically relevant samples.

Amelogenin: currently used exclusive sex determination marker in forensic application

Amelogenin gene produces a heterogeneous population of mRNAs due to alternative splicing, subsequently generating heterogenous group of amelogenin proteins responsible for tooth enamel matrix formation [49]. Though other genes and/or proteins are also involved in tooth enamel formation, 80–90% of the same is formed by the amelogenin protein [36] (Fig. 1). Thus, any mutation/alteration in the amelogenin locus cause a disease affecting the structure and appearance of enamel, called amelogenesis imperfect [103]. AMELX is the main source of such mutations and this disease has not yet been connected with any mutations in the AMELY. Various studies have concluded that X linked amelogenesis imperfecta is a heterogeneous disease having different phenotypic effects in varied cases [66]. A deletion of over 5 kb in the amelogenin has been found to cause amelogenesis imperfecta in males while in females, the cause has been found to be a non-sense mutation in exon 5 [25]. Likewise, a recent study has identified a deletion in the Amelotin (AMTN) which is a matrix protein responsible for mineralization during enamel development, responsible for Amelogenesis Imperfecta [108].

Fig. 1
figure 1

Structural and functional proteins for the tooth enamel matrix development and its distinguishability for male and female individual

In humans, both AMELX and AMELY genes are present on sex chromosomes at p22.1-22.3 region and at Yp11.2 region respectively, which are responsible for synthesizing the same Amelogenin protein [91]. AMELX is responsible for 90% of the amelogenin proteins whereas only 10% is contributed by AMELY [7]. Nucleotide sequences show 89% homology between AMELX and AMELY along with the presence of highly conserved protein coding regions at 2–6 exons [15]. Hence, the homology, location of the genetic marker and distinguishability between AMELX and AMELY make it a suitable marker for genetic basis of sex determination in forensic DNA typing applications.

Usefulness of amelogenin marker in sex determination

Amelogenin based sex determination relies on the presence of two homologues of Amelogenin i.e. AMELX and AMELY, which exhibit length polymorphism. They differ both in size and sequence and hence can provide sex differentiation. In comparison to AMELY, AMELX possesses a 6 bp deletion contributing to the difference in size between these two homologous genes. PCR amplification using the commonly used primer sets produce fragments of 106 and 112 bp in length for AMELX and AMELY respectively [82]. Thus, in females only X specific fragment can be observed, whereas, in males both X and Y specific fragments are observed after electrophoretic separation [33]. Hence, the male and female can be distinguished on the basis of the number of fragments detected in them [91, 103].

Sullivan et al. [110] initially developed a single set of primers spanning part of the intron 1 of Amelogenin gene for quantitative assessment of sex at DNA level using less than 1 ng DNA template. Gradually, to accommodate amelogenin marker in multiplexing systems, varied primer sets have been developed targeting different fragments of the gene and flanking regions. In this context, intron 3 of the Amelogenin gene has been explored greatly to design primer sets targeting the 6 bp difference in amplicons between AMELX and AMELY (Table 1).

Table 1 List of primer sets targeting the Amelogenin gene to detect AMELX and AMELY polymorphism

Discrepancies in amelogenin based sex determination

Amelogenin has been used for decades for sex determination at molecular level in forensics as well as for genetic and molecular profiling in archeology and anthropology. In non-human studies, amelogenin marker is useful in the reproduction field, archaeozoology, meat production and sample identification [37]. It is included in all the present day available genotyping kits and no other substitute has ever been proposed. The efficacy of the amelogenin marker came into question when cases of false detection of males as females started surfacing [128]. The amelogenin failure cases and their studies have increased several folds in the last decade. Amelogenin marker may give false negative results due to allele dropouts resulting from either deletions or mutations in the same region. Males harboring deletions and mutations in the AMELY region would be identified as females in the absence of any other corroborating evidences. Discrepancies in the Amelogenin based sex determination due to allele dropouts can result in different false positive and false negative results as explained in Fig. 2. Additionally, deletions in the amelogenin region of the Y chromosome are far more frequent than in the X chromosome [73, 83, 109]. The failure due to deletions can range up to 10% in an endogamous population. However, such discrepancies can be detected by sequencing and using different genotyping kits or different sets of primers [60, 77, 85]. The different nature of genetic anomalies resulting in the non-coherent discrimination between males and females has been described in details.

Fig. 2
figure 2

Probable results and problems associated with amelogenin typing in normal as well as compromised samples; a normal male, b normal female, c male with Y deletion, d male with X deletion, e female with X deletion in both the chromosomes, f female with X deletion in any one chromosome

Mutation in the primer binding region

One of the major causes of allele dropout is mutation in the primer binding region of the amelogenin gene. Failure in primer binding results in allele dropout which ultimately results in misidentification of the gender. Such mutations prevent the binding of primer and hence the amplification of AMELY [101, 106]. As a result, a male can be falsely identified as a female due to the absence of AMELY fragment in the amplified product. A study by Borovko et al. [13] reported two cases of AMELX and AMELY dropout due to mutations in primer binding sites in Belarusian population. Another study involving 5534 subjects of Polish population showed the failure of AMELX amplification in 0.02% of cases using SGMPlus, ProfilerPlus and PowerPlex16 kits. However, when alternative primer sets are used, an X homologue product was amplified confirming the mutation in primer binding site [78]. Further studies showed that mutations may cause failure in some of the cases but most failures are caused by a deletion in the AMELY region [77, 109].

Y deletion

Most of the cases of amelogenin failure result from the absence of AMELY in the genotyping results when the subject had the Y chromosome. These discrepancies are due to deletion of nucleotide sequences on the Y chromosome that includes the AMELY region [71]. In cases of Y deletion, known male samples show only the presence of AMELX with 6 bp deletion resulting in the appearance of only one peak after genotyping. In these cases, the presence of Y chromosome can be assured either by karyotyping or by the use of alternative markers present on Y chromosome [83]. Such Y deletions have been reported on 6 instances by a study involving 29,432 male individuals of Australian population [109]. Other study involving an Indian population confirmed Y deletion at a higher rate in Indian population appearing in five cases when 270 individuals were subjected to examination [113]. Y deletion is present in almost all populations throughout the globe and their corresponding frequencies of occurrence have been given in Table 2. As major part of Y chromosome does not undergo recombination, any anomalies in the Y chromosome are passed as such to the next generations leading to the formation of geographically specific haplogroups [97]. In this context, haplogroups belonging to the Indian and Asian Ancestry have been found to have a higher frequency of Y deletion.

Table 2 Global scenario of Y deletion and its characteristics involving different populations

Y deletion mapping

Deletions in the AMELY region are usually studied by sequencing upstream and downstream markers surrounding the AMELY region. A combination of sequence-tagged-site (STS) deletion mapping, binary-marker, Y-STR haplotyping and TSPY copy number estimation is used to study these deletions by referring to the reference database sequence of the Y chromosome [57]. Further, the size of the deletions can be detected on the basis of presence of desired markers in the amplified product [60]. Many studies have reported the size of deletion to be approximately 2.5 Mb [71, 112]. Primers devised in earlier studies along with new set of primers, are generally used for Y deletion mapping. Using different set of primers leads to generation of different size of deleted products. Jobling et al. [57] mapped the deletions by designing Y specific STSs and verifying STSs near the AMELY region for Y specificity in already published STSs. Additionally, Mitchell et al. [85] estimated the size of Y deletion in 5 AMELY null males of Australian population to be of two different sizes, ranging from 304 to 731 Kbp and 712 to 1001 Kbp. For this study, markers surrounding the AMEL locus were targeted to map the size of deletions and confirmed the presence of the deletions on the short arm of the Y chromosome. A further study by Jobling et al. [57] speculated the presence of five different classes of Y deletion after examining 33 Y STR markers i.e. Class I, I-sY59, II, III and IV. However, study of Takayama et al. [112] reported a Y deletion that did not belong to any of the above mentioned classes. Though AMELY deletion reveals a normal tooth phenotype, it has many classes. In Class I deletion (known as PRKY), along with AMELY, serine-threonine protein kinase and TBL1Y normally expressed in prostate and foetal brain also get deleted. Whereas, in non-Class I deletions, Y linked PCDH11Y (Protocadherin 11) expressed in brain is lacking. Another reported Class III case lacked testis specific Y linked Transforming Growth Factor-Beta-Induced Factor 2-Like (TGIF2LY) [57].

AMELY deletion in Indian population

The failure rate of amelogenin varies among populations throughout the globe. However, the maximum failure rate has been observed in the Indian subcontinent populations. The landmark studies regarding the Y deletion in Indian population have been reported by Santos et al. [102], Kashyap et al. [60] and Thangaraj et al. [113]. Various studies reporting frequencies of Y deletions in Indian as well as other populations have been given in Table 2. Human Y chromosome haplogroup represents the mutations in the non-recombining portions of the Y chromosome. This represents the common paternal descends [69]. About 5% of Indian males belong to, the J2 haplotype lineage harboring mutation at M172 genetic marker [6]. This lineage originated in the Middle East and was later distributed across Europe and Asia. Indian males belonging to the J2 lineage are found to have more frequent Y deletions [57, 60, 77, 127]. Additionally, “Haplogroup O” of Indian population has been found to possess a 5 bp deletion at position M175, whereas 1 bp deletion is common at M17 UEP in M173 lineage [130]. In this context, Kashyap et al. [60] concluded that Y deletion in amelogenin is lineage specific and inherited from one generation to another. Similarly, Cadenas et al. [18] suggested a common ancestry for AMELY null males on the basis of a finding that reported 5 AMELY null males belonging to the same lineage i.e. J2b2-M241. In this case, the Y STR profiles of two individuals were identical in nature whereas, other studied profiles had striking similarity.

X deletion

AMELX dropout is a rare phenomenon as compared to AMELY dropout. Mutation in AMELX can be the cause of AMELX dropout in males but it would not affect the gender identification as there would be normal amplification of AMELY. Additionally, in case of females, AMELX dropout is difficult to detect as it is very rare to have a mutation in both the homologous chromosomes and in cases of mutation in single chromosome, a single peak would be observed similar to a normal female as has been shown in Fig. 2. A study on the South Chinese population showed deletion of AMELX at a frequency of 0.037% [93]. Similarly, a female patient with clinical symptoms of microphthalmia has been reported to possess deletion of the entire AMELX gene along with additional 50 monosomic genes localized to Xp22 [47]. Two cases of AMELX dropout have also been identified in Belarusian population; however these have been attributed to the mutation in primer binding sites [13]. Use of Powerplex 16 System (Promega) has also been reported to generate only Y homologue of amelogenin in 10 male individuals of West African population [4].

Most of the currently used human STR kits such as, InnoTyper21 kit, AmpFLSTR™ Identifiler™ kit, PowerPlex® 21 System do not possess any alternative sex determining markers. However, few new generation kits such as GlobalFiler™ (Thermo Scientific), PowerPlex® Fusion 6C (Promega Crop.), Investigator 24plex GO! Kit (Qiagen) and SureID® PanGlobal (Health Gene Tech.) harbor DYS391, DYS570, DYS576 and Y InDel as alternative sex determining markers [30]. However, when AMELY deletion occurs, simultaneous deletion of DYS570 and DYS576 occurs as they are present within the AMELY region on the Y chromosome (Fig. 3) [59]. Additionally, the position of DYS391 is very close to the AMELY region, which increases the chance of co-deletion of DYS391 along with other Y-STR markers. Besides, none of the markers get amplified in female samples, thus becomes difficult to distinguish between the causes of non-amplification, i.e. female sample or sample with PCR inhibitors. Thus, alternative markers should be assessed to the currently available set of commonly used STR markers for sex determination.

Fig. 3
figure 3

Modified from Kareem et al. [59]

Relative positions of the commonly used Y-STR markers on the Y chromosome.

Amelogenin marker and PCR inhibitors

Sex determination using STR markers can be affected by PCR inhibitors, especially in cases of forensic samples. Commonly known PCR inhibitors may arise from the environmental sources, as microbial by-products or chemical contamination during DNA isolation. Though the mechanism of action among different PCR inhibitors differs, the PCR process being an enzymatic reaction is highly sensitive to them. These inhibitors can have profound effect on STR amplification and often result in dropouts, peak imbalance and stutters. Though no specific study on this aspect has been conducted till date, the potential role of these inhibitors in affecting the amplification process of amelogenin marker cannot be ignored. In this regard, a list of potential inhibitors affecting the Amelogenin amplification as well as other markers has been given by Dash and Das [26].

Degraded vs. mixed samples

Analysis of Amelogenin marker may cause ambiguity between degraded and mixed forensic samples. In case of mixed samples of two heterosexual origins, Amelogenin marker generates both X and Y peaks with imbalanced peak heights and a difference of 70% in magnitude [41]. On similar lines, degraded forensic samples also produce DNA profiles with heterozygous imbalance complicating the analysis process [44]. Most of the primer sets used for sexing using amelogenin marker require a minimum of 106 bp of template DNA for proper amplification. Many studies have reported high fragmentation of forensic samples which in turn generate DNA fragments as low as 61 bp [27, 76]. In this context, Fazi et al. [34] reported the limitation of amelogenin marker in as many as 28 degraded DNA samples from analyzed touch DNA and hair shaft sources. Limitations of using amelogenin marker in ancient DNA analysis have also been reported [82].

Alternative markers for sex determination

For forensic application, the molecular technique used should be fool proof and the technique(s) with even little chance of error should be seconded by additional or alternative techniques. Amelogenin may lead to various kinds of discrepancies in sex determination due to deletion and/or mutations depending on the nature and type of biological evidence. In this regard, a multiplex method involving other sex determination markers along with the amelogenin is the best proposed method for reliable sex determination to avoid any possible anomalies. Some of the alternative sexing markers with potential forensic applications have been described in detail. An extensive list of primer sets for amplification of the alternative sexing markers has been provided in Table 3. However, huge challenges come into existence when samples of forensic relevance are being analyzed. Hence, a comprehensive list of the alternative markers with their corresponding pros and cons has been given in Table 4.

Table 3 Primer sets for amplification of alternative sexing markers for molecular diagnosis of sex
Table 4 Advantages and disadvantages of the alternative sexing markers/techniques for forensic applications

Steroid sulfatase (STS)

Steroid sulfatase is a widely dispersed enzyme that plays an important role in the formation of steroids encoded by the STS gene found at the Xp22.3 [116]. It is pseudo autosomal in nature and on Y chromosome, a homologue of STS is present as a pseudogene which is transcriptionally inactive. Certain point mutations and/or deletions have been reported in this gene resulting in the development of X linked Ichthyosis as well as other neurodevelopmental disorders like Attention-deficit hyperactivity disorder (ADHD) and Autism [42].

STS-X, sized about 146 kb with intron 1 of 35 kb is the prime target for development of sex determination marker [126]. Morikawa et al. [86] developed primers for amplifying the first intron of the STS gene and its homologue on the Y chromosome resulting in the DNA fragments of 158 and 166 bp on X and Y chromosome respectively. This size variation between the amplicons is used to differentiate STS and its homologue on Y chromosome i.e. STSP1 and hence can be employed for sex determination [86]. STS sequences have been reported successful for sex determination in 53 mammalian species with greater efficacy [48]. Additionally, Morikawa et al. [86] used certain additional markers including STS to confirm the sex of an individual to be male, which was incorrectly reported to be female by the amelogenin analysis.

SRY gene

The SRY gene, also known as the sex determining region is responsible for the determination of male sex in humans as it promotes the development of precursor cells into sertoli cells instead of follicle cells [43]. SRY is a transcriptional regulator that directly up-regulates the SOX9 i.e. Sry box9. It dominates the Wnt/b-catenin pathway which promotes the female development and in turn allows expression of SOX9 to initiate male sex development [10]. SRY acts only on the sex gonads and the rest of the body is conveyed the sex determination message via hormones i.e. if the male gonads are removed in fetal stage, the embryos with XY phenotype would develop as female [80]. Additionally, mutation in SOX9 may generate a female individual by affecting the development of Y chromosome [54]. Being directly related to the male phenotype, this marker has an edge over others.

A new set of primers has been developed by Drobnič [32] for determining sex using SRY marker. Similarly, Steinlechner el al. [109] successfully depicted the use of SRY marker for sex determination in males with amelogenin deletion. A similar study on forensic samples confirmed the sensitivity and reliability of this marker for forensic applications either as a single marker or in multiplex system with high efficacy in mixture analysis [61]. In this regard, Codina et al. [24] developed a multiplex system involving the simultaneous amplification of amelogenin, SRY and four mini X-STR loci for application in sex determination of mixed as well as degraded forensic samples. Additionally, SRY marker has also shown its potential advantage over other sexing markers in sex determination of challenged forensic samples i.e. incinerated teeth [90].

TSPY

TSPY is a multi-copy gene family with repetitive units and a variation of copy number in the range of 30-60 bp located on the short arm of Y chromosome and regulates the process of spermatogenesis [124]. The whole gene consists of six exons and five introns which approximate to about 2.8 kb in size. A recent study has concluded that, men with less than 33 copies of TSPY have a 1.5 times higher risk of infertility [39, 119]. A minimum of 6 locations on the Y chromosome have been reported to be occupied by TSPY gene family members [28].

As several genes of the TSPY are present on the Y chromosome, different primers can be designed to exhaust the high repeats of this marker for sex determination. As TSPY is only present on the Y chromosome, female samples will show no peak in the electropherogram for the TSPY marker while the male samples will show a single peak which would symbolize the presence of any of the markers of the TSPY gene family. The high copy number would prove to be a merit in sex determination as it increases the sensitivity of this marker. Based on results of a study done for determination of male DNA after intense kissing, the sensitivity of TSPY marker has been proved for sex determination [58]. A similar study on ≤ 3 years old filter paper stored samples concluded that, TSPY is a better marker than SRY as 50% of the samples yielded a positive result with TSPY in comparison to 30% results obtained with SRY [20]. Further studies reinforced the fact that higher copy number in TSPY makes it sensitive and yield better results in forensic samples as compared to SRY which has only one copy. Such studies also established the high sensitivity of TSPY [11].

Giachini et al. [39] reported considerable variation in TSPY copy number in different haplogroups of Italian population. A study on sensitivity of TSPY marker confirmed that, TSPY could give results in DNA as low as 0.004 ng (about 1 sperm cell) whereas SRY could work only up to 0.08 ng (about 25 sperm cells) [53]. Additionally, the efficiency of TSPY in analyzing samples with female DNA as high as 500 times was also found remarkable. The amplification of TSPY was also efficient in the post-coitus samples. It could be amplified till 72 h while SRY could give results only up to 24–48 h. Reports have also shown the detection of TSPY 7 days post-coitus [53] showing superiority of this marker over others for sex determination.

DXYS156

DXY156 is a Penta nucleotide [(TAAAA)n] repeat multi allelic marker located in a long interspersed element (LINE) on both X and Y chromosome, which was first isolated and mapped by Chen et al. [22]. Linkage analysis can be used to map this Penta nucleotide marker to the long arm of X chromosome. Additionally, a sequence homologue is also present at the short arm of Y chromosome [67]. The X and Y counterparts of this marker vary in length i.e. no. of repeat motifs. It has been reported that, 11 repeats or higher are found in locus at Y amplicons while X locus consist of 10 or less number of repeats [115]. In this regard, Cali et al. [19] reported an Adenine insertion in the fourth repeat at the Y locus which could differentiate the X and Y counterparts effectively. They also recommended involving the DXYS156 marker in multiplex analysis kits, for forensic and archeological causes, as this marker is present on both X and Y, it would provide an internal positive control and would result in two peaks similar to amelogenin. Due to its multiallelic nature, it shows variations among different populations or different geographic regions. Reports also suggest the utility of this marker for ethnic studies as well as to study specific patterns of the same allele [63].

Additionally, DXYS156 is highly polymorphic for Indian population. Different population groups exhibit different degrees of variability at Y locus for this marker, whereas, X locus has limited variability as compared to Y locus [88]. This marker does not undergo recombination and hence any mutation in the Y locus tends to accumulate over time. This contributes to the generation of genetic diversity among various ethnic groups of a population [19].

SNPs

Single nucleotide polymorphisms (SNPs) are the most abundantly found variations in the human genome which are present both in coding as well as non-coding regions and account for almost 85% of variations in humans. About 1.9 million SNPs have been registered in the databases till date, proving the relative abundance as well as promising applications of these markers [31]. The added advantages provided by SNP analysis are the low fragment size of 60–80 bp, cost effectiveness, the speed of analysis and huge range of multiplexing [35]. Though SNPs are not as discriminatory as the STR markers, but they promise better results in cases of mass disasters or missing person analysis where reference samples may not be available [131]. Another advantage of SNPs assay is their usefulness in lineage studies using markers on mtDNA or Y chromosome as well as the analysis of phenotypic information based on the SNPs results [69, 75].

SNP markers have been successfully used for human identification as well as gender determination. The advantage of using SNPs for sex determination includes the requirement of lesser sized DNA fragment which is useful in case of degraded forensic samples [68]. In this regard, Wei et al. [121] constructed a SNP panel for ABO blood grouping as well as sex determination. Allwood and Harbison [2] developed a method of sex determination which used single base extension primers to flag Y chromosome regions. This method was found robust on all different aspects of forensic importance like sensitivity and mixed analysis. Recently, a mini-multiplex SNaPshot assay was developed for analysis of degraded samples which included five SNPs for sex determination present on Y chromosome [8]. The superiority of using Y-SNPs over Y STRs include their low mutation rate which can be as low as 100,000 times thus providing a concrete ancestry signature [62, 87]. Additionally, 25 SNPs markers have been analyzed on X chromosome for sexing as well as genetic homogeneity study of Mediterranean populations [114]. In this regard, a novel method has been discovered by Masuyama et al. [82] for accurate sex determination from as low as 20 pg DNA samples by targeting SNPs in the amelogenin gene and sex determining regions on Y chromosome. Another method targeting known X and Y SNP markers called as TriXY-Homogeneous genetic sexing is also being used for sex determination in degraded samples as well as in prenatal diagnosis of gender related birth defects [79]. A recent discovery has suggested the use of SNPs in the genes GATA4 and WWOX for sex determination as well as progression of cancer [70].

However, in current scenario SNPs are categorically being used for forensic phenotyping as they can be sued for the prediction of ethnicity. This ability of SNPs becomes instrumental in cases where there are no suspects [51]. Having highlighted the importance and efficacy of SNPs it’s important to note that SNPs have several benefits but they can’t substitute STRs altogether. Still their role becomes imperative like mtDNA analysis and Y SNPs as they can give you supportive genetic information [17].

DYZ1/Alu assay and others

DYZ1 is a loci present on the long arm of Y chromosome, which may have up to 4000 copies. The high copy number provides an efficient male specific assay. The loci is absent in female samples, thus generating a sex differentiation among humans. The qRT-PCR assays using DYZ1/Alu is considered to be 1500 times more effective than amelogenin assay [34].

About 1 million copies of Alu are present in DNA consisting 10% of the total genome [84]. Microchip electrophoretic sorting of five polymorphic Alu insertions (RC5, A1, PV92, TPA and ACE) have been reported for human ethnicity determination, whereas, two monomorphic Alu insertions (AluSTXa and AluSTYa) for sex typing [92]. Rahman et al. [98] found DYZ1 efficient in discriminating between tissue samples of same individual and samples from sibling males [98]. In this regard, tissue specific individualization is unique to DYZ1 and has high forensic utility. Moreover, its utility in diagnosis of mosaicism gives it an upper edge over all other markers [98]. Sequence and copy number variation of DYZ1 have been reported in prostate cancer biopsied samples [125].

Additionally, pyrosequencing of short PCR products present on amelogenin have also been regarded as a useful tool for sex determination in degraded samples [117]. A novel technique targeting short intergenic sequences (< 50 bp) on both gonosomes has been developed recently called as triplex for the X and Y chromosome (TriXY) [79].

Next generation sequencing

With the advancements in molecular biology, Next Generation Sequencing (NGS) techniques now provide a unique and conclusive result regarding sex determination of any sample irrespective of the nature, condition and presence/absence of genetic disorders. The massive parallel reactions and advanced computational methods make NGS the technology of future. The technology comes with added advantages of reduced costs and enhanced speed. Additionally, Rockenbauer et al. [100] established that many nucleotide substitutions, deletions and insertions are not detected by currently practiced capillary electrophoresis based STR analysis. However, the same can be detected by NGS as it focuses on both the numbers and polymorphisms of repeats.

Detection of a male individual as well as the Y copy number variation (Y-CNV) across the whole chromosome has been studied in much detail with its forensic relevance [81]. Additionally, many sex markers can be targeted to identify sex of an individual rapidly using SEXCMD to human whole exome or RNA sequence datasets using NGS [56]. Application of NGS has also been proven to be useful for sex determination in cases with abnormalities in sex chromosomes such as premature ovarian failure (POF) and sex reversal cases [74]. In this regard, patients with sexual disorder due to mutation in CBX2 gene were categorically characterized for their correct sex using DamID-NGS approach [9]. A recent study emphasized the combinatorial use of NGS, capillary electrophoresis and pyrosequencing, collectively called as ‘NGS+’ for Y-STRs and Y-SNPs typing for sex determination of biological samples at optimum level [96]. Massively parallel sequencing also provides a way to sequence all the available genetic material and type various different kinds of markers in a single run including the sexing makers [5]. The major advantage of using massively parallel sequencing is its capacity to provide results on sex determination using the highly degraded DNA in which the conventional capillary electrophoresis based processes fail to yield results [118].

Alignment of B-allele frequency of X chromosome with reference allele frequency of X chromosome can rapidly identify the sex of the individual from whole genome sequence data [56]. Due to sequence homology of mammalian X and Y chromosomes, Webster et al. [120] developed a statistical tool to minimize technical artifacts in genomic data for reliable sex determination by NGS. NGS has also been reported to be used for reliable sex determination in cases with atypical chromosomal, gonadal or anatomical sex [50]. Besides sex determination, assessment of copy number variation using NGS data is useful in determining effect of azoospermia factor (AZF) loci on male fertility, determination of TSPY copy number variation and detection of AMELY deletion [81]. Additionally, the currently available NGS kits for forensic application such as ForenSeq DNA Signature Prep Kit (Illumina) and HID-Ion AmpliSeq Identity Panel (Ion Torrent), contain additional sex determining markers besides amelogenin increasing their usefulness [3]. Though inception of new and more relevant SNP markers for sex determination by NGS is still in its infancy [104], requirement of scanty amount of DNA sample as low as 62 pg for this purpose is an added advantage [129].

Future challenges of sex determination: chimerism, legal-biological discrepancies and genetic disorders

The future of forensic genetic typing would largely be based on advanced technologies like next generation sequencing. In this regard, RNA markers for sex determination using SNPs are also going to come into the mainstream forensics very soon. SNPs may dominate the sex determination field due to their rapidity and sensitivity. Sex determination in mass disaster cases and conditions with limited samples, would be made possible by SNP markers and the next generation sequencing technology. Additionally, the next generation sequencing technology is going to overturn the capillary based genotyping for sex determination in the upcoming years with faster, contamination free and automatic results with meager amount of DNA samples. The future challenges have been described below.

Chimerism, the presence of genetically varying population of cells in an individual mostly arises due to bone marrow transplant [65]. In this regard, newly transplanted bone marrow migrates to other long bones and is subsequently grafted there followed by the production of new cells with the same genetic lineage as that of the donor; thus, the DNA profiling of the individual differs depending on the sample taken i.e. blood, hair or saliva [52]. With regard to gender determination, no matter what genetic marker we use, as long as the source is blood, the gender detected would be that of the donor instead of the patient [111]. In this respect, recently occurred allogeneic blood transfusion as well as pregnancy with male child may lead to microchimerism which might adversely affect the molecular sex determination using any advanced technique [12].

In the modern era of gender revolutions, genetic sex and gender may not necessarily be the same. On the basis of genetics an offspring can either be a male, female or intersex in case of genetic disorders. But gender is a social term encompassing the individual choices and societal prejudices. A person born as female might identify herself as a male or vice versa and this complicates the issue of gender based identification [107]. On the birth of an intersex child, the parents knowingly or unknowingly assign a gender to the child, which may create a problem in the long run. There occurs a contradiction between the legal gender and biological gender in such cases of intersex and other related issues [55]. In this regard, the gender identity disorder (GID) may also present issues for societal acceptance of molecular sex determination results [99]. Furthermore, people with GID, identify themselves as the opposite sex and often undergo gender transformation surgery [46]. In cases of altered gender, there will be incongruity between the social gender and the biological gender. In the light of increasing instances of people undergoing such surgeries and openly accepting their self-proclaimed sexual identities, sex determination using genetic basis is going to face challenges in near future. The cases of trisomy are most common as XXY, XXX or XYY in humans [23]. On similar lines, Turner syndrome i.e. XO is also witnessed frequently [64]. In such cases the amelogenin or any other sex determining marker will result in correct analysis of genotypic sex but it may not coincide with the legal gender of the person.

Additionally, cases of disorders of sex development such as Androgen Insensitivity syndrome, Gonadal dysgenesis and 5-alpha (2)-reductase-deficiency (ARD) may result in difference in genotypic and phenotypic sex due to abnormalities in the genitalia development [38]. In this regard, 8 SRY positive female athletes were reported among 3,387 cases studied [29]. It can be understood how such abnormalities can have profound effects in the forensic arena. As the gender of the reference samples need to be matched with the registered gender to conclude the involvement, any discrepancy in the gender determination can have a wide impact on the forensic and legal outcomes.

Conclusion

Amelogenin has been the touchstone for sex determination for more than a decade. However, numerous studies have shown the loopholes concerned with amelogenin based sex determination. The cause of such anomalies can be mutations and/or deletions in the Y chromosome. The percentage of such anomalies is very low considering the population sizes but in forensics science even a low percentage of discrepancies matter. An unattended case of Y deletion can lead to false identification of a male as female and hence lead to exclusion of a suspect. Various markers and some advanced technologies like the pyrosequencing and NGS are promising alternatives to using the amelogenin marker. Additionally, use of SNPs, DYZ1/Alu assay can also provide useful information regarding the molecular sex of an individual. Rejecting amelogenin is not an option, multiplex assays that include at least one different marker for sex determination is the solution for this problem. The best marker would have high differentiating power, better sensitivity, good results in mixture analysis and an internal positive control i.e. an X homologue. SNPs seem like the most promising marker for the future of forensic sex determination considering its ability for multiplex, automation and time benefits as they would be a part of the next generation sequencing technologies. In the era of gender revolutions, the marker incorporated to aid amelogenin in sex determination should also propose a solution for determining the gender of individual with incongruent biological and legal sex. With genetic revolutions in mind, the contrasting chromosomal sex and gender complicates the situation and hence, molecular basis of sex determination plays a pivotal role in solving these discrepancies with an added advantage. Moreover, further studies of haplotype groups having higher Y deletion frequencies such as J2, are needed to understand this anomaly in a better way.