Keywords

Introduction

Advances in molecular genetics in the 1980s allowed the study of polymorphic traits directly at the DNA level. The use of DNA in forensic science developed rapidly after Alec Jeffrey’s discovery of polymorphic repeat sequences in the DNA molecule in 1985. Since then, the identification of biological samples collected from both individuals and from the crime scenes in the determination of paternity-kinship relations and other criminal investigations has carried out using DNA analysis techniques. In forensic cases, such as terrorism, murder, sexual assault, and theft, it is possible to link up between the suspect and the crime scene by using DNA profiles, which are isolated from biological materials detected at the crime scene (Jeffreys et al. 1985; Robertson et al. 1990; Chan 1992). Although VNTR (variable number of tandem repeats) loci, which were used in the first period of DNA analysis in forensic sciences, have a high discrimination power, it has been replaced by new technologies on account of the need of good qualified (non-fragmented) and excessive amounts (300–500 ng) of DNA, long and laborious analysis times, exposing radioactive materials, and so on. For the last 20 years, STR loci have been widely used in forensic identification (Lee et al. 1994; Robertson et al. 2002).

Short tandem repeats (STR) loci have been used as ideal genetic markers in forensic science in recent years due to their small amplicon size, successful results in degraded biological samples, capability to do multiplex analysis, and no requirement of expensive equipment (Weber and May 1989; Edward et al. 1992). Although STR loci have been still used today, typing problems are experienced in extremely degraded biological samples, which are collected from the crime scene, and successful results for comparison cannot be obtained many times. In order to solve this problem, mini short tandem repeats (miniSTRs) that allow typing even in degraded samples were put on the market in the early 2000s (Coble and Butler 2005). In recent years, for the same purpose, researchers have been focused on different DNA polymorphisms, such as single-nucleotide polymorphism (SNP), which takes up a very little space in DNA, and InDel polymorphism (Pereira et al. 2009a, b).

Insertion

It is a type of mutation that disrupts the natural sequence as a result of addition of one or more bases to the DNA base sequence. This addition can be a base or as much as a whole chromosome (Fig. 1). As a result of DNA polymerase shift, it is usually thought to be formed by addition of base sequences, which are not adjacent to microsatellite sites, to the main sequence (Gelbart et al. 2002; Kondrashov and Rogozin 2004; Rodriguez-Murillo and Salem 2013).

Fig. 1
figure 1

Insertion on a chromosome level (Gordon and Egner 2013)

Deletion

It is a type of mutation that occurs as a result of the deletion of one or more bases in the DNA sequence. Deletion, as well as insertion, can occur on one or more bases, as well as in chromosome size (Fig. 2). If a deletion occurs when part of the chromosome breaks off and disappears, it can lead to serious genetic diseases (Gelbart et al. 2002; Kondrashov and Rogozin 2004; Rodriguez-Murillo and Salem 2013).

Fig. 2
figure 2

Deletion on a chromosome level. (Gordon and Egner 2013)

InDel (Insertion/Deletion)

Point or gene mutations are defined as changes that occur in the DNA sequence and can be passed on to subsequent generations. These mutations usually occur in one or more nucleotides and cause changes in the structure of the genome. Point mutations can also occur in the form of insertion or deletion (Campbell et al. 2006) (Fig. 3). However, insertion or deletion mutations are much more important changes than point mutations. Because insertion or deletion of one or more bases in the DNA chain usually leads to the shift of the genetic code that starts from the point where the insertion or deletion occurs, this causes polymorphism by creating significant changes in the structure of the gene (Pereira et al. 2009a, b).

Fig. 3
figure 3

Schematic representation of insertion and deletion on the DNA sequencing. (Gordon and Egner 2013)

InDels and somatic/gonosomal chromosome mutations are terms used to express mutation combinations, which include deletion or insertion separately or together, in the studies of forensic molecular genetics, evolution, and population genetics (Gelbart et al. 2002; Kondrashov and Rogozin 2004; Gregory 2004).

InDels, which account for 16–25% of all genetic variations in the genome, are the most commonly seen DNA polymorphism after SNPs, and 1.6–2.5 million InDel polymorphisms have been identified in human population studies. This, in turn, suggests that hereditary changes that occur as a result of insertion or deletion (InDel) mutations can be used as a genetic marker since they are seen in the human genome frequently. Even though they are so common, studies with InDels are limited (Hongbao 2005). However, since InDels have power of discrimination and heterozygosity, it is possible to use InDels in studies of human identification, ancestry, and evolutionary and molecular anthropology. For this reason, today, it is an alternative polymorphism to SNPs and STRs, which are used in DNA profiling in forensic identification (Reiner et al. 2005; Pereira et al. 2009a, b, 2012; Martínez-Cortés et al. 2015).

Use of Insertion/Deletion Polymorphism in Forensic Genetics

Insertion/deletion polymorphism (InDel) is known to be as a length polymorphism, which formed as a result of insertion and/or deletion of one or more nucleotides on the genome. Differences are used to discrimination of two people from each other in forensic identification, which makes polymorphisms the basis of identification (Rodriguez-Murillo and Salem 2013).

Polymorphism occurs as a result of successive mutations and is passed down from generation to generation according to Mendelian laws. In 2002, James Weber and colleagues identified over 2000 biallelic insertion/deletion polymorphisms on the human genome. Thus, studies of InDel polymorphism began for the first time in forensic sciences (Weber and May 1989). InDel loci have been used in forensic identification since they are able to study with multiplex PCR and have high heterozygosity and small amplicon lengths (60–200 bp) (Fig. 4). They can also be used to identify disaster victims in mass fatality cases, such as aircraft accidents, terrorist attacks or natural disasters, ancestry determination, evolutionary research, and molecular anthropology (Manta et al. 2012; Martínez-Cortés et al. 2015). More successful results can be achieved in identification by using InDel polymorphism, which are called next-generation genetic variations together with STR and SNP loci (Sanchez et al. 2006; Pereira et al. 2012).

Fig. 4
figure 4

Comparison of PCR product sizes of InDels with STRs

InDel loci have been used in forensic sciences, especially in recent years. Commercial kit production for InDel loci is highly restricted. So forensic scientists often create their own InDel panels and try to popularize them (Guangyao et al. 2015).

Commercial Kits Based on InDel Analysis

InDel loci have been used in forensic science in recent years. However, InDel kit production is also quite limited. Currently, there are three commercial InDel kits made for the use in forensic science.

Investigator® DIPplex Kit

It is a kit containing 30 InDel loci found in somatic chromosomes that have been marketed by QIAGEN firm in the last 2–3 years. The kit also includes the locus of amelogenin. Selected InDel loci are smaller than 160 bp and are specifically designed for the use in the identification or anthropological research. In addition, they are analyzed by ABI 310, 3130, 3130XL, 3500, and 3500XL capillary electrophoresis (Investigator® DIPplex Handbook 2014).

InDelPlex INDEL Polymorphism Detection Kit

It is a kit developed by Pereira R. et al. in partnership with the Institute of Molecular Immunology and Pathology of Oporto University and the University of Santiago de Compostela and commercialized with Genomica firm over the past few years. The kit allows multiplex PCR amplification of 38 InDel region and produced for the use of forensic identification and clinical diagnostic purposes. These analyses can be done by using ABI 310, 3130, 3130XL, 3500, and 3500XL capillary electrophoresis (Pereira et al. 2009a, b, 2012).

Mentype® DIPplex PCR Amplification Kit

It is a kit containing 30 InDel loci found on somatic chromosomes that have been marketed in recent years by the Biotype firm and also includes the locus of amelogenin. This kit restricted amplicon length to ~150 bp, which makes the kit perfectly suitable for analyzing critical stains. The kit can be analyzed by using ABI 310, 3130, 3130XL, 3500, and 3500XL capillary electrophoresis (Mentype® DIPplex PCR Amplification Kit Handbook 2009).

Use of Gonosomal InDel Loci in Forensic Genetics

InDel loci exist on both somatic and sex chromosomes. Compared to somatic chromosomes, there are InDel variations in similar ratios on the X and Y chromosomes too. It can be used in addition to autosomal genetic markers in father/daughter relationship in X-linked polymorphism analysis. In the cases of paternity of two relative men (such as father/son), paternity can be determined by the use of polymorphism on the X chromosome if the child is a girl. It is because suspicious fathers will have different X chromosomes since they have different mothers (Szibor et al. 2005; Prinz and Sansone 2001).

Since the Y chromosome is transferred unchanged from father to son, it is observed in male members of the same family in the same form, except for mutations and genetic abnormalities. For this reason, population genetics, biogeographic lineage determination, male kinship evaluation or link analysis, and forensic genetic identification studies can be done by utilizing Y chromosome-related polymorphism analysis. In paternity cases, if the child is a boy, it is possible to get results by typing Y chromosome loci of any man (grandfather, uncle, cousin, etc.), who is in the family tree of the father candidate, especially when DNA cannot be obtained for various reasons (such as the father candidate cannot be found, DNA cannot be obtained from his biological material or he is dead, etc.) (Roewer et al. 2000; Corach et al. 2001; Prinz and Sansone 2001). Similarly, in paternity cases, where the father candidate is dead or could not be found, it is possible to get results by analyzing the X-linked polymorphism between the grandmother candidate and child (if the child is a girl) because the child gets one of her X chromosomes that her father gets from her grandmother. As a result, the paternal X chromosome in a girl will necessarily be coming from one of her grandmother’s X chromosomes (Szibor et al. 2005, Prinz and Sansone 2001).

Pregnancies that occur after sexual assault crimes can result in abortion. In 6–8 weeks of abortions, the tissue and maternal blood materials are found together with miscarriage, and it is difficult to separate them microscopically. In this case, if the fetus is female, the presence of the X chromosome, which was inherited from the suspected father, can be determined by X-linked polymorphism analysis (Szibor et al. 2005; Prinz and Sansone 2001).

The Y chromosome is also used in illuminating sexual assault crimes. In cases where the victim is a woman, the victim’s vaginal swab sample contains a mixture of DNA belonging to the victim and the perpetrator. Since there is no Y chromosome in women, only the profile of the perpetrator is obtained when identification is performed using Y chromosome loci in the mixture DNA (Roewer et al. 2000; Corach et al. 2001; Prinz and Sansone 2001).

By analyzing the polymorphism associated with X chromosome, it can also be determined whether two girls’ father is the same, regardless of whether their mothers are different or the same. Since the father has one X chromosome, he will pass on the same X chromosome to all girls. But if their father is not the same, the X chromosome loci transferred from their father will also be different (Szibor et al. 2005; Prinz and Sansone 2001).

By using gonosomal and autosomal InDels together with STR and SNP loci, more successful results can be achieved in such cases or in identification of samples, which were collected from the crime scene (Szibor et al. 2005; Prinz and Sansone 2001).

Analysis of InDel Loci

Although InDel loci are biallelic, it is analyzed by the method of fragment analysis, such as STRs. These steps are followed in the identification of InDels as in STRs: first, extraction and quantification of DNA sample, then amplification of InDel loci of DNA by using fluorescence-marked InDel primers in multiplex PCR, and, finally, obtaining the profile of person by separating these loci in capillary electrophoresis with ABI 310, 3130, 3130XL, 3500, and 3500XL devices.

As a result of the analysis of InDel loci in capillary electrophoresis, three types of alleles can be seen in electropherogram because either deletion, insertion, or a combination of both can be transferred from the mother and father. These are expressed as deletion (Del – minor allele), insertion (Ins – major allele), and insertion-deletion (InDel), where both deletion and insertion are transferred together. The base pair size of the identified InDel loci on the DNA is defined by researches. Since it is defined how many base deletions or insertions they undergo, determining only the alleles as insertion or deletion will be sufficient to say the number of base pairs.

Before the analysis in capillary electrophoresis, this base pair data is defined in panel manager. It is created with a panel for each dye color (primer) with bins. In this way, the data of all alleles that exist in the population, just like in analysis of STR loci, is defined into the device. In capillary electrophoresis of DNA, if a person’s peak was observed in the first bin, as shown in the figure, this shows the person’s allele is “Del” because it has an incomplete base sequence. If the insertion has been transferred from the parent, the peak will be observed in the second bin, as shown in Fig. 5, which means that the person has the insertion as a result of addition of base, and the allele is “Ins.” If a peak is observed in both bins, it means that the person has received both an insertion and deletion polymorphism from the parent. This, in turn, refers to the third allele, which we call “InDel” (Fig. 5).

Fig. 5
figure 5

Electropherogram image of insertion/deletion (InDel) polymorphism. (Fondevila Álvarez et al. 2011)

In this way, the profile of people analyzed with InDel multiplex panel kits in all regions can be determined just like in STR analysis.

The number of loci that will reach sufficient (99.9%) discrimination power for human identification of InDel loci differs from STR loci number. In order to use InDels for identification purposes, as in STR analysis, a sufficient number of loci to use are determined, depending on the heterozygosity rates and discrimination power of the locus contained in the multiplex kit or panel. The number of loci contained in InDel kits or panels developed by researchers, which exist on the market today, has a 99.9% discrimination power, and this locus number ranges from 20 to 40 (Ünsal et al. 2017).

While expressing D70+ as the insertion allele in the figure, D70- refers to the deletion allele.

Conclusion

Currently, STR systems are widely used for human identification in criminal laboratories. Forensic scientists are developing new polymorphic systems that can obtain results from all kinds of biological samples as an alternative to polymorphic systems (Coble and Butler 2005).

Insertion/deletion polymorphism occurs in the form of the addition or loss of one or more bases in the human genome, which is causing polymorphism and can be used in human identification and genealogical determination in illuminating forensic cases. Especially in cases where DNA obtained from biological evidence from the crime scene is degraded or trace amount; As an alternative to STRs, small-sized systems on DNA are preferred for analysis in forensic sciences. In such cases, successful identification results can be obtained by using InDel polymorphism (Pereira et al. 2009a, b, 2012).

The loci that these systems occupy on DNA are quite small (60–200 bp), and the potential for a successful DNA profile is high, even if the DNA is degraded. The amount of DNA that can be obtained from a degraded or trace biological sample is either non-existent or ranges from 100 pg to 1 ng. Since it is known that the amount of DNA required for multiplex PCR used for identification studies is 0.5–1 ng, it has been determined by studies that identification with InDel loci gives successful results.Therefore InDels have been used in forensic science in recent years, and studies continue to develop InDel panels as an alternative or complement to STR and SNP loci in identification (Ünsal et al. 2017).