Easykin: a flexible and user-friendly online tool for forensic kinship testing and missing person identification

Li, Ran; Wang, Nana; Zang, Yu; Liu, Jiajun; Wu, Enlin; Wu, Riga; Sun, Hongyu

doi:10.1007/s00414-023-03083-1

Easykin: a flexible and user-friendly online tool for forensic kinship testing and missing person identification

Original Article
Published: 25 September 2023

Volume 137, pages 1671–1681, (2023)
Cite this article

Download PDF

Access provided by Autonomous University of Puebla

International Journal of Legal Medicine Aims and scope Submit manuscript

Easykin: a flexible and user-friendly online tool for forensic kinship testing and missing person identification

Download PDF

Ran Li ORCID: orcid.org/0000-0001-6473-4570^1,2,3,
Nana Wang^1,3,
Yu Zang^1,3,
Jiajun Liu^1,3,
Enlin Wu^1,3,
Riga Wu^1,3 &
…
Hongyu Sun ORCID: orcid.org/0000-0002-5926-4495^1,3

412 Accesses
Explore all metrics

Abstract

In forensic kinship testing and missing person identification, it is a fundamental question to choose the most informative reference relatives, select appropriate genotyping systems, and evaluate the weight of evidence comprehensively. Despite that several useful tools have been developed, they have not addressed these questions satisfactorily. In this paper, we develop a flexible and user-friendly online tool, Easykin, to address the aforementioned issues. It has some promising features: (i) Pedigrees can be constructed easily and presented intuitively with just a few mouse clicks. (ii) System power can be estimated before testing based on certain set of markers and reference relatives. (iii) The pruning function of EasyKin enables users to choose appropriate subsets of available references. (iv) Parameters at a specific LR for a single case may ease evidence interpretation. (v) The user interface (UI) is an HTML-based dashboard, which is friendly to both professional and non-professional users and can be used anytime and anywhere. Here, we presented three common cases as examples to demonstrate how kinship testing and missing person identification can be improved with EasyKin. In conclusion, this tool provides a one-stop solution for forensic use, that is, instructing users to choose appropriate kits and reference relatives before testing, calculating LR in the testing, and providing parameters for data interpretation after testing. EasyKin is freely available at https://forensicsysu.shinyapps.io/EasyKin/.

Kinship analysis: assessment of related vs unrelated based on defined pedigrees

Article 20 November 2015

An evaluation of the SureID 23comp Human Identification Kit for kinship testing

Article Open access 14 November 2019

The Advantages of Noncriminal Genetic Databases in Identifying Missing Persons and Human Remains

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Introduction

The determination of genetic relatedness is frequently adopted in several forensic applications, such as kinship confirmation after separation, inheritance disputes between illegitimate children, immigration cases, and personal identification of missing persons, unknown bodies, and disaster victims [1,2,3]. In such cases, the pedigree structure can be determined by using a likelihood ratio (LR) method based on genetic marker data for a set of persons, so that the determination of a relationship and the identification of a person of interest (POI) are achieved.

Generally, two questions need to be answered before testing: (i) How many markers are needed and (ii) how many reference relatives and who should be genotyped if there is a choice? Many studies have shown that adding genetic markers (STR and/or SNP) can improve discrimination between relatives and non-relatives [3,4,5,6,7]. However, the number of added markers depends on the detection systems available in the laboratory. In addition, further testing may be impossible for DNA samples with limited quality and quantity, such as trace DNA or degraded DNA. Therefore, this question can be converted to whether it is sufficient to perform a kinship analysis with available kits or genetic data. With respect to reference relatives, choosing the most informative references and/or typing more relatives can also improve the discrimination power of a genotyping system in kinship testing and missing person identification. Ge et al. [1] suggested that first degree relatives (parents and full siblings) were the most preferred relatives and references with less genetic dependence were superior to those with more genetic dependence. However, it is possible that the most suitable candidates, e.g., parent(s), are not available and more distant relatives need to be genotyped. One of this kind is the well-known “Missing Grandchildren of Argentina”, where the biological parents of POI were murdered and their bodies still remain missing [8]. Sometimes, there may be many relatives, say ten full siblings. Conceivably, it is not necessary to genotype all of them. Furthermore, if a reliable conclusion cannot be made after initial testing, further data must be gathered by recruiting additional family members. Prioritization problems may be encountered because the addition of one relative may provide higher discrimination power than the one of another [3, 9]. Last but not the least, different labs may have different thresholds to confirm a relationship [10, 11]. Selecting a lower threshold decreases the false negative rate (FNR) but at the cost of increasing the false positive rate (FPR). A higher threshold generally results in a higher accuracy but lower effectiveness [12]. Accordingly, the number of kits/markers and reference relatives need to be increased to reach an explicit and reliable conclusion. Beyond these questions, data interpretation also matters. More parameters are needed to comprehensively interpret DNA evidence besides LR itself and corresponding posterior probability in the court.

Despite that several useful tools, such as Familias [13], EasyDNA [14], forrel [9], Bonaparte [15, 16], and Converge Software [17] have been developed, they have not addressed the issues mentioned above satisfactorily. For example, Familias is useful for LR calculation and simulation, but it does not provide solutions for choosing reference relatives. The R package forrel, using a conditional simulation method, is a good tool for prioritizing additional family members for genotyping in missing person cases. However, it is not friendly to laypeople, particularly those unfamiliar with coding. Therefore, we developed a flexible and user-friendly online tool, i.e., Easykin, for forensic kinship analysis and missing person identification. This tool has several promising features. First, it can be used to estimate the system power for a specific set of markers and reference relatives at the consultation and commissioning stage. Importantly, the system power of subsets of available references can also be evaluated, making it easy to choose appropriate references or combinations of them. Second, two mutually exclusive hypotheses can be constructed easily and presented intuitively with just a few mouse clicks. Finally, the user interface (UI) is an HTML-based dashboard, which is friendly to both professional and non-professional users and can be used anytime and anywhere.

Methods

Pedigrees and references

For the purpose of simplicity in pedigree construction, 1st and 2nd degree relatives as well as several genetically unrelated individuals can be chosen. At current version, reference relatives include father/mother, 0–6 children (0–3 sons and 0–3 daughters), paternal grandparent(s), 0–3 paternal uncles, 0–3 paternal aunts, maternal grandparent(s), 0–3 maternal uncles, 0–3 maternal aunts, 0–6 full siblings, 0–6 paternal half siblings, 0–6 maternal half siblings, 0–6 grandchildren (the children of son), 0–6 grandchildren (the children of daughter), 0–6 nephews/nieces (the children of brother), and 0–6 nephews/nieces (the children of sister). Several genetically unrelated individuals can also be included, e.g., spouse, the mother of paternal half sibling, the father of maternal half sibling, daughter-in-law, son-in-law, brother-in-law, and sister-in-law. Theoretically, more than one billion scenarios can be constructed, covering the majority of common cases. For more complex scenarios, say involving incest, users are encouraged to upload their own pedigrees under the instruction in the user guide.

In order to determine the most appropriate reference relative(s), a pruning function is implemented in this tool. During this process, each reference is pruned one by one from the original pedigree, thus generating a series of subsets of them. For example, if three references are available, all possible subsets/pedigrees are POI+S1+S2, POI+S1+S3, POI+S2+S3, POI+S1, POI+S2, and POI+S3 (Fig. 1).

Simulation

The alleles of founders (i.e., individuals without parents in the pedigree) are randomly assigned according to the allele frequencies of each locus. All markers are assumed to be unlinked and in Hardy-Weinberg equilibrium and linkage equilibrium. Founders/parents transmit a single allele to his/her offspring with an equal probability. Mutations are also incorporated, with a higher rate for paternal mutation than for maternal mutation (e.g., 3–5 folds). After fully assigning the pedigree, we eliminate the genotypes of samples who are not available in the testing. One hundred pedigrees are simulated under two mutually exclusive hypotheses by default, but the number can be increased if necessary.

LR calculation

Kinship is assessed by comparing two alternative hypotheses: H1: person of interest (POI) is the specific member of the putative pedigree and H2: POI is unrelated to the putative pedigree. The likelihood ratio (LR) is calculated as follows:

$$\textrm{LR}=\frac{P\left(E|H1\right)}{P\left(E|H2\right)}$$

where E represents the DNA evidence, i.e., the joint DNA profiles (e.g., STR) of all tested samples and P(E| H) represents the probabilities of the DNA evidence under each hypothesis (H1 or H2). Likelihoods are calculated using Elston–Stewart (E-S) algorithm [18], which is implemented in the R package Familias [19].

System power estimation

With simulated pedigrees, an empirical log-normal distribution of LRs for H1 and H2 can be obtained. Then, the probability of log₁₀LR at a threshold (t) can be easily estimated using the function pnorm in R. Hypotheses are supported based on the following threshold ranges (t₁ and t₂; t₁ < t₂): (i) H1 true: log₁₀LR > t₂, (ii) H2 true: log₁₀LR < t₁, and (iii) inconclusive: t₁ ≤ log₁₀LR ≤ t₂. Accordingly, several parameters are calculated for the estimation of system power, including sensitivity (Sen), specificity (Spe), positive predictive value (PPV), negative predictive value (NPV), false positive rate (FPR), false negative rate (FNR), inconclusive, and effectiveness. They are defined as follows:

Sen: proportion of pedigrees under H1 judged as H1 true;
Spe: proportion of pedigrees under H2 judged as H2 true;
PPV: proportion of pedigrees correctly judged as H1 true;
NPV: proportion of pedigrees correctly judged as H2 true;
FPR: proportion of pedigrees under H2 judged as H1 true;
FNR: proportion of pedigrees under H1 judged as H2 true;
Inconclusive: proportion of pedigrees that cannot be judged as either H1 true or H2 true;
Effectiveness: proportion of pedigrees that can be judged as H1 true or H2 true.

For details on how these metrics are calculated, please refer to Supplementary Table 1.

Effectiveness indicates how many cases will be successfully addressed with defined thresholds. It is a good indicator of overall performance and is classified into four levels, < 0.8 as unsatisfactory, > 0.8 as acceptable, > 0.9 as good, and > 0.99 as perfect.

Implementation

The user interface (UI) of EasyKin is an HTML-based dashboard using shinydashboard (version 0.7.1), which leverages functions from the R package shiny (version 1.5) for the application. Familias is utilized for pedigree construction and LR calculation. Package DT provides an R interface to the JavaScript library DataTables and is used for data presentation. The UI can be accessed from commonly used web browsers (e.g., Google Chrome, Microsoft Edge, Mozilla Firefox, and Apple Safari) and may be utilized from desktop, tablet, or smartphone devices at https://forensicsysu.shinyapps.io/EasyKin/ (a stand-alone version is also available at https://github.com/Ryan620/Easykin). An example of user interface of EasyKin is shown in Fig. 2. All simulations, calculations, and presentations in this tool are performed using R programming.

Results

First, a general workflow is recommended in Fig. 3 for kinship testing and missing person identification using EasyKin. Step 1: Construct two alternative hypotheses with available reference relatives. Step 2: Generate a number of virtual families according to the allele frequencies of STR markers, which are included in the available kits in one lab. An empirical log-normal distribution of LRs for H1 and H2 can be obtained with these simulated pedigrees. Then, by setting appropriate thresholds, parameters of system power, i.e., Sen, Spe, PPV, NPV, FPR, FNR, Inconclusive, and Effectiveness, are estimated. Step 3: Users can now make a decision on which kit(s) and reference relative(s) should be included after balancing FPR and FNR as well as Effectiveness. Considering that there may be cases where many references are available and possibly not all of them are necessary, the pruning function in EasyKin can be used to choose the most informative subsets of them. Step 4: Process sample collection, DNA genotyping, and LR calculation. Step 5: Evaluate the weight of evidence and prepare for data interpretation, including LR itself, posterior probability, a corresponding verbal equivalent, and parameters of system power under the specific LR.

Next, we will present three examples to demonstrate how kinship testing and missing person identification can be improved with EasyKin.

Example 1—pairwise full sibling testing

Pairwise full sibling testing is the second most common type in forensic practice after paternity testing. We assume that one forensic lab has three STR sets, i.e., AmpFlSTR Identifiler (Thermo Fisher Scientific, San Francisco, CA, USA), Huaxia Platinum (Thermo Fisher Scientific, San Francisco, CA, USA), and Microreader 23sp (Suzhou Microread Genetics, Jiangsu, China). The performance of the three kits can be evaluated based on simulation data with EasyKin. According to [12], thresholds of t₁ = −2 and t₂ = 2 are required. Under these thresholds, none of these kits can individually reach a perfect effectiveness (> 0.99) unless combining them, e.g., Set 4 and Set 5 (Table 1). Therefore, a combination of AmpFlSTR Identifiler + Microreader 23sp or Huaxia Platinum System + Microreader 23sp is suggested for pairwise full sibling testing in this lab. Although it is possible that LR values of individual cases may reach the defined thresholds using a single kit, a sufficient set of markers is still suggested for a lower error rate (Table 1). And vice versa, stricter or higher thresholds are suggested when using low power systems, which may be contrary to our instincts.

Table 1 System power for pairwise full sibling testing using different STR sets. Thresholds: t₁ = −2 and t₂ = 2

Full size table

Example 2—personal identification of a unknown body

A man was found dead 20 years ago and his body was cremated after genotyped with AmpFlSTR Identifiler (Thermo Fisher Scientific). For years, his identity remains unknown, until recently a man claims to be his (full) brother. Now, we are commissioned to confirm their relationship.

In this case, marker sets cannot be expanded further due to a lack of DNA of POI. We first evaluated the performance of pairwise full sibling testing with 15 STR loci included in AmpFlSTR Identifiler. As shown in Table 1 (Set 1), the effectiveness was unsatisfactory (0.7866) and the error rate was relatively high, i.e., FPR = 0.0005 and FNR = 0.0014. Therefore, we requested for more reference relatives to participate in the test and were informed that merely an aunt of POI was available. After adding the aunt, the effectiveness increased to 0.9152 and error rate decreased significantly, with FPR < 0.0001 and FNR = 0.0005, indicating that this set of references was able to improve the performance. Then, blood samples of the two references, along with DNA profiles of POI, were sent to our lab and genotyped with Goldeneye 25A (Peoplespot, Beijing, China). Genotypes and LRs are listed in Table 2. The combined LR (CLR) was 119.2377 (log₁₀CLR = 2.0764), exceeding our defined thresholds (t₁ = −2 and t₂ = 2) and thus supporting their relationship. In addition, we also calculated LR values for POI and his brother, which fell between t₁ and t₂, thus inconclusive. Given this, pre-estimation with EasyKin is helpful to guide the test and can avoid multiple sampling in forensic caseworks (i.e., collect related samples in one time, not successively).

Table 2 Genotypes and LR values in Example 2. POI: the deceased man; S1: putative aunt of POI; S2: putative full sibling of POI; POI was genotype with AmpFlSTR Identifiler while S1 and S2 were genotyped with Goldeneye 25A, which covers all the markers in AmpFlSTR Identifiler

Full size table

Example 3—inheritance dispute

In an inheritance dispute case, a boy (POI) claimed to be the child of a deceased man. The mother of POI (known), putative grandparents, a putative paternal half-brother, and his mother were available for the test.

This kind of scene is frequently encountered in practice and we need to determine who should be included before testing. First, we need to evaluate the performance if all these references are genotyped using a certain kit, e.g., Huaxia Platinum System. In this case, we require stricter thresholds with t₁ = −4, t₂ = 4 and effectiveness > 0.99. Pedigrees under H1 and H2, LR distribution and corresponding system power are shown in Fig. 4. We can anticipate that ~99.89% of cases (effectiveness) will pass the defined thresholds with very low error rates, i.e., FPR < 0.0001 and FNR < 0.0001, indicating a sufficient system power for the testing. In the next step, by pruning the pedigree, we find that the number of references can be reduced without significant decrease in accuracy and effectiveness (Table 3). Four combinations, i.e., POI+S4+S5+S6+S3+S7, POI+S4+S5+S6+S3, POI+S4+S5+S6+S7, and POI+S4+S5+S6, have effectiveness > 0.99. The least number of references is achieved with the combination of POI, his mother, and both putative grandparents (POI+S4+S5+S6). Not surprisingly, POI+S4+S5+S6+S3 and POI+S4+S5+S6 have the same discrimination power given that the two subsets are equivalent. S3 is a singleton in both H1 and H2 and provides no further information about the deceased man unless her son (S7) is also genotyped. If looser thresholds are defined, say t₁ = −1, t₂ = 1 and effectiveness > 0.99, the number can be reduced further to only two references (both putative grandparents) at the cost of a higher error rate (Supplementary Table 2). Therefore, it may not be necessary to genotype S3 and S7 (as well as S4 at looser thresholds).

Table 3 System power for different subsets after pruning the original pedigree in Example 3. Markers: 23 STRs in Huaxia Platinum System; thresholds: t₁ = −4 and t₂ = 4; simulations: n = 500; sample labeling of POI and S1–S7 corresponds to those in Fig. 4; rows colored gray represent subsets with effectiveness > 0.99; cells with “-” mean NULL outputs as the two hypotheses are equivalent

Full size table

It is noteworthy that the reduction of references may be problematic in this case if S5 and S6 have more than one child. If POI+S4+S5+S6 or POI+S5+S6 are genotyped, we can only say that POI is the grandson of S5 and S6 (if support), not necessarily to be the child of the deceased man. From this point of view, whether to perform reference pruning depends and varies in real cases.

Discussion

In this paper, we introduced a flexible and user-friendly online tool, named EasyKin, for forensic kinship testing and missing person identification. The three examples demonstrated that if we estimate the system power in advance using EasyKin, appropriate kits and informative references can be easily determined before testing. It may be helpful to avoid multiple sampling and superfluous testing in real cases, thereby reducing time and economic cost.

Although it is possible that LRs of individual cases may reach the defined thresholds with a smaller number of STRs, a sufficient marker system (if available) is always suggested considering a higher accuracy at the same thresholds (Table 1). With regard to references, more relatives generally indicate higher system power, and typing as many of them as possible is encouraged. There are some more considerations to take into account. First, singleton individuals (e.g., spouses) are useless unless other specific relatives are genotyped. For example, S3 cannot provide any more information unless her son S7 is also genotyped in Example 3. Second, distant relatives (third degree or more distant relatives) can only provide limited discrimination capacities and are less recommended with conventional autosomal markers. That is why only 1st and 2nd degree relatives are included by default in the construction of pedigrees in EasyKin. Nevertheless, lineage markers residing on the Y chromosome and mitochondrial DNA (mtDNA) genome can still be used to increase the LR values for these distant kinship analyses [20, 21]. However, it may be challenging to perform additional amplifications in cold cases (Example 2) and forensic investigations of small amounts of DNA. Third, if many references are available, (possibly) not all of them are necessary and the pruning function in EasyKin can be used to choose the most informative subsets of them. Besides the scenario in Example 3 of this study, we also performed the pruning function for pedigree F9 in [22]. We found that the maternal aunt should not have been included as she provided no further increase in effectiveness (Supplementary Table 3).

In addition to genetic markers and references, the threshold also matters. We notice that different labs may have different thresholds to confirm a relationship [10]. Previous works tend to focus only on inclusion and apply a single threshold [4, 23]. At present, double thresholds are widely used in China so that both FPR and FNR can be balanced. As recommended in Specification of parentage testing (GB/T 37223–2018) [24], Technical specification for identification of biological full sibling relationship (SF/T 0117–2021)[25], and Specification for identification of biological grandparent-grandchild relationship (SF/Z JD0105005–2015)[26], a relationship is affirmed if LR > 10,000 while it is rejected if LR < 0.0001, otherwise inconclusive. In accordance with these specifications, EasyKin is designed to estimate the system power (Sen, Spe, PPV, NPV, FPR, FNR, Inconclusive, Effectiveness) under either single or double thresholds. However, with the above fixed LR-threshold method[27], t = 10,000 may be too high for cases with low statistical power and will lead to high false negative rates under single threshold and low effectiveness under double thresholds. Therefore, lower thresholds were also applied in some studies [4, 12]. Beside, Marsico et al. proposed a flexible and case-specific LR threshold, named LR decision threshold (DT) [22]. The DT approach allows dealing with underpowered pedigrees and obtaining thresholds with manageable FNR and FPR. The concept is similar to the estimation of optimal cutoff in ROC curves. Although the authors did not intend to provide a LR threshold for reaching a conclusion in the identification process, the DT approach is very instructive on threshold determination.

In real applications, interpretation of evidence is also crucial. In order to improve the communication between forensic scientists and laypeople, EasyKin converts likelihood ratio to verbal equivalents, which are often used to express the strength of evidence in court. According to [28], proposed verbal scales are null support (LR = 1), weak or limited support (LR > 1–10), moderate support (LR > 10–100), strong support (LR > 100–1000), very strong support (LR > 1000–10000), and extremely strong support (LR >10,000). However, we would like to point out that a verbal scale should always be accompanied by a numeric expression of the value of evidence, especially when the value of the evidence is weak/limited. Besides LR itself, system power at a specific LR may also act as good metrics for the interpretation. With EasyKin, users can drag the slider to the proper position (using the single threshold mode) after LR calculation to evaluate the evidence for individual case. If H1 is supported, Sen, PPV, and FPR are useful for data interpretation while Spe, NPV, and FNR can be used if H2 is supported. Take the case in Example 2 as an example. Given the current LR value 119.2377 (log₁₀CLR = 2.0764) as the threshold, Sen, PPV, and FPR are 0.9120, 0.9999, and 0.0001, respectively. Correct rates (PPV and NPV) and error rates (FPR and FNR) may be, to some degree, more straightforward and easier to understand for jurors and lawyers. Therefore, these metrics can also be used for interpreting the value of evidence.

We compared the performance between EasyKin and Familias [2] (desktop application), the latter of which is a popular and free software for kinship analysis. Taking the case in Fig. 4 as an example, we just need about 15 seconds (s) for hypothesis construction with EasyKin, approximately twelve folds faster than Familias (about 3 min). Therefore, fast and intuitive construction of hypotheses is one of the main advantages of EasyKin. With respect to the speed of simulations, EasyKin cost 5.33 s, 52.24 s, 520.00 s for 100, 1000, and 10000 simulations while the runtime was 4.45 s, 39.44 s, and 396.59 s with Familias. Although EasyKin is a little slower, parallel computation may be processed to speed up the simulation with the stand-alone version of EasyKin (https://github.com/Ryan620/Easykin).

We noticed that the runtime of different relationships differed greatly. Thirty-seven common scenarios in forensic casework listed in Ge et al.’s study [1] were simulated and the runtime was compared. Results showed that only several seconds were needed for most scenarios with 100 simulations under “Equal” mutation model using the 23 STRs in AmpFlSTR Huaxia. More time are expected when cousins are included, e.g., 2040.38 s for two cousins (they are also cousins) plus POI (Supplementary Fig. 1). In addition, we found that the runtime increased linearly with both the number of STRs and the number of simulations (Supplementary Fig.2).

There are still some limitations for current version of EasyKin. First, three mutation models are implemented, i.e., “Equal,” “Proportional,” and “Stepwise,” but the calculation under “Equal” model is more efficient and faster. If the “Stepwise” model is specified, LR calculation may be time-consuming for some scenarios. Some common and intuitively simple pairwise relationships still cost several to thousand seconds. In our one test with 100 simulations and 23 STRs in local mode, parent-child, full-sibling, half-sibling, grandparent-grandchild, and avuncular-nephew relationships needed approximately 6 s, 23 s, 7 s, 7 s, and 4000 s, respectively. Therefore, we recommend users choose the “Equal” model for pedigree simulations given the almost identical LR distribution under the three mutation models. Second, the relationships among the references are not validated but EasyKin automatically calculate the LRs for all pairs of references. If any false relationship is found, the individual(s) with false relationship should be removed. Similarly, the true relationship may differ from both of the stated hypotheses and it may introduce bias in the test results [29]. This kind of issue will be studied in our future work. Finally, dependence among markers, especially those on the same chromosomes, should also be considered. If any dependence is found, one of them should be excluded.

Conclusion

EasyKin is a flexible and user-friendly online tool for kinship testing. It provides a one-stop solution for forensic use, that is, instructing users to choose appropriate kits and reference relatives before testing, calculating LR automatically in the testing, and providing metrics for data interpretation after testing. We think it will greatly benefit both forensic and non-forensic practitioners.

Data availability

The datasets generated and/or analysed during the current study are available from the corresponding author on reasonable request.

References

Ge J, Budowle B, Chakraborty R (2011) Choosing relatives for DNA identification of missing persons. J Forensic Sci 56. https://doi.org/10.1111/j.1556-4029.2010.01631.x
Kling D, Tillmar AO, Egeland T (2014) Familias 3 - extensions and new functionality. Forensic Sci Int Genet 13. https://doi.org/10.1016/j.fsigen.2014.07.004
Pinto N, Simões R, Amorim A, Conde-Sousa E (2019) Optimizing the information increase through the addition of relatives and genetic markers in identification and kinship cases. Forensic Sci Int Genet 40:210–218. https://doi.org/10.1016/j.fsigen.2019.02.019
Article CAS PubMed Google Scholar
Tamura T, Osawa M, Ochiai E et al (2015) Evaluation of advanced multiplex short tandem repeat systems in pairwise kinship analysis. Leg Med 17:320–325. https://doi.org/10.1016/j.legalmed.2015.03.005
Article CAS Google Scholar
Carboni I, Iozzi S, Nutini AL et al (2014) Improving complex kinship analyses with additional STR loci. Electrophoresis 35:3145–3151. https://doi.org/10.1002/elps.201400080
Article CAS PubMed Google Scholar
Zhang Q, Zhou Z, Wang L et al (2020) Pairwise kinship testing with a combination of STR and SNP loci. Forensic Sci Int Genet 46. https://doi.org/10.1016/j.fsigen.2020.102265
Phillips C, García-Magariños M, Salas A et al (2012) SNPs as supplements in simple kinship analysis or as core markers in distant pairwise relationship tests: when do SNPs add value or replace well-established and powerful STR tests? Transfus Med Hemotherapy 39:202–210. https://doi.org/10.1159/000338857
Article Google Scholar
Kling D, Egeland T, Piñero MH, Vigeland MD (2017) Evaluating the statistical power of DNA-based identification, exemplified by ‘The missing grandchildren of Argentina. Forensic Sci Int Genet 31:57–66. https://doi.org/10.1016/j.fsigen.2017.08.006
Article CAS PubMed Google Scholar
Vigeland MD, Marsico FL, Herrera Piñero M, Egeland T (2020) Prioritising family members for genotyping in missing person cases: a general approach combining the statistical power of exclusion and inclusion. Forensic Sci Int Genet 49:102376. https://doi.org/10.1016/j.fsigen.2020.102376
Article CAS PubMed Google Scholar
Thomsen AR, Hallenberg C, Simonsen BT et al (2009) A report of the 2002-2008 paternity testing workshops of the English speaking working group of the International Society for Forensic Genetics. Forensic Sci Int Genet 3:214–221. https://doi.org/10.1016/j.fsigen.2009.01.016
Article CAS PubMed Google Scholar
Annual report summary for testing in 2013. In: AABB. https://www.aabb.org/sa/facilities/pages/relationshipreports.aspx. Accessed 17 Jun 2021
Li R, Li H, Peng D et al (2019) Improved pairwise kinship analysis using massively parallel sequencing. Forensic Sci Int Genet 38:77–85. https://doi.org/10.1016/j.fsigen.2018.10.006
Article CAS PubMed Google Scholar
Egeland T, Mostad PF, Mevåg B, Stenersen M (2000) Beyond traditional paternity and identification cases. Selecting the most probable pedigree. Forensic Sci Int 110:47–59. https://doi.org/10.1016/S0379-0738(00)00147-X
Article CAS PubMed Google Scholar
Fung WK (2003) User-friendly programs for easy calculations in paternity testing and kinship determinations. Forensic Sci Int 136:22–34. https://doi.org/10.1016/S0379-0738(03)00218-4
Article PubMed Google Scholar
Bruijning-van Dongen CJ, Slooten K, Burgers W, Wiegerinck W (2009) Bayesian networks for victim identification on the basis of DNA profiles. Forensic Sci Int Genet Suppl Ser 2:466–468. https://doi.org/10.1016/j.fsigss.2009.08.024
Article Google Scholar
Slooten K (2011) Validation of DNA-based identification software by computation of pedigree likelihood ratios. Forensic Sci Int Genet 5:308–315. https://doi.org/10.1016/j.fsigen.2010.06.005
Article CAS PubMed Google Scholar
Converge Forensic Analysis Software - CN (2021) https://www.thermofisher.cn/cn/zh/home/industrial/forensics/human-identification/forensic-dna-analysis/forensic-dna-data-interpretation/converge-forensic-analysis-software.html. Accessed 17 Jun 2021
Elston RC, Stewart J (1971) A general model for the genetic analysis of pedigree data. Hum Hered 21:523–542. https://doi.org/10.1159/000152448
Article CAS PubMed Google Scholar
Familias: book, R version and courses. https://www.familias.name/. Accessed 17 Jun 2021
Kayser M (2017) Forensic use of Y-chromosome DNA: a general overview. Hum. Genet 136:621–635
Article CAS PubMed PubMed Central Google Scholar
Parson W, Gusmão L, Hares DR et al (2014) DNA Commission of the International Society for Forensic Genetics: revised and extended guidelines for mitochondrial DNA typing. Forensic Sci Int Genet 13:134–142. https://doi.org/10.1016/j.fsigen.2014.07.010
Article CAS PubMed Google Scholar
Marsico FL, Vigeland MD, Egeland T, Piñero MH (2021) Making decisions in missing person identification cases with low statistical power. Forensic Sci Int Genet 54. https://doi.org/10.1016/j.fsigen.2021.102519
Cho S, Shin ES, Yu HJ et al (2017) Set up of cutoff thresholds for kinship determination using SNP loci. Forensic Sci Int Genet 29:1–8. https://doi.org/10.1016/j.fsigen.2017.03.009
Article CAS PubMed Google Scholar
Specification of parentage testing (GB/T 37223–2018) issued by State Administration for Market Regulation, Standardization Administration of the People’s Republic Of China 2018. https://www.moj.gov.cn/pub/sfbgw/zwfw/zwfwbgxz/202101/P020210122423062713689.pdf. Accessed 17 Jun 2021
Technical specification for identification of biological full sibling relationship (SF/T 0117–2021) issued by the Ministry of justice of the people’s Republic of China. Release on 17 Nov, 2021. https://www.moj.gov.cn/pub/sfbgw/zwxxgk/fdzdgknr/fdzdgknrlzyj/lzyjsfhybzj/202112/W020211207597115986640.pdf. Accessed 17 Jun 2021
Specification for identification of biological grandparent-grandchild relationship (SF/Z JD0105005–2015) issued by the Judicial authentication administration of the Ministry of justice of the people’s Republic of China.2015. http://www.moj.gov.cn/pub/sfbgw/zwfw/zwfwbgxz/202101/1565869722167038371.pdf. Accessed 10 Aug 2022
Kruijver M, Meester R, Slooten K (2014) Optimal strategies for familial searching. Forensic Sci Int Genet 13:90–103. https://doi.org/10.1016/j.fsigen.2014.06.010
Article PubMed Google Scholar
Marquis R, Biedermann A, Cadola L et al (2016) Discussion on how to implement a verbal scale in a forensic laboratory: benefits, pitfalls and suggestions to avoid misunderstandings. Sci Justice 56:364–370. https://doi.org/10.1016/J.SCIJUS.2016.05.009
Article PubMed Google Scholar
Brustad HK, Vigeland MD, Egeland T (2021) Pairwise relatedness testing in the context of inbreeding: expectation and variance of the likelihood ratio. Int J Legal Med 135:117–129. https://doi.org/10.1007/s00414-020-02426-6
Article PubMed Google Scholar

Download references

Funding

This research was funded by the National Natural Science Foundation of China (82293655, 81971798) and the Natural Science Foundation of Guangdong Province (2019A1515011527).

Author information

Authors and Affiliations

Faculty of Forensic Medicine, Zhongshan School of Medicine, Sun Yat-sen University, No. 74 Zhongshan Road II, 510080, Guangdong, People’s Republic of China
Ran Li, Nana Wang, Yu Zang, Jiajun Liu, Enlin Wu, Riga Wu & Hongyu Sun
School of Medicine, Jiaying University, Meizhou, 514015, People’s Republic of China
Ran Li
Guangdong Province Translational Forensic Medicine Engineering Technology Research Center, Zhongshan School of Medicine, Sun Yat-sen University, No. 74 Zhongshan Road II, Guangzhou, 510089, Guangdong, People’s Republic of China
Ran Li, Nana Wang, Yu Zang, Jiajun Liu, Enlin Wu, Riga Wu & Hongyu Sun

Authors

Ran Li
View author publications
You can also search for this author in PubMed Google Scholar
Nana Wang
View author publications
You can also search for this author in PubMed Google Scholar
Yu Zang
View author publications
You can also search for this author in PubMed Google Scholar
Jiajun Liu
View author publications
You can also search for this author in PubMed Google Scholar
Enlin Wu
View author publications
You can also search for this author in PubMed Google Scholar
Riga Wu
View author publications
You can also search for this author in PubMed Google Scholar
Hongyu Sun
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Hongyu Sun.

Ethics declarations

Ethics approval

This study was approved by the Ethics Committee of Zhongshan School of Medicine, Sun Yat-sen University (Guangzhou, China), approval number [2020]044.

Conflict of interest

The authors declare no competing interests.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

ESM 1

ESM 2

ESM 3

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Cite this article

Li, R., Wang, N., Zang, Y. et al. Easykin: a flexible and user-friendly online tool for forensic kinship testing and missing person identification. Int J Legal Med 137, 1671–1681 (2023). https://doi.org/10.1007/s00414-023-03083-1

Download citation

Received: 29 June 2023
Accepted: 05 September 2023
Published: 25 September 2023
Issue Date: November 2023
DOI: https://doi.org/10.1007/s00414-023-03083-1

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Easykin: a flexible and user-friendly online tool for forensic kinship testing and missing person identification

Abstract

Similar content being viewed by others

Kinship analysis: assessment of related vs unrelated based on defined pedigrees

An evaluation of the SureID 23comp Human Identification Kit for kinship testing

The Advantages of Noncriminal Genetic Databases in Identifying Missing Persons and Human Remains

Introduction