Introduction

Age of majority is the threshold at which an individual is legally considered to have attained adulthood and, in most countries, it is set at 18 years [1,2,3]. Several exceptions exist and, before the age of majority, a minor may acquire partial or full legal emancipation through the consent of parents, legal guardian, or the court approval. In criminal proceedings, this does not mean that emancipated subjects under the age of 18 can no longer benefit from protection by the specific national law. In terms of the best interest and rights of minors, they cannot be detained with adults until at least 18 years and, in cases of uncertainty about the real age, the individual must be given the benefit of the doubt until proven otherwise; this means that, in such cases, a minor must be treated as such [3, 4].

In the European Union (EU), the best interest of the child, in particular of unaccompanied minors, is protected in the Dublin Regulation (Article 6) [5]. Increasing cross-border migration has resulted in numerous unaccompanied minors who cannot prove their chronological age with valid identification documents or who deliberately provide false age information [3,4,5,6,7]. Being unable to prove age, or incorrectly assessing a minor as an adult, can restrict the minor from having access to some basic human rights such as healthcare and education. This fact puts into focus age assessment: the judicial authorities often request a medical age assessment solid report issued by a forensic expert in which the most likely age and/or the minimum age is indicated [8, 9].

The European Asylum Support Office (EASO) has contributed intensively to the debate with its recently updated “Practical Guide on age assessment”, in which a distinction between “radiation-free” techniques and those involving radiation is made. In addition, priority is given to the least intrusive methods [10]. The EASO established that, within the EU, no agreement has been reached so far about the possible ethical and legal justification of methods [10].

The Study Group on Forensic Age Diagnostics (AGFAD: Arbeitsgemeinschaft für Forensische Altersdiagnostik), of the German Society of Legal Medicine, created and updated the guidelines for age estimation and institutionalisation of quality control, with special attention to legal and ethical implications [3, 11]. These researchers stated that age assessment reports should indicate data on the reference population regarding genetic and geographic origin, and variations of age-related parameters. That is important in order to allow the decision-making institutions to consider any doubt such that it leads to the more favourable legal outcome for the undocumented minors [3, 11, 12].

Along with the analysis of hand-wrist bones, the examination of lower third molar mineralisation, performed by an orthopantomogram of the mandibular region, is one of the most employed methods for the age periods towards 18 years [13,14,15,16]. In the last few years, many works have focused on the third molar because it is still in development around the age of 18 years old [17,18,19,20,21]. The most important one according to Demirjian [22] uses eight stages. However, it has been demonstrated that ethnicity plays an important role when a certain phase is reached [23,24,25,26]. Recently, Rolseth et al. [27] highlighted that, although such variations have been interpreted as ethnic differences, this can very likely be attributable to inadequate sample size in each stage of classification and age mimicry bias.

Cameriere et al. [28] developed a quantitative method for assessing if an individual is younger than 18 years. Its accuracy has been already tested on several populations but the outcomes of these studies have not been mutually compared to detect country-specific or ethnical differences in a pooled international group [29, 30].

The aim of this research was to compare the validity of third molar maturity index (I3M) to assess age of majority (I3M < 0.08 suggests that the subject is older than 18 years; instead I3M ≥ 0.08 suggests that the subject is younger than 18 years) between various countries based on the available data about sensitivity and specificity from five different regional groups’ radiographic samples.

Materials and methods

The sample consisted of 6157 radiologically scored orthopantomograms (OPGs), coming from 15 countries and analysed in detail in previous studies (AL = Albania, AUS = Australia, CHN = China, CO = Colombia, DOM = Dominican Republic, ET = Egypt, F = France, I = Italy, IND = India, J = Japan, PL = Poland, RCH = Chile, SRB = Serbia, TR = Turkey, ZA = South Africa) [31,32,33,34,35,36,37,38,39,40,41,42,43,44,45] (Fig. 1).

Fig. 1
figure 1

Pie chart showing the distribution of the nationality of individuals in the sample

The following regional groups were considered and modified from Liversidge et al. [46], Hennessy and Stringer [47], and Cavalli-Sforza et al. [48]: Australian (AUS), African (AF), South American (SA), European (E), and Asian (A) (Fig. 2). It is not a racial categorisation but only a geographical representation of living human population [49].

Fig. 2
figure 2

Pie chart showing the distribution of the considered regional groups

The distribution of countries for different regional groups is reported in Table 1, and the age distribution according to sex and regional group is showed in Table 2. The following I3M range categories were considered to analyse the relationship between index and real age (years) of each subject: [0, 0.08), [0.08, 0.22), [0.22, 0.4), [0.4, 0.7), [0.7, 1), [1, 2.4]. These categories were chosen according to the available published researches on the I3M [29,30,31,32,33,34,35,36,37,38,39,40,41,42,43,44].

Table 1 Sample distribution according to the country and regional group (abbreviations for countries: AL, Albania; AUS, Australia; CHN, China; CO, Colombia; DOM, Dominican Republic; ET, Egypt; F, France; I, Italy; IND, India; J, Japan; PL, Poland; RCH, Chile; SRB, Serbia; TR, Turkey; ZA, South Africa. Abbreviations for regional groups: AUS, Australian; AF, African; SA, South American; E, European; A, Asian)
Table 2 Age and sex distribution of the sample according to the different regional groups (2929 males, 3228 females. Abbreviations for regional groups: AUS, Australian; AF, African; SA, South American; E, European; A, Asian)

Regarding the selection criteria of the sample, they were as follows: clear OPGs and all permanent teeth present, including third molars. The presence of any visible dental and bone pathology (e.g., large carious lesions or endodontic treatments), children with any systemic diseases or endocrine anomalies (e.g., dysmorphology, abnormally short roots), and subjects with previous root canal treatment in the left lower permanent teeth were excluded from the study. Heavily rotated and impacted teeth with no visible roots were excluded also from the analysis.

Data was recorded in an excel file, whose columns contained individual’s identification number, sex, date of birth, and date of OPG.

Statistical analysis

A subject is positive in the test for the age of majority if his value of I3M is less than 0.08. On the contrary, a subject is negative to the test if the value of his I3M is greater than or equal to 0.08. In this work, data about sensitivity (the proportion of individuals being 18 years of age or older whose test is positive) and specificity (the proportion of individuals younger than 18 whose test is negative) of five different regional groups were pooled together using a bivariate modelling approach.

Given the prevalence of adults/minors in the populations, we evaluated the positive/negative post-test probability, i.e., the proportion of subjects testing positive/negative who truly are adults/minors. These probabilities can help a court judge make an uncertain decision about an individual’s age of majority. Furthermore, we defined the positive likelihood ratio, LR + , and negative likelihood ratio, LR − , of the test which are the likelihood of a test result, positive or negative, in adult subjects divided by the likelihood of the test result in minor subjects:

$${\displaystyle \begin{array}{c} LR+=\frac{P\left({I}_{3M}<0.08|\ subject\ is\ adult\right)}{P\left({I}_{3M}<0.08|\ subject\ is\ minor\right)}\\ {} LR-=\frac{P\left({I}_{3M}\ge 0.08|\ subject\ is\ adult\right)}{P\left({I}_{3M}\ge 0.08|\ subject\ is\ minor\right)}\end{array}}$$

A likelihood ratio close to 1 means that the test result does not change the likelihood of adult age of a subject appreciably. The more the likelihood ratio for a positive test (LR +) is greater than 1, the more likely the subject is adult. The more a likelihood ratio for a negative test is less than 1, the less likely the subject is adult. Thus, LRs correspond nicely to the concepts of ruling in and ruling out adult age of a subject.

The results of each group can be summarised by a 2 × 2 contingency table (Table 3). It summarised the number of subjects who resulted positive to the test and are adult (true positive values, tpi, i = 1,…,5), then subjects who resulted positive to the test and are underage (false-positive values, fpi, i = 1,…,5), followed by those negative to the test who are adults (false-negative values, fni, i = 1,…,5), and finally those negative to the test who are underage (true negative values, tni, i = 1,…,5). The data model is binomial:

Table 3 Contingency table showing the discriminating performance of the test
$$\begin{array}{c}{tp}_{i}\sim \mathrm{Bin}\left({n}_{1i}, {p}_{1i}\right), i=1,\dots ,5\\ {fp}_{i}\sim \mathrm{Bin}\left({n}_{2i}, {p}_{2i}\right), i=1,\dots ,5\end{array}$$

where Bin(n.p.) stays for binomial distribution with parameters n and p, and pki, i = 1,…,5, is the probability that the test results positive in adults, k = 1, or in underage, k = 2 in the population i = 1,…,5. In a more technical jargon, p1i is the true positive rate or sensitivity, sei, and p2i is the false positive rate or complementary rate of the specificity, 1-spi, of the ith regional group, i = 1,…,5. Before estimating sensitivity and specificity in the five regional groups and possibly their pooled estimates, their dependency from the regional group was tested transforming them as follows:

$$\begin{array}{c}{D}_{i}=\mathrm{logit}\left({p}_{1i}\right)-\mathrm{logit}\left({p}_{2i}\right)\\ {S}_{i}=\mathrm{logit}\left({p}_{1i}\right)+\mathrm{logit}\left({p}_{2i}\right)\end{array}$$
(1)

where,

$$\mathrm{logit}\left(p\right)=\mathrm{log}\left(\frac{p}{1-p}\right), 0<p<1$$

The transformations (1) were performed because the logit scale allows the practitioner, first, to define the parameters on the range (− ∞, + ∞) and can plausibly fitted by the normal distribution and, second, the linear transformation should leave the pair \(\left({D}_{i}, {S}_{i}\right)\) approximately independent.

Since the main aim is to assess the likelihood that sensitivity and specificity of the five regional groups are sampled from the same population, the equality of mean vectors \(\left({D}_{i}, {S}_{i}\right)\) i = 1,…,5, for the five regional groups, was tested using the one-way multivariate analysis of variance (MANOVA) analysis. Considering the invertibility of relation (1), affirming that the bivariate distribution of vectors \(\left({D}_{i}, {S}_{i}\right)\) i = 1,…,5, does not depend on the regional groups is equivalent to affirm also the independence of both sensitivity and specificity from the regional groups.

Consequently, if the one-way MANOVA analysis and Wilks’ test point out that the hypothesis of equality of means cannot be rejected (the p-value is greater than the significant level of the test, α = 0.05, then it is possible to assess the validity of I3M by pooling all the data together). The result of the one-way MANOVA analysis is usually reported in a MANOVA table. Statistical analysis of data and related graphs was carried out by the R statistical program version 4.0.3 [50].

Results

Table 4 showed that only 6% of the individuals with I3M < 0.08 were minor. In fact, if the number of minors in the first category ((I3M class [0, 0.08) = (27 + 149)) and the total number of individuals in the class ([0, 0.08) = 2972) are considered, the proportion of minors with I3M < 0.08 = (27 + 149)/2972 = 0.06 or 6%. Similarly, when I3M increased from the second class [0.08, 0.22) to the last one [1, 2.4], the percentage of adults wrongly classified as minor also decreased from 31.8% [calculation: (28 + 72 + 23)/1015 = 0.318 or 31.8%] to 6.4% [(12 + 3 + 2)/263 = 0.0646 or 6.46%] with an overall percentage of 17.4% = (number of adult individuals with I3M ≥ 0.08)/(number of subjects with I3M ≥ 0.08) = (228 + 72 + 23 + 87 + 33 + 6 + 47 + 7 + 1 + 29 + 7 + 0 + 12 + 3 + 2)/(6157 − 2972) = 0.17488 = 17.488%.

Table 4 I3M and age distribution of the overall sample. Both age and I3M classes are closed on the left

The result of the one-way MANOVA, reported in Table 5, showed that the Wilks’ Lambda was equal to 0.762 with a p-value of 0.944. Therefore, both sensitivity and specificity did not depend from the regional groups and its validity can be tested pooling all the data together.

Table 5 One-way MANOVA

Table 6 displayed the strong association between adult age and positivity of the test (i.e., I3M < 0.08). In fact, 5431 of 6157 individuals were correctly classified. According to the results reported in Table 6, the sensitivity was 84% with a 95% confidence interval equal to (82%, 85%), and its specificity was 94%, with a 95% confidence interval equal to (93%, 95%). The proportion of correctly classified individuals was 88% with a 95% confidence interval equal to (87%, 89%).

Table 6 Contingency table describing performance of the I3M

Considering the probability that an individual in the sample of the considered regional groups is 18 or older was equal to 54%, the estimated positive post-test probability (Table 7) was 94%, with a 95% confidence interval of 93% and 95%. Hence, the probability that a subject positive on the test (i.e., I3M < 0.08) is 18 years of age or older resulted 94%. In addition, the high LR + and low LR − values showed that minors and adults were clearly discriminated.

Table 7 Validity of I3M test for pooled regional groups

Discussion

Due to the increasing rise in the number of people attempting to reach Europe from different countries, unaccompanied minors have become a recognised part of today’s global and mixed migration flows [10, 23]. Therefore, when the age estimation procedures are performed, it should be fundamental to take into account their geographic origin or nationality [10, 23].

Since the original study was conducted for the first time in 2008 in a Caucasian sample [28], one of the main concerns has been its validation in several populations. Previous studies have already demonstrated the validity of I3M to estimate age of majority in numerous samples coming from different countries [30, 51].

In this study, the aim was two-fold: to address a specific target issue more related to the daily forensic routine, whether a subject is below or above the legal age of 18, and to assess the influence of ethnicity in the application of I3M. The I3M was studied on a pooled collection of OPGs from several world regions. The obtained results showed that both sensitivity and specificity did not depend on the geographic origin and that this age marker could be applied without considering ethnicity as a statistical variable for discriminating between child and adult.

In the daily forensic practice, when verifying the status of the unaccompanied minors and implementing the assisted return procedures become very complex, the scientific best age estimation procedures should be considered for a uniform and optimal ethical age estimation practice, being thus a relevant task for the forensic experts [23, 51]. In those cases, the main aim is to achieve “the best interest” of the minor, to avoid ethically and technically unacceptable misclassifications (minors wrongly classified as adults or the opposite), and to allow other experts to reproduce all performed examinations, to check the results and to evaluate the clinical interpretations [52]. A more precise indication about a particular age (or age range) becomes important not only for discriminating between child and adult but also to answer to the question of how long a subject, assigned to the status “child”, remains in that status [23].

In a recent systematic review and meta-analysis published by Santiago et al. [30], the slight tendency of higher specificity over sensitivity-pooled values has been highlighted, thus meaning that the I3M is better in classifying the subjects younger than 18 years of age when compared with its accuracy of detecting the subjects older than this age. In this study, the sensitivity and specificity were 84% and 94%, respectively, with a 95% confidence interval. This supports the fact that this method is usually more precise at identifying minors and discriminating between the two age categories at the extremes of the root maturation (i.e., early and apical stages) [20, 47, 53].

Several works on third molar development and maturity stages in a specific population have been already published [54]. Whilst sex dependence was always taken into account, the impact of ethnicity and nutrition or socio-economic status on these procedures has not been analysed properly [23].

On the other side, only few studies have been carried out so far to analyse the problem of ethnicity and compare the influence of this variable on a pooled sample coming from different continents. The obtained outcomes were dissimilar according to the characteristics of each used sample and the performed method [20, 54].

Thevissen et al. [54] showed that, in spite of some changes in speed and onset of development, differences in value were small and not constant over the considered age range. Conversely, Liversidge et al. [47] indicated that some group differences were evident in third molar timing, but this had some impact on the confidence interval of estimated age in females and little impact in males because of the large variation in age. Olze et al. [55], in their study about the application of Demirjian method on three population samples (one German, one Japanese, and one South African), concluded that population-specific standards would enhance the accuracy of forensic age estimates based on wisdom tooth mineralisation in living minors involved in criminal proceedings.

All these studies indicated that some discrepancies eventually observed are largely due to inter-individual variability in mineralisation time of third molar teeth more than to the population-specific criteria [3, 38, 49, 56].

Although the OPGs were used in this study, age estimation could be performed also by taking one periapical dental radiograph. In certain forensic cases, where an orthopantomogram is available, or patient safety and maximum reduction of radiation doses have to be accomplished, this method could be useful. Anyway, consent protocols should be followed, and informed consent must be obtained every time for these procedures in accordance with international paediatric guidelines and the specific jurisdictional regulation [10, 23].

The main strength of this research was the possibility to gather reliable data from the world regions that are represented in these samples of ascertained ethnic background, general health, and chronological age. On the other side, limitations were the relatively uneven age distribution of the sample and the difficulty in separating ethnicity from socioeconomic status. In fact, since the OPGs were selected retrospectively from data obtained for dental care, a limited number of radiographs were available in Asia and Australia, where no further information was available from the whole regions. In addition, nearly half (47.2%) of the population sample is from Europe. Finally, no sample information was reported about the socioeconomic status.

As regards the future challenges and research objectives, a new large sample is being currently gathered in order to provide new data on the I3M in a better-balanced sample distribution, in which both the right and left lower molars, and the mesial and distal root, are considered for the analysis. To date, only few studies have showed differences in the relative development of each root within the same third molar [57].

Conclusions

Based on the results of this study, further population-specific reference data for the I3M are not required. The I3M is useful to identify whether a subject has reached the legal age of 18 years, regardless of his or her ethnicity.

In order to accomplish the legal certainty, the welfare of the minor, and the fair distribution of resources, the I3M could be included in a new catalogue of age markers with available references, associated data, and sound statistical support. In fact, since different age markers cover different age ranges, if a specific age marker may not provide a useful result to decide whether a subject’s chronological age is below or above the threshold of 18 years, the usage of more than one age marker should be recommended [23].

Regarding the socioeconomic status, further databases should be collected in order to investigate possible differences in the I3M application between samples coming from diverse regional groups.