Introduction

Age assessment in living individuals plays an important role in forensic issues including identification, immigration, and sentencing in most jurisdictions. According to the laws in different countries, ages of 16, 18, and 21 are of legal relevance [1,2,3,4]. Forensic age assessment in living juveniles and young adults is mainly based on the evaluation of hand ossification, mineralization, and eruption of second and third molars or clavicular ossification [2, 5,6,7]. The Study Group on Forensic Age Diagnostics (AGFAD) recommends that physical examination, radiographic assessment of the left hand, and dental examination contribute to the diagnosis of chronological age. If the skeletal development of the hand is complete, an additional radiological examination of the sternal extremity of the clavicle is recommended [8].

The investigation of regressive dental changes has been proposed for age diagnostics in adults after the completion of clavicular ossification [9,10,11,12]. For dental age assessment, the main criterion is the mineralization stages of third molars. However, previous studies have shown that the development of third molars can be completed under the age of 18 years in Chinese individuals [13,14,15]. Recently, several new dental techniques have been proposed to judge a subject below or above 18 years of age once the root mineralization of the third molars is completed, including the radiographic visibility of the root pulp and periodontal ligament in the lower third molars [16, 17]. Both of them can be helpful for the exclusion of ages under 18 years in Chinese individuals [18, 19].

In 1947, Gustafson was the first to propose a scientific method for dental age assessment [20]. According to Gustafson, the characteristics of secondary dentin formation, periodontal recession, attrition, apical translucency, cementum apposition, and external root resorption correlated with chronological age. Matsikidis proved in 1981 that the characteristics first introduced by Gustafson for extracted teeth can also be applied to dental films [21]. As not all the characteristics presented by Gustafson in an extracted tooth are equally evaluable in radiographs, Olze et al. [11] in 2012 modified the Gustafson’s criteria by utilizing the characteristics of secondary dentin formation, periodontal recession, attrition, and cementum apposition in orthopantomograms building a multiple regression formula, which can be recommended for age assessment in living individuals. In 2017, Timme et al. [22] proved the applicability and reliability of this method with a large cohort and a wide age range, including older individuals.

In this study, we tested the applicability and reliability of the method proposed by Olze et al. in a northern Chinese population and checked whether the formulas developed by Olze et al. and Timme et al. are suitable for the Chinese population too.

Materials and methods

The material of this study consisted of 1300 orthopantomograms from 650 male and 650 female northern Chinese individuals between 15 and 40 years of age, collected from the Department of Oral Radiology at the Stomatological Hospital of Xi’an Jiaotong Univeristy, China, in the period between 2011 and 2014. The first examiner was a forensic odontologist with profound professional experience in age assessment based on orthopantomograms (Guo Y.-C). Prior to the study, he discussed the features of different stages with a dentist experienced in dental age estimation by means of Gustafson’s criteria (Olze A.). After discussion and training, the first examiner was very qualified in this method.

Table 1 shows the age and sex distributions of the studied population. The characteristics of secondary dentin formation, periodontal recession, attrition, and cementum apposition were determined in mandibular premolars according to the stage classifications presented by Olze et al. [11] (Figs. 1, 2, 3, and 4). The evaluated teeth were selected according to the exclusion criteria from the recommendations presented by Matsikidis (Table 2) [21]. The following stage classifications developed by Olze et al. were used in this study, including the characteristics of secondary dentin formation, periodontal recession, attrition, and cementum apposition:

Table 1 Distribution of the sample by age and sex
Fig. 1
figure 1

Stage classification to determine the degree of secondary dentin

Fig. 2
figure 2

Stage classification to determine the degree of periodontal recession

Fig. 3
figure 3

Stage classification to determine the degree of attrition

Fig. 4
figure 4

Stage classification to determine the degree of cementum apposition

Table 2 Exclusion criteria according to Matsikidis

Secondary dentin formation

  • Stage 0 Pulp horn reaches to above crown equator

  • Stage 1 Pulp horn reaches at maximum to crown equator

  • Stage 2 Pulp horn exceeds enamel-cementum boundary and falls short of crown equator

  • Stage 3 Pulp horn reaches at maximum to enamel-cementum boundary

Periodontal recession

  • Stage 0 No periodontal recession

  • Stage 1 Periodontal recession into cervical root third

  • Stage 2 Periodontal recession into middle root third

  • Stage 3 Periodontal recession into apical root third

Attrition

  • Stage 0 No attrition, cusp tips present

  • Stage 1 Beginning attrition with loss of cusp tips

  • Stage 2 Attrition reaching into dentin

  • Stage 3 Attrition reaching into dentin with opening of pulp cavity

Cementum apposition

  • Stage 0 No visible cementum apposition

  • Stage 1 Beginning apical cementum apposition

  • Stage 2 Clearly visible cementum apposition, reaching beyond the apex

The evaluation of the orthopantomograms was randomized and blinded, i.e., without knowing the dates of birth or the dates of X-ray examination. The identification number, sex, date of birth, date of X-ray examination, and the stages of the teeth characteristics of each test subject included in the study were recorded. The chronological age was calculated by the confirmed date of the radiographic examination and the date of birth.

We randomly divided the sample into a training dataset and a test dataset (Table 3). Five male and five female subjects in each age group were randomly selected to a test dataset. The correlation between the chronological age and the individual characteristics was studied by means of a stepwise multiple regression analysis in the training dataset. The dependent variable was chronological age; the independent variables were characteristics based on Olze et al. The modeling of the linear regression model was developed in single steps with the prognosis-relevant influencing variables secondary dentin formation, periodontal recession, attrition, and cementum apposition. The test dataset was used to verify the Chinese prediction formula and the formulas developed by Olze et al. and Timme et al. To compare the performances of different age prediction formulas on the Chinese population, we calculated the error of the age prediction which was defined as the difference between the dental age (DA) and the chronological age (CA). The mean differences of DA and CA showed the direction of the error (underestimation or overestimation), and the mean absolute differences of DA and CA demonstrated the magnitude of the error. Mann–Whitney U test was applied to assess the significances of the difference for DA and CA between the present study formula and Olze’s and Timme’s formulas.

Table 3 Distribution of study subjects in training dataset and test dataset

To assess intra-observer agreement, 100 randomly selected orthopantomograms were evaluated by the first examiner after a 3-month interval. For inter-rater agreement evaluation, the same 100 orthopantomograms were evaluated by a second examiner. The second examiner was a dentist without experiences in dental age assessment. Cohen’s kappa coefficients were calculated for intra- and inter-rater agreement.

Results

Tables 4 and 5 record the number and percentage of missing and non evaluable teeth in each age group in males and females, respectively. Depending on the studied teeth, 70.00–82.92% of cases were evaluable. Considering the evaluated teeth in each age group, the percentages were 44.00–100% and 36.00–100% in males and females, respectively. Tables 6 and 7 show the frequencies of the various stages of the examined features for males and females, respectively. The intra-rater agreement was 0.884, 0.883, 0.893, and 0.639 for secondary dentin formation, periodontal recession, attrition, and cementum apposition, respectively. The inter-rater agreement was worse than the intra-rater agreement in every characteristc. It was 0.652 for secondary dentin formation, 0.572 for periodontal recession, 0.587 for attrition, and 0.285 for cementum appostion, respectively. Tables 8 and 9 show the multiple regression equations according to the training dataset with age as the dependent variable and individual characteristics as independent variables. Tables 10 and 11 show the comparison of mean differences and mean absolute differences between dental age and chronological age in different formulas according to the test dataset. The p values between Olze’s formula and the present formula ranged from 0.000 to 0.371 in males and from 0.000 to 0.165 in females, respectivly. While for the comparison between Timme’s and the present formula, the p values differed from 0.001 to 0.354 and 0.237 in males and females, respectively.

Table 4 Number and percentages of teeth excluded or missing in each age group in males
Table 5 Number and percentages of teeth excluded or missing in each age group in females
Table 6 Frequencies of the various stages of the examined features for males
Table 7 Frequencies of the various stages of the examined features for females
Table 8 Regression equations, correlation coefficients (R), coefficients of determination (R2), and standard errors of estimate (SEE) of multiple regression analyses with age as the dependent variable and dental age changes as independent variables for teeth 34, 35, 44, and 45 of males
Table 9 Regression equations, correlation coefficients(R), coefficients of determination (R2) and standard errors of estimate (SEE) of multiple regression analyses with age as the dependent variable and dental age changes as independent variables for teeth 34,35,44,and 45,females
Table 10 Comparisons of mean differences and mean absolute differences in years between dental age (DA) and chronological age (CA) from Olze’s, Timme’s, and present study formulas for males in the test dataset
Table 11 Comparisons of mean differences and mean absolute differences in years between dental age (DA) and chronological age (CA) from Olze’s, Timme’s, and present study formulas for females in the test dataset

Discussion

According to Gustafson’s criteria, phenomena like secondary dentin formation, periodontal recession, attrition, cementum apposition, external root resorption, and apical translucency correlate with chronological age [23]. Olze et al. [11] in 2012 studied 1299 orthopantomograms from 649 male and 650 female Germans aged from 15 to 40 years. Because external root resorption and apical translucency of premolars were hard to determine in orthopantomograms, only the characteristics of secondary dentin formation, periodontal recession, attrition, and cementum apposition were evaluated in all the mandibular premolars. Compared to the staging system presented by Matsikidis, their staging system with less stages for regressive dental changes was better applicable to orthopantomograms. With the aid of a multiple regression analysis, they studied the correlation between the chronological age and individual characteristics. Although regression analysis should only be used for metrically scaled variables, previous research showed that by applying the Bayes theorem to ordinally scaled variables, an improvement in the accuracy of age diagnosis could not be achieved in comparison to when applying regression analysis [24]. Olze et al. recommended their method for forensic age assessment in living individuals with the limitation that the quality of the X-ray images may restrict the applicability of the new method presented. In 2017, Timme et al. [25] investigated the validity of the technique developed by Olze et al. They reviewed 2346 orthopantomograms of 1179 male and 1167 female Germans aged between 15 and 70 years. The same four characteristics of premolars were studied. Regression analyses were performed separately for the age cohorts 15 to 40 years and 15 to 70 years. They concluded that these methods could be used for age assessment in the living and that more precise regression formula could be presented for the age group between 15 and 40 years compared to the age cohort of 15 to 70 years.

Against the background that all studies published to date concerned Caucasians, the aim of the present study was to test the validity of the method in a northern Chinese population. Stepwise multiple regression analysis was used in the present study to develop calculation formulas to assess the age of a living individual based on characteristics of mandibular premolars with significant correlation to age. The calculated R values range from 0.80 to 0.83, which are better than the values presented by Olze et al. [11] and Timme et al. . [25], who calculated R values ranging from 0.65 to 0.73 and 0.69 to 0.77, respectively. For the values of standard error of estimate, the range is 4.29 to 4.75, which is also better compared to previous studies. The calculated ranges for the standard error of estimate by Olze et al. [11] and Timme et al. [25] are 5.3 to 5.7 and 4.6 to 5.2, respectively. These findings prove the applicability of these characteristics for age assessment in Chinese individuals.

In the present study, we selected 260 individuals into the test dataset in order to examine the accuracy of different formulas. Tables 10 and 11 show the comparison values of the mean differences and absolute differences in years between DA and CA from the results of Olze et.al. [11], Timme et al. [22], and the present study for males and females, respectively. According to Olze’s formula, the mean differences were by the range of − 1.546 to 1.472 years and − 1.432 to 1.541 years in males and females, respectively. Based on Timme’s formula, the mean differences were by the range of − 1.654 to − 0.653 years in males and by the range of 4.554 to 6.532 years in females. According to the formula developed by the present study, the mean differences were by the range of 0.984 to 1.754 years for males and 1.654 to 2.165 years for females. The significant statistical differences (p < 0.05) between Timme’s formula and the present study formula were found for teeth 35, 44 and 45 in males and teeth 34, 44 and 45 in females. While comparing present study formula with Olze’s formula, the significant statistical differences (p < 0.05) were found in teeth 35 and 45 for males and teeth 35 and 44 for females.

The mean absolute difference does not consider whether dental age is underestimated or overestimated and only quantifies the distance from the true age. Mean absolute differences for the Olze’s formula were 3.434–4.183 years and 3.632–4.657 years for males and females, respectively. Based on Timme’s formula, mean absolute differences were 4.085 to 4.979 years for males and 5.643 to 7.045 years for females. According to the formula developed by the present study, the mean absolute differences were 3.539–4.522 years in males and 3.964–4.870 years in females. Similar with the mean differences, the significant statistical differences were observed only in some teeth for both males and females. Comparing with the Olze’s study, the present study formula was significantly more accurate (p < 0.05) only for tooth 45 in males. The present study formula performed more accurate (p < 0.05) for teeth 35 and 45 in males and teeth 34, 44 and 45 in females when comparing with the Timme’s formula. Taking both mean differences and mean absolute differences into account, the Chinese formula did not always perform better compared with Olze’s and Timme’s formulas for both males and females.

In order to prevent systematic over- or underestimation of age, an even age distribution of the sample was ensured [26]. After exclusion of non-evaluable teeth due to the quality of the radiographs and missing teeth, 70.00–82.92% of the cases were suitable for evaluation. The number of assessable cases is higher here than the values Olze et al. and Timme et al. presented in their studies, in which 45–60% and 15–59% of cases were evaluable, respectively. Considering the percentages of evaluated cases in each age group in the present study, most of the values were over 70.00% indicating that revised Gustafson’s criteria could also be used in Chinese subjects for age estimation.

The intra-rater agreement was almost perfect except for cementum apposition, indicating that stages of cementum apposition are hard to determine compared to other characteristics. The inter-rater agreement was worse than the intra-rater agreement in all characteristics. This result can be explained by the fact that the second examiner was a dentist, who had no experiences in age assessment methods. Thus, when using this method for age estimation, it should only be applied by qualified forensic odontologists.

In conclusion, the technique presented by Olze et al. [11] could be validated in a Chinese population. The method is applicable for dental age assessment in adults. One of the limitations of this method is the percentage of radiographs that cannot be evaluated. Compared to the other three characteristics, the stages of cementum apposition are harder to determine, which might influence the precision of the age diagnosis. In further studies, the applicability of this method in Africans should be examined.