Introduction

Age estimation is an important part of identification procedure for unidentified adult subjects in forensic centers, among other biological characteristics such as sex, stature, and ancestry group [1].

Several anthropological methods have been proposed for age estimation, mainly based on the pubic bone [2] or the sternal extremity of the 4th right rib [3,4,5,6]. Among these different methods, the Iscan method based on the 4th rib is widely used. The joint surface displays different macroscopic patterns throughout age: smooth in adolescents, sharp and cupshaped in middle aged adults, and irregular in elders [7,8,9]. Morphometric characteristics of the sternal end of the 4th rib that were found most correlated with age on a forensic sample of 414 male were the postero-superior pit depth, the fine serrations of the ovoid (delineating the pit shape), and its posterior flaring [10]. In histological studies, physiologic osteoporosis begins after third decade, causing a progressive enlargement of the marrow cavity at the expenses of cortical thickness [7, 8, 11]. In addition, the proportion of collagen decreases from 90% in the first decade to 55% between the fourth and eight decade of life [12]. A linear increase in the mineralization of costal cartilage with age was indeed described on an X-ray study [13]. Based on these observations, Iscan et al. proposed in 1984 a method categorizing the 4th rib into 9 different groups, correlated with age subject [5], revised in 2010 by Hartnett in an 8 stage classification [14]. Initially developed onto an adult population of American-European ancestry [5], and this method was then validated in different populations [15,16,17].

Post-mortem computed tomography (PMCT) is a currently widely used tool in forensic medicine [18,19,20]. PMCT non-destructively evaluates several features that help for identification: presence of orthopedic implants, old fractures, comparative identification based on previous radiological exams [21], or pre-existing medical condition [21,22,23]. Anthropological methods for age estimation have been adapted to PMCT, using pubic symphysis shape [24], 4th right rib evaluation [25, 26], and a more complex method using multiple parameters (endocranial sutures of the skull, trabecular structure of the humeri and femora, and pubic symphysis shape) [27]. The 4th right rib method was first tested on CT of dry bones [28] and then validated on two studies in two independent Australian populations using Iscan’s phase description [26] and revised method for volume rendering reconstructions [25]. However, this PMCT-based 4th rib method needs to be validated in other populations. Moreover, even if correlation between 4th rib score and age has been largely shown, the individual age prediction confidence interval for a given PMCT 4th rib score is not clearly defined in literature.

The aim of our study was to validate the PMCT 4th right rib method for age estimation in the population of MLIP and to provide an easy-to-use graphical method for age estimation.

Materials and methods

Population

All 310 PMCT performed at MLIP between January 2019 and January 2020 were screened for inclusion. Inclusion criteria were availability of age in MLIP medical file, age ≥ 18 years. We excluded the subjects whose score was not assessable due to the impossibility to identify the 4th rib (rib fractures) or to correctly define the degenerative changes (insufficient bone quality). All PMCT were performed onto a single scanner (LightSpeed VCT, 64 detectors, GE Medical Systems) without contrast enhancement. The thoracic region was explored using the following parameters: 120 kV, adaptative mA, 0.625 slice thickness, bone reconstruction filter.

PMCT evaluation

The revised 4th rib scoring method proposed by Merritt et al. [25] was used to classify the sternal extremities of the ribs. Analysis was done on native images and using multiplanar (MPR) and volume rendering (VR) reconstructions. As Iscan first described the method on the 4th right rib, main analysis was done for the right 4th rib. The classification evaluated the following degenerative changes in the sternal extremity of the 4th rib: (1) pit depth, which corresponds to the articular cavity and gets deeper with age; (2) pit shape, which is V-shaped in early stages and enlarges into a U-shaped pit in late stages; (3) rim and wall configuration: walls are smooth and thin in early stages and become thinner with sharped edges with age; (4) osteophytes which can be seen in the last stages. Multiplanar reconstructions and native images were used to evaluate the pit depth and pit shape. Maximum intensity projections (MIP) and volume rendering (VR) reconstructions were used to visually assess rim and wall configuration. Each subject was then classified between phase 1 and phase 8, as proposed by Merritt et al. [25] (Table 1). To assess the possibility to use this method on other ribs in case of unavailability of the 4th rib, additional scoring of 3rd and 5th right ribs and 3rd to 5th left ribs was made. Since subjects had various putrefactive status, bone quality could not be properly assessed due to the variable presence of gas impairing its evaluation. First scoring session was made by two radiologists, independently to compute inter-observer reliability. One radiologist read all images 3 months later to compute intra-observer reliability.

Table 1 Criteria used for 4th rib score rating

Statistics

Analysis was performed with R software version 3.6.0 and package “cocor” [29]. Intra- and inter-observer reliability was evaluated with weighted kappa. Systematic bias was evaluated by comparing score distribution between the two readers using paired Wilcoxon test. Spearman correlation was performed to evaluate relationship between age and score. Multivariate analysis was performed for adjustment on sex, using generalized linear model. Confidence intervals for individual prediction of age based on 4th rib score and sex were computed with bootstrapping (number of replications B = 10,000). Additional multivariate analysis with a supplemental interaction term between 4th rib score and sex was performed in order to study the relationship between these two variables. Transition analysis was performed using a cumulative probit model in order to evaluate mean transition age between two consecutive 4th rib scores [30]. Correlation coefficients for different ribs evaluation were compared with Hittner method and Holm correction was used for adjustment on multiple testing. Results are expressed as median, interquartile range (IQR), and 95% confidence interval (95%CI) when applicable.

Ethics

As consent could not be obtained after death, in accordance with French legislation, the absence of written opposition was verified (Reference Methodology MR-004 n°0,509,211,020). As this study implied retrospective analysis of routinely acquired data, formal approval by Ethics Committee was not required.

Results

Population

Among 310 subjects screened for inclusion during study period, 255 fulfilled inclusion criteria. Of these, 5 were excluded due to rib scoring impossibility (2 poly-traumatisms, 1 scanned after autopsy, 2 with insufficient bone quality). Among 250 studied subjects, 68 (27%) were female and 182 (73%) were male. Median age was 51 years (IQR 36–68 years, range 18–98) and age distribution is shown in Fig. 1.

Fig. 1
figure 1

Distribution of age and sex for included subjects

Intra- and inter-observer reliability

Inter-observer and intra-observer reliability for 4th rib scoring were almost perfect (weighted κ = 0.82, 95%CI: 0.70–0.96; κ = 0.85, 95%CI: 0.78–0.93, respectively). Score distributions were not statistically different between reader 1 (median 5, IQR 4–6) and reader 2 (median 5, IQR 4–7) (p = 0.29).

Relation between score and age

There was high positive correlation between 4th rib score and subject age (rho = 0.72, p < 0.001) (Fig. 2). After adjustment on sex, association between age and 4th rib score was significant for all scores > 4 (Table 2) and linear model fit was R2 = 0.58. Confidence intervals for age prediction based on 4th rib score and sex were estimated using bootstrapping onto the linear model and are synthetized in Fig. 3. Typical 4th rib changes for each stage are illustrated in Fig. 4. Multivariate analysis with additional interaction term between sex and 4th rib score did not show any significant interaction between these two variables (p ≥ 0.1). Transition analysis displayed a larger variability of the mean age-at-transition for greater scores (Table 3).

Fig. 2
figure 2

Distribution of age and sex according to 4th rib scores in population study

Table 2 Multivariate linear regression analysis for age prediction
Fig. 3
figure 3

Confidence intervals for age estimation. The 95% confidence intervals (95%CI) for age estimation according to the linear regression model are displayed for each score and each sex category. Confidence intervals were estimated using bootstrapping

Fig. 4
figure 4

Illustration of typical 4th rib changes for each score. This figure illustrates the morphological changes of 4th rib on PMCT for each score. The upper and middle rows depict respectively axial and coronal PMCT views centered on the right 4th rib, used for evaluating pit depth and pit shape. The bottom row depicts volume rendering reconstructions, useful to assess rim and wall configuration

Table 3 Mean age-at-transition between 4th rib scores

Evaluation of other ribs

Correlation between age and was significantly higher using the right 4th rib score (rho = 0.72) than using the right 3rd rib (rho = 0.63, p < 0.001), right 5th rib (rho = 0.68, p = 0.01), left 3rd rib (rho = 0.66, p = 0.004) but not statistically different from left 4th rib score (rho = 0.68, p = 0.07) and from left 5th rib score (rho = 0.68, p = 0.052).

Discussion

Reproducibility

This study confirms an almost perfect inter- and intra-observer reliability for 4th rib method scoring. Regarding the morphological assessment of 4th rib score, previous studies by Haj Salem et al. [16] and Munoz et al. [17] reported high intra-observer agreement (κ = 0.73 in [16]) and inter-observer agreements (respectively κ = 0.75 and κ = 0.81). Previous PMCT studies by Merritt [25] and Blaszkowska et al. [26] also found high intra-observer agreements (κ = 0.85 and κ = 0.76, respectively) and inter-observer agreement (κ = 0.82 in [26]). Our study confirms in a new population that the 4th rib scoring method adapted to PMCT is a method as robust as the morphological evaluation. Our study also confirms that the bilateral 3rd to 5th ribs scores have lower correlation with age as in a previous study [26]; however, this effect may be partially due by measurement bias, as the right 4th rib was probably assessed with more accuracy.

Age range estimation

The 4th rib classification adapted in PMCT is strongly correlated to age (rho = 0.72), as previously demonstrated on previous PMCT study by Merritt [25] (rho = 0.84). However, these high correlation values do not imply that one can predict age based on 4th rib score. Two different approaches have been proposed in literature to evaluate the relationship between 4th rib score and age. First, regression techniques allow to predict age using a linear equation, but this prediction may be limited by the large confidence intervals for individual prediction. In the study by Blaszkowska et al. [26], standard error for age prediction is evaluated to ± 11.2 years, which corresponds to 95% confidence intervals spanning over 43 years under normal law. Our results confirm the large span of these confidence intervals and also underline the important overlap between age prediction intervals for adjacent scores. The second approach proposed in literature is the transition analysis [30], which does not aim to predict individual age for patients but modeling the passage of individual subjects from one stage to another [31]. In our study, the standard error for transition ages was in same magnitude as in a previous study using 4th rib score in PMCT [25] (1.4 to 2.9 in our study as compared to 1.4 to 3.01 in the previous study).

Therefore, neither approach allows to make a reliable individual prediction of subject age, given its 4th rib score. This issue is recurrent in anthropological literature [32], and may be hindered by multimodal approaches [33, 34]. Bascou et al. proposed a sequential approach based on two different PMCT skeletal characteristics: first the Suchey-Brooks method on the pubic bone, then the measure of bone density if the Suchey-Brooks score is comprised between 4 and 6 [35]. Machine learning techniques such as random forests and gradient boosting [36] could improve individual prediction by combining several weak classifiers such as 4th rib score, Lovejoy method on the auricular surface of the ilium [37, 38], cranial sagittal suture evaluation [39], and acetabulum morphological features [40] that have already been adapted to PMCT. Image analysis algorithms such as deep learning convolutional networks have already been proposed for age estimation in living individuals using brain MRI data [41] and could be applied onto PMCT data.

Limits

Our study presents several limits. First, it has been showed that biogeographical origin may change 4th rib classification [42] and hence may improve age prediction models, but this parameter was not evaluated in our study. Ethnicity of subjects examined at MLIP is varied but was not recorded for ethical and regulatory reasons. Second, our population displayed a number imbalance between male and female, female subjects representing about ¼ of scanned subjects. This imbalance is probably responsible for the larger confidence intervals in small 4th rib scores (1 and 2) for women age estimation. Third, the same classification method was used for both male and female subjects; however, Iscan demonstrated a difference in aging process of the sternal extremity of the rib [5]. We chose to use a single scoring system for male and female since (1) there was a number imbalance between male and female; (2) the use of a single scale seems easier and more acceptable in clinical routine; and (3) differences in previously described 4th rib score in male and female are subtle and did not seem appropriate for PMCT evaluation. However, we included sex in our multivariate and predictive models, therefore adjusting for this limitation. Moreover, we verified that there was no statistical interaction between sex and 4th rib score, confirming that a unique scoring system may be more adapted to this clinical situation.

Conclusion

PMCT usage has increased over past years in forensic medicine, for different purposes such as identification. Our study, based on 250 subjects, confirms the high reliability of the 4th rib method applied to PMCT and its high correlation to subject age, but emphasizes on the large confidence interval for subject-level age prediction. Age prediction techniques may benefit from multimodal and machine-learning approaches in order to reduce these confidence intervals.