Introduction

The postmortem interval (PMI), or the time since death, is the time between death and the discovery of the body [1]. Estimating the PMI is important for understanding the circumstances surrounding death and for contributing to the identification of human remains. The PMI can be used to (i) narrow the list of possible missing persons to whom the remains may belong [2, 3]; (ii) include or exclude suspects related to the investigation based on the time interval [2,3,4,5]; and (iii) project the state in which a body may be found in the context of a missing person search [3].

Numerous studies have investigated the rate of decomposition from a fleshed body to the skeletal remains. In forensic anthropology, the standard methods for estimating PMI are based on a morphological study of the decomposition process and its changes [6]. In 2005, Megyesi et al. [7] proposed a classification of decomposition stages based on the method proposed by Galloway et al. [8]. The system involves a sum of scores, according to the visual observation of the state of decomposition divided and characterized into four categories (fresh, early decomposition, advanced decomposition and skeletonization) with stages within to describe the remains. From the fleshed body to skeletal remains, the authors independently assigned a score to the head and neck, trunk and extremities. The Total Body Score (TBS) is obtained from the sum of each of the three anatomical regions [7]. With this classification, the authors proposed that a quantitative method, already existing in scientific disciplines such as forensic botany or entomology, could be applied by forensic anthropologists. As decomposition is a sequential process with small changes, the use of qualitative categories that describe small changes would increase the statistical power of the method and the estimation of the PMI [7].

Although the process and sequence of decomposition are well-known, variations over time are affected by a wide variety of factors [9, 10]. Temperature is considered to be the factor that most affects the decomposition process [7, 11, 12]. Given the absence of methods in forensic anthropology that consider decomposition as being dependent on factors other than time, Megyesi et al. [7] proposed a relationship between TBS and temperature through the accumulated-degree day (ADD). ADD is defined as the sum of average daily temperatures for the number of days the corpse had been decomposing [13]. The daily average between maximum and minimum degree was considered as temperature data (degrees Celsius) and 0ºC was the base temperature because biological processes were significantly slowed down or even inhibited at this temperature [7, 14].

With the combination of TBS as a dependent variable and ADD as an independent variable, Megyesi et al. [7] proposed a retrospective study to calculate the PMI. Although the study was developed using qualitative results, the method allows for the possibility of reproducibility, consistency, comparability and validity [4]. Since its publication, numerous studies have used TBS and ADD to estimate PMI in different species (for example, in piglets [15], rabbits [16] and horses [17]), geographical regions (such as North America [18, 19], Europe [1, 20] and Africa [21, 22]) and environments (e.g., water [23, 24], terrestrial surfaces [16, 25], buried [26, 27] and hanging [28]). The investigation of this correlation has had mixed results: some authors have obtained positive outcomes [25, 29], while others have not shown a significant relationship between these variables [30, 31].

The present systematic review and meta-analysis aimed to assess the accuracy of estimating PMI using the relationship between TBS and ADD.

Materials and methods

Protocol

The “Preferred Reporting Items for Systematic Reviews and Meta-Analyses—PRISMA” checklists and flow diagram published by Moher et al. [32] were followed, although these do not fully fit the content of this study [33].

Literature search strategy

A systematic search was carried out for studies that used the method proposed by Megyesi et al. [7] to quantify the degree of decomposition through a TBS that allows ADD and PMI to be calculated. The SciELO, LILACS, PubMed, Scopus and Web of Science databases were used. The following formula of terms was applied, which was adapted to the rules of each database:

(TBS OR total body score OR PMI OR postmortem interval) AND (ADD OR accumulated degree-days) AND (estimation OR improved OR evaluation OR application OR comparison OR scoring).

The search was conducted on October 17, 2022. Observational studies published in scientific journals written in English or Spanish, which analyzed the decomposition of humans and/or pigs according to the TBS of Megyesi et al. [7], had a known date of death and which indicated the ADD obtained, either as an average with a known sample size or through the details of each individual that allows the average to be calculated, were included.

Once the articles were obtained and duplicates had been eliminated, a first selection was made based on the title and abstract, followed by a second selection based on the full publication. These evaluations were carried out independently by two researchers using the Rayyan application which allows blind decisions and notes between reviewers and facilitates the resolution when there is disagreement between them [34]. Finally, a manual search was performed in the bibliography and reference lists of the studies which were considered eligible.

Data extraction

The data extracted were year and journal of publication, country of origin of the sample, sample size, type of sample (human or pig model), environment (terrestrial or aquatic) and, if applicable, burial depth (on the surface or buried), TBS, the average and standard deviation of the real ADD and the calculated ADD [7]. If the calculated ADD was not specified in the study, it was obtained from the formula given by the authors. To facilitate the analysis, studies which worked with humans and pig models were treated as two independent studies and were differentiated by an a and b, respectively, in the figures.

Risk of bias in studies

The risk of bias was assessed using QUADAS-2, as suggested by the Cochrane Collaboration for Systematic Reviews [35]. All publications with values of ≥ 70% were considered to have a low risk of bias and were included.

Data analysis

Being a continuous variable, the meta-analysis worked by contrasting two measures: the reference test, which corresponded to the real ADD, and the diagnostic test, known as the calculated ADD. For this, the average and standard deviation of the ADD obtained empirically by each study at a given TBS was used and compared with the ADD and standard deviation calculated using the formula of Megyesi et al. [7] for the same TBS.

Due to the high variability present between the included studies (given the diversity of study subjects, geographical and climatic differences, possible distinctions in burial methods and treatment of corpses), the meta-analysis carried out used a random effects model based on the study of DerSimonian & Laird [36] which takes both within and between study variances into account [37, 38]. The type of effect size used, which quantifies the magnitude and direction of the effect if interest, was a standardized mean difference; in this case, a positive value shows that the overall mean of the calculated ADD is higher than the real ADD, whereas a negative value indicates that the calculated ADD is lower than the real ADD. This was interpreted by Cohen [39] who suggested the classifications of small (d = 0–0.2), medium (d = 0.21–0.5), large (d = 0.51–0.8) and very large (d = > 0.8) [40]; this information was plotted on a forest plot.

Heterogeneity between studies was estimated using the I2 test. A value of 0% indicates no observed heterogeneity, with 25% indicating low heterogeneity, 50% moderate heterogeneity and 75% high heterogeneity [41]. A moderator analysis was performed for remains which were buried or on the surface, in aquatic or terrestrial environments, human or pig models and TBS values. To detect publication bias, Egger’s method [42] was applied and a funnel plot was obtained.

In addition, meta-analyses were performed to examine the results between subgroups: outcomes were compared between pig vs. human models, as well as between the decomposition stages described by Megyesi et al. [7];  fresh body (TBS = 3), early decomposition (TBS = 4–6), advanced decomposition (TBS = 17–24) and skeletonization (TBS = 25–35). For the latter, the average of the general ADD was not necessarily used, but different averages were calculated if possible, grouping the individuals according to the TBS observed in each group; this included cases of the same individual in different stages of decomposition. All statistical tests, graphs and figures were performed using R (version 4.2.1; R Core Team, 2022) and the ‘metafor’ package version 3.8.1 [43].

Results

Study selection

The search of the five databases identified 1228 articles, with 1081 remaining after the removal of duplicates. The review of the title and abstract reduced this number to 43 and the reading of full publications decreased it further to 21. Finally, the manual search resulted in the addition of 4, leaving a total of 25 articles for analysis (Fig. 1). As there were two publications that worked with human individuals and pigs, the statical analysis was carried out as if there were 27 articles.

Fig. 1
figure 1

Flow diagram describing the process of the systematic review, following the guidelines proposed by PRISMA

Table 1 summarizes the general characteristics of the studies. These were carried out with populations from the United States of America (N = 9), South Africa (N = 5), Australia (N = 3), the United Kingdom (N = 2), Sweden (N = 2), the Netherlands (N = 1), China (N = 1), Italy (N = 1) and Serbia (N = 1). Overall, 24 of the 25 articles were published in specialized forensic science or legal medicine research journals, both locally and internationally. The total number of participants was 751 (542 humans and 209 pigs).

Table 1 Main characteristics of eligible studies

Risk of bias in studies

The results of QUADAS-2 showed a low risk of bias and, hence, a high methodological quality (Table 2). Two questions in the tool were considered not applicable: Question 5 (about the index test) asked about the use of a threshold to define the index test, which does not apply to these variables, while Question 8 (about flow and timing) asked about the time interval for the analysis, which was not considered because index test (calculated ADD) and reference standard (real ADD) were compared over the same time period [44].

Outcomes and meta-analyses

General results

The results of the meta-analysis showed an overall mean difference of 0.5758 (p = 0.0141; SE = 0.2345; 95% CI = 0.1162–1.0354), which indicates that the value obtained from the calculation proposed by Megyesi et al. [7]  was higher than the real ADD (Fig. 2). According to Cohen’s interpretation [39], this mean difference was medium, which means that the estimation formula’s predictions have a moderate but noticeable difference from the actual ADD.

Table 2 Risk of bias assessed for each study using the QUADAS-2 tool

The I2 value was 92.40% (p < 0.0001), implying high heterogeneity. The evaluation of moderators showed that only the use of humans (k = 13) or pigs (k = 14) (p = 0.0271) and TBS value (p = 0.0015) presented statistical significance. In the case of environment, where the categories were terrestrial (k = 27), aquatic (k = 1) or both (k = 1) (p = 0.5142), and of burial depth, where the categories were surface (k = 23), buried (k = 3), on the surface or in a sewer (k = 1), there was no statistical significance (p = 0.3267).

Fig. 2
figure 2

Forest plot showing the mean difference between Control ADD and Calculated ADD. MD = mean difference. CI = confidence interval. The red diamond shows the result averaged in general and yellow diamonds show the result averaged for human and pig subgroups

Egger’s test shows hints of possible publication bias (p = 0.0049) which is also seen in the funnel plot (Fig. 3).

Fig. 3
figure 3

Egger’s funnel plot

Results for human versus pig

Human

Thirteen papers reviewed decomposition in the human body and were analyzed further. The results of the meta-analysis showed a mean difference of -0.0038 (p = 0.9891; SE = 0.2792; 95% CI=-0.5511–0.5435) where the values of calculated ADD were lower on average than the real ADD (Fig. 2). The mean difference was small and statistically insignificant; therefore, the Megyesi’s formula [7] has a reasonably accurate estimate of ADD when used on humans. The I2 value was 91.89% (p < 0.0001), showing high heterogeneity. Egger’s test shows no indication of publication bias (p = 0.2440).

Pig

Fourteen papers reviewed decomposition in pig bodies and were analyzed further. The results of the meta-analysis showed a mean difference of 1.1128 (p = 0.0050; SE = 0.3961; 95% CI = 0.3363–1.8892) which indicates that the calculated ADD values were higher on average than real ADD (Fig. 2). The mean difference is very large and statistically significant; hence, this discrepancy suggests that this formula lacks sufficient precision to provide reliable data when used in pigs. The I2 value was 90.42% (p < 0.0001), indicating high heterogeneity. Egger’s test does not show signs of publication bias (p = 0.0640).

Results for decomposition stage

Fresh body

Six publications with 49 observations (29 humans and 20 pigs) were analyzed. The results of the meta-analysis showed a mean difference of 0.0066 (p = 0.9740; SE = 0.2020; CI 95%=-0.3894 to 0.4026) (Fig. 4) where calculated ADD values were higher than real ADD values on average. The mean difference was small and statistically insignificant, which means that the results of the calculated ADD are statistically indistinguishable from real ADD in the fresh body stage. The I2 value was 0% (p = 1.0), indicating no heterogeneity. Egger’s test shows no evidence of publication bias (p = 0.7437).

Early decomposition

There were 16 publications with 416 observations (321 humans and 95 pigs) which were analyzed. The results of the meta-analysis showed a mean difference of 0.0338 (p = 0.6271; SE = 0.0695; 95% CI=-0. 1025 to 0.1700), that is, the calculated ADD values were higher on average than the real ADD values (Fig. 4). The mean difference is small and statistically insignificant; therefore, the formula can be considerable reliable for estimating ADD in the early decomposition stage. The I2 value was 0% (p = 0.9116), indicating no heterogeneity. Egger’s test shows no evidence of publication bias (p = 0.2008).

Advance decomposition

There were 19 publications with 542 observations (416 humans and 126 pigs) analyzed. The results of the meta-analysis showed a mean difference of -0.3378 (p = 0.1239; SE = 0.2195; 95% CI=-0.7680 to 0.0925) (Fig. 4) where the value of calculated ADD is lower on average than the value of real ADD. The mean difference was medium but statistically insignificant; thus, the formula has a noticeable difference from real ADD, potentially due to random variability. The I2 value was 87.55% (p < 0.0001), implying high heterogeneity. Egger’s test does not show signs of publication bias (p = 0.0976).

Skeletonization

Seventeen publications with 370 observations (251 humans and 119 pigs) were analyzed. The results of the meta-analysis showed a mean difference of 1.6583 (p = 0.0101; SE = 0.6448; 95% CI = 0.3945–2.9221) (Fig. 4) in which the average value of calculated ADD was higher than the real ADD. This mean difference is very large and statistically significant, which suggests that the formula is not accurate for calculating ADD in the skeletonization stage.

The I2 value was 95.96% (p < 0.0001), showing high heterogeneity. Egger’s test, while not significant, suggests a trend toward publication bias (p = 0.0574).

Fig. 4
figure 4

Forest plot showing the mean difference between Control ADD and Calculated ADD in subgroups according to the stages of decomposition (Megyesi et al., 2005). MD = mean difference. CI = confidence interval. Yellow diamonds show the result averaged for every subgroup

Summary of meta-analysis results

Table 3 summarizes the results. The general study of the mean difference, considering humans and pigs without differentiation by TBS, showed a medium magnitude of difference where the value of ADD is overestimated when using the formula of Megyesi et al. [7]. The results by type of sample indicated that this overestimation is also present in analysis when using pigs, with a very large magnitude of difference, whereas its use to estimate ADD in humans has a minimal underestimation with no statical significance. By decomposition stage, its use in fresh body and early decomposition stages results in an estimated ADD value that is insignificantly overestimated; as it progresses to the advanced decomposition stage, the estimated value is slightly underestimated and the estimated ADD value in the skeletonization stage is ultimately significantly overestimated.

The heterogeneity of the results was high, except for the fresh body and early decomposition stages, where it was null. Additionally, in the overall analysis there is a clear suggestion of publication bias, which does not appear in the analysis of the reviewed subgroups, except for the skeletonization stage.

Table 3 Summary of the results

Discussion

PMI estimations primarily rely on soft tissue degradation [45], influenced by key abiotic factors: temperature, moisture, pH and the partial pressure of oxygen [46], with the first one being the principal variable influencing the velocity and rate of decomposition [11]. According to Megyesi et al., over 80% of the observed variation in human decomposition can be explained by elapsed time and temperature [7]. The ADD metric is derived from this relationship, standardizing the accumulated temperature over time and allowing its application in PMI estimation [46], in addition to enabling comparison between different methods and studies that use it. With the aim of standardizing decomposition for use in PMI calculations, Megyesi and colleagues developed their TBS, which quantifies the sequential process of soft tissue degradation independently of other processes and places an emphasis on the relationship between time and temperature through ADD [7].

Based on the stages described qualitatively from cases in various climates, clothing states and completeness, and recognizing that decomposition follows a semi-continuous and predictable pattern [45], a quantitative points value system was proposed for widespread forensic application. This method, based on TBS and ADD, has been widely criticized; however, it has continued to be used without modification by researchers [47]. Criticisms focus possible variations by microhabitat between body discovery sites and weather stations locations, affecting temperature measurement precision [48, 49], or relying on photographs rather than direct observation [47]. In addition, despite high replicability in its use in humans [50], observers experience influence assessments using digital images [51]. The major criticism targets its intended worldwide application, as methods that use ADD may not account for sufficient geoclimatic variation to correctly record the decay rates in different regions [21, 52].

This study initially analyzed the relationship between ADD and TBS in general, without differentiating by burial status, environments (terrestrial or aquatic), species (human or pig) or degree of decomposition. The results indicate that the effect of ADD on TBS overestimates the PMI according to the formula of Megyesi et al. [7]. This overestimation can be attributed to the challenges of generating a universal mathematical model from a dependent variable based on numerous small observable changes [53] and errors in understanding the linear regression statistical model, as noted in previous studies [5, 54]. Despite this, the method of Megyesi et al. continues to be used without modification [47].

It is important to consider the high heterogeneity in the results, which limits their interpretation. This study aimed to find variables that explain this heterogeneity. While some variables provided further insights about the accuracy of the TBS and ADD relationship to estimate PMI, it was not possible to fully clarify it. Possible reasons include differences in habitats not analyzed in this study, significant variations in the real ADD results based on the same TBS within the same studies due to the difficulties in accurately measuring temperature differences between nearby places [48, 55] or the grouping of individuals with very dissimilar characteristics, as seen in Tomić et al. [20], where the average was generated from two pig cases at different depths, one on the surface and the other in a sewer hole.

Another pertinent aspect to consider is publication bias. The overall analysis reveals a clear presence of publication bias, attributed to selective reporting favoring studies with positive results over those with non-significant results or findings deemed less relevant [56]. Additionally, there is a linguistic and geographical bias, wherein studies published in languages other than English or in non-indexed national journals are less likely to be included in meta-analyses [57].

Interestingly, this publication bias is absent in the subgroup analyses. This discrepancy could be explained by heterogeneity, as the subgroups may comprise more homogeneous studies, thereby reducing heterogeneity and diminishing the impact of publication bias [58], as seen in fresh body and early decomposition stage results. Additionally, in the overall analysis, a few large or influential studies with significant results may drive the suggestion of publication bias [42]. In subgroup analyses, these influential studies could be less prominent or excluded, as seen in the advanced decomposition stage. However, skeletonization, which shows a tendency towards publication bias, aligns more closely with the overall results and does not meet these conditions.

Regarding the reviewed variables, most experimental studies analyzed samples located on the surface. However, some observed differences in buried samples. Pittner et al. [26] found that morphological characterization tables were adequate but noted a significant temporal difference, with TBS underestimating the ADD for burial intervals. Marais-Werner et al. [27] compared decomposition processes and found a similar pattern for buried and aboveground pigs, although buried pigs decayed 55.2–88.9% slower. These temporal differences suggest that corrections or recalculations are necessary for future research [26]. In this study, no significant differences were found between surface and buried decomposition processes. However, the validity of these findings is limited due to the small number of studies and their sample sizes.

This was repeated in PMI comparisons in territorial and aquatic settings. Heaton et al. [23] found a significant correlation between ADD and decomposition across various study locations. In contrast, Alfsdotter & Petaros [24] demonstrated that while surface decomposition correlated with ADD after formula adjustments, this relationship performed poorly in aquatic environments, possibly due to additional decomposition factors. The present meta-analysis found no significant differences, yet the limited number of studies and small sample sizes restrict the validity of the findings. This may be linked to the previously observed publication bias and suggests a lack of published studies that provide detailed insights into the variables and their specific influences on the relationship between TBS, ADD and PMI.

The results from human samples indicate a slight underestimation, suggesting that the method of Megyesi et al. [7], although minimal, underestimates PMI based on TBS and ADD. However, the results exhibit high heterogeneity. Various authors comparing their findings with the proposed method report a strong correlation, yet also note instances of significant PMI overestimation, where their samples reached ADD in a shorter time [28, 59], as well as an underestimation when more time was required [26, 60]. In contrast, a meta-analysis of studies using pigs shows PMI overestimation, also with notable heterogeneity. Comparisons using the method of Megyesi et al. [7] indicate a PMI overestimation in some studies [61], while others highlight a consistent underestimation [52, 62].

The subgroups based on decomposition stage exhibit low heterogeneity for the fresh body and early decomposition stages, but high heterogeneity for advanced decomposition and skeletonization. This translates to the first two presenting estimated values of ADD which are indistinguishable from the actual ADD value, whereas the latter two exhibit moderate underestimation in advanced decomposition and very significant overestimation in skeletonization. Thus, as TBS increases, so does the error of the formula for calculated ADD, leading to greater discrepancies in PMI estimates across various studies.

This aligns with a recognized limitation: estimating time since death becomes more complex with time and tissue decomposition progression [63,64,65]. Myburgh et al. observed a linear decay rate initially, which becomes more variable at higher TBS values [21]. Similarly, Suckling et al. noted reduced prediction accuracy in advanced decomposition and skeletonization stages [59], while Wescott et al. reported increased standard deviation in ADD over time [31].

Conclusions

It was concluded from the statistical analysis carried out in the present investigation that the method to estimate PMI, based on the relationship between TBS and ADD, exhibits high accuracy in humans during early decomposition stages. However, its accuracy deteriorates as decomposition progresses, leading to overestimation. This indicates that the method is not optimal as a universal formula and tends to underestimate PMI in human models while overestimating it in pig models. Limitations of this study include high observed heterogeneity and dissimilar results between studies.

Regarding decomposition stages, the method showed the highest accuracy in estimating PMI for the fresh body and early decomposition stages. In contrast, advanced decomposition and skeletonization stages demonstrated increased estimation errors as the degree of soft tissue degradation increased.

While the relationship between TBS, ADD and PMI proposed by the authors represents a significant contribution to forensic sciences, its application in different geoclimatic contexts necessitates adaptation and modification.