Introduction

Insect physiology encompasses a broad array of topics, ranging from the intricacies of metabolic pathways to the physiological adaptations that enable survival in diverse environments. Among the myriad physiological processes studied within this discipline, the composition and temporal variation of cuticular hydrocarbons (CHCs) in insects have garnered significant attention [1,2,3,4,5]. CHCs, comprising a complex mixture of long-chain hydrocarbons and their derivatives, are vital components of the insect cuticle, playing pivotal roles in various ecological interactions, such as communication, reproduction, and defense [5,6,7].

The Calliphoridae family, commonly known as blow flies, comprises a diverse group of insects with considerable ecological and forensic significance. Among these, Calliphora vicina holds particular interest due to its widespread distribution and close association with carrion decomposition [8,9,10]. The development stages of C. vicina serve as a critical temporal marker in forensic entomology, aiding in estimating minimum postmortem intervals (PMI min) in forensic investigations. As blow flies progress through their life cycle, transitioning from larval stages to puparia and ultimately emerging as adult flies, the puparia retain residual hydrocarbons on their cuticles. Understanding the temporal dynamics of CHCs in C. vicina puparia holds excellent potential for forensic applications.

In the late stages of decomposition, when the body has reached the skeletal stage, and no soft tissues or dead insects are available, empty puparia remain significant forensic evidence [11, 12]. These puparia can be found in various locations, such as on the body, beneath the body, or buried in soil [13, 14]. This distribution occurs as insects migrate away from the original feeding source to find a suitable place for pupation. Therefore, empty puparia have a high potential to be discovered as evidence in forensic investigations [15]. Puparia play a vital role in forensic entomology and ecological research by providing insights into decomposition timelines, species identification, and environmental processes [12, 16,17,18,19]. Studies on insect stages other than empty puparia are abundant [18, 20,21,22,23,24]. Yet, research focusing specifically on empty puparia of blow flies, specifically in field conditions, as crucial evidence during advanced stages of decomposition is relatively limited in comparison [18, 25, 26]. Addressing the limitations and knowledge gaps associated with their use requires interdisciplinary collaboration, methodological advancement, and a detailed understanding of insect biology and ecosystem dynamics. Overcoming these challenges will enhance the reliability and applicability of puparial analysis in both forensic science and ecological studies, contributing to advancements in understanding environmental health and forensic investigation methodologies.

Given the interdisciplinary nature of CHCs in C. vicina puparia, this study seeks to integrate principles from insect physiology, ecology, and forensic entomology to shed light on the temporal variation of CHCs in a species of significant ecological and forensic importance. Prior research has emphasized that more than half of the lipids from blowfly puparia comprise alkanes falling within the n-C25 to n-C30 range [27]. In this context, our investigation specifically focused on the analysis of CHCs in empty blowfly puparia, with a particular emphasis on five predominant CHCs - n-Pentacosane (n-C25), n-Hexacosane (n-C26), n-Heptacosane (n-C27), n-Octacosane (n-C28), and n-Nonacosane (n-C29) [2, 4, 28]. These CHCs are chosen for their recognized stability and lower volatility when compared to smaller chain CHCs. This study employs the machine learning model XGBoost for age estimation of empty puparia based on hydrocarbon concentrations, connecting baseline research with the practical utilization of blowfly puparia in environmental forensics and ecological studies.

Materials and experiment design

The rearing process followed the protocol outlined in our previous study [4] 300 larvae were divided into three groups of 100 larvae each. These larvae were placed in plastic cups with 20 g of minced pork meat and transferred to plastic boxes filled with sawdust to aid pupation (refer to Fig S1 in Supplementary file). Empty puparia were collected from these three groups and placed in three conditions: buried outdoors, placed above-ground outdoors, and stored indoors (Fig S2). For the outdoor buried condition, ten empty puparia were buried approximately 5 inches deep in nylon stockings at the Institute of Legal Medicine in Frankfurt am Main, Germany. The study was conducted from late October to early February, encompassing the winter season. Sampling occurred on days 1, 28, 56, and 90, with two empty puparia collected per time point per replication. Similarly, above-ground outdoor and indoor conditions each involved the same sampling protocol. Samples were prepared by submerging two puparial cases in n-hexane and methadone-d9 (used as internal standard), followed by ultrasonication, drying with nitrogen air, and GC-MS analysis with hexane reconstitution and blank runs for carryover prevention as detailed in our previous study [28].

CHC profile analysis

Chemical analysis was conducted using an Agilent GC-MS system comprising a 7693 GC and a 7890 B MSD, equipped with a split/splitless injector in splitless mode. Samples, injected with 1 µl at 250 °C, were separated on an Agilent VG-1 ms capillary column (30 m × 250 μm I.D. × 0.25 μm film thickness). The temperature program started with a 2-minute hold at 100 °C, followed by ramps to 200 °C at 25 °C/min, 260 °C at 3 °C/min, and 320 °C at 20 °C/min, holding for 2 min. Helium flowed at 1.2 ml/min. The 5977 B MSD operated in positive ion mode (70 eV), using Selected Ion Monitoring (SIM) and scan modes (m/z 45–600) from 4 min into the run. Quantification of n-C25 to n-C29 compounds utilized SIM, with specific ions (352, 366, 380, 394, 408) and methadone-d9 (303) as internal standard. Calibration curves (0.4–50 ng/µl) were constructed with Agilent Chemstation, yielding regression coefficients > 0.994 (S3). Regular calibrations and hexane blanks ensured analysis accuracy and reliability.

Statistical analysis

Statistical analysis encompassed One-way ANOVA followed by Dunnett’s test to identify significant variations in hydrocarbon means across different age groups of empty puparia. To track changes in CHC concentrations over time, normalization involved dividing initial day one concentrations by subsequent time points and expressing results as percentages. A correlation heatmap in IBM SPSS version 25 investigated relationships between compound concentrations and weathering time alongside canonical discriminant analysis (CDA) and regression analysis. Following these analyses, for practical utility in age prediction of empty puparia, we employed machine learning technique, specifically eXtreme Gradient Boosting (XGBoost). Recognizing that simple regression analysis and CDA may not directly serve this purpose, we utilized XGBoost to develop a predictive model relevant to age determination.

Machine learning via XGBoost modeled the age prediction of puparia using n-C25 to n-C29 concentrations after log transformation (because of heteroscedasticity in data) with the ‘xgboost’ library in R. 70% of the data was allocated for model training and the remaining 30% solely for testing the model’s generalizability to unseen data points as test data. A robust 5-fold cross-validation approach was applied, dividing the dataset into five subsets to facilitate accurate performance assessment. The training set was used for parameter tuning via 5-fold cross-validation, where we explored a grid of parameters including the number of boosting rounds (‘nrounds’), maximum tree depth (‘max_depth’), and learning rate (‘eta’). During tuning, regularization parameters such as ‘gamma’, ‘colsample_bytree,’ ‘min_child_weigh’t, and ‘subsample’ were fixed [29]. The selection of hyperparameters aimed at minimizing the Root Mean Squared Error (RMSE) and Mean Absolute Error (MAE). Below is the R script for this method.

figure aa
figure ab
figure ac
figure ad
figure ae

Results

The study explored variations in the CHCs of empty puparia of C. vicina across different natural environmental conditions for 90 days. The chromatograms depicting these conditions are presented in Fig. 1. Examination of hydrocarbons at different time intervals revealed a consistent highest concentration of n-C27 hydrocarbon in all three conditions, as detailed in Table 1. N-C26 is omitted in the analysis due to concentrations below 2%. Subsequent to n-C27, the concentrations of hydrocarbons followed a descending order: n-C29, n-C28, and n-C25. From the data logger readings, the highest temperature observed outdoors in buried conditions reached 12.5 °C, while the lowest dropped to 2 °C.

Fig. 1
figure 1

Chromatograms representing the cuticular hydrocarbons (CHCs) extracted from empty puparia of Calliphora vicina in three conditions i.e. (I) Buried, (II) Above-ground and (III) Indoor showing peaks 1 to 4 for n-C25, n-C27, n-C28, and n-C29, respectively

Table 1 Cuticular Hydrocarbons with their mean of concentrations (ng/µl) and standard deviation for each time point/Age (in days) for three environmental conditions

Conversely, outdoor above-ground conditions experienced a maximum of 10 °C and a minimum of -3 °C. Indoors, temperatures ranged from a maximum of 21.5 °C to a minimum of 16 °C (Fig S4). Key trends include a significant decrease in n-C25 concentrations in buried and above-ground settings over time. n-C27 concentrations consistently decline in buried and above-ground conditions while remaining stable indoors. n-C28 concentrations decrease in above-ground and indoor settings. n-C29 concentrations decrease in buried and above-ground conditions, with a slight, statistically insignificant increase indoors.

In the buried condition, n-C27 and n-C29 exhibited a significant (p < 0.0001) declining trend over the observation period, indicating decreased concentrations from Day 1 to Month 3. However, no significant changes were observed in the concentrations of n-C25 (p = 0.419) and n-C28 (p = 0.223) during the same period. This suggests that the burial environment may selectively influence certain hydrocarbons in empty puparia of C. vicina, with n-C27 and n-C29 showing a more pronounced response (Fig. 2A).

Fig. 2
figure 2

Residual ratio (%) vs. Weathering time (days) plot. A. Outdoor (buried) Among the studied hydrocarbons, n-C7, and n-C29 displayed a decreasing trend after 28 days. B Outdoor (Above-ground) displaying decreasing trend over the time. C Indoor, where hydrocarbons showed stability

Conversely, all hydrocarbons, including n-C27, n-C28, n-C25, and n-C29, displayed a significant decline in the above-ground condition. Concentrations decreased notably from Day 1 to Month 3. This uniform decline suggests a consistent environmental influence on the hydrocarbon composition of puparia when placed above-ground (Fig. 2B).

In the indoor condition, no significant (p > 0.05) changes were observed in the concentrations of the analyzed hydrocarbons over time. This relative stability suggests that the indoor environment has a less discernible impact on the hydrocarbon composition of empty puparia compared to the outdoor conditions (Fig. 2C). The distinct trends observed in different environmental settings highlight the nuanced responses of C. vicina puparia to varying conditions, providing valuable insights into the environmental factors shaping their hydrocarbon profiles.

The correlation analysis between weathering time and CHC concentrations in empty puparia buried in soil revealed intriguing relationships (Fig. 3A). Notably, a negligible correlation was observed between weathering time and n-C25 concentration (r= -0.0471), indicating minimal impact of time on this specific hydrocarbon. In contrast, a strong negative correlation was identified between weathering time and n-C27 concentration (r= -0.8248), suggesting a significant decrease in n-C27 levels over time. Similarly, a strong negative correlation (r= -0.7778) was found between weathering time and n-C29 concentration, indicating a pronounced reduction in n-C29 levels with increasing time. When exploring relationships between CHCs, weak to very weak correlations were identified. For instance, a weak positive correlation (r = 0.1425) was observed between n-C25 and n-C27 concentrations, while a moderate positive correlation (r = 0.6451) existed between n-C27 and n-C29 concentrations.

Fig. 3
figure 3

Heatmap displaying the correlation matrix of CHCs and Weathering time (days). A Outdoor-buried. B Outdoor-Above-ground. C Indoor. Each cell in the heatmap represents the correlation coefficient between the corresponding CHCs and weathering time. Negative correlations are represented by red color, positive correlations by blue

The correlation analysis between weathering time and CHC concentrations in empty puparia placed outdoors above ground yielded noteworthy insights into the dynamics of CHC changes over time in a natural environment. Moderate to strong negative correlation was observed between weathering time and each of the specific CHCs, namely n-C25 (r= -0.6546), n-C27 (r= -0.8171), n-C28 (r= -0.6765), and n-C29 (r= -0.8733). When examining the interplay between different CHCs, significant positive correlations were identified. For instance, a strong positive correlation (r = 0.7606) was observed between n-C25 and n-C27 concentrations, indicating a concurrent increase or decrease in both hydrocarbons over time. Similarly, moderate positive correlations were found between n-C25 and n-C28 (r = 0.6164), n-C27 and n-C28 (r = 0.6301), and n-C27 and n-C29 (r = 0.6416) (Fig. 3B). These correlations suggest potential synchronized variations in the concentrations of these CHCs during the weathering process.

In indoor conditions, weathering time shows a weak negative correlation with n-C25 (r = -0.026), indicating a slight tendency for the concentration of this hydrocarbon to decrease with increasing weathering time. Similarly, the correlations with n-C27 (r = -0.157) and n-C28 (r = -0.051) are negative but remain weak, suggesting only marginal decreases in their concentrations over time. The positive correlation between weathering time and n-C29 (r = 0.123) is also weak, signifying a slight increase in concentration with more extended weathering periods. However, the correlations between weathering time (days) and the concentrations of CHCs appear to be generally weak and insignificant. The correlation coefficients range from − 0.026 to 0.366, suggesting subtle or limited associations between these variables (Fig. 3C).

In the CDA conducted to assess the discriminative potential of CHCs among four distinct age groups, the results revealed significant differentiation between the groups, with an overall accuracy of 76.7%. Specifically, variables nC27 and nC29 emerged as crucial contributors to accurate group classification. The Eigenvalues provided further insight, with the first function explaining a substantial portion of the total variance (94.2%), indicating strong discriminatory power. However, in varying scenarios, such as in outdoor above-ground conditions, where the first function explained a remarkably high percentage (98.5%) of the total variance, and in indoor conditions, where it accounted for a relatively lower proportion (57.7%) of the cumulative variance, the discriminatory power varied, highlighting interpretations of the data across different contexts (for details refer to Fig. S5 in Supplementary information). Further, regression analyses conducted on three datasets—burial, above-ground, and indoor—revealed significant findings for CHCs in Calliphora vicina empty puparia. In the burial dataset, the model showed a strong correlation (R = 0.888, R²=0.789) with nC27 and nC29 as significant predictors (p < 0.001). Similarly, the above-ground dataset exhibited a robust model (R = 0.947, R²=0.896) emphasizing the importance of nC29 (p < 0.001). Conversely, the indoor dataset demonstrated a weaker model (R = 0.361, R²=0.130) with non-significant predictors overall (p > 0.05). Detailed results for each dataset can be found in the Supplementary file S6. To enhance precision in age estimation, the study transitioned to machine learning models, capitalizing on their effectiveness in predicting CHC dynamics.

Age estimation of empty puparia placed in outdoor buried using XGBoost model

The predictive model’s performance in estimating the age of puparia based on hydrocarbon concentrations in buried conditions was evaluated and presented in Fig. 4. The model optimization was carried out using a robust 5-fold cross-validation methodology coupled with hyperparameter tuning through a grid search approach. The optimized model parameters included Boosting Rounds set to 100, a Maximum Tree Depth of 6, and a Learning Rate of 0.1.

Fig. 4
figure 4

Illustrating the predicted age versus the actual age of puparia based on hydrocarbon concentrations in buried conditions. The robust 5-fold cross-validation methodology and hyperparameter optimization through grid search led to optimized parameters (Boosting Rounds: 100, Maximum Tree Depth: 6, Learning Rate: 0.1). Cross-validation results showed an average training RMSE of 5.36 and MAE 4.05, while the model achieved an RMSE of 5.54 and an MAE of 3.57 on a test dataset. The R-squared values further emphasize the model’s accuracy, with a value of 0.9737 for the training data (A) and 0.9719 for the test data (B)

The cross-validation results demonstrated that the model achieved an average training RMSE of 5.36 and an MAE of 4.05. When applied to the test dataset, the model’s performance was comparable, with an RMSE of 5.54 and an MAE of 3.57. These error metrics indicate the model’s capability to predict the age of puparia with high precision.

Furthermore, the R-squared values underscore the model’s accuracy and reliability. The training dataset yielded an R-squared value of 0.9737, while the test dataset exhibited a closely matching R-squared value of 0.9719. These high R-squared values reflect the model’s strong predictive power and robustness in estimating the age of puparia under buried conditions.

Age estimation using XGBoost on outdoor (above-soil)

The optimized model parameters through grid search included Boosting Rounds set to 50, a Maximum Tree Depth of 3, and a Learning Rate of 0.3. The cross-validation results demonstrated that the model achieved an average training RMSE of 0.03. When applied to the test dataset, the model’s performance was consistent, with an RMSE of 0.05 and an MAE of 0.04. These error metrics indicate the model’s capability to predict the age of puparia with very high precision.

Furthermore, the R-squared values underscore the model’s accuracy and reliability. The training dataset yielded an R-squared value of 0.9999, while the test dataset exhibited an R-squared value of 0.999. These exceptionally high R-squared values reflect the model’s strong predictive power and robustness in estimating the age of puparia under above-ground outdoor conditions (Fig. 5).

Fig. 5
figure 5

The predicted age versus the actual age of puparia based on hydrocarbon concentrations in above-ground outdoor conditions with key insights into the model’s performance. Cross-validation results (Boosting Rounds: 50, Maximum Tree Depth: 3, Learning Rate: 0.3) showed an average training RMSE of 0.03, while the model achieved an RMSE of 0.05 and a mean absolute error (MAE) of 0.04 on a test dataset. The R-squared values further emphasize the model’s accuracy, with a value of 0.9999 for the training data (A) and 0.999 for the test data (B)

Age estimation of empty puparia placed indoor

The model was optimized with 50 Boosting Rounds, a Maximum Tree Depth of 6, and a Learning Rate of 0.1. The cross-validation results demonstrated an average training RMSE of 9.2. For the test dataset, the model achieved an RMSE of 10.32 and an MAE of 8.54.

The R-squared values further emphasize the model’s accuracy, with the training data showing an R-squared value of 0.902 and the test data an R-squared value of 0.922. These high R-squared values also confirm the model’s effectiveness and reliability in indoor conditions (Fig. 6).

Fig. 6
figure 6

Illustrating the predicted age versus the actual age of puparia based on hydrocarbon concentrations with key insights into the model’s performance in Indoor conditions. Cross-validation results (Boosting Rounds: 50, Maximum Tree Depth: 6, Learning Rate: 0.1) showed an average training RMSE of 9.2, while the model achieved an RMSE of 10.32 and a mean absolute error (MAE) of 8.54 on an independent test dataset. The R-squared values further emphasize the model’s accuracy, with a value of 0.902 for the training data (A) and 0.922 for the test data (B)

Further results and details about the XGBoost model and interpretation are provided in Supplementary file S7.

Discussion

The investigation into the CHC profiles of empty puparia of Calliphora vicina across burial, above-ground, and indoor conditions over 90 days yielded valuable insights into the nuanced responses of these hydrocarbons to diverse environmental settings. Burial conditions revealed significant decreases in CHC concentrations, particularly notable for hydrocarbons such as n-C27 and n-C29. This decline suggests a pronounced response of CHCs to soil burial, likely driven by microbial degradation and chemical interactions with the soil matrix. Microbial activity in the soil, including the metabolism of organic compounds by bacteria and fungi, likely contributed to the degradation of longer-chain hydrocarbons [30,31,32,33,34,35]. Moreover, chemical processes such as oxidation and hydrolysis, facilitated by soil moisture and pH levels, may have further contributed to CHC degradation [36, 37].

In contrast, above-ground conditions exhibited uniform decreases in CHC concentrations across all hydrocarbons, indicating a consistent environmental influence on CHC degradation. Exposure to stressors like UV radiation, temperature fluctuations, and atmospheric pollutants likely accelerated hydrocarbon degradation processes. UV radiation, in particular, can induce photodegradation of CHCs, while temperature fluctuations and atmospheric pollutants can catalyze oxidative processes [3, 38,39,40]. Conversely, indoor conditions demonstrated relative stability in CHC concentrations over the observation period, suggesting a lesser impact of indoor environments on CHC degradation. Protection from environmental stressors like UV radiation and temperature fluctuations likely slowed down CHC degradation processes indoors. The weak correlations between weathering time and CHC concentrations in indoor conditions underscore the limited associations between environmental factors and CHC dynamics indoors. Understanding these influences is pivotal for accurate age estimation and forensic applications in entomology, and further research into the specific mechanisms driving CHC degradation in varied environments will enhance our comprehension of CHC dynamics and their forensic utility.

In our laboratory-based investigation, we assessed the changes in CHCs of empty puparia placed in two different storage media: soil (representing outdoor conditions) and paper towels (representing indoor conditions) [4]. After eight weeks, notable degradation was observed for all the CHCs studied in the soil medium. However, in the current study, only two of the four investigated CHCs showed degradation in buried conditions, while all CHCs exhibited a decline in concentration over time when puparia were placed above ground, resembling the effects of outdoor conditions. In contrast, the paper-towel medium, simulating indoor conditions, displayed a slower degradation pattern. Specifically, after 120 days, significant degradation was noted for n-C25 and n-C28, while concentrations of n-C27 and n-C29 increased [4]. In the current indoor investigation, no significant variations were noted. Indeed, while laboratory conditions attempt to mimic real-world environments, there can still be variations in degradation patterns between laboratory and actual outdoor conditions. Factors such as temperature fluctuations, humidity levels, microbial activity, and exposure to sunlight may influence the degradation rates of organic materials, including insect remains, in natural settings compared to controlled laboratory environments. These environmental variables create dynamic conditions that can significantly impact the decomposition process.

In a notable study, the degradation of CHCs in empty puparia of Chrysomya rufifacies Macquart, 1842 (Diptera: Calliphoridae) was investigated in laboratory and field conditions. The findings revealed a stark contrast in the rate of hydrocarbon weathering between the two settings, with much more rapid degradation observed in the field [26, 41]. This disparity underscores the profound influence of environmental conditions on the degradation kinetics of puparial hydrocarbons. The accelerated weathering observed in the field suggests a complex interplay of factors such as temperature fluctuations, exposure to UV radiation, microbial activity, and moisture levels, all of which contribute to the breakdown of organic compounds [33, 38, 39, 42]. Another study compared the differences in hydrocarbon and fatty acid esters, as well as transesterified waxes, between lab-preserved old puparia and fresh ones of Hydrotaea aenescens (Diptera: Muscidae) over a 15-year period [43]. The comparison between recent (2012) and older (1997) puparia contents has revealed significant differences in composition. Specifically, there is an observed general decrease in the chain length of the n-alkane distribution pattern, coupled with an increase in the length of ester chains. Both extracts also contain traces of three hopane hydrocarbon congeners. Furthermore, differences in hydrocarbon and fatty acid esters from transesterified waxes are clearly discernible. Their distribution patterns exhibit similarities to those reported by Zhu et al. for puparia exposed to weathering over 90 days [26, 41]. A recent study utilized multivariate analysis to investigate the weathering and aging of empty puparial cases from two blow fly species over nine months [44]. Quantifying the weathering time of these insect traces could significantly aid in narrowing down longer PMI.

While laboratory studies offer controlled environments conducive to systematic experimentation and observation, they may not fully capture the intricacies of real-world degradation processes. Consequently, extrapolating findings from laboratory experiments to natural scenarios requires careful consideration of potential differences and limitations. Field experiments or observational studies play a crucial role in complementing laboratory research by providing insights into the complexities of decomposition dynamics in diverse environmental contexts. The current study is an attempt in this direction.

Also, a relevant study investigated temporal variations in the reflection spectrum of Chrysomya megacephala Fabricius, 1794 (Diptera: Calliphoridae) pupa using hyperspectral imaging (HIS), along with proposing the eXtreme Gradient Boosting Regression (XGBR) model as an optimal approach for estimating pupa development time based on HSI data [45]. Their findings shed light on the dynamic changes occurring during pupa development and the potential of HSI in monitoring such changes non-invasively. Moreover, the introduction of the XGBR model represents a promising advancement in accurately estimating pupa age, which is crucial in forensic entomology for determining PMImin [45]. Previous research has employed artificial neural networks (ANN) to identify fly maggots based on images of spiracles, predict the age of adult flies, and estimate the weathering time of empty puparia in other forensically significant fly species [46,47,48]. Our recent study focused on the weathering patterns of empty puparia of another forensically important blowfly Lucilia sericata, various machine learning models, including ANN, support vector machine (SVM), and XGBoost, was employed to predict the age of empty puparia. Our first approach was regression analyses (detailed in Supplementary file S6), which yielded moderate R2 values of 0.78 for buried conditions and 0.89 for above-ground conditions. We sought to enhance accuracy and predictive performance, particularly in discerning age groups based on CHCs, so we turned to CDA. However, to achieve superior results, we implemented XGBoost. XGBoost is versatile and can be applied to various supervised learning tasks, including regression, classification, and ranking problems. The study reported that XGBoost demonstrated superior performance to the other models, achieving higher accuracy in age prediction. This highlights the effectiveness of advanced machine learning techniques, particularly XGBoost, in forensic entomology research for accurately estimating late PMI. A review study highlights the challenges in estimating PMI in highly decomposed bodies and skeletal remains, citing environmental factors and the scarcity of reliable time since death markers as key obstacles [49]. It emphasizes the need for a multidisciplinary approach, combining taphonomic, morphological, and entomological assessments and advanced techniques like fluorescence and proteomics [11, 49,50,51,52,53,54,55]. The study underscores the variability in decomposition rates and the importance of using multiple methods to enhance accuracy tailored to each case’s specifics and available resources. Understanding the limitations and varying reliability of these methods is crucial for forensic practitioners aiming to achieve accurate PMI estimations in cases involving severe decomposition.

Limitations of the study include the fact that it did not investigate other influential environmental factors such as microbial activity, UV exposure, moisture levels, and geographical variations. While these factors are known to likely impact the degradation of CHCs, their specific contributions require further empirical validation through subsequent research efforts. This study establishes a foundational baseline for future investigations in this direction.

Furthermore, enhancing the machine learning models used in this study by incorporating additional variables (as it was beyond the logistic and time frame of the current study), such as a broader range of CHCs and a more extensive dataset, is essential for advancing the accuracy and reliability of age estimation models based on CHC profiles. These enhancements represent critical steps towards refining forensic entomology methodologies to achieve more precise estimations of PMI. However, before these methods can be reliably applied in practical forensic settings, further research is needed to confirm and expand upon these findings.

Conclusion

The study provides valuable insights into the intricate dynamics of CHC profiles in empty puparia of Calliphora vicina across diverse environmental conditions. The findings reveal distinct degradation patterns, with burial and above-ground settings showing significant declines in CHC concentrations over time, while indoor conditions exhibit relative stability. These results highlight the complex interplay between environmental factors, microbial activity, and chemical processes influencing CHC degradation. The study emphasizes the importance of considering environmental variables in forensic entomology and ecological research, particularly for age estimation and environmental monitoring purposes. However, the study’s limitations include its focus solely on selected environmental variables. Factors like microbial activity, UV exposure variations, moisture levels, and geographical influences, known to impact CHC degradation, were not comprehensively investigated. Future research should validate findings across broader environmental contexts, expand datasets, and incorporate additional variables to refine CHC dynamics understanding. Moreover, the integration of machine learning techniques with entomological expertise holds potential for standardized methodologies for age determination, enhancing the reliability of entomological evidence in legal contexts and opening avenues for future research and development.