Introduction

Perinatal asphyxia is a major cause of neonatal morbidity and mortality, affecting 5–10 per 1000 live births [27]. Acute kidney injury (AKI) represents a serious complication as it is associated with prolonged hospital stay and mechanical ventilation [25], as well as with adverse neurological outcomes [39]. Its incidence is estimated at 40% of asphyxiated newborns, although presenting wide variance depending on the definitions [40]. The pathophysiology of AKI after a hypoxic insult is multifactorial and it is based on renal hypoperfusion due to diving reflex and increased oxidative damage [45]. Recent research has focused on the potential role of various preventive measures, suggesting the administration of theophylline in neonates at high risk for post-asphyxial AKI [34]. However, the interpretation of creatinine measurements in the early postnatal life is unreliable, since it reflects maternal concentrations and can overestimate the glomerular filtration rate (GFR) due to its unfavorable kinetics [12]. As a result, there is growing interest in the identification of novel biomarkers that would allow early risk stratification and accurate prediction of AKI in neonates with perinatal asphyxia [3].

Neutrophil gelatinase-associated lipocalin (NGAL) is a 178 amino acid protein expressed mainly by neutrophils and by various epithelial tissues at lower levels. It presents bacteriostatic properties, contributing to the innate immunity processes [7]. AKI is followed by a rapid rise of serum and urinary NGAL, as it is able to promote renal tubular recovery due to its anti-apoptotic effects [30]. As indicated by recent meta-analyses, NGAL represents a promising biomarker of AKI with high performance in the settings of sepsis [54] and exposure to contrast medium [52]. Furthermore, there is evidence that it can serve as a useful marker in pediatric populations [18] with high predictive efficiency, especially in AKI after cardiopulmonary bypass [11]. NGAL is also increased in a variety of pediatric urologic conditions and can contribute to the evaluation of renal damage in children with infection [51] or obstruction of the urinary tract [20].

The role of NGAL in the prediction of AKI in asphyxiated neonates has been recently explored in several observational studies; however, no consensus exists regarding its exact efficacy in this population. The present meta-analysis aims to accumulate current literature knowledge in the field in order to determine the pattern of NGAL elevation and assess its performance in the prediction of AKI in newborns with perinatal asphyxia.

Materials and methods

Study design

The present systematic review and meta-analysis was designed according to the Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) guidelines [26]. Eligibility criteria were predetermined by the authors. Date or language restrictions were not applied during the literature search to avoid language bias. The studies were selected in three consecutive stages. Firstly, the titles and abstracts of all electronic articles were screened to evaluate their eligibility. Subsequently, the articles that met or were presumed to meet the criteria were retrieved as full texts. In the final stage, all observational studies (both prospective and retrospective) that reported serum or urinary levels of NGAL among asphyxiated newborns that developed AKI were selected. Case reports, review articles, and animal studies were excluded. Any discrepancies regarding the methodology, retrieval of articles, and statistical meta-analysis were resolved through the consensus of all authors.

Literature search and data collection

Literature search was primarily conducted using the Medline (1966–2018), Scopus (2004–2018), EMBASE (1980–2018), and Clinicaltrials.gov (2008–2018) databases, along with the reference lists of the electronically retrieved full-text papers. Google Scholar (2004–2018) database was also searched in order to identify possible additional sources. The date of the last search was set at 12 April 2018. The search strategy included the following terms: “NGAL, neutrophil gelatinase, lipocalin, perinatal, neonatal, birth, asphyxia, encephalopathy, acute kidney injury, renal failure” and is schematically presented in the PRISMA flowchart (Fig. 1).

Fig. 1
figure 1

Search plot diagram

Quality assessment

The quality of all included studies was evaluated with the Risk Of Bias In Non-randomized Studies (ROBINS-I) assessment tool, which assesses the potential bias due to confounding, selection, classification, deviation from intended intervention, missing data, measurement, and reporting of the outcomes [42]. Moreover, the methodological quality of the studies included in the diagnostic accuracy analysis was also judged with the QUADAS-2 tool, which consists of four domains: patient selection, index test, reference standard, and flow and timing [53].

Definitions

Perinatal asphyxia was defined on the basis of Apgar score, fetal distress, pH, metabolic acidosis, hypoxic event, and need for neonatal resuscitation. Acute kidney injury was diagnosed with creatinine criteria (serum creatinine ≥ 1.5 mg/dl) or according to AKIN (Acute Kidney Injury Network) [28] and KDIGO (Kidney Disease: Improving Global Outcomes) [23] guidelines. The definition of AKI in each study is provided in Suppl. Table 1.

Statistical analysis

The statistical meta-analysis of diagnostic accuracy was carried out in R (3.4.3 version), using the MADA package [10]. A bivariate model was fitted to calculate summary estimates of sensitivity and specificity, since it takes into consideration the potential correlations between these two quantities, due to threshold effect [35, 36]. Summary receiver operating characteristic (SROC) curves were constructed and the area under the curve (AUC) for serum and urinary NGAL were separately estimated. Confidence intervals (CI) were set at 95%. Subgroup analysis was performed regarding the following variables: cutoff values, AKI definition, study design, and location. Additionally, as a secondary analysis, a univariate approach was implemented to provide pooled estimates of likelihood ratios (LR) and diagnostic odds ratio (DOR) of the two tests. A Fagan’s nomogram was drawn, which determines the post-test probability, according to the calculated positive and negative likelihood ratios [5]. The pre-test probability was set at 30%. The possibility of publication bias was assessed using Deeks’ funnel plot, which is constructed by plotting the inverse of square root of the effective sample size (ESS) against the ln(DOR) [8].

Sensitivity analysis

Leave-one-out analyses were performed to explore the effect of individual studies on the overall outcome. One study was sequentially omitted at a time and pooled estimates of sensitivity, specificity, and AUC were calculated. This analysis was also performed in R (MADA package) [10].

Results

Included studies

Eleven studies [1, 4, 13, 16, 31, 33, 38, 44, 46, 48, 49] were finally included in the present review, with a total number of 652 asphyxiated neonates. Among them, 257 developed AKI, while the rest 395 served as the control group. The methodological characteristics of the included studies (country, study design, patient exclusion criteria, asphyxia and AKI definition, type of sample, assay method, prematurity, and NGAL cutoff value) are presented in Suppl. Table 1. The most important patients’ characteristics that were extracted included number, gender, gestational age at delivery, birth weight, nature of birth, Apgar score, and baseline serum creatinine levels (Suppl. Table 2). Nine studies [1, 4, 13, 16, 31, 33, 38, 48, 49] were eligible for the diagnostic accuracy meta-analysis, as they provided adequate data for the construction of the 2 × 2 table. Sensitivity and specificity of the included studies are illustrated in forest plots (Suppl. Figure 1).

Excluded studies

Six studies [6, 14, 17, 19, 41, 43] were excluded after reading the full text. Four of them [14, 17, 19, 41] did not report the outcome of interest, while the population of one study [43] did not consist exclusively of newborns with perinatal asphyxia. Moreover, one study was not included, since it contained less than five AKI cases [6].

Quality assessment

The outcomes of ROBINS-I tool show that the overall risk of bias was evaluated to be moderate (Suppl. Table 3). The results of the QUADAS-2 tool are illustrated in Fig. 2. Higher risk was identified in the section of index test, since all studies did not use pre-specified cutoff values but the optimal ones. Furthermore, risk of bias in the domain of patient selection was pointed out in two studies [31, 38] due to case-control design, while applicability concerns were acknowledged in one study as a result of differentiations in the applied definitions [49].

Fig. 2
figure 2

QUADAS-2 evaluation

Qualitative synthesis

The results of all studies were assessed qualitatively (Suppl. Table 4). Serum levels of NGAL were assessed in six studies [1, 4, 13, 33, 38, 44]. All studies reported significantly higher serum NGAL concentrations in the AKI group, when measured both in the first and third days of life. However, this effect was not observed in measurements in postnatal days 7 and 10 [33, 38]. Pejovic et al. [33] discriminated AKI cases in two stages according to the AKIN criteria, finding higher NGAL in both subgroups. Interestingly, stage 2 was associated with significantly greater NGAL values compared to stage 1. Urinary NGAL was explored in seven studies [1, 16, 31, 38, 46, 48, 49]. In six of them, NGAL was significantly increased in the urine of asphyxiated newborns with AKI, while this was not observed in one case-control study including 13 neonates [38]. More specifically, elevated urinary NGAL was reported in five out of seven studies in the first day of life and in three out of four studies in days 3–4. In addition, Tanigasalam et al. [48] revealed significant differences of urinary NGAL in all stages of AKI compared to the control group, with higher values in stage 3.

Quantitative synthesis

The summary ROC (SROC) curves of serum and urinary NGAL are depicted in Fig. 3. The summary sensitivity and specificity of serum NGAL for the detection of post-asphyxial AKI were calculated at 0.818 (95% CI [0.668, 0.909]) and 0.870 (95% CI [0.754, 0.936]), respectively, while AUC was 0.912. The pooled DOR was estimated at 34.880 (95% CI [7.068, 172.122]), while the positive LR at 5.8 (95% CI [2.618, 12.849]) and the negative LR at 0.213 (95% CI [0.095–0.479]). Pooled estimates of sensitivity, specificity, and AUC for urinary NGAL were 0.897 (95% CI [0.829, 0.940]), 0.729 (95% CI [0.561, 0.850]), and 0.899, respectively. The summary DOR was calculated at 33.183 (95% CI [13.508, 81.518]), with the positive LR to be 3.339 (95% CI [2.060–5.414]) and the negative LR 0.141 (95% CI [0.084, 0.236]). The outcomes of the subgroup analysis are shown in Table 1. Of note, serum NGAL presented higher AUC values in cohort studies, using the AKIN definition, with cutoff > 100 ng/ml, while urinary NGAL appeared to be more specific in studies with lower thresholds. Fagan’s nomogram for serum NGAL indicated that the post-test probability was increased to 71.3% and decreased to 8.4%, when the pre-test probability was 30%. Respectively, the post-test probabilities for urinary NGAL were 58.9 and 5.7% (Fig. 4). Deeks’ funnel plot revealed no evidence of publication bias (p value > 0.05) (Suppl. Figure 2).

Fig. 3
figure 3

Summary ROC curve (bivariate model) for the performance of serum and urinary NGAL in the prediction of acute kidney injury in asphyxiated neonates

Table 1 Outcomes of the subgroup analysis
Fig. 4
figure 4

Fagan’s nomogram for the calculation of the post-test probability of acute kidney injury

Sensitivity analysis

The outcomes of the leave-one-out analysis revealed that no single study was found to significantly affect the overall outcome (Suppl. Table 4). By omitting one study at a time, sensitivity of serum NGAL ranged from 0.766 to 0.874, specificity from 0.845 to 0.906, and AUC from 0.873 to 0.949. Concerning urinary NGAL, sensitivity ranged from 0.887 to 0.927, specificity from 0.674 to 0.781, and AUC from 0.881 to 0.934.

Discussion

Acute kidney injury constitutes a leading cause of neonatal morbidity, especially in the setting of perinatal asphyxia. Early risk stratification is essential to guide clinical management, provide measures of prevention, and improve mortality rates. More specifically, recent research has focused on the potential role of theophylline in preventing post-asphyxial acute kidney injury, since it is able to increase renal blood flow by inhibiting the pre-glomerular vasoconstriction mediated by adenosine [32]. In this context, several clinical trials [15, 21, 29, 34] have investigated the effectiveness of this approach, concluding that the administration of theophylline early after birth is associated with significantly higher urine output and improved glomerular filtration rate. Moreover, there is evidence that therapeutic hypothermia in newborns with perinatal asphyxia is associated with reduced incidence and severity of acute kidney injury, as well as with lower mortality rates [47]. As a result, effective early risk assessment is crucial to identify the group of asphyxiated neonates that would benefit the most from the application of the above preventive strategies [40].

Nevertheless, current criteria for AKI detection have important shortcomings, since creatinine represents a late biomarker due to its constant rate of generation by muscle tissue and varies widely, as it is influenced by maternal renal function and GFR alterations in the early postnatal life [2]. Consequently, it is necessary to find out an optimum marker that combined with clinical factors would maximize the ability to promptly predict AKI in asphyxiated neonates [50].

The findings of the present meta-analysis suggest that NGAL is elevated in newborns with perinatal asphyxia complicated with AKI and may serve as a promising biomarker, given its high diagnostic performance. Urinary levels presented greater sensitivity (89.7%), while serum levels turned out to be more specific (specificity: 87%). As indicated by the subgroup analysis, the predictive accuracy of serum NGAL was superior when thresholds ranged from 140 to 157 ng/ml. On the contrary, the used cutoff values of urinary levels presented wide variation (18–652 ng/ml), rendering thus the interpretation of the test problematic. It is important to mention that NGAL levels showed a significant rise early in the postnatal period, while this effect was attenuated after the first week of life, highlighting the potential role of this marker in the follow-up of AKI cases. Furthermore, severe forms of AKI were associated with increased levels of both serum and urinary NGAL [33, 48], while a positive correlation with Sarnat stages of hypoxic-ischemic encephalopathy was noted [1]. As reported in two studies, the utility of NGAL in the prediction of perinatal asphyxia severity is also supported by its linear correlation with Apgar score (r = − 0.41, p value < 0.05) [4], umbilical lactate concentration (r = 0.57, p value < 0.05), and pH (r = − 0.55, p value < 0.05) [44].

Strengths and limitations of the study

The present study constitutes a systematic review combining, for the first time, current literature in the field of NGAL utility on the prediction of AKI among asphyxiated neonates. A recent meta-analysis [22] has also shown the potential efficacy of NGAL in AKI among critically ill neonates; however, the present meta-analysis includes a significantly greater number of studies (11 vs. 5) and focused only in cases with perinatal asphyxia in order to determine the accuracy of NGAL in this specific population. Statistical meta-analysis was performed using a bivariate model, which is a random effects model that overcomes the limitations of the conventional Moses-Littenberg method [9]. The methodological quality of the studies was evaluated by two independent tools (ROBINS-I and QUADAS-2), in order to reveal possible sources of systematic bias. Potential confounders were identified and subgroup analysis was conducted. Also, sensitivity analysis was performed to assess if a single study exerted significant effect on the overall outcome and the results remained stable. It is necessary to note that omitting the study [49] with the higher risk of bias did not significantly alter the net diagnostic accuracy (AUC 0.885 vs. 0.899). A Fagan’s nomogram was finally drawn, since it is a clinically useful tool that allows the rapid determination of post-test probability, using one of the two NGAL tests.

However, this meta-analysis is based on the results of nine studies, including a moderate number of patients. Since the majority of them reported their outcomes in terms of median and range, a quantitative synthesis regarding the differences of the absolute NGAL values between the two studied groups was impossible to be performed. Moreover, the variability of the applied cutoffs was conspicuous, while the tendency of studies to use the optimal ones may have overestimated the calculated diagnostic accuracy of the test. Possible sources of heterogeneity that could contribute to the wide variability of NGAL measurements include differentiations in patient eligibility criteria and definition of perinatal asphyxia, as well as in the laboratory assay used for the detection of the marker. In addition, the population of seven out of nine studies consisted exclusively of term neonates; therefore, evidence regarding NGAL utility in premature newborns remains insufficient. Most measurements were performed in the first 3 days of life and thus the role of this marker in AKI follow-up is uncertain. It is also important to state that the use of creatinine as the gold standard test constitutes an inherent limitation of all studies investigating the accuracy of novel biomarkers, due to its unreliable nature in the perinatal period, as earlier discussed.

Implications for current clinical practice and future research

The present meta-analysis suggests that NGAL represents a promising marker in neonates and may serve as a candidate test to detect perinatal asphyxia complicated by AKI. Effective stratification of AKI risk would allow intervening early in the disease process by offering prophylactic measures and guide clinical management regarding fluid balance and the avoidance of nephrotoxic medications. However, several aspects need to be elucidated before NGAL use can be applied in the clinical setting. Future large-scale multi-center prospective studies should be conducted in order to accurately calculate the diagnostic efficacy of NGAL and to determine the range of its concentration in asphyxiated neonates with and without AKI, as well as in healthy controls. The AKIN [28] and KDIGO [23] criteria should be used as the reference test, while definitions based on single creatinine measurements should be avoided. Furthermore, studies should pre-determine their cutoffs based on the values proposed in the present review, in order to achieve a more realistic estimation of NGAL accuracy. The effect of possible confounders, especially severity of perinatal asphyxia, stage of AKI, and prematurity should be taken into consideration and analyzed separately. Other perinatal factors able to modify NGAL levels, such as congenital infections and neonatal sepsis [24, 37], should not be ignored. Also, sequential measurements of NGAL would allow defining the optimal timing of sampling and assessing its utility as a marker of recovery from renal injury. Head-to-head comparison of serum and urinary levels should be performed to find out which of the two is the most useful test. In this context, NGAL should be evaluated in conjunction with other novel biomarkers and clinical factors in order to construct a combined predictive model that would maximize the accuracy of AKI detection among newborns with perinatal asphyxia. Finally, randomized clinical trials are needed to assess if NGAL constitutes a useful tool in guiding decisions regarding clinical management and preventive strategies, especially the administration of theophylline early after birth.

Conclusions

The outcomes of the present meta-analysis suggest that serum and urinary NGAL present high performance in the detection of post-asphyxial AKI. However, since the available evidence comes from a limited number of studies, future large cohorts are needed to confirm its diagnostic efficacy, as well as to determine the optimal timing of measurement and the cutoff values to be applied in the clinical practice.