Introduction

Urinary tract infections (UTIs) frequently occur in infants and children. Before reaching 6 years of age, nearly 1.8 % of boys and 6.6 % of girls have suffered from at least one UTI [1]. In clinical practice, acute pyelonephritis (APN) and lower UTIs are two common forms of UTI that occur during infancy and childhood. Lower UTIs do not normally have complications, while APN is severe form of UTI that may result in renal scarring and subsequent complications, such as hypertension, hyposthenuria and chronic renal failure [2, 3]. The nonspecific signs and symptoms, including fever, flank pain or dysuria, make the clinical differentiation between APN and lower UTIs difficult, even for the experienced clinician [4, 5]. Commonly used laboratory parameters or other novel markers, such as erythrocyte sedimentation rate (ESR), serum leukocyte counts, neutrophil counts, C-reactive protein (CRP), interleukin (IL)-6 and IL-8, have low specificity and cannot reliably differentiate APN from lower UTIs, particularly in young children [68].

Differentiating APN from lower UTIs is important because the accurate diagnosis and prompt treatment of APN can prevent renal scarring and subsequent complications. Renal 99mTc-dimercaptosuccinic acid (DMSA) scintigraphy is considered the gold standard imaging method for APN diagnosis. However, there are factors that limit wide application of early DMSA scintigraphy in clinical practice, including the cost, availability and risk of exposing young patients to attendant radiation and sedation [9, 10]. Therefore, it is necessary to develop a reliable, specific and alternative biological marker that could be widely used by clinicians to differentiate APN from lower UTI in infants and children.

Procalcitonin (PCT), a 116-amino acid propeptide of calcitonin, is an early, reliable diagnostic marker of serious bacterial infections and sepsis that has several advantages compared with other potential biomarkers, including a wide biological range, short time of induction after bacterial stimulus and long half-life [11]. Recent studies have reported a correlation between serum PCT levels and renal parenchymal damage in UTIs [12, 13]. Increasing evidence indicates that serum PCT is a potential marker for differentiating APN from lower UTIs in children; however, the different studies that have evaluated the clinical utility of this test are controversial. This controversy has arisen due to the single study design of the previous studies and the fact that the various diagnostic threshold values used to perform sensitivity and specificity analyses disagreed among the studies [1421]. Additionally, new studies have been performed since the publication of the previous meta-analyses, and our understanding of PCT is still developing [22, 23]. Based on the above, we conducted an updated meta-analysis to comprehensively investigate the performance of serum PCT for the differentiation of APN and lower UTI in infants and children.

Methods

We performed this systematic review and meta-analysis in accordance with the preferred reporting items for systematic reviews and meta-analyses (PRISMA) guidelines [24].

Data sources and search strategy

Systematic literature searches of Medline (via PubMed), Embase (via Elsevier), Web of Science and Cochrane Library were performed to identify relevant studies that assessed the accuracy of PCT for APN diagnosis until November 30, 2014, using the following search terms without language restrictions: (procalcitonin or PCT) and (acute pyelonephritis or APN). Manual searches were completed following a review of the reference lists of selected articles to identify any missing articles.

Study selection

Eligible studies were identified based on the following inclusion criteria: (1) enrolled infants and children who were diagnosed as UTIs and underwent serum PCT detection and subsequent renal 99mTc-DMSA scintigraphy; (2) evaluated the efficacy of PCT for diagnosing APN based on DMSA scintigraphy as a reference standard; (3) data were available to obtain true-positive (TP), false-positive (FP), true-negative (TN) and false-negative (FN) diagnostic results of APN and lower UTI; and (4) the sensitivity, specificity, positive predictive values (PPVs) or negative predictive values (NPVs) for the diagnosis of APN based on certain cutoff PCT values could be calculated in each study. Conference abstracts were excluded. Studies involving patients with renal dysplasia and renal scarring on DMSA scan, which were often associated with previous APN, were excluded.

Date extraction and quality assessment

The characteristics of each included study were extracted using a standardized form: first author, year of publication, study design, country of origin, sample size in each group, demographic characteristic of the study population (age, sex), PCT levels, referred cutoff values of PCT, time for DMSA scan and whether first episode. The Quality Assessment of Studies of Diagnostic Accuracy included in Systematic Review (QUADAS) assessment tool, which contains 14 items, was applied for the quality assessment of the included studies. Each question was assigned a response of yes, no or unclear for each study [25]. We graded them as 1 score to yes, 0 score to no or unclear for each of the 14 items. Because the quality assessment related strongly to the reporting of results, a well-conducted study could score poorly if the methods and results were not reported in sufficient detail. Two reviewers screened the studies and performed the study selection, data extraction and quality assessment independently. Any disagreements were resolved through discussion or a third reviewer.

Data synthesis and analysis

All synthesis and analysis were performed with the use of STATA 12.0 (StataCorp, College Station, TX). We used the bivariate mixed-effect model to estimate the pooled sensitivity, specificity, diagnostic odds ratio (DOR), area under the receiver operating characteristic curve (AUROC) and positive and negative likelihood ratios (LRs) of included studies by the MIDAS module [26]. In order to propose the potential optimal cutoff value, we conducted the data synthesis on two common threshold values of PCT (0.5 vs. 1.0 ng/mL) used in the literature. We used the derived logit estimates of sensitivity, specificity and respective variances to construct a hierarchical summary receiver operating curve (HSROC) by the METANDI module. The closer the AUROC was to 1, the better the test performance. Based on the summary sensitivity and specificity, we calculated the pretest probabilities of 25 % versus the corresponding posttest probabilities with the detection of serum PCT after admission. Graphs were produced with STATA 12.0 program. The I 2 test was used to assess heterogeneity of pooled statistical variables, with I 2 > 50 % indicating significant heterogeneity and I 2 < 25 % indicating no obvious heterogeneity. To explore the potential source of heterogeneity, we stratified the studies into several subgroups according to the characteristics of studies. Specificity, sensitivity, DOR, AUROC, I 2, PLR, NLR and 95 % confidence intervals (95 % CI) were calculated for each subgroup to quantitatively estimate the proportion of total variation. We investigated publication bias by Deeks’ test. Publication bias was considered to be present if there was a nonzero slope coefficient (p < 0.05) [27].

Results

Search results and study characteristics

As shown in Fig. 1, our systematic search initially yielded 239 publications in total, with 127 duplicates removed. Another 33 articles, including reviews, case reports and commentaries, were excluded after screening the titles and/or abstracts. A total of 79 studies remained for evaluation via detailed reading. Several studies did not meet our inclusion criteria because they were focused on therapy, risk evaluation or adults. Thirty studies did not include patients with lower UTIs. Additionally, we could not extract data for a 2 × 2 quadrant table in two studies. An additional manual search of the reference lists of the included studies and relevant reviews did not identify any other articles. Finally, 18 studies were included in this meta-analysis. The main characteristics of the included studies are listed in Table 1. These studies were performed in ten countries throughout Europe, America and Asia from 1998 to 2014, representing an international experience. A total of 1482 patients with UTIs were enrolled, and all the studies were conducted in infants and children. Only four of the 18 included studies had sample sizes greater than 100 subjects. The serum PCT levels were provided in most studies, and they were significantly different between patients with APN and those with lower UTIs. Ten of the included studies evaluated the performance of serum PCT using more than two cutoff values. Eleven of the 18 studies were conducted in patients experiencing their first UTI. Only one study did not report the time between admission to the hospital and performance of a DMSA scan, whereas the other studies reported performance of a DMSA scan within 7 days.

Fig. 1
figure 1

Flow chart of study selection

Table 1 Characteristics of included studies evaluating the performance of serum PCT in the diagnosis of APN among infants and children

Quality assessment

As summarized in Figure S1, the methodological quality of each included study was moderate to high according to the QUADAS standard. Seventeen of the studies were designed as prospective research and contained sufficient clinical information. In most studies, the index test results were not interpreted with knowledge of the blinded reference tests to eliminate bias, and uninterpretable, indeterminate or intermediate results were not reported.

Data analysis and calculations

Because some studies used different cutoff values for serum PCT, the 18 included studies were stratified into two groups according to two commonly used cutoff values in the literature (0.5 and 1.0 ng/mL) to investigate the diagnostic value of serum PCT for differentiating between APN and lower UTI. Because the numbers of TP, FP, TN and FN results were not directly provided by some studies, we calculated these indexes from the provided sample size, sensitivity and specificity values. Fourteen studies provided data for a cutoff value of 0.5 ng/mL, with a total of 1049 patients (55 % had APN), and seven studies (680 patients in total, 58 % had APN) used a cutoff value of 1.0 ng/mL. The relevant data from the individual studies of the two groups are listed in Table S1 and S2. The sensitivity of serum PCT with a cutoff value of 0.5 ng/mL ranged from 0.31 to 1.00, and the specificity ranged from 0.23 to 0.90. The sensitivity of serum PCT with a cutoff value of 1.0 ng/mL ranged from 0.52 to 0.94, and the specificity ranged from 0.83 to 1.00. Figures 2 and 3 present forest plots of the pooled analysis. As summarized in Table 2, the pooled sensitivity of serum PCT (≥0.5 ng/mL) for the diagnosis of APN in infants and children with UTIs was 0.86 [95 % confidence interval (CI), 0.73–0.93] and the specificity was 0.76 (95 % CI 0.66–0.83) with a DOR of 18.90 (95 % CI 6.78–52.71). The heterogeneity across studies was significant, as evidenced by the I 2 statistic of 95.78 %, and it was not derived from the threshold effect (r = −0.26, p = 0.38 > 0.05) because of the identical cutoff values used in these studies. However, serum PCT ≥ 1.0 ng/mL had a similar sensitivity of 0.84 (95 % CI 0.78–0.89) but a higher DOR of 55.06 (95 % CI 22.57–115.48) and specificity of 0.91 (95 % CI 0.86–0.94) without noticeable heterogeneity (I 2 = 0.00 %). The areas under the receiver operating characteristic curve (AUROC) of the two cutoff values were 0.86 (95 % CI 0.83–0.89; Fig. 4a) and 0.94 (95 % CI 0.92–0.96; Fig. 4b). The threshold effect was evaluated using the Spearman correlation coefficient of sensitivity and specificity (r = −0.32, p = 0.48 > 0.05), which suggested that there is no heterogeneity from the threshold effect when using 1.0 ng/mL as a cutoff value within the studies.

Fig. 2
figure 2

Forest plots of the pooled sensitivity and specificity of serum PCT (cutoff value, 0.5 ng/mL) for differentiation APN from lower UTIs. The black squares in the gray squares and the horizontal lines represent the point estimate and 95 % confidence interval (CI), respectively. The dotted line represents the pooled estimate, and the diamond shape represents the 95 % CI of the pooled estimate

Fig. 3
figure 3

Forest plots of the pooled sensitivity and specificity of serum PCT (cutoff value, 1.0 ng/mL) for differentiation APN from lower UTIs in infants and children. The black squares in the gray squares and the horizontal lines represent the point estimate and 95 % confidence interval (CI), respectively. The dotted line represents the pooled estimate, and the diamond shape represents the 95 % CI of the pooled estimates

Table 2 Pooled diagnostic accuracy for differentiation APN from low UTIs
Fig. 4
figure 4

Hierarchical summary receiver operating characteristic (HSROC) plots of serum PCT for the diagnosis of APN in young. a Derived from studies using cutoff value of 0.5 ng/mL. b Derived from studies using cutoff value of 1.0 ng/mL. The summary point represents the summary sensitivity and specificity, the 95 % confidence region represents the 95 % confidence intervals of the summary sensitivity and specificity, and the 95 % prediction region represents the 95 % confidence interval of sensitivity and specificity of each individual study included in the analysis. The plot also includes study estimates indicating the sensitivity and specificity estimated using the data from each study separately

No potential publication biases were identified in the studies using cutoff values of 0.5 ng/mL (t = −0.66, p = 0.52, Figure S2A) and 1.0 ng/mL (t = 0.06, p = 0.95, Figure S2B), as evidenced by the symmetric plots.

Subgroup analysis

To identify the sources of significant heterogeneity when using a cutoff of 0.5 ng/mL, the area of origin, ratio of males/females, first episode reported and timing of DMSA scan after admission were chosen as potential variables. Table 3 summarizes the results of the subgroup analysis.

Table 3 Diagnostic accuracy of serum PCT (cutoff value, 0.5 ng/mL) in various analyzed subgroups

Area of origin

Because the incidence of UTIs differs across countries, different races and ethnicities may be important factors that could affect the performance of PCT. Based on the origin of the subjects enrolled in this meta-analysis, we classified the 14 publications into two subgroups: Europe or America and Asia. Mild differences were evident in the statistical variables of each subgroup. The six studies from Asia had a relatively higher specificity (0.80; 95 % CI 0.68–0.89), DOR (22.19; 95 % CI 7.09–69.45) and AUROC (0.88; 95 % CI 0.85–0.91), while the specificity, DOR and AUROC of the eight studies in Europe or America were 0.72, 16.09 and 0.84, respectively. The difference between the subgroups was not statistically significant (p = 0.60).

Ratio of male/female

Physiologically, females are more susceptible to UTIs than males. Therefore, we divided the patients into two subgroups based on the sex ratio (males/females ≥0.5 and <0.5). The DOR was 17.63 for males/females ≥0.5 and 20.87 for males/females <0.5. The difference in DOR between the subgroups was not statistically significant (p = 0.23). The heterogeneity observed in each subgroup was still quite large because I 2 > 50 %.

First episode

Five studies did not report whether the subjects were experiencing their first UTI; therefore, we separated these studies. The DOR and AUROC of these five studies were 13.45 and 0.81, respectively. However, the nine remaining studies in which all included subjects were experiencing their first UTI showed improved serum PCT performance (DOR 30.50; AUROC 0.90) without obvious heterogeneity (I 2 = 0.00 %).

Timing of DMSA scans after admission

Of the 14 studies, five studies conducted a DMSA scan less than or equal to 72 h after admission, while the patients of the other studies may have received a DMSA scan less than or equal to 7 days after admission. The DOR and AUROC of the former subgroup (timing of DMSA scan ≤72 h) were 13.12 and 0.84, respectively, without heterogeneity across studies (I 2 = 0.00 %). However, the DOR and AUROC of the studies in which the timing of the DMSA scan was ≤7 days were 20.29 and 0.87, although it is difficult to reach a definitive conclusion due to the significant heterogeneity (I 2 = 94.24 %).

Discussion

PCT has been recognized as a marker for APN since 1998 [28]. However, the reported diagnostic value in children varies worldwide. A previous meta-analysis conducted by Mantadakis et al. [22] was published on a similar subject in 2009; however, some deficiencies in their design and data analysis may have affected the accuracy of their conclusions. First, only ten studies that used a serum PCT cutoff value of 0.5–0.6 ng/mL were used to calculate the estimated pooled statistics, which had obvious heterogeneity. Second, there were some errors in the data extraction from the included articles. Additionally, no further analyses were performed to identify the source of heterogeneity. In another meta-analysis from 2013, which included 18 studies from January 1993 (when PCT was first described in relation to bacterial infection) to September 2011, Leroy et al. [23] studied procalcitonin as a predictor of both APN and scarring in children with UTIs. The diagnostic accuracy of serum PCT was low: The pooled sensitivity and specificity were 0.71 (95 % CI 0.67–0.74) and 0.72 (95 % CI 0.67–0.76), respectively, for the cutoff value 0.5 ng/mL and 0.65 (95 % CI 0.61–0.69) and 0.87 (95 % CI 0.83–0.90), respectively, for the cutoff value 1.0 ng/mL. When examining studies included in that meta-analysis individually, we determined that some publications did not perform the current gold standard, DMSA renal scintigraphy, which was not strictly in accordance with the inclusion criteria. Furthermore, the authors did not provide the extracted data used for the analysis in detail. Despite considering three PCT cutoff values, the number of publications used to evaluate the performance of each cutoff value was unknown, and the authors did not offer any novel viewpoints. Moreover, in the last three years (from September 2011 to September 2014), at least six additional publications that assessed the suitability of PCT in pediatric UTIs have been performed; therefore, it is appropriate to re-evaluate the diagnostic value of PCT.

In this systematic review and meta-analysis, we summarized the results of all published studies that assessed the performance characteristics of serum PCT for the differential diagnosis of APN and lower UTIs in infants and children in detail. Our findings are consistent with previous studies; however, we add more evidence and novel views that support the utilization of serum PCT as a marker of APN in clinical practice. We enrolled 1482 pediatric patients with UTIs in total and applied two commonly used cutoff values for serum PCT (0.5 or 1.0 ng/mL) in the analysis.

The results revealed that a serum PCT cutoff value of 0.5 ng/mL had a DOR of 18.90 (95 % CI 6.78–52.71) and an AUROC of 0.86 with significant heterogeneity across studies. Notably, according to the subgroup analysis, we determined that lack of information about the number of episodes and timing of the DMSA scan after admission, rather than the area of origin or ratio of males/females, may be the major sources of heterogeneity in the studies with this cutoff value. Nine of the 14 included studies reported that all subjects were experiencing their first episode and exhibited good performance for pooled specificity, DOR and AUROC and lacked heterogeneity (Table 3), indicating that well-controlled studies may provide more reliable information for selection. Therefore, several pediatric patients in the other studies may have previously suffered APN, and these patients were responsible for the observed heterogeneity. Patients in five studies had a DMSA scan within 72 h after admission, and it appears that this group had a higher pooled sensitivity (0.88; 95 % CI 0.68–0.97) and AUROC (0.84; 95 % CI 0.80–0.87) compared with patients who had a DMSA scan within 7 days. Some possible explanations for this observation are that some patients may not have arrived at the hospital in time or may have used antibiotics before admission; therefore, a DMSA performed within 72 h may be more accurate. There was a potential skewing of the reference standard test in the studies that performed DMSA within 5 or 7 days because of the different positive results caused by time point selection, specifically for patients with mild renal parenchymal involvement, which could directly contribute to the observed heterogeneity. To our knowledge, the incidence of APN in different genders and races may differ; however, we failed to further stratify the patients enrolled in our study due to the limited valid data.

Conversely, our analysis determined that 1.0 ng/mL may be an optimal serum PCT value for APN diagnosis, with a DOR of 2.60, high sensitivity (0.84; 95 % CI 0.75–0.90), high specificity (0.91; 95 % CI 0.86–0.94) and an AUROC of 0.94, which further suggested that its overall predictive accuracy was high. The likelihood ratios and posttest probabilities are also relevant for clinicians. The overall positive LR (9.17; 95 % CI 5.81–14.47) of this cutoff value was relatively reliable compared with 0.5 ng/mL; for example, an infant or child with APN is almost nine times more likely to have a positive test result than a patient with a lower UTI. With a pretest probability of 58 %, the posttest probability of a positive test result increased to 93 %, which was much higher than the posttest probability of 81 % calculated using 0.5 ng/mL as a cutoff value. Furthermore, the negative LR of 0.19 reduced the posttest probability of a negative test result to 20 %. The main strength of the present study compared with previous studies is that the data synthesis and analysis were conducted using a commonly used cutoff value (0.5 ng/mL) and a possible optimal cutoff value 1.0 ng/mL. In addition to conventional DOR, sensitivity, specificity and AUROC, we also provided the LR to ensure that the results can be better understood by clinicians. On the basis of our results, we consider 1.0 ng/mL to be an effective cutoff value for serum PCT to distinguish APN from lower UTIs.

There are some limitations to our study. First, the age of the included patients ranged from 3 weeks to 12 years. The sensitivity of serum PCT may vary at different life stages; however, due to the nature of the data extracted from the research studies, we were not able to stratify the subjects according to age. Second, we were unable to formally assess the related clinical signs and symptoms or the coexistence of infections at other sites, which may influence serum PCT levels. Third, the sample sizes in a majority of the included studies were less than 100, and only seven studies evaluated the diagnostic value of a cutoff of 1.0 ng/mL for serum PCT. More than half of the studies were performed in Asian populations, which may have led to selection bias. Moreover, the LRs were calculated from dichotomized data. The results of the serum PCT test were artificially classified as either positive or negative. Therefore, it is possible that some useful intermediate information was lost.

In conclusion, this meta-analysis demonstrates that a serum PCT cutoff value of 1.0 ng/mL provided good diagnostic value for discriminating between APN and lower UTIs in infants and children. Moreover, several published studies have suggested that serum PCT is also a predictive marker of renal scarring or vesicoureteral reflux in patients with UTIs. Further studies are needed to validate the diagnostic and potential prognostic value of this cutoff value of serum PCT in large multicenter cohorts.