Background

Current bedside tests for UTI are not sufficiently accurate. The best available point-of-care test for UTI, the leukocyte esterase test, has a 79% sensitivity (i.e., it misses 21% of children with a true UTI) and 87% specificity (i.e., 13% of those negative for UTI will be incorrectly diagnosed with UTI and receive antimicrobial treatment) [1]. Accordingly, there has been interest in identifying tests that can more accurately differentiate children with and without UTI. More accurate tests could reduce both overtreatment and missed cases.

Current point-of-care tests for the diagnosis of acute pyelonephritis (APN) are also suboptimal; among these, the best markers are procalcitonin (PCT) and C-reactive protein (CRP), which are only useful for ruling in and ruling out APN, respectively [2].

NGAL is a small protein released by activated neutrophils in response to infection. Thus, several investigators have compared the accuracy of NGAL to the leukocyte esterase test as a screening test for UTI. Furthermore, because renal tubular cells release NGAL in response to injury or infection [3], others have investigated its potential as a marker of APN.

Although the accuracy of NGAL for acute kidney injury has been reviewed, its utility in the diagnosis of UTI and APN has not. This systematic review will examine whether plasma or urine NGAL (pNGAL and uNGAL, respectively) can be used to accurately diagnose UTI or APN.

Methods

Types of studies

The protocol for this review was submitted to PROSPERO (https://www.crd.york.ac.uk/PROSPERO/) on November 13, 2018 (CRD42020137166). We considered published studies that evaluated the results of an index test (NGAL) against the results of urine culture (for UTI) or dimercaptosuccinic acid (DMSA) for acute pyelonephritis (APN). Cross-sectional and cohort studies were acceptable; case-control designs were included only if controls were representative of children being tested for UTI or APN (case-control studies with extreme cases or controls tend to result in biased estimates) [4]. We excluded studies in which another inflammatory marker (e.g., C-reactive protein, erythrocyte sedimentation rate) was used to select those getting DMSA. We included studies of symptomatic children from birth to 18 years of age with suspected UTI. Studies limited to children with neurogenic bladder or children with major genitourinary abnormalities were excluded. Studies in which another test was initially used to screen patients (e.g., WBC count to screen for UTI) were excluded.

Index tests

Studies that examined the accuracy of NGAL were considered, regardless of how NGAL was measured. Some studies reported creatinine normalized uNGAL values (uNGAL/creatinine), while others just reported uNGAL.

Target conditions and reference standards

UTI and APN were the two target conditions examined. The reference standard for diagnosis of UTI was the urine culture. We defined UTI as growth of one or two organism(s) at ≥ 10,000 colony-forming unit/ml (CFU/ml) from a catheterized specimen, or ≥ 100,000 CFU/ml clean catch, midstream, or bag specimen, or any growth from a suprapubic specimen [5]. The reference standard for assessing the presence and extent of pyelonephritis is a DMSA scan obtained within 2 weeks of diagnosis of a UTI [6].

Search strategy

We searched MEDLINE and EMBASE. Searches were carried out on December 7, 2019. Search strategies are presented in the electronic supplement (online resource 1). The reference lists of all included articles and relevant systematic reviews were reviewed to identify additional studies not found through the electronic review. Two authors independently applied the selection criteria to all citations (titles and abstracts). The full text of all articles identified by either author was retrieved and reviewed. We limited the review to articles written in English, Spanish, French, German, Portuguese, or Italian.

Data extraction and management

For each study meeting the inclusion criteria, we extracted the following information: age of participants, fever required (yes, no), used perineal bags to collect urine specimen (yes, no), and case control (yes, no). Two by two tables were constructed independently by two authors from the data in the publication. Only studies for which two by two data were available (or could be reconstructed) were included. Two authors independently used the QUADAS II questionnaire to assess the risk of bias in all studies meeting inclusion criteria [7]. Disagreements were resolved by discussion.

Statistical analysis and data synthesis

The primary analysis was to compare the NGAL test against the reference standard. We had planned to pool accuracy values (using a random-effects bivariate model using SAS Proc NLMIXED in SAS; see registered protocol for more detail) and statistically investigate heterogeneity only if 5 or more studies reported accuracy values for the NGAL test using the same cutoff. Otherwise, if different studies use different cutoff values when reporting accuracy, instead of reporting pooled accuracy (i.e., sensitivity and specificity) values, we had planned to examine how accuracy varied with threshold on a summary receiver operating characteristic curve (i.e., curve used to summarize performance of a diagnostic test based on data from a meta-analysis).

Results

The results of the search strategy are shown in Fig. 1. Of the 1892 records identified, 60 records were retrieved and reviewed. A total of 12 studies met our inclusion criteria; one author was successfully contacted for clarification. The characteristics of key excluded studies are detailed in the electronic supplement (online resource 2).

Fig. 1
figure 1

FLow Diagram

uNGAL or uNGAL/creatinine for UTI

Four studies (920 patients) provided data for these analyses [8,9,10,11]. Two [9, 11] presented data for uNGAL over creatinine and three [8, 10, 11] presented data for NGAL not normalized to urine creatinine. Of the four studies, two used bags for urine collection, three included only febrile populations, and two were case control in design (Table 1). In all studies (Table 2, also see online resource 3 for graphical representation), NGAL had markedly higher accuracy than previously reported pooled sensitivity and specificity values [95% confidence interval] for the leukocyte esterase test (0.79 [0.77, 0.81] and 0.87 [0.86, 0.88], respectively) [1]. Direct comparison between uNGAL and existing tests for the diagnosis of UTI (leukocyte esterase test or WBC count) was limited due to the small number of studies, but data from studies that provided this is reported in Table 3; in all but one study, the NGAL test (or NGAL/creatinine) was more sensitive than the currently available test and had comparable or better specificity; in one small study [9], uNGAL was more sensitive but less specific than the leukocyte esterase test. An important methodological limitation was that in two of the four included studies, patient selection could have introduced bias (online resource 4).

Table 1 Characteristics of the included studies according to index test and target condition
Table 2 Accuracy of NGAL for urinary tract infection (UTI) and acute pyelonephritis
Table 3 Accuracy of leukocyte esterase or white blood cell (WBC) count for diagnosis of UTI in included studies

uNGAL for APN

Three studies (162 patients) provided data for this analysis [9, 12, 13]. Of the three studies, one used bags for urine collection, all three included only febrile children, and two were case-control studies (Table 1); two studies reported uNGAL, and two reported uNGAL/creatinine. An important methodological limitation was that in one of the three included studies, patient selection could have introduced bias (online resource 4). The accuracy of uNGAL was low in all three studies (Table 2 and online resource 3 for a graphical representation).

pNGAL for APN

Six studies (474 patients) provided data for this analysis [12, 14,15,16,17,18]. Of the 6 studies, none used bags for urine collection, four included only febrile children, and none of the studies were case control in design (Table 1). Both sensitivity and specificity were above 80% in two of the six studies [14, 16] (Table 2 and online resource 3 for a graphical representation).

Discussion

In this review we examined the accuracy of urinary NGAL for the diagnosis of UTI and the accuracy of urinary and plasma NGAL for the diagnosis of pyelonephritis. With regard to UTI diagnosis, we found that urinary NGAL consistently outperformed the leukocyte esterase test in both sensitivity and specificity in the majority of included studies. The non-uniformity of cutoffs used across the different studies precluded the calculation of an overall sensitivity and specificity. Nevertheless, in most studies examined, the accuracy of NGAL significantly exceeded the accuracy of the leukocyte esterase test.

For differentiating pyelonephritis from cystitis, neither uNGAL nor pNGAL demonstrated sufficient accuracy to replace the DMSA scan for definitive diagnosis. Furthermore, the number of studies in each of the three analyses (uNGAL for APN, uNGAL/creatinine for APN, pNGAL for APN) was small. This lack of data was compounded by methodological shortcomings in a large proportion of studies (use of bag specimen, non-representative patient populations, inclusion of children without fever). The pre-test probability of pyelonephritis in young febrile children with a UTI is approximately 60% [20]. A positive DMSA scan increases the probability of UTI to 93% and a negative DMSA scan decreases the probability of pyelonephritis to 18% (assuming the DMSA has a sensitivity and specificity of 86% and 91%, respectively). To rule in pyelonephritis, a useful test would need to increase post-test probability of pyelonephritis to at least 85%, with 85% selected as a clinically reasonable threshold above which many clinicians would feel reasonably confident in their diagnosis of pyelonephritis and which is almost a high as a DMSA test. The post-test probability of pyelonephritis exceed 85% in only two [14, 16] of the six studies of pNGAL and one [9] of three studies of uNGAL. To rule out pyelonephritis, a useful test would need to reduce the post-test probability of pyelonephritis to less than 20%, with 20% representing the cutoff below which most clinicians would feel reasonably confident in their diagnosis of cystitis and which is close to the DMSA test. The post-test probability of pyelonephritis was less than 20% in only two [16, 17] of the six studies of pNGAL and one [13] of three studies of uNGAL. In summary, the bulk of the data, albeit sparse, suggest that the utility of both pNGAL and uNGAL for the diagnosis of pyelonephritis is limited.

The small number of studies with relevant data was a major limitation on this analysis. This was especially patent in the diagnosis of pyelonephritis, where cutoffs and estimates of test accuracy varied widely between studies without explanation. Another potentially large, but less obvious, limitation of studies examining the accuracy of NGAL as a diagnostic marker of pyelonephritis was the failure to differentiate between the monomeric and dimeric forms of the protein. Previous studies have demonstrated that while the renal epithelium produces NGAL predominantly in the monomeric form, activated neutrophils in the urine (and in the infected kidney) secrete NGAL primarily in its dimeric form [21]. Since assays that could differentiate the two forms were not readily available until recently, none of the studies included attempted to differentiate the two forms and thus did not consider the effects that differentiating the two forms of the protein might have had on diagnostic accuracy. This may explain why uNGAL was more accurate for diagnosis of UTI compared to diagnosing APN. This suggests that future studies of the diagnostic accuracy of NGAL should pay particular attention to the form of NGAL being measured. Inclusion of children without fever was another limitation. However, except for two studies, all children in the included studies were febrile. Thus, only a small number of children without fever (n = 31 across all studies) were included. This is unlikely to have significantly affected our results. Furthermore, only one of the 5 analyses conducted (plasma NGAL for APN) included studies with afebrile children. Differences in the definition of UTI across studies were another limitation. Of the studies looking at the accuracy of NGAL for UTI, two studies allowed for the presence of a second organism on culture as long as the child met the criteria for UTI with the first pathogen. Neither of these studies used perineal bags to collect samples. We frequently encounter cases in which children with likely UTI have growth of a second organism, usually with counts < 10,000 CFU/ml, and thus feel that inclusion of these studies is reasonable; similar criteria have been used in large recent studies [22,23,24].

In summary, urinary NGAL offers a promising alternative to the current leukocyte esterase test for diagnosing UTI. Although we used 96-well plates and ELISA kits to measure NGAL, individual testing using existing analyzers in many hospital laboratories is becoming increasingly available. Larger studies are needed to validate the superiority of this marker for the diagnosis of UTI in children. For the differentiation of cystitis from pyelonephritis, we found that neither plasma nor urine NGAL levels appeared to be sufficiently accurate; however, further study examining different molecular forms of NGAL (monomeric vs. dimeric) for the diagnosis of APN are warranted.