Introduction

Given that the prevalence of urinary tract infection (UTI) in children is low (i.e., many children are tested for each UTI detected), highly specific screening tests are needed so that empiric treatment can be restricted to those with a true UTI. Neither the leukocyte esterase nor leukocyte count obtained using urine conventional microscopy are sufficiently specific (specificities of 87% and 86%, respectively) [1]; approximately 50% of children initially treated with antibiotics for UTI based upon urinalysis are subsequently found to have a negative urine culture [2]. More specific markers would reduce the number of false positive results and reduce unnecessary antimicrobial use. In this study, we evaluated markers belonging to established inflammatory pathways that could be used in conjunction with the leukocyte esterase test to increase its specificity in the diagnosis of UTI.

Methods

From October 2010 to June 2015, we prospectively enrolled a convenience sample of children 1 month to 10 years of age, at the time they presented to one of three Emergency Departments (Children’s Hospital of Pittsburgh, Children’s National Medical Center, Hasbro Children’s Hospital) or to one outpatient pediatric clinic affiliated with the Children’s Hospital of Pittsburgh, with fever and/or urinary symptoms. We included only children who were treated for UTI by the caring clinician and had pyuria on urinalysis or dipstick (≥ 10 WBC/mm3, ≥ 5 WBC/hpf, or ≥ 1+ leukocyte esterase). Exclusion criteria are listed in Fig. 1 and included recent antibiotics or steroids, chronic disease or genitourinary anomaly, another infectious condition (pneumonia, sepsis, or meningitis), and bagged urine specimen. We defined a positive urine culture as growth of ≥ 100,000 CFU/mL of at least one uropathogen from a specimen collected by clean catch or ≥ 50,000 CFU/mL of at least one uropathogen from a catheterized specimen. There were no samples collected by suprapubic aspiration. No bagged urine specimens were used for urinalysis or culture. The institutional review board at each respective site approved this study.

Fig. 1
figure 1

Flow diagram

Urine sample collection and processing for proteins

We used urine collected at the time of presentation in most instances; when this was not available, we collected an additional sample using a perineal collection bag. We filtered urine samples (Millex filters, Millipore) and stored them in cryovials at − 80 °C. Processing generally occurred within 1 h of collection. However, if a delay was anticipated, samples were stored in a specimen refrigerator until processing.

Blood sample collection and processing for proteins

We collected blood samples from children whose parents consented to phlebotomy. C-reactive protein (CRP) was measured using Siemens Dimension Vista 500 Intelligent Lab System. Another aliquot was spun at 2700 rpm for 10 min and plasma was subdivided into multiple cryovials for protein measurement (cytokines, neutrophil gelatinase-associated lipocalin (NGAL), and procalcitonin). Specimens were processed generally within 1 h of collection, stored at − 80 °C and analyzed in batches.

Protein measurement

We assessed 53 candidate markers; 48 cyokines, NGAL, and DEFA1 (human alpha defensin 1) were measured in both blood and urine using Bio-Rad Pro Human Cytokine 27-plex and 21-plex plates (Bio-Rad, Hercules, CA), Thermo Fisher’s Rapid Human NGAL ELISA kit (KIT037; Waltham, MA), and Hycult DEFEA1 ELISA Kit (HK317; Hycult, Wayne, Pennsylvania), respectively. On each plate, we included duplicates and control samples. CRP and procalcitonin were measured in blood using Siemens Dimension Vista 500 Intelligent Lab System and bioMerieux’s miniVIDAS immunoanalyzer (bioMerieux, Marcy-l’Etoile, France), respectively. We measured urine creatinine concentration using R&D Creatinine Parameter Assay Kit (KGE005; R&D Systems, Minnepolis, Minnesota).

Statistical methods for protein data

We used logistic regression for binary variables and t test for continuous variables (e.g., protein levels). In order to adjust for multiple comparison, the resulting p values were converted to q-values using the Benjamini-Hochberg correction [3]. To access discriminative power, we constructed a receiver operating characteristic curve (ROC) using UTI vs no-UTI status as the outcome and calculated the area under the curve (AUC). Sensitivities and specificities were for the point on the ROC curve that maximized the Youden index (Youden index = sensitivity + specificity − 1). To evaluate the predictive power of combinations of biomarkers, we used classification and regression tree (CART) [4] classifier from R package rpart. To be specific, leave-one-out cross validation was adopted to split the data into training set and testing set, and the predictive score of each testing sample was calculated using the classifier constructed by the training set. Samples or variables with missing values were removed. As a sensitivity analysis, we also present data for the subgroup of children with fever.

Serum RNA processing and sequencing methods

We used Applied Biosystems mini Tempus™ tubes to stabilize the RNA. Total RNA libraries were generated using Illumina TruSeq Stranded Total RNA Sample Preparation Guide (Rev. E). First, we removed globin and ribosomal RNA using biotinylated, target-specific oligos combined with globin and rRNA removal beads. Following purification, RNA was fragmented into small pieces using divalent cations under elevated temperature. Cleaved RNA fragments were copied into first-strand cDNA using reverse transcriptase and random primers, followed by second-strand cDNA synthesis using DNA Polymerase I and RNase H. After ligation of the adapter, we added single ‘A’ base fragments to the cDNA fragments. We purified and enriched products with PCR to create the final cDNA library. We validated cDNA libraries using KAPA Biosystems primer premix kit with Illumina-compatible DNA primers and Qubit 2.0 fluorometer. We examined the quality of the RNA using Agilent Tapestation 2200. The cDNA libraries were pooled at a final concentration 1.8 pM. Cluster generation and paired-read 75-bp sequencing was performed on Illumina NextSeq 500’s. The technician was blinded to the results of the corresponding patient’s urine culture results. Because of limitations in funding, RNA sequencing was undertaken in only a subset of children.

RNA-Seq data preprocessing

The RNA-seq data were aligned to Homo Sapiens UCSC hg19 using TopHat2 (TopHat v2.0.9) [5]. The expression count data of 23,710 unique genes and 45 samples were harvested after using HTseq to convert BAM files. Low-expression genes with mean counts smaller than 5 were removed and 15,722 genes remained after this filtering step.

Statistical analysis of RNA data

We used edgeR [6] to detect differentially expressed (DE) genes. Because of the relatively small sample size, we used p values at 0.05 and fold change 1.5 as the threshold to determine DE genes. In order to investigate the functional annotation of the DE genes, we applied enrichment analysis using the database provided by Chaussabel [7] using Fisher’s exact test. In addition, we performed pathway enrichment analysis using ingenuity pathway analysis (IPA).

Results

Figure 1 describes the flow of children into the study. Compared to excluded children, included children were less likely to be White (p < 0.001) and more likely to be older (p < 0.001). Table 1 describes the demographic characteristics of the 111 children with UTI and the 89 children with no UTI. Mean age of the children we included was 4.5 years. Children with UTIs were younger, were more likely to have fever, were febrile for longer duration, and had a higher maximum temperature. Mean creatinine level in the two groups did not differ significantly (p = 0.39), nor did the number of children receiving ibuprofen (p = 0.99). A urine collection bag was used in four children for collection of urinary proteins. Of the 57 children < 24 months of age, 56 were febrile and 54 had samples obtained by catheterization.

Table 1 Demographic and clinical characteristics of the children includeda

Protein markers

The urinary markers that best discriminated (i.e., AUC ≥ 0.85 and q < 0.0001) between children with UTI and no UTI (Table 2 for top 30 markers, Supplementary Table 4s for complete list of all significant markers) were proteins involved in T cell response proliferation (IL-9, IL-2), cytokine/interferon pathway (IL-13, IL-2, INFγ), innate immunity (NGAL), and response to infection (TNFα, IL-17A), as well as chemoattractants for monocytes (CXCL12) and neutrophils (CXCL1, CXCL8). Division of protein marker level by urine creatinine had little effect on the p-values or the AUC of the urinary markers; accordingly, we present only raw values in the tables included in this report. The predictive power (as measured by the AUC, Fig. 2) of a combination of four urinary markers (IL-2, IL-9, IL-8, and NGAL) using the CART model with leave-one-out cross-validation was 0.94, with a sensitivity of 0.96 (95% CI: [0.92, 1.00]) and a specificity of 0.84 (95% CI: [0.74, 0.94]).

Table 2 Univariate association between top 30 biomarkers and urinary tract infection (UTI) in urine and serum by decreasing Q valuea
Fig. 2
figure 2

Accuracy of a combination of four urinary markers in diagnosing urinary tract infection (area under the ROC curve = .94)

Serum markers that best differentiated children with UTI from those with no UTI included interferon α2 (involved in cytokine signaling), CXCL12 (chemoattractant for monocytes), and IL-3 (Table 2). A combination of five serum markers (interferon α2, CXCL12, HGF, CCL5, and NGAL) identified by CART achieved the highest accuracy; its predictive power (as measured by the AUC) was 0.85, with a sensitivity of 0.82 (95% CI: [0.74, 0.89]) and a specificity of 0.88 (95% CI: [0.81, 0.95]). Unlike a previous study [8], we did not find differences in the levels of urinary human neutrophil peptide 1 (or its gene: human α-defensin 1) in the two groups.

Supplementary Table 5s presents the protein biomarkers in the subgroup of children that were febrile. Supplementary Table 6s presents the results after excluding 12 children with possible contamination (4 children with UTI with significant growth (≥ 10,000 CFU/mL) of > 1 uroopathogens and 8 children without UTI who had significant growth (≥ 10,000 CFU/mL) of one uropathogen at counts lower than those required for a UTI). Results in these two sensitivity analyses were generally similar to the overall results.

RNA expression data

RNA sequencing was only performed on febrile children who had not received antibiotics or corticosteroids before sample collection and for whom we had collected a serum sample that had adequate concentrations of RNA. Furthermore, because of cost, RNA sequencing was only attempted on a subset; we included all children with pyelonephritis confirmed by DMSA, a matched (by age and duration of fever) group of febrile children with cystitis, and all febrile children who had no growth on culture. A total of 36 children with UTI and 9 children with no UTI were included in this analysis (Fig. 1). All had fever and three were males. By controlling p values at 0.05 and absolute fold change at 1.5, we obtained 345 differentially expressed (DE) transcripts. The expression levels of the DE genes can be visualized in Fig. 1s in the appendix; the list of the top 25 DE genes is shown in Supplementary Table 1s. Modular analysis (Supplementary Table 2s) using Chaussabel’s database showed that neutrophil response differed significantly in children with UTI and no UTI [7]. Pathway analysis (Supplementary Table 3s) revealed several candidate pathways with biological plausibility that differ significantly in children with and without UTI.

Discussion

In this study, we have identified urinary and serum markers that appear promising in differentiating children with pyuria who do and do not have UTI. Particularly promising, single markers include urinary IL-9, IL-2, IL-8, and NGAL. At present, NGAL can be measured commercially and ELISA tests are available for the three interleukins. These findings suggest that differences in easily measurable inflammatory proteins in the urine might be useful in differentiating children with and without UTI. Because collection of blood samples is not routinely performed in children with suspected UTI, and because accuracy of serum markers was lower than for urinary markers, it seems prudent to focus on urinary markers.

To understand the implications of our findings, imagine a hypothetical cohort of 1000 children being tested for UTI using the leukocyte esterase test, of whom 7% [9] (or 70) are assumed to have a UTI. Of the 70 children with UTI, the leukocyte esterase test will identify 55 (because the sensitivity of the test is 79% [1]). Of the 930 without UTI, 13% (i.e., 121) will be incorrectly identified as having a UTI (because the specificity of the test is 87% [1]). Assuming that children with a positive leukocyte esterase are empirically treated with antimicrobials, applying the panel of 4 urinary markers identified here to the 176 children (55 + 121) with a positive leukocyte esterase test would reduce the number of children receiving unnecessary antibiotics from 121 to 19 (because the specificity of the panel is 84%) without significantly reducing the number of children with UTI who are appropriately treated with antibiotics (because the sensitivity of the panel is 96%).

A role for the four previously mentioned urinary markers is supported by their pathophysiological role and by previous studies of these markers as diagnostic tools. IL-9 is involved in growth of helper T cells, and these cells have been shown to play a role in the immune response to UTI [10]. Furthermore, in one study, adults with UTI had significantly higher levels of IL-9 compared to age- and gender-matched healthy controls [11]. IL-2 is involved with T cell growth and differentiation following activation by an antigen [12, 13], and its receptor is present on neutrophils [14]. IL-8 is a key neutrophil chemoattractant in the urinary tract, directing the migration of neutrophils across the mucosa and into the urine [15], which is elevated in adults and children with UTI [16, 17]. We found significant differences in NGAL levels (in urine and serum) and in the gene coding for NGAL (LCN2) between the two groups. Although the function of NGAL is not completely understood, it plays a role in sequestering iron required for bacterial growth. It is released from neutrophils, and to a smaller degree from intercalated cells in the renal collecting duct in response to infection or cell injury [18]. Several studies have found that NGAL differs in children with and without UTI, both in serum and/or blood [19,20,21,22,23].

Of the genes identified though RNA sequencing that are supported by previous studies (i.e., LTF, AZUI, CEACAM8, DEFA4, in Supplementary Table 1s and CAMP in Supplementary Table 2s), most seem to have an antibacterial role. Lactotransferrin (also known as lactoferrin, encoded by LTF) interferes with iron uptake and thereby causes bacterial cell lysis. Azurocidin (encoded by AZUI), present in azurophil granules of neutrophils, has activity against gram-negative bacteria [24]. DEFA4 (also known as HNP-4) and CAMP encode for antimicrobial peptides (defensin alpha 4 and cathelicidin, respectively). Carcinoembryonic antigen-related cell adhesion molecules (CECAM) are glycoproteins expressed on mucosal surfaces which are thought to be involved in pathogen colonization [25, 26]. All five genes (or their products) have been implicated in UTI or colonization in previous studies [25,26,27,28,29,30,31,32,33,34]. Our data was slightly puzzling because, even though all of the above genes are involved in the defense against UTI, we found that all were underexpressed in children with UTI. One explanation might be that children who fail to effectively upregulate expression of these genes are more susceptible to UTI. Indeed, some support for this hypothesis comes from a recent study that found children with increased risk of UTI recurrences had fewer copies of α-defensin-encoding DEFA1A3 [31]

Analysis of the gene pathways (Supplementary Table 3s) identified several pathways that appear to have clinical plausibility. Specifically, our data suggests that leukocyte extravasation, natural killer cell signaling, IL-6 and IL-2 signaling, and T cell pathways are important in the pathophysiology of UTI. These findings are generally consistent with previous studies [12, 17, 35,36,37,38,39,40,41,42,43]. Of note, both pathway analysis of gene expression in the serum and urinary protein measurement suggest that IL-2 signaling could also be important in the immune response to UTI. To our knowledge, other than one tangentially related study [44], this is a novel finding.

Our study was limited because the recruited sample did not provide us with adequate power to examine potentially important subgroup differences. Asymptomatic bacteriuria, although possible, is unlikely because all patients were symptomatic to the extent that clinicians tested and treated for UTI and all had pyuria. As with most studies on pediatric UTI, a small proportion children categorized as having a UTI may have actually had contamination of the sample during collection. We attempted to minimize this by avoiding bag-collected urine samples for diagnosis of UTI, by requiring pyuria, and by using established cutoff-offs to define a UTI. Furthermore, as a sensitivity analysis, we present the results in which we exclude children with possible contamination. A strength of the study is performance of both RNA sequencing and protein measurement, which appear to have provided complimentary data.

In conclusion, this study provides preliminary evidence to support a role for measurement of urinary IL-9, IL-2, IL-8, NGAL, and preliminary support for the use of an RNA signature in differentiating, among children with pyuria, those with UTI from those without UTI. These markers, if confirmed by future studies, could be useful in reducing unnecessary antimicrobial use for children with presumed UTI.