Introduction

Chest x-ray (CXR) is among the most commonly utilized tools in the assessment pediatric respiratory diseases and is a standard imaging technique to evaluate children in acute respiratory distress and failure [1]. Respiratory illness accounts for between 20 and 30% of pediatric intensive care unit (PICU) and ward admissions [2]. CXR is estimated to be obtained in 50% of PICU admissions [3]. Despite ubiquitous usage, CXRs accuracy and reliability has never been rigorously evaluated (i.e., compared to chest computerized tomography (CT)) [4]. While chest CT is considered the gold standard pulmonary imaging modality, CXR is the standard against which newer imaging techniques, such as lung ultrasound (LUS), are evaluated in pediatrics to limit exposure to the high ionizing radiation of CT [5,6,7].

Few studies have evaluated the diagnostic accuracy of CXR compared with CT and have demonstrated low to modest sensitivity (range 40–79%) and specificity (range 39–92%) in identifying pulmonary pathology [8,9,10,11]. Clinicians still highly value CXRs despite their low accuracy and continue to regularly utilize them in clinical practice [12]. Recently published guidelines support the use of LUS in pediatric respiratory diseases, yet cited studies include few children with acute respiratory failure (ARF), such as children requiring ancillary respiratory therapies, high supplemental oxygen support (hypoxemic respiratory failure), invasive or non-invasive mechanical support (hypercapnic respiratory failure), or an escalation from baseline respiratory support (acute on chronic respiratory failure) [13]. Many studies of specific disease processes have found LUS has high sensitivity (range 89–97%) and specificity (range 56–99%) identifying findings consistent with the underlying disease process; however, it has also been consistently described that children with a worse clinical course (defined by the requirement of more hospital resources) demonstrate more ultrasound abnormalities [14,15,16,17]. Therefore, our study objective was to compare lung imaging techniques (CXR and LUS) in the evaluation of children with undifferentiated ARF to quantify imaging agreement and determine which technique identifies a higher frequency of pulmonary abnormalities. We hypothesized that LUS would identify substantially more abnormalities than CXR in children with ARF. If borne out, clinicians may be able to reduce the financial cost and radiation exposure associated with CXR utilization while affording a better understanding of pulmonary pathophysiology and allowing for more specific treatments.

Methods

The Institutional Review Board at the University of Wisconsin-Madison (USA) approved this pre-designed secondary analysis of a prospective observational study evaluating the sensitivity and specificity of LUS in children admitted to the pediatric intensive care unit (PICU) with ARF (IRB 2018-071) [18]. ARF for the study was defined by the clinical needed for PICU admission and at least one of the following: (1) non-invasive (high flow nasal cannula (HFNC) ≥ 1 L/kg/min; continuous positive airway pressure; or bilevel positive airway pressure delivered by nasal cannula, nasal mask, or full face mask) or invasive respiratory support; (2) supplemental oxygen with FiO2 > 35% while on HFNC < 1 L/kg/min or nonrebreather/oxymask to maintain saturation ≥ 90%; (3) continuous nebulized therapy; or (4) chronic oxygen or ventilatory support and any increase in baseline support [18, 19]. Children > 37.0 weeks corrected gestational age and ≤ 18 years of age from whom signed, informed parent/guardian consent, and assent when appropriate, were enrolled from December 2018 to February 2020.

A 3-region per hemithorax LUS was completed using a modified BLUE (Bedside Lung Ultrasound in Emergency) protocol within 14 h of PICU admission by one of three pediatric intensivists with point-of-care ultrasound training and clinical LUS experience [20, 21]. The three lung regions were defined by: (1) the anterior chest wall between the anterior axillary line and sternum; (2) the lateral chest wall between the anterior and posterior axillary lines; and (3) the posterolateral chest wall between the posterior axillary line and the spine (Fig. 1A). For each region, the most abnormal ultrasound pattern was identified and a representative 4-s video recorded and uploaded to Q-path imaging system (Telexy Healthcare; British Columbia, Canada). In Q-path, an assessment of ultrasound artifacts was made for each individual region and an overall diagnosis determined (e-Table 1A). For comparison with CXR, A lines were considered a normal ultrasound pattern; multiple B lines and confluent B lines were considered an interstitial disease pattern; and sub-pleural consolidation and lobar consolidation were considered consolidation patterns [21, 22]. All patients/images were examined by a senior intensivist with extensive experience in LUS acquisition and interpretation and blinded to patient historical and clinical information. A second physician (ultrasound fellowship-trained emergency medicine physician) independently reviewed all images to determine an ultrasound diagnosis, but did not document individual ultrasound artifact findings due to loss of documented lung region location during a Q-path system upgrade. Differences in diagnoses between the two were resolved by consensus.

Fig. 1
figure 1

Lung ultrasound (A) and chest radiograph (B) regions. LA left anterior, LL left lateral, LLL left lower lobe, LP left posterior, LU left upper lobe, RA right anterior, RLL right lower (right middle lobe and right lower lobe), RP right posterior, RU right upper lobe

Clinical CXRs completed within 6 h of LUS were evaluated concurrently by two pediatric radiologists who were blinded to patient historical, clinical, and LUS information. CXRs were evaluated for the presence of interstitial lung disease, atelectasis, consolidations, pleural effusions, and pneumothoraces across four lung regions and a radiographic diagnosis was determined [22]. Differences in CXR interpretation were resolved in real-time by consensus (Fig. 1B; e-Table 1B). The six LUS regions were combined into four to correspond with the four CXR regions, approximating the five lung lobes (e-Table 2) [22]. A LUS sub-pleural consolidation (SPC) artifact was considered a consolidation for the primary study analysis [22]. A separate analysis was conducted that did not include SPC as a consolidation, as small consolidations may not be identified on CXR. A CXR finding of atelectasis was considered a consolidation for comparisons. Given the relatively long duration between imaging studies, sub-analyses of studies completed within 4 h and within 2 h of each other were additionally performed.

All LUS exams were performed using a Philips Sparq ultrasound machine (Koninklijke Philips N.V; Amsterdam, Netherlands) with a L12-4 linear probe (frequency 12–4 MHz) or S4-2 phased array probe (frequency 4–2 MHz) with children in a recumbent or semi-recumbent position and (when clinically feasible) rolled onto their side or placed into a sitting position to optimize posterior lung field scanning according to the previously published protocol [18]. In the PICU, a portable single view anterior–posterior (AP) CXR was obtained using a Fujifilm FDR Go Mobile radiograph machine (Fujifilm Medical Systems; Connecticut, USA). In the Emergency Department either a portable single view AP CXR was obtained using a Philips MobileDiagnost wDR 2.2 (Philips Healthcare; Massachusetts, USA) or a two-view AP and lateral CXR was obtained using Samsung GC85 radiograph machine (Samsung/Neurologica Corporation; Massachusetts, USA). All CXR imaging was obtained according to standard hospital thoracic imaging protocols, as requested for clinical care by the clinicians caring for the patient. The reference standard for the final patient diagnosis was an independent, standardized review of the medical record following hospital discharge by a physician not involved in the care of the patient (final diagnosis) and incorporated the history, clinical findings, laboratory tests, radiographic (including CXR) data, patient management, and course to determine the etiology of the ARF [18].

Descriptive statistics were used to define the frequency of imaging abnormalities. Kappa statistics (k) adjusted for maximum attainable agreement (kmax) were used to quantify agreement between techniques [23,24,25]. The 95% confidence intervals (95% CI) for k and kmax were computed by bootstrap replications (B = 5000) [26]. The k/kmax was used when the sparsity of diagnoses made the default maximum of one unattainable, as the upper and lower limits of k are a function of the frequency [24]. Kappa statistical agreement (for both k and k/kmax) was considered: k ≤ 0.00, less than chance agreement; k > 0.00–0.20, slight agreement; k > 0.20–0.40, fair agreement; k > 0.40–0.60, moderate agreement; k > 0.60–0.80, substantial agreement; and k ≥ 0.80, near perfect agreement [25]. Statistical analysis was performed using R version 3.6.3 (R Project for Statistical Computing; Vienna, Austria). The study was registered with ClinicalTrials.gov and data are reported using Strengthening the Reporting of Observational Studies in Epidemiology (STROBE) guidelines.

Results

Eighty-eight children had LUS completed for the primary study [18]; 32 of these 88 (36%) had LUS and CXR completed within 6 h of each other and are included in this study (e-Fig. 1). The study cohort includes 2 (6%) patients with status asthmaticus, 17 (53%) with bronchiolitis/viral pneumonitis, 12 (37%) with pneumonia, and 1 (3%) patient with multiple concurrent final diagnoses. CXR was completed a median of 1.25 h (IQR − 2.0 to 5.81) following PICU admission while LUS was completed a median of 4.67 h (IQR 2.70–7.63) following PICU admission and a median of 3.64 h (IQR 1.85–5.88) from CXR. Demographics of the study cohort are presented in Table 1.

Table 1 Demographics of study population, n = 32

There was fair agreement between LUS and CXR derived diagnoses with 58% agreement (k/kmax = 0.36, 95% CI − 0.01 to 0.80) (Table 2A). CXR identified more abnormalities (104 abnormal lung regions across 123 examined regions) compared with LUS (71 abnormal regions) (Table 3A; e-Table 3). Evaluation of specific imaging patterns included: normal, 57% agreement (k = 0.032, 95% CI − 0.11 to 0.17); interstitial pattern, 47% agreement (k = 0.003, 95% CI − 0.20 to 0.20); consolidation, 65% agreement (k = 0.294, 95% CI 0.13–0.46); pleural effusion, 94% agreement (k = NA; one imaging modality did not find an effusion); and pneumothorax, 99% agreement (k = NA; one imaging modality did not find a pneumothorax) (Table 3B).

Table 2 Comparison between ultrasound and chest x-ray derived diagnoses (A) and between the final diagnosis, lung ultrasound, and chest x-ray (B)* Bolded numbers represent agreement between the two imaging modalities
Table 3 Comparison of the number and frequency of lung imaging findings (A) and agreement between imaging modalities (B)

When comparing LUS and CXR with the final patient diagnosis, LUS demonstrated moderate agreement (k/kmax = 0.43, 95% CI 0.12–0.84) while CXR demonstrated slight agreement (k/kmax = 0.18, 95% CI − 0.21 to 0.65) (Table 2B). Additional analyses evaluating agreement between LUS and CXR when SPCs were not considered LUS determined consolidations and when comparing imaging obtained within 4 h and 2 h of each other, did not significantly increase agreement (Table 4).

Table 4 Comparison of agreement between lung ultrasound and chest x-ray findings when sub-pleural consolidations are not considered a lung ultrasound consolidation, n = 32 (A); between lung ultrasound and chest x-ray findings when images obtained within 4 h of each other, n = 16 (B) and 2 h of each other, n = 9 (C)

There was substantial agreement between the intensivist and emergency medicine physician-derived ultrasound diagnoses with 67% observed agreement (k/kmax = 0.61, 95% CI 0.49–0.78) [27]. Agreement in CXR interpretation between radiologists was not determined as interpretations were performed concurrently with differences resolved in real-time.

Discussion

Acute respiratory distress and failure are among the most common indications for pediatric hospitalization [2]. As such, there is great interest in improving diagnostic accuracy and treatment specificity given the high disease burden and associated costs [2]. Given frequent diagnostic uncertainty, many children presenting with respiratory concerns, especially when on the severe end of the disease spectrum, are initially treated with a multi-therapy approach until the underlying etiology becomes better defined, after which therapeutic de-escalation may occur [28,29,30]. LUS has proven useful in adult medicine where it demonstrates improved sensitivity and specificity in determining the etiology of ARF when compared with standard clinical evaluation (history, physical, and CXR imaging) [21, 22, 31, 32]. The few studies evaluating the ability of LUS to determine the underlying etiology in undifferentiated pediatric respiratory distress and failure have been less convincing [16, 18, 20, 33, 34]; though when evaluating a single suspected disease entity, LUS appears to have favorable operating characteristics [5, 6, 14, 17, 35]. Most pediatric studies use CXR as the standard against which LUS is compared [5,6,7] and most use children with a known or suspected disease process (and thus high pre-test probability). In a blinded assessment of temporally related imaging studies, we found only fair agreement between imaging derived diagnoses and fair to slight agreement in identifying specific imaging abnormality patterns. CXR detected more interstitial disease and pleural effusions while LUS detected more consolidations and pneumothoraces, though neither of these were statistically more than CXR.

Chest CT is considered the preeminent pulmonary imaging modality but in contrast to adult medicine, in which CT is considered very early in the disease course, it is hardly ever used for diagnosis of acute pediatric respiratory diseases because of the high ionizing radiation, difficulty in patient cooperation, and high cost [5, 6]. Instead, clinical examination and CXR remain standard of care despite relatively low sensitivity and specificity [5, 8, 9, 36]. In a study of 56 children being evaluated for complicated pneumonia that were unresponsive to therapy, CXR missed 110 CT findings, including parenchymal, pleural, and consolidative findings [36]. In another study of 949 children with suspected pneumonia who underwent CT, CXR demonstrated only 79% sensitivity and 60% specificity [8] and in a study of 36 children with suspected pneumonia who underwent CT, CXR demonstrated 62% sensitivity (specificity not reported) [9]. These two studies additionally evaluated LUS with CT and found that LUS demonstrated slightly improved sensitivity and similar or worse specificity when compared with CXR [8, 9]. Still, meta-analyses comparing LUS to CXR demonstrate pooled estimates of the area under the receiver operating curve of 98% [5, 6]. However, the utility of evaluating LUS compared with an imaging modality with known modest operating characteristics is unclear. A large (413 patients with 1002 CXR/LUS comparisons) recently published study similarly comparing imaging patterns in children with concomitantly obtained CXR and LUS found k agreement ranging from 0.51 (moderate agreement) in the detection of left sided pleural effusion to 0.98 (near perfect agreement) in the detection of right sided pneumothorax [37]; though agreement did differ significantly based on imaged region/location and only 19% of patients had underlying respiratory pathology. Another study of 81 children comparing LUS with CXR found k agreement of 0.64 (substantial agreement) for the diagnosis of pneumonia [38], though agreement for other disease processes are not specifically provided. Newer LUS studies are evaluating temporal trends and changes over time including using the lung ultrasound score to evaluate children with bronchiolitis [14, 39, 40] and pediatric acute respiratory distress syndrome [41], and to evaluate changes in lung aeration over time [42].

Pediatric researchers are also beginning to utilize alternative ultrasound modes (i.e., non-B mode) to more thoroughly define LUS specific patterns for individual disease processes [43]. M-mode has classically been utilized to detect pleural motion (or lack thereof) over time; now doppler ultrasound is being used to describe consolidation characteristics [43]. Color and pulsed wave doppler have been used to identify the presence and type of vessels within consolidations [44], helping to differentiate fluid or mucus filled bronchograms, lung necrosis, and abscesses [43]. Incorporation of lesion characteristics including size, number, and distribution of consolidations, position and motion of bronchograms, as well as pleural effusion characteristics and the distribution, movement, and characteristics of vertical artifacts can all significantly complement LUS evaluation [45, 46]. Contrast-enhanced ultrasound and microvascular flow imaging are additional techniques being studied which may provide benefit in LUS [47, 48]. Inclusion of real-time ultrasound during therapeutic trials and inclusion of clinical information may further inform LUS examination [33, 49].

LUS also has proposed benefits outside of its diagnostic abilities, including its ubiquity in patient care units, it does not rely on another provider for performance (it is interpreted by a provider with clinical knowledge of the patient), it promotes provider time spent at the bedside, it is readily repeatable, that allows immediate integration into treatment plans, and that it has a rapid time to acquisition (typically < 10–15 min but shorter with a directed clinical question) [13, 18, 33, 50]; though in certain instances or circumstances these may not be considered beneficial. LUS has additionally been used to guide therapeutic intervention and patient reassessment including in guiding fluid administration/diuresis, lung recruitment, and chest tube insertion [13, 51]; in emergent or time-sensitive situations such as determination of the presence of a pneumothorax [13, 51, 52]; it has been combined with point-of-care cardiac exams to evaluate for reversible causes of cardiopulmonary arrest [51]; and it may be helpful in rapid assessment of reflux/aspiration risk [53]. Our data and similar recently published studies [37,38,39, 41, 42] suggest that it is likely that LUS plays a complimentary role to CXR in children with respiratory distress and ARF given differences in strengths and weaknesses [54], though our study was not specifically designed to test this. CXR provides an overall picture of the patient at a moment in time, is well established in clinical care, and is completed by another provider allowing for quick examination of multiple patients, while LUS provides more regionalized information, is performed by providers caring for the patient, and can readily be repeated following changes in therapeutic intervention or patient pathophysiology.

A major limitation in the use of LUS is in its inability to detect peri-hilar lesions which are especially common in pediatrics, as it requires findings which extend to the pleural surface [49, 50]. Additional limitations specific to our study include: failure to evaluate LUS and CXR operating characteristics with CT (the gold standard pulmonary imaging modality); evaluation of only the most abnormal LUS pattern in each lung region which may have decreased agreement; evaluation of only 3 LUS regions per hemithorax and inclusion of single view AP CXR which likely decreased the number of identified pathology and thus imaging sensitivity, though chosen for the study design given its clinically pragmatic approach; utilization of approximate lung anatomy (lobes) and corresponding imaging location which may not have correlated with actual anatomy; imaging interpretation being blinded to the patient’s history (and interpretation by different providers) which likewise may have decreased agreement (and is different from clinical practice, though chosen for our study design to reduce confirmation bias and maximize reliability); inability to use a LUS expert consensus of ultrasound artifact findings (due to loss of documented lung region location during a Q-path system upgrade) which may have increased or decreased lesion and location specific agreement; failure to compare agreement in lesion size which may influence a patient’s clinical presentation and course; failure to include patients with pulmonary edema, pneumothorax, or pleural effusion which may have increased agreement; utilization of standard clinical imaging (CXR) in the final diagnosis; and the 6-h duration between imaging which may have been sufficient for changes in pulmonary pathophysiology (either due to progression or improvement with treatment) so as to change imaging patterns (though sub-analyses completed at shorter intervals did not significantly increase agreement). Though chest CT is considered the “gold standard” lung imaging modality, given its limitations, it is not the “standard of [clinical] care” used in pediatrics; future studies should consider evaluating imaging modalities in the context of clinical care (rather than using a blinded interpretation as in this study), as well as evaluating how image acquisition and interpretation influences clinical management.

Conclusion

There is fair agreement between CXR and LUS-derived imaging diagnoses in children with ARF when imaging is completed within 6 h of each other. Given this, clinicians should consider the benefits and limitations of specific imaging modalities when evaluating children with ARF. Additional studies are necessary to further define the role of LUS in children with ARF given the small sample size of our study.