Introduction

Tissue sampling is often necessary for histopathologic diagnosis, treatment planning, and subsequent management of aggressive bone tumors. Image-guided core needle biopsy (ICNB) has become an increasingly common method of tissue sampling for adult patients at many institutions, as it has been shown to be safe and effective, with a lower complication rate and cost compared to open surgical biopsy [1,2,3,4]. Diagnostic yield of ICNB is an important metric as successful diagnosis leads to timely implementation of management plans and obviates the need for repeat biopsy with accompanying cost and potential complications. Diagnostic accuracy is also important, as inaccurate biopsies can lead to inappropriate patient management and treatment.

There is heterogeneity in the literature regarding the diagnostic efficacy of ICNBs of suspected malignant osseous lesions, with values ranging from 69 to 98% for diagnostic yield, and values ranging from 73 to 98% for diagnostic accuracy [5,6,7,8,9,10,11,12,13,14,15,16,17,18,19,20,21,22,23,24]. Moreover, there is divergence on the technical, lesion-related, and patient factors that influence these two measures. [11, 18,19,20]. Additionally, while it has been reported that obtaining three total core specimens optimizes the diagnostic yield of ICNBs of osseous lesions, there have been (to our knowledge) no reports focusing on the total core length that should be obtained to optimize yield [5]. While intuitively, increased lesion sampling should increase diagnostic efficacy, it also might increase procedure time, potential for complications, sedation, and exposure to ionizing radiation. A threshold value for total core length that optimizes diagnostic yield therefore serves as an important recommendation to guide radiologists during biopsies. Additional study is also needed on top of the 88 bone biopsies so far reported to accurately determine the total number of core specimens that optimize diagnostic yield [5].

The purpose of this study is to evaluate the diagnostic yield and accuracy of ICNB of suspected malignant osseous lesions in (to our knowledge) the largest cohort of adult patients thus far published, evaluate what factors influence these measures, and offer technical recommendations to optimize yield.

Materials and methods

Patient selection criteria

Institution review board approval was obtained for this Health Insurance Portability and Accountability Act–compliant retrospective study. Informed consent was waived for retrospective participation and data analysis. All consecutive patients who received an ICNB at a single tertiary care academic institution from July 2010 to October 2021 were identified using the radiology information system. Patients who received the biopsy for the sampling of suspected osteomyelitis were excluded, as additional clinical factors (such as antimicrobial exposure) could influence the diagnostic efficacy of ICNB, and evaluating these was outside the scope of the present study. Additionally, pediatric patients were excluded, as there might be limited generalizability between pediatric and adult patients due to potential differences in technical biopsy sampling as well as pathology. Lastly, lesions with classically benign appearances on prior imaging (e.g., osteoid osteoma, giant cell tumor, osteochondroma) which were biopsied immediately prior to lesion ablation were excluded, as they were sampled in very small numbers compared to the overall number of biopsies included in the study, and the overall findings of the study might not be generalizable to these lesions. This left only adult patients who were referred for biopsy for sampling of a suspected malignant lesion (based on both clinical history and prior radiologic findings) to be included. Furthermore, cases included in the study where there was subsequent surgical excision were also evaluated for diagnostic accuracy. Of this subset of patients, cases in which patients received radiation therapy or chemotherapy treatment following biopsy but prior to surgical excision of a lesion were excluded from accuracy analysis.

Patient, lesion-related, and clinical factors

For each biopsy, the following data from the procedure report and electronic medical record were documented: patient age, sex, type of lesion (lytic, sclerotic, mixed, or radiographically occult (visualized on MRI or PET/CT)), presence of a known primary malignancy, lesion anatomic location (spine, pelvis, extremity, clavicle/ribs/sternum/scapula, and skull), biopsy needle gauge, number of bone cores obtained, total core length obtained (sum length of all cores obtained, measured by the performing radiologist immediately after sample acquisition), imaging modality used (CT, fluoroscopy, or US), radiation dose, total monitored sedation time, doses of fentanyl and versed received, procedure-related complications and if the biopsy specimen was sufficient for diagnosis (histologically interpretable and adequate for diagnosis by the pathologist). Additionally, if the lesion was later surgically excised, it was recorded whether the surgical pathology from the excision was concordant with the pathology from ICNB and if any complications occurred during excision. Lesion size and extraosseous extension were not evaluated as they were absent from the majority of procedure reports and imaging was not available for analysis of earlier cases. Diagnostic yield was defined as the total number of biopsies that yielded a sufficient histopathologic diagnosis divided by the total number of biopsies. Diagnostic accuracy was defined as the number of biopsies in which the histopathologic diagnosis obtained from the biopsy was concordant with the histopathologic diagnosis of the final surgically excised specimen (which was considered the reference standard as maximal tissue is available for analysis), divided by the total number of lesions excised.

Biopsy technique

All biopsies were supervised and/or performed by a musculoskeletal radiology attending physician. The majority of the biopsies were performed by one of three attending physicians (including J.W.J., T.J.H., and J.C.B., each with 14, 12, and 12 years of post-fellowship experience respectively at the end of the study period). Greater than 90% of the biopsies were performed using a coaxial technique with the Oncontrol Needle system (Fig. 1). Once obtained, biopsy specimens were subsequently dislodged, placed in formalin with or without a bone tray, and sent for laboratory analysis (onsite histopathologic analysis was not performed). Pathologists were not blinded to clinical, procedural, or imaging data, including prior biopsy results of a lesion, during analysis of surgically excised lesions.

Fig. 1
figure 1

Images of the Oncontrol needle system used throughout the body demonstrating non-diagnostic and inaccurate biopsies. a Unenhanced axial CT image shows the outer cannula of the coaxial needle system (black arrow) and hollow biopsy needle (white arrow) traversing a lytic lesion in the T5 vertebral body. The biopsy specimen obtained was insufficient for histopathologic diagnosis. b Unenhanced axial CT image shows the outer cannula (black arrow) of the coaxial needle system as well as hollow biopsy needle (white arrow) sampling lytic lesion in the proximal humerus. This biopsy was inaccurate, as the histopathologic diagnosis from ICNB was necrosis with acute inflammation, but the histopathologic diagnosis after the lesion underwent surgical excision was necrotic metastatic adenocarcinoma. c Lateral fluoroscopy image shows the outer cannula of the coaxial needle system (black arrow) and hollow biopsy needle (white arrow) traversing the lesion in the L5 vertebral body. This biopsy was inaccurate, as the histopathologic diagnosis from ICNB was marrow fibrosis, but the histopathologic diagnosis after the lesion underwent surgical excision was Extranodal Rosai-Dorfman Disease. d Unenhanced axial CT image shows outer cannula (black arrow) of coaxial needle system as well as hollow biopsy needle (white arrow) sampling lesion in L2 spinous process. This biopsy was inaccurate, as the histopathologic diagnosis from ICNB was densely sclerotic bone with medullary fibrosis, but the histopathologic diagnosis after the lesion underwent surgical excision was metastatic pheochromocytoma

Statistical analysis

Multivariate logistic regression analysis was performed (by expert statistician C.L.) to analyze which factors (listed above) predicted diagnostic yield and accuracy using R statistical software (version 4.1.3, 2022, opensource). Factors were first tested in univariate analysis, and after backward stepwise selection, statistically significant factors were then subsequently tested in multivariate analysis to control confounding [25]. The odds ratio for categorical values was calculated in the usual fashion. For continuous values, the odds ratio represents the odds that a value produces an outcome compared to one unit less than that value. p values were obtained by the chi-square test for categorical factors, or t-test for continuous factors. A p value of < 0.05 was considered statistically significant. After multivariate analysis, different cutoff values (all values in the range of obtained data) of total core length and core number were tested and marginal odds ratios in relation to yield (odds that a biopsy above the cutoff is diagnostic compared to below the cutoff) were calculated. The statistically significant cutoff in which the marginal odds ratio plateaued was determined to optimize yield [26].

Results

Patient demographics and selection

Of the 4193 ICNBs performed on 3898 patients initially included in the study, 1872 ICNBs performed on 1719 patients were excluded per the exclusion criteria. This left 2321 ICNBs performed on 2179 patients as the study sample. Of those cases, 123 lesions that underwent ICNB in 122 patients were subsequently surgically excised and these data were used for analysis of diagnostic accuracy. See Fig. 2 for specifics regarding how many biopsies and patients were excluded for each exclusion criterion. The mean patient age was 62 years with a standard deviation of 14 years. There were 1190 male patients (54.6%) and 989 female patients (45.4%). Overall, 1491 (68.4%) of the patients had a known primary malignancy.

Fig. 2
figure 2

Study selection criteria. a Flow diagram demonstrating patient selection for diagnostic yield analysis. b Flow diagram demonstrating patient selection for additional diagnostic accuracy analysis

Diagnostic yield

Details pertaining to lesion-related and technical factors are summarized in Table 1. Overall diagnostic yield was 98.2%, with 2279 of 2321 biopsies yielding a sufficient histopathologic diagnosis. In multivariate statistical analysis, increased total core length (odds ratio [OR] = 2.34, 95% CI (1.41–3.90), p = 0.001), core number (OR = 1.51, 95% CI (1.06–2.16), = 0.02), and presence of primary malignancy (OR = 2.81, 95% CI (1.40–5.62), p = 0.004) were found to be associated with improved yield. Lesion location in an extremity (OR = 0.27, 95% CI (0.11–0.68), = 0.006) and using fluoroscopic imaging guidance (OR = 0.33, 95% CI (0.12–0.90), = 0.03) were found to be associated with lower yield. See Table 2 for univariate statistical analysis and Table 3 for multivariate statistical analysis of the factors selected after backward stepwise selection. The cutoff at which increased total core length reached a threshold value in relation to increased diagnostic yield was found to be 20 mm (marginal OR = 4.16, 95% CI = (2.09–9.03), < 0.001) with corresponding box plot and contingency table shown in Fig. 3. The cutoff at which increased core number reached a threshold value in relation to increased yield was found to be 3 total cores obtained (marginal OR = 2.78, 95% CI (1.34–6.54, = 0.005) with corresponding box plot and contingency table shown in Fig. 4. Marginal odds ratios of different tested cutoff values for total core length and number of cores can be seen in Table 4, and the values in this table for total core length are represented graphically in Fig. 5. The most common histopathological diagnoses of malignant lesions can be seen in supplemental Table 1 (electronic supplemental material).

Table 1 Lesion-related and technical factors
Table 2 Univariate statistical analysis for all analyzed factors
Table 3 Multivariate statistical analysis for factors selected by backward stepwise selection
Fig. 3
figure 3

a Boxplot displaying total core lengths of diagnostic and non-diagnostic biopsies, with a 20 mm total core length cutoff shown as a horizontal line on the box plot. b 2 x 2 contingency table compares diagnostic and non-diagnostic biopsies with biopsies above and below the 20mm total core length cutoff. Marginal odds ratio shown below table

Fig. 4
figure 4

a Boxplot displaying the total number of cores obtained from diagnostic and non-diagnostic biopsies, with three core cutoffs shown as a horizontal line on the box plot. b 2 x 2 contingency table compares diagnostic and non-diagnostic biopsies with biopsies above and below the three-core cutoff. Marginal odds ratio shown below table

Table 4 Tested cutoff values for total core length and number of cores in relation to diagnostic yield. (Only data for the most pertinent cutoffs are listed below). Twenty millimeters for total core length and three total cores were selected as the statistically significant cutoffs in which the marginal odds ratios plateaued, and thus the values that optimized yield
Fig. 5
figure 5

Connected scatterplot shows marginal odds ratios in relation to diagnostic yield of various total core lengths. Marginal odds ratio of 20 mm cutoff shown as dashed horizontal line on graph

Diagnostic accuracy

Overall diagnostic accuracy was 97.6%, with pathology from the initial bone biopsy specimen being concordant with the surgically excised specimen in 120 of 123 lesions. The sensitivity of ICNB diagnosing malignancy was 97.9% (93/95, 95% CI (92.6–99.7%)) and specificity was 100% (28/28, 95% CI (87.7–100%)) (Fig. 6). No analyzed factors were found to be statistically significant predictors of accuracy. Specific histopathologic diagnoses in nonconcordant biopsies are also provided in Table 5.

Fig. 6
figure 6

2 x 2 contingency table comparing tissue diagnosis from image-guided core needle biopsy to tissue diagnosis from surgical excision (considered reference standard)

Table 5 Specific histopathologic diagnoses in 3 cases in which diagnosis from core needle biopsy specimen was nonconcordant with diagnosis from surgically excised specimen

Complications

Complications were documented in 0.5% (12/2321) of biopsies. In seven biopsies, patients experienced hematomas and three patients experienced pneumothoraces, all without change in hemodynamic status. One patient experienced a foot drop after a sacral biopsy and the patient was lost to follow-up. In one patient the needle tip fractured and a 1.7 cm needle fragment was embedded in the bone. Complications from surgical excision were experienced in 0.8% (1/123) of patients, in which the patient experienced an intraoperative femur fracture.

Discussion

In the literature, estimates of diagnostic yield and accuracy of ICNB of suspected malignant osseous lesions are relatively divergent, there is heterogeneity concerning which factors influence these measures, and few studies offer technical recommendations to optimize diagnostic yield [5,6,7,8,9,10,11,12,13,14,15,16,17,18,19,20,21,22,23,24]. By examining a large cohort of patients, our study aimed to evaluate diagnostic yield and accuracy, further expand upon what factors influence these measures, and offer technical recommendations that operators may use to optimize the diagnostic yield of biopsies. We found diagnostic yield and accuracy to be 98.2% (2279/2321) and 97.6% (120/123) respectively and found that increased total core length, core number, and presence of primary malignancy were associated with improved yield, while lesion location in an extremity and using fluoroscopic imaging guidance were associated with lower yield. We also found that obtaining a total core length of 20 mm and three total cores optimized diagnostic yield.

The values we obtained for diagnostic yield and accuracy are on the higher end of those reported in the literature. In relatively older studies, Yang et al (in an analysis of 508 biopsies) reported yield to be 91%, Omura et al (in an analysis of 444 biopsies) reported yield to be 71%, and Sung et al (in an analysis of 309 biopsies) reported yield to be 92% [19, 20, 24]. Notably, M Wu et al (in an analysis of 96 biopsies) also reported a yield of 98% [7]. In more contemporary studies, Yang et al (in an analysis of 247 biopsies) reported yield to be 80%, and Kirhira et al (in an analysis of 585 biopsies) reported yield to be 82% [6, 17]. M Wu et al and Yang et al both reported a diagnostic accuracy of 98%, similar to our study results [7, 24]. Other studies, however, report lower accuracy values than what we found in our current study. Sung et al reported accuracy to be 84%, Omura et al reported accuracy to be 86%, and Rimondi et al (in an analysis of 430 biopsies) reported accuracy to be 93% [19, 20, 22].

Open surgical biopsy has long been considered the reference standard for tissue acquisition of bone tumors [27]. Given the low complication rate, a vast number of biopsies included, and very high rates of diagnostic accuracy and yield found in this study however, we believe that our study results suggest that ICNB could be considered the gold standard and preferential method of tissue sampling for suspected malignant osseous lesions. The most notable benefits of ICNB versus open biopsy are the reduced cost and complication rate [1,2,3,4]. Other studies have also reported no statistically significant difference in diagnostic accuracy between ICNB and open biopsy for bone tumors, and further support the implementation of ICNB over open biopsy [3, 28].

Identifying factors associated with diagnostic yield might identify ways in which to improve yield. Increased core number and total core length being positively associated with increased yield is an intuitively expected result; however, in the literature, it has not always been replicated. J Wu et al produced similar results to our study and found that an increased number of cores and total core length were associated with improved diagnostic yield [5]. In larger retrospective studies, however, Yang et al found no difference in diagnostic yield with increased core number, and Li et al found no difference with increased total core length [6, 16]. Our study did not find an association between needle gauge and yield and is concordant with other studies in the literature in this regard [5, 16].

The only other positive predictor of diagnostic yield found in our study was the presence of a known primary malignancy. A similar result has been obtained by Hwang et al who reported that cancer history is associated with increased yield [10]. This association might be due to the known primary malignancy lowering the quality of specimens needed by the pathologist to diagnose metastatic lesions.

The two factors found in our study to be associated with lower yield were fluoroscopic imaging guidance and lesion location in an extremity. Several smaller studies have reported no difference between CT and fluoroscopic imaging guidance but might have been underpowered to detect such a difference [6, 19, 20]. Compared to CT guidance, fluoroscopic guidance might decrease yield due to its limited visualization of small lesions [21]. However, in larger lesions with complete marrow replacement, fluoroscopic guidance would not seem to be inferior to CT guidance. Furthermore, fluoroscopic guidance may improve procedure efficiency and decrease ionizing radiation exposure, and operators should weigh these factors when choosing which imaging modality to use [21]. In this study, CT guidance was primarily utilized to provide better soft tissue visualization and was used for lesions in which the biopsy tract ran adjacent to sensitive soft tissue structures. For lesions in anatomic areas without important adjacent soft tissue structures (such as the feet or lumbar spine), fluoroscopic guidance was generally favored.

Yang et al reported a similar result to our finding that lesion location in an extremity decreases yield and reported that elbow and forearm lesions had lower yield compared to other central locations [24]. Other studies in which lesions were grouped into peripheral and central lesions, however, found no association between lesion location and yield [16, 20]. Lesions in extremities might have decreased yield by having a greater relative proportion of cortex to the medulla, making them technically harder to sample.

Notably in our study, we did not find that lytic lesions increased yield, similar to the results reported by Omura et al [20]. Several other studies in the literature however have reported lytic lesions to have increased yield [5, 6, 16].

While our study has shown that increased lesion sampling via increased core number and total core length increases diagnostic yield, increased sampling ostensibly also increases procedure time and might increase the potential for complications, sedation, and exposure to ionizing radiation. Thus, obtaining threshold values for the core number and total core length at which diagnostic yield plateaus is important. The only study in the literature that has focused on optimizing technical parameters for bone biopsy (to our knowledge) has been the study by J Wu et al, who in an analysis of 88 bone biopsies, reported that diagnostic yield reached a plateau at three total cores obtained [5]. Our current study of 2321 biopsies has replicated this finding. Additionally (to our knowledge), no other studies in the literature have investigated the total core length that optimizes diagnostic yield. We found that the relationship between increased total core length and increased yield starts to plateau at 20 mm. Thus, we recommend that for bone biopsies of suspected malignant lesions, operators should attempt to obtain three cores and 20 mm total core length to optimize diagnostic yield.

Our study did not find any factors that influenced diagnostic accuracy. However, due to the relatively small sample size of lesions included in the accuracy analysis, it is possible this analysis was underpowered. Other study limitations include that the study reflects the experience of only one academic institution, and thus demographics of the population, biopsy referring patterns, and preferential use of certain equipment or imaging modalities might not reflect the diversity of clinical practice. Additionally, patients were not randomized, and some data such as lesion size and extraosseous extension were unavailable. Furthermore, there might be relatively limited generalizability of the study results for primary osseous malignancy as only 2.2% of the lesions included in the study were malignant primary bone tumors.

Additionally, the lack of information regarding whether patients were oligometastatic or had diffuse disease might affect the diagnostic yield/accuracy if the majority of patients had multiple lesions to choose from. Lastly, operator heterogeneity of the radiologists who performed the biopsies might have confounded results.

In conclusion, image-guided core needle biopsy is a safe procedure with a very high rate of diagnostic yield and accuracy. Our study results suggest ICNB could be considered the preferential method for tissue acquisition in suspected malignant osseous lesions. Increased total core length, core number, and presence of primary malignancy are associated with improved yield, while lesion location in an extremity and using fluoroscopic imaging guidance are associated with lower yield. We recommend that operators attempt to obtain a total core length of at least 20 mm and three total cores to optimize diagnostic yield.