Introduction

The incidence of thyroid nodules and thyroid cancer is on the rise; the latter has more than doubled since 1973 [1]. Although 5% of women and 1% of men have palpable thyroid nodules, ultrasound will reveal a nodule in 19–67% of individuals [1]. This increased rate of detection has direct clinical, financial, and therapeutic implications. Although the rate of thyroid cancer in any one of these nodules is typically 5–15% [1], thyroid cancer, nevertheless, must be ruled out. Fine-needle aspiration biopsy (FNAB) has been shown to be the most accurate, quick, and safe means to do this [2,3,4]. Ultrasound guided FNAB is superior to palpation guided FNAB, insofar as it yields significantly more diagnostic samples [5,6,7], while also reducing the false-negative rate.

Despite the established efficacy of FNAB in the workup of thyroid masses, relatively high non-diagnostic rates continue to hinder the full potential of the FNAB. These rates for ultrasound guided FNAB in the literature range widely from 2 to 29% [8, 9]. Such non-diagnostic results increase costs, patient stress, and time to diagnosis. Most importantly, studies have shown that initial non-diagnostic samples can harbor a malignancy in 1–4% of cases [10].

Several studies in the radiology literature have examined the influence of needle gauge and/or biopsy technique (aspiration vs. capillary action) on diagnostic accuracy in ultrasound guided FNAB of neck masses using varying classification systems [11,12,13]. However, only one similar surgeon-led study in the otolaryngology literature has examined the effect of needle size on non-diagnostic rates [14]. Although Bhakti et al. reported non-diagnostic rates in a one surgeon experience with a combination of both capillary and aspiration technique [15], no surgeon-led study in the otolaryngology literature has differentiated between aspiration and capillary technique to analyze the relationship of technique on non-diagnostic rates. This study reports a one-surgeon experience detailing 5 years of thyroid FNAB using the Bethesda criteria. The study examines the relationship between non-diagnostic rates and FNAB technique.

Materials and Methods

A minimal risk IRB was approved in conjunction with a HIPAA waiver for informed consent. A total of 164 thyroid ultrasound guided FNAB were performed by the senior author at George Washington University Hospital form September 2009 to June 2014. Patients consisted of 33 males and 131 females. Patient age ranged from 23 to 89 years. FNA was performed on all nodules >10–15 mm, as well as nodules <10 mm that demonstrated suspicious features on ultrasound [16]. Suspicious features included microcalcifications, intranodular hypervascularity, irregular margins, hypoechoic solid nodule, and nodal involvement [16].

The following data was collected for each FNAB sample: age and gender of patient, anatomic location of biopsy, size, biopsy technique, needle gauge, adequacy of sample, FNA diagnosis, final pathologic diagnosis when applicable.

Each nodule underwent ultrasound imaging by the senior author. Echogenicity, vascular pattern, size, location, margins, composition, and calcification were recorded (Table 1). Nodules were characterized as either > or <50% cystic based on ultrasound findings.

Table 1 Study demographic and ultrasound characteristics of thyroid nodules in relation to diagnostic and non-diagnostic results

Ultrasound guided FNA was performed by the senior author with either a 23 or 25-guage needle. Informed consent was obtained for all patients who underwent FNAB. The needle was used to aspirate or draw up cellular material via capillary action from the solid portion of the nodule. The aspirated material was used to form air-dried smears by forcefully expunging it onto a thin prep slide, smearing it with a second slide, and then allowing it to air dry. Material drawn up by capillary action was gently spread along the horizontal length of the thin prep slide, smeared and then allowed to air dry. All biopsies were performed without an on-site cytopathologist. Slides were sent to cytology department for Diff-Quik staining.

The following needle gauge and biopsy techniques were used during 9/2009-10/2012, 10/2012-10/2013, and 10/2013-06//2014: 23-gauge with aspiration, 25-gauge with aspiration, and 23-gauge with capillary action, respectively. Technique was altered during these time periods in an attempt to lower non-diagnostic rates. All biopsies were performed using 4 passes.

Cytopathology results were retrospectively reviewed and classified based on the 6-tiered Bethesda criteria [10]. Results were classified as: non-diagnostic, benign, atypical cells of undetermined significance/follicular lesion of undetermined significance, suspicious for follicular neoplasm, suspicious for malignancy, and malignant (Table 2). Specifically, “non-diagnostic” results included nodules that were “completely cystic, virtually acellular (<6 groups of at least 10 cells) or otherwise unrevealing due to blood or artifact [10].” Although the cytopathology system at our university varied slightly from the Bethesda system, we were able to extrapolate our cytopathology classifications.

Table 2 Distribution of FNA diagnoses of study compared against literature values

Patient demographics and ultrasound features of thyroid nodules were compared in non-diagnostic versus diagnostic cases using a 2-tailed independent-groups t test (continuous data), chi square test or Fisher’s exact test (for categorical data). Multivariate logistic regression analysis was performed to determine independent factors that predicted non-diagnostic cases after accounting for all covariates that were potentially associated with insufficiency (based on having univariate p < 0.10). SAS (version 9.3, Cary, NC) was used for all data analysis with p < 0.05 considered statistically significant.

Results

Nodule composition (p < 0.0001), needle gauge (p = 0.01), and technique (p = 0.01) were significantly associated with non-diagnostic rate (Tables 1, 3). The non-diagnostic rate was lowest for nodules that were “cystic <50%” (16%) and highest for “cystic >50%” (72%). The non-diagnostic rates for 23- and 25-gauge needles were 22 and 43%, respectively. Aspiration technique resulted in a non-diagnostic rate of 32%, while the rate for capillary action was 13%. The non-diagnostic rate by year of surgeon was 0, 32, 14, 34, 39, and 7% for years 2009 through 2014, respectively (Table 3). The non-diagnostic rate according to year by technique was 27, 43, and 13% (Table 4).

Table 3 Non-diagnostic rates by year in practice
Table 4 Non-diagnostic rates—years by technique

After multivariate logistic regression, predictors with significant independent effects included age, nodule composition, and technique (Table 5). The prediction model was significant (p < 0.0001) with a strong effect size (pseudo-R2 = 0.25) and good prediction accuracy (c = 0.82). Each year of patient age raised the odds of insufficiency by 4% (OR 1.04 [95% CI 1.01–1.07], p = 0.018). Nodules that were “cystic >50%” had 13 times the odds of being non-diagnostic, compared to solid nodules. Nodules biopsied via aspiration had 21 times the odds of being non-diagnostic compared to nodules that underwent capillary action, after accounting for the other covariates.

Table 5 Multivariate regression analysis—variables independently associated with non-diagnostic rates

Of all nodules with FNAB results (n = 164), 52 underwent surgical sampling. The malignancy rate was 26.9%, as 38 nodules were benign and 14 were malignant. Overall sensitivity, specificity, PPV, NPV, and accuracy for detecting malignancy were 64, 100, 100, 88, and 90%, respectively. “Suspicious for follicular neoplasm” and “non-diagnostic” were excluded in this analysis.

Discussion

FNAB has been shown to be an efficacious primary diagnostic tool in the workup of head and neck masses. Indeed, the accuracy of detecting benign and malignant lesions in the present study was 90%. The sensitivity, specificity, PPV, and NPV for detecting benign vs. malignant lesions was 64, 100, 100, 88%, respectively. The high sensitivity, specificity, and accuracy in the present study support the use of FNAB as a primary screening tool in the clinical setting. Furthermore, these values are replicated in the literature. Wong et al. evaluated 13 similar studies in the context of the Bethesda criteria, stating that ultrasound guided FNAB was highly sensitive (84–94%) and specific (98–99%) in detecting malignancy [17]. A separate meta-analysis on the Bethesda criteria reported a PPV and NPV of 98.6 and 96.3% for the identification of benign and malignant lesions [8].

The literature, and the present study, support the efficacy of ultrasound guided FNAB, but non-diagnostic rates remain high (2–29%), despite the implementation of the Bethesda criteria in 2007 [8, 9]. The wide range may be due to the use of different biopsy techniques and varying operator experience among the studies. Lack of standardization is not an issue due to the implementation of the Bethesda criteria.

In an attempt to improve diagnostic adequacy of FNAB samples, we focused on the association between non-diagnostic rates, and needle gauge/biopsy technique. Both a 23-gauge needle (p = 0.01) and capillary action (p = 0.01) were associated with a statistically significant decrease in non-diagnostic rates when compared to 25-gauge needle and aspiration technique (Table 1). Again, several studies in the non-otolaryngology literature have examined the relationship between biopsy technique and non-diagnostic rates. Most of these studies, including Tauro et al. and Tubulin et al., have found no statistically significant difference in these rates based on aspiration versus capillary action [19,19,20,21,22]. Nevertheless, Tauro et al. concluded that aspiration may be better for largely acellular nodules, whereas capillary action may be better for cellular lesions [21]. Tubulin et al. concluded that capillary action is superior to aspiration insofar as it is easier to learn and faster to perform [19].

Degirmenci et al. is an exception to this trend, as they did note a statistically significant decrease in non-diagnostic rates with the use of a capillary action over aspiration [13]. Dey et al. also reported that capillary action could provide better sampling than aspiration [23]. Most recently, Zhou et al. suggested that choice of technique should be based on the size of the nodule [24]. Aspiration provided lower non-diagnostic rates in nodules that were 5.1 to 10 mm and >20 mm in their study, whereas aspiration and capillary action were of equal diagnostic benefit in the remainder of the nodules.

Similarly, most studies do not report an association between non-diagnostic rates and needle size [11, 12, 14, 25], although smaller needle size has been shown to decrease pain [11] and provide more cellular material [26]. Naiim et al. reported a decrease in non-diagnostic specimens when the number of passes was increased from 1 to 3, but related no difference when 2 or 3 passes were used [11].

It is important to note the studies that have examined the relationship between technique and non-diagnostic rates have done so prior to the implementation of the Bethesda criteria. In fact, to the best of our knowledge, ours is the only study in the literature to have examined the association between non-diagnostic rates and technique in the context of the Bethesda criteria.

The present study demonstrates a clear decrease in non-diagnostic rates with capillary action and thus may provide guidance for the novice clinician interested in implementing ultrasound guided FNAB. Capillary action technique has been demonstrated to provide specimens with greater cellularity and higher quality diagnostic material.

Multivariate logistic regression analysis was performed to determine independent factors predictive of non-diagnostic results after accounting for all covariates that were potentially associated with non-diagnostic results. Capillary action and age, but not needle gauge, was found to provide a significant decrease in non-diagnostic results after accounting for confounding variables. Similarly, nodules “cystic >50%,” as well as samples from older patients, were more likely to yield a non-diagnostic cytopathology result.

Interestingly, multivariate logistic regression analysis demonstrated that physician experience was not an independent factor for the degree of diagnostic adequacy, although there was a statistically significant relationship between physician experience and non-diagnostic rates (p = 0.014). There are likely two reasons for this observation. First, as explained previously, the use of aspiration with a 25-gauge needle during the majority of 2012 and 2013 resulted in a higher non-diagnostic rate compared to 2009–2012. This created a non-linear decrease in non-diagnostic rates across the years of the study, which prevented physician experience from being acknowledged as an independent factor in the degree of non-diagnostic samples. Second, at the cessation of the study, the most recent year, 2014, in the physician’s career was still incomplete. It is likely, given the trend in 2014, that a greater sample size incorporating the remainder of 2014 would demonstrate physician experience to be an independent factor in the prediction of non-diagnostic results.

There are several limitations to this study. The retrospective design of the study can lead to selection bias. Moreover, only the nodules that underwent surgery were used to calculate sensitivity, specificity, and accuracy. The decision to proceed with surgery was favored in patients with a worrisome presentation, or suspicious features on ultrasound and/or FNAB. The FNAB reporting system used at our institution is not the same as the Bethesda system, although it is sufficiently similar to allow for easy extrapolation. Additionally, the results of this study may not be generalizable to the public as it represents a small, single surgeon’s patient population at a tertiary care center with cases that may be different in complexity compared to what is usually seen by community practitioners; it is possible that given the complexity, cases referred were more likely to be non-diagnostic than what might be seen by in the community. Finally, due to the retrospective nature of study, several different pathologists reviewed the cytology results, which may have led to increased inter-observer variability. The potential confounding relationship between multiple different variables, including physician experience, and non-diagnostic rates was not a limitation, as it was corrected for using logistic regression analysis.

Conclusion

After taking into account confounding variables, including physician experience, our data reveals a statistically significant decrease in non-diagnostic rates with the use of capillary action compared to aspiration technique in US-guided FNAB. A major focus in health care today is providing cost-effective, minimally invasive patient care. In the setting of a rising incidence of thyroid disease, we believe our study demonstrates the need for a larger and more definitive analysis (i.e. prospective vs. randomized control trial) of the relationship between technique and non-diagnostic rates, in an attempt to minimize this inefficiency and help reduce costs associated with repeat FNAB.