Introduction

Fine-needle aspiration (FNA) with cytological evaluation has become a usual technique for screening thyroid cancer [1]. It has reduced the number of thyroid surgeries and increased the likelihood of detecting malignancy in patients undergoing thyroidectomy. Diagnostic performance of thyroid FNA, however, is hindered by non-diagnostic biopsies due to inadequate/insufficient samples or indeterminate cytological patterns, the latter including atypia of uncertain significance/follicular lesions of uncertain significance (AUS/FLUS), and follicular neoplasms/suspicious for follicular neoplasm (FN/SFN), third and fourth categories of Bethesda System for Reporting Thyroid Cytopathology [2]. Factors influencing indeterminate or insufficient FNA include the operator performing the procedure [3], the technique [4], and the experience of the cytologist [5], reaching up to 24 % in a recent multicenter review [6]. The recommended strategy to solve these drawbacks is repeating the FNA, which is diagnostic in 50 % of cases [7, 8]. Surgical follow-up of these groups yields a malignancy ratio that ranges from 6 % to 48 % in AUS/FLUS [9] and between 15 and 30 % in FN/SFN [2, 10].

Ultrasound (US) guided percutaneous thyroid nodule core needle biopsy (CNB) has proved to be useful in non-diagnostic FNA, and in nodules with AUS at FNA. The CNB allows obtaining a sample for histological study and thus a higher diagnostic accuracy than FNA, but this is still in debate [1115]. Some authors report better results with a combination of both techniques [1618]. Concerns regarding potential increase in patient pain and complications, as well as technical issues, have curtailed the spread of thyroid CNB.

Only a few reports about the accuracy of thyroid CNB can be found in the literature and, to our knowledge, there are no studies regarding performance evaluation as a primary tool and potential pitfalls of the technique. In our centre, after using CNB to improve the diagnostic performance of thyroid FNA, our preliminary results encouraged us to use this technique as primary diagnostic test over FNA [19].

The purposes of this study were to analyze the diagnostic accuracy of ultrasound-guided core-needle biopsy (CNB) of thyroid nodules using thyroidectomy specimens as a reference standard and to describe the main causes of its pitfalls.

Materials and methods

Study design

We performed a retrospective review of all CNB of thyroid nodules from October 1, 2005 to December 1, 2013 in our institution. Biopsies were requested for any nodule with sonographic criteria following the ATA guidelines [20]. We included in our study all the patients who had undergone thyroidectomy and had at least one previous CNB of one or more thyroid nodules. In our centre, surgery is indicated after a CNB result of malignancy or follicular proliferation, and in some nodules with benign CNB because of size, growth, local compressive symptoms, or patient’s choice.

US pattern, localization, and size of the nodules, pathological diagnosis of both the biopsy and surgery, complications of CNB, and demographic issues were recorded. If necessary, a consensus was required between radiologist, pathologist, and surgeon. Surgical and pathological reports were recorded no more than 2 weeks after thyroidectomy. The interval between CNB and surgery was less than 12 months in all cases. Informed consent was obtained for all patients, and institutional review board approval was obtained for this study.

CNB procedure

CNB procedures were performed with real-time US guidance, using a 10-12 MHz linear probe in an IU-22 platform (Philips HC, Best, the Nederlands). Three radiologists with more than 20, 8, and 4 years of experience in thyroid biopsies performed all procedures. CNB were performed using a spring-loaded 18-gauge full-core Biopince biopsy needle (Angiotech, Vancouver, Canada). This needle has a variable stroke length of 13, 23, and 33 mm, thus allowing different sample sizes according to the volume and position of nodule, an excellent control of the reach of the biopsy, and allows obtaining good specimens with a relatively small gauge. Each patient was placed supine with the neck extended. After an initial US evaluation to identify the thyroid nodules of interest, 1 % lidocaine was injected in the path. The US probe was placed over the target, and the biopsy needle was advanced with US guidance to the edge of the nodule and then fired (Fig. 1). The approach was usually transisthmic (see Results), introducing the needle from the midline of the neck, trough the isthmus to the nodule. When this approach was not possible (i.e. nodules located close to the carotid artery) a lateral-to-midline approach was used. Usually, two CNB specimens of each thyroid nodule were taken. Cystic nodules were emptied using a 22-G needle before taken a CNB from the residual solid portion. Patients were discharged after a firm local compression of 5-10 min.

Fig. 1
figure 1

A 42-year-old woman was scheduled for biopsy of a left thyroid nodule of 2.5 cm (a). In the sonographic exploration before the biopsy, a small (3.5 mm) hypoechoic nodule was discovered in the right lobe (b). Both nodules underwent biopsy. In c, the needle (arrows) can be seen pointing to the small nodule. A transisthmic approach is used. In d, the needle (arrows) can be seen crossing the target after firing the spring-loaded device. The diagnosis of the biopsy of the bigger lesion was hyperplasia, and the smaller nodule was a papillary carcinoma

Specimen analysis

CNB samples were placed in cooled saline solution and immediately transported to the pathology laboratory. Refrigeration of the transport medium prevented tissue autolysis and contributed to maintain the samples in optimal conditions. Upon arrival in the laboratory, biopsy core samples were processed, fixed in formalin, and embedded in paraffin. Paraffin block slices were taken from the beginning with the microtome, keeping two or three unstained slides mounted for immunohistochemistry, if necessary. A board-certified attending staff pathologist reviewed each CNB specimen. We standardized four main diagnostic categories for the CNB results: insufficient (I); benign (B), including hyperplasic, inflammatory, and other benign lesions; follicular lesion (FOL), including follicular and oncocityc neoplasms; and malignant (M), papillary, medullary, or other thyroid cancer subtypes. Diagnosis of FOL versus hyperplasic lesions was defined according to previously described criteria [18], i.e. microfollicular pattern with scant or absent colloid, sometimes with minimal pleomorphism or discrete nuclear changes. This category could be equivalent to AUS/FLUS, and follicular neoplasms/suspicious for follicular neoplasm (FN/SFN), III and IV categories in Bethesda classification [2]. We do not consider suspicious of malignancy as a diagnostic category because the biopsy specimen allowed a more precise diagnosis.

After thyroidectomy, all surgery specimens were evaluated by the same pathologists who performed the CNB analysis. Definitive surgical diagnoses were categorized as follows: hyperplastic nodule, thyroiditis, other benign lesions, follicular adenoma, oncocytic adenoma, papillary carcinoma (PTC), follicular carcinoma (FTC), oncocytic carcinoma, medullary carcinoma (MTC), and other malignant tumours, based on classical histopathological criteria [19]. Surgical pathology diagnosis was considered the final diagnosis.

Statistical analysis

Positive predictive value (PPV) for malignancy was calculated for every category of CMB diagnosis. Also, as both FOL and M results lead to the recommendation of surgical management, sensitivity and PPV of CNB considering any of both diagnosis as positive, was calculated. Moreover, to test the diagnostic value of thyroid CNB to identify neoplasms, we calculated sensitivity and PPV of CNB for diagnosis of adenoma (follicular or oncocytic) or thyroid cancer considering CNB FOL and M results as positive. As we have only included patients who underwent surgery, calculation of predictive negative value (PNV) or specificity was not appropriate.

Analysis of variance (ANOVA) was used to test differences for age and size of the nodules. Chi-square test or Fisher’s exact test was used to compare frequencies for categorical variables such as nodular content, lobe, solitary or multinodular goiter and categorized size. A two-tailed p-value less than 5 % was considered significant. Statistical analysis was performed with the SPSS package software, version 19.0 (SPSS, Chicago, IL, USA).

Results

In total, 3517 thyroid CNB were performed with 204 (5.8 %) considered insufficient or inadequate in pathological evaluation. Of them, 54 (27 %) were biopsied again, and eight were surgically removed and thus included in this study. All of them were benign.

Only one major complication was observed: a recurrent nerve lesion after a direct puncture of the nerve (myelin was observed in the specimen) that caused a permanent dysphonia. After this case we favoured a transisthmic approach for the biopsy. Minor complications included 56 self-limited hematomas that did not require treatment. From all the 3313 diagnostic CNB, 158 samples were labelled as malignant (M), 160 were classified as FOL, and the remaining as benign (B) nodules. Out of them, 684 nodules in 637 patients had surgery (in 47 patients, two different nodules were biopsied). Eight of them (1.2 %) had unsatisfactory or insufficient tissue to make a diagnosis, all of them benign on surgery.

In 676 nodules undergoing surgery, CNB diagnosis was benign in 374, malignant in 148, and FOL in 154 (Table 1). Demographic and sonographic characteristics of these groups are shown in Table 2. After surgery, 181 nodules were malignant (154 PTC, 16 FTC, five MTC, three anaplastic carcinomas, two poorly differentiated carcinoma, one mucoepidermoid carcinoma) and 495 nodules benign (380 hyperplasic lesions, 60 follicular adenomas, 41 oncocytic adenomas, 10 lymphocytic thyroiditis, one fibrotic nodule, one cavernous hemangioma, one amiloid goiter, and one necrotic cyst). The correlation between CB results and histopathological results is shown in Table 3. In 614 cases (90.8 %) CB diagnosis and final diagnosis were coincident and in 62 (9.2 %) not (Fig. 2).

Table 1 Pathological diagnosis of CNB in patients undergoing surgery
Table 2 Demographic and sonographic characteristics of three CNB diagnostic groups
Table 3 Correlation between CNB result and surgical diagnosis
Fig. 2
figure 2

Diagram of the study population and the final diagnosis

Incidental papillary microcarcinomas were discovered on surgery in 56 patients of the study group, 38 of them in patients with CB diagnosis of hyperplasia and 18 in diagnosis of FOL. These cases were not considered false negatives, and the incidental lesions were not included in the study because they were located in non-biopsied areas, distant from the biopsied nodules.

Discrepancies and causes

Discrepant results are shown in Table 4. A benign CNB corresponded to a malignant final diagnosis in 2.1 % of cases (8/374). The causes of false negative were:

Table 4 Discrepancies core-needle biopsy vs. surgical diagnosis

a. Cystic PTC with small solid component in four cases measuring 12 to 14 mm. One of them revealed their malignancy when simultaneous FNA was performed to empty the cyst.

b. Sampling error in three cases: two well-differentiated follicular variants of PTC with scattered milimetric tumour foci into nodules bigger than 30 mm, and one multifocal PTC with six foci from 2 to 18 mm, none of them sampled on CNB.

c. Error in pathological interpretation. One follicular variant PTC of 40 mm with evident diagnosis of malignancy in the revision of CNB specimen after surgical diagnosis.

False positive patients included two hyperplasic lesions with partial nuclear changes that had been identified as suspicious of PTC in CNB, and one hyperplasic nodule with squamous metaplasia classified as squamous carcinoma in CNB, as previously reported [21]. A diagnosis of malignancy in CNB had a false positive rate of 2 % (3/148). One PTC, diffuse sclerosing variant, was diagnosed as MTC in CNB. Revision after surgical histopathologic analysis, showed an error in pathological interpretation, with negative immunohistochemistry for calcitonin in CNB specimen.

CNB and surgical diagnosis of follicular and oncocytic neoplasms

Follicular non-oncocytic tumours at CNB (n = 89) included 72 truly follicular tumours when operated on: 55 follicular adenomas (one oncocytic variant), nine follicular carcinomas, and eight PTC (seven follicular variant and one cribiform morular). The other lesions were 16 hyperplasic lesions and one MTC.

At surgery, the 65 oncocytic tumours in CNB, resulted in 47 truly oncocytic lesions (40 adenomas and seven oncocytic carcinomas), two PTC oncocytic variant, 15 hyperplasic nodules (all with oncocytic transformation), and one poorly differentiated carcinoma.

Diagnostic performance of CNB

Positive predictive value of a CNB diagnosis of malignancy is 98 % (Table 3). The sensitivity for malignancy if CNB diagnosis of FOL or M are considered positive was 95.6 % (173 of 181) and PPV 57.9 % (173/299). The sensitivity for diagnosis of thyroid neoplasm (adenoma or carcinoma) if CNB diagnosis of FOL an M are considered positive was 95 % (268/282) and PPV 88.7 % (268/302).

Discussion

FNA is an excellent diagnostic tool and the reference standard in evaluating nodular thyroid disease, but limited by insufficient or indeterminate results [22]. Repeated FNA, second opinion diagnosis and group consensus review of cytological specimens may reduce uncertainty, but results are inconclusive in a significant number of cases. In addition, the indeterminate AUS/FLUS category has a high inter-observer variability and the probability of malignancy in these nodules remains unclear [8, 2327].

CNB offers an alternative that could improve the accuracy of diagnosis in these challenging nodules [1416, 2830]. Recently, some authors have published good results when using this technique in combination with FNA [18] or after insufficient, non–diagnostic, or suspicious results in FNA, as in nodules with suspicious US features and benign cytology [11, 12, 1416, 3133]. The results suggest higher diagnostic accuracy for CNB, mainly because it can offer a bigger and a better sample than FNA, as has been demonstrated with samples acquired from surgical specimens [34]. However, since CNB is a seldom used technique, its actual diagnostic power as a primary indication has not been studied so thoroughly as that of the FNA, with only a few articles studying its accuracy, and these reports include selected cases [35].

Only a low number of series compares the results from CNB with that of pathology after surgery. Most of them use instead a stable clinical course and the lack of sonographic changes to consider the nodule as benign, without a definitive histological diagnosis. This approach potentially misses up to 10 % carcinomas, particularly in large thyroid nodules, due to histological heterogeneity [36], also observed in two of our cases. To our knowledge, ours is the biggest published series of thyroid CNB with surgical correlation. Moreover, we have included consecutive thyroidectomy confirmed nodules; regardless if a prior FNA had been performed, thus not selected by previous indeterminate or failed FNA as in other published series [11, 12, 14, 15, 29, 37].

Higher economical cost, technical requirements, time consumption, and concerns about potential complications, have limited the use of CNB. However, we have observed a low rate of insufficient samples for CNB (5.8 %), thus additional procedures are less frequently required. Also, we have observed only one major complication: a neurogenic lesion, also reported with FNA [38], and, in our experience, the procedure, in experienced hands, is similarly as time consuming as FNA cytology. In addition, FNA diagnostic categories show great inter-operator variability owing to technical aspects, application of criteria and subjective interpretation of each cytologist [9]. Our proposed four main diagnostic categories for CNB could avoid these limitations of FNA.

The main problem of FNA in thyroid nodules is the indeterminate cytological patterns, the third and fourth groups of diagnostic categories in the Bethesda system: AUS/FLUS (with a risk of malignancy of 5–15 %), and FN/ SFN (with 15–30 % risk of malignancy) [2]. When these cytomorphologic features appear, the usual practice is repeating FNA for AUS/FLUS or a surgical procedure for FN/SFN. Recommended rate of insufficient FNA in Bethesda’s guidelines [2] are less than 10 % and 7 % for diagnosis of AUS/FLUS. There is no recommended ratio for FN/SFN, but usually is slightly higher than malignant [9]. This means that more than 20 % of FNA do not have a reliable diagnosis. CNB is also affected by this limitation, but insufficient and FOL cases include only 10.3 % of all the biopsies. Almost nine of ten CNB obtained a definitive diagnosis in a single session. In the group labelled as FOL, 79.7 % of the nodules were neoplasms, 18.2 % malignant, similar to reported in undetermined cases with FNA.

Histology is the reference standard diagnostic test in the evaluation of follicular neoplasms. Numerous tissue blocks have to be prepared and analysed to demonstrate or exclude vascular invasion and/or capsular breakthrough as decisive hallmarks of malignancy. CNB that includes the nodule capsule for preoperative diagnosis of indeterminate follicular neoplasms for detecting malignancy has been proposed in a short series [39].

In addition, some groups suggest that FTC may develop from follicular adenoma due to the accumulation of genetic alterations [40]. This prospect would question the nonsurgical option if the adenomatous lineage of the neoplasm could be preoperatively demonstrated.

CNB, in our study, is a highly reliable test, with a disagreement ratio of 9.2 %, lower than 15.3 % reported by Yang et al in a series of 4703 FNA in 2007 [41]. Only in a small number of cases the result of CNB has lead to a mistaken clinical decision. Our results show a very high PPV (98 %) for a malignant CNB in preoperative diagnosis of malignancy of thyroid nodules. Pitfalls of CNB include cystic PTC. So, it should be stressed the importance to drain the cystic component of any cystic lesion before the CNB and to carry out a cytological analysis of the fluid in such cases. Other causes of false negatives are infrequent: sampling error due to histological heterogeneity in non-homogeneous neoplasms with undefined pathologic foci, and mistakes in pathological analysis. False positives are even more infrequent (2 %) and also related to pathological interpretation.

Sensitivity of a CNB diagnosis of malignancy is very high (94.8 %). Thus, a CNB diagnosis of malignancy and a benign diagnosis should be both regarded as highly reliable. PPV for malignancy of a CNB diagnosis of FOL is 18.2 %, similar to Bethesda IV [2] lesions. Moreover, PPV of CNB diagnosis of neoplasm (FOL or M) for diagnosis of thyroid neoplasm (adenoma or carcinoma) is 88.7 %. Difference in PPV between a CNB diagnosis of follicular neoplasm and oncocytic neoplasm is low.

This study demonstrates the potential of CNB as primary diagnostic test for patients with thyroid nodules. Using a robust volume of patients the study confirms that one diagnostic CNB correlates to subsequent pathological study, making unnecessary studies in CNB labelled as benign.

Selection of the type of biopsy needle is, in our opinion, a very important technical item to achieve good results with thyroid CNB. The use of end-cut biopsy needles allows obtaining larger specimens with needles of small bore. Side-cut needles, the other type of CNB needles, have also the inconvenience of having a non-sampling distal end that can cause undesired damage to structures located beyond the biopsied nodule.

Our study has some limitations. It is a retrospective study. The surgical diagnoses were not reviewed by an external pathology expert or a central panel of pathologists, as should be done in studies evaluating accuracy in clinical trials designed for the development of diagnostic test performance. However, the pathologists evaluating the CNB and surgical specimens have an extensive experience. The job was carried out in a single centre with a high volume of cases, and the results cannot be applied to other centre with lesser experience. Given that we have included only cases with diagnostic CNB and surgical correlation, we cannot calculate specificity or PNV of the technique. Also, despite the guidelines of our centre, some cases with M or FOL diagnosis have not been operated on, including five thyroid lymphomas treated without surgery, three invasive anaplastic thyroid cancers, two metastatic carcinomas and six CNB diagnosed as FOL not operated because the patient refused surgery. However, the potential impact of these cases in the measured parameters is low given our big sample size and the high accuracy observed for CNB.

We conclude that CNB is a reliable, safe, and accurate method to approach thyroid nodules in the clinical practice. Our results show high sensitivity and PPV and a low rate of non-diagnostic cases. Pitfalls of CNB are scarce and related to cystic PTC, sampling error due to histological heterogeneity and mistakes in pathological analysis. US-guided CNB can be an alternative technique to FNA in selected nodules, in addition to a technique to be used after insufficient samples or indeterminate cytological patterns in FNA instead of repeating FNA or surgical lobectomy.