Introduction

Cytology from fine-needle aspiration (FNA) is recognized as a main tool for the evaluation of thyroid nodules [1, 2]. The main limit of thyroid FNA is represented by indeterminate reports which occur with a non-negligible rate (i.e., up to 20–25% in different institutions). In presence of an indeterminate FNA report, the histologic examination after surgery is still the more frequently adopted way to achieve a diagnosis. Unfortunately, only one in four of all indeterminate nodules is cancer at final histology. Thus, to preoperatively diagnose thyroid nodules cytologically classified as indeterminate has represented one major challenge in clinical thyroidology [3]. Many imaging or molecular parameters have been investigated in this context [4,5,6,7,8,9,10,11]. However, to date there is no international consensus to use these tools for the management of patients in clinical practice.

In the last decades several international societies have published their guidelines for reporting thyroid cytology. A major challenge of these cytologic systems was to discriminate, between all indeterminate lesions, those at high risk of malignancy requiring surgery from that at low risk in which we could plan a clinical/ultrasonographic follow-up [12]. Specifically, the British Thyroid Association proposed the subclassification Thy 3a (low risk) and Thy 3f (high risk) [13] and Bethesda system the subcategories Category III (low risk) and IV (high risk) [14]. After these, the Italian consensus for the classification and reporting of thyroid cytology (ICCRTC) reported a five-classes system with a subclassification of Tir 3 in Tir 3A (low risk) and Tir 3B (high risk) [15]. Specifically, Tir 3A is defined as increased cellularity with numerous microfollicular structures in a background of scant colloid, where the proportion of microfollicles is not sufficient for the diagnosis of follicular neoplasm; this category also includes partially compromised samples (i.e., preparation artifacts or blood contamination of specimens), with cytologic or architectural alterations that cannot be confidently classified as benign nor otherwise categorized. Conversely, Tir 3B is characterized by high cellularity in a monotonous microfollicular/trabecular arrangement, with scant or absent colloid, that is suggestive for follicular neoplasm; in addition, of very high interest, the Italian classification includes in the subcategory Tir 3B those cases with “mild/focal nuclear atypia”. The latter represents the most significant difference between Italian system and British and Bethesda ones which include the atypia in their categories of indeterminate at low risk, i.e., Thy 3a [13] and Category III [14], respectively. The decision of ICCRTC was mainly based on the high rate of malignancy (25 %) reported in the Category III of Bethesda system until that moment [16, 17]. Currently, no published data are available on the reliability of ICCRTC and in particular on cancer rate of Tir 3A and Tir 3B. Instead, no significantly difference was found in a meta-analysis of papers reporting the performance of Category III and Category IV of Bethesda [18].

Here, we aimed to systematically review all published papers reporting the rate of malignancy at final postoperative histology of thyroid nodules with presurgical FNA diagnosis of Tir 3A or Tir 3B according to the ICCRTC. Furthermore, to obtain more robust evidence from these data, a meta-analysis of the published articles was also attempted.

Material and methods

Search strategy

A comprehensive literature search in online databases of Google Scholar and Scopus was conducted by searching all papers citing the Italian consensus for the classification and reporting of thyroid cytology (i.e., we searched by using the original title of the ICCRTC “Italian consensus for the classification and reporting of thyroid cytology” and then retrieved all papers with this article cited in the references list). This allowed us to retrieve studies reporting a series of nodules undergone FNA and classified according to ICCRTC. A beginning date limit was not used, the search was updated until June 20, 2017 and no language restriction was used. To try to expand our search, references of the retrieved articles were also screened to identify additional studies.

Study selection

Only original articles that reported complete data of nodules with indeterminate FNA according to ICCRTC were eligible for inclusion: the main exclusion criteria were articles that did not provide histologic follow-up of the indeterminate nodules and studies with overlapping patient data. Cases reports were also excluded. Two researchers (PT, LG) independently reviewed titles and abstracts of the retrieved articles, applying the above criteria; then, the same researchers independently reviewed the full-text version of the remaining articles to determine their final inclusion.

Data extraction

For each included study, information was abstracted concerning study data (authors, year of publication and country of origin); number of patients evaluated; number of thyroid cancers and thyroid benignancy detected.

Statistical analysis

The prevalence of cancers among the categories Tir 3A and Tir 3B was obtained from individual studies using the formula: number of cancers/number of thyroid nodules × 100. For statistical pooling of the data, DerSimonian and Laird method (random-effects model) [19] was used. In this model, pooled data represent weighted averages related to the sample size of the individual studies. Pooled data were presented with 95% confidence intervals (95%CI) and displayed using a forest plot. I-square index was used to quantify the heterogeneity among the studies, and a significant heterogeneity was defined as an I-square value > 50%. Egger’s test was carried out to evaluate the possible presence of a significant publication bias. Statistical analyzes were performed using the StatsDirect statistical software version (StatsDirect Ltd; Altrincham, UK).

Results

Eligible articles

The comprehensive computer literature search revealed 62 articles. Review of titles and abstracts excluded 56 articles according to the above criteria, and six articles [20,21,22,23,24,25] were finally included.

Qualitative analysis (systematic review)

All the included studies were retrospective and were published by Italian authors. Data on the final histologic follow-up were clearly identified in all these manuscripts and the true percentage of malignancy could be calculated. Table 1 details the characteristics and findings of the six included studies. Out of the six papers, one aimed to evaluate the reliability of core needle biopsy in indeterminate nodules [20], two investigated the reliability of ultrasound parameters to predict malignancy [22, 24], and the remaining three [21, 23, 25] reported institutional experiences in indeterminate thyroid nodules with specific focus on the new ICCRTC. Total cases of indeterminate nodules with postoperative histologic follow-up was 423 and 156 (36.9%) of these were malignant. Table 2 describes the results of the 6 studies in the subcategories Tir 3A and Tir 3B.

Table 1 Main characteristics of the included studies
Table 2 Results of histologic examination reported in the included studies

Quantitative analysis (meta-analysis)

The rate of malignancy among the pooled series of indeterminate thyroid nodules at low risk according to ICCRTC (Tir 3A) was 17% (95% CI = 12 to 22%), ranging from 6.7 to 26.1% (Fig. 1). The rate of malignancy among the pooled series of indeterminate thyroid nodules at high risk (Tir 3B) was 52% (95% CI = 46 to 58%), ranging from 42.4 and 57.8% (Fig. 2). The included studies were statistically non heterogeneous for sample and prevalence of cancers among nodules classified as Tir 3A (I-square 3.7%) and Tir 3B (I-square 0%). No significant publication bias was evident in these series: Egger’s test for Tir 3A = 2.95 (95% CI = −0.63 to 6.53, P = 0.084); Egger’s test for Tir 3B = 1.95 (95% CI = −4.81 to 0.92, P = 0.132).

Fig. 1
figure 1

Pooled prevalence of cancers in nodules cytologically classified as Tir 3A

Fig. 2
figure 2

Pooled prevalence of cancers in nodules cytologically classified as Tir 3B

Discussion

Here, we aimed to systematically review all published data on the prevalence of cancers in thyroid nodules with preoperative cytologic diagnosis of Tir 3A or Tir 3B according to ICCRTC [15]. Also, we attempted to meta-analyze the results of the included papers. Remarkably, the sample size and the prevalence of cancer in both categories were non-heterogeneous and no significant bias was identified. These data allow us to consider the recent results as strong. The systematic review retrieved six original papers with complete data of thyroid nodules operated upon after indeterminate FNA. The meta-analysis demonstrated that the subcategory Tir 3A had a lower cancer rate (17%) than Tir 3B (52%). This finding is encouraging for all users of Italian system for classification of thyroid cytologic samples in their clinical practice. In fact, this meta-analysis supports the actions suggested from the panel of ICCRTC, i.e., repeat FNA or clinical follow-up in presence of a Tir 3A cytologic report, and surgery in case of Tir 3B. We would like to underline that the present findings are quite different from that recorded in a meta-analysis on the reliability of Bethesda system in distinguishing low risk indeterminate lesions from high risk ones; there, the overall rate of malignancy was 27% for the Category III and 31% for the Category IV [18]. Then, no true difference was found between these two subcategories. Specifically, a preoperative Category III had a higher risk of malignancy than that one would expect. The herein demonstrated good performance of ICCRTC is probably due to the different interpretation of atypia which are features of its high risk subcategory (Tir 3B), while are included in the low risk of British (Thy 3a) and Bethesda (Category IV) systems. We advise for further prospective studies on this topic.

Some potential limitations of the present study have to be discussed. Generally, studies reporting positive findings are more likely to be published than those describing negative results. Also, small sample-size studies are likely to report positive relationship. Our study demonstrated lack of both significant publication bias and heterogeneity between the included studies. However, we retain that a selection bias might be present in this meta-analysis because these papers reported only selected series of patients undergone surgery (to obtain a histologic diagnosis of the indeterminate lesions) and it is not possible to evaluate outcome of all patients non-operated upon. Finally, the herein recorded malignancy rate of both Tir 3A and Tir 3B is higher than that expected [15]; conversely, in the document of ICCRTC the experts report clearly an estimation of risk based on their previous clinical experience.

In conclusion, Italian classification of thyroid cytologic samples can discriminate the indeterminate lesions at low risk of malignancy from that at high risk. This is mainly due to the fact that the Italian system, at variance with British and Bethesda ones, includes in the subcategory at high risk those cases with mild/focal nuclear atypia.