Introduction

Thyroid nodule is a common entity. It occurs as a palpable lesion in 5% of women and 1% of men from iodine-sufficient regions [1, 2], while its prevalence in the general population increases to 70% in ultrasound (US) studies [1,2,3]. The large majority of thyroid nodules is benign. When the malignancy risk is excluded, patients with thyroid nodule(s) can be managed differently. Asymptomatic, benign nodules generally do not require treatment and patients are addressed to clinical follow-up. On the other hand, patients who refer clinical symptoms or discomfort specifically correlated to the goiter, may require a specific treatment [4,5,6]. Traditionally, surgery has represented the main treatment strategy. However, reducing invasiveness in patients with benign thyroid nodules is a crucial breakthrough of our era. Indeed, while thyroid surgery is widely available in specialized centers, it still carries a 2–10% prevalence of complications, such as neck scarring, hypothyroidism, transient or definitive postoperative hypoparathyroidism, recurrent laryngeal nerve injury, and the recognized risks associated with general anesthesia. Also, surgery is expensive and may not be appropriate for a surgically high-risk individual [4,5,6,7,8].

Relevant advancements have been achieved in the last decades with image-guided thermal ablations, such as radiofrequency ablation (RFA), laser ablation, high-intensity-focus ultrasound (HIFU), and microwave ablation (MWA). Solid non-functioning thyroid nodules with a cytological proof of benignancy determining local compressive symptoms or esthetic/cosmetic concerns have represented the main target of these treatments. On the contrary, other benign thyroid lesions, such as autonomously functioning adenoma, still represent a potential challenge [9]. Among all these thermal options, RFA and laser ablation have been certainly the most largely used, while only preliminary sparse results have been reported for the other therapies. Then, due to relevant observational studies and some randomized controlled trials, international guidelines recommend RFA and laser ablation in selected patients with local symptoms due to benign non-functioning solid thyroid nodules (BNFSTN) [10,11,12,13]. Fewer studies have been published on HIFU and MWA which anyway appear as encouraging options [14,15,16,17]. More recently, a relevant document quoted that the technical success of image-guided thermal ablations of a thyroid nodule should be defined as a volumetric reduction rate (VRR) ≥ 50% of the initial nodule volume. According to the same document, clinical success should be defined as the ability of treatment to resolve the clinical condition itself [18]. To date, data on results of RFA and laser ablation has been limited to the first few months of follow-up in the majority of papers, also with inhomogeneous definition and reporting of technical and clinical efficacy. Furthermore, stability of the treatment results over time still remain a major issue to be solved. Obtaining more robust evidence on results durability over time could allow to assume image-guided thermal therapies as a consistent alternative to surgery in symptomatic patients with BNFSTN.

With the present study, we aimed to systematically review the literature on the use of RFA and laser ablation in BNFSTNs to obtain solid evidence about their technical and clinical efficacy over time. End-points of this meta-analysis were VRR, compressive symptoms, cosmetic concerns, and complications.

Methods

This meta-analysis was conducted according to the Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) statement [19].

Search strategy

A six-step search strategy was planned. Firstly, we searched sentinel studies in PubMed. Secondly, keywords and MeSH terms were identified in PubMed. Thirdly, in order to test the strategy, the terms “radiofrequency”, “laser”, “thyroid”, and “nodule” were searched in PubMed. This allowed the development of the best strategy to return all identified sentinel studies and the lowest number of non-relevant articles. Fourthly, PubMed, CENTRAL, Scopus, and Web of Science were searched. This allowed the creation of the database of studies to be assessed. Fifthly, studies reporting VRR, compressive symptoms, cosmetic concerns, and complications for RFA or laser ablation in BNFSTN were included. Finally, references of included studies were screened for additional papers. The last search was performed on March 4, 2019. No language neither time restriction was adopted. Three investigators (C.V., M.C., and P.T.) independently and in duplicate searched papers, screened titles and abstracts of the retrieved articles, reviewed the full-texts, and selected articles for their inclusion.

Study selection

Thermal ablation represents a valuable treatment mainly for solid or predominantly solid thyroid nodules. Then, only original papers reporting complete data of BNFSTNs treated by RFA or laser ablation and later followed-up for at least 6 months could be included in this systematic review. Exclusion criteria were: (a) articles not within the field of interest of this review; (b) studies evaluating RFA or laser ablation in malignant, autonomous functioning, or cystic nodules; (c) studies with <6-month follow-up; (d) studies with overlapping patients or nodules data; (e) review articles, editorials, letters, comments, or case/series reports. Three researchers (C.V., M.C., and P.T.) independently reviewed titles and abstracts of the retrieved articles, applying the above criteria. Then, four authors (C.V., G.M., M.C., and P.T.) independently reviewed the full-text of the remaining articles to determine their final inclusion. Discordances were solved in a final collegial meeting.

Data extraction

The following information was extracted independently and in duplicate by three investigators (C.V., M.C., and P.T.) in a piloted form: (1) general information on the study (author, year of publication, country, study type, follow-up duration, number of patients, number, and volume of BNFSTNs); (2) RFA or laser ablation procedure; (3) VRR; (4) compressive symptoms score; (5) cosmetic score; and (6) complications. For cosmetic score, the nodule was classified according to the following scale: 1. no palpable mass; 2. no cosmetic problem but palpable mass; 3. a cosmetic problem only on swallowing; and 4. easily visible mass. For compressive symptoms score, a visual analog scale (scale 0–10) was used, where 0 indicates the absence of compression and 10 an important compression giving serious symptoms and modifying the patient’s quality of life. The main paper and supplementary data were searched. Data were cross-checked and any discrepancy was discussed.

Study quality assessment

The risk of bias of included observational studies was assessed independently by two reviewers (M.C. and P.T.) through National Heart, Lung, and Blood Institute Quality Assessment Tool for Observational Studies [20]. The risk of bias of included RCTs was assessed independently by the same reviewers through the Cochrane Collaboration’s tool for assessing risk of bias for the following aspects: random sequence generation; allocation concealment; blinding of participants and personnel; blinding of outcome assessment; incomplete outcome data; and selecting reporting. For other bias, funding was assessed. Each domain was assigned low, unclear, or high risk of bias [21].

Statistical analysis

The primary outcome was the VRR at 6, 12, 24, and 36 months after RFA or laser ablation. Secondary outcomes included change in compressive symptoms score and cosmetic score from baseline to the last available follow-up. Reported complications were also assessed. The endpoints were analyzed as continuous variables and summarized as weighted mean and weighted mean difference, respectively. If standard deviation was missing in a study for a specific outcome, it was calculated from standard error, 95% confidence interval or from interquartile range; if none of these were available, the largest among the other studies was reported. A subgroup analysis based on the number of sessions and on the baseline nodule volume was planned, with a cut-off of 30 ml. Meta-regressions on VRR based on baseline nodule volume and year of publication were attempted. Pooled data were presented with 95% confidence interval (95% CI). Heterogeneity between studies was assessed by using I2, with 50% or higher regarded as high. Publication bias was assessed with Egger’s test; the trim-and-fill method was used for estimating its effect. Finally, although the objective of the present review was the independent evaluation of RFA and laser ablation in the management of BNFSTN, a comparison between RFA and laser ablation for VRR was attempted through t-test. All analyses were carried out using Prometa 3.0 (Internovi) and RevMan 5.3 (The Cochrane Collaboration) with a random-effect model; p < 0.05 was regarded as significant.

Results

A total of 963 papers were found, of which 197 on PubMed, 36 on CENTRAL, 371 on Scopus, and 359 on Web of Science. After removal of 435 duplicates, 528 articles were analyzed for title and abstract; 440 records were excluded (guidelines, reviews, on techniques other than RFA or laser ablation [e.g. HIFU, MWA, percutaneous ethanol injection], on lesions other than thyroid solid benign non-functioning nodules [parathyroid, thyroid cancer, thyroid cysts, autonomous nodules], case reports, not in humans). The remaining 88 papers were retrieved in full-text and 24 articles were finally included in the systematic review (Fig. 1).

Fig. 1
figure 1

Flow-chart of the systematic review

Qualitative analysis (systematic review)

The characteristics of the included articles are summarized in Tables 1 and 2 [22,23,24,25,26,27,28,29,30,31,32,33,34,35,36,37,38,39,40,41,42,43,44,45]. The studies were published between 2002 and 2019. Sample sizes ranged from 12 to 1534 BNFTNs. Median post-treatment follow-up was 12 months on both RFA and laser ablation. Mean post-treatment follow-up was 15.5 ± 12.1 and 19.5 ± 14.9 months after RFA and laser ablation, respectively. Six studies were prospective cohort, 10 retrospective cohorts and seven randomized controlled; the study design was not clearly stated in one study [36]. Twelve studies examined RFA and 12 laser ablation. Participants were adult outpatients with a solid thyroid nodule causing compressive symptoms or cosmetic concerns and being ineligible to or refusing surgery. Benignity was defined according to one or two separate fine-needle aspiration cytologies. The nodule was defined as non-functioning if cold on scintigraphy or if the patient was euthyroid. Devices and technical aspects for RFA and laser ablation are reported in Supplementary Data. Overall, 1186 BNFSTN were treated with RFA and 2009 with laser ablation. Nodule volume at baseline was 20.1 ± 22.4 ml in RFA, and 24.6 ± 23.6 ml in laser ablation (p < 0.01).

Table 1 Characteristic of included studies on radiofrequency ablation and availability of data
Table 2 Characteristic of included studies on laser ablation and availability of data

Quantitative analysis (meta-analysis)

The primary outcome was the VRR at 6, 12, 24, and 36 months. Overall, image-guided thermal ablations were associated with a VRR of 60%, 66%, 62%, and 53%. RFA was associated with a VRR of 68%, 75%, and 87%, respectively. There were not enough RFA data for the 36-months follow-up in order to perform a meta-analysis. Laser ablation was associated with a VRR of 48%, 52%, 45%, and 44%, respectively. Similar results were found when assessing only nodules undergone single session treatment. Comparing VRR at each follow-up with the previous one in different ablative modalities, a statistically significant reduction was found only for RFA at 24 months (Table 3 and Fig. 2). A subgroup analysis was performed according to nodule volume at baseline. There were enough data to perform a meta-analysis only for RFA, with nodules smaller than 30 ml achieving a better outcome (Table 4).

Table 3 Volume reduction rate of BNFSTN with all thermal treatments, RFA or laser ablation, according to follow-up and number of sessions
Fig. 2
figure 2

Pooled results of volume reduction rate of BNFSTN considering RFA, laser ablation, and all studies

Table 4 Volume reduction rate of BNFSTN with RFA according to nodule volume at baseline and follow-up

Data regarding change in compressive symptoms and cosmetic score from baseline to the last available follow-up were available only for RFA, showing an improvement of both (Figs. 3 and 4). However, most of studies on laser ablation reported an improvement of both outcomes [34,35,36,37,38,39,40,41,42,43, 45].

Fig. 3
figure 3

Change in compressive symptoms score of BNFSTN treated with RFA from baseline to the last available follow-up

Fig. 4
figure 4

Change in cosmetic score of BNFSTN treated with RFA from baseline to the last available follow-up

The list of reported complications can be found in Supplementary Data.

Overall, a high heterogeneity was found. In order to investigate this, a meta-regression using nodule volume at baseline as explanatory variable was performed, showing a significant interaction with VRR only for RFA. Publication bias was found for overall analysis of image-guided thermal ablations and for laser ablation VRR at 12 months; the trim-and-fill method did not change the estimate (Supplementary Data).

Study quality assessment

The risk of bias of the included studies is shown in Supplementary Data. Concerning the observational studies, statement of the study question, enrollment of patients, description of intervention, outcome measures, patients flow, and statistical analysis were adequate in all. In three studies, eligibility criteria were not clearly described [31, 34, 36]. A selection bias was found in four: Spiezia, 2009 enrolled only elderly patients; Aysan, 2016, Yue, 2017 and Deandrea, 2019 selected nodules depending on the maximum diameter [22, 28, 29, 32]. To treat very small nodules in order to minimize the risk of complications (i.e. subcapsular, close to trachea, or carotid arteries) was based on clinical decision. There was no sample size calculation in any study. Outcomes were blindly assessed in three [22, 41, 44]. Outcome measure of interest were generally taken multiple times, except in two studies [40, 43]. Concerning the RCTs, we found no attrition or reporting bias; no study was funded by industry. Information on random sequence generation and allocation concealment was not reported in one study [26]. Performance bias was rated as high in all, since an open-label design was used. However, no other study design could have been reasonably used: thermal treatments were compared with observation or levo-thyroxine in some studies, and single versus multiple sessions in other studies. Finally, outcome assessment was blinded only in two studies [35, 37].

Discussion

This study was conceived as an attempt to achieve robust evidence on the efficacy over time of thermal ablation to reduce volume of BNFSTN and its correlated symptoms. Particularly, the present study focused on RFA and laser ablation, as the most worldwide used thyroid thermal ablation modalities, but it was not undertaken to compare the performance of these two options. To our knowledge, this is the first high-evidence study assessing the durability of both techniques. Twenty-four studies were found, of which 12 on RFA and 12 on laser ablation. The overall results of our meta-analysis showed that both RFA and laser ablation were effective in reducing nodule’s size. Results were stable up to 2 years for RFA and 3 years for laser ablation. Improvement in compressive symptoms and cosmetic concerns was demonstrated for RFA. Based on our results, we discuss several considerations.

First, both RFA and laser ablation were effective to reduce the volume of BNFSTNs and their success was maintained up to 2–3 years. In the overall analysis, the 36-month VRR appeared to be smaller than the 24-month one, although not statistically significant. It is worth noting that data for the former outcome were retrieved from one study using RFA and four studies using laser ablation. Thus, a comparison between the two endpoints was biased. Two recent studies with long-term results of RFA [46] and laser ablation [47] were recently published. Both studies showed patients overlap with other included papers and anyway they would have not changed the longest meta-analyzable data. Overall, following the herein recorded results, we are able to consider that when a nodule shows a shrinkage at first visits after RFA or laser ablation its volume will be stable later. This finding achieves high interest for clinical practice, because when a patient is treated using RFA or laser ablation instead of surgery the optimal results over time shall be guaranteed. The latter issue, together with the low frequency of complications recorded in these studies (see below), represents a high-level evidence which allows us to consider a non-surgical thermal treatment as a real alternative to surgery. Since thermal ablation has been used in patients with esthetic or compressive symptoms related to a single nodule or a dominant nodule in a multinodular goiter especially when smaller than 30 mL, our results have to be interpreted in this clinical context.

Second, the performance of RFA seems to be better than that of laser ablation in terms of VRR achieved. However, several issues have to be discussed on this topic. First of all, several differences are present between studies on RFA and on laser ablation, including significant different nodule volume at baseline, publication period, geographical region, nodule composition, and energy deposition. While most RFA data were published since 2015, the large majority of laser ablation data were reported in a previous time frame (Fig. 5), including the very initial experiences on BNFSTN. This also explained the longer duration of follow-up in studies using laser ablation. Furthermore, the studies on laser ablation were published only by European authors, while RFA data were reported by European and Asian researchers. Moreover, the studies on RFA defined solid nodules when there was a solid composition in at least 50–80% of the lesions, while this definition was not reported in the large part of laser ablation studies. Both different chronology of the publications and diffusion of RFA versus laser ablation might explain the different size of nodules treated by either treatment; the selection of patients has been probably improved over time [48]. Indeed, nodules treated by RFA were significantly smaller than those undergone laser ablation, and the baseline volume was found to be significantly associated with the efficacy of RFA (Supplementary Data). Sparse data on the comparison between RFA and laser have been published and no significant difference have been observed [30]. Notably, no randomized controlled studies were reported until now. Finally, it should be highlighted how the endpoint of the treatment is not to achieve the maximum VRR possible, but to achieve a stable clinical result (e.g. resolutions of compressive symptoms) with the minimum possible rate of complications. On this topic, a great inhomogeneity among studies was found regarding endpoints and reporting criteria. This further highlight what recently reported regarding the need of an higher standardization in terminology and reporting criteria [18].

Fig. 5
figure 5

Timeline of included studies

Third, ideally, patients should be treated in a single session. Seven studies on RFA and 10 studies on laser ablation used this approach. Moreover, two studies were specifically designed to test this hypothesis. In Huh et al. (2012), 30 patients were randomized to one or two sessions of RFA, respectively; no significant difference between the two groups was found for VRR, compressive symptom, or cosmetic score. However, following the first session, three patients in the first group requested an additional treatment because of unsatisfying nodule-related symptoms [23]. In Dossing et al. (2006), 30 patients were randomized to one or three sessions of laser ablation, respectively; a little incremental effect in nodules undergone multiple treatments was reported, without differences in subjective symptoms [37]. This implies that when expected results are not met, retreatment can potentially be considered [24].

Fourth, even if the algorithm search of the study was not designed to evaluate the side effects, a low prevalence of complications with RFA and laser ablation was reported. Recurrent laryngeal nerve injury (including transient voice change) was the most frequent major complication both with RFA and laser ablation. Other major complications included brachial plexus injury (n = 1), local infection (n = 1), massive colliquative necrosis (n = 1), and nodule rupture with fasciitis (n = 1) among RFA-treated patients; pseudocyst transformation with fasciitis (n = 3) and trachea injury (n = 1) were described in laser ablation-treated patients. No analysis was performed on these data, since studies were selected based on the stated efficacy end-points and that analysis should be affected by selection bias. Moreover, reporting differed among studies; a missing data cannot be assimilated to a lack of the event (Supplementary Data).

Limitations and strengths of the present meta-analysis should be discussed. Firstly, even if the duration of follow-up we found was significant, it might be considered as limited compared to the expectancy of life of treated patients. Secondly, data on success rate, defined as VRR > 50%, were often missing, and data on compressive symptoms and cosmetic concerns with laser ablation were unevenly reported (Supplementary Data) [18]. To follow the standardized reporting in future studies should be useful for future reviews [18, 49]. Thirdly, we were not able to assess the difference between single and multiple session treatments. Lastly, a high heterogeneity was found possibly due to: (1) study design; (2) procedure modality and mastering; and (3) patients’ characteristics other than the extracted ones.

In conclusion, a significant VRR was recorded in BNFSTN treated by thermal ablation. Available data allowed to show that both compressive symptoms and cosmetic concerns were significantly improved by RFA. As a relevant novelty for clinical practice, these results obtained early were stable over time. With this evidence, both RFA and laser ablation can be considered in patients with BNFSTN.