Introduction

Robotic thyroidectomy has gained popularity worldwide [17], offering a scar-less neck and greater cosmetic satisfaction to the patients [8], while it offers a three-dimensional magnified view, hand-tremor filtration, and multi-articulated hand-like motions to surgeons [3]. During the initial period of adopting the robotic system in thyroid surgery, studies were focused on the technical feasibility, safety, and complications of robotic thyroidectomy [3, 9, 10]. Recently, surgical completeness and functional outcomes affecting patients’ quality of life have been investigated [1113].

Voice alteration is one of the main issues affecting the quality of life of patients after thyroidectomy, and prevention of voice impairment is considered a primary goal of thyroidectomy procedures. It is well known that damage to the laryngeal nerve causes vocal cord paralysis and hoarseness. However, in many cases, voice impairment after thyroidectomy is not related to impairment of the recurrent laryngeal nerve (RLN) or external branch of the superior laryngeal nerve (EBSLN) [1416].

Voice change following robotic thyroidectomy has not been thoroughly evaluated, although the short-term voice outcomes after transaxillary robotic thyroidectomy have been reported recently by us and others [1719]. In the present study, we prospectively compared long-term voice functions after robotic thyroidectomy and conventional thyroidectomy. Voice outcomes were evaluated both subjectively and objectively, with evaluation of serial changes. To our knowledge, this is the first report on long-term functional voice outcomes after robotic thyroid surgery.

Patients and methods

Patients

We prospectively enrolled a consecutive cohort of patients undergoing conventional transcervical thyroidectomy or robotic thyroidectomy using a gasless unilateral axillary (GUA) or gasless unilateral axillo-breast (GUAB) approach between March 2011 and August 2012. The inclusion criteria were patients undergoing thyroidectomy with or without central neck dissection for treatment of thyroid nodule(s) and who provided informed consent for voice evaluation. The study was approved by the Institutional Review Board of Hanyang University Hospital.

Indications for robotic thyroidectomy included benign thyroid nodules less than 5 cm in diameter, differentiated thyroid carcinoma less than 3–4 cm with small nodal metastases [20]. The exclusion criteria were identical to those in our previous study evaluating short-term voice function after robotic thyroidectomy: age <18 or >70 years, vocal fold lesions and movement disorders, evidence of injury to the RLN or EBSLN on videolaryngostroboscopic or flexible laryngoscopic examination, history of surgery or irradiation to the neck, maximal extrathyroidal extension, or distant metastasis [18]. We also excluded male patients to eliminate the bias in acoustic parameters, due to the difference of acoustic parameters between male and female patients.

Videolaryngostroboscopic examination with an EndoSTROB DX (Xion, Berlin, Germany) or fiberoptic flexible laryngoscopy was routinely performed the day before surgery and again post-operatively to evaluate paralysis of the RLN or EBSLN. We estimated the motion of the vocal folds, vocal fold mucosal lesions and wave, symmetry of arytenoid excursion, bowing, and degree of glottic closure [18].

Surgical techniques

All robotic thyroid surgeries were performed by a single experienced surgeon (K.T.). The surgical procedures for robotic thyroidectomy using GUA and GUAB approaches have been described in previous reports [2, 3, 21]. Conventional thyroidectomy was performed by the usual method, with a 5–6 cm low-collar skin incision [13]. We routinely identified and preserved the RLN during thyroidectomy. However, we did not identify the EBSLN routinely to minimize the EBSLN injury in the both groups.

Evaluation of subjective voice symptoms

Voice symptoms were evaluated subjectively and objectively the day before surgery and again at 3, 6, 12, and 24 months after the operation. Subjective voice symptom score (VSS) was evaluated using a questionnaire to estimate discomforts associated with thyroid surgery [18]. The questionnaire comprised five questions on changes in voice pitch, range, intensity, fatigability, and singing quality: (1) Do you have vocal fatigue? (2) Do you have a hoarse voice? (3) Is it difficult to produce a high pitch? (4) Is your voice weak or breathy? (5) Do you have difficulty in singing? All questions were scaled as 0 (no symptom), 1 (mild), 2 (moderate), 3 (severe), or 4 (very severe) [18]. The VSS was estimated as the sum of the scores for the five questions.

Objective acoustic parameters and aerodynamic study

Acoustic analysis and aerodynamic measurements were made using voice range profile (VRP) analysis and the multidimensional voice program (MDVP) with Computerized Speech Lab software (MDVP model 4150B, KayPENTAX, Lincoln Park, NJ, USA). The acoustic analysis was recorded by an experienced speech-language pathologist (B.R.Y).

For VRP analysis, patient was instructed to phonate the sustained vowel/a/as loudly and as softly as possible from the lowest frequency to the highest. The lowest frequency (F-low, Hz), highest frequency (F-high, Hz), frequency range (Hz), and intensity range (dB) were measured. For MDVP analysis, the patient produced a sustained vowel/a/at the most comfortable pitch/amplitude for at least 5 s. The parameters measured were noise-to-harmonic ratio (NHR), jitter (%), shimmer (%), and fundamental frequency (F0, Hz). For the aerodynamic study, maximal phonation time (MPT, seconds) was measured by asking the patient to produce the vowel/a/for as long as possible. This was repeated three times and the longest measurement was selected.

Statistical analysis

Student’s t test, Chi squared test, and Fischer’s exact test were used to compare factors between the robotic and conventional thyroidectomy groups of patients. Paired t tests were used to compare pre-operative and post-operative variables. SPSS version 18.0 software (SPSS Chicago, IL, USA) was used for all statistical analyses. A value of p < 0.05 indicated statistical significance.

Results

Clinicopathological characteristics

During the study period, an overall total of 321 patients with thyroid nodules underwent robotic (97 cases) or conventional (224 cases) thyroidectomy. Of these, 165 patients were excluded from the present study due to exclusion criteria. Two further patients who underwent robotic thyroidectomy and four who underwent conventional thyroidectomy were observed to have transient RLN paralysis after operation and they were excluded from the study. Among the 150 female patients, 124 (82.6 %) successfully completed all of the acoustic analysis. Finally, the robotic group comprised 54 patients, and the conventional group comprised 70 patients. The clinical and pathological data of the both groups are summarized in Table 1. The mean age of the robotic group was lower, and operative time was longer than the conventional group.

Table 1 Clinical and pathological characteristics of the robotic and conventional thyroidectomy groups

Analysis of subjective voice symptoms

A comparison of VSS in the two groups of patients is shown in Table 2. At 3 months after surgery, the VSS was significantly lower (corresponding to less voice impairment) in the robotic group than in the conventional group (p = 0.025); however, no difference between the groups was observed at 6 months, 1 year, and 2 years post-operatively (p = 0.203, 0.536, and 0.785, respectively). Within the robotic group, the VSS at 3 months was significantly higher than the pre-operative VSS, but by 2 years there was no significant difference from the pre-operative level. In contrast, the VSS at 2 years in the conventional group remained significantly above the pre-operative VSS.

Table 2 Comparison of pre- and post-operative voice symptom scores in the robotic and conventional thyroidectomy groups

Measurement of objective acoustic and aerodynamic parameters

The results of measurement of aerodynamic and acoustic parameters in the two groups of patients are listed in Table 3. The F-high and the frequency range were higher and wider, respectively, in the robotic group than in the conventional group at 6 months, 1 year, and 2 years after surgery. Within the robotic group, the F-high and the frequency range recovered to pre-operative levels by 6 months after surgery. In contrast, in the conventional group, these parameters had not recovered to pre-operative levels at 2 years. No differences were found between the two groups regarding intensity range, F0, jitter, shimmer, NHR, and MPT.

Table 3 Comparison of pre- and post-operative aerodynamic and acoustic parameters in the robotic and conventional thyroidectomy groups

Discussion

Thyroidectomy causes post-thyroidectomy syndrome, including voice impairment and discomfort during swallowing. Injury to the RLN is the main cause of functional voice impairment after thyroidectomy. The prevalence of RLN paralysis after conventional thyroidectomy is 0.77–7.2 % [2224]. In earlier studies, no difference was reported in the incidence of RLN injury between patients who underwent robotic and conventional thyroidectomy [3, 21, 25]. Nonetheless, impairment of vocal or throat functions occur in 15–84 % of patients who undergo thyroidectomy, even without definite injury of the laryngeal nerves [26]. These impairments may remain for a long period of time after thyroidectomy.

Studies that have evaluated the long-term voice outcomes after thyroidectomy were all confined to conventional thyroidectomy [15, 26, 27]. One prospective observational study evaluating long-term voice functions after conventional thyroidectomy showed that vocal frequency range was narrower, perceptual voice remained rougher, and subjective parameters of vocal symptoms remained worse than during the pre-operative period, even up to 12 months after thyroidectomy [26]. In another study with a mean follow-up period of 4 years, voice change persisted in 28 % of the patients in the absence of definite laryngeal nerve injury [15]. At present, there is no consensus on the definition of a long-term post-operative period in evaluating post-thyroidectomy syndrome. In one study on assessment of post-thyroidectomy voice function, 1–2 weeks after thyroidectomy was considered the early post-operative and 3 months the late post-operative period [28]. However, in the present study, we considered 1 year or more after the surgery as long-term, in accordance with other reports [13, 27].

Previously, we have reported short-term results for voice and swallowing function following robotic thyroidectomy [18]. In our previous study, subjective voice impairment following robotic thyroidectomy was less than with conventional thyroidectomy at 1 day, 1 month, and 3 months post-operatively. In addition, measurements of acoustic parameters gave better results for vocal frequency range and F-high in the robotic group than in the conventional group at 3 months after surgery. These results were reproduced in the present study, even though the patient groups differed and surgery took place later than in the earlier study [18]. In the present study, we analyzed the extended long-term serial voice outcomes up to 2 years post-operatively. Long-term subjective and objective outcomes were significantly better in the robotic group than in the conventional group. At 3 months post-operatively, subjective voice functions were better in the robotic group and voice symptoms had recovered to the pre-operative level by 2 years, whereas voice symptoms had not recovered by 2 years in the conventional group. However, the difference in recovery of VSS to pre-operative levels in the two groups must be interpreted with caution, since the VSS at 2 years was not significantly different between the two groups.

In the robotic group, the F-high was higher and the frequency range was wider at 6 months, 1 year, and 2 years after surgery than in the conventional group. The F-high and the frequency range recovered to pre-operative levels by 6 months in the robotic group, whereas those parameters had not recovered by 2 years post-operatively in the conventional group. We additionally compared the risk factors between patients with VSS or F-high (Hz) that were >10 % increase than pre-operative baseline at 2 years (56 patients) and patients with VSS and F-high ≤10 % increase (68 patients). The analyzed risk factors were robotic versus conventional surgery, total thyroidectomy versus lobectomy, central neck dissection, age ≤30 versus 30 <age ≤50 versus age >50, laryngopharyngeal reflux, smoking, alcohol, and body mass index. In univariate analysis, age (p = 0.039) and conventional thyroidectomy (p = 0.011) were risk factors for worsening VSS/acoustic parameter. In multivariate logistic regression analysis, only conventional thyroidectomy was an independent risk factor (OR = 2.298, 95 % confidence interval 1.029–5.132, p = 0.042).

The proposed factors that affect voice symptoms following thyroidectomy in cases without apparent laryngeal nerve injury are delayed healing, surgical trauma of the strap muscles or cricothyroid muscle, laryngotracheal fixation, subclinical hematoma, laryngeal edema caused by disturbance of venous or lymphatic drainages, local neck pain, vocal fold changes, and injury of the arytenoid caused by orotracheal intubation [14, 18, 2832]. Acoustic and aerodynamic parameters that are influenced by thyroid surgery are decreases in the MPT, fundamental frequency, F-high, and frequency range, and increases in jitter, shimmer, and the NHR [29, 31]. After thyroidectomy, subjective voice symptoms without laryngeal nerve injury are more common than objective acoustic abnormalities [13, 33]. The discordance between objective parameter abnormalities and subjective vocal symptoms is found in approximately one-third of patients who undergo thyroidectomy [28, 34].

Laryngotracheal fixation and altered strap muscle or cricothyroid muscle movement can be the major causes of voice change after thyroidectomy [35]. Adhesion of the strap muscle to the larynx and trachea is caused by scar formation after thyroidectomy, and results in defective pitch control during phonation [14, 18]. Strap muscle movement effects laryngotracheal movement, and damage to the muscles by retraction and adhesion is associated with a reduction in fundamental frequency and range [36, 37]. The impairment of cricothyroid muscle movement is usually related to paralysis of the EBSLN or surgical trauma of the muscle, and it results in lowered pitch, loss of upper range and projection, vocal fatigue, and breathiness [14]. Typical cricothyroid dysfunction can be observed by a shift of the posterior commissure toward the affected side, weak tension, and shortened vocal fold on videostroboscopic examination [14].

We assume that the better long-term results for voice symptoms and vocal pitch after robotic thyroidectomy compared with conventional thyroidectomy might be mainly due to reduced adhesion of the skin flap, the strap muscles and the laryngotracheal structure, and fewer traumas to the strap muscles [18]. Robotic transaxillary approach avoids the need to make an incision in the skin of the neck, and permits limited dissection of strap muscles and fine dissection of the thyroid gland; therefore, laryngotracheal fixation is reduced. The better results for phonatory pitch after robotic surgery may be the result of reduced injury to the EBSLN in comparison with conventional thyroidectomy. Robotic procedures allow fine dissection in a ×10 magnified view, and provide a better view of the small branches of neurovascular structures compared with direct vision, or viewing with a surgical loupe.

The present study has several limitations. First, the different distribution of age between the two groups might have affected the results. Second, electromyography to confirm EBSLN injury was not used in this study, because of the reluctance of patients to undergo an invasive procedure, although we excluded cases with definite EBSLN injury on videostroboscopy examination [33, 38]. Third, the study lacks randomization of the robotic or conventional thyroidectomy techniques. Randomized clinical trial of robotic thyroidectomy is difficult because robotic thyroidectomy is not covered by the national health insurance system in our country and costs 4–5 times more than conventional thyroidectomy. A further study with randomization and a larger patient pool may minimize these limitations. Fourth, VSS questionnaire is not validated. However, VSS questionnaire has been used in the previous study, and was effective in evaluating specific voice symptoms related with thyroidectomy, and showed high compliance due its conciseness [18].

Conclusion

In the long-term post-operative period of up to 2 years after surgery, robotic thyroidectomy via a GUA/GUAB approach has advantages in recovery of subjective voice symptoms and better acoustic parameters of frequency range and highest frequency than with conventional thyroidectomy.