Recently, robotic thyroidectomy using the da Vinci Surgical System has been developed to eliminate neck scaring and to overcome limitations of endoscopic thyroid surgery such as an inadequate surgical view and inaccurate instrument manipulation [16]. The da Vinci Surgical System offers many advantages over traditional endoscopic thyroidectomy including improved surgical ergonomics and surgical dexterity. It provides a magnified high-resolution three-dimensional view, hand-tremor filtration, fine-motion scaling, and precise and multi-articulated motion of innovative instruments.

In robotic or endoscopic thyroidectomy, axillary and breast approaches or modifications of these procedures are the most widely used [1, 35, 713]. The advantages of robotic or endoscopic thyroidectomy using the extracervical approach over conventional open thyroidectomy include better cosmetic results and magnified surgical views.

Robotic thyroidectomy can be performed safely for appropriately selected patients with thyroid nodules. Several authors have described its feasibility and safety, reporting surgical outcomes comparable with those of conventional open thyroidectomy, although it has been performed by only a very limited number of surgeons due to the difficulty of the technique and the need for a period of training [16, 14].

In recent years, increasing attention has been paid to the subjective symptoms and functional outcomes of endoscopic or robotic thyroidectomy and minimally invasive thyroidectomy including postoperative pain, paresthesia, hyperesthesia, voice and swallowing functions, and cosmesis [3, 5, 13, 1518]. Robotic thyroidectomy using a gasless transaxillary or axillo-breast approach is reported to provide better postoperative cosmesis and less postoperative neck paresthesia than open thyroidectomy [35, 19]. However, functional voice and swallowing outcomes after robotic thyroidectomy have not been thoroughly investigated, although one study has evaluated subjective postoperative voice and swallowing symptoms using a questionnaire [19].

The current study aimed to compare postoperative voice and swallowing functions after robotic thyroidectomy with those after conventional open thyroidectomy in the absence of impairment to the recurrent laryngeal nerve (RLN) or the external branch of the superior laryngeal nerve.

Methods

This prospective study involved a consecutive cohort of patients with thyroid nodules who met the inclusion criteria and underwent robotic thyroidectomy using a gasless unilateral axillo-breast (GUAB) approach or a conventional open thyroidectomy with or without central compartment neck dissection from September 2009 to October 2010. All the patients were followed up for at least 6 months after the operation.

The exclusion criteria specified age younger than 18 years or older than 70 years and vocal cord lesions including vocal nodules, vocal polyps, Reinke’s edema, and vocal cord paralysis. Patients with postoperative evidence of RLN or paralysis of an external branch of the superior laryngeal nerve also were excluded as well as patients who did not consent to evaluation of voice and swallowing functions. Patients were ineligible for the study if they had a history of neck or thyroid surgery or had undergone reoperation, completion thyroidectomy, or lateral compartment neck dissection with thyroidectomy. Male patients were excluded to obtain a homogenous analysis of acoustic parameters. Patients with anaplastic and medullary cancer also were excluded.

No randomization for robotic versus open thyroidectomy was performed. The indications for robotic thyroidectomy included follicular neoplasm or benign thyroid nodules smaller than 5 cm in diameter and differentiated thyroid carcinoma smaller than 3 cm at its largest diameter with or without a minimal extrathyroidal extension or a small central or lateral compartment lymph node metastasis. We excluded cases of papillary thyroid carcinoma with extensive extrathyroidal extensions, large multiple cervical lymph node metastases in the lateral or central compartment, or distant metastases. Patients with a history of neck or thyroid surgery or irradiation also were excluded.

The operating procedures for robotic thyroidectomy using the GUAB approach have been described previously [3, 5]. All robotic and open thyroidectomies were performed by one surgeon (K.T.). All patients provided informed consent for the voice and swallowing assessments. The Institutional Review Board of Hanyang University Hospital approved the study.

Voice and swallowing functions were assessed by subjective voice and swallowing evaluation, videolaryngostroboscopy or fiberoptic flexible laryngoscopy, objective acoustic voice analysis, and aerodynamic measurements. Using questionnaires that we specifically designed, the subjective voice and swallowing evaluation was performed the day before surgery and then 1 day, 1 week, 1 month, 3 months, and 6 months postoperatively. The voice questionnaire consists of five questions about changes in voice pitch, range, intensity, fatigability, and singing quality (Table 1). The swallowing questionnaire consists of three questions (Table 2).

Table 1 The subjective voice symptom score questionnaire
Table 2 The subjective swallowing symptom score questionnaire

Voice and swallowing symptoms were scaled as 0 (no symptom), 1 (mild), 2 (moderate), 3 (severe), or 4 (very severe). Voice and swallowing symptom scores were defined as the sums of the scores in the corresponding questionnaires. The maximum voice symptom score was 20, and the maximum swallowing symptom score was 12.

Videolaryngostroboscopy or fiberoptic flexible laryngoscopy was performed to assess motion of the vocal folds, vocal mucosal lesions, arytenoid asymmetry, vocal cord bowing, degree of glottis closure, and mucosal wave characteristics pre- and postoperatively. Objective acoustic voice analysis was performed using the Multidimensional Voice Program and the Voice Range Profile from the Computerized Speech Lab (Model 4150B; KayPENTAX, Lincoln Park, NJ, USA) the day before surgery and then 1 week, 1 month, 3 months, and 6 months postoperatively. Acoustic recordings were collected by an experienced speech-language pathologist with a microphone positioned 15 cm from the patient’s mouth while the patient was in a comfortable sitting position.

For the Multidimensional Voice Program analysis, the patients were instructed to produce the sustained vowel “a” at a comfortable pitch and amplitude for at least 5 s. A period of steady-stable sound lasting 3 to 4 s for each vowel was used in the analysis. In the Multidimensional Voice Program analysis, fundamental frequency (F0, Hz), jitter (%), shimmer (%), and noise-to-harmonic ratio (NHR, dB) were analyzed.

In the Voice Range Profile analysis, the patients were asked to phonate using a sustained vowel “a” as loudly and as softly as possible from the lowest to the highest frequency. The lowest (F-low) and highest (F-high) frequencies (Hz), the frequency range (number of semitones), and the intensity range (dB) were measured from the Voice Range Profile. Maximum phonation times (MPTs) were measured by having patients sustain the vowel “a” at the habitual amplitude and pitch for as long as possible in a single breath as an aerodynamic parameter.

All statistical analyses were performed using SPSS version 15.0 (SPSS, Chicago, IL, USA). A p value of 0.05 or less was considered statistically significant.

Results

During the study period, 205 patients with a thyroid nodule underwent open (141 cases) or robotic (64 cases) thyroidectomy. Of the 205 patients, 5 (7.8%) in the robotic group and 4 (2.8%) in the open group experienced postoperative transient or permanent (1 case in the open group) RLN paralysis and were excluded from this study. In addition, 85 patients were excluded from the study due to male gender, underlying vocal cord lesions, no informed consent, and the like. Of the 111 patients who finally participated in this study, 50 underwent robotic thyroidectomy using the GUAB approach, and 61 underwent open thyroidectomy.

The questionnaire was completed by all 111 patients preoperatively, then 1 day and 1 week after the operation, by 105 patients after 1 month, by 97 patients after 3 months, and by 92 patients after 6 months. The acoustic voice analysis was completed by all 111 patients preoperatively and 1 week after operation, by 102 patients after 1 month, by 92 patients after 3 months, and by 85 patients after 6 months.

Table 3 presents the clinicopathologic characteristics of the two groups. All the patients were females, and the mean age was significantly younger in the robotic group than in the open group. The distribution of pathologies, mean tumor sizes, types of thyroidectomy, and rates of central compartment neck dissection did not differ between the two groups. The operation times were significantly longer in the robotic group than in the open group. The voice risk factors including smoking, alcohol, and laryngopharyngeal reflux did not differ between the two groups.

Table 3 Clinicopathologic characteristics of the robotic and conventional open thyroidectomy groups

None of the patients evidenced any paralysis of the RLN or the external branch of the superior laryngeal nerve at the pre- or postoperative videolaryngostroboscopic or flexible laryngoscopic examination. The changes in voice symptom scores are shown in Table 4. The mean voice symptom scores were significantly elevated 1 day after surgery and did not return to preoperative levels until 6 months postoperatively in either group. Comparison of the two groups showed that voice symptom scores were significantly lower in the robotic group 1 day, 1 month, and 3 months after surgery.

Table 4 Comparison of voice symptom scores (VSS) between the robotic and conventional open thyroidectomy groups

The mean swallowing symptom scores also were significantly elevated 1 day, 1 week, 1 month, and 3 months after surgery but had returned to preoperative levels by 6 months postoperatively in both groups. Comparison of the two groups showed that pre- and postoperative swallowing symptom scores did not differ between the two groups (Table 5).

Table 5 Comparison of swallowing symptom scores (SSS) between the robotic and conventional open thyroidectomy groups

The results of the objective acoustic voice analysis are presented in Table 6. No significant differences between the two groups were found in terms of mean F0, jitter, shimmer, or NHR. The mean F-high was significantly reduced 1 week (p = 0.024), 1 month (p = 0.021), and 3 months (p = 0.033) after surgery in the open group, whereas the robotic group showed no significant effect. In addition, the mean F-high was significantly lower 3 months postoperatively in the open group than in the robotic group. The mean frequency range was significantly reduced 1 week (p = 0.020) after surgery in the robotic group, and 1 week (p = 0.010) and 3 months (p = 0.002) after surgery in the open group. The frequency range was significantly lower 3 months postoperatively in the open group than in the robotic group. The pre- and postoperative intensity ranges did not differ between the two groups. The mean MPT was reduced 1 week (p < 0.001) and 1 month (p < 0.001) after surgery in both groups and had returned to preoperative levels at 3 months postoperatively. The pre- and postoperative MPT did not differ between the two groups.

Table 6 Comparison of objective acoustic parameters between the robotic and conventional open thyroidectomy groups

Discussion

Recurrent laryngeal nerve injury is the main cause of voice alteration after thyroidectomy. However, effects on the voice and swallowing are common complaints after thyroidectomy, even in the absence of impaired RLN function [1830]. Voice alteration after uncomplicated thyroidectomy usually is temporary and includes various changes such as voice fatigue, roughness, low voice pitch, and volume reduction. Subjective voice symptoms increase immediately after thyroidectomy and gradually decrease to the preoperative level 3 or 6 months postoperatively [18, 20, 21, 2528, 30]. Generally, subjective voice symptoms are more common than objective voice alterations. However, several authors have reported objective changes in acoustic parameters including F0, shimmer, highest frequency, frequency range, highest intensity, and dysphonia severity index in the absence of laryngeal nerve injury after thyroidectomy [20, 21, 24, 30].

The mechanism of postthyroidectomy voice alteration without injury to the laryngeal nerve is not well understood. Possible causative factors include laryngotracheal fixation with impairment of vertical movement, cricothyroid muscle dysfunction, operative injury or temporary dysfunction of extralaryngeal strap muscles, orotracheal intubation trauma and laryngeal edema, severe retractile cervical scar, local neck pain, and psychological factors [2022, 27, 28].

The results of this study confirmed that thyroidectomy frequently results in subjective and objective vocal alterations, even in the absence of laryngeal nerve injury. To our knowledge, this is the first study that analyzed objective acoustic parameters after robotic thyroidectomy as well as subjective voice and swallowing symptoms. In this study, subjective voice function was significantly better in the robotic group than in the open group 1 day, 1 month, and 3 months after surgery. Moreover, in the objective acoustic voice analysis, the mean F-high was significantly higher and the frequency range significantly broader in the robotic group than in the open group at 3 months postoperatively.

The loss of frequency range may be associated with unilateral vocal fold paralysis [20]. The reductions in F0, F-high, and frequency range also could be associated with an injury to the external branch of the superior laryngeal nerve [31]. Paralysis of the external branch of the superior laryngeal nerve may result in a low-pitched and monotonous voice, voice fatigue, hoarseness, and breathiness, although its effects are vague and nonspecific in some cases.

Videolaryngostroboscopy is an important diagnostic tool for evaluating the signs indicating paralysis of the external branch of the superior laryngeal nerve. The affected vocal fold seems to loosen and shorten and is at a lower level than the normal vocal fold. Laryngeal electromyography permits a definitive diagnosis of paralysis of the external branch of the superior laryngeal nerve. In laryngeal electromyography, fibrillation potentials and absence of recruitment point to paralysis of the external branch of the superior laryngeal nerve [32].

Because RLN and paralysis of the external branch of the superior laryngeal nerve were excluded in this study, we suppose that the subjective and objective voice alterations after robotic and open thyroidectomy were associated mainly with laryngotracheal fixation and impaired vertical movement of the larynx due to scar contracture or temporary malfunction of the strap muscles after surgery, as suggested previously [20].

It is known that the strap muscles have a positive relation to pitch elevation [33]. After thyroidectomy, the larynx and trachea adhere to the strap muscles because of scar formation, and the strap muscle and subcutaneous soft tissue also adhere to each other. Therefore, movement of the larynx and trachea during speech is impaired, resulting in defective pitch control during speech or singing.

Generally MPT and perturbations are affected by the status of the RLN. When the RLN is injured, the MPT decreases, and jitter and shimmer increase. In our study, MPT declined immediately after thyroidectomy in both groups. Because RLN paralysis was excluded, the reduction of MPT may have been related to the early postoperative neck pain or discomfort.

Less attention has been paid to functional swallowing outcomes after thyroidectomy [18, 19, 23, 27]. However, swallowing symptoms including painful swallowing, swallowing difficulty, and sensation of a lump or coughing during swallowing are common after thyroidectomy. These symptoms usually resolve after a short period but may persist longer as the voice changes in some circumstances [27].

In this study, subjective swallowing function did not differ between the two groups. However, there was a trend toward slightly better postoperative swallowing function in the robotic group, although the difference did not reach statistical significance. A previous study that evaluated voice and swallowing functions after robotic thyroidectomy using the gasless transaxillary approach also showed better subjective swallowing outcomes in the robotic group than in the open group [19]. Further studies with a larger number of cases are warranted to validate swallowing outcomes after robotic thyroidectomy.

Postthyroidectomy voice and swallowing alterations in the absence of laryngeal nerve palsy seem to be associated mainly with the routine healing process. Therefore, it is thought that less invasive techniques such as smaller incisions, limited dissection, and fewer traumas to the strap muscles could prevent this postthyroidectomy syndrome. In a study of voice and swallowing outcomes after minimally invasive video-assisted thyroidectomy (MIVAT), MIVAT caused fewer postthyroidectomy voice and swallowing alterations than open thyroidectomy. The authors suggested that reduced risk of injury to the perivisceral neural plexus, formed by small branches connecting the RLN and the external branch of the superior laryngeal nerve, due to the less extensive and finer dissection of the thyroid bed, could be responsible for the reduction in voice and swallowing changes after thyroidectomy [18].

In this study, the postoperatively subjective voice function and objective acoustic parameters were significantly better in the robotic group than in the open group. Based on these findings, it appears that robotic thyroidectomy using the GUAB approach may have a role in minimizing postthyroidectomy voice impairment, although the dissected area of skin flap is larger than in open thyroidectomy. Robotic thyroidectomy using the GUAB approach is characterized by the absence of cervical skin incision and limited dissection of the strap muscles without cutting. The absence of cervical skin incision, the limited dissection of the strap muscles, and the reduced adhesion between the strap muscles, subcutaneous tissues, and skin in robotic thyroidectomy using the GUAB approach may be the key factors responsible for the better voice outcomes in this study.

Several comparative studies have already shown that robotic thyroidectomy has significant advantages in terms of postoperative cosmesis and neck paresthesia, resulting in early surgical outcomes comparable with those after conventional open thyroidectomy, whereas surgical completeness and complication rates are similar [13, 5, 19]. Our study showed a further advantage in terms of better voice function after robotic thyroidectomy using the GUAB approach.

This study had some limitations. First, although videolaryngostroboscopic examination provided no evidence of paralysis of the external branch of the superior laryngeal nerve in any of the patients, we cannot exclude palsy of this branch because confirmatory laryngeal electromyographic examination was not performed. However, the results of this study can be trusted because paralysis of the external branch of the superior laryngeal nerve in most cases is shown by videolaryngostroboscopic examination or by clinical symptoms.

Second, age, gender, extent of surgery, smoking, alcohol, and laryngopharyngeal reflux can affect voice outcomes. The mean age differed in the two groups and may have influenced the results of the acoustic voice analysis. However, the effect of age on acoustic parameters was probably limited because we excluded very young and very old patients.

Third, we used relatively simple questionnaires that we developed to evaluate subjective voice and swallowing functions. These questionnaires may have limitations in that they were not validated with normal control groups, nor have they been validated in other studies. However, we think they are appropriate tools for assessing postthyroidectomy voice and swallowing alterations because they include questions about the typical symptoms of voice and swallowing after thyroidectomy. Moreover, their use by patients is easy and quick.

In conclusion, postoperative voice function is better with robotic thyroidectomy using the GUAB approach than with conventional open thyroidectomy. This is an additional advantage of robotic thyroidectomy using the GUAB approach in addition to the excellent cosmesis.