Introduction

Since the first report of endoscopic parathyroidectomy in 1996 [1], countless endoscopic and minimally invasive procedures have been developed with the aim of obtaining the safest approach with the best cosmetic result and postoperative outcome after thyroidectomy.

Aiming to shift the scar from the neck, several endoscopic approaches, including transaxillary endoscopic thyroidectomy [2, 3], the breast approach [4], the anterior chest wall approach using a flap-lifting system [5], the axillobilateral-breast approach [6], the bilateral axillo-breast approach [7], and transareola single-site endoscopic thyroidectomy [8] have been proposed, most often by surgeons in Far Eastern countries. Several trials have shown that most of these techniques achieved the goal of shifting the operative scar better than traditional cervicotomy [912] for both benign and malignant disease. However, these endoscopic approaches have also proved to be technically challenging because of two-dimensional vision and inadequate instrumentation; above all, they cannot be considered minimally invasive [3, 13, 14].

As a consequence, minimally invasive video-assisted thyroidectomy (MIVAT), proposed in 1998 [15], with its central cervical access, gained larger popularity in Europe and became the most widespread minimally invasive endoscopic technique for thyroidectomy in North America [16]. Several studies reported MIVAT as safe, easily reproducible, and achieving optimal results compared with traditional cervicotomy in cosmetic outcome, postoperative pain, and postoperative course, not only for benign disease but also for low-risk and intermediate-risk papillary carcinoma [1725].

In recent years, the da Vinci S surgical robotic system was developed to improve the weak points of endoscopic surgery, and recent studies have reported that robot-assisted transaxillary thyroidectomy (RATT), introduced by Chung in South Korea in 2009 [26, 27], is feasible, safe, and has excellent cosmetic results. The robotic approach easily surpasses the endoscopic techniques performed with conventional laparoscopic instrumentation through the same access because it permits magnified and enhanced three-dimensional vision, provides a fourth arm that can be used to retract the thyroid lobe, and allows extended freedom of motion of the robotic instruments as well as comfortable surgeon ergonomics. The first aim of this technique is an optimal cosmetic result, and indeed, RATT represents a “scarless in the neck” operation. In fact, several studies that have compared RATT with axillary endoscopic thyroidectomies have reported that the robotic approach provides better cosmetic results and superior patient satisfaction [2832]. More recently, an alternative “scarless in the neck” approach that takes advantage of the robotic system is the “robotic facelift thyroidectomy” proposed by Terris and Singer in 2012 and performed through a postauricular and occipital hairline (facelift) incision [33].

The enthusiasm generated by the introduction of these new robot-assisted techniques is seriously limited by their cost, which is much higher than those of the traditional approach and video-assisted thyroidectomy. Broome et al report a 217 % increased cost of robotic thyroidectomy compared with traditional thyroidectomy, and this greater expense could well be prohibitive in countries with a flat reimbursement schedule [3436].

In the context of this wide and varied panorama of different approaches designed to reach the best cosmetic result, to the best of our knowledge, there are no published studies comparing RATT and MIVAT. We therefore initiated the present study to prospectively compare two groups of thyroid surgery patients randomly allotted to MIVAT or RATT.

Patients and methods

Study design

This prospective, double-arm study recruited 62 patients undergoing hemithyroidectomy between January 2012 and September 2013 at the Department of Surgery, University of Pisa. Eligibility criteria were benign disease, maximum nodule diameter up to 4 cm, and total thyroid volume of less than 30 mL. Excluded were patients with symptoms attributable to the thyroid nodule (dysphagia, dyspnea, dysphonia), so as not to affect the validity of the SF-36 questionnaire findings. In addition, patients with malignant disease, intrathoracic goiter, previous neck operation or neck irradiation, and those who required central or lateral compartment lymphadenectomy, permanent medication for pain, and anticoagulation medication were excluded. All patients provided an informed consent for the study.

Among 376 patients referred to our department during the study period with uninodular benign thyroid benign disease who were scheduled for surgery, 62 matched the inclusion criteria and were randomly allotted to undergo hemithyroidectomy by the MIVAT technique (group A) or hemithyroidectomy by the RATT procedure (group B). These patients, who were previously informed of the randomization to one of these two techniques, were assigned the day before surgery by draw, with a sealed envelope containing “MIVAT” or “RATT.”

Study end points were mainly two specific outcomes of the procedures: cosmetic result and overall patient satisfaction. Also, operative time and length of hospital stay were considered, both involving important differences in costs of the two different procedures.

Complications were also assessed, even though a statistical comparison was not possible because of the very limited incidence of complications in patients undergoing thyroid surgery.

Surgical treatment

MIVAT

MIVAT was performed with the patient supine, without neck extension, through a central, transverse 1.5 cm incision placed two fingers above the sternal notch. Operative space was maintained by external use of small retractors. A 30° 5 mm endoscope was used during the endoscopic step. Under endoscopic vision, the upper pedicle was dissected and divided with a Harmonic Scalpel. The recurrent nerve and parathyroid glands were identified and isolated. After extraction of the lobe, hemithyroidectomy was achieved. Hemostasis was obtained by Harmonic Scalpel and titanium clips (5 mm). Subcuticular stitches and glue were used to close the wound. Drains were never used.

RATT

The da Vinci S system (Intuitive Surgical Inc.) was used in all procedures. The patient was placed supine with one arm extended over the shoulder. The working space was created using the standard technique through a 5–7 cm skin incision along the anterior border of the pectoralis major muscle. A Chung’s retractor was used to maintain the operative space. Only three robotic arms were used in all operations. During the docking of the robot, the dual-channel endoscope was positioned on the central robotic arm, the Harmonic curved shears were positioned on the upper side arm, and the Maryland dissector (Intuitive Surgical) was positioned on the lower side arm.

The operation started with dissection of the upper pedicle. Under endoscopic view, the upper pole was dragged downward with the Maryland dissector. The upper thyroid vessels were exposed then sealed and interrupted with the Harmonic shears, very close to the thyroid capsule to avoid any damage to the external branch of the upper laryngeal nerve. The Maryland dissector was then used to retract the thyroid lobe medially, and the surgeon started the identification of the recurrent laryngeal nerve, dissecting the tracheoesophageal groove with the Harmonic shears, both bluntly and by cutting. In the same manner, the parathyroid glands were identified and preserved. Once the critical structures were identified, the thyroid lobe was dissected with the Harmonic scalpel and the specimen was removed. A drain was left in the operative space, and the skin was closed with intradermal reabsorbable suture.

Assessment of surgical complications

Direct fiberoptic laryngoscopy was performed in all patients immediately before and 3 months after thyroidectomy to assess vocal cord motility. Recurrent laryngeal nerve injury was considered permanent if persistent 6 months after surgery. Hypoparathyroidism assessment was not considered necessary because all procedures were simple hemithyroidectomies.

Analysis of epidemiological data, follow-up, and cosmetic and satisfaction outcome assessment

Data were collected on age, sex, body mass index (BMI), thyroid volume, nodule diameter, skin-to-skin operative time, total time from intubation to extubation, final histology report, complications, and length of postoperative hospital stay. The cosmetic result was evaluated after 2 months by the by the Patient Scar Assessment Questionnaire (PSAQ). Overall satisfaction was evaluated after 1 month by the SF-36 questionnaire. The completed questionnaires were retrieved by means of a telephone interview by a researcher directly involved in the study but not in the surgery (A.B.).

PSAQ

The PSAQ was used with the aim of providing a valid measure of each patient’s perception of scarring. This scale, designed specifically for the assessment of linear scars, is composed of four subscales: scar appearance, consciousness, satisfaction with scar appearance, and satisfaction with scar symptoms. The symptoms subscale was omitted because of its poor reliability in this kind of scar. For each subscale, a higher score reflects a poorer outcome. The reliability of this scale has been demonstrated in several studies, in particular, to evaluate scars after thyroid and parathyroid surgery [36, 37].

SF-36 2 health survey (version 2.0)

SF-36 2 questionnaire was used to evaluate the burden of surgery. The SF-36 is a multipurpose, short-form health survey that has proven to be useful in the assessment of a patient’s quality of life (QoL). It is composed of eight subscales, each highlighting a different aspect of a patient’s QoL: physical functioning, role-physical, bodily pain, general health, vitality, social functioning, role-emotional, and mental health. For this tool, a lower score indicates a poorer QoL [38].

Statistical analysis

We used the Kolmogorov–Smirnov test to check normality of data distribution. The introductory phase was concluded with a statistical power analysis (ex post) to estimate the sample size required for the specific tests. The 1 − β value of the significant variables was >0.8, assuring a low risk of type II error and an appropriate sample size. We used the Mann-Whitney test and the t test to assess continuous variables and the χ 2 test for categorical variables. The analysis was performed with SPSS 17.0.1 software (SPSS Inc, Chicago, IL).

Results

A total of 62 patients matched the inclusion criteria previously mentioned and underwent hemithyroidectomy. Of those 62 patients, 30 were allotted to group A (MIVAT) and 32 to group B (RATT).

The remaining 314 patients with uninodular benign disease scheduled for surgery in the same period were excluded from the study and underwent hemythyroidectomy by a traditional approach (standard cervicotomy) for the following reasons: the diameter of the thyroid nodule was larger than 4 cm in 279 cases, compressive symptoms attributable to the thyroid nodule were present in 21 cases, patients refused to enter the study in 14 cases.

The groups were well matched for age, sex ratio, nodule diameter, and BMI (Table 1). There were 29 women and 1 man in group A and 31 women and 1 man in group B. Patients were a mean age of 36.9 ± 9.6 years (range 21–62 years) in group A and 32.5 ± 11.25 years (range 18–73 years) group B. There were 16 right lobectomies and 14 left lobectomies in group A, and 15 right lobectomies and 17 left lobectomies in group B. No conversions to open cervicotomy were required.

Table 1 Demographics of patients treated by minimally invasive thyroidectomy (MIVAT; group A) and robot-assisted transaxillary thyroidectomy (RATT; group B)

The preoperative diagnosis was 21 nodular goiters and 9 microfollicular nodules in group A and 19 nodular goiters, 3 toxic adenomas, and 10 microfollicular nodules in group B. The final histology report showed benign disease in all patients except two microfoci of papillary carcinoma (0.3 and 0.5 mm) in two patients in group A and three microfoci of papillary carcinoma (0.1, 0.2, and 0.6 mm) in group B (Table 1).

Operative time (open to close time in the MIVAT group and open to close time [without docking time] in the RATT group) was significantly shorter in the MIVAT group (group A), at 46.5 ± 10.5 min (range 30–70 min) than in the RATT group (group B), at and 85.25 ± 48.76 min (range 19–220 min). Total surgical time (intubation–extubation time, including draping of the operative space and docking time in group B) was significantly shorter in group A (71.6 ± 13.3; range 50–95 min) than in group B (121.5 ± 46.8; range 74–325 min; p < 0.0001) (Table 1).

Complications consisted of one transient recurrent laryngeal nerve injury in each group and one subcutaneous hematoma over the major pectoralis fascia in group B. Hematoma was diagnosed by ultrasound on the first postoperative day and conservatively treated, both with ultrasound-guided evacuation of the blood and with antibiotics administration. There were no long-term complications, as the recurrent nerve injury in both patients healed completely in two months.

Postoperative stay was 1.85 days in group B, which was significantly longer than the 1.15 days in group A (p < 0.0001). This extension of the hospital stay was due to the necessity of keeping the drain (which is not used in group A patients) for longer than one day in most of the group B patients.

Assessment of cosmetic result and overall satisfaction

On the PASQ, the appearance and satisfaction with appearance scores were both significantly higher in group B than in group A, at 16.93 ± 3.78 versus 13.62 ± 2.94 (p < 0.0001) and 15.71 ± 4.60 versus 12.03 ± 3.65 (p < 0.018), and the remaining variables (consciousness and satisfaction with symptoms) were comparable. These results mean that patients who underwent MIVAT (group A) were more satisfied with their scar than those who underwent RATT (group B).

Scores for the SF-36 domains of social activity and general health were significantly higher in group B than in group A [90.01 ± 23.41 vs 74.03 ± 27.34 (p < 0.006) and 90.70 ± 12.10 vs 78.51 ± 17.71 (p < 0.0001)], showing that patients undergoing RATT have a better perception of their general state of health and are more likely to participate in social activities than patients undergoing MIVAT. Scores for bodily pain were significantly higher in group A than in group B (96.82 ± 7.99 vs 81.67 ± 21.43; p < 0.0005), indicating that patients undergoing MIVAT experienced less tenderness than patients undergoing RATT. Scores for the remaining domains (physical functioning, role-physical, vitality, role-emotional, and mental health) were comparable in the two groups. Results are summarized in Table 2.

Table 2 Patient Scar Assessment Questionnaire (PSAQ) and Short Form (SF-36) 36-Item Health Survey Questionnaire scores

Discussion

To our knowledge, this is the first prospective randomized study comparing cosmetic and overall satisfaction after MIVAT and RATT for the treatment of benign disease. Concerning this primary end point of the study, RATT seems not to supersede MIVAT when comparing two groups of patients undergoing thyroidectomy for benign disease, although nodule diameter was significantly larger in the RATT group.

In particular, the PSAQ questionnaire, which specifically evaluates the scar, highlighted that the axillary scar, even if hidden in a remote site, is less appreciated than the scar produced by MIVAT, although the latter is more evident. The appearance and satisfaction with scar appearance scores (Table 2) significantly favored MIVAT. This might be due mainly to the length of the scar after RATT, which is necessarily longer, and less obviously because the axillary skin incision, which is placed in the border of the pectoralis major muscle and does not follow the natural skin folds, might result in poorer healing. Although not all agree about the superiority of small incisions in the neck [37], it is worthwhile noting that other studies [39] reaffirm that the cosmetic result after MIVAT, evaluated with a mean follow-up of almost 2 years with two scales: observer scar assessment score and patient scar assessment score, was classified as excellent, with 1.3 points on a 4-point scale.

For the SF-36 questionnaire, we found that patients undergoing RATT had a better perception of their general state of health and were more likely to participate in social activities after surgery, but they also scored higher on the bodily pain domain at prolonged follow-up.

Since its introduction, the RATT procedure has yielded in published studies better patient outcomes than open thyroidectomy, including lower postoperative pain and increased cosmetic satisfaction [31, 32]. If the increased cosmetic satisfaction provided by RATT and reported by these published series seems reasonable when comparing RATT with traditional cervicotomy because of the hidden location of the skin incision, the lower postoperative pain after RATT is difficult to understand. The difficulty comes from comparing patients undergoing a “maximally invasive” operation (dissection of a large flap over the major pectoralis fascia from the axilla to the neck) and patients undergoing cervicotomy, where, most of the time, in expert hands, an incision not larger than 6–8 cm can allow a perfect thyroidectomy but where also a subcutaneous flap is necessary as is hyperextension of the neck, which might be responsible for a large part of the postoperative distress after conventional surgery. These two technical features are overcome in MIVAT, which is performed through a 1.5 cm incision with the patient in the supine position and does not require dissection of a subcutaneous flap. For this reason in several articles [2124, 3840] MIVAT proved to allow a superior cosmetic result and a better cosmetic outcome in patients with small thyroid nodules. The drawback of this operation though is that, like conventional thyroidectomy, it is performed on a very visible part of the body.

For these reasons, it seemed reasonable to try a comparison between two endoscopic procedures, both of them considered cosmetic, rather than comparing either of them with traditional thyroidectomy.

In spite of the significant limits imposed by the sharp cost increase associated with RATT, it has an advantage over MIVAT in that it allows treatment of larger lesions (up to 6 cm in size) and therefore allows recruiting larger numbers of patients. In fact, one of the major limits of MIVAT is the strict limitation imposed by the maximum thyroid volume (30 mL) and nodule diameter (3 cm), which narrows the patients who can benefit from the MIVAT procedure to 10–15 % of patients being considered for thyroid surgery [17]. Conversely, because the dimensional limit for RATT is 5–6 cm, a larger number of patients will benefit from a cosmetic operation while complaining of larger goiters [26]. This point must not be overlooked as a consistent advantage of the robotic technique compared with MIVAT, especially in Europe, where goiter is endemic and mean thyroid volumes are higher than in the Far East.

In the present series there were no differences with regard to complications, but a statistical comparison was not possible because of the small incidence of complications after thyroid surgery no matter which procedure is employed. Both MIVAT and RATT benefit from the magnification provided by the endoscope (and a 3D image for the robotic approach). Several reports have shown a complication rate similar to traditional thyroidectomy for endoscopic operations [4144]. Hypoparathyroidism was not evaluated in the present series because all patients underwent hemithyroidectomy, which almost always excludes the risk that this important complication will occur.

When analyzing the length of the surgical procedure, we found (Table 1) that the two groups differed significantly. Total time was significantly longer in the RATT group (121.5 ± 46.8 min; range 74–325 min) than in the MIVAT group (71.6 ± 13.3 min; range 50–95 min; p < 0.0001). That must be considered an important disadvantage of RATT; if we consider both the longer operative time and the longer length of hospital stay it must be said that this cosmetic procedure implies a significant increase in costs that is not paralleled by an increased satisfaction of the patients. In fact, in our series, the bodily pain score on the SF-36 questionnaire was higher in group B than in group A (Table 1). The problem of a correct allocation of resources, of course, exceeds the aims of the present study. Furthermore, a cost analysis would not have been performed because of the difficulty in comparing costs in different countries and under different health systems, but other researchers have already stressed that the simple traslocation of a scar from the neck to the axilla probably does not justify the added expense [45].

In conclusion, the results that we found most surprising were those reflecting patient satisfaction with scar appearance as assessed by the PSAQ “appearance” and “satisfaction with scar appearance” subscales (Table 2): this demonstrates that relocating the wound from the neck to a less visible area, such as the axilla, is not enough to meet the full expectations of the patients, probably because the axilla scar is still considered to be too long and its appearance is less likely to improve over time because it cannot be placed in a skin crease. Our results appear to be in agreement with the words of Duh, “For now… neck incisions… are here to stay” [46].