Introduction

Improvements in surgical and anesthetic techniques coupled with the advent of precise electrosurgical and monitoring devices have significantly revolutionized surgical practice in general [1, 2]. Thyroidectomy has particularly benefited immensely from these innovations with excellent outcomes reported globally. Over the years, outcomes such as mortality have significantly reduced to near negligible rates making thyroidectomy a relatively safe operation to perform. Most discussions around thyroidectomy, therefore, focus on its effects on quality of life issues such as recurrent laryngeal nerve injury and voice changes.

When it occurs, voice changes can be associated with significant social, emotional, and in some instances, economic challenges [3]. Beyond inadvertent injury to the nerves during operation, other factors such as intubation trauma and pressure injuries to the recurrent laryngeal nerves by the endotracheal tube have been described as possible mechanisms for developing postoperative voice disorders [4, 5].Traditionally, preoperative indirect laryngoscopy is often performed to document the preoperative state of the vocal cords as a marker of the integrity of the laryngeal nerves. This practice although recommended is not routinely practiced by all thyroid surgeons worldwide [6, 7]. Where these practices are not routine, the true incidence of recurrent laryngeal nerve injury cannot be determined, given the possibility of asymptomatic nerve injury. Similarly, routine voice assessment post-thyroidectomy is not often conducted except when there are major voice changes. It is therefore possible for some forms of vocal dysfunction following thyroidectomy to remain undiagnosed or unreported. These postoperative events are not frequently evaluated, particularly in non-specialized centers where the majority of thyroid operations are performed [8]. Their effect on quality of life has also not been comprehensively evaluated.

In Nigeria, where goiter is endemic and often ignored until they assume enormous sizes [9], it will be ideal to evaluate the pattern of morbidities associated with thyroid operations. This study was conducted to evaluate the pattern and determinants of post-thyroidectomy voice changes in a Nigerian cohort.

Patients and methods

Study design

This was a cohort study of patients undergoing thyroidectomy for benign goiters at a Nigerian tertiary health hospital (Obafemi Awolowo University Teaching Hospital, Ile-Ife) between January 2019 and December 2020.

Consecutive adults scheduled for thyroidectomies on account of benign goiters were included in the study. Patients undergoing revision thyroid surgery were excluded. All patients with postoperative diagnosis of incidental thyroid malignancy were retrospectively excluded from the study.

Preoperative assessment

Preoperatively, each participant completed the Vocal handicap index-10 (VHI-10) questionnaire which is a validated tool for voice assessment. Preoperative fiber-optic laryngoscopy was performed in all patients within 1 week of surgery.

Intra-operative assessment

Intra-operatively, the ease of endotracheal intubation was assessed using the intubation difficulty score. Other details such as duration of surgery (taken as time interval from the commencement of skin incision to the point at which the last skin suture is completed), duration of thyroidectomy, duration of anesthesia and type of thyroidectomy were recorded. Type of thyroidectomy was classified as partial (lobectomy + isthmusectomy), subtotal thyroidectomy (removal of the thyroid gland leaving about 4–5 g on each side), near total thyroidectomy (removal of all the thyroid gland leaving only a little amount of thyroid tissue, estimated at about 1 g) and total thyroidectomy (removal of all thyroid tissue).

Postoperative assessment

Repeat VHI-10 and laryngoscopy were done at 1 week, 1 month, and 3 months post-thyroidectomy. Patients who had persistent nerve injuries at 3 months had additional laryngoscopies up to one year.

Data analysis

All collected data were entered into a computer for analysis using SPSS 22.

Variables of interest included socio-demographic details, VHI scores, IDS, Endotracheal tube sizes and intra-operative events such as type of thyroidectomy, duration of anesthesia and duration of surgery.

A 6-point change in VHI-10 score from baseline was considered to be clinically significant [10]. Median VHI-10 score was calculated and the significance of any change between the preoperative and postoperative scores was determined using Wilcoxon signed test. The relationship between change in baseline VHI-10 score and anesthetic considerations such as intubation difficulty and size of endotracheal tube was determined by Chi square test, while the relationship between change in VHI-10 and duration of surgery and anesthesia were determined using the two-sample t test with equal variance and the level of significance set at p < 0.05. Multivariate analysis was carried out to determine the factors associated with abnormal voice changes. Selection of variables included in the multivariate model was based on result of the univariate analysis. Only factors which were significant in the univariate analysis were included in the multivariate analysis.

Ethical issues

Ethical approval was obtained from the institutional ethics committee before commencement of the study.

Results

Baseline characteristics

A total of 54 participants made up of six males and 48 females completed the study. The median duration of symptoms before presentation was 5 years. Subtotal thyroidectomy was the most common form of thyroidectomy (17, 31.5%), 14 patients (25.9%) had total thyroidectomy, 12 (22.2%) had near total while 11 patients (20.4%) had partial thyroidectomy. The majority (90.7%) had normal baseline endoscopic findings with five patients having vocal cord abnormalities, mainly benign vocal cord lesions (Table 1). The median baseline VHI-10 score for the entire cohort was 6.5. There was no significant difference in the baseline VHI-10 between participants with normal and abnormal preoperative laryngoscopic findings (p = 0.94) (Table 1). The median duration of surgery was 60 min, with about one-third of the patients having moderate to severe difficulty with intubation (Table 2).

Table 1 Participants demographic and baseline parameters
Table 2 Intra-operative details

One week post-surgery assessment

VHI-10 scores

Ten participants (18.5%) had clinically significant change in their VHI-10 values (≥ 6 points increase from baseline) at 1 week. Median postoperative VHI-10 at 1 week was 10 (IQR 12) which was statistically higher than the baseline score of 6.5, p = 0.01.

Laryngoscopy findings

Seven patients (12.9%) had endoscopic features of vocal cord paralysis. Six of these were unilateral while one was bilateral. Of these, four (57.1%) had significant change in their VHI scores while 3 (42.86%) did not.

Factors associated with abnormal voice changes at 1 week

Abnormal postoperative laryngoscopy at one week was significantly associated with change in VHI-10 score (p ≤ 0.01). Extent of thyroidectomy was also associated with clinically significant change in VHI-10 score with total thyroidectomy being more frequently associated with voice changes (p = 0.03) (Table 3). Neither duration of surgery (p = 0.99), anesthesia (p = 0.98), size of endotracheal tube (p = 0.71) nor intubation difficulty showed any significant association with voice changes observed at one week post-surgery.

Table 3 Relationship between type of thyroidectomy and change in VHI-10 at 1 week post-thyroidectomy

One month post-surgery assessment

VHI-10 scores

At one month post-surgery, two of the ten patients with significant VHI-10 change had reverted to baseline while eight still had significant changes from their baseline VHI-10 values.

Laryngoscopy findings

One patient (1.85%) was noticed to have a large vocal cord mass at laryngoscopy (Fig. 1). The patient had clinically significant change from baseline VHI-10 score. The VHI-10 score in this patient reverted to normal after excision of the mass. Histology of the mass confirmed the suspicion of a vocal cord granuloma.

Fig. 1
figure 1

Intubation granuloma detected 1 month post-thyroidectomy

Three months post-surgery assessment

VHI-10 scores

An additional three patients had reverted to baseline VHI-10 score leaving five patients (9.3%) with persistent change in VHI-10 score at 3 months. Overall median VHI-10 score at 3 months was not significantly different from baseline [8 (IQR-10) vs 6.5(10)].

Laryngoscopy findings

Endoscopy at 3 months revealed the persistence of preoperative vocal cord lesions in the five patients with abnormal vocal cord findings at baseline. Forty-four patients (81.5%) had normal laryngoscopy findings while two of the seven patients with vocal cord paralysis at 1 week had reverted to normal. Five patients (9.3%) however had vocal cord paralysis persisting till the third month. Of the five, four were unilateral, of which three were asymptomatic while one had hoarseness of voice. One patient had bilateral vocal cord paralysis (Table 4). At all time points, abnormal endoscopy finding was associated with significant change in VHI-10.

Table 4 Laryngoscopy finding at 3 months

Endoscopic evaluation of patients with persistent vocal cord paralysis

Further evaluation of patients with vocal cord paralysis at about one year post-surgery showed that two of the five patients with abnormal vocal cord findings had reverted to normal, leaving three with endoscopic features of vocal cord paralysis (5.6%). Of the three, two were asymptomatic while one patient (1.8%) had symptomatic vocal cord paralysis.

Predictors of abnormal VHI-10 score

On univariate analysis, abnormal endoscopic findings at 1 week, 1 month and 3 months and the type of thyroidectomy were significantly associated with early postoperative voice changes. On multivariate analysis, abnormal endoscopic finding at 1 week was the only predictor of early postoperative voice change (Table 5). Persistent voice change at 3 months was significantly associated with abnormal endoscopic findings at 1 week, 1 month and 3 months. None of these was however significant on multivariate analysis (Table 6).

Table 5 Predictors of abnormal of abnormal voice handicap index score at 1 week
Table 6 Predictors of abnormal of abnormal voice handicap index score at 3 months

Discussion

This study established the presence of early postoperative voice changes in about one-fifth of patients with a steady decline over time. At three months, half of those with voice changes had reverted to normal highlighting the transient nature of some of the voice abnormalities. The extent of thyroidectomy and the presence of abnormal postoperative endoscopic findings were found to be associated with voice changes but only the former correctly predicted the occurrence of voice changes on multivariate analysis.

Voice assessment post-thyroidectomy has been evaluated in various settings using different tools. This accounts for the wide range of outcomes reported across various studies [11, 12]. Our study utilized the VHI-10 tool and reported prevalence of 9.3% at 3 months. Kovatch et.al in a review of 2325 patients using the same tool reported an incidence of 12.7% at 3 months [13] while Borel et.al in another study of 203 patients reported an incidence of 5.7% [14]. The assessment of voice changes post-thyroidectomy using this tool is novel in Nigeria, and therefore serves as a reference for further studies in the locality.

A spectrum of post-thyroidectomy voice changes ranging from vague symptoms to weakened vocal strength to severe hoarseness has been described [5, 15]. The decline in the prevalence of voice changes with gradual restoration of voice quality over time has also been reported by other studies. Among those with persistent voice changes, however, it is necessary to carefully seek for possible reasons as this may have resulted from an underlying nerve injury. The possibility of this resulting from other causes should also be borne in mind. In this study, a vocal cord granuloma was observed in one of the patients in this study, with restoration of voice quality after excision. A similar finding has been reported by a few studies [16, 17]

This study showed that clinically detectable voice changes were significantly associated with surgical complexity. Understandably, patients undergoing more extensive thyroid surgeries involving bilateral dissections have a higher risk of voice changes. Such operations involve a greater amount of dissection with more postoperative edema and subsequent fibrosis which may potentially change the aerodynamic quality of the larynx resulting in voice changes. Furthermore, such extensive dissections also put the laryngeal nerves at risk on both sides with higher chances of injuries [18]. Meticulous dissection and careful identification of the laryngeal nerve at surgery help to reduce the rate of nerve damage during thyroidectomy [19,20,21]. In our practice, the safety and speed of thyroidectomy has continued to improve with the advent of electrosurgical devices such as the Ligasure© vessel sealing device [2, 22, 23]. With the use of this device, complex dissections can be rapidly accomplished in a relatively bloodless field.

Of the factors associated with early voice changes, abnormal postoperative endoscopy was the only significant predictor identified on multivariate analysis. This pattern indicates the significant contribution of vocal cord paralysis to the occurrence of voice changes. This association was no longer seen at 3 months, apparently due to the recovery of some of the early postoperative vocal cord palsies. It should be noted however that voice changes may sometimes occur in the absence of nerve injury [24, 25].

This study found no association between abnormal preoperative laryngoscopy and postoperative voice changes. Although about 9.3% of patients had various benign vocal cord lesions, this did not reflect in the VHI-10 values preoperatively or after surgery. It is noteworthy; however, that the VHI-10 being a self-reporting tool may be insensitive in detecting subtle long-standing voice changes that an individual might have adjusted to. Consequently, patients with subtle long-standing voice changes may return normal scores.

Although this study focused primarily on voice changes, its interrelatedness with nerve injury and the clinical significance of nerve paralysis makes it an important subject. Serial endoscopic assessment of the vocal cords as performed in this study is quite novel for a Nigerian series. The proportion of patients who were asymptomatic despite having endoscopic evidence of nerve injury highlights the importance of objective vocal cord assessment in order to avoid under-reporting. Earlier studies in Nigeria on outcomes of thyroidectomy have reported varying rates of recurrent laryngeal nerve injury, ranging from 2.8 to 9.7% [26, 27]. The majority of these are however retrospective in design, with limited description of the methodology used in determining the occurrence of nerve injury. None of these also reported performing routine endoscopic assessment of the vocal cords postoperatively. These differences in study design limit the comparability of our findings with local data. Relative to high income countries, however, the nerve injury rate reported in this study is higher than the 0.3–3% reported in some series [28]. Although the majority were temporary, the proportion of patients with persistent nerve injury at one year (5.5%) exceeds the threshold. While the data in this study are not robust enough to adequately determine the predictors of nerve injury, factors such as operative volume, gland size and the practice of routine nerve identification are important factors to consider. In this study, of the three patients with persistent nerve injury at one year, two were operated by less experienced surgeons. In our setting, goiters tend to assume enormous sizes before patients present to the hospital for treatment. In an earlier series from our institution, the mean thyroid weight was 120 g which exceeds reports from high income countries [2, 29]. Furthermore, routine identification of the recurrent laryngeal nerve, which is an important step in the operation was not standardized in this study. All these factors have been associated with nerve injury and their contribution to nerve injury in our practice needs to be evaluated [30,31,32]. Building on our findings, another study which is evaluating the predictors of recurrent nerve injury, incorporating these variables is currently underway.

Findings from this study must be interpreted within its limits. The use of the VHI-10, though validated is based on patients’ self-report which is subjective. Other variables such as the size of the gland and degree of fibrosis and lifestyle behaviors such as smoking which may affect outcomes were not included in the analysis. Follow-up of patients with voice changes in this study was limited to three months. A longer follow-up period could have helped determine the long-term voice profile of patients in this study. This study only documented the occurrence of voice changes but did not evaluate its impact on everyday life. This would have been of greater clinical significance. Multivariate analysis in this study was carried out on a limited sample, a more robust analysis with a larger sample size will certainly shed more light on the subject.

Within these limits however, we conclude that post-thyroidectomy voice changes are commoner in the early postoperative period with a steady decline over time. Abnormal endoscopic finding is a predictor of its occurrence.