Introduction

Hypothyroidism is a common endocrine disorder resulting from low thyroid hormone levels. Untreated hypothyroidism leads to low quality of life (QoL) due to symptoms and can be fatal in severe cases [1]. The most common symptoms of hypothyroidism include cold intolerance, dry skin, hair loss, constipation, fatigue, sleep disturbance, weight gain, irregular menses, and cognitive impairment such as memory loss. This clinical presentation can vary with age, sex, and time between onset and diagnosis [2].

The standard treatment for patients with hypothyroidism is thyroid hormone replacement therapy with levothyroxine (LT4) [3]. LT4 monotherapy is effective at restoring thyroid levels and improving hypothyroid symptoms in most patients. However, 10–15% of patients with hypothyroidism treated with LT4 complain of persistent hypothyroid symptoms even when their thyroid hormone levels are normalized [4]. While the etiology of these symptoms is under investigation, their presence affects patients’ QoL, satisfaction with care, and patient-clinician relationship [5]. Patients affected by persistent symptoms are often labeled as having “unexplained” medical symptoms and struggle to find care responsive to their needs. In addition, clinicians caring for patients with persistent symptoms face the challenge of not having evidence-based recommendations to guide their next steps [6].

There is a need to systematically examine and appraise the evidence on interventions for patients with hypothyroidism and persistent symptoms to guide decision-making and future research efforts. The purpose of this systematic review was to summarize the available evidence on pharmacologic and non-pharmacologic interventions to improve symptomatology in patients with hypothyroidism who experience persistent symptoms despite achieving biochemical euthyroidism on thyroid hormone replacement.

Materials and methods

A protocol was developed to perform this study. This manuscript is reported according to the Preferred Reporting Items for Systematic Reviews and Meta-Analysis (PRISMA) guidelines [7] (Figure 1).

Fig. 1
figure 1

Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) flow diagram of the studies selection process

Eligibility criteria

We included randomized controlled trials (RCTs) and comparative observational studies that assessed pharmacological and non-pharmacological interventions for hypothyroid adults with persistent symptoms despite biochemical euthyroidism. To be considered for inclusion, studies must have reported the effect of the intervention on QoL or hypothyroid symptoms. There were no restrictions on language. We excluded studies that detailed data on pregnancy or planned pregnancy. Animal studies, case reports, case series, reviews, and letters were also excluded.

Persistent symptoms despite biochemical euthyroidism were defined as patients experiencing fatigue, memory loss, concentration disturbance, slow thinking, dizziness, depression, weight gain, cold intolerance, skin dryness, and constipation after failing to respond to the first treatment line. The thyroid function test had to be within normal limits, with a TSH range of 0.2–4.5 mIU/L.

Data sources and searches

A comprehensive search of several databases from January 1st, 2000, to March 11th, 2022 was conducted. The databases included Ovid MEDLINE(R) and Epub Ahead of Print, In-Process & Other Non-Indexed Citations, and Daily, Ovid EMBASE, Ovid Cochrane Central Register of Controlled Trials, Ovid Cochrane Database of Systematic Reviews, and Scopus. The search strategy was conducted by an experienced librarian with input from the study’s principal investigator. Controlled vocabulary supplemented with keywords was used to search for the treatment of hypothyroidism with persistent hypothyroid symptoms in adult patients. References of selected studies were searched to identify additional publications. Furthermore, articles suggested by our expert investigators were reviewed to consider for inclusion. The full search strategy is described in Supplementary Material.

Study selection

The references from the search strategy process were uploaded into the systematic review software program Rayyan. Reviewers working independently in duplicate screened titles, abstracts, and full text articles for eligibility using standardized criteria. Before the title and abstract screening phase, a pilot test was conducted with 20 articles to ensure adequate inter-rater agreement; a Kappa statistic of >0.70 was set as an appropriate inter-rater agreement. Studies included by at least one reviewer at the title and abstract phase were considered for full-text screening. Before starting the full-text screening, a second pilot test was conducted with 10 articles between the reviewers. Any disagreements between reviewers were resolved by consensus.

Data collection and management

For all included articles, one reviewer extracted the study data into a spreadsheet in excel and a second reviewer checked the data for accuracy and completeness. The variables extracted were: (i) study characteristics (author name, publication year, country, period when the study was conducted, study design); (ii) patient characteristics (sample size, age, sex, body mass index [BMI], TSH, free T4, free T3, cause of hypothyroidism, follow-up duration, high-density lipoprotein, triglycerides, waist circumference, systolic blood pressure and fasting blood glucose; (iii) pharmacological interventions (LT4, combination therapy with LT4 and liothyronine [LT3]); (iv) non-pharmacological interventions (surgery, exercise, diet, natural products); (v) primary outcomes (impact on hypothyroid symptoms, QoL); (vi) secondary outcomes (adverse events); and (vii) risk of bias of included studies.

Statistical analysis

A narrative synthesis of the included studies was conducted due to inconsistencies in intervention, comparators, and outcome assessment. The statistical program R Studio for R software was employed to perform the analysis (Supplemental Table S1).

Risk of bias in individual studies

Study quality was assessed by two independent reviewers using the Cochrane risk of bias tools for parallel and cross-over RCTs [8], and the nine-star Newcastle Ottawa Scale (NOS) tool for cohort studies, according to the study design [9]. A third reviewer resolved any disagreements.

Results

Study characteristics and risk of bias

A total of 277 records were identified through our electronic database search and 7 studies fulfilled the inclusion criteria (Figure 1) [10,11,12,13,14,15,16]. Five were RCTs [10,11,12, 15, 16] and two were comparative observational studies [13, 14]. One study was only available as a conference abstract [16]. Two studies assessed the natural health products ginger and L-carnitine [10, 11], two studies evaluated total thyroidectomies [12, 13], and three studies assessed combination therapy with LT3 and LT4 [14,15,16]. Supplemental Table S2 summarizes additional details of these interventions. The studies originated predominantly from the United States (28.5%) (Table 1), and the overall risk of bias was judged to be high in all studies due to selection bias, lack of blinding assessment, and ascertainment of outcomes (Supplemental Table S3).

Table 1 Characteristics of included studies

Overall characteristics of participants according to treatment group

A total of 455 participants were included across all studies. In the intervention group, there were 166 (78.6%) females with an overall mean age of 47.5 (±2.8) years. The overall mean baseline BMI of participants in the intervention group was 26.2 kg/m2 (±2.4), waist circumference 96.3 cm (±10/51) and the overall mean initial TSH, FT4, and FT3 were 1.6 mIU/L (±0.9), 1.5 mIU/L (±0.6), and 4.3 mIU/L (±1.4), respectively. Follow-up duration was between 1 month and 18 months. The most common causes of hypothyroidism were autoimmune disease (n = 141; 67.1%) and post-thyroidectomy (n = 37; 17.6%). The characteristics of the comparator group were similar and are shown in Tables 2 and 3. Symptom and QoL ascertainment were done using validated and internally generated scales (Supplemental Digital Content Table S1).

Table 2 Patients’ baseline characteristics
Table 3 Patients’ metabolic characteristics

Impact of interventions on hypothyroid symptoms and QoL

Supplements

Ashraf et al. evaluated the effect of LT4 + ginger supplementation (500 mg twice daily) vs. LT4 + placebo on hypothyroid symptoms as assessed by the Thyroid Symptom Rating Questionnaire (ThySRQ scale) [10]. After 30 days, ginger supplementation resulted in a clinically significant improvement of tiredness, weight gain, cold intolerance, constipation, dry skin, appetite, memory loss, concentration, and feeling “giddy or dizzy.” No significant improvements were observed in hair loss, nail fragility, hearing, hoarseness, speech, depression, or feeling down (Table 4). Similarly, Hyun An et al. studied the impact of LT4 + L-carnitine vs. LT4 + placebo on fatigue [11]. L-carnitine, administered at 990 mg twice daily, did not significantly reduce disabling or physical fatigue at 12 weeks. However, the group treated with L-carnitine showed significant improvement in “mental fatigue” (problems concentrating, thinking clearly, finding the correct words, and memory). In a subgroup analysis, both physical and mental fatigue significantly improved in patients younger than 50 years. The mental score was also more markedly improved among patients with postsurgical hypothyroidism compared to patients with other causes of hypothyroidism [11].

Table 4 Outcome before and after intervention in patients with hypothyroidism and persistent symptoms a) symptomatology, b) quality of life

Combined LT4/LT3 therapy

Rodriguez et al. showed that combination therapy with LT4 and LT3 daily at a ratio of 5:1 (LT4:LT3) for 6 weeks did not lead to significant differences in fatigue (assessed by the Piper Fatigue Scale), depression (assessed by the Beck Depression Inventory-II), feeling sad or nervous, forgetfulness, slow thinking and attention problems (assessed by visual analog scaling) when compared to LT4 monotherapy [15]. On the other hand, Michaelsson et al. showed that combined LT4 and LT3 (given twice daily) at a ratio of 17:1 did not change tiredness, weight loss, sensitivity to cold, dry/itchy skin (assessed by ThyPRO-39) at 3 and 12 months when compared with data from a reference population [14]. However, they showed that overall ThyPRO-39 scores (inversely proportional to the QoL) significantly decreased on combined LT4/LT3 therapy despite stable TSH. This improvement in QoL occurred across several domains, including emotional susceptibility, impaired social and daily life, and cognitive complaints. Another study conducted by Fadeyev et al. using the 36-Item Short Form Survey (SF-36) showed improved QoL in patients taking LT4 and LT3, especially in social functioning and mental health [16]. Also, in the combination therapy group, there was a significant reduction in the severity of depression using the Hospital Anxiety and Depression Scale (HADS) [16].

Surgery

Guldvog et al. randomized patients with serum antithyroid peroxidase (TPO Ab) titers greater than 1000 IU/ml to total thyroidectomy or medical management with thyroid hormone replacement to maintain euthyroidism [12]. They demonstrated that chronic fatigue scores, as measured by a standardized fatigue scale, decreased at 6, 12, and 18 months in the surgical intervention group. Furthermore, general health scores on the SF-36 were higher at 18 months in the surgical group compared to the medical therapy group across most domains, including physical functioning, emotional score, mental health, and vitality. In contrast, an observational study by Thatipamala et al. showed that patients with Hashimoto’s thyroiditis (with an average preoperative TPO Ab of 260 IU/ml) who underwent surgery had similar total SF-36 scores compared with normative data of age-matched healthy controls after a mean follow-up of 18 months [13].

Adverse events

Overall, 31 (12%) adverse events were observed in the intervention group, while 18 (10.8%) events were reported in the control group. Studies assessing thyroidectomy as the intervention had the highest rate of adverse events (e.g., transient, or permanent hypocalcemia, recurrent laryngeal nerve palsy) [12, 13]. A study assessing L-carnitine + LT4 as the intervention had nausea as the highest adverse event [11]. The other interventions resulted in fewer adverse events (Table 5).

Table 5 Intervention and comparator complications

Discussion

Around 10-15% of patients with hypothyroidism report persistent hypothyroid symptoms despite adequate LT4 replacement and achievement of biochemical euthyroidism. The most common symptoms are fatigue, weight gain, neurocognitive impairment, and cold intolerance [4]. Their presence is tightly correlated with a perceived lower QoL as assessed by validated tools. Although this is a well-described clinical problem, the optimal management of persistent symptoms remains unclear. To complement pharmacological treatments for hypothyroidism and address persistent symptoms or enhance QoL, patients and clinicians often explore non-pharmacological interventions, including dietary supplements. These supplements, such as ginger and L-carnitine, are chosen for their potential to support thyroid function and improve overall energy metabolism, offering a natural adjunct to traditional therapy. While these supplements are popular among patients seeking alternative or supplementary treatments, their effectiveness and safety must be evaluated with careful consideration, guided by evidence-based recommendations.

Non-pharmacological interventions explored include ginger supplementation, L-carnitine supplementation, and surgical management with thyroidectomy. Ashraf et al. randomized 53 patients to either ginger supplementation or placebo for 1 month [10]. At the end of follow-up, the intervention group had significant improvement in several hypothyroid symptoms and QoL scores, and no side effects were reported [10]. Previously, Mahassni et al. had shown that ginger supplementation lowers TSH levels in patients with hypothyroidism [17]. A potential mechanism is ginger’s antioxidant properties, conferring a protective effect against endocrine-disruptive chemicals such as bisphenol A (BPA) [18]. On the other hand, Hyun An et al. showed improvement in fatigue severity, physical fatigue, and mental fatigue scores in patients treated with L-carnitine for 12 weeks compared to placebo [11]. L-carnitine is essential for mitochondrial fatty acid oxidation and energy production and supplementation reduces fatigue in cancer patients [19]. Of note, 36.6% of patients in the intervention group developed side effects, mostly gastrointestinal [11].

Regarding thyroidectomy as a therapeutic option for persistent hypothyroid symptoms, two studies have shown conflicting results [12, 13]. The RCT conducted by Guldvog et al. showed that, in patients with Hashimoto’s thyroiditis with high TPO Ab titers, total thyroidectomy plus thyroid hormone replacement resulted in significantly higher health related QoL scores and lower fatigue scores compared to continued thyroid hormone replacement alone after 18 months [12]. The resulting dramatic decline in TPO Ab titers suggests that thyroidectomy may modulate the immune response and reduce inflammatory mediators responsible for non-thyroidal symptoms [20]. Nonetheless, the non-blinded nature of the study (no placebo surgery performed) raises concern about a placebo effect, and the strict inclusion criteria argue against this study’s generalizability. On the contrary, a retrospective study conducted by Thatipamala et al. evaluating the impact of thyroidectomy on persistent hypothyroid symptoms showed no significant difference in any of the eight domains assessed by the SF-36 questionnaire compared to a healthy cohort [13]. The major limitations of this study were the lack of pre-operative SF-36 responses in the intervention group, the small sample size (19 patients) and the low survey response rate. Importantly, total thyroidectomy was associated with surgical complications in both studies, including transient or permanent hypocalcemia and recurrent laryngeal nerve palsy.

In the pharmacologic arena, three studies evaluating the effect of combination LT4 and LT3 therapy on persistent hypothyroid symptoms despite biochemical euthyroidism were included; patients in the control group remained on LT4 monotherapy. Two of them were mainly positive and showed improvement in QoL measures: the open label prospective cohort study by Michaelsson et al. [14] and the cross-over RCT by Fadeyev et al. [16] both from Europe. Conversely, the RCT conducted by Rodriguez et al. [15] in the U.S. did not show a significant difference in depression scores, fatigue, working memory and concentration disturbance between groups.

Some clinical implications from this systematic review include consideration of a ginger supplementation trial in patients reporting non-remitting hypothyroid symptoms despite adequate LT4 replacement, given its potential benefit and favorable safety profile. Despite conflicting data, a combination of LT4 and LT3 may improve symptoms and QoL in some patients. Finally, there may be a role for thyroidectomy in patients with Hashimoto’s thyroiditis, very high TPO Ab titers, and persistent symptoms.

From a research perspective, the lack of large, high-quality studies prevents clinicians from providing definitive recommendations to this group of patients. Moreover, further bench and translational research efforts are critical to better understand the mechanism behind these manifestations and ultimately look for more specific therapeutic targets. Due to our limited understanding of the mechanisms and best strategies to improve the quality of life of patients with persistent hypothyroid symptoms, it is important for clinicians to validate and listen to their patients’ symptoms, evaluate patients for non-thyroid explanations for ongoing symptoms, and provide support.

Limitations and strengths

Our study has some limitations, such that some observations should be interpreted with caution. The main limitation of this review is that it can only draw an overview of alternatives for refractory hypothyroidism, as well, some of the included studies were retrospective in nature. Significant heterogeneity among studies existed in terms of inclusion/exclusion criteria, type of intervention and explored outcomes limited our ability to conduct a comprehensive meta-analysis, Similarly, sample sizes were small in most of the studies, particularly the RCTs. In addition, some RCTs were not blinded and open. Finally, the short intervention period and follow-up duration of the included studies did not allow us to assess the long-term effects and durability of these interventions.

The strengths of our study derive from an in-depth database search, which allowed us to include a broad array of studies from a methodological, geographical, and interventional standpoint. Interventions span from supplements to surgery, with attention to the increasingly popular combination LT4/LT3 therapy. Finally, a wide variety of difficult-to-measure outcomes such as fatigue, QoL, depression, social life impairment and cognitive dysfunction are explored using validated questionnaires, providing clinicians with objective data to support treatment recommendations.

The substantial prevalence of patients enduring persistent symptoms despite standard hypothyroidism treatment underscores an urgent necessity for further research. This gap, accentuated by a scarcity of studies comparing potential interventions, demands a multifaceted research approach. Key to this endeavor is the comprehensive examination of the underlying mechanisms contributing to these persistent symptoms. Such investigations, spanning both bench and translational research, are vital for uncovering new therapeutic targets and paving the way for more effective treatment options.

Moreover, the development of a systematic framework to facilitate comparative effectiveness research is imperative. This framework should encompass the identification of patients grappling with persistent hypothyroid symptoms, along with the establishment of the most appropriate scales for evaluating the efficacy of various interventions. Additionally, there is a pressing need for the infrastructure capable of supporting multicenter clinical trials. These trials must not only be well-designed but also inclusive of a diverse patient population to ensure the generalizability of findings.

Conclusion

There is no high-quality evidence supporting any intervention for persistent symptoms in hypothyroidism. Available evidence, limited by the risk of bias, inconsistency, and heterogeneity, suggests that some persistent symptoms, particularly fatigue, could improve with ginger and thyroidectomy. There is urgent need for the evaluation of interventions to address persistent symptoms in patients with hypothyroidism.