Introduction

Familial adenomatous polyposis (FAP, OMIM #175100) is an autosomal dominant inherited syndrome with an incidence of 1 in 6850 to 1 in 23,700 [1]. It is caused by a germline mutation in Adenomatous Polyposis Coli (APC) gene, a tumor suppressor gene located in the 5q21 region, and has a high penetrance for colorectal cancer. Almost every mutation carrier develops colorectal cancer by 40 years of age [1]. Other gastrointestinal malignancies in FAP patients include duodenal ampullary adenocarcinoma and gastric adenocarcinoma. Improved life expectancy in patients with FAP in recent years has shifted the prevalence of its extra-intestinal manifestations, including pancreatic cancer, hepatoblastoma, brain tumor, benign tumors such as adrenal adenoma, osteoma, desmoid tumor, and dental abnormalities—all of which contribute to the morbidity and mortality in FAP patients to various degrees [1].

Thyroid cancer (TC) was first described as an extra-intestinal malignancy of FAP in Crial’s report in 1949 [2]. Subsequent reports estimated the prevalence of TC in FAP between 0.4 and 1.2% [3, 4] whereas the prevalence of TC in general population was reportedly 0.14% [5]. The recent studies however suggested that its prevalence may be as high as 11.8% [6]. This would have an implication for cancer screening in FAP patients if it reflected a true prevalence. The histologic characteristics of FAP-associated TC were not described until the 1990s. In 1994, Harach et al. first reported an unusual feature of cribriform pattern and solid areas with spindle cell component in thyroid pathology [7]. This was later known as cribriform-morula variant of papillary thyroid cancer (CMV-PTC) and was found to occur in 0.16% of the patients with papillary TC [8]. While a proportion of CMV-PTC in FAP-associated TC had been approximated to 0–12.5%, some authors estimated its prevalence to be as high as 80% and suggested this pathologic variant as a surrogate marker for the diagnosis of FAP [3, 9].

Despite the fact that TC has occurred in the background of underlying benign thyroid disorders in FAP [7, 10], the prevalence of benign thyroid disorders in this patient population has been poorly described. Characterizing the prevalence of the benign thyroid lesions will expand the syndrome’s phenotypic spectrum. FAP, once known for just a cancer predisposition syndrome, is virtually multisystem disorder with both nonmalignant and malignant conditions of various organs; the familiarity of treating physicians to these extra-intestinal manifestations can therefore alleviate the patients’ morbidity and mortality. In conclusion, the purpose of this study is to evaluate the prevalence of thyroid diseases in FAP and describe the clinical, pathological, and genetic characteristics of FAP-associated TC by reviewing existing cohort studies.

Methods

Search strategy

In consultation with a medical librarian (JY), we used PubMed and EMBASE databases to search literatures that were published before 11 July 2016. Search terms were provided in Supplementary Text 1. We limited the search only to human study published in English. Duplicated references were removed by reference manager software (EndNote®X7 Thomson Reuters, Philadelphia, PA, USA). The reference lists of all included studies were examined to identify additional studies not retrieved through online searches. A citation search was also conducted through Web of Science to identify articles referencing any of the included studies.

Eligibility criteria and data extraction

Cohort studies which conducted in cancer registries involving FAP patients and demonstrated the prevalence of TC and/or benign thyroid disorders were eligible for inclusion. Both observational cohort studies and screening cohort studies were included. Cross sectional studies, case reports, case series, letters, conference abstracts, systematic reviews, studies that included at-risk family members for the analysis, and studies with unknown pathology of TC were excluded. If one cancer registry was investigated for more than one study, we included only the latest publication. Studies that focused exclusively on pediatric population will be excluded given the possible unrepresentativeness of general FAP patients due to an age-dependent penetrance of the disorder. Studies that TC was diagnosed by autopsy will also be excluded since it is known that postmortem evaluation commonly detects incidental thyroid cancer in asymptomatic individuals [11].

Three reviewers (JC, TP, and AA) independently screened the titles and abstracts and obtained full manuscripts for relevant studies. They also read the full text of selected articles and excluded studies that did not meet eligibility criteria. Disagreements among them were resolved by consensus. When they did not reach the consensus, the fourth author (SK) was involved to make a final decision. One author (SO) subsequently extracted the following data: first author, publication year, origin of cancer registry, definition of FAP, study design, duration of follow-up, and patients’ age and gender. Numbers of patients with the following diagnoses were recorded: FAP, TC, benign thyroid masses diagnosed by prospective surveillance screening ultrasound, and endocrinologic thyroid disorders including hypothyroidism, hyperthyroidism, and thyroiditis. The following details regarding TC were obtained: methods of TC diagnosis, age at diagnosis of FAP and TC, bilaterality, multicentricity, and pathology. The location of APC mutation was classified with reference to the mutation cluster region (MCR; codon 1286–1513): 5′ end (proximal to the MCR), within the MCR, and 3′ end (distal to the MCR) [12]. Another author (SK) cross-checked all data and, again, disagreements between them were resolved by consensus. The third author (TP) became involve, if necessary, to make a final decision.

Assessment of risk of bias within studies

Two authors (SO and SK) assessed the risk of bias by utilizing Newcastle–Ottawa Scale for observational study (Supplementary Text 2) [13]. With a maximum score of 9, the scale assesses risk of bias in three domains: (1) selection of the participants; (2) comparability of study groups; and (3) ascertainment of outcomes of interest. Studies with scores ≥ 7 were considered as having low risk of bias, scores of 4–6 as having a moderate risk of bias, and scores of ≤ 3 as having high risk of bias. Where discrepancies arose, articles were re-examined and consensus was reached by discussion and the opinion of a third reviewer (TP).

Statistical analysis

Given the expected considerable heterogeneity between studies, we employed a random-effects model with Logit transformation to estimate the pooled prevalence of TC, benign thyroid masses, and endocrinologic thyroid disorders with 95% CI. I2 index and Cochran’s Q test were utilized to measure the heterogeneities across studies [14]. Using a mixed-effects model [15], we further explored the potential sources of heterogeneity by analyzing the pre-specified subgroups with following moderators: study design (retrospective vs. prospective), use of screening ultrasound for TC diagnosis, number of the study population (N < 200 vs. ≥ 200), and the publication year (before or during 2000 vs. after 2000). TC characteristics were presented as means ± standard deviations (SD) or crude percentages. Sensitivity analysis was performed using one-study remove method to detect the impact of each study on the combined effect. Publication bias was assessed by funnel plot inspection, Egger’s test, and Begg’s test [16, 17]. Analyses were performed by using Comprehensive Meta-Analysis software version 3.3.070 (https://www.meta-analysis.com).

Results

Characteristics of included studies

9821 FAP patients from 12 cohort studies published between 1987 and 2016 were included. Steps of literature search were illustrated in Fig. 1. The characteristics and Newcastle–Ottawa Scales of individual studies were provided in Table 1 and Supplementary Table 1, respectively. According to the Newcastle–Ottawa Scales, most studies had moderate risk of bias while one study had high risk of bias [3]. The median Newcastle–Ottawa Scales score was 4. The details of each registry and their respective definition of FAP were further provided in Supplementary Table 2.

Fig. 1
figure 1

Literature search flow diagram

Table 1 Characteristics of individual included studies

Thyroid disease in FAP

From a total of 120 patients who had TC, the crude unweighted mean of TC prevalence was 1.2% (ranging 0.4–11.8%). Meta-analytic pooled prevalence of TC in FAP (Fig. 2A) was 2.6% (95% CI 1.3–4.8%). Characteristics of patients with TC were provided in Table 2. Female was presented in 95 ± 3.9% of the affected patients. In FAP patients who had TC, the mean age of FAP diagnosis was 29 ± 11.4 years old, whereas that of TC diagnosis was 31 ± 11.8 years old. Approximately one-third (34 ± 9%) of TC patients had presented with TC prior to the diagnosis of FAP. 45.6 ± 12.9% of TC was bilateral and 58.8 ± 9.6% was multicentric.

Fig. 2
figure 2

Meta-analysis of prevalence of thyroid cancer (TC, A), benign thyroid masses (BTM, B), and endocrinologic thyroid disorders (Endo, C) in patients with familial adenomatous polyposis (FAP). *Benign thyroid masses were detected using screening ultrasound

Table 2 Clinical characteristics of familial adenomatous polyposis (FAP)-associated thyroid cancer (TC) in each study

Papillary TC was the most common pathology (100/120; 83.3%); 26 out of 100 specimens (26%) were reported as CMV-PTC. The remaining specimens were reported as follicular type (4/120; 3.3%), medullary type (1/120; 0.8%), and mixed type (6/120; 5%). The pathology was unavailable in 9 patients (7.5%). Data on the location of APC mutation in TC were available in 24 patients (data not shown): 19 patients (79.2%) had mutation at the 5′ end; 4 patients (16.6%) at the 3′ end; and only one patient (4.2%) harbored the mutation in the MCR.

Five studies (n = 411) proactively performed screening thyroid ultrasound [9, 22,23,24,25]. In 368 individuals who received ultrasound, 179 of them had benign thyroid nodules–making a pooled prevalence of 48.8% (95% CI 33.8–64.0%), as shown in Fig. 2B. Three studies (n = 321) reported the prevalence of endocrinologic thyroid disorders, with a subsequent pooled prevalence of 6.9% (95% CI 4.5–10.3%; Fig. 2C) [6, 23, 24].

Heterogeneity and subgroup analysis

When evaluating the prevalence of TC, there was a high level of heterogeneity across all studies (Q = 117.23, df = 11, P < 0.001, I2 = 90.61). The results of subgroup analyses were provided in Table 3. We found that the prevalence of TC was significantly higher in (a) studies that used screening ultrasound (6.9%, I2 = 0) compared to the non-use group (1.4%, I2 = 85.6); (b) studies with study population < 200 (6.3%, I2 = 0) compared to those with study population ≥ 200 (1.2%, I2 = 92.3%); and (c) studies published after 2002 (5%, I2 = 83%) as opposed to those published before or during 2002 (1%, I2 = 83.8%). The difference in the prevalence was not observed when classifying studies by study type.

Table 3 Mixed-effects meta-analysis of thyroid cancer prevalence stratification

Sensitivity analysis and publication bias

Pooled prevalence was recalculated iteratively after each study was removed; subsequent pooled prevalence was not significantly different from one another and ranged between 2.2 and 3.0%, indicating robustness of the included studies. The funnel plot depicted in Fig. 3 was almost symmetric, indicating the absence of publication bias. This was confirmed by the Egger’s test (P = 0.10) and Begg’s test (P = 0.15).

Fig. 3
figure 3

Funnel plot: the x-axis corresponds to the logit of prevalence and the y-axis corresponds to the standard error of the studies

Discussion

To our knowledge, this present meta-analysis is the largest group of pooled cases of thyroid disease in FAP patients, including 9821 individuals from 12 cohort studies. It confirmed a higher prevalence of TC, up to 19-fold, in this patient population when compared to the prevalence of TC in the United States (2.6 vs. 0.14%, respectively) [5]. Our results corroborated earlier observation that TC in FAP tended to occur in young female, presented with papillary TC as the most common pathology, had a high rate of mulcentric and bilateral diseases, and preceded the diagnosis of FAP in approximately one-third of the cases [3, 9, 18, 21].

Generally, TC shows a male-to-female ratio of 1:3–1:4 and has a mean age at diagnosis of 49 years for women and 53 years for men [5, 26]. We found that FAP-associated TC had a more pronounced gender discrepancy (male-to-female ratio of 1:19) and occurred at a younger age (mean age at diagnosis of 31 years). It was postulated that female aged less than 35 years with FAP had a 160-time higher risk of developing TC [18], and that papillary TC in FAP and in sporadic cases are, in fact, different disease entities [27]. Nevertheless, the decision to refer all young female patients diagnosed with papillary TC for colonoscopy remains inconclusive [28]. According to previous literature, CMV-PTC histologic subtype was considered a frequent and specific finding in FAP and its presence should alert clinicians to perform a colonoscopy and/or APC gene analysis [29, 30]. A recent study however showed that only 39% of the published CMV-PTC cases harbored germline APC mutation [29], whereas at least 26% of all patients with FAP-associated TC in our study demonstrated CMV-PTC subtype. We therefore hypothesized that the sensitivity and specificity of CMV-PTC variant for diagnosing FAP may not be as high as previously thought. Additionally, CMV-PTC is rare and its diagnosis is challenging even for experienced cytologists [31]. Considering the multicentricity and bilaterality, intraoperative multicentric and bilateral diseases were found respectively in 50 and 32% among all papillary TC cases in a study by Gerfo et al. [32], while our analysis showed that the proportions of multicentricity and bilaterality in FAP-associated TC were respectively 58.8 and 45.6%. Hence, our results support younger age at diagnosis and almost exclusive occurrence in female as major different characteristics of FAP-associated papillary TC, when comparing to sporadic cases. On the other hand, the diagnostic utility of histopathologic features (multicentricity, bilaterality, and the presence of CMV-PTC variant) to differentiate the sporadic and familial forms of TC is uncertain and requires further investigations.

In non-selected cases of FAP, more than 60% of mutations in APC gene are found in the MCR, between codons 1284 and 1580 [33]. Mutation at the 5′ end of APC gene (i.e. proximal to the MCR) is common in patients with FAP-associated TC [33, 34]. Our result showed that out of 24 mutations, 19 of them located in the 5′ end proximal to the MCR. Septer et al. recently analyzed the largest pool of APC mutation data (n = 48) in FAP-associated TC cases, emphasized an increasing risk of TC if the patient harbors a mutation in codon 1061 or any mutation proximal to codon 528, and concluded that annual surveillance ultrasound should be performed from age 18 onwards if the patient has these mutations [33]. By combining our data to those of Septer et al., genotypes of FAP-associated TC were available in 52 patients; 44.2% of them were localized at the 5′ end, proximal to codon 938, whereas only 17.5% of the mutations in the reference population with FAP occurred in this region [33]. This, therefore, stresses a high frequency of the mutations in FAP-associated TC in the 5′ end of APC gene and confirms its genotype–phenotype correlation. The pathomechanism of this phenomenon is unclear, but there has been a suggestion that an activation of RET/PTC1 oncogene in thyroid tissue may play an additional role in the carcinogenesis of TC in FAP patients [35].

We found a high pooled prevalence of benign thyroid masses (48.8%) in FAP patients who underwent screening thyroid ultrasound. Without using routine ultrasound, the occurrence of benign thyroid masses in FAP were only 0.2–6% [6, 36]. This discrepancy is similar to that of general population when using thyroid ultrasound results in higher incidence of thyroid masses when compared to manual palpation [37]. Few studies have reported the occurrence of thyroiditis, hypothyroidism, and hyperthyroidism in FAP [6, 10, 23, 24] while we found that 6.9% of FAP patients had endocrinologic thyroid disorders. Although the number is not justified for regular thyroid function testing, the treating physicians should be aware of this phenotypic spectrum. In normal population, the link between TC and benign thyroid disorders is not negligible [38, 39]. Unfortunately, the disease trajectory and clinical outcome of these benign conditions in FAP are unknown. The occurrence of FAP-associated TC in the background of lymphocytic thyroiditis, as well as Grave’s disease, has been characterized [7, 10]. Herraiz et al. have described ultrasonographic changes in two FAP patients after a follow-up at an interval of 27 months: one with an enlargement of a pre-existing nodule and another one with the development of two new benign nodules [9]. Their data may confer the slow rate of growth of the nodules; however, a larger, prospective study is necessary to describe their natural history and potential associations with TC, and an appropriate interval for follow-up. Until the risk is better delineated and guideline is established, periodic ultrasound and/or laboratory follow-up to monitor these benign thyroid disorders are crucial. It was suggested that an interval of greater than one year is possibly safe for a follow-up ultrasound once thyroid nodules are detected [9].

Increasing diagnosis of TC has been well recognized while the mortality from TC has remained stable [26]. Most new cases can be attributed to the diagnosis of papillary TC with small or non-palpable lesions due to advanced diagnostic techniques and increasing surveillance in young or middle-aged population [26, 40]. Our study corroborates with this finding by showing that the publication year after 2002 and the use of surveillance thyroid ultrasound are associated with a higher prevalence of TC [40]. Papillary TC is the least aggressive histologic subtype and most patients will never develop clinically significant symptoms [26, 40]. Despite its often multicentricity and nodal involvement in FAP-associated TC, the condition shows an indolent course [27]; in fact, recent study has even suggested a possibility of spontaneous regression of CMV-PTC in FAP after the age of 35 years [31]. While the observed-to-expected morbidity ratio (i.e. standardized morbidity ratio) of TC in FAP was reportedly 19.2 in one study [41], various studies indicated that mortality of FAP-associated TC is extremely low and patients tended to die from another causes [41,42,43].

To detect FAP-associated TC, current recommendations controversially include both annual clinical examination and a referral, if nodules are present [1], and routine use of screening ultrasound [6, 9, 22]. The American College of Gastroenterology recommended FAP patients to undergo annual screening thyroid ultrasound [44]; however, this recommendation has a low quality of evidence. The uncertainty was re-addressed by the American Thyroid Association that the use of ultrasound may lead to an early diagnosis of FAP-associated TC, and there is insufficient evidence that these interventions would reduce morbidity and mortality [30]. In general population, the United States Preventive Services Task Force found inadequate evidence on the benefits of screening and recommended against TC screening in asymptomatic adults. The sensitivity and specificity of thyroid ultrasound to detect one or more malignant features were reportedly 94.3 and 55.0%, respectively [45]. Hence, universal thyroid ultrasound approach may lead to not only a burden and significant anxiety to the patients undergoing the screening procedure, but also false positive results due to its low specificity [36]. Among few studies investigating the consequences of thyroid ultrasound screening in FAP, Feng et al. have shown that TC in patients undergone thyroid ultrasound (n = 15) may have a more favorable morbidity profile when compared to those in nonscreening-detected cases (n = 18) [24]. However, the study may be too small to draw a conclusion. Based on aforementioned findings, the current evidence on the overdiagnosis and overtreatment of TC (particularly unaggressive papillary subtype), and the lack of cost-effectiveness analysis for the use of ultrasound surveillance in FAP-associated TC, we recommend against the use of surveillance ultrasound to detect any thyroid abnormalities in every FAP patients. Until a larger study is conducted, screening thyroid ultrasound among FAP patients with high risk of TC, such as young female with the APC mutation proximal to the 5′ end, seems reasonable.

Several limitations should be considered when interpreting our results. First, risk of bias across studies exists because all recruited studies are heterogeneous. Apart from the use of thyroid ultrasound, sample size of the study, and publication year, which affect the prevalence of TC according to the subgroup analyses, other possible factors include demographic difference in each study and operator dependency of ultrasound. Furthermore, although most studies used routine thyroid ultrasound to diagnose TC [9, 22,23,24,25], a study by Steinhagen et al. detected TC cases from palpable nodules and non-screening ultrasound [6]. Second, the included articles were observational studies and risk of biases within studies exist—both selection bias and information bias. Most of them had moderate risk of bias and none of them complied all items in the Newcastle–Ottawa Scales. Third, two studies conducted during 1990s did not provide details on FAP definition [3, 4]; however, the fact that all selected studies were derived from either hospital-based or population-based cancer registries is a strength of our study due to presumptive robust methodology and systematic data collection as exact cancer identification are generally required [46]. However, one must consider the referral bias in each cancer registry since not all patients with FAP end up in a registry and it is more likely that only complicated patients were recruited. The definition of FAP in each registry and inclusion of other similar phenotypes (such as attenuated FAP and MUTYH-associated polyposis syndrome) may further complicate between-study heterogeneity, as shown in Supplementary Table 2. Fourth, not all included papers reported the occurrence of CMV-PTC especially the older ones; thus, the prevalence of this particular histology may be underestimated. Fifth, the present study did not include unpublished studies and studies published in non-English languages. This may under- or overestimate the actual prevalence of TC-associated FAP. Finally, the number of studies and overall number of patients when analyzing the prevalence of benign thyroid disorders were relatively small.

In conclusion, benign thyroid disorders are common in FAP but their clinical significance and an appropriate follow-up interval are yet undetermined. We described the characteristics of FAP-associated TC and concluded that younger age at diagnosis and almost exclusive occurrence in female are major differentiating characteristics of FAP-associated and sporadic forms of TC. We confirmed the genotype–phenotype correlation of TC in FAP patients and stressed a high frequency of the mutations in the 5′ end of APC gene. We demonstrated that the prevalence of TC in FAP patients is 2.6%, which is 19-fold higher than general population; however, the absolute prevalence is still very low. Its increased prevalence in recent years can also be explained by the detection method bias—this is consistent with the global trend of overdiagnosis and overtreatment of TC. Based on the controversies from current guidelines, a low absolute prevalence, a low specificity of thyroid ultrasound to detect TC, a high proportion of papillary TC with resultant low morbidity and mortality, a lack of cost-effectiveness analysis, and a potential burden for patients if the universal screening approach is employed, we suggest that only a selective subgroup of FAP patients with a high risk of developing TC, such as young female with APC mutation at the 5′ end, might benefit from routine surveillance ultrasound.