Introduction

Cancer antigen 125 (CA125) is a high molecular weight glycoprotein encoded by the MUC16 gene [1]. It is expressed under normal conditions in epithelial tissues (e.g., breast, lung, genitourinary tract) and overexpressed in epithelial cancers [2, 3]. Circulating CA125 is elevated in more than 80% of women with epithelial ovarian cancer and is the best biomarker to date for the early detection of ovarian cancer [4, 5]. However, the sensitivity and specificity of CA125 as an early detection marker are limited [6], and recent large-scale randomized screening trials reported no significant mortality benefit with CA125 screening versus usual care [7, 8]. Pretreatment CA125 levels are associated with survival and changes in levels have been shown to predict recurrence [9, 10]. Although CA125 is commonly used to monitor women with ovarian cancer for progression, a recent study suggested that active surveillance using CA125 leads to a lower quality of life without increasing survival time [11].

In women without ovarian cancer, CA125 varies with age, race, body mass index (BMI), oral contraceptive (OC) use, hysterectomy, parity, and breast cancer history [1214]. In women diagnosed with ovarian cancer, CA125 levels are predominantly determined by the extent of disease but also some of the same factors that influence the biomarker in healthy women [15]. Understanding how CA125 varies in women with ovarian cancer both due to the tumor characteristics and independent of tumor characteristics could improve our ability to interpret CA125 values in women with ovarian cancer and provide insight into how CA125 may be associated with progression of disease. Here, we evaluate associations between tumor characteristics, reproductive, and lifestyle characteristics and preoperative CA125 levels in women with ovarian cancer from 13 studies participating in the Ovarian Cancer Association Consortium.

Materials and methods

Study population

This study included women with ovarian cancer from 13 studies participating in the Ovarian Cancer Association Consortium (OCAC), a collaborative group established in 2005 with goal of discovering new genetic variants associated with ovarian cancer [16, 17]. Studies included in this analysis were the Alberta Ovarian Tumor Types Study (AOV) [18, 19], Australian Ovarian Cancer Study (AUS) [20], Belgium Ovarian Cancer Study (BEL) [21], Hawaii Ovarian Cancer Study (HAW) [22, 23], Dr. Horst Schmidt Kliniken (HSK) [24, 25], Hospital-based Epidemiological research Program at Aichi Cancer Center (JPN) [26], Women’s Cancer Program at the Samuel Oschin Comprehensive cancer Institute (LAX) [27], Malignant Ovarian cancer Study (MAL) [28], Mayo Clinic Ovarian Cancer Case–Control Study (MAY) [29, 30], New England Case Control Study (NEC) [31], Oregon Ovarian Cancer Registry (ORE) [32, 33], Danish Pelvic Mass Study (PVD) [34], and Scottish Randomized Trial in Ovarian Cancer (SRO) [35, 36]. In total, there were 5,538 women with preoperative CA125 values in OCAC. We excluded 147 women with non-epithelial tumors or tumors with unknown origin, 277 patients with borderline tumors, 22 patients with tumors of unknown morphology, and one patient with in situ disease. This resulted in a total of 5,091 women with invasive epithelial ovarian cancer and available CA125 levels. All studies included in this analysis had obtained written informed consent from all study participants and had approval from ethics committees.

Information about demographic, reproductive, lifestyle, and tumor characteristics was collected by individual studies and submitted to a coordinating center that compiled a core dataset, including age at diagnosis, age at menarche, race, family history of breast cancer or ovarian cancer, personal history of endometriosis, menopausal status, hysterectomy, tubal ligation, height, weight 1 year prior to diagnosis, smoking, ever use of OC, history of pregnancy, tumor stage, grade, and histology. Pretreatment CA125 levels were either measured directly as part of an individual study (BEL, JPN, MAL, PVD) or abstracted from medical records (AOV, AUS, HAW, LAX, MAY, NEC, SRO). Information about type of CA125 assay used by different studies is listed in the Supplemental Table 2.

Statistical analysis

We used probit scores to standardize CA125 levels, which varied across studies [37, 38]. Probit scores were calculated using the following equation: Ф −1 = [i/(n + 1)], where Ф is the cumulative distribution function for a standard normal distribution, i is the rank of each participant within a study and n is the number of participants in each study. We estimated the association between exposures of interest and CA125 using univariate and multivariate linear regression.

Epidemiologic and tumor characteristics considered in relation to pretreatment CA125 levels include stage (I, II, III, IV, unknown), histologic subtype (serous, endometrioid, clear cell, mucinous, other), tumor grade (well differentiated, moderately differentiated, poorly differentiated, and undifferentiated), self-reported race (white, black, Asian, other, presumed white, unknown), family history of ovarian cancer (no, yes, unknown), family history of breast cancer (no, yes, unknown), prior history of breast cancer (no, yes, unknown), BMI (<18.5, 18.5–<25, 25–<30, ≥30, unknown), ever OC use (no, yes, unknown), ever pregnant (no, yes, unknown), tubal ligation (no, yes, unknown), prior hysterectomy (no, yes, unknown), and endometriosis (no, yes, unknown), age at menarche, height, and weight 1 year prior to diagnosis. For the purpose of this analysis, race was grouped in three categories: presumed whites have been grouped with whites, black, Asian, and others were grouped as non-white, and unknown were grouped with missing. Residual disease was classified as: no macroscopic disease, macroscopic disease ≤1 cm, macroscopic disease >1 and ≤2 cm, macroscopic disease >2 cm, macroscopic disease of unknown size, tumor not ressected, and unknown.

In univariate models, we adjusted for age at diagnosis (continuous). In order to identify CA125 predictors that are independent of tumor characteristics (stage, histology, and grade), we constructed multivariate models additionally adjusted for stage and a variable for combined histology and grade: high-grade (moderately and poorly differentiated, and undifferentiated) serous, low-grade (well differentiated) serous, high-grade endometrioid, low-grade endometrioid, mucinous, clear cell, and other/unknown). In order to investigate the independent contribution of individual predictors to CA125 levels, we simultaneously adjusted for all the factors that were significant predictors of CA125 in multivariate models. For each predictor, we report the original parameter estimates (coefficients) as well as the percent change in CA125 levels [calculated as [exp (coefficient) − 1]*100]. All the analyses were performed using SAS v9.4 (SAS Institute, Cary, NC). All p values were two-sided, and a significance threshold of p < 0.05 was used.

Results

This analysis included a total of 5,091 women diagnosed with epithelial ovarian cancer from a mixture of case–control (population or hospital based) or case only (registry or clinical trial) studies in the USA, Canada, Europe, Asia, and Australia between 1992 and 2016 (Table 1). Cases were predominantly high-grade, advanced stage, and invasive serous though the proportion varied between studies. Among high-grade serous cases, median CA125 levels varied between studies, ranging from 259 U/ml (SRO) to 1590 U/ml (JPN).

Table 1 Characteristics of studies included in the pooled analysis of factors associated with pretreatment CA125 at diagnosis, Ovarian Cancer Association Consortium

In age-adjusted models, height, weight 1 year before diagnosis, age at menarche, hysterectomy, OC use, smoking, and prior history of breast cancer were not significantly associated with pretreatment CA125 levels. Older age at diagnosis, history of pregnancy, tubal ligation, family history of breast cancer, and family history of ovarian cancer were associated with higher CA125 levels, while a personal history of endometriosis was associated with lower CA125 levels (Table 2). After additionally adjusting for tumor characteristics, BMI > 30 kg/m2 was associated with 9.8% (95% CI 1.7, 18.5%) higher CA125 levels, while race (non-white vs. white) was associated with 15.3% (95% CI 4.3–27.4%) higher CA125. Since the majority of non-white participants were Asian, we performed an analysis restricting non-whites to Asians. In the model adjusted for age and tumor characteristics, compared to white women, Asian women had a 16.5% (3.1, 31.7%) increase in CA125 levels. To further address the issue of collinearity between race and study characteristics, we excluded sites that consisted of only one or predominantly one race (BEL, HSK, JPN, MAL) or had no information on race (PVD, SRO), and observed that non-white race was associated with 30.7% (95% CI 18.1, 44.5%)(p < 0.0001) higher CA125 levels after adjusting for age, histology, and grade. Since similar analyses have been previously published in the NEC study [15], we excluded NEC participants and observed similar associations for BMI > 30 kg/m2 (p = 0.004) and race (p = 0.001).

Table 2 Associations* between demographic, lifestyle, and reproductive characteristics and pretreatment CA125 levels among women with ovarian cancer in the Ovarian Cancer Association Consortium

We constructed a multivariate model adjusted for all the factors that were significantly associated with CA125 levels in the age-adjusted models (Table 3). Compared to high-grade serous tumors, CA125 levels were significantly lower for low-grade serous, high-grade endometrioid, low-grade endometrioid, mucinous, clear cell, and other/unknown subtypes (p < 0.0002). CA125 levels increased with stage of disease (p < 0.0001). The percent change for BMI > 30 kg/m2 compared to BMI 18.5–25 (9%) and non-white versus white race (14%) was similar to the model adjusted for age and tumor characteristics.

Table 3 Multivariate adjusted associations between demographic, lifestyle, and reproductive characteristics with pretreatment CA125 levels*

In analyses conducted separately for premenopausal and postmenopausal women, we observed no association between CA125 levels with BMI > 30 kg/m2 (p = 0.50) and race (p = 0.73). Among postmenopausal women, BMI > 30 kg/m2 was associated with 10.8% (95% CI 1.2, 21.2%) higher CA125 levels, while non-white race was associated with 17.7% (95% CI 3.5, 33.8%) higher CA125 levels (Supplemental Table 3). In order to address variation in CA125 measurements within studies, we evaluated the significant associations in studies that measured CA125 on all participants using a single assay (BEL, JPN, MAL, PVD). We observed a significant association between BMI > 30 kg/m2 with CA125 levels (p = 0.02), while the association with race was no longer significant (p = 0.20). When we additionally adjusted for residual disease, we observed that BMI > 30 kg/m2 was no longer significantly associated with CA125 levels (7.6%, 95% CI −0.2, 15.9%), while the association with non-white race remained significant (16.8%, 95% CI 5.8, 28.9%, p = 0.002).

To address the differences between tumor types (including differences in CA125 values), we performed sensitivity analysis restricted to high-grade serous tumors. BMI > 30 kg/m2 was no longer associated with CA125 levels in the age-adjusted model (p = 0.32) or the model additionally adjusted for stage p = 0.62). Non-white race remained significantly associated with CA125 levels both in age-adjusted (p = 0.05), and in age and stage adjusted model (p = 0.04). Furthermore, compared to high-grade serous cases younger than 50 years of age, those older than 70 years of age had 13.3% lower (95% CI −23.3, −2.0%) in CA125 levels.

Discussion

This pooled analysis included 13 studies in the Ovarian Cancer Association Consortium with pretreatment CA125 which were either measured or abstracted from medical records as well as detailed epidemiologic and clinical data on more than 5,000 women with invasive epithelial ovarian cancer. Our results suggest that BMI > 30 kg/m2 and race might be associated with CA125 levels, after adjusting for tumor-related characteristics (stage, histology, and grade). We observed predictors of CA125 that are consistent with previously published results, including tumor characteristics (histology, grade, stage) [15], as well as epidemiologic factors (age, high BMI, history of pregnancy, family history of breast cancer, family history of ovarian cancer, endometriosis, tubal ligation, and race) [12, 13, 15]. Most of the previously described epidemiological predictors of CA125 were identified in healthy women [12, 13] and in one study among women with ovarian cancer cases [15]. We hypothesized that the association between epidemiologic factors and CA125 levels is partially independent of, and partially mediated by tumor characteristics. For example, high BMI is associated with increased levels of CA125 in healthy women [12], and BMI also increases risk of endometrioid subtype of ovarian cancer, which itself is associated with lower CA125 levels [15]. By adjusting for tumor characteristics, we identify characteristics that may influence CA125 above and beyond tumor characteristics.

Higher CA125 levels with more advanced disease as well as differences by histologic subtypes has been described previously [15]. While high-grade serous tumors are known to have the highest CA125 levels, differences in CA125 levels between the less common subtypes may not be appreciated. However, the findings of histology and grade-specific estimates of CA125 should be balanced with the possibility that there is some misclassification between subtypes. A recent comparison of grade assessment by two gynecologic pathologists on more than 500 ovarian cancer cases in the Surveillance Epidemiology and End Results Residual Tissue Repository reported only 49% agreement between the pathologists [39]. Similarly, recent studies using molecular markers to distinguish ovarian cancer subtypes suggested that histologic subtype is often misclassified [40]. Most commonly, high-grade serous ovarian cancers are misclassified as high-grade endometrioid. In our study, contamination of the endometrioid subgroup with high-grade serous cases could lead to an overestimate of the CA125 levels for some endometrioid cases.

For epidemiologic factors, most of the significant predictors of pretreatment CA125 that we observed in this pooled analysis in both univariate models (age, parity, family history of breast or ovarian cancer, race) and after accounting for tumor characteristics (BMI, race) have been previously described in the New England Case Control (NEC) study [15]. The results were similar after excluding participants from the NEC study. These data suggest that personal characteristics and exposures beyond tumor characteristics influence CA125 levels in women with ovarian cancer. Interestingly, almost all of these variables were also predictors of CA125 in healthy women who participated in one of the largest randomized ovarian cancer screening trials [12, 13], suggesting that these factors influence CA125 regardless of disease status. Similarities between CA125 predictors and ovarian cancer risk factors in combination with studies showing CA125 can impair immune function [41] suggests that CA125 may have a role in carcinogenesis in addition to being a marker of its progression.

The clinical assay used to measure CA125 varied over time and by site. A few studies measured pretreatment CA125 as part of their study (BEL, JPN, MAL, PVD), while the others abstracted pretreatment CA125 values from medical records. To account for some of this variability, we used a probit score approach which ranks CA125 values within each study to account for variability attributable to between-study differences. However, this approach does not account for any additional variability in the CA125 within study, which is likely more of an issue at sites where CA125 values were abstracted from medical records.

The strengths of our study include its large sample size, detailed epidemiologic and tumor data, and the inclusion of a large number of non-serous histologic types. Questionnaires and clinical data were originally collected for the purposes of large-scale genetic studies at a data coordinating center [42]. For many variables, data have been harmonized across study sites for epidemiologic analyses [4345].

While our study was limited by the inclusion of existing CA125 values rather than prospective measurements, we observed expected associations between tumor characteristics and pretreatment CA125 levels as well as additional factors that predicted levels. However, validation is needed in a large study using a single assay. In addition, a diverse study population is needed to robustly determine how CA125 varies by race. Identification of predictors of CA125 will aid in the interpretation of its levels for prognosis and screening as well as provide new insights into how CA125 may be involved in the pathogenesis of the disease.