1 Introduction

The ultimate goal of health care is to restore or preserve functioning and well-being related to patients’ health, known as the health-related quality of life (HRQOL, Rodenberg, 2006). In 2014, the American Cancer Society (ACS) reported that there were as many as 14.5 million cancer survivors living in the USA. The SEER-Medicare Health Outcomes Study showed that patients with lung, colorectal, or prostate cancer had significantly worse HRQOL in terms of mental and physical health outcomes compared with cancer-free control subjects (Reeve et al., 2009, 2012, 2015) due to effects from both the disease and the treatments used to fight cancer. Moreover, there are disparities in HRQOL outcomes among cancer survivors across race and ethnicity. Gallicchio et al. (2013) examined the self-reported HRQOL scores among breast cancer survivors by race and found that black women reported significantly more physical functioning limitations than whites. Culver et al. (2002) identified more severe level of “psychological distress” (which includes depression and anxiety) in the Hispanic breast cancer survivors versus matched non-Hispanic white survivors. There is a growing recognition that racial/ethnic differences in HRQOL exist among cancer survivors. Overall, compared with non-Hispanic whites (NHW), blacks and Hispanics whites (HW) exhibit poorer HRQOL outcomes of general physical, mental health, and role functioning. Our goal is to identify and differentiate the effect of modifiable risk factors that contribute to the racial/ethnic disparities in HRQOL in the USA in order to inform targeted precision medicine efforts. For this purpose, we demonstrate a novel multiple mediation analysis method that enables exploring mechanisms that underlie these disparities and provides information to guide screening methods and interventions that optimally reduce them. In this paper, we focus the research on exploring the ethnic disparities in anxiety and depression among cancer survivors using data collected from the Measuring Your Health (MY-Health) study that was funded by the National Health Institute (NIH). To avoid the confounding effects from race (most Hispanics are whites), this research is stratified to compare between Hispanic and non-Hispanic whites only.

The MY-Health study collected information on HRQOL measures developed by the Patient-Reported Outcomes Measurement Information System (PROMIS) among cancer survivors enrolled in the study. Four population-based cancer registries of the National Cancer Institute’s Surveillance, Epidemiology, and End Results (SEER) Program participated in the study. The MY-Health study database was linked with the cancer registry data on tumor characteristics and cancer treatments, based on which we conduct research to identify and order factors that contribute to the health disparities.

We apply a novel mediation analytic method to differentiate the relative effects from different risk factors on ethnic disparities in anxiety and depression. A third variable can intervene in the relationship between an explanatory variable and a response variable through many forms. In this paper, we focus on the two forms: mediation or confounding. Although they are conceptually distinctive, MacKinnon (2000) claimed that these effects are statistically similar in the sense that all of them measure the change of association between the exposure (X, e.g., ethnicity) and response (Y, e.g., anxiety or depression score) variables when considering a third variable (denoted as M). Therefore, the statistical methods developed for mediation framework can be used for confounding effect analysis, although the scientific interpretations of the analysis might be different. In general, indirect effect (IE) refers to effect through X to M, and then to Y, and direct effect (DE) refers to effect from X directly to Y after adjusting for all related factors. The relationship among the variables can be shown by the graphical model in Fig. 1. Note that \(X - M_i - Y\) is a path between X and Y, where \(M_i\) could denote one mediator/confounder, or a vector of mediators/confounders within which there are complicated associations. Mediation analysis refers to the statistical techniques attempting to make inferences on mediation/confounding effects (effects from X to Y through different paths). In general, there are two groups of mediation analytic methods, linear structural equations and counterfactual frameworks, that can be applied to differentiating risk factors that contribute to the ethnic disparities in anxiety/depression (Alwin & Hauser, 1975; Judd & Kenny, 1981; Robins & Greenland, 1992; Mackinnon & Dwyer, 1993; Have et al., 2007; Albert, 2008; Vanderweele & Vansteelandt, 2009; VanderWeele, 2009). Exploring ethnic disparities in HRQOL outcomes requires a statistical mediation method that meets the following challenges: (1) the response, exposure, and mediators/confounders can take any formats: continuous, binary, or multi-categorical; (2) transformations among exposure, mediators/confounders, and other covariates should be allowed to account for potential nonlinear associations and interactions among variables; (3) if we can differentiate individual mediation/confounding effect from each mediator/confounder, then we gain knowledge on what are the most important mediators/confounders that explain the exposure–outcome relationship; (4) joint effects from highly correlated variables can be calculated, and (5) can make inferences on estimates of mediation/confounding effects (calculate variances and confidence intervals).

Fig. 1
figure 1

Graphical model of mediation/confounding effects. Note that there can be lines that connect between mediators, M, and Z.

We apply a novel mediation analysis method with multiple mediators/confounders to the MY-Health data to explore ethnic disparities in depression and anxiety among cancer survivors in this paper. The rest of the paper is organized as follows: Sect. 2 introduces the MY-health study and data measurements. Section 3 reviews mediation analysis methods and the novel MMA method and its algorithms for statistical inferences. We also introduce the R package, mma, for data analysis. Results for the exploration of ethnic disparities in HRQOL are discussed in Sect. 4. And finally, we make conclusions, discuss limitations of the research, and point out future research directions in Sect. 5.

2 PROMIS and the MY-Health Study

2.1 PROMIS

Health-related quality of life is defined in different ways. The most commonly used one is by Patrick and Erickson (1993): “Health related quality of life is the value assigned to duration of life as modified by the impairments, functional states, perceptions, and social opportunities that are influenced by disease, injury, treatment, or policy.” The Patient-Reported Outcomes Measurement Information System (PROMIS) is an NIH-funded initiative designed to provide a standard set of patient-reported outcome measures using modern psychometric theory that provides scores referenced against the general US population. PROMIS uses IRT-calibrated item banks across many symptom and functional domains (Jensen et al., 2015).

PROMIS has created many item banks that cover a range of function and symptom issues prominent in cancer survivorship such as physical function, pain interference, fatigue, sleep disturbance, depression, anxiety, ability to participate in social roles, and cognitive function (NIH, 2017b). For cancer patients, PROMIS measures can provide insight into symptom and functional issues during initial treatment and follow-up care.

PROMIS measures use a t-score metric for all domains, with a mean of 50 and a standard deviation of 10. Physical function, pain interference, fatigue, sleep disturbance, depression, and anxiety are calibrated, so a score of 50 reflects the average response in the US general population, and 10 points reflect a standard deviation for the US population. Reference values specific to cancer survivors are also available. The meaning of a high score depends on the domain it measures. For example, a higher fatigue score means higher fatigue severity, but a higher physical function score indicates better functions. The NIH’s Assessment Center (http://www.assessmentcenter.net/) provides the PROMIS instrument that can automatically calculate domain scores when survey questions are answered online. If researchers choose not to use the Assessment Center instrument, the Web site also provides manuals on how to measure the domains. The manuals provide instructions on how to calculate raw scores from the questions answered and tables that convert raw scores to scaled t-scores. The manuals also give instructions on calculating standard errors of the estimated t-scores (NIH, 2017a, c).

2.2 MY-Health Study

The Measuring Your Health (MY-Health) study was supported by NIH and includes investigators from the Georgetown University, the Fred Hutchinson Cancer Research Center, and three SEER cancer registries (California, New Jersey, and Louisiana). The main goal of this study was to validate eight PROMIS domains in a broad, diverse community-based cancer population. Patients were identified and contacted 6–13 months postdiagnosis and completed a 6-month follow-up.

The study group chose seven cancer sites (i.e., breast, cervical, uterine, prostate, colorectal, non-Hodgkins lymphoma, and nonsmall cell lung). Survey participants were randomly selected using a stratified sampling method. The strata are designed based on four racial/ethnic subgroups: black, non-Hispanic white, Asian/Pacific Islander (API), and Hispanic, and three age-based subgroups: age 21–49, 50–64, 65–84. The stratified random sampling method was used to ensure diversity in the surveyed population. The participants were recruited through the three participating cancer registries (Jensen et al., 2016, 2017).

Measurement equivalence across the ethnic groups (Hispanic whites and non-Hispanic whites) is an essential prerequisite for exploring the ethnic disparities in the PROMIS measures. Reeve and Terest (2016) reviewed papers that evaluated the measurement equivalence of the PROMIS and have provided strong evidence supporting through differential item functioning (DIF) the measurement equivalence of the PROMIS in ethnically, socio-demographically diverse groups. Specifically for the measurements of anxiety and depression in the MY-Health study, Teresi et al. (2016a, b) have shown that none survey item had a high magnitude of DIF, therefore supporting the use of depression and anxiety measures in the MY-Health study across ethnically diverse groups.

MY-Health survey population consists of 5506 cancer patients who completed the baseline survey. Among them, 2928 individuals are either NHW (2160) or HW (667). Ethnicity is used as the exposure variable in our study, with the standardized anxiety PROMIS score and depression PROMIS score as the outcomes for two separate analyses. The data cleaning process is described briefly in the supplementary material. We first check whether the PROMIS scores are different among different ethnic groups. ANOVA was used for the tests, and the results are shown in Table 1.

Table 1 ANOVA to compare anxiety and depression PROMIS scores by ethnicity.

Yost et al. (2011) stated that a difference of around 3.0 in PROMIS scores between groups is considered as important. The difference in the average score between NHW and HW is 3.58 for anxiety and 2.90 for depression. We conclude that the differences in the anxiety and depression PROMIS scores between NHW and HW are clinically meaningful. Therefore, we performed mediation analyses on the dataset to identify potential specific risk factors that may be used to explain this identified difference.

3 Methods

3.1 General Definitions of Mediation/Confounding Effects for Multiple Mediators/Confounders

Yost et al. (2014) proposed general definitions of mediation/confounding effects. These definitions are related to conventional mediation analysis (e.g., Mackinnon & Dwyer, 1993; Mackinnon, 2008), but are more general in that they are consistent for different types of predictors or outcomes. The definitions are based on three basic assumptions (VanderWeele & Robinson, 2014; VanderWeele et al., 2014) that were widely used in defining mediation/confounding effects:

Assumption 1

No unmeasured confounder for the exposure–outcome relationship;

Assumption 2

No unmeasured confounder for the mediator–outcome relationship;

Assumption 3

Mediator \(M_i\) is not causally prior to other mediators/confounders \(M_{-i}\).

With these assumptions, given other factor(s) \({\mathbf {Z}}\), Yu et al. (2014) define the mediation/confounding effects through the rate of change in Y when X changes from x by a \(u^*\) unit, where \(u^*\) is the smallest unit of X, such that \(u^*+x\in \mathrm{domain}(X)\), for any \(x\in \mathrm{domain}(X)\). Based on the notations, the average total effect is defined as

$$\begin{aligned} \mathrm{ATE}_{{\mathbf {Z}}} = E_{x^*}\left[ \lim _{u\rightarrow \ u^*}\frac{E(Y(x^*+u)|{\mathbf {Z}})-E(Y(x^*)|{\mathbf {Z}})}{u}\right] , \end{aligned}$$

and the average direct effect not through \(M_i\) (the ith mediator) is

$$\begin{aligned} \mathrm{ADE}_{\backslash M_i\mid {\mathbf {Z}}} = E_{x^*} E_{M_i} \left[ \lim _{u\rightarrow \ u^*}\frac{E[Y(x^* + u, M_i, {\mathbf {M}}_{-i}(x^* + u))\mid {\mathbf {Z}}] - E[Y(x^*, M_i, {\mathbf {M}}_{-i}(x^*))\mid \mathbf{Z }]}{u} \right] , \end{aligned}$$

where \({\mathbf {M}}_{-i}\) denotes the vector of mediators excluding \(M_i\). The definition of direct effect not from \(M_i\) is analogous to the natural direct effect [defined as \(E\{Y(x^*,M(x))-Y(x,M(x))\}\) at two different levels of the exposure variable x and \(x^*\) (Robins & Greenland, 1992; Pearl, 2001)] that allows for natural variation in the levels of the mediator between subjects. Instead of allowing \(M_i\) to vary conditionally on a fixed level of the predictor, we allow it to vary at its marginal distribution. The average direct effect measures the average changing rate in the potential outcome with the modification of X, where the distribution of \(M_i\) does not change with X (marginal distribution), while distributions of all other mediators/confounders change with X (conditional distributions). Therefore, we call the rate of change in Y the direct effect of X on Y not from \(M_i\), which includes the direct effect of X on Y and the indirect effects through mediators/confounders other than \(M_i\). The definition of indirect effect through \(M_i\) is defined straightforwardly as \(\mathrm{AIE}(M_i |{\mathbf {Z}})=\mathrm{ATE}_{{\mathbf {Z}}}-\mathrm{ADE}_{\backslash M_i\mid {\mathbf {Z}}}\). In summary, the general indirect effect of a mediator is defined as the changing rate in the outcome when the relationship from the exposure to the mediator/confounder is deactivated. With the general definition, the cross-world assumption, which requires that no measured or unmeasured effect of the exposure that confounds the mediator-outcome relationship (Avin et al., 2005), is not needed to identify the mediation/confounding effect. As an example, Fan (2012) discussed the situation when there are exposure–mediator interaction effects on the outcome for binary and continuous exposures. We include the result in the supplementary material.

Applying the definitions to the special case in our analysis, the exposure variable is the binary ethnicity (NHW, \(X=1\) or HW, \(X=0\)). The average total effect is \(E(Y|{\mathbf {Z}},X=1)-E(Y|{\mathbf {Z}},X=0)\), and the average direct effect of X on Y not through \(M_j\) is \(\mathrm{ADE}_{\backslash M_j|{\mathbf {Z}}}={\mathrm{DE}}_{\backslash M_j|{\mathbf {Z}}}(0)=E_{m_j}\{E_{{\mathbf {M}}_{-j}|X=1}[E(Y|{\mathbf {Z}},M_j=m_j,{\mathbf {M}}_{-j},X=1)]-E_{{\mathbf {M}}_{-j}|X=0}[E(Y|{\mathbf {Z}},M_j=m_j,{\mathbf {M}}_{-j},X=0)]\}.\)

Compared with conventional definitions of the average total effect that look at the average differences in the expected Y between \(x = a\) and \(x = a^*\) (for example, Mackinnon & Dwyer, 1993; VanderWeele, 2009; Vansteelandt & Daniel, 2017), the definitions by Yu et al. (2014) are based on the rate of change. The motivation is that first the exposure levels a and \(a^*\) do not have to be preset, thus generalizing the definitions from binary to multi-categorical or continuous exposures, and second, the effects will not change with the unit of X. Yu et al. (2014) have shown that the proposed method is equivalent to the conventional method in the single continuous mediator case. They also established the relationship between the proposed definitions of direct or indirect effect and the natural direct or indirect effects in the single binary mediator case. The mediation analysis is generalized so that it can handle binary, multi-categorical or continuous exposure, mediator, and response variables. In addition, algorithms were provided by Yu et al. (2014) so that general predictive models including multivariate additive regression trees (MART) and Cox proportional hazard models can be used in addition to generalized linear models for fitting variable relationships.

In this paper, mediation analysis is used to explore the racial disparity in HRQOL measurements. There are controversies in interpreting the race effect since it is impossible to establish causal effect of race. However, “effect of race” can be defined. VanderWeele and Robinson (2014) extensively discussed the challenges and different interpretations of race effect. We use their interpretation where “the effect of race involves the joint effects of race-associated physical phenotype (e.g., skin color), parental physical phenotype, genetic background, and cultural context when such variables are thought to be hypothetically manipulable and if adequate control for confounding were possible” (VanderWeele & Robinson, 2014). Combined with the method proposed above, direct effect of race is interpreted as the remaining racial disparity if distributions of various risk factors across racial groups could be equalized. The indirect effect from a certain risk factor (mediator/confounder) is the change in the health disparity if the distributions of the risk factor can be set as the same across racial groups, while distributions for other risk factors are kept as observed. With this interpretation, the hypothetical manipulation on race is not required. Instead, the interpretation was performed by framing around more manipulable risk factors, such as environmental and healthcare facility variables. For detailed discussion on explaining “race effect,” the readers are referred to VanderWeele and Robinson (2014).

3.2 Multiple Mediation Analysis for Non-/Semi-Parametric Predictive Models with Binary Exposure

When (generalized) linear regression is insufficient to describe the relationships among variables, we seek to use the non-/semi-parametric predictive models. Algorithms 1 and 2 that derived directly from the definitions of mediation/confounding effects provide the method to calculate mediation/confounding effects when the exposure variable is binary with the assumption that the sample size at each exposure level is large. More general mediation analysis with any types of exposures is discussed in a separate paper (Yu & Li, 2017). The R package mma was built using the algorithms to make inferences on mediation/confounding effects.

The algorithms are based on the assumption that the prediction model of E(Y) has the form

$$\begin{aligned} E(Y_{i}) = f(x_i,M_{1i},\ldots ,M_{pi}), \text{ for } i=1,\ldots ,n, \end{aligned}$$
(1)

In the R package mma, MART and generalized linear models (GLM) can be chosen to build f.

Algorithm 1

Estimate the total effect:

  1. 1.

    Randomly draw N vectors of mediators/confounders from the sub-population where \(X=0\), denoting as \((M_{1j1},\ldots , M_{pj1})^T\), for \(j=1,\ldots ,N\).

  2. 2.

    Randomly draw N vectors of mediators/confounders from the sub-population where \(X=1\), denoting as \((M_{1j2},\ldots , M_{pj2})^T\), for \(j=1,\ldots ,N\).

  3. 3.

    \(\mathrm{TE}=\frac{1}{N}\left[ \sum _{j=1}^Nf(1,M_{1j2},\ldots ,M_{pj2})-\sum _{j=1}^Nf(0,M_{1j1},\ldots ,M_{pj1})\right] .\)

We can also calculate the total effect using \(E(Y|x=1)-E(Y|x=0)\) directly from the observations.

Algorithm 2

Estimate the direct effect not through \(M_k\):

  1. 1.

    Use the samples generated by Steps 1 and 2 of Algorithm 1.

  2. 2.

    Combine the vectors \(\{M_{kj1}\}_{j=1}^N\) and \(\{M_{kj2}\}_{j=1}^N\) and randomly permute the combined vector, denote the new vector as \(\{{\tilde{M}}_{kj}\}_{j=1}^{2N}\). \(\{{\tilde{M}}_{kj}\}_{j=1}^{2N}\) forms a sample of \(M_k\) from its marginal distribution.

  3. 3.

    \(\mathrm{DE}_{\backslash M_k}\) is estimated by \(\frac{1}{N}\left[ \sum _{j=1}^Nf(1,M_{1j2},\ldots ,M_{k-1,j2},{\tilde{M}}_{kj},M_{k+1,j2},\ldots ,M_{pj2})\right. \left. -\sum _{j=1}^{N}f(0,M_{1j1},\ldots ,M_{k-1,j1},{\tilde{M}}_{k,(N+j)},M_{k+1,j1},\ldots ,M_{pj1})\right] .\)

Due to the randomness brought in by sampling, the two algorithms are repeated, and the average results from the repetitions are estimates of mediation/confounding effects. For the analysis in this paper, we did 20 permutations.

3.3 The R Package mma

The mma package, available on the Comprehensive R Archive Network (CRAN) (Yu & Li, 2017), was generated based on the above two algorithms for mediation analysis. It has two sets of functions for multiple mediation analysis. One is the step-by-step process, where the function data.org helps identify the mediators/confounders and covariates. Then the organized data sets are read into the function med to estimate the mediation effects [indirect effect (IE), total effect (TE), direct effect (DE)]. Finally, the function boot.med helps to report summary statistics of the estimated mediation/confounding effects, where the standard deviations and confidence intervals are calculated using the bootstrap method. An alternative process combines the three steps in one function, mma. For both sets of functions, linear or nonlinear predictive models can be chosen for mediation analysis. By default, generalized linear models are used to model the associations among variables. If nonlinear method is chosen, MART and smoothing splines are used to model the relationships.

MART, originally proposed by Friedman (2001), is an ensemble technique that aims to improve the performance of a single model by fitting many models and combining them for prediction. MART employs two algorithms: “regression tree” from Classification And Regression Tree (CART) (Breiman et al., 1984) and “boosting” which builds and combines a collection of models, i.e., trees. We choose MART as a modeling method in mediation analysis since first, MART is able to capitalize on the nonlinear relationships between the dependent and independent variables with no need for specifying the basis functions. Second, due to the hierarchical splitting scheme in regression trees, MART is able to capture complex and/or high-order interaction effects. And third, as a tree-based method, MART can handle mixed-type predictors (i.e., quantitative and qualitative covariates) and missing values in covariates. Moreover, MART is able to handle time-to-event (survival) outcomes (Yu et al., 2009). Smoothing spline is a functional estimate that balances between the “goodness-of-fit” and “smoothness.” Smoothing spline is fitted to a set of spline basis functions, typically by least squares with a penalty on the “roughness.” Since spline bases are used for model fitting, nonlinear relationships can be fitted among variables. Also, the method can be easily extended to deal with data sets of hierarchical structure (Gu, 2013).

The mma package provides generic functions to help explain the results from mediation analysis. For our research, any variable that is significantly related to ethnicity and is significantly associated with the outcome when other variables are included in the model is treated as a potential mediator. Variables that significantly associate with the outcome when other variables are adjusted, but not significantly related to the ethnicity are included in the mediation analysis as other covariates. Variables that are not related to the outcome when other variables are considered are excluded for further analysis. The function data.org tests associations among variables and identifies mediators/confounders and covariates. Its results can be summarized to show selected variables and the test results (p values) for each associations of interests. Moreover, the outputs from the function boot.med or mma can be summarized to show the inferences results on mediation/confounding effects (estimates, standard deviations, and confidence intervals). The graphic function, plot, helps researchers visualize the complicated relationships and explain the ethnic differences in health outcome. The readers are referred to Yu and Li (2017) for details on how to use the mma package and Yu et al. (2017) for an example of applying the mma package.

4 Results

We explore the ethnic disparity in HRQOL outcomes (anxiety and depression) using the mediation analysis with multiple mediators/confounders (Yu et al., 2014) using the MY-Health study on cancer patients.

4.1 Selection of Potential Mediators/Confounders

Potential mediators/confounders are the variables that can potentially explain the ethnic disparities. Usually, mediators/confounders are significantly related to ethnicity and with the HRQOL outcome (Baron & Kenny, 1986). Our first step of analysis was to clean the MY-Health data set and identify potential mediators/confounders.

The MY-Health data set was well maintained by the research group. We performed the following two steps for data cleaning. Firstly, some variables were created. We created the variable comorbidities to indicate the comorbidity conditions of patients using the “Self-Reported Comorbidity” section of the MY-Health survey. The comorbid conditions (e.g., heart attack and asthma) were reported by patients with the possible answers for each condition “Yes,” “No,” or “Unsure.” The variable comorbidities record the total number of comorbidities a patient reported to have experienced. Its values range from 0 to 13. We also regrouped some variables into more convenient categories. For example, “surgery of the primary site” was site specific indicating the type of surgery. For example, tumor destruction and resection were code as 10–19 and 20–80, respectively. We combined the values into the variable “surgery,” which indicates whether surgery was performed as part of the treatment. Secondly, missing data were evaluated. Variables like “age to USA” and “years lived in USA,” which do not apply to patients born in the USA, tended to have many missings. For such cases, we included the variable “US born” as an indicator of whether the patient was born in the USA and excluded “age to USA” and “years lived in USA” for further analysis. Based on literature reviews and data availability, we included all variables listed in Table 2 other than the exposure variable and outcomes, as potential mediators/confounders. In the table, “social support” is a PROMIS score that measures the companionship, and emotional, information, and instrumental supports one can get. The variable “spirituality” measures one’s spiritual support. The variable “income” was recoded to five ordered categories from “less than $10,000” to “$200,000 or more.” Education was also ordered to five categories from “lower than high school” to “graduate degree.” Since both variables are ordered and MART was used to fit models in which linear relationship assumption is not required, we treated both variables as continuous. The variables, their variable formats and data sources are listed in the table.

Table 2 Variables, formats, and data sources.

To understand the relationships among variables and to identify potential mediators/confounders and covariates, we tested the significance of two associations: (1) between ethnicity and the potential mediator/confounder; and (2) between the potential mediator/confounder and the outcome, when other variables are controlled. For this initial selection process, we set the significance level at 0.1 to reduce the risk of falsely ignoring important variables. To test the first relationship, we use ANOVA for continuous mediators/confounders and Chi-square test for categorical mediators/confounders. For the second relationship, we use the type-3 tests in a linear model with all potential variables. Table 3 shows the test results and identified potential mediators/confounders (with \(*\)) and covariates (with \(^-\)) for outcomes anxiety and depression, respectively. Note that the significant exposure–mediator relationship and mediator–outcome relationship are not prerequisites for variables to be included in the mediation analysis. Since the variables employment, insurance, and income are highly correlated, we forced the three variables into the analysis as potential mediators/confounders and estimated the joint effect from them.

Table 3 Potential mediators/confounders and covariates.

From Table 3, we found that mostly, the mediators/confounders and covariates were similarly selected for both anxiety and depression. The differences are that “days from diagnosis” was chosen as a potential mediator/confounder for depression, but not for anxiety. Also, “radiation” and “hormonal therapy” were covariates for depression but not for anxiety, while “grade of cancer” was chosen as a covariate for anxiety but not for depression.

4.2 Mediation/Confounding Effects Explaining the Ethnic Disparities in HRQOL Outcomes

Tables 4 and 5 show the estimated direct and indirect effects in explaining the ethnic disparity in anxiety and depression, respectively, using the generalized linear models and nonparametric models. The confidence intervals were calculated based on 1000 bootstrap samples. The relative effect (RE) is defined as the ratio of the indirect or direct effect over the total effect. In the tables, joint effect is the joint effect of income, employment, and insurance. For both outcomes, more ethnic disparities are explained by nonlinear models since they account for potential nonlinear relationships. To explain the results, we focus on using the nonlinear models. Figure 2 shows the relative effects for the outcomes from the nonlinear models. Although “days to diagnosis” was selected as a potential mediator/confounder for depression score, it does not have a significant indirect effect in explaining the ethnic difference. Overall, the significant mediators/confounders are the same for both depression and anxiety scores and the order of mediation/confounding effects are roughly the same. In explaining the mediation/confounding effects, we focus on the anxiety outcome. Results for the depression outcome can be similarly deduced.

Table 4 Summary of mediation/confounding effect estimations for ethnic disparity in anxiety PROMIS score.
Table 5 Summary of mediation/confounding effect estimations for ethnic disparity in depression PROMIS score.
Fig. 2
figure 2

Estimated relative effects (RE) on the ethnic disparities in anxiety (left) and depression (right) by nonlinear models. Note that “de” refers to the direct effect, the ethnic disparities cannot be explained by mediators/confounders, and “joint effect” is the joint indirect effect from employment, insurance, and income.

Compared with Hispanic whites, non-Hispanic whites have an average lower anxiety PROMIS score (\(\mathrm{TE}=-3.08\)). If the “US nativity” could be set equivalent among different ethnic groups, the ethnic disparity in anxiety would reduce by \(22.5\%\). Other variables such as social support (\(21.6\%\)), education (\(18.5\%\)), age at diagnosis (\(16.7\%\)), and the joint effect of income, insurance, and employment (\(5.4\%\)) also significantly explain the ethnic difference. An interesting variable is spirituality, which have a positive indirect effect (opposite to the total effect) and a significant negative relative effect (\(-\,19.7\%\)). This is observed since the anxiety score decreases with spirituality support, which was on average higher among Hispanics compared with non-Hispanic whites. Spirituality support is a protective factor. If the spirituality support is kept equivalent between non-Hispanic white and Hispanic whites, the ethnic disparity would increase \(19.7\%\).

The mma R package provides visual aids to understand the relationships among variables. For example, Fig. 3 describes how age mediated the ethnic disparity in anxiety. The top plot shows the relationship between age and anxiety score. The line was fitted using the Multiple Additive Regression Trees (MART). We can see that the average anxiety was the highest when the patients are diagnosed with cancer at an age younger than forty. After that, the anxiety score decreased sharply with age, until about the age 78, at which the decreasing rate became level. The lower two plots show the age distribution among non-Hispanic (middle plot) and Hispanic (lower plot) population. We found that non-Hispanic whites were diagnosed with cancer at a relatively older age than were Hispanic whites. Therefore, non-Hispanic whites have an average lower anxiety score. If the age distributions could be set as equivalent across the ethnic groups, the ethnic disparity would decrease by \(16.7\%\). Similarly, Fig. 4 shows how US nativity helps explain the ethnic disparity in anxiety score. In conclusion, US-born patients have an average lower anxiety score than foreign-born patients, and there are more foreign-born Hispanic whites (\(58.25\%\)) than non-Hispanic whites (\(5.83\%\)). Therefore, the variable US born explained part of the ethnic disparity. The supplementary material provides the graphs for mediators/confounders to explain how each of them explains the ethnic disparity in anxiety score. Overall, having social support, being employed, male, having private insurance, no chemotherapy, high income, high education, old age, US born, and few comorbidity are related to low anxiety. And in general, there are higher proportion of non-Hispanic whites with such properties.

Fig. 3
figure 3

Indirect effect of age (of diagnosis with cancer) on anxiety.

Fig. 4
figure 4

Indirect effect of US born (whether the patient was born in USA or not) on anxiety.

After accounting for all the mediators/confounders, there is still 27% of the ethnic disparity in anxiety score that cannot be explained (the direct effect). This could be due to variables that can explain the difference was not all included in this study, such as the environmental factors.

5 Conclusions and Future Research

Mediation analysis is used most often in health sciences and psychological research, with different behavioral and socio-demographic variables used as the mediators/confounders. Its application in a study such as this one is to identify which factors could be important in explaining the relationship between a predictor and an outcome from a large pool of candidate mediators/confounders. Both the MY-Health study in general and our assessment using the data aim at taking a broader approach to study cancer survivors’ experiences in terms of HRQOL. MY-Health did not limit their study by cancer site, instead included seven different cancers (female breast, uterine and cervical cancer, prostate cancer, and male and female colorectal cancer, non-Hodgkin’s lymphoma, and nonsmall cell lung cancer), at three different age stratification groups (21–49, 50–64, 65–84). A stratified random sampling scheme was adapted in order to ensure a racially and ethnically diverse survivor population (Hispanic, non-Hispanic white, black, and Asian/Pacific Islander). The results of this study have allowed us to shed light on the factors that mediate the ethnic disparities in two very serious mental symptoms, anxiety and depression. The number of cancer survivors in the USA is growing rapidly as treatments for cancers improve. There will be a growing number of survivors, and our study helps show how we can care for their psychological needs along ethnic lines. Clinicians should focus on screening those survivors with less education, who were diagnosed at a younger age, with no insurance or having only government insurance, with multiple comorbidities, less socially supported, and having lower levels of spirituality. Researchers should focus more on the modifiable factors as they develop interventions and long-term care strategies. If an intervention is chosen to improve the HRQOL of cancer patients, the proposed method can also be used to evaluate the intervention. The two comparison populations will then be the groups with and without intervention separately. The effect of the intervention is measured by the total effect between the two populations.

However, there remained disparities in the anxiety and depression scores after accounting for all the mediators/confounders we collected. It is likely due to that we unknowingly omitted some important variables or the variables were not collected for the MY-Health study. This is considered to be the most common cause of specification error by Judd and Kenny, and they note that this problem is often not easy to fix (Judd & Kenny, 1981). There are many variables in the MY-Health data set that we omitted since either we did not consider them relevant when designing our analysis (e.g., type of radiation treatment administered, SEER region, or the different MY-Health exclusive scores such as financial well-being), or we found through exploratory analysis that did not seem to be relevant (survey mode: by mail or phone). It is possible any one of these variables may actually be a crucial mediator/confounder, and needs further exploration. Moreover, we believe geographic variables about build (e.g., walkability) and social environment (e.g., census tract poverty level) may have an influence on anxiety and depression, but were not collected for this study. Our next step is to build the environmental risk factor data set and link it with the MY-Health study. We will develop a multilevel mediation analysis to account for mediators/confounders from different levels and sources.