Introduction

The rapid economic growth of China has resulted in a historically unprecedented surge in urbanization. Increasing numbers of rural inhabitants have joined this exodus to the cities in search of an improved quality of life. According to the National Bureau of Statistics, in 2015, rural–urban migration in China involved approximately 278 million people (including migrant workers, who numbered 169 million), which may constitute the most massive migration in the history of humankind (Meng et al. 2010).

Chinese rural–urban migrants are usually engaged in 3D (i.e., dirty, dangerous, and demanding) work that urban residents are seldom willing to perform. They often work more extended hours at a high intensity and with less protection than native residents do, while they frequently live in worse conditions and have less support from their families (Meng et al. 2010). These factors significantly increase the risk of rural–urban migrants becoming sick while living in cities. Moreover, due to the discrimination involved in the “hukou” household registration system, rural–urban migrants are often systematically excluded from urban public resources, one of which is access to healthcare. Consequently, rural–urban migrants receive less coverage under formal medical schemes and encounter more barriers when applying for reimbursement of treatment expenses, even if they have specific insurance (Hu et al. 2008).

Although migrant workers face a higher risk of poor health and lower chance of accessing and affording treatment, studies face challenges when asserting that the health status of the migrant workers is worse than that of urban residents. Indeed, most studies have found that migrants exhibit better health than urban natives because young, healthy individuals having a higher capability and propensity to migrate. The “healthy immigrant effect” (henceforth referred to as the HIE) is a well-established phenomenon in the field of international migration, which describes the observation that immigrants are generally healthier than natives (Antecol and Bedard 2015).Footnote 1 A more nuanced definition includes a second aspect: after immigrants move to their new country of residence, the health advantage gradually deteriorates (Aglipay et al. 2013) in what referred to as the years since migration (henceforth referred to as the YSM) effect (McDonald and Kennedy 2004).

Numerous empirical studies have investigated the effects of HIE and YSM in the international migration context. Most of them inevitably come from developed countries, since they are the countries receiving the most immigrants. For developing countries, especially for the vast countries such as China, internal migration is often from rural to urban areas rather than the immigration patterns. Accordingly, some researchers have sought to determine whether internal migrant workers are healthier than urban residents and whether this health advantage deteriorates over time in ChinaFootnote 2. However, most studies have mainly focused on self-reported health or physical health indicators, such as diseases and disability (Tong and Piotrowski 2012), while the mental health status of migrant workers has been paid less attention.

Intuitively, it may be predictable that the healthy migrant effect is evident among Chinese rural–urban migrants according to measures of self-evaluated or physical health. After all, in most cases, people who choose to migrate have better health or exhibit a positive outlook about their health. However, in the context of mental health, the scenario may differ substantially. Migrant workers who work in unfamiliar environments without the company of family members may feel lonely and anxious. Moreover, rural to urban migrant workers who perform low-skilled tasks may commonly be discriminated against by native residents. Therefore, they may tend to have more anxiety disorders, depression, and low self-esteem than urban native residents, although they are physically healthier. Hence, the evidence of the healthy migrant phenomenon regarding physical health in China may not apply to mental health. This association needs to be empirically tested, preferably using national-representative survey data; therefore, the original motivation of this research was to focus on the mental health of rural to urban migrants in China.

Furthermore, China is undergoing a period of epidemiologic transition: the reason for most deaths in recent decades is noncommunicable instead of infectious disease, and with the rapid development of industrialization, some health problems have emerged, for instance, mental health disorders (Lu et al. 2012). The mental health problem of migrant workers is a crucial issue of immediate significance.

Therefore, in this study, we first contribute to the literature by using more representative nationwide survey data focused on rural–urban migrants in China, the Rural–Urban Migrants in China (RUMiC) database, to explore the HIE in regard to mental health in China.Footnote 3 Our results show that the mental health level of migrant workers is better than that of native urban residents based on the regression analysis controlled for individual characteristics. This is consistent with the stylized facts of the HIE, as we mentioned.

Second, due to the unique data framework of the RUMiC database, we were able to use not only the samples from the Rural-to-Urban Migrants Survey (UMS) and the Urban Household Survey (UHS) but also that from the Rural Household Survey (RHS), which allowed us to assess potential channels of the healthy migrant phenomenon. Concretely, we classified rural samples in the RHS into three categories: (1) people who have never migrated, (2) people who have returned to their hometown, and (3) people who had jobs in urban cities but were temporarily living in a rural area during the survey fieldwork period. Consequently, we were able to discuss the mechanisms of self-selection by migrants and the “salmon effect”, as well as the measurement of the healthy migrant phenomenon itself. We found a substantial, significant self-selection effect and a weaker, insignificant “salmon effect” as possible mechanisms causing this phenomenon.

Finally, we determine that the socioeconomic status (SES) disparity between migrants and urban natives plays a crucial role in whether a healthy migrant effect on mental health exists in China. For the segmentation of the hukou household registration system in China, migrant workers possessed relatively lower SES as measured by household income, educational attainment, and job characteristics than their urban counterparts, which drastically impaired their health status.

Conceptual Background and Literature Review

HIE Framework: From International Immigration to Internal Migration

As mentioned earlier, the health immigration effect (HIE) describes a phenomenon in which immigrants are generally healthier than natives. Two concepts of defining the HIE are commonly found in the literature. One involves directly comparing the mean of the health outcomes between natives and immigrants, which is also called the “unconditional” HIE (e.g., Gotsens et al. 2015; Kwak and Rudmin 2014). The other is based on multiple regression analysis and expresses the HIE as “immigrants are healthier than their native counterparts” (e.g., Antecol and Bedard 2015). The word “counterparts” reminds us that the latter definition requires controlling for covariates, such as demographic variables, work, income, and education.

There is a large body of literature about the HIE in immigration studies. Researchers often compare the health of immigrants to that of natives at the destination in terms of various indicators of health status for most developed countries, including the US, Canada, Australia, the UK, Germany, Sweden, and others, which are often the most major immigrant-receiving countries.Footnote 4 Moreover, most studies, whether unconditionally comparing or conditionally comparing the mean of health outcomes, provide some evidence that HIE does exist, which means that immigrants are both conditionally and unconditionally healthier than native residents.

However, only a few studies have been performed concerning internal migration; most are conditional comparisons. For example, examining the economically motivated movement from rural to urban areas in Indonesia, Lu (2008) applied the HIE to internal migration and determined that the migration process is complex and diverse while the healthy migrant phenomenon does exist in Indonesia. Subsequently, Lu (2010) used the Indonesian Longitudinal Data for 1997 and 2000 to examine the YSM effect of rural–urban migration on physical and mental health. She concluded that migration has little impact on physical health for several years but that the YSM effect does apply to mental health among migrant workers in Indonesia.

Some scholars have also used the HIE to explore the problems faced by rural to urban migrants in China. For instance, using the data from CHNS (China Health and Nutrition Survey), Tong and Piotrowski (2012) suggested that, in terms of self-reported health, high blood pressure, heart disease, and ADLs (activities of daily living), migrants are generally healthier than native residents. Niu (2013) also analyzed aspects such as self-reported health, chronic diseases and disability and identified the health advantage of migrant workers over native urban residents in China by using data from the third issue of the Chinese Women’s Social Status Survey. In other words, she confirmed the existence of HIE in China as well. Meanwhile, she found that migrants who eventually returned to their hometown had worse health conditions than the rural residents who never migrated, which supports the theory of the YSM effect. Using data from the eight district migrant workers survey in Beijing, Yuan (2009) revealed the diminishing health situation of migrant workers after their immigration and the formation of the vicious cycle of decreasing health status and decreasing income, both of which support the existence of the YSM effect. Utilizing the 16-province tracking data from the ministry of agriculture fixed observation points of China, Qin et al. (2015) also showed similar results. Furthermore, Zhou and Lu (2016) demonstrated that the YSM effect exists among manufacturing and construction workers more often than it does among those in white-collar jobs, such as administrative, technical, management and service jobs.

In summary, numerous studies on the healthy migrant phenomenon in China have investigated self-reported health and physical health, whereas evidence concerning mental health is relatively limited.

Mechanisms of the Healthy Migrant Effect

Many previous studies have provided three potential channels to explain why the HIE exists in China (Chen 2011).

Health Care Access and Health Behaviors

Studies have indicated that barriers to health service access result in immigrants obtaining relatively less information concerning diseases that they have, thereby causing them to provide relatively high ratings of their health condition, which can result in overestimation of subjective health indicators of the HIE (McDonald and Kennedy 2004). However, empirical conclusions regarding this mechanism are unclear because some studies have reported that increasing health care access can effectively improve health levels. Immigrants commonly have less access to public medical care systems, which may impair health and reduce HIE further. Hence, the influence of medical factors on HIE is hard to determine (Antecol and Bedard 2015). In addition to the healthcare mechanism, health behaviors may play another potential role in explaining HIE (Kennedy et al. 2015). For example, a healthy premigration lifestyle may have involved high levels of physical activity and low-calorie diets. These health behaviors are more conducive to a better health status.

Selective Migration Effect

In most cases, migrants tend to be healthier people. Therefore, new immigrants enjoy a health advantage over native residents. This mechanism can account for the HIE among domestic migrant workers in China: healthy rural residents tend to migrate for work (Tong and Piotrowski 2012; Zhou and Lu 2016). In international migration, another mechanism affects potential migrants, namely, the immigration screening processes of host countries can reject applicants with relatively inferior health. Although this mechanism does not exist for internal migrants in China, strict household registration policies can be considered a form of migration screening and allow only relatively privileged migrant workers being permitted to settle; these strict registration policies are enforced in megacities such as Beijing and Shanghai and force unhealthy migrant workers to return to rural areas. This strict household registration policy can also explain the third HIE mechanism, namely, the “salmon effect”.

Salmon Effect

The term is used to describe the patters that immigrants who are relatively less healthy are more likely to return to their country of origin; consequently, the average health condition of immigrants in the host country is superior. However, the empirical results are mixed. Fong (2008) reported that Chinese sojourners in the United Kingdom and Ireland returned to China because of personal or familial health crises. Van Hook and Zhang (2011) determined that immigration was less strongly associated with health in the United States. Nevertheless, we contend that the “salmon effect” can explain the healthy migrant phenomenon in China. For example, Niu (2013) reported that migrant workers who returned to rural areas were less healthy than those who did not. Moreover, Zhou and Lu (2016) stated that “migrant workers devoted themselves to the development of the city, but when they become less healthy, the city was unwilling to accept them and pushed them back to rural areas.”

Healthy Migrant Effect and Mental Health

Relative to studies using physical health as the primary outcome variable, few have been conducted on the HIE and mental health. Alegria et al. (2008) and Cook et al. (2009) have assessed that Latino immigrants in the United States exhibit a lower risk of developing psychiatric disorders than do US-born Latino immigrants. Similarly, adopting psychiatric disorders and anxiety disorders as indicators to measure mental health, Menezes et al. (2011) and Aglipay et al. (2013), respectively, confirmed that the HIE is evident in Canada. Rivera et al. (2016) used the GHQ-12 questionnaire as an indicator of psychological well-being and revealed that a group who migrated to Spain less than 10 years prior exhibited better mental health. They also observed no HIE in the group who had migrated more than a decade prior, which supports the existence of the YSM effect. Bergeron et al. (2009) reported that immigrants in Canada suffered diminishing mental health because of cultural, language, and climate-related problems that had occurred following migration. Straiton et al. (2014) indicated that men from non-English speaking backgrounds were exposed to an increased risk of mental health problems, which also verified the action of the YSM effect on mental health.

Compared with the studies above, the mental health of Chinese rural to urban migrants has received less attention. To the best of our knowledge, only Wen et al. (2010) and Chen (2011) used data from Shanghai and Beijing, respectively, to study the healthy migrant phenomenon in relation to mental health in China. They presented mixed and inconsistent pieces of evidence. Wen et al. (2010) confirmed the healthy migrant phenomenon impact on mental health by using survey data that only covered Shanghai, whereas Chen (2011) reported the opposite results by using survey data solely from Beijing. The contradictory conclusions reflect not only a considerable heterogeneity between the two different regions of China but also unusually insufficient evidence of the phenomenon regarding mental health.

Data, Methods, and Variables

Rural–Urban Migration in China (RUMiC) Survey Data

This study used data from the RUMiC survey, which implemented a stratified sampling method using random mapping to conduct detailed investigations of five thousand rural–urban migrant families in fifteen large and medium-sized cities in China. The survey obtains individual and family characteristics, including occupation, income, consumption, and daily routine. In addition to information about the migrant families, it also obtains data from samples of five thousand urban residents in the fifteen cities above and eight thousand rural residents in ten provinces. Thus, the RUMiC survey data comprise three datasets: the “Rural-to-urban Migrants Survey (UMS)” dataset, the “Urban Household Survey (UHS)” dataset, and the “Rural Household Survey (RHS)” dataset.

We use the pooled cross-sectional data of the RUMiC 2008–2009 data, which covers the years 2007 and 2008. The advantages of the RUMiC survey are as follows: (1) It focuses on rural–urban migration issues, and its purpose corresponds to that of this study. (2) The data sampling method is distinct from those of previous surveys of migrants in that it applies both workplace and place of residence to determine the sampling frame. Therefore, we were able to include migrant workers who live at their workplace, who have typically been omitted from previous surveys. (3) More importantly, the data include native urban residents, migrant workers, and rural residents based on a nationally representative sample.

Methods

This study measured the healthy migration phenomenon by comparing migrant workers (from the UMS dataset) with urban residents (from the UHS dataset). We evaluated the healthy migrant effect through an ordinary least squares regression model. Our baseline estimating equation was as follows:

$${\text{mh}}_{\text{idt}} = \beta {\text{Mig}}_{\text{idt}} + \gamma X^{\prime}_{\text{idt}} + D_{\text{id}} + T_{\text{it}} + \mu_{\text{idt}}$$
(1)

where mhidt represented the level of mental health of individual i in area d at year t. Migidt indicated whether the individual is a migrant worker (1 = migrant worker, 0 = urban resident). X′idt represented a series of control variables. Did and Tit represented the region fixed effect and the year fixed effect, respectively. μidt was the error term. In particular, β was the critical coefficient of concern, representing the coefficient of the healthy migrant effects.

Equation (1) was not only used for the basic framework but was also adapted for subsequent analysis of the mechanisms. To analyze the selective migration effect and the “salmon effect”, we classified samples in the RHS into three groups: (1) rural residents who never had any migratory experiences and continuously lived in rural areas; (2) migrant workers who had migration experiences but who returned to their rural hometowns in the relevant year; and (3) migrant workers who were temporarily living in the city during the period when the survey was conducted. These groups were referred to as nonmigration residentsR, return workersR, and migrant workersR, respectively. The superscript R represents that all three above are subsets among the RHS dataset.

Specifically, we used two different samples for the comparison in Eq. (1). The first was composed of migrant workers (from the UMS) and native urban residents (from the UHS) and was used to measure the healthy migrant phenomenon. Second, to assess the selective migration effect, we used the nonmigration residentsR, migrant workersR, and return workersR samples to determine the “salmon effect”. Therefore, the study not only analyzed the healthy migration phenomenon but also attempted to explore the mechanisms behind the phenomenon; it may provide a more comprehensive conclusion than many other studies regarding the HIE, particularly those focused on rural–urban migrants in China.

We used the RHS sample to evaluate selective migration effect and “salmon effect” rather than comparing migrant workers (from the UMS) and rural residents (from the RHS) for the following reasons: (1) Taking the mobility patterns of rural to urban migrants into account, the sampling design of the UMS and UHS in the RUMiC was different from that of RHS in RUMiC. The samples of the UMS and UHS were derived from fifteen cities located in more coastal areas and were intended to be representative of the host places of migrants. In contrast, the RHS was chosen from rural areas in ten provinces and was designed to be representative of the source places. (2) Migrant workers in the UMS dataset were surveyed in fifteen cities and originated from all regions of China, whereas the rural residents in the RHS dataset originated from only ten provinces as sources. Therefore, comparing the samples from the UMS and those from the RHS is inappropriate for discussing the selective migration effect and the “salmon effect”. Although the urban residents in the UHS dataset originated from only fifteen cities, we were still able to compare them with migrant workers in the UMS from the entire country because of the definition of the HIE. (3) We chose GHQ-12 as the proxy of mental health, which is a subjective index. However, Chinese people living in different circumstances (urban area vs. rural area) may use different criteria when assessing their subjective health conditions because they face different reference groups (Mu 2014). Therefore, compared migrant workers and urban residents to measure the healthy migrant effect because they were all investigated in the same urban cities at almost the same time. However, when we examined the selective migration and the salmon effects, samples that were all investigated in the rural areas were more suitable.Footnote 5

To illustrate the samples used in this paper more clearly, we present some details of the data in Table 1.

Table 1 Sample structural

Key Variables

Different studies have used different indicators, such as psychiatric disorders (Cook et al. 2009; Menezes et al. 2011), anxiety disorders (Aglipay et al. 2013), and psychological distress (Chen 2011), to measure the status of mental health. Following the examples of Rivera et al. (2016) and Gotsens et al. (2015), we adopted the GHQ-12 questionnaire as a comprehensive indicator of mental health. GHQ-12 consists of 12 items with a 4-point Likert-type scale (ranging from 0 to 3). The score is used to generate a total score ranging from 0 (most healthy mental state) to 36 (least healthy), which is one of the most extensively used indexes for measuring common mental disorders. Its application in empirical research is well documented in various countries and various types of populations (Rivera et al. 2016). Therefore, we used the GHQ-12 questionnaire as the dependent variable to assess overall mental health level.

We also used other variables such as household incomes per capita, years of schooling, employment, ownership and contract type of employment, and occupation as control variables to evaluate the SES of the individuals. Additionally, demographic variables (gender, age, and ethnicity) were also controlled. We limited our migrant sample to individuals who were 18–60 years old, had a rural hukou and were engaged in “seeking a job and doing business” in cities. Table 2 provides the necessary information on these migrants as well as their counterparts in the cities and the native urban residents.

Table 2 Descriptive statistics: urban residents and migrant workers

Table 2 shows that the mean GHQ-12 scores for migrant workers and native urban residents were 7.589 and 7.481, respectively. The mental health level of migrant workers was worse than that of native urban workers (significance level: 10%). This result appears to cast doubt on the existence of the HIE. However, when we compared the other variables in Table 2, we found that migrant workers were more likely to be male and younger (approximately 10 years) than the natives.

Moreover, migrants were also much lower than urban residents in terms of household income per capita, education level, shares of employment in the state sector and permanent contracts. Columns (5) and (6) in Table 2 present raw the differences and standard errors of these variables, respectively, by a t test of the mean differences between the two groups. Most values of the mean differences in these variables are significant at the 0.1% significance level. In other words, migrant workers and urban residents were, to some extent, two distinct groups competing in a segmented labor market. This indicates that we should control these significant covariates to make the comparison of migrants and urban residents comparable when studying the healthy migrant phenomenon in China.

Results

Main Findings of the Healthy Migrant Effect

Table 3 presents the multiple regression results according to Eq. (1). The baseline regression only included essential demographic characteristics, such as age, gender, and ethnicity, as well as the year and province dummy variables (Model 1). The result indicates that the migrant workers’ mental health was significantly worse than that of urban residents. This means that when we only matched on demographic characteristics, such as gender, age, and minority, plus a controlling year and region dummy variables, migrant workers’ mental health levels were significantly inferior to those of urban residentsFootnote 6.

Table 3 OLS regression: migrant workers versus urban natives

Then, we gradually added some “conventional” controlling variables to the baseline regression (Model 1). At first, when the income variable was added, the key coefficient declined from 0.528 to 0.362, which reflects that migrants have much lower income than urban residents and income was positively associated with mental health (Model 2). Then, when we further controlled the education level, the result was reversed; the sign of the coefficient turned from positive to negative. This means that migrant workers’ mental health was significantly better than that of urban residents (Model 3).

In other words, when we matched the two groups and compared people with identical demographic characteristics, year, region, income level, and education level, the stylized fact of a healthy migrant effect appeared. In turn, when we did not control the income variable and education level, the migrant workers’ mental health was worse than that of urban residents because the income and education of the migrant workers were much lower than in urban residents.

To yield a deeper understanding of the healthy immigration effect, we gradually added more variables about employment, such as the status of employment and other job characteristics, if the respondent was employed. This specification allowed us to examine the effects of income, education, employment, and job characteristics separately so that we could understand the healthy migration effect better by pinpointing which aspect of SES was vital in explaining the mental health difference. As shown in Model 4, when employment status is added to the regression model as an control variable, the absolute value of the critical coefficient barely changed (from − 0.235 to − 0.232). That means that employment status could explain almost nothing. However, when we added detailed job characteristics, the critical coefficient turned from − 0.232 to − 0.342 (Model 5). This indicates that migrant workers engaged in different types of work than the urban residents, which tended to hurt the mental health of migrant workers. Additionally, from the changing magnitude of the critical coefficients between the five models, we can see that income and education played a relatively substantial role in explaining the reversal of the results.

In summary, the results reveal that migrant workers’ mental health is significantly superior to that of urban residents if we control the SES variables. Then, one may ask why the descriptive statistics showed that migrant workers’ mental health was worse. The results from the regression analyses suggest that it is because migrant workers and urban residents are two distinct groups with significant disparities in SES. The migrant workers possessed relatively inferior SES, which was negatively related to their mental health; thus, the stylized fact of healthy migrant effect—migrant workers exhibiting a better average mental health level—appeared only after equally controlling for SES.

Robustness Check

The Order of Inclusion of Controlling Variables

Our previous analysis is based on a stepwise regression method.Footnote 7 However, caution should be exercised in drawing inferences about the importance of particular covariates in relation to mental health based on the stepwise type of approach. In other words, the order of adding variables into the regression may matter for the result.

To test the robustness of our analysis in the last section, we tried five different orders of inclusion, which specifically changed the order of three important variables, education, income, and job information, and re-estimated the model to see if the results changed substantially. The results are shown in the Online Appendix Table 2. As we can see from the table, the basic pattern shown in Table 3 is relatively robust; both results present the fact that the sign of the coefficient was positive at first, then as we added more controlling variables, the sign eventually became negative, no matter which orders we chose.

The Nonlinear Estimation Model

Another concern about our results is the specification of our estimation model. As we introduced, the dependent variable was the mental health measured in the GHQ-12, which consists of 12 items with a 4-point Likert-type scale (ranging from 0 to 3). The score range from 0 (most healthy mental state) to 36 (least healthy), for which the value is always a nonnegative and integral. This indicates that our linear estimation model may be overly restrictive.

Taking the possible nonlinear relationship between mental health and migration status into account, we also used a negative binomial regression model, which is often applied to count outcomes, to re-estimate the model as a robustness check. The results are shown in Online Appendix Table 3. The sign, magnitude, and significance of the estimate coefficients in the table are similar to those in Table 3, which proves that the specification of our model did not substantially change the results.

The Restriction of Recent Migrant Workers

Ideally, health status should be observed at the time of migration when measuring the healthy migration effect, since the time after movement could lead to possible health changes arising from migration. Namely, the YSM effect may confound the comparison between migrant workers and urban residents. Therefore, we restricted the sample of migrants to the recent migrants as a robustness check.

We generated a new variable, “YSM”, which we defined as the number of years since the migrant workers came to the city. Then, we repeated the same stepwise regressions as in Table 3 but restricted the migrant sample to YSM ≤ 3 years, YSM ≤ 2 years, and YSM ≤ 1 year separately. As shown in Table 4, the coefficients are only slightly higher than those in Table 3, which means that recent migrants suffered worse mental health than permanent migrants. In general, the results are consistent with the main findings in Table 3.

Table 4 Robustness check by restricted to recent migrant workers

Potential Mechanisms of the Healthy Migration Effect

Since we have confirmed that the mental health level of migrant workers in China is consistent with the stylized facts in the HIE, after controlling the SES characteristics, the next question is what are the mechanisms behind the HIE phenomenon? As summarized in the literature review section, three channels behind the healthy migrant effect have been presented thus far: health care access and health behaviors, the self-selection effect, and the “salmon effect”.

Mechanism 1: Health Care Access and Health Behaviors

To identify the effect of health care access and health behaviors on HIE, we incrementally put more variables, which can be seen as the proxies of health care access and health behaviors, into the regression equation (presented in Model 5 of Table 3). The results are shown in Table 5.

Table 5 Analysis of mechanism: health care access and health behaviors

First, we added the variables “health care expenditure over the last 3 months” and “whether you have health insurance” as proxies for health care access to assess this mechanism. We adhered to the concept that a newly added variable is the pathway variable for explaining the healthy migrant phenomenon if the magnitude and significance of the coefficient of the migrant worker’s variable declines after the new variable is added (Cutler and Lleras-Muney 2010). After the two variables were added, the coefficient of the migrant effect became − 0.299 (Model 6), which was not considerably different from the result presented in Model 5 of Table 3 (− 0.342). In other words, the health care factor only explained a small share, which is 14.4% ((0.299–0.342)/0.299 = − 14.4%), of the healthy migration phenomenon mechanism.

Considering that “whether you have health insurance” may be insufficient to describe the social benefits in the medical service gap between the two groups, we added the logarithmic reimbursement variable, which was the amount of medical treatment or expense covered by any social insurance in the last year. The results are presented in Model 7, which exhibit minimal change compared with Model 6. As stated, health care access has both positive and negative effects on individuals’ subjective health levels. Consequently, the effect of healthcare on HIE is challenging to determine (Antecol and Bedard 2015). In this study, we could not identify whether health care access had no effect at all or whether the positive and adverse effects offset each other.

Finally, we added smoking habits and the prevalence of obesity (body mass index, BMI ≥ 28) as a proxy for health behaviors (Kennedy et al. 2015).Footnote 8 As shown in Models 8 and 9, the key coefficients changed slightly. Interestingly, the coefficients of smoking and obesity were all negative, which means they were related to better mental health. The unexpected signs for smoking and obesity could indicate that smoking and obesity damage physical health; however, they may improve one’s mood, thereby enhancing mental health.

The health care expenditure over the past 3 months not only measures the supply of healthcare services to some extent but also undoubtedly reflects the demand for healthcare services; hence, it is going to be directly related to mental health status. To see if our results are robust, we dropped the health care expenditure variable and re-estimated the model. As shown in Model 10 and Model 11, the results were similar to the previous ones, which means that our results were immune to the potential endogeneity of health care expenditure.

In summary, our study does not yet indicate that health care access and health behaviors are potential mechanisms for explaining the role of mental health in the healthy migrant phenomenon in China. Nevertheless, we should note that the variable “health care expenditure over the last 3 months” contained only general medical information and that the RUMiC data lack more specialized psychological and medical information related to mental health, as well as the distant between the residence and the nearest the health care facilities, which may also affect the health status of migrant workers. Therefore, the relationship between health care access and the healthy migrant phenomenon requires further analysis.

Mechanism 2: Selective Migration Effect

This study also investigated the selective migration effect and “salmon effect”. First, we compared samples of migrant workersR and nonmigration residentsR to examine the selective migration effect; the regression results, presented in Column 1 of Table 6, indicate that migrant workersR had a lower risk of developing mental health problems than nonmigration residentsR did; this difference passed the 5% level of the significance test. Hence, the selective migration effect was supported.

Table 6 Analysis of mechanism: selective migration and salmon effect

As a supplementary statement, the coefficients of other controlling variables mostly adhered to our intuition: the young, employed individuals with higher incomes and higher education exhibited significantly superior mental health.

Mechanism 3: “Salmon Effect”

To investigate the “salmon effect”, we ran a regression similar to the one in the previous subsection but only used the migrant workerR and return workerR samples at first. As shown in Column 2 of Table 6, migrant workersR exhibited insignificantly superior mental health levels compared to return workersR. This indicates that the “salmon effect” exerted a weak impact on the healthy migration phenomenon for migrant workers’ mental health in China. The result seems to correspond to some empirical evidence on international immigrants. That is, the effect of the salmon mechanism was relatively weaker than the effect of the selective migration effect (Van Hook and Zhang 2011). There are two opposite views to explain why migrant workers return to their hometown: worsening health conditions and familial reunion.

To estimate the selective migration effect and the “salmon effect” simultaneously, we also pooled all samples including the migrant sample, the nonmigrant sample, and the return migrant sample together and ran a new regression with two dummy variables that represented nonmigrant residents and return workers. The results are presented in column 3 of Table 6. We can see that the magnitude and significance of the two coefficients are very similar to the results from separate samples in the previous columns.Footnote 9

In summary, a selective migration effect and a relatively weaker “salmon effect” are found on the mental health of migrant workers in China. Our suggestive evidence derived from comparing urban residents and rural–urban migrants who have already self-selected to migrate to cities coincides with findings reported in the literature on the healthy migrant phenomenon. Therefore, we conducted a regression analysis under the control of individual characteristics, and we obtained results indicating that the mental health of migrant workers was better than that of urban residents. However, owing to the more inferior socioeconomic status of the migrant workers, their mental health status is worse than others when we did not control the SES.Footnote 10

Discussion and Conclusion

Using a nationwide representative dataset, RUMiC 2008 and 2009, this study examined the relation of the healthy migrant phenomenon with mental health in China. Our results confirmed that HIE does exist under the setting of internal rural to urban migrants in China and found that socioeconomic status disparities between migrants and urban natives plays a vital role in the phenomenon. In addition, we provide some suggestive evidence that self-selection and the “salmon effect” are potential mechanisms in explaining the healthy migrant phenomenon in China.

Our findings on mental health correspond to the evidence presented in the literature on the HIE. However, if SES factors are not controlled, migrant workers possessed inferior mental health conditions due to their inferior SES. The evidence reflects the severe disparity in social and economic characteristics between urban residents and rural residents; this can be attributed to the systematic discrimination involved in the “hukou” household registration system.

It is well known that the household registration system in China divides people into two distinctly different identities, “agriculture population” (rural hukou) and “non-agriculture population” (urban hukou). For a long time, people could not change their identities from one to another, except by several channels, such as going to college, joining the military, or having a marriage across the rural and urban divide. This meant that a person who had no “urban hukou”, even if he or she is living and working in a city, could not enjoy the social benefits in education, medical services, and social security that urban residents had.

Due to the discrimination involved in the “hukou” system, rural to urban migrants are often systematically excluded from urban public resources, one of which is access to healthcare (Gong et al. 2012). One reason is that the basic health insurance schemes used to be divided by “hukou”, Urban Resident Basic Medical Insurance (URBMI) for urban residents and the New Rural Cooperative Medical Scheme (NRMS) for rural residents, causing extensive fragmentation in the health insurance system. The identity-based schemes limit migrants’ access to healthcare services in their job locations because it is difficult for them to use a specific health insurance account across regions or schemes. In addition, the reimbursement levels and benefits packages differ among regions, and the separate operation mechanisms and different administrative institutions lead to weak interconnections within the system, thereby causing significant inequity issues for migrants (Shan et al. 2018). Migrants receive less coverage under formal medical schemes, and they encounter more barriers when applying for reimbursement of treatment expenses (Hesketh et al. 2008). As a result, they consistently underuse health services in their destination cities, causing potential short-term and long-term health problems (Mou et al. 2009). In this way, the migrant workers’ health advantage gradually deteriorates after migrants move to a new place of residence (Aglipay et al. 2013).

Several limitations of this study must be noted. First, the conclusions of our study are primarily descriptive and illustrative and do not represent canonical causal effects. Second, due to lacking detailed medical information and the data compatibility issue in RUMiC, our result about potential mechanisms for explaining the role of mental health in the healthy migrant phenomenon in China may be not conclusive. Finally, this study involves no panel data, so we could not examine mechanisms such as premature death (Antecol and Bedard 2015) and the dynamic YSM effect. Therefore, the mechanisms we proposed represent suggestive evidence rather than solid causal evidence. These limitations motivate us to address these shortcomings in our future research.