Introduction

Far-reaching economic reforms in China in the early 1980s resulted in the collapse of the rural Cooperative Medical Scheme (CMS). Since then, rural residents have been excluded from the social security system. A major consequence of these events was the creation of financial barriers, which resulted in a substantial reduction in access to basic health services, especially for the rural poor. For example, a national health survey reported that 38 % of the respondents did not seek medical services when they were sick, and 70 % refused hospitalization despite a physician’s recommendation, which creates concern that excessive cost is the primary reason for these decisions (MOH, 2003). In response, a new program—the New Cooperative Medical Scheme (NCMS)—was introduced in 2003 to provide a government-subsidized health insurance scheme that is intended to promote health care utilization in rural China and interrupt a cycle of illness-based poverty. The potential distributional gains from such subsidized health insurance schemes are, however, usually accompanied by efficiency losses, which occur because of distortions in both the participants’ behavior and in the allocation of resources to the health care sectors as a whole (Trujillo et al. 2005). As a result, it is imperative for us to carefully evaluate the effectiveness of these programs concerning the extent to which they achieve their desired objectives.

Previous works have examined the effectiveness of the NCMS in improving health care access and reducing the economic burden in small pilot studies and in large population-based samples. These studies, however, yielded mixed results. A study that assessed NCMS pilot programs in 2003 in selected counties, for example, found that enrolment in the NCMS was associated with a 20–30 % increase in the use of outpatient and inpatient services but was not associated with lower overall out-of-pocket expenditures (Wagstaff et al. 2007). A second study that used hospital financial accounting data (Gu and Fang 2007) found that medical expenditures did not decrease significantly following NCMS implementation. A third study observed similar findings but could not confirm an overall increase in the use of formal medical servicesFootnote 1 or an improvement in the health status of the population (Lei and Lin 2009).

Program effectiveness is often assessed by using outcome measures that are summarized across all participants, yet the impact may differ in meaningful ways according to key characteristics in the population (Khandker et al. 2010). Rural China has been previously rendered a ‘three-world’ characterization that reflects its diversity concerning geography, economic development, public resources, society and culture, population health indicators and demand. This characterization includes the coastal, high-income region (the first world), the central, middle-income region (the second world), and the western, low-income region (the third world) (Liu and Rao 2006). The people who live in poor rural areas experience difficulties in accessing basic health care services, and even a relatively low amount of medical expenses can cause financial hardship and aggravate poverty. By contrast, people from the central regions have the greatest variation in levels of income and financial vulnerability to health care costs, whereas the people from the eastern regions tend to protect themselves against catastrophic medical expenses that often result from episodes of hospitalization. Given the diverse socioeconomic conditions and health care demands in each region, the NCMS has not been designed as a universal rural health insurance program. With central guidelines, local governments are authorized to retain considerable discretion over the benefit package design and local implementation of the NCMS. An expectation that differential effects may arise following implementation of the NCMS in rural China is further supported by regional heterogeneities. The extent to which important differences in NCMS effectiveness exists in these socially and culturally distinct regions of China, however, has not yet been established. This study explores the potential for the NCMS’s policy differences in its impact across rural China and thus provides valuable feedback to facilitate future planning and policy development.

The objectives of this paper, therefore, are to explore the variation in the determinants of enrolment and the impact of the NCMS in each of China’s three large geographic regions, specifically in promoting health care utilization and reducing outpatient and inpatient expenditures through insurance reimbursement. We also explore the differences in both the utilization and expenditures by income groups. The remainder of this paper is organized in sections with the second section devoted to the descriptions of the NCMS program. The third section elaborates the analytical methods, data source and study sample, and the fourth and fifth sections present the results and discussion, respectively. A final section concludes this paper.

The New Cooperative Medical Scheme in rural China

The central government issues three broad guidelinesFootnote 2 as basic rules for the NCMS nationwide. (1) Participation in the NCMS is voluntary and based on household unit. (2) The NCMS is conducted at the county-levelFootnote 3 rather than the village or township levels. (3) The NCMS focuses on catastrophic illnesses and receives contributions from both governments (central and local) and individuals. After following these guidelines, local governments can arrange the implementation of the NCMS based on their fiscal conditions and local needs.

Financing mechanism of the NCMS

The financing of the NCMS contains both individual and government contributions. In the early stages of the NCMS, the premium of each enrollee was a minimum of 10 Yuan (per year), which was contributed by households. Enrollees who lived in the central and western provinces of China received a subsidy of 20 YuanFootnote 4 on the premium (US$ 3.1) (per enrollee, per year); 10 Yuan is from the local government and another 10 Yuan was a matching subsidy from the central government. For the relatively more affluent eastern provinces, the local governments were required to subsidize the same amount of 20 Yuan (MOH, 2003). Since 2006, the government’s subsidies were raised to 40 Yuan per enrollee per year, and simultaneously, the subsidy program was extended to include the eastern provinces (China Ministry of Health and Ministry of Finance, 2008). In 2009, the government’s subsidies were revised to 80 Yuan for all three provinces and the per capita premium was formally required to be no less than 20 Yuan per year.

Localities have the right to adjust their own insurance protection levels according to the local economy and specific needs. The central government only formulates the minimum requirement on individual contributions. The financial levels across different regions have shown significant discrepancies at the initial stage of implementation of the NCMS, followed by distinct per capita net income levels (see Table 1).

Table 1 Resources of funding in the NCMS

Reimbursement modes of the NCMS

There are four major reimbursement modes in the NCMS, but only one mode can be adopted in one county; therefore, an enrollee cannot freely choose. The reimbursement can only be claimed in the county where a person is enrolled. The most commonly adopted mode, which is implemented in approximately 66 % of the rural counties in China, is “inpatient and household medical saving accountsFootnote 5 mode. Under this mode, inpatient services are reimbursed according to a certain formula (that is set by each county), whereas only registration or threshold fees for entering hospitals are paid for by household medical saving accounts. The second mode is “inpatient only”, which accounts for 17 % of the counties. This type is popularly used in the majority of economically developed areas, such as the Jiangsu, Fujian and Zhejiang provinces. The third mode is “inpatient and catastrophic care”, with separate deductibles and caps in inpatient and catastrophic expenditure reimbursement. This third mode is used in 11 % of the counties. The fourth mode is “inpatient and outpatient pooling”, which has been adopted in 6.7 % of the counties. Under this mode, the reimbursement of inpatient care is the same as the first mode, but the outpatient services are compensated according to a certain formula through collective funds, usually with no deductible and no reimbursement cap (Du and Zhang 2007). In addition, some counties provide extra free physical check-ups (once per year) for the enrollees who have not used any medical services in that year.

All of the modes reflect “catastrophic illness-oriented” policy requirements (see Table 2). Because of limited financing levels, the reimbursement ratio is still not high; nationally, the average level is approximately 25 %. However, the actual amount of reimbursement that is obtained by the enrollees in the eastern regions is approximately two times the reimbursement of the central regions and approximately three times the reimbursement of the western regions. Overall, the financing of the NCMS is still low, and reimbursement is not high. Under this small budget, a concern is whether and how effectively the NCMS can achieve its goals. For the past 30 years, the Chinese government has provided no financial support to rural residents for the purchase of health care services (Liu and Rao 2006). Theoretically, health insurance can lower the effective prices of health care at the time when the consumer purchases it. Therefore, given these considerations, we would like to expect that the NCMS, to some extent, can promote medical utilization in rural residents, especially in the western regions. However, the role of the NCMS in reducing out-of-pocket expenses may be very limited.

Table 2 Inpatient reimbursement in the NCMS

Data and methods

Data source and study sample

The panel data from the 2004 and 2006 waves of the China Health and Nutrition Survey (CHNS) were used in this study. The CHNS is jointly conducted by the Carolina Population Center at the University of North Carolina, the National Institute of Nutrition and Food Safety, and the Chinese Center for Disease Control and Prevention. A stratified, multistage, random cluster process was used to obtain a study sample from nine of the 34 provincial administrative regionsFootnote 6 (Fig. 1). The counties were stratified by income, and a weighted sampling scheme was used to randomly select four counties in each province. Then, three randomly chosen villages were selected in each county. In the CHNS, three of the provinces (Liaoning, Jiangsu and Shandong) are in eastern China,Footnote 7 four provinces (Heilongjiang, Henan, Hubei and Hunan) are in the central areas, and two provinces (Guizhou and Guangxi) are western provinces.

The CHNS sample was not designed to be representative of China but to be randomly selected, to capture a range of economic and demographic circumstances and to provide data from randomly selected households in each province (Popkin et al. 2010). The CHNS provided uniquely rich details for our analysis, and the data to create a representative provincial level sample could be obtained. Although in the text of this paper we refer specifically to these provinces, previous studies have shown that the characteristics of CHNS households and individuals are comparable with the characteristics of the national sample (Entwisle and Chen 2002; Du et al. 2002).

Fig. 1
figure 1

CHNS survey in China (Shadow areas). Source China Health and Nutrition Survey www.cpc.unc.edu/projects

Because the NCMS targeted at rural residents only,Footnote 8 the residents of urban areas and county-town neighborhoods in the CHNS data were excluded. In the analysis of health care utilization, we further restricted our sample to individuals in a household who are aged \({\ge }15\) years and reported that they currently have no other forms of health insurance. Children’s health would be difficult to interpret, given that the intensity of regular check-ups naturally decreases as they get older. Moreover, the CHNS survey did not distinguish insurance types from service usage; therefore, we lack sufficient information to determine whether health care utilization relates to the NCMS or other health insurance types.

A quasi-experimental design

“What would have happened to the people who enrolled in the NCMS if they had not enrolled?” Because one cannot simultaneously observe the outcomes of an individual who is both a ‘participant’ and a ‘non-participant’ in the same program, one must create a state of unconfoundedness Footnote 9 to enable statistical comparisons. Propensity score matching (PSM) is a tool that is commonly used in quasi-experimental studies to reduce selection bias in observational data, assuming that all observations are randomly selected for participation.

The propensity score is defined as the conditional probability of participating in a program given certain observable variables (Rosenbaum and Rubin 1983). In our study, the propensity score that is generated for each household by using a logit model with potential confounders may have affected the choice to enroll in the NCMS. Based on the estimated score, Kernel matching (KM), a process of non-parametric estimation that uses the weighted averages of all individuals in the control group, was used to construct the counterfactual outcome. Kernel weight is a decreasing function of the absolute difference in propensity score between the treated and untreated units (Smith and Todd 2005); a major advantage of kernel matching is the lower variance that is achieved because of more information that is used as a result (Caliendo and Kopenig 2008). However, this ‘advantage’ could turn into a drawback because of ‘bad matches’. Therefore, the proper imposition of the common supportFootnote 10 condition is significant for kernel matching (Heckman et al. 1997, 1998), which increases the accuracy of the matching.

The Propensity Score Matching (PSM) estimator is thus the mean difference in outcome over the common support, which is appropriately weighted by the propensity score distribution of the participants. In particular, we are interested in the average treatment effect on the treated (ATT), because it refers to the difference between the expected outcomes value with and without the treatment for the people who actually participate in the treatment. The calculation of kernel weights \((\upomega )\) is specified as follows (Khandker et al. 2010):

$$\begin{aligned} \upomega (i, j)_{KM} =\frac{K\left( {\frac{P_j -P_i}{a_n}} \right) }{\sum \nolimits _{k\in C} {K\left( {\frac{P_k -P_i}{a_n}}\right) }}, \end{aligned}$$

In this equation, \(P_{i}\) and \(P_{j}\) are the propensity scores for the participant (in this study, enrolled-household) i and for the non-participant, j. The K(.) is a kernel function, and \(a_{n}\) is a bandwidth parameter. \(P_{k}\) is the propensity score of the ‘matched’ participant from the control groups. In our study, the Gaussian kernel and a bandwidth of 0.06 are used.

The analysis considers only the individuals who were actually enrolled in the NCMS program at the time of the survey in wave 2006 as the treatment group because the CHNS questionnaire does not ask enrollees whether they have remained in the program without interruption between 2004 and 2006. Moreover, the insurance period of the NCMS relates to an annual basis. Therefore, the evaluation excludes all of the observations from the NCMS-pioneer counties (piloted in 2003/2004) in wave 2006 and uses enrollees from the ‘follow-up NCMS’ counties (piloted in 2005/2006) as a treatment group to obtain a better (although a more conservative) measure of the impact. In this way, by using wave 2004 information as the pre-intervention variables to predict the probabilities of enrolment, this method also guarantees that all covariates were strictly exogenous in the propensity score estimation equation.

The control group is constructed from the non-NCMS counties in wave 2006. Given that no information involves whether the NCMS affects behaviors of both non-enrollees and health care providers in NCMS counties, we therefore use observations from non-NCMS counties in 2006 as the only control group to avoid any mixed impact interference. In effect, it is appropriate to adopt PSM in a setting where both groups (treatment and control) come from their respective economic environment (Jalan and Ravallion 2003). The identification of the treatment and control groups are shown in Table 3.

Table 3 Identification of treatment and control group

In addition, considering the time interval between the actual interview in the CHNS each round and the coverage period of the NCMS every year,Footnote 11 our analysis reflects only the short-term impact of enrolment and makes no attempt to estimate the effects on health because these may take longer to realize.

Bounding approach

Matching and balancing remove biases because of the non-overlapping support region and the differences in the propensity score distributions of the treatment and control households (Johar 2009). However, matching and balancing do not remove bias that results from unobserved and, therefore, unmeasured variables. If unobserved variables simultaneously affect participation and the outcome variable, a ‘hidden bias’ may arise for which matching estimators are not robust (Rosenbaum 2002). To address whether the inferences that are derived from our propensity-matching analysis of the participation effects may be affected by unobserved factors, we adopt the bounding approach that is proposed by Rosenbaum (2002).

If a hidden bias u exists, in theory, two individuals (i and j) with the same observed covariates (say x) would have differing chances of participation. The odds ratio for receiving participation is given by the following:

$$\begin{aligned} \frac{P_i (1-P_j)}{P_j (1-P_i)}=\frac{\exp (\beta x_i +\gamma u_i )}{\exp (\beta x_j +\gamma u_j)} \end{aligned}$$

where u is the unobserved variable, and \(\gamma \) is the effect of u on the participation decision. Therefore, if there are either no differences in the unobserved variables \((u_{i}=u_{j})\) or if the unobserved variables have no influence on the probability of participation (\(\gamma =0\)), the odds ratio is one, which implies the absence of hidden or unobserved selection bias (Becker and Caliendo 2007). Rosenbaum (2002) suggests that the following bounds on the odds-ratio, that either of the two matched individuals will choose to participate is given by:

$$\begin{aligned} \frac{1}{e^{\gamma }}\le \frac{P_i (1-P_j)}{P_i (1-P_i)}\le e^{\gamma } \end{aligned}$$

Both matched individuals have the same probability of participating only if \(e^{\gamma }=1\). Using this logic, \(e^{\gamma }\) is a measure of the extent to which a study is free of ‘hidden bias’. In practice, for binary outcomes, Aakvik (2008) suggests the use of the non-parametric Mantel–Haenszel (MH, 1959) test statistic \(Q_{MH}\) in assessing the null hypothesis \(\gamma =0\) for binary outcomes. The test involves comparing the number of participants who benefited from program attendance and whether the program has no effect:

$$\begin{aligned} Q_{MH} =\frac{\left| {Y_1^D -\sum _s^S {E(Y_{1s}^D)}} \right| -0.5}{\sqrt{\sum _{s=1}^S {\hbox {var}(Y_{1s}^D)}}} \end{aligned}$$

where \(Y_{1s}^D \) is the number of positive outcomes for participation in stratum s of the sample and is the total number of positive outcome in stratum s. Under the null hypothesis, there is no participation effect, and a positive outcome is equally likely for participant and non-participant units. The strata reflect the estimated propensity score. For a fixed \(e^{\gamma }>1\) and \(u\in \{{0,1}\}\), Rosenbaum (2002) shows that the test statistic can be bounded by two known distributions, which move apart from one another and reflect uncertainty regarding the test statistic in the presence of unobserved selection bias.

Dependent variables

Enrollment status is from the report of the household heads, which is set as a binary variable to capture the probability of household unit participation in the NCMS. With the outcome evaluation, we consider the following three dichotomous variables that relate to the previous 4 weeks to measure health care utilization: (1) whether the individual has used outpatient care;Footnote 12 (2) whether the individual has been in the hospital; and (3) whether the individual has used preventive care.Footnote 13 Meanwhile, the study investigates the discrepancy of outpatient utilization among different medical service providersFootnote 14 and income groups.

Health care expenditures were estimated at the household level and calculated as net out-of-pocket expenses after excluding reimbursement. A catastrophic health expenditure (CHE),Footnote 15 which is used to indicate economic vulnerability to impoverishment that is related to excessive health care expenses, was considered present when household health care costs exceeded a specific proportion of total household income (Wagstaff and van Doorslaer 2010). Thresholds of both 10 and 20 % of income were used in the sensitivity analyses, which are described in further detail below.

Independent variables

Household and individual level characteristics were obtained from the responses to the CHNS survey to predict the probability of household enrolment in the NCMS. These variables included household head attributions, household socio-economic and socio-demographic characteristics, household opportunity cost to access health care, and household risk attributes. (For the details of each variable group, please see Table 6.) Household sanitary conditions, including access to safe sources of drinking water and toilet facilities, and the location of the household may also affect the health status of household members and the likelihood that health care services must be accessed (Gertler et al. 1987; Dor et al. 1988). Each of these variables was represented by using a dummy indicator. The economic status of the household was measured by using household per capita incomeFootnote 16 and a computed index of household assets, which served as a proxy for accumulated wealthFootnote 17 (Trujillo et al. 2005). The index was based on a factor analysis of items that assessed the availability of 28 household goods, including household electrical appliances, a mode of transportation (e.g., car, motorcycle, bicycle, and tractor), equipment, and farm machinery.

In rural China, young adults, males, and people with higher educational attainment or good health are more likely to migrate to urban areas for employment (Wu 2010). In many cases, these people are the primary breadwinners of their households. Both potential incentives and disincentives to enroll in the NCMS may exist for these people, and the potential of the NCMS scheme to households with migrant workers has not yet been researched. Administratively, these people are not permitted to register for employer-based health insurance in the urban areas because of the limitations of the Chinese registration system (called “Hukou”), nor are they eligible in other rural areas where they are not registered. Migrant workers can only enroll in the NCMS in their home district (where their Hukou is located) and can only claim reimbursement for medical expenses in their home district. High opportunity costs (e.g., transportation costs, lost working days) and a complicated procedure of declaration on the reimbursement would result in a lower willingness to enroll for migrant workers; their decision will further affect household enrolment. However, to the contrary, enrollment in the NCMS may help the vulnerable groups (such as females, the elderly, and children) that are left at home to obtain better financial protection from induced medical expenditures, which would ease the concerns on the families of migrant workers. Given these considerations, we generated a binary indicator to examine whether the households with migrant workers are more likely to enroll in the NCMS.

Table 4 Regional distribution of the ‘follow-up’ NCMS counties (piloted in 2005/2006) in wave 2006

Results

As a result, restricted to the followed households with complete enrollment information, we end up with a sample of 738 households in the treatment group and 1320 households in the control group (all from wave 2006). On average, the household enrolment rate across all three regions in the NCMS at the time of the follow up in 2006 is 69.78 %. Table 4 provides comparisons of the features of the NCMS in eastern, central and western rural China. Table 5 describes the percentage distribution of the NCMS enrollees (treatment group) with households that are ranked by per capita income. Approximately 21 % of the individuals from the lowest income group are enrolled in the NCMS. For people in the two middle groups, the number is approximately 60 % together. This pattern indicates that the richest class does not capture the coverage and accords with the NCMS initiative to cover rural residents.

Table 5 Percentage of rural people living in households enrolled in the NCMS program in wave 2006, by per capita income

Overall (across three regions in 2006), the average 4-week out-patient and inpatient utilization were 11 and 0.1 %, respectively, and were similar for both the enrolment and non-enrolment groups. The descriptive statistics for the selected independent variables that were studied in the three regions are listed in Table 6.

Table 6 Descriptive statistics for independent variables between treated groups and control groups across three regions in China

Propensity score estimation

Given the rule of ‘household enrollment as a unit’ in the NCMS, the members of a household should have identical propensity scores because they have the same exposure status. Households in the treatment group are matched to households in the potential control group, and the matching equation is a function of the conditions in wave 2004 (pre-intervention). The propensity score is separately estimated for the three regions in our sample. Table 7 presents the propensity score that forms the basis of the matching that is subsequently conducted. The Pseudo R-squared for the regional models ranged between 0.112 and 0.589. The sample size of the treatment and matched groups—concerning individuals—are different because the household sizes vary. Our samples for the three regions passed the balancing test as described by Becker and Ichino (2002).Footnote 18 The marginal effects of the coefficients in the logit models are also shown; thus, a change in any one of the independent variables impacts the probability of the willingness to be enrolled.

Table 7 Propensity score estimation (logit), by regions

As expected, we found discrepancies in the household determinants to enroll in the NCMS. In eastern China, the results indicate that household heads who have a ‘married’ status, poor sanitation environment (drinking water and toilet), high ratio of children, chronically ill family members and more family members who report their health as ‘fair’ are more likely to enroll in the NCMS. These results reflect health-related adverse selection. In particular, the living environment can affect health status significantly; for example, a one percent increase in households with a poorer condition of drinking water would increase the enrollment rate by 31 % in the eastern regions. In the central regions, economy-related adverse selection was observed. However, relatively wealthy households are found to have a higher likelihood of enrollment in the NCMS. In 2009, the rural per capita net income in the central regions was 4864.8 Yuan, which accounts for 72 % of the eastern regions and is 1.32 times the rural per capita net income of the western regions (Green book of China’s Rural Economy, 2011). A ‘middle-level-economic’ position may cause people from the central regions to have the greatest variation in the levels of financial vulnerability to health care costs. Meanwhile, other types of health insurance industry in the central regions (even in the western regions) lag far behind the eastern areas; therefore, wealthier households in the middle regions seek more protection by enrolling in the NCMS.

In the western regions, the households with a poor living environment (e.g., no toilet facilities) preferred to enroll in the NCMS. However, income is not a strong determinant, although considered household head with employment, it would increase the probability of enrollment by 17.6 %, two other income-related factors (asset index and household per capita income) have no significant role in promoting enrollment. In fact, under the strong support of government subsidies, more low-income households in the western regions have little difficulty in paying the premium.Footnote 19 Enrolment in the NCMS reduces the threshold fees when accessing rural healthcare services; therefore, it may promote health care utilization. In addition, the households with a higher ratio of migrant workers are found to be less likely to enroll in the NCMS in the western regions. The results indicate that a one percent point increase on migrant workers in the household would decrease the probability of enrollment by 26 % in the western regions. Because of limited earnings, 74 % of the migrant workers in China have no health insurance. Only 32 % of migrant workers will seek formal medical care utilization when they are sick, 38 % of them will buy medicine from pharmacies, and 20 % will visit private doctorsFootnote 20 (Jia 2008). The high health care costs in coastal cities (eastern regions), an inconvenient reimbursement procedure and small repayment make the NCMS less attractive to migrant workers. Moreover, because of regional heterogeneities, source areas and destination areas could issue different NCMS policies, and normally, destination areas can provide preferential conditions. Obviously, migrant workers are more likely to choose the NCMS of destination areas. However, because the household members of migrant workers are not covered in the insurance and because of the rule of ‘household enrolled as a unit’, this again limits migrant workers’ options.

Impact estimation

After estimating the propensity score for the treated and control groups, we restricted the observations in common support areas to calculate the ATT.Footnote 21 Table 8 reports the impact estimates of the NCMS on medical care utilization and medical expenditures. Table 8 includes the percentage average change on the treated group (ATT) under different regions.

Table 8 Propensity score matching results—estimated average treatment effect on the treated (ATT)

Derived from individual-level information (adult, age \(\ge 15\)), the NCMS has a statistically significant positive impact on promoting the overall outpatient utilization for both the eastern (8 %) and western regions (7 %). By contrast, the NCMS in the central regions showed less impact on outpatient utilization because there are slight positive impacts on outpatient utilization in village-level (3 %) and county-level (1 %) medical care institutions. The NCMS has a relatively larger positive impact on the western regions and not only increases outpatient utilization (7 % increase in outpatient utilization as a result of the NCMS enrollment) but also promotes inpatient utilization (2 % increase) and preventive care (5 % increase). Concerning the choice of different health care providers, the result of outpatient utilization in the western regions indicates that the NCMS can only significantly promote rural people’s return to town-level hospitals (3 % increase), not to village-level or county-level institutions, which is shown in the other regions.

In contrast, the NCMS has no statistically significant impact on reducing either outpatient or inpatient expenditures. The NCMS has not only failed to diminish the financial burden that is caused by paying for healthcare services but also increased the incidence of catastrophic expenditure for the western regions. However, it is difficult to judge whether a higher or lower incidence is good without comparing the medical quality that is provided. The higher incidence may be induced by services that were not previously covered; thus, the cost improves survival. Alternatively, this result can be explained by reasons such as moral hazard or inappropriate policy formulation. For example, an inappropriate deductible or coinsurance could result in the poor being excluded from the benefit while it incentivizes the rich to use more health care services.

Table 9 reflects the selected impacts by household per capita income. Given our propensity score estimation, which was separately derived for the different regions to analyze heterogeneous impact based on income groups, we implemented nearest neighbor matching with a radius of 0.01. For each household in the treatment group, we selected a control-household with the nearest value of the propensity score. In this way, we try to limit the matched groups that come from the same regions. Table 9 indicates several interesting differences in each income centile concerning the impact of the NCMS on medical care utilization, cure expenditures and the incidence of catastrophic expenditure. The NCMS seems to favor the richest groups, considering that statistically significant impacts are found in reducing inpatient expenditures and decreasing the incidence of catastrophic expenditures (24 %, when 20 % is used as a threshold).

Table 9 Estimated average treatment effect on the treated (ATT) on medical care utilization and medical expenditure, by income group

Sensitivity test results

Considering that the main impacts are mostly found in the binary outcomes, here we do not check the sensitivity for continuous outcome variables.Footnote 22 The bounds are calculated by using the routine of Becker and Caliendo (2007), and the matched groups are found by radius matching with a radius of 0.01.Footnote 23 The test may be unsuitable for kernel matching, which uses the entire sample as the matching pair (Johar 2009). The upper bounds adjust the MH test statistics downwards when the ATT is overestimated, and the lower bounds adjust the MH test statistics upwards when the ATT is underestimated.

For the NCMS program, the direction of the hidden bias is not obvious. The true effect may be the reverse of the estimated impact if the hidden bias dominates in the opposite direction. Nevertheless, given that most of the ATT for medical care utilization have positive signs and assuming that we have underestimated the true treatment effect, the bounds are somehow less interesting. If individuals with a low value of unobserved variables are overrepresented in the treatment samples, then the true effects will be larger and more significant than estimated, which is undesirable. Therefore, we shall focus on the overestimation cases.

The testsFootnote 24 for medical care utilization suggest that at the small level of \(e^{\gamma }\), the enrollment of individuals in the NCMS will have significant positive effects on outpatient utilization (in the eastern region), outpatient utilization at village-level clinics (both in the eastern and central regions), and inpatient utilization and preventive care utilization (in the western regions). However, the significant impact of the NCMS on outpatient utilization at county-level hospitals no longer exists in both the eastern and central regions even when \(e^{\gamma }=1\). This result means that there are definitely unobserved characteristics between the treatment and control groups. In the western regions, the significant impact on outpatient utilization and the outpatient utilization at town-level hospitals are still found when \(e^{\gamma }=1\), but the positive results may be reversed if the treatment groups are allowed to differ from their counterparts in the control groups concerning the unobserved bias by 25 and 20 % or more, respectively.

Our estimation is robust regarding the significant and positive effectiveness of the NCMS on the incidence of catastrophic expenditure in the western regions. This effect is because only when the unobserved bias between the treatment and control groups achieves 35 % or more, the estimated impact will be reversed. Meanwhile, for the heterogeneous impacts on income groups, the test indicates that the significant positive impacts of the NCMS on outpatient and inpatient utilization for the second-poor income group (centile \(= 2\)) are robust, especially for inpatient utilization even when \(e^{\gamma } =2\) is stable. However, significant impacts of the NCMS on reducing catastrophic expenditures for the richest and second-poor groups disappear at \(e^{\gamma }=1\). These checks suggest that it is unlikely that the results of the significant positive impacts on medical care utilization are reversed.

Discussion

The implementation of the NCMS is a major reform in rebuilding the health security system in rural China. This paper attempts to investigate the NCMS’s likelihood of achieving its stated goal of promoting medical care utilization and reducing financial burdens based on heterogeneous regional characteristics and income groups. Propensity score matching is used to alleviate the bias that arises from observed heterogeneity, and the bounding approach answers whether the inference concerning treatment effects may be altered by unobserved factors.

The results confirm that the NCMS has promoted outpatient utilization for rural residents, especially for relatively poor groups and in the western regions of China. The NCMS can promote poor rural households to visit a doctor for minor diseases at their early stages before they turn into serious diseases; therefore, the NCMS helps to control health risks. Because poor households normally face larger negative health shocks (Ursula 2005; Lindelow and Wagstaff 2005), the financial loss of the poor regarding disease incidence will cause higher welfare loss. The poor should have relatively more benefits in the security system that is provided by the NCMS than their wealthy counterparts. Similar to all insurance schemes, the NCMS program has no significant positive impact on promoting medical care utilization in the poorest group, which implies that the development of the NCMS may still need the support from the medical assistance scheme.

The results also present the different utilization structures of medical care institutions. China has a six-tiered structure of government, health administration and health care. From county to village, the health service system is the so-called ‘rural three-tier’ system.Footnote 25 At the village level, there are usually one to three village doctors who provide outpatient care for common diseases, maternal and child health care, and immunization services. Village health stations are normally the first places where rural patients seek medical care. Given transaction costs and timeliness of medical treatment, rural people are more likely to seek health care services from basic medical care institutions in their communities. In this regard, the estimation results in the eastern and middle regions show a positive impact of NCMS enrollment in promoting outpatient services in village-level clinics. However, because of differences in regional development, the village-level institutions in the western regions still have a serious shortage of health resources (MOF, 2006), which may lead to the dysfunction of village health stations. Therefore, in the western regions, the positive impact of the NCMS on improving outpatient services is shown in town-level hospitals.

Our estimation of improving outpatient utilization at county-level hospitals for the eastern and central regions is not robust to hidden bias. However, this finding is still somehow consistent with the allocation of funds on the ground. Because funds are distributed and concentrated more at the village-level clinics in the central regions and at town-level hospitals in the western regions, our results indicate that more health care utilization occurred at the village-level clinics in the central regions and at town-level hospitals in the western regions. Therefore, policy formulation that refers to reimbursement levels can play an important role in influencing consumer behavior and can guide the rational utilization of health care resources.

Generally, the results suggest that given its relatively restricted financing and narrow benefit coverage, the NCMS has positively improved the utilization of medical services. This result is not surprising and more generally consistent with international experiences on subsidized and community-based health insurance programs. Examples include the SUBS program in Colombia, which has been estimated to increase preventive and ambulatory care, and the health insurance program in Vietnam is associated with higher rates of utilization (Sepehri et al. 2006). A program in rural Senegal also appears to have increased utilization (Jütting 2003). By contrast, there is little evidence that the NCMS has reduced outpatient or inpatient expenditures or the incidence of catastrophic expenditures. Concerning this point, the international experiences also report very mixed results. The impact research regarding health insurance schemes in three African countries, namely, Kenya, Senegal and South Africa, shows that insured households are less likely to face catastrophic expenditures than uninsured households in Senegal; however, in South Africa, this result only occurs for the richest quintile, and in Kenya, no significant impact emerges (WHO 2006). The SEWA program in Gujarat (India) has decreased the incidence of catastrophic expenditures from 35.6 to 15.1 % (Krause 2000), whereas the Seguro Popular program in Mexico has reduced the risk of catastrophic out-of-pocket expenditures (Gakidou et al. 2006).

The results still show that the NCMS not only failed to reduce the financial burden for rural residents, but it also increased the incidence of catastrophic expenditures for the western regions. The following reason may rationalize the findings. Health insurance can lower the effective prices of health care at the time when consumers purchase it, which induces the incentives of both consumer and health care supplier to over-pursue or over-provide medical care (Arrow 1963; Zeckhauser 1970; Culyer and Newhouse 2000). In this study, one possible explanation may lie on the supply-side: healthcare providers in China are paid by a ‘fee-for-service’ and face a fee schedule that strongly encourages demand shifting to expensive drugs and high-tech care for which the payment margins are higher (Yip and Hsiao 2008; Liu and Mills 1999). Therefore, although the NCMS can help to reduce the price of health care services, in fact it increases the cost of medical care through the providers’ immoral behavior, such as an unnecessarily longer hospitalization and the use of more high-tech care. Given such a scenario, as the NCMS develops, the access to healthcare for a majority of low- and middle-income groups will again be impeded at different levels. Our estimation concerning reducing the incidence of catastrophic expenditures shows consistency in that higher income households receive disproportionately more benefits, which would imply that low-income participants in the NCMS subsidize the rich. Although these results in our study are sensitive to unobserved variables, if not checked, the sustainable development of the NCMS program will be in question.

If the key rationale behind the introduction of the NCMS program were to promote medical care utilization and relieve the financial burden of households that is caused by illness, then the results that are presented here suggest little improvement regarding welfare. The theory suggests that the welfare gains concerning access and risk reduction that come from reducing the cost of care must be weighed against the potential welfare losses that arise from demand- and supply-side moral hazards (Wagstaff et al. 2007). Although the data used in the paper did not explain the extent of unnecessary care, concerns remain from the supply-side. Therefore, it is necessary for the NCMS to find better ways to control health care suppliers’ moral hazards, such as the reform of fee-for-service (FFS) payment systems to Diagnosis Related Group (DRG) payment methods for hospitals and the risk-adjusted capitation method for primary-care providers. Other controls are to seek more effective third-party purchasers and specify standardized drug lists and regulatory regimes.

Nevertheless, the study has some limitations. First, because of data limitations, we focus on a limited set of outcome variables. Notably, we do not consider the impact of the NCMS on health outcomes per se because the reference period is only 4 weeks. Second, we do not examine how the impact of the scheme varies with the design and implementation characteristics because of ambiguous information regarding the reimbursement modes and the fact that both the design and implementation are likely to vary endogenously along many dimensions. Finally, the estimates that are reported in the paper may not be an accurate reflection of the gross impact of the scheme. Despite these limitations, however, we believe the study makes an important contribution to the existing literature regarding the impact of the NCMS scheme that is differentiated by regions and income groups. This study will serve as a background to address in future research the issues that are raised here.

Conclusions and policy implications

The findings of this study have several important policy implications. First, there is a need to continuously increase financing for the NCMS program and improve the benefit package design of the NCMS. Current financing levels of the NCMS remain at a comparatively lower level. Government subsidies have demonstrated strong contributions in reducing the access barriers, which promote health care utilization for rural people, not in reducing either outpatient or inpatient expenditures. Thus, the NCMS has not achieved effective coverage yet. Local governments are mainly responsible for the collection and management of funds and designing the benefit packages. Therefore, local governments should have better knowledge on how to mobilize possible financial and health resources and establish suitable reimbursement coverage that is based on healthcare needs and utilization. Notably, it is possible for governments to attempt to integrate NCMS policies in regions, which helps to establish a broader financing pool and favors the enrollment of households with migrant workers. Consequently, NCMS policies can achieve the gradual transition of integrating policies from a ‘local-level’ NCMS policy to a ‘regional-level’ policy and then to a national policy.

In addition, given that the value of health insurance is rooted in the unpredictability of medical spending (Culyer and Newhouse 2000), the reimbursement of the NCMS is also required to be dynamically evolutional following economic changes and development. The central government is encouraged to provide more financial support to less developed areas to supply social protection for poor people. Our findings also show an increase in prices that is induced by a possible over-pursuing or over-providing medical care. Although our paper did not examine this aspect in depth, the necessity for the employment of more effective measures to conduct risk management and cost control in the NCMS has been foreseeable.