1 Introduction

Economic migrants are individuals who have emigrated from one region to another primarily because of their own economic opportunities. One of the standard propositions in the migration literature is that economic migrants are favorably self-selected for labor market success (Chiswick 1978; Borjas 1987; Chiswick 2000).

However, besides pursuing higher income themselves, some economic migrants may also migrate for better opportunities of their offspring. When examining the labor market performance of second-generation immigrants,Footnote 1 earlier literature finds that the second generation experiences high educational attainment and labor market achievement in the receiving economies whereas substantial heterogeneity exists by parental region of origin (Chiswick 1988; Boyd and Grieco 1998; Chiswick and DebBurman 2004; Card 2005). Most studies link education and earning advantages as well as cross-origin discrepancies among the second generation to intergenerational transmission of human capital (Card et al. 2000; Bauer and Riphahn 2007). Very few papers look at the human capital investment by immigrant parents for their children. If immigrants migrate partly for the economic well-being of their decedents, they might emphasize children’s education more and be more willing to invest in school than the stayers if resources allow.

Correspondingly, this paper investigates how immigrants value education for their children and the economic mechanism for differential evaluation of school. The value placed on education is assessed through households’ residential location choices. Parents have long exercised choice of their children’s schools through residential location choices in what is often referred to as Tiebout sorting.Footnote 2 About half of the parents in the 1993 National Household Education Survey reported that the schools their children would attend influenced their decision of where to live (McArthur et al. 1995). The close link between school quality and residential location has been verified by a number of studies (Barrow 2002; Clapp et al. 2008; Hasting and Weinstein 2008).

This paper explores where immigrant households choose to live within the Los Angeles Metropolitan Area and how location characteristics, including school quality, neighborhood sociodemographics, house features, and other local amenities affect their decisions. Immigrant households are defined by the migration status of household heads. The analysis then compare the choices of immigrants to native-born Americans’ and examines decisions cross immigrants by parent characteristics. The main dataset employed is the 2000 Census. The 1999 Academic Performance Index (API)Footnote 3 is used to measure public school quality.

Residential location choice is modeled as a conditional logit model, which enables the researcher to examine immigrant household preferences over a broad range of housing and neighborhood characteristics and how these preferences vary by household characteristics. Since the unobserved characteristics of jurisdictions may be correlated with local school quality, I follow the identification strategy of Barrow (2002) and compare the role of public schools in the location choices of households with and without children, reasoning that unobservable non-school attributes affect both types similarly, while households with children care more about public schools. Specifically, an interaction term between the API and an indicator for having a child under 18 years of ageFootnote 4 is included in the model to capture the differential effects of school quality on the residential location choices of the two types of households.

Regression results suggest that school quality is positively and significantly related to location choices of immigrant households with children. The importance of school quality in residential locations increases with household income and householder’s education. These results are robust to addressing the omission of private school choices, unobserved constraints on choice sets, differential preferences toward non-school amenities, heterogeneity in mobility, and alternative school quality measure. Relative to their native counterparts, the weight allocated to school quality by immigrant parents is of similar magnitude. Yet low-income immigrant households appear to value school quality more than low-income native households.

To understand the underlying mechanisms that drive the heterogeneity in the emphasis on school quality, I compare the behaviors of immigrant parents. First, both the age and the number of children matter to the weight placed on school quality. Parents of school-aged children value school more than parents of children under six. The number of school-aged children is positively associated with the weight assigned to school quality, and the relationship is more evident among financially restricted immigrant households. Second, favorable self-selection among immigrants by origin could be another source of heterogeneity in weights. Motivation appears to play a positive role in how immigrant parents value schools. That is, immigrants who have overcome longer distances and more language barriers tend to value education more for their offspring. Last, potential returns to education may be an economic driver for immigrant parents to invest on their children’s school quality in residential locations. The origin-specific returns to education for immigrants in Los Angeles are positively related to the probability that immigrant parents select areas with better public schools.

The remainder of the paper is organized as follows: Section 2 presents the empirical model and the identification strategy; Section 3 describes the data used to estimate the model; Section 4 examines the weight placed on school quality by immigrants; Section 5 compares immigrants and natives; Section 6 further explores the economic mechanism for immigrants to value school for their children; and Section 7 concludes the paper and discusses the policy implications.

2 A model of residential choice

2.1 Conditional logit model

The residential location decision of each household is modeled as a discrete choice of a single residence. The conceptual experiment motivating this analysis considers a household moving into the Los Angeles Metropolitan Area and deciding where to live. The household may compare house features, local amenities, and community demographics, and select the location that best matches its ideal.

Because households who have lived at the same location for years may have a disincentive to move and their residential location may not well reflect their current demand for public education and other local amenities, I examine the location choices of households that recently moved from outside of Los Angeles to the current location. Given the high cost of moving, movers from out of the area are more likely to undergo some exogenous move-inducing shocks, such as job relocation, and re-sort when they move to the area.

More formally, I assess households’ residential location choices using the conditional logit model introduced by McFadden (1974). Suppose that each household selects its residential location from N mutually exclusive alternatives to maximize its utility. The indirect utility function of household h that resides in location j is of the form:

$$ U_{hj}=V_{hj}+\varepsilon_{hj}, $$
(1)

where V h j stands for the component of indirect utility of household h that depends on the location characteristics observed by the household, such as characteristics of houses (e.g., size, age, and type), public goods (e.g., public school quality, crime rate and number of metro stations), neighborhood sociodemographics (e.g., ethnic composition, age structure, fraction of immigrants, and socioeconomic status), as well as housing price that accounts for the cost to live in location j. ε h j represents household h’s unobservable tastes in choosing where to live.

The probability that household h selects location j is

$$ p_{hj}=\Pr \left( U_{hj}\geqslant U_{hk}\right) ,\forall k=1,2,...,N. $$
(2)

Assuming that the error term ε h j is independently and identically distributed with a standard Type I extreme value distributionFootnote 5 across alternatives, the probability that household h chooses location j can be derived as:

$$ p_{hj}=\frac{\exp \left( V_{hj}\right) }{{\sum}_{n=1}^{N}\exp \left( V_{hn}\right) }. $$
(3)

I further assume that the observed component V h j can be approximated by a linear function of choice-specific attributes:

$$ V_{hj}=\beta_{h}Y_{j}. $$
(4)

Y j denotes location characteristics, and β h is the set of household-specific parameters. As different types of households may demand different bundles of public goods, β h may be formulated as

$$ \beta_{h}=\beta_{0}+\sum\limits_{r=1}^{R}\beta_{r}X_{hr}. $$
(5)

X h r ,r=1,2,...,R represents the characteristics of household h that do no vary across communities, such as household income, householder’s education, and family composition.Footnote 6 That is, household characteristics are assumed to affect residential location choices through their influence on households’ valuation of location attributes.

2.2 Identification strategy

I first examine how immigrant households value school quality in residential location choices. One major problem arising is that local school quality is potentially correlated with unobserved location characteristics. If this is the case, the coefficient on school quality may capture the effects of some non-education factors.

To address this problem, I compare the residential location choices of immigrant households with and without children following the identification strategy exploited by Barrow (2002). Having young children in a household indicates a direct demand for schooling services, whereas households without children are only indirectly affected by neighborhood schools. At the same time, location attributes other than schools may influence the two types of households similarly. Therefore, the differential effect that school quality has on the residential location choices of households with and without children potentially identifies the value parents place on school quality.

Accordingly, I rewrite Eq. 4 as

$$ V_{hj}=\alpha_{1h}S_{j}+\alpha_{2}S_{j}\cdot chd_{h}+\beta_{h}Z_{j}+\gamma_{h}e_{j}. $$
(6)

S j measures the school quality of location j. c h d h is a binary indicator taking on value of one if household h has children under 18 years of age and zero if not. So the weight placed on school quality by households without children is α 1h , and that by households with children is α 1h + α 2. Z j stands for attributes of location j which are observed by both households and econometricians, and e j stands for the location attributes that are only observed by households but not observed by econometricians. The household-specific parameters are assumed to take the form of a linear function of household characteristics as Eq. 5.

The estimate of α 1h is biased if the unobserved neighborhood characteristics included in e j are correlated with school quality S j . But α 2 will be consistent as long as any unobservable attributes are equally valued by households with and without children.Footnote 7 In particular, if households without children put no value on school quality per se, the estimated coefficient on the interaction term between school quality and presence of children provides an unbiased estimate of the true valuation of school quality by households with children. The estimate of the direct effect of school quality only captures the effects of unobserved amenities that are correlated with school quality.

To further assure that the two types of households have similar tastes for non-school location attributes, I use the method of propensity score to trim the sample. The propensity score for having children under 18 is estimated from a series of household sociodemographic characteristics and householder individual characteristics that may be correlated with demand for local public goods and residential location choices. Compared to restricting the sample by a certain household characteristic, such as age, the method of propensity score takes more household characteristics into consideration and balances their effects.

The propensity score is estimated using a probit model:

$$ \Pr \left( chd_{h}=1\right) ={\Phi} \left( \lambda X_{h}+\epsilon_{h}\right) . $$
(7)

X h represents the characteristics of household h, such as household income, linguistic isolation, family size, cross-state mover status, race, and householder’s age, gender, educational attainment, and marital status. 𝜖 h denotes the unobserved characteristics that are relevant to having children. I eliminate the observations with a predicted propensity score lower than 0.1 or higher than 0.9, and conduct the analysis on the remaining sample of more comparable households with and without children.Footnote 8

In addition, to precisely capture the heterogeneity among parents with children in different age range, I generate three binary indicators: 1) \( chd_{h}^{0-6}\) which is unity if household h has children less than 6 years of age, and zero otherwise; 2) \(chd_{h}^{6-12}\) which is unity if there are children aged 6–12, and zero otherwise; and 3) \(chd_{h}^{12-18}\) which is unity if there are children aged 12–18, and zero otherwise. Usually, children under 6 do not utilize grade schooling service; children aged 6–12 go to elementary schools; and children aged 12–18 have a need for secondary school quality. It is possible that when migrating to a new area, households with children below school age do not care about school district as much since they may intend to move again when the children get older and school choice become more relevant. The value placed on elementary school and secondary school may also vary due to parents’ beliefs about how different level of education determine one’s future economic success.

Accordingly, I interact the three indicators with the school quality measure, and estimate the following equation on the trimmed sample:

$$ V_{hj}=\alpha_{1h}S_{j}+\alpha_{21}S_{j}\cdot chd_{h}^{0-6}+\alpha_{22}S_{j}\cdot chd_{h}^{6-12}+\alpha_{23}S_{j}\cdot chd_{h}^{12-18}+\beta_{h}Z_{j}+\gamma_{h}e_{j}. $$
(8)

Now the differential weights that parents assign to school quality for children of different age range relative to childless households are captured by α 21, α 22, and α 23.

2.3 Potential mechanisms

To better understand the impacts of immigration on provision of public education in host societies, it is essential to know the economic mechanisms explaining why immigrant parents value school quality for their children.

One hypothesis to test is that the number of children matters to the priority parents place on schools in residential decisions. Immigrants in general have higher fertility rate than natives. The number of children can be related to the evaluation of school quality in a few ways. First, larger family size results in less resource available per child. If the number of children affects residential choices through income constraints, households with more children may be less able to afford to live in good school districts. Second, parents make trade-offs between the quantity and quality of children when they make fertility and human capital investment decisions (Hanushek 1992). The households with more children are likely to be the ones who value education less. Third, given the housing price a household has to pay to live in a neighborhood, more school-aged children in a household may indicate that moving to a location with good public schools is more cost-efficient.

The second hypothesis is that the weight assigned to school quality by parents is associated with a selection model of migration. When migrating to seek labor market success, immigrants face different costs for migration, such as physical distance, language barriers, cultural shocks, and skill mismatches. Therefore, the ones overcome more obstacles are supposedly more motivated. The discrepancies in motivation may also apply to how immigrant parents value the future success of their offspring. If so, more motivated immigrants may emphasize school quality more.

The third hypothesis is that parents are more willing to invest in children’s schooling if the future returns are high. The heterogeneity in education and earning across immigrant groups have long been observed (Chiswick 1988; Bratsberg and Terrell 2002; Card 2005). It is possible that when immigrant parents expect high future returns to education, they have a stronger incentive to pay the premium in housing price so as to send their offspring to better public schools.

Given the established patterns using the estimation strategy in the previous sections, I conduct supplementary analysis to test the above hypotheses among immigrants with children only. The linear indirect utility of household h for the conditional logit model is formulated as follows:

$$ V_{hjc}=\alpha_{1h}S_{j}+\varphi S_{j}\cdot A_{hc}+\beta_{h}Z_{j}+\gamma_{h}e_{j}. $$
(9)

V h j c is the utility of immigrant household h from country c living in area j that depends on observed location attributes. A h c is a measure based on the hypothesis tested, namely, measures for number of children in the first case, origin country characteristics relevant to selective migration in the second, and returns to education in the last. The definitions of other notations are the same as before.

Specifically, I assume A h c to be the log number of children under 18 and/or the log number of school-aged children in the first case because the attention and economic resources allocated to each additional (school-aged) child usually diminish with the total number of (school-aged) children in a household.

Measures of self-selection include distance to the U.S., if English is an official language, percent of refugees, income inequality, and GDP per capita in the origin country. Earlier studies show that immigration is larger, ceteris paribus, when the source country and the destination country are geographically adjacent or the language and culture in the destination country is familiar (Lewer and Van den Berg 2008). Yet the story may not hold for those who migrated to the U.S. as refugees (Cortes 2004). The classical literature of migration (Chiswick 1978; Borjas 1987) also argues that self-selection among immigrants depends on the income distribution in their home countries relative to the United States or other destinations. That is, less dispersed income in the origin predicts that individuals at the right tail of home country income distribution migrate to the U.S., and vice versa. The quality of immigrants may matter to their school and residential location choices. In addition, I include the relative share of national origins among immigrants and the years since migration. Though the representation of origin countries could be result from selective migration, it may serve as a pull factor as well. The origin representation may also link to ethnic network and thus affect immigrants’ labor market outcomes (Borjas 1995b; Damm 2009). While the years spent in the U.S. may shape immigrants’ preferences toward education over time, they are correlated with waves of migrations of different set of countries.

Accordingly, α 1h stands for the base weight household h assigns to school quality in residential location choice, and the economic incentive among immigrant parents is reflected by φ. Since the school quality S j may be correlated with unobserved location characteristics e j , it is likely that the main effect of school quality α 1h is biased. However, if all households with children value e j equally, φ is more likely to be unbiased as it captures the variation related to school quality in location choices across households or across origins.

3 Data

3.1 Main samples

The main dataset employed in this paper is the 5 % Integrated Public Use Microsample Series (IPUMS) version of the 2000 Census. In the public use Census data, household location is identified at the Public Use Microdata Area (PUMA) level. I examine the households in the Los Angeles Metropolitan Area. This area covers two counties - Los Angeles County and Orange County, and is divided into 84 PUMAs. I further restrict my sample to households that earn a positive income and have moved to their current location, from outside of the Los Angeles Metropolitan Area, within the past five years.

I study the population in the Los Angeles Metropolitan Area for three reasons. First, unlike many other states, school spending is not directly related to local property taxes in California. The State Supreme Court’s decision in the case Serrano v. Priest (1971) mitigates the problem of controlling for the effective tax rate as it relates to school expenditure across regions. In this case, spending on public education in a district is less endogenous to the composition of households. Second, the smallest geographic area in the public version of the Census is the PUMA which generally follows county or city boundaries and consists of 100,000+ residents. Since Los Angeles is densely populated, all the PUMAs in this area are geographically small so that the choice of residing in a certain PUMA is less constrained by the location of employment. Third, the area has a diverse population with a high proportion of immigrants and high discrepancy in school quality across districts.

The above criteria were met by 11,821 immigrant households in the 2000 Census. Summary statistics of household characteristics of the entire immigrant population in the Los Angeles Metropolitan Area and the immigrant movers are reported in the first two columns in Table 1. Relative to the whole population, the movers are younger, slightly better educated with smaller family size, and migrated to the U.S. more recently. The reported adjusted household income is the total household income divided by the family equivalent scale (Citro and Michael 1995). The racial composition of the movers is similar to that of the whole population: more than half of the immigrant households are Hispanic, and Asians make up the second largest group. About 34 % of the movers have children under 18 in the household.

Table 1 Summary statistics on the census sample

For the main analysis on immigrant households, I restrict the sample of immigrant movers by the propensity score to have children so that the preferences toward location characteristics other than school quality are more likely to be equal. Table 2 shows the estimates from the regression of having children on observed household characteristics and Fig. 1 depicts the distributions of the propensity score for households with and without children respectively. Among all the variables, marital status appears to be the strongest predictor for having children. Family size and the number of families in the household also explain a sizable proportion of variation in having children.Footnote 9

Table 2 Propensity to have children under 18 in the household
Fig. 1
figure 1

Propensity score for having children under 18 years of age

I further compare the distribution of household characteristics that may be key determinants for both the fertility and the residential location decisions, namely, household income, householder’s age, marital status, and family size by propensity score. Figure 2 depicts the averages of these characteristics within propensity score neighborhoods by group. Except the left tail of adjusted household income and the right tail of marital status, households with and without children with similar propensity scores appear analogous. The householder’s age of parents deviates from that of non-parents to a certain extent in the middle range of propensity score, yet there is no consistent trend.Footnote 10These figures may indicate the propensity score matches the characteristics of households with and without children effectively.

Fig. 2
figure 2

Family characteristics by propensity score

Column 3 Table 1 reports the summary statistics for the sample of immigrant households trimmed by the propensity score. The trimmed sample accounts for approximately 70 % of immigrant movers and 10 % of all immigrant households. The trimmed sample has lower adjusted income and larger number of children, but resembles the untrimmed sample in most of the other respects.

3.2 School quality

The school quality measure employed is the 1999 Academic Performance Index (API) of public schools from the California Department of Education. The API Report is part of California’s Accountability Progress Reporting which starts in 1999. The report measures the academic success of California’s nearly 10,000 public schools in over 1,000 school districts and local educational agencies. A school’s API is a number that ranges from 200 to 1,000 and is calculated from the results for each school’s students on statewide tests. The API Reports are publicly available to parents and guardians as indicators of school performance.

I employ the APIs of high schools as a measure for local school quality. Since the number of high schools in each school district is much smaller than the number of elementary or middle schools, using high school quality largely mitigates problems related to discrepancies of school quality within a district. Also, since school quality is highly correlated across levels,Footnote 11high school quality well represents the school quality of all grade levels in a district.

Admittedly, some households in my sample moved to their current location prior to 1999. When they evaluated location attributes and made a decision about where to live, school quality may have been different from that measured by the 1999 APIs. Nevertheless, school quality is likely to be stable over a five-year period.Footnote 12 As earlier data are not collected, the APIs in 1999 are the best available indicator for school quality when households chose their current locations.

According to the data obtained from the California Department of Education, there are 233 public high schools in 67 school districts in the Los Angeles Metropolitan Area. However, the PUMAs defined in the public use Census data do not tend to line up with school district or attendance zone boundaries. It is possible that several small school districts are contained in one PUMA, while a large school district like the Los Angeles Unified School District consists of several PUMAs.

The aggregation of the individual school API to the PUMA level proceeds as follows. First, except for the Los Angeles Unified School District,Footnote 13 the mean API of each school district weighted by school enrollments is calculated. Second, I average the district mean APIs to the PUMA level weighted by the population in the intersections of each PUMA and overlapping school districts. Third, lacking data for attendance zone for each school, I simply calculate the mean API weighted by school enrollment for every PUMA within the Los Angeles Unified School District. Table 3 reports the summary statistics for PUMA mean APIs.

Table 3 Summary statistics on the neighborhood characteristics

3.3 Housing prices

The Census has collected an array of measures related to housing: a binary variable whether the unit is owned or rented, the corresponding rent or owner-reported value, property tax payment, number of rooms, number of bedrooms, type of structure, the age of the building, and etc. Because house values are self-reported, it is difficult to ascertain whether these prices represent the current market value of the property, especially if the owner purchased the house many years ago. A second deficiency of the house values reported in the Census is that they are top-coded at $500,000. In the Los Angeles Metropolitan Area, it is not unusual that the top-code is binding. Therefore, I employ the reported monthly rent instead. Presumably, rents are subject to less misreporting than house values, even though renters who have occupied a unit for a long time may receive some sort of tenure discount (Bayer et al. 2007).

In order to get a more accurate measure for market rent that is comparable across PUMAs, I first run a hedonic price regression on all the households with cash rent in the Los Angeles Metropolitan Area in the 2000 Census. Specifically, I regress the reported gross rent on the tenure of the current renter, a full set of PUMA dummy variables, and a series of house characteristics, including number of rooms, number of bedrooms, number of units in the structure, whether there is a kitchen, and the age of the building. I utilize the estimated PUMA fixed effect as a measure for overall housing price of each PUMA.

There may be two empirical issues related to the housing price measure to address. First, the housing price measure derived from reported rents may suffer from the problem of endogeneity. An analogous problem is commonly discussed in the empirical industrial organization literature in which market shares and prices are simultaneously determined while consumer level data is often unavailable (Berry 1994). Relative to a market-level analysis, the use of household level data reduces the simultaneity problem (Barrow 2002). Also, as my sample accounts for only less than 5 % of all the households dwelling in Los Angeles, it is likely that the sample is not representative of net market demand shifts. Second, housing price not only accounts for the cost to live in a certain neighborhood, it also capitalizes the value of local amenities to the marginal homebuyer. This may lead to a positively biased estimate of price and complicates the interpretation of the coefficient on school quality, since the difference in the value across marginal residents should already be reflected in the price. Yet in both cases, the approach of netting out the weights estimated for households without children mitigates the problem, and the interaction between school quality and having children under 18 may not be biased. The question studied is thereby interpreted as, given the price a household has to pay to live in a community, whether households with children sort to expensive communities where the public bundle is skewed toward good schools.

3.4 Neighborhood characteristics

The data for neighborhood characteristics, including sociodemographic characteristics, house features and local amenities are from various sources.

The sociodemographic characteristics of each PUMA, including the racial distribution, age structure, percentage of immigrants, percentage of urban areas, percentage of unemployed, and median household income, are extracted via the Missouri Census Data Center’s Dexter Data Extractor. Some other characteristics which are not covered by Dexter Data Extractor, such as median educational attainment, fraction of private school enrollments among households with children, percentage of owned houses, density, and total number of house units are directly calculated from the 2000 Census. Similarly, the PUMA average house characteristics, including the age of the building and the number of bedrooms are also calculated from the 2000 Census data.

The conventional monocentric urban model assumes all employment is located at the center of a circular city encompassed by a suburban ring (Straszheim 1987). Previous literature on residential location choices often uses the distance to the city center, or the Central Business District (CBD) , to proxy for access to employment. However, with the decline of central cities and the growth of suburbs, more than one CBD has emerged in populous metropolitan areas like Los Angeles. Hence, I measure the job access of different PUMAs using the average commute time to work among all employed individuals in each PUMA.

The data of crime rates for Los Angeles County and Orange County are from the Criminal Justice Statistics Center Databases of California. Other local amenities data, including parks, metro stations, hospitals, and colleges are derived from the Geographic Information System documents provided by the Cal-Atlas Geospatial Clearinghouse.

The summary statistics for the neighborhood characteristics are also reported in Table 3.

3.5 Selective migration

Six origin-level characteristics are employed to measure selective migration: distance to the United States,Footnote 14 whether English is an official language,Footnote 15 fraction of refugees, income inequality relative to the U.S., per capita GDP, and the relative share of national origin among immigrant population in the Los Angeles Metropolitan Area.Footnote 16

As the Census does not collect information about immigration status, I use the fraction of refugees among migrants from each national origin instead so as to incorporate the distinction in motivation for migration between refugees and economic immigrants (Cortes 2004). The data of refugees and asylums granted lawful permanent residents are from the 2000 Yearbook of Immigration Statistics provided by the U.S. Department of Homeland Security.Footnote 17

I calculate the ratio of occupational income at the 90th percentile to the 10th percentile in each country as a measure for income inequality. The ratios are obtained from the Occupational Wages around the World (OWW) Database by Freeman and Oostendorp. The dataset collects occupational wages for 161 occupations in 171 countries and regions from 1983 to 2008. To capture the persistent dispersion by nation relative to the U.S., I estimate the country fixed effects from a regression of the ratio on a specification that includes year effects.Footnote 18 I also include the origin country GDP per capita so as to control for the mean of the income distribution of each country. The GDP data are from the Statistics on World Population, GDP and Per Capita GDP by Maddison.

Table 4 Panel A presents the summary statistics for these factors.Footnote 19

Table 4 Summary statistics on the origin characteristics of immigrants

3.6 Returns to education

Because an individual’s return to education is endogenous to the quality of the school he or she attends (Card and Krueger 1992; 1996), I use the returns to education among the parental generation, i.e. first-generation immigrants and link these to how immigrant households value schools when choosing residences. It is possible that immigrant parents’ perceptions of potential future earnings of their children come from their own experiences in the labor market.

Since the local labor market in the Los Angeles Metropolitan Area is unique compared to other parts of the U.S., I derive the static ”local” returns to education in the area by immigrants’ country of origin. The ”local” returns to education are estimated from the 2000 Census data based on the same regression function proposed by Bratsberg and Terrell (2002).Footnote 20 Table 4 Panel A presents the summary statistics.

It is worth mentioning that variation in returns to education of first-generation immigrants could partly result from self-selection. Table 4 Panel B displays the relationship between returns to education and the measures for selective migration. Distance from the source country to the U.S. is positively correlated with the returns to education, indicating that more motivated immigrants enjoy better labor market achievements in the U.S. Immigrants who have English as their native language and who are from wealthier countries also tend to earn more. The share of national origin shows a negative relationship with the returns to education, perhaps as a result of the high fraction of Hispanic immigrants in the Los Angeles Metropolitan Area. The six selection measures together explain about 53 % of the cross-origin variation in the returns to education among first-generation immigrants. Some other factors, such as origin-specific attitudes toward education and the demand side of the U.S. labor market, may also be a determinant. Therefore, I also test controlling for the factors that may affect selective migration when studying the role of returns to education in how parents value school quality.

4 Immigrant values on school quality

4.1 The role of school quality

Table 5 presents the results for different specifications of the conditional logit model. The dependent variable is an indicator for residential location choice among 84 PUMAs.

Table 5 Conditional logit model of residential location choices

The first regression includes only school quality as measured by the API score, the interaction term between the API and the child indicator, adjusted rent as the proxy for the cost of living in a given location, and the number of house units in each PUMA to account for size differences across neighborhoods. Estimates from this parsimonious specification imply that immigrant households without children assign negative weight to school quality in residential location choices while households with children value school quality significantly more. Both types of households tend to avoid areas where housing price is high. As discussed above, it is possible that both the rent and API are correlated with other neighborhood characteristics, and thereby capture households’ preferences over those characteristics.

The second regression takes more location attributes into account, including neighborhood sociodemographics, average house characteristics, and local amenities. When other location characteristics are controlled for, both coefficients on the API and the rent become positive. The parent-non-parent difference in the weight placed on the API stays positive and significant, implying immigrant parents value school quality in choosing where to live.

To rule out the possibility that the API-child interaction picks up differential preferences toward non-school characteristics between households with and without children, column 3 include the interactions between the child indicator and other location attributes in addition. The estimated main effects of location characteristics resemble those in column 2, and so is the interaction effect of the API on households with children. The estimates being insensitive to including additional interaction terms may support the validity of the assumption for identification that parents and non-parents view unobserved non-school attributes similarly.

Moreover, the fourth column assumes that the set of weights assigned to location characteristics vary by household income and householder’s educational attainment. I add a series of interactions between these two household characteristicsFootnote 21 and the location attributes, including the API score, to the regression. The estimate on API-child interaction is not affected.

Again, column 5 includes a series of interactions between location characteristics and the child indicator when adjusting for effects of household income and householder’s education. Similar estimates are produced.

To quantify the implications for choice of residence (Ai and Norton 2003), I calculate the difference in the probability change to live in a certain PUMA between a hypothetical household with children and a hypothetical household without children if the API of that PUMA increases by one standard deviation based on the estimates in column 4. The adjusted household income and householder’s education are assumed to be at the sample means. The APIs of other PUMAs and all the non-school location attributes are held constant. The parent-non-parent difference is calculated repeatedly for all the 84 PUMAs. Table 5 displays the average and standard deviation in percentage points. The average percentage change is also reported. The simulated results suggest that a one standard deviation increase of the API in a PUMA raises the probability that an immigrant household with children select that PUMA relative to an immigrant household without children by .17 percentage points on average, representing a 14 % change.

4.2 Different effects by socioeconomic status

A selection model of migration (Chiswick 1978; Borjas 1987) suggests that migrants are highly inspired for labor market success regardless of their socioeconomic background. If so, household income or householder’s education should not be predictive for how immigrants evaluate education.

However, despite the value on schools, both household income and householder’s education play a pivotal role in residential choices through budget constraint. Low-income households have more restricted choice sets compared to their wealthier counterparts. Householder’s education may also capture the parental tastes for education - better educated individuals may favor education more.

As these two household characteristics may lead to heterogeneity in the parent-non-parent difference in the weight assigned to school quality, I examine the sample by households’ socioeconomic status.

First, I split the trimmed sample into five household income quintileFootnote 22 groups and run separate regressions using the specification presented in column 4 Table 5 on each group. Table 6 reports the results. The base effect of the API score is not significant in all income quintiles. The API-child interaction exhibits a monotonic relationship with income: the importance of school quality grows as household income grows. The interaction effect of the API on households with children is all positive, and it is statistically significant in the upper four quintiles. Admittedly, the difference in value placed on school quality across income quintiles is more likely to reflect the de facto choice sets of immigrant households of different income levels instead of differential tastes for children’s education.

Table 6 School quality and residential location choices by group

Second, I examine how the value placed on school quality varies by the educational level of the householder. Five categories are generated: 1) high school dropouts, 2) high school graduates, 3) attended some college, 4) college graduates, and 5) postgraduates. To isolate the effect of education from that of income constraints, I include an additional three-way interaction of the API, the child indicator, and the standardized adjusted household income in each regression. The weights allocated to other location characteristics are assumed to vary only by household income within each educational level. Table 6 displays the regression results by education. Consistent with the findings of Barrow (2002) and Bayer et al. (2007), the magnitude of the parent-non-parent difference in value on school quality is positively associated with educational attainment of householders.

4.3 Robustness

I explore modifying the benchmark regressions in Table 5 in various ways to test whether the finding that immigrant parents value education is robust.

First, to address the concern that the private school options may partly break the link between residential location and school to attend, I test including the percentage of households who send children to private schools in each PUMA as a proxy for households’ propensity to choose private schools over public schools. Second, in case the choice sets of certain immigrant groups are mischaracterized, I focus on naturalized immigrant households who resemble natives more and are probably better informed in residential selection, and exclude households with household heads themselves are in school so that the residential choices of the remaining sample are not restricted by the school location of household heads. Third, to assure that immigrant households with and without children have homogenous preferences toward unobserved non-school location attributes, I test two subsamples: households with householders aged 35 to 54, and households who rent. I also test accounting for the difference between households who would potentially have children in the near future and other households without children. Fourth, to address the potential differences in mobility, I examine out-of-state movers and movers who moved within Los Angeles Metropolitan Area. Last, I employ an alternative school quality measure, the SAT score. All robustness checks produce reassuring results. More details are provided in Appendix A.3.

5 Immigrant v.s. native value on school quality

Earlier studies find that second-generation immigrants outperform the children of natives in education and wages. Even children of the least-educated immigrant origin groups have closed most of the education gap with the children of natives (Chiswick and DebBurman 2004; Card 2005). One relevant hypothesis to test is whether immigrant parents emphasize schools more in residential locations than native parents. If so, this may shed light on the higher labor market achievements of the second generation.

Immigrant and native households are distinct in many aspects. The two groups may not share the common preferences over the unobserved location attributes or face the same choice set of residential locations. For instance, immigrants may prefer to cluster with immigrant households, but native households do not. Compared to the native-born, some immigrant households are more constrained by limited English skills and lack of access to public goods.

Therefore, instead of contrasting immigrant parents with native parents directly, I compare the parent-non-parent difference among the two groups. Presumably, the households with and without children within each group have similar unobserved preferences over non-school location attributes. I again employ the propensity score trimming method to balance the two groups.

The sample of native households is drawn based on the same criterion for the immigrant sample. That is, only households who earn a positive income and moved to the Los Angeles Metropolitan Area within the past five years are included. I estimate the propensity score for being an immigrant household from observable household and householder characteristics, and remove observations with estimated propensities below 0.1 or above 0.9 from the sample.Footnote 23 , Footnote 24 Then I run the regression in column 4 Table 5 on the two groups separately.

Table 7 reports the weight assigned to school quality by immigrant households versus native households as a whole, by income quintile, and by educational level.Footnote 25 In general, the interaction effects of the API are of similar size on immigrants and natives. The weight on school quality is positively associated with household income and householder’s education for both groups. Yet among the households in the lowest income quintile and the households with household heads who attended some college, immigrant parents place significantly higher weight than their native counterparts.

Table 7 Residential location choices: immigrants vs. natives

I also explore disaggregating the sample of immigrant and native households with both parents present into smaller groups by the migration status of the household head and the spouse.Footnote 26 No significant differences are detected across these smaller groups.

6 Mechanisms for differential values on school quality

This section investigates the potential economic mechanisms that drive differential evaluation of education among immigrant parents. Three aspects are examined: age and number of children in the household, selective migration, and future returns to education.

6.1 Age and number of children

In this section, I examine the differential demand for the level and quantity of schooling services of immigrants due to the age and the number of children in the household.

I start with estimating Equation 8 using the trimmed sample of immigrant households as a whole and by income quintiles. This model specification allows the weight placed on school quality vary by the age structure of children. Similar to column 4 Table 5, I control for household income and householder’s education. The regression results are presented in Table 8.

Table 8 Value on school quality and age of chidren

For households with lower income, having children under six lowers the weight assigned to school quality. It is likely that children of this age range have no direct needs for schooling services but leads to tighter budgets. Households with school-aged children, no matter in elementary or secondary schools, basically value school quality more than households having no children or very young children.

In the highest two income quintiles, households with children of all ages assign positive weight on school quality with no significant difference observed across the age groups. This may suggest that parents do not take children’s age into calculation to choose where to live when the resources are abundant.

Though the above analysis provides interesting insights into the heterogeneity across the age structure of children, one caveat is that the three age groups are not mutually exclusive. Households with multiple children may belong to more than one category. It is difficult to disentangle the values over education for children of different ages in such households. Also, the number of categories that households fall into is related to the number of children in the households. So a household fits into more categories may also face a tighter budget constraint.

Accordingly, I move on to study the relationship between differential values on school quality and the number of children.

I focus on households with children only and replace the API-child interaction in the specification in column 4 Table 5 with an interaction term between the API score and the log number of children under 18 in the household. That is, I assume that the value on school quality embodied in residential location choices is linearly related to the log number of children. The estimates are shown in column 1 Panel A in Table 9. Presumably due to contradictory effects of number of children, the interaction term is not significantly different from zero. In column 2, I add two additional interactions: one between the API and the log number of children between 6 and 12; and one between the API and the log number of children between 12 and 18. When the these two interactions are added, the total number of children is negatively and significantly related to the weight on the API; and both measures for school-aged children are positively and significantly related to the weight on the API. As the value placed on school quality may vary in a systematic manner across origins, I further control for country fixed effects in column 3 so that the numbers of children is unlikely to capture the country effects through differential ethnic social norms on fertility. Similar estimates are produced.

Table 9 Value on school quality and number of chidren

These results may suggest that immigrant households with more school-aged children value school quality more when deciding where to live since they can benefit more from better schools. When origin fixed effects are controlled for, increasing the number of children aged 6–12 from mean to one standard deviation higher leads to an increase of 0.2 percentage points (11 %) in choosing a PUMA with top 10 % APIs; and increasing the number of children aged 12–18 from mean to one standard deviation higher leads to an increase of 0.2 percentage points (15 %) in choosing a PUMA with top 10 % APIs. On the other hand, given a fixed number of direct consumers of public schooling services, the total number of children is negatively related to the weight placed on school, presumably because of tighter budget or lower general value on children’s education.

I also examine the relationship between weight on school and number of children by income quintile. The results are displayed in Panel B Table 9. The pattern of the estimates among the lower three quintiles resemble those estimated from the whole sample. But among the upper two quintiles, the total number of children, or number of children of a certain age range does not show much explanatory power for the weights assigned to school quality. It is possible that when less financially restricted, immigrant parents choose to locate in good school districts in spite of the number of children benefited. Nonetheless, the much smaller group size leads to larger standard errors.

6.2 Selective migration

One potential explanation for immigrants to place the more weight on school quality than their native counterparts is that migrants have higher aspirations for labor market success (Chiswick 1978; Borjas 1987; Chiswick 2000). Since the motivation of native-born population is difficult to quantify, I examine the varying motivation and quality among immigrants through selective migration. The effects of the selection on weight assigned to school quality are estimated using Eq. 9. That is, I interact the six selection measures listed in Table 4 and years since migration with the API. Only households with children are included in this analysis.

Table 10 displays the regression results. All the location attributes are controlled for. The weights placed on these attributes are allowed to vary by household income and householder’s education so that the measures for selective migration are unlikely to pick up the effects of income or education.Footnote 27 The first column only investigates the effects of distance to the U.S. and whether English is an official language in the origin country. Fraction of refugees, income inequality measure and per capita GDP, share of national origin, and years since migration to the U.S. are introduced to the model one by one.

Table 10 Value on school quality and selective migration

The distance between the source country and the U.S. is positively associated with the propensity that households choose to live in areas with good public schools, while English being an official language is negatively related to the odds of living in better school districts. This finding supports the argument by Chiswick and DebBurman (2004) that immigrants from non-English speaking countries manifest a higher demand for investment in education so as to increase the transferability of origin country skills. The coefficient on the interaction between the API and GDP per capita is positive and significant, which may suggest that households originated from wealthier countries are likely to locate in neighborhoods of high school quality.

Table 11 presents the coefficients estimated using the specification in column 3 Table 10 by income quintile. For all income quintiles, distance to the U.S. and English as an official language show some effects on the value assigned to school quality, and they all have the same sign as those estimated from the whole sample of immigrant parents. The interaction effect of API and origin GDP per capita is positive and significant in most cases. Both the national origin share and the years migrated are significant and positive in the lowest income quintile. This may suggest that, among the low-income households, those who have a bigger ethnic network and those who have a better knowledge of the U.S. tend to locate in better school district. Nevertheless, as number of households in the upper three quintiles declines, most of the estimates become less statistically significant due to larger standard errors.Footnote 28

Table 11 Value on school quality and selective migration by income quintile

In addition, I estimate the effect of selective migration among immigrant households with and without children under 18 using a two-step model (Card and Krueger 1992). The first step is to estimate the origin-specific value on school quality by interacting the API-child interaction with a set of origin dummies from all immigrant households involved in earlier analysis.Footnote 29 In the second step, I regress the estimated origin-specific API-child interaction effect on the seven measures by weighted least squares,employing the inverse of the sample variance of the origin-specific parent-non-parent difference, estimated in the first step, as the weight. Although the two-step regression model does not compare to the one-step regression perfectly for a non-linear model like the conditional logit model, it helps illustrate the diversity in the value placed on school quality by origins and enables verification of the approximate extent to which selective migration captures an important component of the country effects. The coefficients estimated by a two-step model have analogous magnitude to those derived from a conditional logit model. The R-square suggests that the seven variables together explain approximately 20 % of the variation in the value put on school quality among immigrants across origins. This may imply that selective migration may partly explain the emphasis immigrants place on school quality in the U.S.

6.3 Returns to education

The effect of ”local” returns to education on the value placed on school quality in location choices by immigrant parents is also estimated by Equation 9. Table 12 displays the regression results for the link between the returns to education and the value placed on school. The first three columns only include the interaction between the API and returns to education, and the latter three control for additional origin characteristics as the returns to education among immigrants are in part determined by their self-selection.

Table 12 Value on school quality and returns to education

For the whole sample of immigrant households with children, the estimated coefficient on the interaction between the API and returns to education is positive and significant no matter the measures for selective migration are controlled or not. Based on the estimates in column 5, if the returns to education increase by one standard deviation from the mean, the average propensity for immigrant parents to choose a PUMA with top 10 % API score in the Los Angeles Metropolitan Area would increase by about 0.6 percentage points, accounting for a 50 % change. Given potential heterogeneity in the location choice set across households with different income levels, I estimate the relation between returns to education and value on school by income quintile. There is no clear pattern of the effect of returns to education across income quintiles. In general, returns to education are positively associated with how households value school quality regardless of household income. Its effect is the significant in most subgroups.Footnote 30

I also utilize a two-step regression model to test the effect of returns to education on values placed on school quality. Different from the previous section, in the second step, I regress the estimated origin-specific API-child interaction effect on the returns to education. Figure 3 depicts the estimated origin-specific weights on school quality against the immigrants’ returns to education as well as the regression line obtained in the second step. The slope of the regression line is 143. The R-square suggests about 28 % of the heterogeneity in the weights placed on school quality across origins is explained by returns to education. If other origin characteristics are also included in the second step, the eight variables together explain more than 48 % of the variation in weight on school. Hence it may be concluded that returns to education partly influence how households value school quality, and parents are more willing to invest on education if the potential payoff for their children is higher in the future.

Fig. 3
figure 3

Weight placed on school quality v.s. returns to education

Last, provided that cities are not isolated economies and immigrants flow between cities, I test replacing the ”local” returns to education with the U.S. ”universal” returns to education estimated by Bratsberg and Terrell (2002).Footnote 31 The regression results resemble those in Table 12.

7 Conclusion and discussion

This paper studies whether and why immigrants value school for their children in host countries. The values placed on school quality are assessed through households’ residential location choices.

The empirical analysis suggests that immigrant parents exercise school choice through the choice of residential locations in the United States. When allowing for income heterogeneity in the weights assigned to school quality, the importance of schools increases as household income or householder’s education increases.

I explore the potential reasons why immigrants are willing to invest on school for their offspring. First, households with school-aged children incline to value school quality more than households with children below school age. The number of school-aged children is positively associated with the weight assigned to schools in residential location choice. Second, selective migration explains a part of the heterogeneity in value placed on school among immigrant parents. Immigrants who migrated to the U.S. from a far away or a non-English speaking country are more likely to reside areas with better schools. These results may indicate that immigrants who are more motivated for labor market success tend to emphasize their children’s education more. Last, higher expected returns to education lead immigrant parents to invest more in children’s education. When allowing the value placed on school quality in location choices to vary by country of origin, the origin-specific returns to education show a positive relationship with the probability of immigrant households with children selecting areas where school quality is high. The measures for selective migration and that for returns to education explain approximately 20 % and 28 % of the cross-origin variation in weights put on education respectively.

The paper has a number of policy implications for immigration and public education. First, the Immigration and Nationality Act of 1965 which proposed a preference system on immigrants’ skills and family relationships has resulted in a remarkable increase in the number of immigrants from Asia and Latin America to the U.S. Despite the overall declining entry earnings of the immigrants subject to the 1965 Amendments (Borjas 1987; 1995a; Jasso et al. 2000), the paper verifies a close link between favorable self-selection and aspirations for human capital investment in their offspring among the new immigrants.Footnote 32 On the one hand, the higher incentive to invest in human capital may lead migrants with low initial skills, such as those coming under family-tie categories, to catch up faster to the native-born population (Duleep and Regets 1999). On the other hand, since the 1965 act also favors skills, a sizable proportion of current immigrants have matched or even surpassed the majority of natives in socioeconomic status soon after arrival (Yu 2003), and are thereby not financially restricted in exercising their children’s school choice through residential selection. Hence, the change in the immigration policy may lead to an increasing demand for educational quality in the U.S. Relevant questions that arise may include whether the inflow of immigrants who place a high value on school increases the provision level (quality) of public schooling services or causes it to deteriorate by lowering public expenditure per student.

Second, the paper provides additional evidence on immigrant human capital investment and assimilation (Chiswick 1978; Duleep and Regets 1999). The relationship between motivation for migration and value placed on school quality by immigrant parents may help explain the dynamics in the labor market of immigrant receiving societies, more specifically, the higher achievements in education and the labor market of second-generation immigrants (Chiswick 1988; Boyd and Grieco 1998; Chiswick and DebBurman 2004). Due to the increasing importance of immigrants and their descendants in the U.S. workforce and the critical role of education in labor market success, it is necessary to understand the educational and earning advantages of the second generation and relate them to the immigration regulations.

Third, the findings in this paper shed light on the effects of school quality on the residential sorting of immigrants. Relative to earlier studies on location choices of immigrants that are mostly conducted at a broader level (Jaeger 2004; Chiswick and Miller 2004), such as choices of regions or metropolitans, this paper provides insights into immigrants’ choices of communities within a metropolitan area. Estimates of a wider range of underlying preference parameters help understand how immigrant households sort in the local housing market, which in turn determines the pattern of residential segregation and the matching of households to schools. Residential sorting not only affects the spatial assimilation of immigrants themselves, but also influences the dynamics in ethnic enclaves, the local labor market, and public good provision in destination economies. The change in the sociodemographics of neighborhoods and the body of students in local schools as well as the matching of immigrant households to local public schools are closely related to the impact of immigration on public education.

Finally, in addition to the predominant understanding of the ethnic clustering of immigrants, the paper finds that similar to the natives, immigrant parents care about public school quality in residential location choices. From a policy perspective, the result may imply that the provision of public goods, specifically, public schooling services, could be used as a tool by the government to regulate the residential sorting of immigrants and influence their spatial assimilation. Since immigrants are in general less likely to choose private schools than natives, they are more likely to benefit from public education reforms. In particular, providing more educational options that relax the strict link between residential location and school choice, such as enacting school choice programs and intradistrict open enrollment, may reduce both residential and school segregation and improve the overall welfare of immigrant households as well as other groups. Moreover, as both immigrant and native parents value school quality, an improvement of the general level of school quality may also help both groups.