Introduction

There is a great deal of research in economics, sociology and geography assessing the degree to which immigrants live or work in segregated environments—aka enclaves—and the impact of doing so. These studies generally argue that living in an enclave offers a host of benefits, such as familiar institutional structures that may ease social integration, but possibly at the expense of broad job and information networks that may connect with economic integration. Some researchers (see, Portes and Jensen 1989; 1992; Zhou and Logan 1989) point to positive economic returns to working in an enclave. Most researchers, however, argue that working in an enclave may offer job opportunities unavailable in the general labour market, but possibly at the expense of occupational mobility and wages (Musterd et al 2008; Kaplan and Douzet 2011; Raaum et al. 2006; Strömgren et al. 2014). Thus, while immigrants often work in firms with high concentrations of immigrants, they typically have poor earnings (see, e.g. Pendakur and Woodcock 2010; Barth et al. 2012).

Recently, in a study that considers both workplace and residential segregation, Strömgren et al. (2014) show that workplace segregation is less severe than residential segregation in Sweden. These two types of segregation (where immigrants live and where they work) are often linked in explanations of poor economic outcomes. An implicit assumption underlying these studies appears to be that living in an enclave increases the probability of working with a co-ethnic population (see, e.g. Bolt et al 2010). Indeed, many researchers have assumed that these two facets of segregation are two sides of the same coin.

In this paper, we use Swedish register data from 2007 to directly assess the degree to which immigrants who live in residential enclaves also work in labour market enclaves. These data provide information on the country-of-birth composition of both each immigrant’s neighbourhood and each immigrant’s workplace as well as individual level socio-demographic characteristics. This means that we can directly measure the correlation between immigrant residential segregation and immigrant workplace segregation at the individual level.

We find that there is a very little correlation between residential and workplace immigrant segregation in Sweden. Unconditionally, the observed correlation at the individual level between indices of residential and workplace segregation is about 0.24. Strömgren et al. (2014) also find a positive correlation. Conditional on individual characteristics that affect the segregation of individuals, such as education and years of residence in Sweden etc., the estimated correlation is only about half as large, equal to 0.14. If we narrow our definition of concentration to equal the proportion of immigrants who share the same place-of-birth (the preferred definition of an enclave among sociologists), the unconditional correlation is 0.11 and conditional on the characteristics of immigrants is only 0.04. If residential and workplace correlations really are two sides of the same coin, these correlations would be closer to 1. Thus, our results suggest that policy should target both types of segregation separately rather than assume that residential and workplace segregation is essentially one and the same.

We next discuss the state of the literature on residential and workplace concentration of immigrants in Sweden and around the world, and then move on to our empirical methodology and a detailed analysis of our results.

Residential and Workplace Concentration

Recent research in Canada (Hiebert 2009), the USA (Card 2009; Iceland 2009), Sweden (Bevelander and Pendakur 2012), Denmark (Andersen 2010) and other immigrant-receiving countries has established that immigrants often concentrate in residential neighbourhoods.

Sociologists list a broad range of reasons why immigrants may choose to live in neighbourhoods dominated by co-ethnics (see Bonacich and Model 1980a; Wilson and Portes 1980; Breton 1974; Hansen et al. 2010). Within the context of labour markets, cultural communities may be closely connected to labour market enclaves for three reasons (see Bonacich and Model 1980a; Wilson and Portes 1980; Akbari and Aydede 2011). First, labour market enclaves may offer a degree of social comfort through language and shared identity that is not available outside the enclave. Second, ethnically defined enclaves may buffer the effects of ethnically based discrimination on the part of mainstream society. Third, Breton (1974) introduces the concept of “institutional completeness,” which in part describes the variety of services available within an ethnic or cultural enclave. Enclaves that are institutionally complete offer a wide variety of services and employment opportunities to group members. Large enclaves are more likely to be institutionally complete than small enclaves. We may then expect workers in large enclaves to earn more than workers in small enclaves because of the greater degree of choice that exists. Pendakur and Pendakur (2002) assess the labour market impact of three types of enclaves in Canada (ethnic, linguistic and ethno-linguistic) and conclude that the size of the ethnic enclave is important in reducing earnings differentials across minority groups.

Within the enclave literature are two long-standing debates revolving around two interrelated conceptual issues. The first concerns the scope (or definition) of the enclave itself: does it revolve around the place of work or place of residence? And, what is the level of analysis? Is it the city or the neighbourhood; is it the firm, the workplace or the work team? The second issue concerns the direction of impact—does working within an enclave have a positive or negative effect on socio-economic outcomes, such as employment probabilities or earnings?

Looking first at issues of scope, Portes and Jensen (1989) define ethnic enclaves on commercial grounds, focusing on businesses dominated by an ethnic minority group (see also Bernabé Aguilera 2009). The argument here is that these business enclaves can improve employment outcomes for members of the ethnic group (see Portes and Jensen 1989; Light 1984; Waldinger 1993; Zhou and Logan 1989). The counter argument suggests that such enclaves are actually exploitative, with benefits flowing primarily to middlemen within the enclave, resulting in lower wages and poorer working condition (see, e.g. Bonacich and Modell 1980b; Sanders and Nee 1987). Gilbertson and Gurak (1993) looking at outcomes for Colombian and Dominican men in New York conclude that working in an enclave also results in lower benefits such as health care and insurance (pp: 218). Raaum et al (2006) assess earnings in Norway and conclude that earnings are affected by neighbourhood characteristics.

Many studies focus on place of residence, perhaps because place of residence is often available on public-use datasets, but the characteristics of the workplace are often not so available (Xie and Gough 2011). But, there are also theoretical rationales for concentrating on where people live as compared to where they work. Sanders and Nee (1987) assess outcomes for Cubans in Florida and Texas, running regressions for selected municipalities. They thus define the enclave from a residential/municipal as opposed to a commercial perspective. Portes and Jensen (1992) countered by arguing that while business owners may have started out living in a residential enclave, they often move out as economic circumstance improves. Nonetheless, the idea of residential enclaves has persisted. Davis (2004), for example, analyse the American 1990 Public Use Microdata Set (PUMS) using this definition of enclaves and concludes that Cuban immigrants earn more working outside the enclave as compared to inside the enclave. Grönqvist (2006) defines enclaves at the municipal rather than neighbourhood level to examine outcomes for immigrant children concluding that growing up in an enclave can negatively affect schooling, but not earnings.

Xie and Gough (2011) use a life course approach and model enclaves both at a residential and place of work level (using language at work as a proxy for working in the enclave). They find few significant results using either definition.

Enclaves in Sweden?

In 2007, Sweden’s foreign born population was nearly 14 % of the total. Of this, nearly 50 % were born outside Europe. About 70 % of all immigrants in Sweden live in the three largest cities of Sweden (Stockholm, Gothenburg and Malmoe), and between 25 and 35 % of the residents of these cities are immigrants (Bevelander 2010).

There has been some work on link between work and residential enclaves. Musterd and Andersson (2006) show that the composition of the neighbourhood has a significant but moderate effect on the employment prospects of immigrants (see also Musterd et al 2008). Hedberg and Tammaru (2012) looking at outcomes in Sweden over time find, not surprisingly, that the probably of employment increases while the probability of living in an immigrant neighbourhood. Andersson and Hammarstedt (2012) assess the link between enclaves and self-employment, concluding that the presence of enclaves increases the propensity to be self-employed.

In an attempt to measure the causal effect of the “enclave”, Edin et al. (2003) measured the effect of the dispersal policy towards new arrived refugees between 1985 and 1989 applied by the Swedish government in contrast to refugees that came during 1981–1984, who could freely choose where to settle. They conclude that living in an enclave seems to increase earnings.

Overall, these studies suggest that there is a link between living in an enclave and working in an enclave. However, they do not test the correlation directly. Rather these studies look at the impact of either living or working in an enclave on job prospects. In this paper, we look specifically at the controlled correlation between living and working in an enclave. Unlike the previous studies, this approach allows to assess the degree to which living in an enclave may affect the probability of also working in an enclave—two distinctly different environments.

Data and Methodology

Our data are drawn from the 2007 STATIV database, which includes a record for every legal resident in Sweden and matches information of the population register to information from the employment register. These data are matched to workplace identifiers for all employed workers. Because we have (nearly) the universe of all workers in Sweden, we are able to completely characterize the immigrant/place-of-birth composition of each workplace and each residential neighbourhood. This enables us to model both neighbourhood segregation and workplace segregation.

These data include information for a total of 4,720,641 individuals aged 25 to 64 years spread across 9231 neighbourhoods and 412,262 workplaces. We restrict our sample to include only working non-Nordic immigrants who live in a municipality with more than 1000 residents and more than 50 immigrants. We drop people working in a workplace with only one worker. We also drop all respondents who do not report a level of schooling or a year of immigration. This leaves us with 279,936 immigrants living in 8085 neighbourhoodsFootnote 1 and working across 45,250 workplaces with more than one person (top three lines of Table 1). In all, we use about 5 % of all observations.

Table 1 Descriptive statistics

The remaining lines of Table 1 provide statistics at the individual level, starting with the dependent variables and continuing with the regressors. For all our dependent variables, the standard deviation is similar in magnitude to the average level, so there is a lot of variation to “explain”.

Our objective is to use these microdata on individual workplace and residential location choices to illuminate the question of whether or not these choices are correlated. The simplest way to investigate this is to measure the raw (or Pearson) correlation coefficient at the individual level between a measure of workplace segregation and a measure of residential segregation. But the raw correlation coefficient leaves something to be desired—it does not control for the characteristics of individuals.

To control for the characteristics of individuals, we specify an equation for each outcome—workplace and neighbourhood segregation—based on individual characteristics and ask whether or not the error terms in these equations are correlated. We have data with many observations of individuals in each workplace and neighbourhood. Let R i and W i be the residential and workplace concentration, respectively, of individuals i = 1,…N. These variables capture the degree to which individual i lives in a neighbourhood or works in a workplace with few or many individuals similar to themselves. We will use two concentration measures, based on definitions of who is similar to whom. In the first, denoted immigrant concentration, we say two workers are similar to each other if they are both immigrants. Then, for an immigrant, R i is equal to the proportion of the residents of person i’s neighbourhood who are immigrants and W i is the proportion of workers at person i’s firm who are immigrants. In the second, denoted place of birth concentration, we say that two workers are similar to each other if they are immigrants from the same country.

Let X i be a vector of characteristics of person i and assume a linear model for both residential and workplace concentration with bivariate normally distributed errors:

$$ {R}_i={X}_i{\beta}_1+{u}_{1i} $$
(1)
$$ {W}_i={X}_i{\beta}_2+{u}_{2i} $$
(2)
$$ \begin{array}{l}{\mathrm{u}}_{1i}\hfill \\ {}{\mathrm{u}}_{2i}\hfill \end{array}\sim N\left(\begin{array}{ll}{\sigma}_1^2\hfill & \rho {\sigma}_1{\sigma}_2\hfill \\ {}\rho {\sigma}_1{\sigma}_2\hfill & {\sigma}_2^2\hfill \end{array}\right) $$
(3)

The raw (Pearson) correlation coefficient measures the correlation between R i and W i . The “controlled” correlation coefficient ρ measures the correlation between u 1i and u 2i , which controls for observables X and asks the correlation between what remains after the effect of X is taken out.

Another way to think about the controlled correlation is that it breaks the raw correlation into two pieces: one driven by observables and another driven by unobservables (given by ρ). Indeed, because u 1i and u 2i are independent of X by assumption, the square of the raw correlation decomposes into the square of ρ and the square of the raw correlation between X 1 β 1 and X 1 β 2.

In all regressions, we include as X i the following regressors: highest level of schooling (seven categories), sex (two categories), age (four categories in 10-year intervals), years in Sweden less 10 and its square. In some regressions, we additionally include 37 country-of-birth dummies.

In the empirical work that follows, we will present raw and controlled correlations for models in which concentration measures are based on either immigrant status or country of birth. All estimated standard errors are clustered at the level of workplace times neighbourhood. Estimation of the model (1)-\ is by maximum likelihood seemingly unrelated regression.

Findings

Table 2 shows the correlation between residential and workplace concentration at the broadest level. Here, we give a two-way cross-tabulation of an indicator variable of immigrant concentration being above or below the median value in the sample for both residential and workplace concentration. If there was no correlation, each cell would hold 25 % of the sample. In our data, the diagonal elements of the cross-tabulation hold about 60 % of the sample, and the off-diagonal elements only about 40 % of the sample. The next lines show that this is roughly the same when we consider place of birth concentration. This distribution is quite close to the “no correlation” distribution: we would only have to move 10 % of the immigrants from one cell to another to achieve an even distribution. So, at this level, we see only a slight correlation. At the top of each panel of Table 2, we report the raw Pearson’s correlation coefficient for residential and workplace immigrant concentration and for residential and workplace place of birth concentration. The raw Pearson’s correlation for immigrant concentration is 0.237 and for place-of-birth concentration is 0.114. This suggests that the proportion of immigrants living in an enclave explains (in a statistical sense) about 6 % (the square of 0.24) of the variance in a firm’s proportion of immigrant workers. For place-of-birth concentration, the explanatory power is close to 1 %. These correlations are low enough that one could not profitably use policy affecting one to manipulate the other.

Table 2 Cross-tabulation of residential and workplace concentration

The Pearson’s r is uncontrolled—we do not know how large the correlation is conditionally on control variables such as age, sex and schooling. If the correlation were much higher conditional on observable variables, which were fixed from the point of view of the policy-maker, then there might yet be a role for policy that focused on just one type of segregation. The controlled correlation provides this information.

Table 3 gives selected parameter estimates from four sets of seemingly unrelated regression models that assess the relationship between the propensity to live in an enclave and the propensity to work in an enclave (Tables A1 to A4 in the Appendix provide all parameter estimates). The left-hand columns give estimated controlled correlations for models that control for age, education, sex and years since migration along with either the proportion of immigrants living in the neighbourhood or working in a firm, or the proportion of people who share the respondent’s place of birth living in the neighbourhood or working in the firm. The right-hand columns give estimated controlled correlations for models that control for these variables plus dummy variables for 37 places of birth. The upper row gives estimates for models where the dependent variables are immigrant concentrations, and the lower row gives estimates for models where the dependent variables are place of birth concentrations.

Table 3 Selected estimates from immigrant and place-of-birth concentration regressions

The model for immigrant concentration without place of birth controls results in a controlled correlation equal to 0.178. Since this number is somewhat smaller than the raw Pearson correlation, the controls eat up some of the positive correlation between residential and workplace segregation. That is, the covariates tend to push both segregation measures in the same direction.

The square of the controlled correlation is about 4 %, suggesting that the overall correlation of 0.24 (top panel of Table 2) which explains 6 % of the variance can be divided into two parts: about 2 percentage points for the observables X and about 4 percentage points for the unobserved variables. Turning to the model for immigrant concentration with place of birth controls, we see a controlled correlation of 0.142. Here, even less correlation is left after we control for the observed characteristics and places of birth of immigrants.

Sociologists studying segregation tend have a working hypothesis that immigrants cannot be lumped together, but rather that we should look for segregation at the level of place-of-birth groups. For example, Portes and Jensen (1989) and Bernabé Aguilera (2009) focus on the importance of home-country networks in economic success. This narrower definition is analogous to most definitions of residential ethnic enclaves in sociology and geography. We consider this in the lower row of Table 3.

In Table 2, we saw that the raw Pearson correlation coefficient is small, equal to 0.114, indicating that living in a neighbourhood with a lot of co-ethnics only explains about 1 % of the variance in co-ethnic concentration in workplaces. In the second line of Table 3, we see that the estimated controlled correlations are only 0.072 and 0.042 for models that do not and do control for place-of-birth, respectively. In the upper line of Table 3, we saw controlled correlations between the proportion of residential neighbours who are immigrants and the proportion of co-workers who are immigrants of around 0.15. When we instead ask for the controlled correlation between the proportion of residential neighbours who are from one’s own country of birth and the proportion of co-workers from that country of birth, we see the correlation drop to around 0.04. Thus, when we consider workplace and residential concentration where enclaves are defined on the basis of country-of-birth, there is essentially no correlation between residential and workplace concentration.

Recall that the correlation coefficient is a scale-free statistic, so it is not influenced by the fact that the overall levels of concentration are lower when we look at a narrower concentration definition. From a policy standpoint, these correlations are close enough to zero to suggest that residential coethnic concentration and workplace coethnic concentration are each caused by fundamentally different drivers.

Recent Immigrants

Now, we investigate whether or not patterns are different for recent immigrants. Table 4 presents estimates analogous to the Pearson correlations in Table 2 and the estimates in Table 3. We estimated the models for the subsample of immigrants arriving in 1996 or after (with 10 years or less of residence in Sweden) from the 14 source countries that had more than 100 recent immigrants each in the sample.

Table 4 Selected estimates from immigrant concentration regressions, recent immigrants

Looking at the results for recent immigrants, we see that the Pearson correlation is similar as for all immigrants. However, when we look at the controlled correlation, we see that difference between it and the Pearson correlation is smaller than for the entire population of immigrants. This difference is only 0.014 (0.228–0.214) when we do not control for place of birth, about one-third of the difference seen for the full population of immigrants seen in Table 3. The difference is 0.031 (0.228–0.197) when we do control for place of birth, less than half the difference seen for the full population of immigrants reported in Table 3. This means that the observable characteristics of recent immigrants are much less informative as to the correlation of residential and workplace segregation than for other immigrants.

These findings are revealing in two ways. First, recent immigrants do not display a higher correlation between living and working in an enclave than less recent immigrants—the Pearson correlations are the roughly same. If integration really were unidimensional so that living and working in an enclave were highly correlated, we would expect to see this most strongly for recent immigrants. That we do not see this provides further evidence that integration is not unidimensional.

Second, the observable characteristics of recent immigrants eat up less of the covariation between living and working in an enclave than do those of less recent immigrants. One interpretation is that over time the unobserved characteristics of immigrants become less important to that correlation.

Correlations by Country of Birth

Table 5 presents just the Pearson and controlled correlations for seemingly unrelated regression models analogous to the right-hand columns of Table 3, where the dependent variables are defined off of place of birth. The left-hand columns give estimates for 37 places of birth for immigrants. The right-hand columns provide estimated coefficients for recent immigrants for the eight largest recent immmigrant country-of-birth groups (these had at least 130 people each in our sample).

Table 5 Summary statistics from 45 seemingly unrelated regressions assessing the correlation between living and working in an ethnic enclave by place of birth, Sweden, 2007

The results provided in Table 5 point to some real differences in the relationship between living and working in an ethnic enclave by different place-of-birth groups.

Overall, however, in evaluating the correlation separately for each birthplace, we have allowed for the possibility that the small correlation reported in Table 2 masks important variation across places of birth. However, this does not appear to be the case. No birthplace shows a large correlation, and consequently, our finding that residential and workplace segregation are largely separate phenomena seems true regardless of birthplace or recency of immigration.

Most European groups (with the exception of Poland, the Netherlands, Germany and Greece) have controlled correlations that are close to zero. The controlled correlations for immigrants from the Netherlands and Greece are the highest (0.21 and 0.11, respectively) suggesting that 1 to 4 % of the variance in where these immigrants work is explained by where they live. Most non-European groups have quite low controlled correlations (less than about 0.1). Immigrants from Turkey and Morocco, for example, have controlled correlations of 0.08. Immigrants from Somalia have the highest controlled correlation among the non-European groups (0.14). Turning to results for recent immigrants, we see that the Pearson and controlled correlations are not particularly large in comparison with those of all immigrants. Indeed, the controlled correlation for Syrian immigrants is negative (−0.14).

Conclusions

In this paper, we have assessed the link between living and working in an immigrant enclave with a goal toward determining the correlation across the two domains. We find that the correlation between residential segregation and workplace segregation for individuals is generally low. Thus, as the concentration of immigrants or co-ethnics increases in a neighbourhood, the probability of working in an ethnic enclave does not increase. That said that the controlled correlation does tend to be higher for more recent groups. Even here, though, the correlations are less than 0.22, suggesting that only about 4 % of the variance in workplace concentration is a related to variance in residential concentration.

These finding run counter to literature which suggests that neighbourhood and workplace enclaves are closely tied (see, e.g. Beckhusen et al. 2013; Kaplan and Douzet: 2011) but does not exclude the possibility that earnings are lower for residents in immigrant and minority enclaves. These finding suggest that policies which target segregation in the workplace will likely not affect segregation at the residential level (or vice versa) since the two appear to operate separately.