1 Introduction

In recent years, the number of people forcibly displaced has rapidly increased, ushering in a new age of diaspora. According to the United Nations refugee agency, as of 2017, there are 68.5 million forcibly displaced people worldwide, and 25.4 million of them are officially registered as refugees by the United Nations High Commissioner for Refugees (UNHCR) and the United Nations Relief and Works Agency (UNRWA). Especially, Europe is at the center of giant waves. In 2015 alone, more than one million migrants and refugees crossed into Europe, precipitating the European refugee crisis. While some opponents intransigently raise their voices to close the borders and worry about the myriad social and economic problems expected to occur with their exodus to Europe, others firmly stand against such pessimistic voices and accentuate the positive aspects, the main point of which is the expectation that the influx of refugees will invigorate the rapidly aging continent with an active, younger working population.

Whether the set of advantages will far outweigh that of disadvantages and fulfill the sanguine expectations of pro-refugee proponents depends on how the nations that are in charge help those refugees settle down in the initial stages and enable them to become economically self-sufficient as soon as possible without burdening their host populations that much. The specified goals of the United States Refugee Admission Program (USRAP) do well in reflecting the importance of the successful labor market integration of refugees into their host communities. Furthermore, some scholars argue that the successful labor market integration extends well beyond economic issues because bad labor market outcomes for refugees may contribute to their political radicalization (Dagnelie et al. 2019). Therefore, refugees’ smooth labor market integration is of critical importance. For this reason, it is crucial to analyze and understand the uniqueness of the labor market conditions that refugees encounter, and in doing so, contextual heterogeneity should also be taken into careful consideration.

This paper opens with the indisputable fact that refugees are in a highly unfavorable position relative to their host populations, which should be construed as the key underlying feature of refugees as job seekers. Hence, it is obvious that they are much more vulnerable to unexpected external shocks affecting labor market conditions. Additionally, what causes further complications stems from the fact that the magnitude of such vulnerability can be heterogeneous depending on certain factors, especially the cause and agent of a shock. Such heterogeneity is the starting point of the investigation of this paper.

This study questions whether the series of four coordinated terrorist attacks planned and perpetrated by Al-Qaeda against the U.S. on September 11, 2001 triggered any significant changes in the labor market outcomes of refugees resettled in the U.S..Footnote 1 While many studies have investigated the impacts of the September 11 attacks on labor markets both in and outside of the U.S., possible consequences on the labor market outcomes of refugees, who are distinguished from economic migrants or usual nonnatives, have been far too neglected, with not a single individual study investigating them, to the best of the author’s knowledge. Hence, this paper provides the first empirical analysis of the impacts of the attacks on refugees as job seekers. More importantly, this research also sheds new light onto whether the impacts stemming from the attacks were heterogeneous depending on a refugee’s ethnicity or religion, considering that the attacks were planned by the Islamic extremist group and perpetrated by its African and Arab hijackers. In light of the fact that refugees usually compete with one another as job seekers in their resettlement regions, if African and Arab refugees, due to the cause and agent of the September 11th attacks, were more affected by the negative impacts stemming from the attacks, it is a sensible hypothesis that the less affected refugees may have rather benefited from the attacks, which is to be addressed in this study.

In terms of econometric methods, this paper criticizes common, yet naïve, single-level approaches, which ignore the irrefutable fact that refugees form unique networks in their resettlement regions, influence one another, and interact with enclaves composed of their previously resettled compatriots. The important effect of such clustering on nonnatives’ labor market outcomes is already corroborated by Edin et al. (2003) and Damm (2009b). Especially, their networks are important in that they provide information on labor market conditions and opportunities for recent refugees (Munshi 2003; Beaman 2012). Hence, it is a natural inference that those networks make refugees not independent but heavily clustered, and the use of a single-level approach, which naïvely considers all observations to be independent, is improper in this context.

Such clustering of refugees makes the conventional assumption of independently and identically distributed (hereafter, i.i.d.) observations implausible, which is essential for the consistency of many econometric methods, such as maximum likelihood estimation. Especially, in analyzing binary responses, naïve negligence on the violation of i.i.d. often leads to severely biased estimates. In contrast to many previous single-level studies that overlooked this important aspect, the present study relies on a multi-level approach, which i) allows for the plausible violation of the i.i.d. assumption, ii) defines what to cluster over, and iii) controls for cluster-specific heterogeneity in a nonlinear context.

While nonlinear multi-level econometric methods, as claimed by Cameron and Trivedi (2005), have not been well understood, it should be noted that colossally different estimates may be obtained depending on the choice of econometric models when binary multi-level analyses are required (Guo and Zhao 2000; Rodriguez and Goldman 2001; Browne and Draper 2006). Therefore, considering the importance, usefulness, and prevalent unfamiliarity of nonlinear multi-level models, the author hopes that this study provides reliable methodological guidelines for further refugee-related labor market analyses.

The remainder of this paper is structured as follows. Section 2 provides theoretical considerations with a literature review on the September 11 attacks and their socio-economic impacts from a labor market perspective. Details on the data used for this study and explanations on empirical frameworks are given in Sect. 3, and Sect. 4 discusses econometric methods for nonlinear multi-level analyses.Footnote 2 The estimation results of this investigation are presented in Sect. 5, while Sect. 6 explains the robustness checks that have been undertaken.Footnote 3 Section 7 concludes.

2 Theoretical considerations

Compared to their host populations, refugees are far disadvantaged in the labor markets of their destinations, and they are highly vulnerable to negative labor market shocks due to many reasons. A lack of local language skills, cultural differences, career interruptions, undervalued previous work experiences, and unappreciated educational attainments are just a few examples. In addition, one of the biggest hindrances that refugees may face while seeking work is direct and indirect discrimination. They are likely targets of discrimination and widely thought to heavily suffer from prejudice (Burawoy 1976; Piore 1980; Evans and Kelley 1991). What is more, such discrimination can be intensified by negative incidents, which, in severe cases, can even lead to extreme xenophobia.

The four coordinated airplane hijackings and suicide attacks on the U.S. by Al-Qaeda on September 11, 2001 were one such example of an extreme incident. The September 11 attacks arguably exacerbated discrimination toward refugees resettled in the U.S.. In other words, the attacks must have aggravated refugees’ vulnerability as job seekers. Furthermore, it is predictable that the September 11 attacks may have triggered intensified antipathy against persons perceived to be Africans, Arabs, or Muslims due to their ethnic or religious similarity to the terrorists that perpetrated the attacks.Footnote 4 According to Singh (2002), polls conducted by various advocacy groups found that between 20 and 60% of Arab Americans and Muslim Americans said they personally experienced discrimination after the September 11 attacks. They also reported increased incidences of discrimination at work (Kaushal et al. 2007).

From a broader perspective, it is plausible that the September 11 attacks induced a substantial increase in anti-immigrant and anti-foreigner rhetoric as well as anti-Arab and anti-Muslim sentiments. Schüller (2016) found that the attacks caused an immediate shift to more negative attitudes toward immigrants and resulted in a considerable decrease in concerns over xenophobic hostility. Considering this more extensive aspect in conjunction with the aforementioned disadvantages that refugees face, I assert that confining our attention only to Arabs and Muslims and viewing them as the only adversely affected individuals is partial and fragmentary. Rather, analyses should be broadened to include all nonnatives (i.e., all refugees in the case of this study) regardless of their relative similarity to the terrorists who perpetrated the September 11 attacks. This is one of the distinguished contributions of this study compared to other previous papers that only investigated the effects of the attacks on Arabs and Muslims such as Dávila and Mora (2005), Sheridan (2006), Kaushal et al. (2007), Braakmann (2009), and Rabby and Rodgers (2011).

The increased animosity toward refugees, or foreigners in general, engendered by the September 11 attacks may have adversely affected their labor market outcomes in the U.S.. On the labor demand side, such negative impacts may have been manifested in a variety of ways as follows. Most importantly, employers’ preferences may have been influenced: in addition, consumers’ prejudices against refugees may have become aggravated by the attacks. Even inter-employee discrimination among coworkers is possible. In a word, increased hostility after the attacks may have reduced the demand for refugee labor, which accordingly may have decreased refugees’ employment probability. On the labor supply side, the fact that the individuals investigated by this study are refugees obviates the need to consider the elasticity of the supply of labor. Different from economic migrants, refugees have no other options such as returning to home countries than settling down and sustaining themselves by finding employment in their resettlement regions.Footnote 5

Meanwhile, although refugees generally face disadvantages in labor markets and this situation may have been exacerbated by the September 11 attacks, the level of such disadvantages specific to each refugee can be heterogeneous. Considering the cause and agent of the September 11 attacks, it is a sensible guess that refugees with a particular ethnicity, despite all the other important factors held constant, were more (or less) affected than refugees with another ethnicity. In other words, the degree of post-attack prejudice and animosity, which could be causative of labor market discrimination against refugees, may have significantly differed across respective ethnic groups. To put it differently, post-attack anti-refugee sentiment may have been selectively moderated for some ethnic groups and discriminately amplified for others.

In the case of developed economies, it is often argued that foreign-born workers’ primary labor market competitors are not their host populations but other foreign-born job seekers. Since the work of Friedberg and Hunt (1995), a number of studies have corroborated that foreign-born workers do not have a considerable adverse impact on the wages and employment opportunities of the native-born population of the receiving country (Peri 2007; D’Amuri et al. 2010; Peri 2011; Manacorda et al. 2012; Ottaviano and Peri 2012).Footnote 6 This is because the jobs they compete for are most closely substitutes for the jobs held by other foreign-born workers, which tend to be more complementary to the jobs held by natives.Footnote 7 The main reason for this labor market separation resides in the fact that foreign-born workers and native workers have different skills and characteristics, and nonnatives, especially in developed economies, may take jobs that natives do not want (Peri 2014; Foged and Peri 2016). Applied to the case of refugees, this argument implies that refugees compete with one another as job seekers. Beaman (2012) provides empirical evidence corroborating this aspect that refugees’ mutual competition decreases their probability of obtaining employment. Dagnelie et al. (2019) also underlines that a refugee’s employment is negatively affected by the number of refugees already employed, which substantiates the presence of such mutual competition among refugees in their labor markets. Hence, if refugees with a particular ethnicity were less affected by the negative impacts stemming from the September 11 attacks compared to those with another ethnicity, the less affected presumably benefited from the attacks, which selectively increased animosity against the more affected ethnic groups. This is what the present paper seeks to address by examining and quantifying differences in the estimated impacts of the September 11 attacks on labor market outcomes across different ethnic subgroups of refugees resettled in the U.S. Religious subgroups are taken into consideration as well.

3 Data and empirical frameworks

3.1 Data

The repeated cross-sectional data used for this paper describe refugees resettled in the U.S. between 2001 and 2004. Those refugees were distributed to destination cities by a refugee resettlement agency, and this exogenous distribution precludes individual refugees from any systematic sorting.Footnote 8 The data were collected by the International Rescue Committee (IRC), a large resettlement agency that helps refugees. In total, the data set contains 1478 adult refugees randomly chosen from a full administrative data set, 65% of whom became employed within 90 days after their arrival in the U.S.. This data set is unique in terms of several key aspects as follows.

First, all individuals in the data set are refugees, not economic migrants. A refugee refers to persons unable to stay in the country of their nationality due to serious persecution or a well-founded fear of persecution. In order to be granted refugee status for immigration purposes, the foreign national must meet clearly defined requirements, the main point of which is that the refugee has a credible and reasonable fear of severe persecution. Since refugees are forcibly displaced, their decision function of leaving their home countries is generally less sensitive than that of economic immigrants. Hence, the typical selection issue (i.e., selection into defection) is not considered to be serious in the case of this study.Footnote 9 Furthermore, this feature also plays an important role as an identification condition for estimating the impacts of the September 11 attacks as such a rigid decision function guarantees that refugees who arrived before and after the attacks are similar in terms of both observables and unobservables. More explanations on identification conditions are provided in Sect. 3.3.Footnote 10

Second, sample respondents did not have any family members who had already settled down in the U.S. and who could thus assist in their resettlement. In other words, they had no consanguinity-based ties in the U.S. by the time they arrived. This feature makes the effect of individual-level covariates, including the regressor of the September 11 attacks, much clearer, since they have no family members to rely on for their initial resettlement. In many empirical studies on refugees’ labor market outcomes, having family members established in the host country before a refugee’s arrival contaminates estimation results in the form of omitted variable bias unless relevant information is recorded: this data set is completely free from such an issue. Moreover, this aspect also makes it even more important to allow for cluster-specific heterogeneity as their within-cluster (i.e., nationality-based local networks in their resettlement cities) reciprocity is expected to be much stronger compared to refugees who have family members already living in the U.S. that can help them to settle down.Footnote 11

In terms of the dimensions of covariates, various individual characteristics of refugees were recorded at the time of placement, enabling this research to allow for many controls and avoid omitted variable bias as much as possible, such as age, household size, ethnicity, religion, date of birth, initial English language level, previous work experience, education received before coming to the U.S., and so forth. Summary statistics on these extensive covariates are presented in Table 14 in Online Supplement B.4.

Observations on refugees’ job acquisition status, the primary dependent variable of this study, were collected at 90 days after each refugee’s arrival in the U.S..Footnote 12 Such short-term records can be seen as one of the shortcomings of this data set; however, as previously mentioned, since I accentuate the importance of refugees’ early integration into host communities’ labor markets in the initial phase upon arrival, job acquisition measurement at 90 days after arrival is not improper. Also, it is often said in labor economics that the period of 90 days is not too short for refugees to find jobs (Beaman 2012; Dagnelie et al. 2019). In addition, such a short-term nature makes it even clearer whether the September 11 attacks triggered any changes in the early employment of refugees with significant heterogeneity across different ethnic subgroups, which is the primary question of this study.

Another shortcoming of this data set resides in the fact that it is only composed of male refugees: however, I think this matter is not much problematic in the context of this study due to the reasons as follows. First, the key question of this paper is whether or not the September 11 attacks caused ethnicity-specific labor market impacts. Hence, if all ethnic subgroups to be compared were in the same circumstance in terms of their sample composition (i.e., only males), the findings would be still meaningful. Second, while gender may play an important role in affecting labor supply decisions, males usually show comparable patterns in obtaining employment across nations: hence, the labor market outcomes of male refugees can be objectively compared.Footnote 13 Third, this study delves into increased prejudice and discrimination as the main source of negative labor market impacts stemming from the September 11. As the paper of Kaushal et al. (2007) argues, there may be less prejudice toward women because women are usually not associated with terrorism. Thus, in studying the labor market impacts of the September 11 driven by employers’ prejudice and discrimination, the shortcoming of using the sample only composed of male refugees would not be that much critical.

3.2 Empirical frameworks

The key objective of this study is to shed new light onto whether the impacts stemming from the September 11 attacks were heterogeneous depending on a refugee’s ethnicity, an important aspect in consideration of the fact that the attacks were planned by Al-Qaeda and perpetrated by its African and Arab hijackers. For the reason that refugees, as discussed in Sect. 2, usually compete with one another as job seekers in their resettlement regions, if African and Arab refugees were more affected by the negative impacts of the attacks due to the cause and agent of the attacks, it is a sensible hypothesis that refugees of other ethnicities may have benefited from the attacks. This paper seeks to address such a facet by examining and quantifying differences in the estimated impacts of the September 11 attacks on refugees’ labor market outcomes across different ethnic subgroups as follows.

$$\begin{aligned} {{{y_{i}^{m}}={\theta _{0}}+{\theta _{1}\text {Post911}_{i}}+{\sum \limits _{k=2}^{5}\pi _{k}D_{k,i}}+{\sum \limits _{k=2}^{5}\phi _{k}}({\text {Post911}_{i}\cdot D_{k,i}})+{\mathbf {q}_{i}^{\prime }{\varvec{\varphi }}}+{{\tau }_{m}}+{u_{i}}}} \end{aligned}$$
(1)

In (1), the dependent variable \(y_{i}^{m}\) is the labor market outcome of refugee i resettled in city m. The regressor \(\text {Post911}_{i}\) is a dummy variable and coded one if a refugee’s job search periods are affected by the September 11 attacks: zero otherwise. \(D_{k,i}\) is also a dummy variable coded one if refugee i’s ethnicity belongs to a certain ethnic subgroup k (i.e., \(i\in k\)), while there are five categories: Africans, Arabs, Asians, Europeans, and South Americans, among which Africans are the baseline reference group in (1). \(\mathbf {q}_{i}\) is a vector of individual characteristics that include age, household size, religion, initial English language level, previous work experience, education received before coming to the U.S., issuance of a social security number, and so forth. In addition, \({\tau }_{m}\) is resettlement city fixed effects, and \(u_{i}\) represents unmeasured factors affecting refugee i’s labor market outcome. Most importantly, this study delves into the heterogeneity of \(\phi _{k}\), and the marginal effect of the attacks on \(y_{i}^{m}\) is separately estimated for each ethnic subgroup on the basis of \(\theta _{1}\) and \(\phi _{k}\).

On the other hand, it is plausible, as mentioned earlier in Sect. 1, that refugees form unique clusters in their resettlement regions, interact with their cluster members, and share some cluster-specific unobservable factors, which can affect a refugee’s labor market outcome \(y_{i}^{m}\). This facet means that \(u_{i}\) in (1) consists of two components: one caused by clustering and the other purely stemming from idiosyncratic elements. If the former is correlated with regressors, \(u_{i}\) in (1) can cause coefficient estimates to be biased. Therefore, such non-independence among observations caused by clusters should be modeled by decomposing \(u_{i}\) into cluster-specific unobservables \(\alpha _{c}\) and idiosyncratic elements \(\varepsilon _{i}\) as follows:

$$\begin{aligned} {y_{i}^{c,m}}= & {} {\theta _{0}}+{\theta _{1}\text {Post911}_{i}}+{\sum \limits _{k=2}^{5}\pi _{k}D_{k,i}}\nonumber \\&+{\sum \limits _{k=2}^{5}\phi _{k}}({\text {Post911}_{i}\cdot D_{k,i}})+{\mathbf {q}_{i}^{\prime }\varvec{{\varphi }}}+{{\tau }_{m}}+{\alpha _{c}}+{\varepsilon _{i}}, \end{aligned}$$
(2)

where \(y_{i}^{m}\) in (1) is changed to \(y_{i}^{c,m}\) in an attempt to highlight the fact that refugee i belongs to cluster c. With regard to \(\alpha _{c}\), one important caveat should be made that we, in general, cannot simply put cluster-specific dummies in a bid to control for \(\alpha _{c}\) when \(y_{i}^{c,m}\) is binary because of the incidental parameters problem (Neyman and Scott 1948; Hall and Severini 1998).Footnote 14 This is why nonlinear multi-level models should be used for the present study, which is discussed in full detail in Sect. 4 and Appendix A.1.

3.3 Identification conditions

Three conditions are required to properly identify the heterogeneous impacts of the September 11 attacks on refugees’ labor market outcomes. First, it should be tenable that overall labor market conditions in the U.S. remained stable during the sample period of four years. If fulfilled, this condition makes it more persuasive that no other serious shocks affecting refugees’ labor market outcomes occurred, aside from the September 11 attacks. The descriptive statistics from the U.S. Department of Labor depict that the unemployment rate slightly increased starting at the beginning of 2001 due to the recession in the U.S.: however, during the sample period, the difference between before and after the attacks was less than one percentage point. Therefore, the first condition is not considered to be seriously violated.

Second, the primary aim of this study is to address whether the impacts of the September 11 attacks on refugees’ labor market outcomes were heterogeneous depending on a refugee’s ethnicity. Therefore, the time trends for each ethnic subgroup before the September 11 attacks should not be significantly different, similar to the common (pre-treatment) trend requirement for the difference-in-differences approach. If validated, this common trend, as a necessary condition, makes it more plausible that the heterogeneous impacts were absent prior to the attacks, and hence they, if found significant after the attacks, can be convincingly attributed to the attacks.Footnote 15

Third, as previously mentioned in Sect. 3.1, the fact that the individuals investigated by this study are not economic migrants but refugees plays an important role as an identification condition. Since refugees are forcibly displaced, their decision function of leaving their home countries is generally less sensitive than that of economic immigrants. Such a rigid decision function makes it probable that refugees that arrived before and after the attacks are not much different in terms of both observables and unobservables. Hence, the typical selection issue is not expected to be much serious in the context of this study.

4 Econometric methods

4.1 Non-independence caused by cluster-specific heterogeneity

As previously mentioned in Sect. 3, it is highly plausible that refugees composing the sample are not independent for the reason that they form unique clusters in their resettlement regions, interact with their cluster members, and share some cluster-specific unobservable factors. If so, naïve single-level approaches, merely assuming independence among observations and not allowing for cluster-specific heterogeneity, can lead to biased estimates. For such a reason, a multi-level approach is essential in this study.

For multi-level analyses, suppose the ith observation in the overall sample is the jth observation (i.e., level-one) in the cth cluster (i.e., level-two), while there are C clusters in total, respectively, having \(N_{c}\) observations.Footnote 16 Using the binary employment outcome at 90 days after arrival as its main dependent variable, this study uses the following two-level specification:

$$\begin{aligned} \Pr (y_{jc}=1\mid \mathbf {x}_{jc},\alpha _{c})=\Pr (y_{jc}=1\mid \mathbf {v}_{jc},\mathbf {i}_{c},\alpha _{c})=E(y_{jc}\mid \mathbf {v}_{jc},\mathbf {i}_{c},\alpha _{c}), \end{aligned}$$
(3)

where observable covariates \(\mathbf {x}_{jc}\) are partitioned into \(\mathbf {v}_{jc}\) and \(\mathbf {i}_{c}\) for reasons to be elaborated in a later section. \(\mathbf {v}_{jc}\) is a vector of level-one covariates varying within clusters, while \(\mathbf {i}_{c}\) is that of level-two covariates invariant within clusters. Most importantly, \(\alpha _{c}\) is cluster-specific heterogeneity in the form of random intercepts. Unlike \(\mathbf {v}_{jc}\) and \(\mathbf {i}_{c}\) that are observed, \(\alpha _{c}\), cluster-specific fixed effects commonly shared by all the members of a certain cluster c, is not directly observed in the data per se, and it substantially differs depending on what to cluster over. Before discussing what to cluster over, it is worthwhile pinpointing how clustering affects estimations.

Non-independence among observations caused by clustering affects two distinct aspects of estimations. First, if unobserved cluster-specific heterogeneity \(\alpha _{c}\) is correlated with covariates \(\mathbf {x}_{jc}\), which implies \(E\left( \alpha _{c}|\mathbf {x}_{jc}\right) \ne 0\), its existence concerns the estimation of coefficients. Hence, in such a case, ignoring the presence of \(\alpha _{c}\) leads to omitted variable bias and causes inconsistent coefficient estimates. This issue can be circumvented by directly modeling clusters and including their fixed effects \(\alpha _{c}\). However, in the case of nonlinear response models, another complication arises in the sense that we, in general, should not simply put cluster dummies in a bid to control for \(\alpha _{c}\) because of the incidental parameters problem (Neyman and Scott 1948). This difficulty is further discussed in Appendix A.1.

The second aspect that needs attention is the existence of within-cluster error correlation, which affects variance estimates. If observations are clustered, which can cause errors to be non-independent, the usual variance formula underestimates the true variance matrix, assuming positive within-cluster error correlation (Cameron and Trivedi 2005). Pertaining to this matter, the solution is simpler: the variances of the regression parameters need to be adjusted in consideration of clustered errors.

The previous two paragraphs can be summarized as follows. If \(\alpha _{c}\) is a pure random effect uncorrelated with covariates \(\mathbf {x}_{jc}\), then only the variances of the regression parameters need to be adjusted, and a single-level approach is not much problematic. However, if \(\alpha _{c}\) is correlated with \(\mathbf {x}_{jc}\), then the regression parameters based on the omission of \(\alpha _{c}\) are inconsistently estimated, and a multi-level approach, which allows for the presence of \(\alpha _{c}\) in a true model and considers \(E\left( \alpha _{c}|\mathbf {x}_{jc}\right) \ne 0\), is necessary. On the positive side, whether \(\alpha _{c}\) is correlated with \(\mathbf {x}_{jc}\) or not is testable: the test results are discussed in Online Supplement B.2.

4.2 What to cluster over

As described in the previous section, non-independence among observations caused by clustering necessitates considering two distinct fronts—coefficient estimates (affected by \(\alpha _{c}\)) and variance estimates (affected by within-cluster error correlation). The former aspect should be dealt with by directly modeling clusters (i.e., see (1) and (2)), and the latter one should be taken care of by using cluster-robust variance matrices. While both econometric measures require a clear-cut decision on what to cluster over, there is no broadly accepted guidance encompassing both. Thus, it is often not clear what to cluster over and how to define clusters. Moreover, there is not even a formal test of the level at which to cluster.

Therefore, empirical thoughts should play a role in determining what to cluster over. To begin with, what can cause the existence of \(\alpha _{c}\) should be taken into account. As mentioned in Sect.  3, since the refugees investigated by this study do not have any family members who had already settled down in the U.S. and who could thus assist in their resettlement, the importance of considering compatriot-based networks and enclaves should be more distinguished, and the unobserved features of such networks that can affect \(y_{jc}\) are expected to lead to the existence of \(\alpha _{c}\). Therefore, concerning how to model clusters and their fixed effects \(\alpha _{c}\), which is one of two factors broaching the subject of what to cluster over, this study assumes that refugees i) from the same home country and ii) being assigned to the same resettlement city form their unique networks and interact with existing enclaves of their compatriots in that city. This is in accordance with how clusters are defined in Beaman (2012) and can be expressed as follows:

$$\begin{aligned} \Pr (y_{imh}=1\mid \mathbf {v}_{imh},\mathbf {i}_{mh},\alpha _{mh})=\Pr (y_{jc}=1\mid \mathbf {v}_{jc},\mathbf {i}_{c},\alpha _{c}). \end{aligned}$$
(4)

In (4), h denotes a refugee’s home country, and m represents a refugee’s resettlement city in the U.S.. c is a refugee’s cluster defined by the unique combination of h and m (i.e., clustering at the city-nationality level). This decision on what to cluster over is also supported by the argument of Topa (2001) and Damm (2014) that there are residence-based job information networks stratified by where a nonnative comes from.

The second aspect to be considered concerns within-cluster error correlation, the issue of which sometimes cannot be completely resolved even by modeling clusters and including their fixed effects \(\alpha _{c}\).Footnote 17 In this study, the same logic described above in (4) is applied to how to adjust the variances of the regression parameters as well. On the other hand, we, however, cannot rule out the possibility that refugees form more aggregated networks, for example, based on whether they come from the same continent given that they reside in the same resettlement city. This leads to the likelihood that within-cluster error correlation is made within bigger, more aggregated clusters (e.g., clustering at the city-continent level) although \(\alpha _{c}\) itself is only caused by subordinate lower clusters. Hence, this plausible error correlation within more aggregated clusters is taken into consideration in Online Supplement B.3.2 for a robustness check. It is motivated by the fact that different assumptions (i.e., different clustering levels) can be made about each of \(\alpha _{c}\) and within-cluster error correlation: it is further encouraged by the suggestion of Pepper (2002) and Cameron and Miller (2015) that the consensus, when it comes to the use of cluster-robust variance matrices, is to be conservative by using bigger and more aggregated clusters.

4.3 Nonlinear multi-level approaches for clustered observations

Once what to cluster over has been determined, the next step is to decide which multi-level model to use, and in doing so, the nonlinear feature of the primary dependent variable of this study should be taken into consideration.Footnote 18 As previously discussed, non-independence caused by clustering affects two aspects of estimations—coefficient estimates through \(\alpha _{c}\) and variance estimates through within-cluster error correlation. However, economists have been mainly focused on the latter without paying enough attention to the former.

When the ith observation in the overall sample is the jth observation in the cth cluster, the most general specification for a binary multi-level approach is

$$\begin{aligned} \begin{aligned} y_{jc}^{*}&=\mathbf {x}_{jc}^{\prime }\varvec{\beta }+u_{jc}\\&=\mathbf {v}_{jc}^{\prime }\varvec{\delta }+\mathbf {i}_{c}^{\prime }\varvec{\eta }+u_{jc}\\&=\mathbf {v}_{jc}^{\prime }\varvec{\delta }+\mathbf {i}_{c}^{\prime }\varvec{\eta }+\alpha _{c}+\varepsilon _{jc}, \end{aligned} \end{aligned}$$
(5)

where \(y_{jc}^{*}\) is a latent continuous variable—all the other notations are the same as defined above. Whether \(y_{jc}^{*}\) crosses a threshold or not decides the observed binary outcome \(y_{jc}\) in the following way:

$$\begin{aligned} y_{jc}=1[\mathbf {v}_{jc}^{\prime }\varvec{\delta }+\mathbf {i}_{c}^{\prime }\varvec{\eta }+\alpha _{c}+\varepsilon _{jc}>0], \end{aligned}$$
(6)

where \(1[\cdot ]\) refers to the indicator function. Multi-level approaches are distinguished from single-level approaches in the sense that they decompose \(u_{jc}\) into \(\alpha _{c}\) and \(\varepsilon _{jc}\) and do not set \(\alpha _{c}=\alpha \) for those belonging to different c.

Generalized linear models (GLM), which can be applied to an entire class of models with dependent variables following a distribution from the exponential family, can be extended to analyze multi-level clustered data as shown below:

$$\begin{aligned} \Pr (y_{jc}=1\mid \mathbf {v}_{jc},\mathbf {i}_{c},\alpha _{c})=E(y_{jc}\mid \mathbf {v}_{jc},\mathbf {i}_{c},\alpha _{c})=g(\mathbf {v}_{jc}^{\prime }\varvec{\delta }+\mathbf {i}_{c}^{\prime }\varvec{\eta }+\alpha _{c}), \end{aligned}$$
(7)

where g refers to link functions (Neuhaus and Kalbfleisch 1998; Neuhaus and McCulloch 2006; Brumback et al. 2010). The model (7) above, which is often called a generalized linear mixed model (GLMM), stands on the assumption that the effects of level-one covariates \(\mathbf {v}_{jc}\) are the same across all clusters; hence, \(\alpha _{c}\) solely captures cluster-specific heterogeneous features.Footnote 19

Concerning the choice of g in (7), the cumulative distribution function of \(\varepsilon _{jc}\) in (6) should be considered. If probit is used as a link function based on the assumption

$$\begin{aligned} \varepsilon _{jc}\mid \mathbf {v}_{jc},\mathbf {i}_{c},\alpha _{c}\sim N(0,\sigma _{\varepsilon }^{2}), \end{aligned}$$
(8)

(7) leads to

$$\begin{aligned} \Pr (y_{jc}=1\mid \mathbf {v}_{jc},\mathbf {i}_{c},\alpha _{c})=\Phi (\mathbf {v}_{jc}^{\prime }\varvec{\delta }+\mathbf {i}_{c}^{\prime }\varvec{\eta }+\alpha _{c}), \end{aligned}$$
(9)

where \(\Phi [\cdot ]\) is the standard normal cumulative distribution function. If logit is to be used as an alternative link function in place of probit, we can replace \(\Phi [\cdot ]\) with \(\Lambda [\cdot ]\), which refers to the logistic cumulative distribution function. By the same token, if the linear probability model (LPM), which does not use a cumulative distribution function, is to be used, we can simply substitute the identity link for \(\Phi [\cdot ]\).Footnote 20

Another key decision to be made concerns which model to use between two dichotomized multi-level models—the fixed effects (FE) model and the random effects (RE) model. As mentioned above, multi-level approaches are distinguished from single-level ones in that they consider \(u_{jc}=\alpha _{c}+\varepsilon _{jc}\) and do not set \(\alpha _{c}=\alpha \), and different assumptions on the correlation between \(\alpha _{c}\) and covariates lead to those two different models.

The random effects model, which regards \(\alpha _{c}\) as purely random intercepts whose distribution does not depend on any covariates, uses both within- and between-cluster variations and accordingly enables estimating the effects of both individual-specific level-one variables \(\mathbf {v}_{jc}\) and cluster-invariant level-two variables \(\mathbf {i}_{c}\). However, the required assumption that cluster-specific unobservables are not correlated with regressors is very restrictive and often implausible, and the violation of this assumption leads to inconsistent estimates. On the other hand, the fixed effects model, which also regards \(\alpha _{c}\) as random unobservables but allows for the possible correlation between \(\alpha _{c}\) and covariates, can provide unbiased estimates of level-one variables \(\mathbf {v}_{jc}\). However, the fixed effects model is less efficient than the random effects model and cannot provide estimates for cluster-invariant level-two variables \(\mathbf {i}_{c}\) because it only uses within-cluster variations. In a nutshell, each of these two models takes an opposite stand concerning the common econometric trade-off between efficiency versus consistency. Considering each model’s advantages and disadvantages, this study uses both models. Full details on nonlinear multi-leveleconometric methods are discussed in Appendix A.1.

5 Estimation results

5.1 The impacts of the September 11 attacks on refugees’ employment

Before starting a full discussion on main estimation results, I present two diagnostic tests and their results that adduce convincing evidence on why the multi-level approach is indispensable for this study. For the sake of brevity, their details are discussed in Online Supplement B.2.

Table 1 compares estimates from different model specifications based on the Chamberlain–Mundlak’s correlated random effects probit model, while Table 2 makes comparisons of those based on the conditional logit fixed effects model. Note that the estimated coefficients in Tables 1 and 2 are different from marginal effects due to the nonlinear nature of both models. The signs of coefficient estimates, however, show whether a certain regressor has a positive or negative marginal effect on the response probability (Wooldridge 2003). Therefore, a discussion based on the signs and statistical significance of coefficient estimates is presented first, and it is followed by explanations on marginal effects and odds ratios in later sections.

Table 1 CRE—Coefficient estimates on employment
Table 2 CML FE—Coefficient estimates on employment

Above all, Tables 1 and 2 show apparently that the coefficient estimates of the September 11 attacks variable (i.e., Post 9/11) are significantly negative, meaning that the attacks appear to have lowered refugees’ employment probability significantly. Moreover, the magnitudes, in both Tables 1 and 2, are far more substantial than those of the English level 3 variable and of the summer variable, two other important factors arguably affecting refugees’ labor market outcomes.

Column 1 of Tables 1 and 2 already contains a wide range of control variables such as basic demographic information, education levels, English levels, and job market experiences before coming to the U.S.. However, the interaction terms between the September 11 attacks and the ethnicity variables are omitted.

Column 2 of Tables 1 and 2 additionally includes the interaction terms between the September 11 attacks and the ethnicity variables, the baseline reference group of which is African refugees. The inclusion of these interaction terms considerably increases the estimated magnitudes of the September 11 attacks by 67% in Table 1 and 74% in Table 2, respectively. This is because the inclusion of such interaction terms makes the coefficient estimates reflect the impact of the September 11 attacks specifically to African refugees. Moreover, although all interaction terms show positive signs, their magnitudes and statistical significance remarkably vary, implying highly heterogeneous moderation effects depending on a refugee’s ethnicity.

Columns 3 and 4 of Tables 1 and 2 intend to check whether the impacts of the attacks differ depending on a refugee’s religion. The attacks-religion interaction terms, not fully shown for the sake of brevity, are both marginally and jointly insignificant when included without (Column 3) and with (Column 4) the attacks-ethnicity interaction terms.

This insignificance of the attacks-religion interaction terms is understandable, since job seekers often are not necessarily required to disclose their religion while searching for a job. On the other hand, ethnicity is, of course, easily exposed through many channels, the most explicit one of which may be visual identification and names, as shown by the experiment of Widner and Chicoine (2011). Moreover, race and gender information is officially required to be collected under the relevant law in the U.S., while religion information is not mandated.Footnote 21 Therefore, it can be concluded that a refugee’s religion neither intensifies nor alleviates the impacts of the September 11 attacks. Instead, the observed heterogeneous moderation effects can be solely attributed to a refugee’s ethnicity.

On the other hand, the summer variable, a seasonal shock to refugees’ labor market, also seems to have negatively affected their employment. This is unambiguous, as many employment decision makers take vacation during the summer, which delays the recruiting process. Interestingly, the interaction terms between the summer and ethnicity variables, not shown for the purpose of brevity, turn out to be both marginally and jointly insignificant when tested with and without the attacks-ethnicity interaction terms. Taken together, these results suggest that summer was an equally negative shock, whereas the September 11 attacks, due to their contextual reasons, were not equally adverse for all refugees.

Table 3 compares coefficient estimates from different models based on the final specification.Footnote 22 Note that Columns 2, 3, and 4 of Table 3 cannot be directly compared, since the link function of each model is different. However, both of Columns 1 and 2 are based on probit and thus are directly comparable. The key difference resides in the fact that Column 1 is from a naïve single-level approach, whereas Column 2 is from a multi-level approach. As the substantial differences between Columns 1 and 2 in Table 3 and the results of the Mundlak test in Table 1 show, it is apparent that the naïve single-level approach results in biased estimates, primarily underestimating the coefficients of the attacks-ethnicity interaction terms and failing to show clear heterogeneity across ethnic subgroups. On the other hand, all the other multi-level estimates, despite different link functions, consistently show evident heterogeneity depending on a refugee’s ethnicity, with similar statistical significance.

Table 3 RE, CRE, CML FE, and LPM—Coefficient estimates on employment

The estimates from all multi-level models in Columns 2, 3, and 4 of Table 3 suggest that Asian and European refugees were significantly less affected by the negative impacts of the September 11 attacks compared to African refugees. However, such significant differences are not observed in the case of Middle Eastern and South American refugees. In sum, the multi-level estimates imply that only Asian and European refugees were significantly cushioned by their ethnic backgrounds from the post-attack anti-refugee sentiments.

The heterogeneity of these interaction terms necessitates separately examining the marginal effects of the September 11 attacks for each ethnic subgroup. Columns 1 and 2 of Table 4 present the average marginal effects (AME) of the September 11 attacks on a refugee’s employment probability. The marginal effects in Column 1 are estimated by the Chamberlain–Mundlak’s correlated random effects probit model, and those in Column 2 are estimated by the linear probability model with cluster fixed effects.

Table 4 Marginal effects and odds ratios of the September 11 attacks on employment

A huge external shock to the labor market of refugees resettled in the U.S., the September 11 terrorist attacks significantly lowered African refugees’ employment probability by 17 to 22% points. Furthermore, the attacks reduced Arab refugees’ employment probability by 15 to 19% points. In the case of European refugees, the null hypothesis of no impact cannot be rejected with the magnitudes very close to zero due to the strong ethnicity-based moderation effect.

The more surprising finding resides in the fact that the September 11 attacks, against common expectations, substantially increased Asian refugees’ employment probability by 11–13% points. Despite such substantial magnitudes, the lack of statistical significance is not unusual and can be attributed to the fact that the average marginal effects of the attacks are determined by the sum of two estimates with opposite signs. Moreover, according to Greene (2007), the inference on heterogeneity should be made not with marginal effects but with coefficients.Footnote 23 Therefore, it should be noted that the heterogeneous impacts stemming from the September 11 attacks are already corroborated in Table 3.

Estimates from the conditional logit fixed effects model are shown in Column 3 of Table 4 in the form of odds ratios. The odds ratio of the September 11 attacks is 0.23 and 0.31 for refugees from Africa and the Middle East, respectively. This means that the odds of being employed significantly decreased by a factor of 0.23 and 0.31 in multiplicative terms. For the other ethnic subgroups, the results are statistically insignificant. However, the odds ratio for Asian refugees, albeit insignificant, suggests that the September 11 attacks, when all other important covariates are controlled for, made Asian refugees’ odds of being employed 2.78 times higher. The odds ratio for European refugees is close to one, which means almost no impact in multiplicative terms.

Based on these estimation results, it can be concluded that the September 11 attacks had varying impacts on each ethnic subgroup of refugees seeking employment in the U.S. Due to the context of the attacks, their negative impacts on employment were significant only in the case of African and Arab refugees. A plausible explanation for such heterogeneous impacts is that the September 11 attacks triggered increased animosity selectively against African and Arab refugees due to their ethnic similarity to the terrorists. Also, employers may have perceived hiring African or Arab refugees as risky and costly, due to either security concerns or uncertainty over the permanency of their stay in the U.S.. Therefore, the shock of the attacks was not equally negative for all refugees, and the degrees of post-attack prejudice and discrimination must have differed across ethnic subgroups. Figure 1 sums up these findings in the form of changes in predicted employment probabilities for each ethnic subgroup.Footnote 24

Fig. 1
figure 1

Changes in predicted employment probabilities

Above all, the most interesting finding lies in the fact that the September 11 attacks gave Asian refugees better career opportunities, as the attacks appear to have substantially increased their employment probability. Nevertheless, it is unlikely that Asian refugees experienced an advantage over American citizens; instead, they arguably held an advantage over other refugees as their direct job market competitors. As discussed in Sect.  2, Friedberg and Hunt (1995), Peri (2007), Ottaviano and Peri (2012), and Foged and Peri (2016) argue that foreign-born workers’ primary labor market competitors, in developed economies, are not their host populations but other nonnatives. While refugees compete with one another as job seekers, any selectively increased antipathy against African and Arab refugees triggered by the September 11 attacks may have boosted American employers’ preference for hiring Asian refugees for jobs that all refugees had equally competed for prior to the attacks.

Meanwhile, Table 4 leaves one question open concerning why Arab refugees were slightly less affected than African refugees, although they were likely perceived as being more similar to the September 11 hijackers, the group of which was composed of Arabs that outnumbered and only a few Africans. One possible explanation is that the strong social networks and solidarity of Arab Muslims in the U.S. may have exclusively helped their members find jobs, while they were not required to disclose their religion, presumably unfavorable information, to employers when searching for a job. To check the plausibility of this elucidation, the marginal effects of the September 11 attacks on employment were separately estimated for each ethnicity-religion subgroup. As shown in Table 5, it seems that the negative impacts of the attacks affected Arab Muslim refugees far less than Arab non-Muslim refugees. In the case of Arab non-Muslim refugees, the negative marginal effects are even greater than those of African refugees from a magnitude standpoint. In other words, being an Arab Muslim was not a double disadvantage; rather, it may have functioned as a buffer against the negative impacts stemming from the September 11 attacks. Interestingly, such Muslim solidarity does not seem to have worked among African refugees. Further studies regarding this aspect would be worthwhile.

Table 5 Marginal effects of the September 11 attacks on employment

5.2 Interstate regional heterogeneity

Following the September 11 attacks, political researchers found increases in both individuals’ trust in government and conservative political attitudes (Ford et al. 2001; Bonanno and Jost 2006; Hersh 2013). Moreover, such increases were found to be linked to authoritarian political views (Jost et al. 2003; Huddy et al. 2005; Huddy and Feldman 2011). For this reason, it is likely that the Americans most mentally and emotionally affected by the attacks became more Republicans relative to those less affected, and the gubernatorial elections held between 2001 (after the attacks) and 2004 may have reflected this facet. Therefore, in a bid to examine interstate regional heterogeneity in terms of the labor market impacts stemming from the September 11 attacks, the 16 resettlement cities were classified into two types, as shown in Table 16 in Online Supplement B.4. Regions where Democratic candidates won were categorized as liberal areas, while regions where Republican candidates won were categorized as conservative areas. Following this, three-way interaction terms were included for estimating marginal effects in each category.

Table 6 presents the results in light of regional heterogeneity and reveals some noteworthy empirical findings as follows. African refugees were uniformly affected by the negative impacts of the attacks, yet Arab refugees were far more adversely affected in conservative regions than in liberal regions. In other words, Arab refugees were the hardest hit group in conservative regions. For European refugees, the impact magnitudes were universally very close to zero. Finally, the comparative benefit that Asian refugees enjoyed in employment was far greater in conservative regions. Taken together, these results mean that there was remarkable interstate regional heterogeneity when it comes to the impacts stemming from the terrorist attacks.

Table 6 Marginal effects of the September 11 attacks on employment—Regional heterogeneity based on gubernatorial election results

In a bid to confirm such regional heterogeneity, each region’s friendliness to nonnatives was additionally used as another classification criterion. For this purpose, the General Social Survey conducted in 2000 was used, which contained a relevant question.Footnote 25 Based on this, the 16 resettlement cities were categorized into two types (i.e., nonnative-friendly regions versus nonnative-unfriendly regions), and three-way interaction terms were included for estimating marginal effects in each type.Footnote 26 The results, presented in Table 17 in Online Supplement B.4, are in line with what is shown in Table 6.

5.3 The impacts of the September 11 attacks on refugees’ wage levels

Notably, the September 11 attacks had a substantially positive impact on the employment probability of Asian refugees. As refugees compete with one another in looking for employment, it could be argued that Asian refugees benefited from the attacks probably due to employers’ selectively increased antipathy against African and Arab refugees. If that is the case, however, why did European refugees not benefit from the attacks? On the other hand, it is possible that European refugees conceivably benefited from the attacks in terms of employment quality (i.e., higher earnings), not employment probability. To answer this question, whether the September 11 attacks affected refugees’ earnings is investigated.

Before proceeding to the wage analysis to determine whether the September 11 attacks improved (or worsened) the quality of jobs held by refugees, a sample selection problem must be thoroughly checked because refugees with recorded wage observations are selected samples based on values taken by the dependent variable of the selection (i.e., employment) equation, which could lead to the dependence of the employment equation and the wage equation in their unobservables (i.e., error terms) and hence cause wage estimates to be inconsistent if overlooked.

Based on the well-known bivariate maximum likelihood (i.e., often abbreviated as ML) sample selection model and the Heckman two-step estimator from Heckman (1979), the existence of the sample selection problem was checked. The results, not shown for the sake of brevity, are comforting in that both models indicate the absence of the selection issue.Footnote 27 Accordingly, I now proceed to the question of whether the September 11 attacks improved or worsened the quality of jobs held by refugees, which is measured by real hourly wages.

As can be seen in Table 7, only European refugees saw a statistically significant rise in their real hourly wages at the 10% significance level, in addition to the greatest magnitude. Despite its weak statistical significance, which requires further investigations with more observations, the estimate of European refugees, from a magnitude standpoint, is economically substantial in that their 0.491 dollars per hour increase in wages is 37% of a standard deviation of the wage distribution of all employed refugees during the same period. This result indicates that the quality of jobs held by European refugees substantially improved after the September 11 attacks. For Asian refugees, whose employment probability markedly increased after the attacks, their wage level increase experienced the second-highest magnitude but without statistical significance even at the 10% level. Meanwhile, African, Middle Eastern, and South American refugees saw neither a statistically significant nor a substantial increase in their wages. Taken together, these results suggest that the quality of jobs improved only for European refugees after the attacks: such a rise in wages may have offset the otherwise possible increase in the employment probability of European refugees. Asian refugees, on the other hand, did not reap the additional benefit of getting better quality jobs.

Table 7 Coefficient estimates of the September 11 attacks on wage levels

Then, one necessary question worth following up is what caused such different patterns of benefits between Asian and European refugees. A likely cause for the divide is that Asian and European refugees had different preferences in the labor market of refugees—easier and quicker job acquisition for Asian refugees and better quality jobs for European refugees. These heterogeneous preferences can be attributed to the possibility that their expectations for future labor market outcomes differed during the initial resettlement period, in which they received financial aid and diverse care from the US government as legally accepted refugees and thus did not have impending concerns about their livelihoods yet.Footnote 28

If most European refugees tend to have greater expectations of finding better jobs after the initial period of support due to some reasons, such as cultural and racial similarities with the mainstream American society, the level of their reservation wages is likely higher during the initial resettlement period than that of most Asian refugees, who have lower expectations for their future labor market outcomes. Accordingly, some European refugees with sanguine expectations may have intentionally delayed employment if proposed market wages fell short of their high reservations wages. Despite the fact that this wait-and-see strategy takes time, they, with a lack of urgency stemming from the financial aid and diverse care from their host government, may have still preferred waiting for good quality jobs rather than hastily accepting mediocre employment offers that would only make them no longer qualify for the aid and care.

If this explanation is reasonable, the selection into employment of European refugees should be negatively made because such wait-and-see behaviors and intentional delay in obtaining employment of those with higher reservation wages can cause the wage level of employed refugees to be lower than that of non-employed ones.Footnote 29 Furthermore, those with higher reservation wages usually have a higher rate of unobserved capabilities and skills, which are unobservable components in both the employment equation and the wage equation. Hence, their wait-and-see strategy for targeting high paying jobs, which increases wage outcomes but decreases employment probability, can cause the employment equation and the wage equation to be negatively correlated in their unobservables (i.e., error terms), leading to the negative selection into employment. Thus, in order to check the plausibility of this inference, the selection parameter of European refugees (i.e., \(\rho _{1}\)) and that of non-European refugees (i.e., \(\rho _{2}\)) were separately estimated based on a single likelihood function. See Appendix A.2 for estimation-related details.

The result of \(\hat{\rho }_{1}<0\) and \(\hat{\rho }_{2}>0\), shown in Table 8, corroborates that European refugees and other refugees are distinguishable in terms of their contrasting selection patterns into employment. The finding that \(\rho _{1}\) of European refugees is estimated to be negative, albeit statistically insignificant, bolsters the inference mentioned above and explains why European refugees, in contrast to Asian refugees, benefited from the attacks in terms of employment quality, not employment probability. The statistical insignificance of \(\hat{\rho }_{1}\), which may be attributed to a limited number of European refugees in the sample, motivates future research with more observations. On the other hand, \(\hat{\rho }_{2}>0\) with strong statistical significance is expectable in that positive self-selection into employment is considered common and natural in usual labor markets, which means that those with high earnings potential and a higher rate of unobserved capabilities and skills are more likely to take up employment (Smith and Ward 1989; Blau et al. 1990s; Blundell et al. 2007; Olivetti and Petrongolo 2008).Footnote 30

Table 8 Selection parameter estimates

6 Robustness checks

6.1 Unobserved time-varying factors

As was suggested by Kaushal et al. (2007), in order to isolate the effects of the September 11 attacks from seasonal, cyclical factors and time-varying unobservables as much as possible, a group-comparison approach with regression adjustments was additionally used as a strategy for checking the robustness of the estimates of this study. This approach can be regarded as a difference-in-differences (i.e., hereafter abbreviated as DD) procedure with repeated cross-sections. As highlighted in previous sections, the sample observations are not independent but heavily clustered: hence, cluster-specific heterogeneity should still be taken into consideration in using the DD approach.

The empirical strategy required for the DD approach is that the employment probability should become affected for some groups (i.e., treatment group) by the September 11 attacks, while it is expected to be unaffected for other groups (i.e., control group) (Åslundslund and Rooth 2005). Thus, properly defining treatment and control groups is of paramount importance for the DD approach. According to the estimation results shown in Sect.  5, it seems to be a tenable argument that European refugees’ employment probability was neither significantly nor substantially affected by the attacks. Therefore, European refugees can be regarded as controlled observations in the context of this study; on the other hand, African, Middle Eastern, and South American refugees can be thought of as (negatively) treated observations.

However, for the reason that we cannot rule out the possibility that all refugees, even including European refugees, were exposed to the impacts of the attacks after they had been perpetrated, it should be carefully noted that the DD estimate here between the treated (i.e., African, Middle Eastern, and South American refugees) and the controlled (i.e., European refugees) should be construed as the effect stemming from the combination of two factors: the September 11 attacks and the ethnic disadvantage of being Africans, Arabs, or South Americans. Accordingly, in the strict sense, European refugees are not free from the first factor but only free from the second factor, and the treatment should be regarded as the concurrent combination of those two factors. Many previous studies with the DD approach interpreted their DD estimates as the effects of the September 11 attacks: however, unless it can be guaranteed that their controlled observations were completely free from any impacts of the attacks, interpreting their DD estimates as the impacts of the attacks as one sole factor is erroneous.

The DD approach requires two underlying assumptions for identification. First, the effects of time should be common across treated and controlled observations (Cameron and Trivedi 2005). This assumption is testable and should be satisfied when either panel or cross-section data are investigated. Thus, the common time trend assumption was thoroughly checked, and the null hypothesis of the parallel time trend before the attacks could not be rejected. Second, this section uses the DD approach with repeated cross-sections: therefore, the composition of the treatment group and that of the control group should be stable before and after the attacks so that unobserved individual features should play no role. As previously mentioned in Sect. 3, this condition is not expected to be severely violated since the observations of this study are not economic migrants but refugees, whose decision function of leaving their home countries is expected to be far less sensitive than that of economic migrants. Based on these two assumptions, the DD model with a time trend \(\omega _{t}\) is specified as shown below, where post refers to the periods after the September 11 attacks.

$$\begin{aligned}&\Pr (y_{jc,t}=1\mid \text {Treatment},\text {Post},\mathbf {v}_{jc},\mathbf {i}_{c},{\omega }_{t},\alpha _{c})\nonumber \\&=g(\beta _{1}\text {Treatment}+\beta _{2}\text {Post}+\beta _{\text {DD}}(\text {Treatment}\times \text {Post})+\mathbf {v}_{jc}^{\prime }\varvec{\delta }+\mathbf {i}_{c}^{\prime }\varvec{\eta }+{\omega }_{t}+\alpha _{c}) \nonumber \\ \end{aligned}$$
(10)

While all the other notations are the same as defined earlier, g refers to a certain binary link function, such as \(\Lambda [\cdot ]\) (i.e., the logistic cumulative distribution function) in the case of the conditional logit fixed effects model and \(1[\cdot ]\) (i.e., the identity link function) for the linear probability model. While both models support the robustness of the main findings discussed in Sect. 5, this section presents the results based on the latter for the reason that the only direct interpretation of the former is using odds ratios in multiplicative terms, which is less intuitive than additive marginal effects (Kleinman and Norton 2009; Karaca-Mandic et al. 2012).

Table 9 shows the DD estimates between European refugees as controlled observations and African, Middle Eastern, and South American refugees as treated observations. The negative \(\beta _{DD}\) estimate implies the negative effect of the combination of two factors—the September 11 attacks and the ethnic disadvantage of being Africans, Arabs, or South Americans. It suggests that the treatment lowered the employment probability of the treated by 17 % points in additive terms.

Table 9 DD Estimates of the treatment on employment—Treatment group: African, Middle Eastern, and South American refugees

On the other hand, Table 10 shows the DD estimates between European refugees as controlled observations and Asian refugees as (positively) treated observations. Again, the DD estimates capture the effect of the combination of two factors— the September 11 attacks and Asian ethnicity. The loss of statistical significance may be attributed to the greatly reduced number of observations. The positive \(\beta _{DD}\) estimate corroborates that the treatment increased the employment probability of the treated by 13% points in additive terms.

Table 10 DD Estimates of the treatment on employment—Treatment group: Asian refugees

Taken together, the DD estimates in Tables 9 and 10 support the fact that the estimates of this study presented in Sect. 5 are robust when seasonal, cyclical factors and time-varying unobservables are taken into account.

6.2 Additional robustness checks

In addition to unobserved time-varying factors tackled in Sect. 6.1, further issues for robustness are a nonlinear projection for cluster heterogeneity in the case of the Chamberlain–Mundlak’s correlated random effects probit model (i.e., discussed in Appendix A.1.2) and more aggregated clusters with regard to what to cluster over (i.e., discussed in Sect.  4.2). This study establishes the robustness of its main findings in these dimensions in Online Supplement. See Online Supplement B.3.1 for a nonlinear projection and B.3.2 for more aggregated clusters.

7 Conclusion

This paper investigates the impacts of the September 11 terrorist attacks on refugees as job seekers in the U.S.. To the best of the author’s knowledge, this paper is the first study of the attacks’ impacts on refugees that qualify for official refugee status and hence are distinguished from economic migrants or general nonnatives. Moreover, this investigation sheds unprecedented light on the heterogeneity of the impacts by ethnic subgroups, remembering that the attacks were planned by the Islamic extremist group and perpetrated by its African and Arab hijackers. Quantifying the heterogeneity of the impacts is a significant addition to previous studies, which simply presumed that the attacks had homogeneously negative impacts.

In terms of econometric methods, this study, the observations of which are not independent but clustered, shows the enormous importance of considering cluster-specific heterogeneity by using nonlinear multi-level models. The multi-level estimates of this study suggest that the September 11 attacks did not uniformly shock all sub-populations of refugees: rather, they presented a unique, substantial opportunity for Asian refugees and a serious threat to African and Arab refugees. One unexpected finding, on the other hand, is that the employment probability of European refugees remained stable, whereas that of Asian refugees markedly increased after the attacks. However, in terms of employment quality, measured by real wages, European refugees were the only ones who benefited from the attacks. Such different patterns of benefits between Asian and European refugees seem to have been caused by their different reservation wages and heterogeneous expectations for future labor market outcomes, as evidenced by their contrasting patterns of selection into employment during the initial resettlement period.

This research provides the first comprehensive assessment of the heterogeneous impacts of the September 11 terrorist attacks on refugees’ labor market outcomes and establishes an econometric framework for controlling for cluster-specific unobservables by disentangling several econometric complications that are intertwined, such as non-independent and clustered observations, the binary dependent variable, and the incidental parameters problem. In addition, various robustness checks show the insensitivity of the findings of this research. Hence, this study makes meaningful strides toward enhancing our understanding of the heterogeneous labor market impacts of the September 11 terrorist attacks on refugees as job seekers.