1 Introduction

In 2015, the United Nations identified financial inclusion as a key target in its development agenda, with many countries having since focused on increasing access to financial services and credit. Simultaneously, researchers have documented the impact of increased financial inclusion on various economic outcomes, such as economic growth and inequality in various geographic areas (Barajas et al. 2015; Dabla-Norris et al. 2015; Hajilee et al. 2017; Buera et al. 2021). In this work, we focus on the impact of credit access on international migration for a set of African economies. Theoretically, as workers gain access to credit, the borrowed funds generate a positive income effect, which loosens budget constraints and aid workers in covering fixed moving costs, required to migrate to foreign labor markets with higher wages (Rapoport 2002; Marchal and Naiditch 2020). On the other hand, access to credit can help workers undertake investment at home, which increases the opportunity cost of migrating abroad (Bazzi 2017).Footnote 1 Alternatively, if a worker’s decision to migrate to a different labor market is part of an insurance contract to mitigate family risk of domestic income shocks (such as droughts), then access to credit capable of smoothing consumption when shocks arise may reduce the incentive to migrate (Stark and Lucas 1988). Additional considerations that shape the relationship between credit and migration are income and education (Bazzi 2017). For example, individuals with high income levels may not be constrained when it comes to financing moving costs; therefore, access to borrowing would not increase the likelihood of migration. Ultimately, the effect of borrowing on migration is not definitive a priori and warrants empirical investigation. Moreover, understanding the link between migration and credit is crucial to evaluating the effect of programs that increase credit access on labor markets. We thus aim to provide new evidence on the relationship between credit access and migration.

The existing empirical literature on the effect of individual borrowing on migration is limited and sometimes mixed evidence emerges. A negative effect of increased borrowing on domestic migration is identified for Thailand (Poggi 2019), a positive one is found for China (Cai 2020) and India (Singh 2018), while no effect is found in the case of Mexico (Angelucci 2015).Footnote 2 In the case of international migration, a positive relationship is found for the poor and low-skilled, from Mexico to the USA (Angelucci 2015; Kaestner and Malamud 2014); lack of finance is associated with lower propensity to emigrate from Indonesia (Mahendra 2014), while no association is found for the case of migration from Comoros to the richer neighboring French island of Mayotte (Gazeaud et al. 2023).

Our study contributes to the literature on the relationship between migration and credit by expanding the geographical scope and investigating 17 Sub-Saharan economies. Based on the Gallup World Poll (GWP), our study examines emigration desires instead of actual migration and identifies a negative result. Migration desires have been extensively used in the literature in place of actual migration (Dustmann and Okatenko 2014; Van Dalen et al. 2005a; Liebig and Sousa-Poza 2004). Evidence shows that migration intentions and aspirations are good predictors of future actual migration (Van Dalen and Henkens 2008, 2013; Creighton 2013). More generally, according to psychology theories of reasoned action and planned behavior, individual’s intention predicts actual decisions and behaviors (Fishbein et al. 1975; Ajzen 1991; Hale et al. 2002; Ajzen and Fishbein 2005). In fact, a 2012 report by GWP shows that 28.9 percent of the 386 million aspiring migrants ended up migrating (Docquier et al. 2015). Additionally, using migration intentions is less sensitive to policies of receiving countries and country-specific migration costs.Footnote 3

Our results build on the work of Van Dalen et al. (2005a) and Liebig and Sousa-Poza (2004), who use survey data to study the desire to migrate, by going a step further and testing whether having borrowed with interest has an effect on the migration incentive. Our work also builds on the work by Dustmann and Okatenko (2014), who study the impact of various amenities on migration propensity by proposing that access to borrowing is also a factor that keeps workers at home.Footnote 4

Our baseline empirical model assumes that respondents prefer to migrate if their utility from migration exceeds that of staying. If utility, defined over consumption, is linear and consumption is constrained by income and borrowed funds, then a preference for migration will also be determined by income and borrowed funds. However, borrowing itself is determined by access to financial services and associated costs. Our model implicitly assumes that financial services themselves do not provide utility directly to the respondents, except in their ability to provide necessary liquidity through borrowed funds. In terms of data, we proxy access to financial services with bank account ownership and variables documenting the costs of banking services such as fees and cumbersome paperwork. The recent increase in bank account ownership by adults, that reached 34% in Sub-Saharan African countries, is a result of improvements in access to banking services through mobile phones. This, in turn, is mainly driven by advances in mobile technology (World Bank, 2020), a fact arguably exogenous to the desire to move. Our implicit exclusion restriction, that access to financial services should only affect access to credit rather than migration intentions, passes a battery of tests for validity including the cutoff F-statistic suggested by Stock and Yogo (2005) and tests for overidentifying restrictions. Nonetheless, there is the possibility that the uptake of bank accounts is not purely exogenous, violating our exclusion restriction. Therefore, we employ identification through heteroscedasticity, a method developed by Lewbel (2012), as a second strategy to deal with endogeneity. The aforementioned strategy exploits non-spherical disturbances in the residuals of the first-stage regression to construct a valid instrument when exogenous instruments may not be available. As a third identification strategy, we employ a geographical instrument, which calculates the average number of respondents in the country who report having borrowed, excluding the region of the respondent, then use the aforementioned average as an external instrument for credit access. All our empirical models verify the negative association between borrowing and migration desire.

We calculate the marginal effect of having borrowed on wanting to migrate from a bivariate probit model for each country in our sample. Additionally, we show that the negative effect of borrowing on migration is larger for those with primary, secondary or tertiary education, indicating that perhaps access to credit can cement skilled workers’ attachment to the home county. We also show that those with higher income are less likely to report a desire to migrate, especially when they have borrowed. Finally, we find that conditional on being able to borrow, an increase in bank account ownership reduces the desire to move, more so when respondents also report that their assets are safe.

In Sect. 2, we discuss the dependent variable, main explanatory variable, and other controls and present descriptive statistics. In Sect. 3, we discuss the empirical strategies and we present our results. In Sect. 4, we discuss marginal effects. Finally, in Sect. 5 we conclude.

2 Data and variable definitions

The Gallup World Poll, now an annual survey, conducted in about 160 countries worldwide, is representative of more than 99% of the world’s population. The survey polls at least 1000 respondents in each country (2000 for the larger countries) and asks more than 100 questions. Gallup asks the same questions, every year, in the same way in the main questionnaire. All samples are probability-based and nationally representative, allowing for direct country comparisons. However, certain questionnaire waves often carry additional questions. Since we are interested in the relationship between desired migration and access to credit in developing economies, the sample we use is limited to Sub-Saharan countries, for which the survey waves conducted in 2009 and 2010 included questions about the desire to move and whether respondents borrowed. Our sample is dictated by the simultaneous availability of both variables (information on the desire to migrate and history of borrowing), along with the necessary controls, for the same years and countries. In the following subsections, we detail how we construct the variables.

2.1 Migration intentions—dependent variable

The survey asks the respondents, “Ideally, if you had the opportunity, would you like to move permanently to another country, or would you prefer to continue living in this country?”. The answers available are, “Like to move to another country, Like to continue living in this country, and Do not know.” We create a dummy variable we label as “desire to move” based on the aforementioned question such that the variable takes the value 1 if the respondent answers that they would like to move and zero otherwise. We drop observations of respondents who refuse to answer.

2.2 Borrowing in Sub-Saharan Africa—main explanatory variable

The survey asks the respondents, “Over the past year, have you borrowed money from any financial institution, i.e., bank, community savings group/savings club, microfinance institution, moneylender?”. We create a dummy variable, we label as “Borrowed” based on the above survey question, taking the value 1 if the respondent answers “yes” and zero otherwise.

Although we cannot determine which institution respondents received funds from, we can investigate general trends about Sub-Saharan Africans’ lending and borrowing practices. Sub-Saharan Africa’s financial system remains largely underdeveloped and leaves a big bulk of the population outside any formal borrowing system. Therefore, most resort to informal borrowing from friends and family. According to the World Bank, in 2013, only 5% of households were able to secure credit from commercial banks across the continent. Some countries, such as Kenya, Ethiopia, Ghana, and Uganda, rise above the continent average, with more than 10% of adults having borrowed from formal sources. Alternatively, borrowers employ semi-formal lending channels, such as savings clubs or rotation savings and credit associations, which require memberships to receive loans as a mechanism to screen applications (Aryeetey 2005). Data on savings groups are scarce; however, some estimates by FinScope on adult participation in such groups in 2016 and later range around 37% in Uganda, 13% in Kenya, 12% in Nigeria, and 16% in Tanzania (Hoop et al. 2020), while GWP finds that in 2008 the regional median of respondents reporting that they depend “a little” on community savings programs is 20%, while a mere 5% reports to depend on such programs “a lot” (English 2008).Footnote 5 Although such clubs are mainly used to acquire funds for consumption, some farmers utilize the funds to finance working capital (Aryeetey 2005). Occasionally, some borrow from moneylenders, who typically charge high interest rates and require quick repayments, thus representing a solution of last resort. In terms of borrowing from microfinance firms, the industry remains relatively small in the continent (Van Rooyen et al. 2012). The aforementioned facts about formal, semi-formal, and informal lending channels in Sub-Saharan Africa are reflected in GWP survey responses for wave years of 2009 and 2010 (same time period as our sample). For example, when asked an open-ended question about how they would borrow if they needed money to start a business, 42% of Sub-Saharan African respondents reported that they were most likely to resort to their families (Marlar 2010b). 15% reported that they would resort to banks, 10% stated that they would resort to savings club, 4% answered that they would utilize microfinance,Footnote 6 while 2% reported that they would resort to money lenders. It is evident that family and friends remain an important source of borrowing for respondents, compared to formal and semi-formal sources. But a different trend emerges for bank account owners: 50% state that they would go to a bank if they needed to borrow, versus 24% who would resort to family instead. In contrast, for those individuals without a bank account, only 8% would go to a bank, if they needed money, and 46% would ask their family members. This is not surprising; in general, bank loans require bank accounts as a prerequisite (Marlar 2010b). But it is worth noting that for the survey time period, only 1 in 5 Sub-Saharan Africans reports having a banking account. Barriers to open a bank account include multiple forms of identification or documents, such as pay slips (which are hard to come by for those in the informal sector), in addition to often prohibitively high associated costs. For example, in Sierra Leone the cost of maintaining a bank account was the equivalent of that country’s GDP per capita in annual fees (Beck et al. 2008).

The importance of bank accounts for formal loan generation was documented by Allen et al. (2021) in the case of Kenya’s Equity Bank, which significantly expanded its customer base both in KenyaFootnote 7 and in the neighboring countries. To grow its client base, the bank opened branches in low-income and underserved areas and expanded its operations in mobile banking. The bank also reduced minimum balance requirements and maintenance fees for deposit accounts. To encourage new clients to open bank accounts, the bank offered products tailored to their needs, such as small loans, the application for which simply required a National ID card, and accepted a wider range of options for collateral. Allen et al. (2021) find that the bank’s branching strategy increased the probability of residents having both bank accounts and loans, especially for those living in rural and arid areas.

This trend of increasing bank account uptake (in Kenya and elsewhere in Africa) is expected to continue as the rise of Fintech alters how traditional financial services are offered by incumbent banks. The region stands as a global leader in mobile money adoption and usage and advances in Fintech will take it beyond its main use as an e-payment and transfer system (Sy et al. 2019). New services such as micro lending, crowdfunding, and peer-to-peer lending are taking shape in the region. For now, the banking sector in Sub-Saharan Africa remains inefficient and charges extremely high interest rates to compensate for risk.

2.3 Control variables

We control for the demographic characteristics of each respondent. First, we control for marital status and whether the respondent is self-employed. We also control for household income per capita in international dollars, which is calculated by dividing each respondent’s reported household income by the total number of occupants in their household.Footnote 8 Other controls are gender, age and its quadratic term and household size. We also control for education through dummy variables for primary and secondary and tertiary education. Following (Dustmann and Okatenko 2014), we use correspondence analysis to create a wealth index as an additional control, based on questions regarding whether respondents own a TV and a cellphone and whether they had enough money for housing and food over the previous year. We list the questions in detail in Appendix A.

Since the desire to migrate is highly correlated with respondents’ satisfaction with general amenities and domestic conditions (Dustmann and Okatenko 2014), we also control for the degree of satisfaction respondents express toward their home country using three indices. The first index we use is the National Institutions Index, which measures the confidence in national institutions, in particular the military, judicial system, the national government, and the honesty of elections. Secondly, we include a Corruption Index, which reflects perceptions about corruption in both business and government. Finally, we use a Law-and-Order Index that measures the level of security respondents perceive for themselves and their families, based on confidence in local police and past experiences with crime. We also control for respondents’ perception regarding the safety of their assets using a question that asks “If someone wants to start a business in this country, can they trust their assets and property to be safe at all times?”.

The previous literature has shown that receiving remittances can affect migration plans (Piracha and Saraogi 2017; Van Dalen et al. 2005a, b) and spur on development by reducing investment constraints (Stark and Lucas 1988). We thus control for whether respondents received remittances using the following two GWP survey questions: i) “Considering all the activities you engage in to make a living, how much do you depend on receiving money from family members working in other countries?” and ii) “Considering all the activities you engage in to make a living, how much do you depend on receiving money from family members working elsewhere in the country?”. Note that we also control for remittances indirectly by including income in our regressions. When recording income for the survey, respondents are “instructed to include all income from all wages and salaries in the household, remittances from family members living elsewhere, and all other sources.”

Having migrant relatives abroad can also increase migration desire even in the absence of remittances, since it can signal the existence of networks abroad. Therefore, we also control for whether respondents report having a family member abroad by including a dummy corresponding to their answer to the question: “Do you have relatives or friends who are living in another country whom you can count on to help you when you need them, or not?”.

2.4 Descriptive analysis

In this section, we discuss general trends we observe and the characteristics of households in the sample for which we have observations on migration intentions, borrowing, and the control variables listed in Sect. 2.3.Footnote 9

For each country, the share of individuals who report i) a desire to migrate and ii) having borrowed from a financial institution is shown in Table 1. We plot the relation between the two aforementioned variables in Fig. 1 and include a linear trend. A cursory examination of the aforementioned linear trend indicates a positive correlation between the two variables.Footnote 10 Our data point to differences across countries.

Fig. 1
figure 1

Country shares: migration desire and reported borrowing

Examining Table 2, we find that individuals in the highest two quintiles of income, with secondary education or higher, and those employed, represent a higher fraction of the total number of people reporting to have borrowed and willing to migrate.

Table 1 Country shares of dependent variable and main explanatory variable
Table 2 Household characteristics

3 Estimation

In this section, we outline our strategy for estimating the effect of borrowing on migration intentions. We discuss possible biases that can arise and then outline our identification strategies.

We assume that respondents have a linear utility function defined over consumption. Respondents will prefer to migrate if their utility of migrating exceeds the utility of staying in the home country, given their relevant budget constraint. We define the perceived net gain in utility of migration for respondent i in country j as the latent variable \(\triangle m^*_{ij}\). Since utility is linear, it will equal consumption, which in turn is determined by whether the respondent borrowed (\(b_{ij}\)) in addition to observable variables such as income, wealth, and employment status. Utility can also be determined by demographic factors such as age and marital status and by the degree of satisfaction with the home country conditions. The following equations govern the latent net benefit of migration and the binary intention to migrate (\(m_{ij}\)) reported by the respondent, respectively:

$$\begin{aligned} \triangle m^*_{ij}&=x_{ij}' \beta _1 + b_{ij} \alpha + c_j' \delta _1 +u_{ij} \nonumber \\ {\textrm{prob}}[m_{ij}=1]&={\textrm{prob}}[x_{ij}' \beta _1 + b_{ij} \alpha + c_j' \delta _1 +u_{ij}>0], \end{aligned}$$
(1)

where \(\beta _1\) is the vector of coefficients associated with the vector of control variables \(x_{ij}\), including income, wealth, employment, demographic characteristics, and variables measuring a respondent’s satisfaction with the home country. \(\alpha \) is the coefficient associated with borrowing \(b_{ij}\). \(c_j\) are country dummies and \(\delta _1\) is the vector of the associated coefficients. Finally, \(u_{ij}\) is the random error.

Estimating Eq. 1 for migration intentions using ordinary least squares (OLS) is likely to suffer from omitted variables bias and reverse causality issues, causing endogeneity. Respondents’ ability and attitude toward risk (which cannot be measured from our survey data) can affect both borrowing and migration decisions. If an omitted variable, such as ability, is positively correlated with both migration and borrowing, then an OLS estimate of \(\alpha \) would be biased upward since \(Cov(b_{ij},u_{ij})>0\). For example, a respondent with high ability can fill out migration applications and loan applications in a more competent way than a low-ability respondent with identically measured characteristics (such as age and education level), making them both more likely to migrate and more likely to have received a loan. Having the aforementioned example in mind, estimating \(\alpha \) using OLS will lead to an inflated coefficient. On the other hand, an omitted variable can be negatively correlated with borrowing and positively correlated with migration.Footnote 11 In this case, \(Cov(b_{ij},u_{ij})<0\) and the bias in an OLS estimate of \(\alpha \) may be downward. Therefore, the net direction of bias is unknown.

Reverse causality may arise if the intention to migrate affects borrowing decisions. For example, those who wish to migrate might borrow to cover migration costs. Alternatively, those who wish to migrate may avoid borrowing if the cost and obligation of a loan hinder their relocation plans. In such cases, migration intentions, \(m_{ij}\), and borrowing, \(b_{ij}\), explain each other and thus: \(m_{ij}={x'}_{ij}\beta _1 +b_{ij} \alpha + u_{ij}\) and \(b_{ij}=\lambda m_{ij}+x_{ij}'\beta _2 + z_{ij}'\gamma + v_{ij}\). The aforementioned reverse causality leads the OLS estimator to be biased. As Wooldridge (2015) shows, the sign of the asymptotic bias in the OLS estimator would be the same as the sign of \(\frac{\lambda }{(1-\lambda \alpha )}\), if we drop x for simplicity. Assuming that \(\lambda \alpha <1\), then if \(\lambda >0\), implying that migration intentions lead to more borrowing, we will find the OLS estimator of \(\alpha \) biased upward. Therefore, if we expect that \(\alpha <0\), an upward bias would lead us to underestimate the effect of borrowing on migration intentions (OLS estimate of \(\alpha \) would be smaller in absolute terms). On the other hand, if \(\lambda <0\), implying migration intentions reduce borrowing, then the OLS estimator would be biased downward, leading us to overestimate the effect of borrowing on migration when \(\alpha <0\) (estimate of \(\alpha \) would be larger in absolute terms). In the special case of \(\alpha =0\), implying no effect of borrowing on migration desire, the OLS estimate would yield a positive or negative effect of borrowing on migration, depending on the sign of \(\lambda \).

Due to the possible biases detailed above, we test whether the variable “having borrowed” is exogenous. We first apply the Durbin and the Wu–Hausman tests, both with the null hypothesis that the variable “having borrowed” is exogenous. Given the significance of both tests at the 5% level, we reject the null of exogeneity of the borrowing decision.Footnote 12 We thus treat the aforementioned variable as endogenous. We tackle endogeneity through the use of several identification strategies, outlined in detail in Sects. 3.1, 3.2, and 3.3.

3.1 Baseline instrumental variable estimation

In this section, we detail our baseline IV approach. Our ability to identify the causal effect of borrowing on migration intention relies on a set of variables we label \(z_{ij}\), that explain an individual’s ability to borrow without affecting migration intentions. Therefore, we specify a corresponding latent model of borrowing behavior. We assume that the latent variable \(b^*_{ij}\) is the amount a person can borrow which is again determined by the observable characteristics we labeled as \(x_{ij}\). However, borrowing is also determined by access to financial services, which we label as \(z_{ij}\).

We define the latent borrowing variable and the binary variable that is generated from the latent equivalent using the following equations:

$$\begin{aligned} b^*_{ij}&=x_{ij}' \beta _2 +z_{ij}' \gamma + c_j' \delta _2 \ + v_{ij} \nonumber \\ {\textrm{prob}}[b_{ij}=1]&={\textrm{prob}}[x_{ij}' \beta _2 +z_{ij}' \gamma + c_j' \delta _2 \ + v_{ij}>0], \end{aligned}$$
(2)

again, \(\beta _2\) are the associated coefficients for \(x_{ij}\), the vector of control variables. \(c_j\) are country dummies for each country j in our sample and \(\delta _2\) are the associated vectors of coefficients and \(v_{ij}\) is the random error. \(\gamma \) is the vector of the associated coefficients of \(z_{ij}\).

In our discussion of the state of the financial system in Sub-Saharan Africa in Sect. 2.2, we detailed examples of how having a bank account is required in order to apply for small loans and how having low fees and less paperwork can increase access to credit. Therefore, we construct a dummy for those who report having a bank account. Then, we use a survey question asking respondents who reported having no bank account, “Why don’t you have a bank account?” to create a cost of banking dummy variable. The dummy tracks whether respondents report that “bank charges and commissions are too high” or whether respondents report “cumbersome paperwork” is required by banks.

Intuitively, the premise of our identification strategy is that individuals do not derive utility directly from financial services, and therefore, access to financial services should not determine whether someone intends to migrate, except through these services’ capacity to provide some needed liquidity to smooth consumption or finance investment. In other words, we assume that access to financial services per se is not a driver of migration intentions, but the credit and liquidity provided by financial services may affect migration intentions. Therefore, variables that reflect the availability of financial services could theoretically constitute valid instruments. Estimating Eqs. 1 and 2 using two-stage least squares (2SLS) regression with the IV’s described above, we show that in addition to having a significant p value, our F statistic from the first-stage regressions exceeds 10, a threshold suggested by Stock and Yogo (2005) for the reliability of instruments. Additionally, the Sargan and the Basmann tests for overidentifying restrictions alongside Wooldridge’s test are all not statistically significant. Wooldridge’s score test is robust when the errors are not i.i.d., which is the likely case for our data. The results of the 2SLS estimation are presented in Table 4, and the first-stage results are presented in Table 5. Additionally, We run two more tests suggested by Stock et al. (2002) to check whether the IVs are weak, which we discuss in Appendix in B and report the results in Table 9. Nonetheless, overidentification tests do not guarantee the validity of instruments, particularly since our two instruments are highly correlated, with only the bank account ownership variable being highly significant. Therefore, in Sects. 3.2 and 3.3 we conduct the analysis with alternative identification strategies.

Our findings suggest that being able to borrow in the home country reduces the desire to leave the home country and migrate. A direct comparison between the 2SLS estimates in Table 4 and the OLS estimates in Table 3 suggests that the OLS results are biased upward and underestimate the extent to which borrowing reduces migration intentions. Therefore, the 2SLS may have corrected for reverse causality and omitted variables that are positively correlated with both borrowing and migration, as per our discussion in Sect. 3. Given that both our dependent variable and main explanatory variable are binary, a nonlinear model such as a bivariate probit (biprobit) may be more appropriate to fit the data. In Appendix C, we discuss the details of the biprobit model and present the results in Table 10. Our biprobit results confirm the negative association between borrowing and migration intentions identified above.Footnote 13

Economic theory offers many explanations for the negative relationship between borrowing and attachment to the home country. Most likely, borrowing at home reduces the intensity of the “push factors,” which the literature defines as factors relating to the conditions at home that propels workers to leave (in contrast, “pull factors” are related to better conditions abroad, typically higher income). If borrowed funds are used for entrepreneurial activities, then these funds present an opportunity for higher income in the future at home and perhaps more satisfactory working conditions, raising the opportunity costs of migration and reducing its appeal. Similarly, if the borrowing was used to purchase assets domestically, then the attachment to the home country may increase (or perhaps the obligation to pay back debt may increase the likelihood of reporting wanting to stay home). Alternatively, if the borrowed funds were used to smooth consumption, then families no longer need to send a migrant abroad as a strategy to diversify risk and insure against negative economic outcomes.

Even though our IVs pass the standard tests for validity, threats to our identification strategy still exist. First, those who want to migrate may be more likely to open a bank account in the home country in order to send remittances back home. However, remittances are more commonly sent through mobile transfers, that can reach rural areas and typically don’t require further identification mechanisms, such as proof of permanent address. Alternatively, a threat to identification could arise if those who wish to migrate might avoid costly bank account ownership, while those with no migration intention open bank accounts to facilitate their transactions at home. A second threat to identification is that the presence of banks in certain areas may be correlated with other amenities or commercial activities in those areas. In other words, both bank presence (and bank account ownership) and migration intentions can be driven by some other economic factors. This problem may be mitigated by adding country fixed effects and the various indices measuring the country’s quality of institutions, corruption, and law and order. Third, the possibility that having a bank account is correlated with the respondents’ entrepreneurial activity, which in turn can reflect a respondent’s intent to migrate. One way we dealt with this concern is by controlling for whether the respondent owns a business in the country of origin or is fully self-employed. If any of these threats materialize in our data, then our estimates may not be valid and may be biased in either direction. In order to verify our findings, we employ two other identification strategies detailed in the next two sections.

Table 3 Ordinary least squares
Table 4 Two-stage least squares
Table 5 First-stage results of the two-stage least squares from Table 4

3.2 Identification through heteroscedasticity

In order to verify the robustness of our results in Sect. 3, we propose the use of a different instrumentation methodology, à la Lewbel (2012). To illustrate this method briefly, we define m as the outcome variable, b as the endogenous variable, and x as the set of exogenous variables. Next, we write the empirical model using the following two equations: \( m=x'\beta _1+b \alpha +u\) and \(b=x'\beta _2+v\), where the errors u and v may be correlated and \(\alpha \) and \(\beta _1\) are coefficients we wish to estimate. Standard IV techniques, like the one used in the previous section, are based on the assumption that at least one element in x belongs in the b equation, but not in the m equation. Such an exclusion restriction raises the possibility of making the wrong assumption (that perhaps all the elements of x appear in the m equation). Lewbel (2012) suggested a new method, which offers identification through a simple linear 2SLS estimator for the coefficients of interest (\(\alpha \) and \(\beta _1\)). This method is based on exploiting information embedded in the heteroscedasticity of v in order to construct valid IVs. The two steps of the Lewbel (2012) method areFootnote 14:

  1. 1.

    Use OLS to estimate the first-stage residuals given by \(\hat{v}=b-x'\hat{\beta _2}\).

  2. 2.

    Define w as some or all elements in x and then using x and \((w-\tilde{w})\hat{v}\) as instruments, estimate the coefficients \(\alpha \) and \(\beta _1\), using linear 2SLS. Note that \(\tilde{w}\) is the mean of w.

In his initial work, Lewbel (2012) does not show that the identifying assumptions are satisfied for when m or b is binary, but later on, Lewbel (2018) shows that they can be satisfied when the aforementioned variables are discrete, thus applicable to our case.

The key additional assumptions for identification using Lewbel (2012) are that \({\textrm{Cov}}(w,vu)=0\) and \({\textrm{Cov}}(w,v^2)\ne 0\), where w is some or all elements in x Baum and Lewbel (2019). One condition that supports these assumptions is that w must be correlated with the squared error in stage 1 (\(v^2\)) so that the instrument is correlated with the endogenous variable b. To verify this condition in our estimation, we run the Breusch and Pagan (1979) test which has a null hypothesis of homoscedasticity. We reject the null in favor of heteroscedasticity for our vector of w variables (listed in Table 6) in the first-stage residuals. We also find that Hansen J test for overidentification restrictions is insignificant providing further evidence of the validity of the instrument. Results are presented in Table 6 and deliver a negative relationship between migration and credit.

Under identification through heteroscedasticity, the estimated coefficient for the effect of borrowing on migration is -.076 (as can be seen in Table 6, column 1). On the other hand, using the external instruments, the biprobit estimate of the average marginal effect of borrowing on migration is – .0946 and the 2SLS estimate is – .146 (as can be seen in column 1 of Tables 8 and 4, respectively). The absolute value of these three estimates exceeds the absolute value of the OLS estimate (which is – .0385, as can be seen in column 1 of Table 3). Therefore, the various estimation strategies generate different magnitudes for the marginal effect of borrowing on migration. If the Lewbel coefficient is the more accurate estimate, this would indicate that our identification strategy using external instruments in Sect. 3.1 did not sufficiently deal with endogeneity or omitted variables issues, such that the estimated coefficients using external instruments (in 2SLS and biprobit) overestimate the impact of borrowing on migration.

Table 6 IV estimation using Lewbel (2012)

3.3 Geographical instrumental variables

Our third methodological approach in tackling endogeneity is using the “leave one out instrument,” widely utilized in the literature. We instrument for having borrowed, using the average number of individuals who have borrowed in one’s home country, excluding the respondent’s own region. Therefore, the instrument varies at the regional level. Such a geographical instrument has been used by Bai et al. (2019), Dang and La (2019), Ansell (2008) and Acemoglu et al. (2019). The identification strategy assumes that regional differences in borrowing levels are driven by the same shock process, which implies a strong first-stage effect (Bai et al. 2019). In other words, differences in regional borrowing levels have a common cause such that regional variation in average borrowing of other regions would instrument for actual borrowing in the respondent’s region (and the regional variation would in turn identify the effect of borrowing on migration for respondents). For example, if low borrowing is driven by the lack of financial regulation or high interest rates, then the variation in regional borrowing (other than the respondent’s region) would pick up these effects and would be correlated with the respondent’s own borrowing levels. A key identification assumption is that international migration desire in any region is determined by the region’s conditions, rather than variations in borrowing elsewhere. We argue that this condition holds in our data. Indeed, Dustmann and Okatenko (2014) show that international migration desire itself is driven by local conditions of respondents’ regions such as quality of local labor markets and amenities, which we control for.

Nonetheless, a threat to identification can arise if a respondent is aware of regions with better borrowing opportunities and therefore plans to migrate internally instead of internationally, suggesting that borrowing in other regions can reduce international migration but raise internal migration. However, given the large upfront costs of international migration and the higher risk associated with it, internal and international migration cannot be considered perfect substitutes, an observation that would support our identification strategy. Our instrument passes the cutoff F-statistic suggested by Stock and Yogo (2005), exhibits strong first-stage results and delivers similar conclusions to the previous identification methods detailed above. The results are presented in Table 7.Footnote 15

As can be seen in Table 7, column 1, the estimated coefficient for the effect of borrowing on migration is – .35, which deviates in magnitude from our previous estimates in Sects. 3.1 and 3.2 (although all the estimates have a similar negative sign). This deviation casts a doubt on the validity of the geographical IV. Comparing the results of our different identification strategies, we find that identification through heteroscedasticity delivers the weakest estimate for the impact of borrowing on migration (\(-.0758\)), while the marginal effect delivered by biprobit and 2SLS estimates points to a stronger impact of borrowing on migration desire (– .0946 and – .146, respectively).Footnote 16 All identification strategies deliver a stronger effect than that predicted by OLS (– .0385).

Table 7 Two-stage least squares—geographical instrument estimation
Table 8 Average marginal effects from bivariate probit

4 Marginal effects

Using the results of the biprobit model, we calculate the average treatment effect of having borrowed on the desire to migrate and list our estimates in column 1 of Table 8. For the full sample, we find that an individual who borrows is on average about 9.5 percentage points less likely to report a desire to migrate. We also calculate the average treatment effect of having borrowed for each country in our sample and report it in column 1 of Table 8. South Africa has the smallest average marginal effects, while Ghana and Malawi have the highest. Examining the descriptive statistics in Table 1, we find that for Ghana and Malawi, the average number of respondents reporting an intention to move is among the highest. Therefore, the larger marginal effect in these countries may indicate that the impact of credit on migration is stronger when a substantial fraction of the population desires to move.

Next, we focus on the effect of credit access on skilled workers’ migration desires since brain drain is a concern for developing countries (Czaika and Parsons 2017; Bhagwati and Hamada 1974; Miyagiwa 1991; yiu Wong and Yip 1999). Additionally, international migration involves costs for physical relocation and requires information collection and paperwork submission in order to migrate. Such hurdles are more likely overcome by skilled workers rather than unskilled (Mayr and Peri 2008). Therefore, skilled workers may be better able to translate their desire to move into a reality.Footnote 17 We find that despite being more likely to report a positive intention to migrate, when they access credit, this desire decreases. Note that the coefficient on education levels is positive perhaps highlighting the brain drain phenomenon. However, access to credit reduces the migration desire of skilled workers in our data set. We report the average marginal effect of borrowing on the predicted probability of reporting a desire to move when the respondent has primary, secondary or tertiary education in column 2 of Table 8.

Previous studies show that initially as income rises, migration rises, but then at high levels of income, migration decreases (Dao et al. 2018; Clemens 2014; Abramitzky et al. 2013). Intuitively, as income increases enough to finance moving costs, migration increases, but at high levels of income, migration decreases as the opportunity cost of emigration increases (since income can be allocated to investment and consumption smoothing at home Bazzi 2017). In our estimations, as income rises, migration desire decreases since the coefficient on income is negative (as can be seen in Table 10). Our estimations show that borrowing, which provides additional funds, further decreases migration intentions. Therefore, for our sample, respondents may be more likely to use the funds for investment or consumption smoothing rather than migration.

4.1 Banking access and migration

We investigate the extent to which access to banking can change the marginal effect of borrowing on migration desire. We find that the marginal effect of borrowing for those with a bank account is larger in magnitude (i.e., absolute terms) than for those without a bank account. Additionally, for those who feel that their assets are safe and have a bank account, the marginal effect of borrowing has an even larger magnitude than for those who do not. The changes in the marginal effects are shown in Fig. 2 and are based on the biprobit estimates from column 1 in Table 10.

Fig. 2
figure 2

Change in marginal effect of borrowing on reporting a desire to migrate: with/without bank account, assets safe/unsafe*. *Based on biprobit estimates from column 1 in Table 10. Vertical lines represent 95% confidence internals

5 Conclusion and discussion

Our work documents a direct link between borrowing and the desire to migrate in Sub-Saharan Africa. Having borrowed reduces the likelihood of reporting wanting to migrate, especially for those with higher levels of education and those who have a lower level of income initially. The likelihood of reporting a desire to migrate is also lower for those who borrowed while having a bank account and perceive that their assets are safe. We document this direct effect using Gallup survey data for the years 2009 and 2010. In order to deal with endogeneity issues, arising because of possible unobservable factors and/or reverse causality, we implement several identification strategies. All techniques, namely traditional IV estimation, Lewbel’s (2012) instrumental variable method, the “leave one out IV” identification and bivariate probit, confirm the negative effect.

Theoretically, there are different channels through which borrowing could affect migration. First, borrowing offers additional liquidity and as a result eases budget constraints through an income effect. Hence, borrowing can facilitate migration of households or individuals who are liquidity-constrained. However, individuals may use borrowed funds to purchase assets or undertake investments, which can increase the opportunity cost of migration and keep them at home. Of course, there is the possibility that borrowing imposes collateral or other guarantee restrictions, burdening individuals and cementing their presence in the country of origin. Alternatively, if borrowing in the country of origin is used to smooth consumption against negative shocks, then being able to borrow reduces migration. This last channel is discussed in the seminal work of Stark and Lucas (1988), who show that as a part of a bargaining problem or due to altruistic reasons, families invest in helping a member migrate as a way to mitigate risks of domestic markets.Footnote 18 In this case, migration and remittance are a part of a family contract that fills in the gap for the lack of formal insurance mechanisms at home.

Although we cannot ascertain the motivation for borrowing in our data set, given that we find a negative effect on migration, generally it is likely that the funds provided an income boost or liquidity for individuals to smooth consumption, or invest, which in turn can increase respondents’ lifetime utility at home, and consequently reduce migration intentions.

In conclusion, we should note that our analysis comes with limitations. Although the overidentification tests did not reject the validity of our external IVs, they still may be invalid. Therefore, caution should be exercised over the implications of our results. If our instruments are in fact invalid, then the magnitude of the biprobit and 2SLS estimates may be biased (in either direction). The differences in the magnitudes of the coefficients across our specifications in Sects. 3, 3.2, and 3.3 may also be a cause for concern. Future research should identify stronger identification strategies, limiting the possibility of bias in the estimates.