1 Introduction

The extent of concentration of poor people in some neighborhoods of a city is found to have medium- and long-term adverse effects on health outcomes (Ludwig et al. 2011, 2013), job opportunities (Conley and Topa 2002), and well-being (Ludwig et al. 2012) for those exposed to it. The level of poverty concentration in a city is measured by the concentrated poverty index (Wilson 1987; Jargowsky and Bane 1991). This index indicates the share of poor population in a city who lives in neighborhoods with a poverty incidence greater than or equal to a certain threshold (e.g., \(20\%\) is used for identifying high-poverty neighborhoods, whereas \(40\%\) is set for identifying extreme-poverty neighborhoods). Several studies examine the trends and drivers of concentrated poverty across American metropolitan areas. Massey et al. (1991) find that residential segregation is the main driver of spatial concentration of poverty in American urban areas. Quillian (2012) investigates changes in concentrated poverty in American metro areas by developing a model that incorporates residential segregation variations. Iceland and Hernandez (2017) analyze the trends in concentrated poverty in American metropolitan areas over the 1980–2014 period and identify the variation in the segregation of poor people as a key driver of the change in concentrated poverty.

Albeit widely used in the empirical literature, the concentrated poverty index does not fulfill some desirable properties. Andreoli et al. (2021) have axiomatically derived a family of urban poverty indices which overcome some drawbacks of the concentrated poverty index. In particular, these indices produce evaluations of urban poverty that take into account (and make explicit the relation between) aspects of incidence of poverty in the city, the distribution of poverty across high-poverty neighborhoods, and the extent of segregation of poor and non-poor residents between high- and low-poverty neighborhoods.

This paper studies the drivers of levels and changes in urban poverty across American metropolitan statistical areas (MSAs) between 1980 and 2014. To do so, we consider measuring urban poverty with selected indices belonging to the family characterized in Andreoli et al. (2021). Two indices within this family are the adjusted concentrated poverty index and the urban poverty index, which are calculated by setting a poverty incidence threshold for identifying high-poverty (or extreme-poverty) neighborhoods. A further index belonging to this family of urban poverty measures is the Gini index of inequality in neighborhood poverty incidence, which is obtained by considering all neighborhoods in a city, including medium- and low-poverty neighborhoods (i.e., setting a tolerance level to poverty incidence equal to \(0\%\) instead of \(20\%\) or \(40\%\)). Since the Gini index can be broken down into a neighborhood and a non-neighborhood component Rey and Smith (2013), the degree of spatial clustering of poverty across neighborhoods can be assessed directly within the urban poverty measurement framework. Furthermore, the change in the Gini index can be split into different components measuring the convergence and re-ranking of neighborhoods in terms of poverty incidence Andreoli et al. (2021), adding information to the analysis of trends in urban poverty.

The aforementioned indices constitute the measurement apparatus that we use to assess urban poverty for a panel of MSAs over the 1980–2014 period, built by exploiting rich data from the Census and the American Community Survey (hereafter, ACS). We produce a comparative analysis of the potential drivers of concentrated poverty and urban poverty. The partial effects of poverty drivers are obtained from pooled OLS regressions of the various indices of urban poverty concentration on a set of explanatory variables controlling for year, state, and region fixed effects. We also examine the roles of the same explanatory variables in driving the changes in urban poverty concentration by running OLS regressions on the pooled period-to-period changes in the indices of concentrated poverty and urban poverty. A covariate describing gentrification at the census tract level is added in the regression analysis, as gentrification may cause changes in the composition of the population living in historically high-poverty neighborhoods of a city Christafore and Leguizamon (2019).

We further analyze the contribution of changes in the distribution of explanatory variables on urban poverty, net of the effect of changes in the correlation between these variables and urban poverty. To do so, we resort to an Oaxaca–Blinder decomposition (Blinder 1973; Oaxaca 1973), which separates the difference between the estimated average levels of urban poverty concentration in 1980 and 2014 into three different components. We find that American MSAs display substantial heterogeneity in urban poverty patterns and that both the re-ranking and convergence components of urban poverty changes are substantial across MSAs. The spatial component of urban poverty is negligible for the large majority of MSAs but very significant in the largest MSAs, where the clustering of high-poverty census tracts seems to be an issue. Demographic variables, income distribution, and the distribution of housing values within an MSA are major drivers of urban poverty concentration in American MSAs over the period considered.

The remainder of the paper is organized as follows: Sect. 2 outlines relevant literature on consequences and drivers of neighborhood poverty and concentrated poverty, highlights critics of the latter measure, and presents the urban poverty index. Section 3 describes data and the covariates entered in regressions. Results are shown and discussed in Sect. 4. Section 5 concludes.

2 Urban poverty and concentrated poverty

2.1 Poverty in the city: relevant literature

The empirical literature has brought about evidence that growing up or living in poor neighborhoods is detrimental to a wide set of individuals’ lifetime outcomes. From a short-term perspective, place matters for the cognitive development of children, as children living in high-poverty neighborhoods tend to have worse performances in math and reading test scores (Pearman 2019; Sharkey and Elwert 2011; Vinopal and Morrissey 2020; Wolf et al. 2017) and lower verbal abilities Sampson et al. (2008) than their peer from other neighborhoods. When the focus is on the long-term outcomes, Chetty et al. (2016) and Chetty and Hendren (2018) show that longer exposure to high-poverty neighborhoods during childhood has negative casual consequences on the economic opportunities of future generations, as it reduces college attendance and earnings and increases single parenthood rates, whereas Conley and Topa (2002) and Ludwig et al. (2012) find an adverse effect on job opportunities and well-being, respectively. The estimated effects can be explained by the fact that residents in poor neighborhoods tend to be more isolated from the middle-class environment, having a bad connection to the labor market and access to low-quality schools and other public amenities (Jargowsky 2013; Wang et al. 2018). In addition, there is also evidence that living in poor places is associated with poor heatlh outcomes (Ludwig et al. 2011, 2013; Thierry 2020) and increased drug use (Boardman et al. 2001; Nandi et al. 2010). Residents in poor neighborhoods, indeed, are exposed to stressful circumstances which have a negative impact on some biomarkers of biological aging (Lei et al. 2018; Smith et al. 2017; Thierry 2020) as well as to environmental pollution due to the high density of industrial facilities which reduces air quality Ard and Smiley (2021). Besides the deleterious impact on this wide range of individual outcomes, poor places act as a barrier reducing the likelihood of moving toward better neighborhoods and hence toward more opportunities for residents and their offspring (Alvarado and Cooperstock 2021; Huang et al. 2021), contributing to the perpetuation of poverty and inequality of opportunities across generations.

The aforementioned literature suggests that cities that differ in how poverty is concentrated across neighborhoods may differ as well in terms of long-run well-being. Assessing the extent to which poverty is distributed within a city is hence key for performing poverty comparisons across cities. In order to be able to perform meaningful comparisons, two fundamental issues need to be addressed: (i) identify high- or extreme-poor neighborhoods and aggregate information about the distribution of poverty in those places, and (ii) uncover the drivers and determinants of the implied level of urban poverty.

Comparisons of poverty distribution across cities can be carried over by aggregating evaluations of the distribution of poverty across neighborhoods of a city through an index, which is a function mapping information (specific to a city and period) about the distribution of poor and non-poor people across the city neighborhoods into a number, regarded to as the level of urban poverty of that city. When we use the term “city,” we refer to a specific MSA, as defined by the American Census Bureau, observed in a given year. Each city is partitioned into n non-overlapping census tracts, the smallest available statistical units for which a broad set of characteristics are observable from available census data. It is standard to use census tracts partition to define neighborhoods. For every census tract \(i\in \{1,\ldots ,n\}\), we observe the demographic size of the tract, denoted by \(N_i\in {\mathbb {R}}_+\), with \(N=\sum _{i=1}^n{N_i}\) being the overall population in the MSA, and the size of the group of individuals that are poor and reside therein, denoted by \(P_i\), with \(P=\sum _{i=1}^n{P_i}\) being the total number of poor in the MSA.

We represent a city by the corresponding urban poverty configuration, denoted by \({\mathcal {A}}\), which is a collection of counts of poor and non-poor individuals distributed across tracts of the MSA, so that \({\mathcal {A}}=\{P_i^{\mathcal {A}},N_i^{\mathcal {A}}\}_{i=1}^n\). In what follows, we use superscripts to indicate a specific urban poverty configuration only when disambiguation is needed. The analysis of urban poverty that we make is conditional exclusively on the distribution of poor and non-poor individuals in space that is provided by a urban poverty configuration. The ratios \(\frac{P_i}{N_i}\) and \(\frac{P}{N}\) measure the incidence of poverty in tract i in the MSA, respectively. In this paper, the poor are always exogenously identified (for instance by a federal poverty line for equivalent household income) and the focus is on how poor and non-poor individuals are distributed across census tracts. A urban poverty line \(\zeta \in [0,1)\) can be used to identify tracts where poverty is over-concentrated. A tract i is a high-poverty tract when \(\frac{P_i}{N_i}\ge \zeta \). According to the Census Bureau, for instance, \(\zeta =0.2\) and \(\zeta =0.4\) identify high-poverty and extreme-poverty census tracts (ghettos), respectively, i.e., places where poor individuals count for above 20% or 40% of the resident populations. For a given urban poverty line \(\zeta \), there are \(z\ge 1\) tracts where poverty is highly concentrated and \(n-z\ge 0\) where poverty is not concentrated. Assume that tracts are ordered by decreasing magnitude of poverty incidence, so that \(\frac{P_i}{N_i}\ge \frac{P_{i+1}}{N_{i+1}}\), we can hence denote \({\overline{P}}_z=\sum _{i=1}^z{P_i}\) and \({\overline{N}}_z=\sum _{i=1}^z{N_i}\).

The most widely used measure for urban poverty analysis is the concentrated poverty index (hereafter, CP) Wilson (1987); Jargowsky and Bane (1991); Iceland and Hernandez (2017). It measures the proportion of poor people who live in high-poverty census tracts as identified by urban poverty line \(\zeta \). Formally:

$$\begin{aligned} CP({\mathcal {A}},\zeta ) = \frac{{\overline{P}}_z}{P}. \end{aligned}$$

The index produces an aggregate evaluation of poverty that is meant to correlate with the incidence of the "double burden" of poverty, arising from the fact that poverty tends to concentrate in neighborhoods where poverty incidence is high, thus producing the external effects highlighted above. Understanding how concentrated poverty evolves and the determinants of these changes is a fundamental concern even from the policy-maker’s perspective.

Several studies analyzing recent trends of concentrated poverty across MSAs document that concentrated poverty displays substantial heterogeneity across cities and over time, reflecting the socio-economic transformation that occurred in the USA over the last decades. More specifically, after a growth of the share of the poor living in high-poverty neighborhoods occurred during the 1980s, the next decade has been characterized by a sharp decline in concentrated poverty. However, this trend has again reversed in the 2000s, when the Great Recession completely wiped the progress of the previous decade (Jargowsky 1997, 2015; Kneebone et al. 2011; Kneebone 2014), the trend being similar in rural and metro areas Thiede et al. (2018). Different drivers are found to be responsible for these trends. Massey et al. (1991), Massey and Denton (1993), and Massey et al. (1994) find that residential segregation is the main driver of spatial concentration of poverty in American urban areas. On the same line, Quillian (2012) investigates changes in concentrated poverty in American metro areas by developing a model which incorporates variations in residential segregation, yielding similar effects. Racial segregation is found to explain only a small share of variations in trends of concentrated poverty across American cities Iceland and Hernandez (2017), whereas income segregationFootnote 1 and the extent of segregation of the poor is found to be positively and robustly associated with concentrated poverty Dwyer (2012); Bischoff and Reardon (2014); Wilson (1987). Overall, the worsening of the economic circumstances, exacerbated by the Great Recession, substantially contributed to the increase in the incidence of poverty, which is one of the drivers of the recent re-emergence of concentrated poverty in Iceland and Hernandez (2017).

Poverty incidence and the segregation of the poor are important dimensions of urban poverty that fail to be addressed by (but are correlated with) the concentrated poverty index. Andreoli et al. (2021) have highlighted some additional drawbacks of the concentrated poverty index, which have to do with the normative justification of the index. In particular, counterexamples can be constructed in which a movement of poor people from a low-poverty neighborhood toward a high-poverty neighborhood, i.e., a shift which unambiguously increases poverty concentration, can, in fact, reduce concentrated poverty as measured by CP. Andreoli et al. (2021) characterize an urban poverty index consistent with the implications of such transfer. We describe the index and its decomposition properties in the following section. The empirical analysis highlights trends and drivers of levels and changes in the urban poverty index across MSAs between 1980 and 2014.

2.2 Urban poverty measurement

The urban poverty index (hereafter, UP) combines aspects of incidence and distribution of poverty, and it is defined as follows Andreoli et al. (2021):

$$\begin{aligned} UP({\mathcal {A}};\zeta ):= & {} \beta \left( \frac{{\overline{P}}_z-\zeta {\overline{N}}_z}{P}\right) +\gamma \left( \frac{{\overline{N}}_z}{N}\right) \left( \frac{{\overline{P}}_z}{P}\right) G({\mathcal {A}},\zeta )\nonumber \\&+\gamma \left( \frac{N-{\overline{N}}_z}{N}\right) \left( \frac{{\overline{P}}_z-\zeta {\overline{N}}_z}{P}\right) , \end{aligned}$$
(1)

where \(\zeta \in [0,1)\), \(\beta ,\gamma \ge 0\) and \(z\ge 1\). The index value is bounded below by zero, since when \(z=0\) then \(UP({\mathcal {A}},\zeta )=0\). The level of urban poverty measured by UP depends on its parametrization. The parameters \(\beta \) and \(\gamma \) have a normative interpretation. The parameter \(\gamma \) is the weight of the distributional component of urban poverty, which compounds information about the distribution of poverty across high-poverty neighborhoods \(i=1,\dots ,z\) measured by the Gini index \(G({\mathcal {A}},\zeta )\), alongside information about the distribution of poverty across high- and low-poverty incidence neighborhoods. The parameter \(\beta \) is, instead, the weight of poverty incidence. A convenient weighting scheme assigns equal weight to both components, implying \(\beta =\gamma =\frac{1}{2}\). Conversely, by setting \(\gamma =0\) and \(\beta =1\), we identify a specific measure of urban poverty, denoted the adjusted concentrated poverty index \(CP^*\) (hereafter \(CP^*\)). It is defined as follows:

$$\begin{aligned} CP^*(\mathcal {A},\zeta ):=\frac{{\overline{P}}_z-\zeta {\overline{N}}_z}{P} = CP(\mathcal {A},\zeta )-\zeta \left( \frac{{\overline{N}}_z}{P}\right) . \end{aligned}$$
(2)

The index provides an adjustment of CP by a counterfactual level of poverty concentration in high-poverty tracts that can be tolerated according to the normative view expressed by the urban poverty line \(\zeta \).

Lastly, the UP index also depends on the threshold \(\zeta \). An interesting case is when \(\zeta =0\), indicating that distributional concerns about poverty are extended to all neighborhoods of the city. By setting \(\gamma =1\) and \(\beta =0\), and noticing that when \(\zeta =0\) then \(z=n\) and \({\overline{P}}_z=P\), it can be shown that the relevant urban poverty index converges to the Gini index of poverty incidence at the tract level, which can be written as

$$\begin{aligned} UP\left( {\mathcal {A}},0\right) =G({\mathcal {A}}):=\frac{1}{2P/N}\sum _{i=1}^{n}\sum _{j=1}^{n}\left[ \left( \frac{N_{i}N_{j}}{N^{2}}\right) \left| \frac{P_{i}}{N_{i}}-\frac{P_{j}}{N_{j}}\right| \right] . \end{aligned}$$
(3)

The index has interesting decomposition properties which turn out to be useful for the analysis of urban poverty distribution. As shown in Rey and Smith (2013), the Gini index can be additively decomposed into a neighborhood and a non-neighborhood component. This spatial decomposition of the Gini index relies on the specification of a binary spatial weights matrix quantifying the spatial relationship between any two census tracts i and j within an MSA. More specifically, the ij-th element of the binary spatial weights matrix is equal to 1 if tracts i and j are considered close according to a given criterion,Footnote 2 and to 0 otherwise. Once a binary spatial weights matrix has been created, the Gini index can be split into a neighborhood component, which measures the inequality among census tracts that are in spatial proximity, and a non-neighborhood component measuring the inequality among non-neighboring census tracts. MSAs characterized by a positive spatial autocorrelation in poverty incidence tend to display little heterogeneity among neighboring tracts, and hence a neighborhood component which is relatively small compared to the non-neighborhood component of urban poverty. Conversely, a negative spatial autocorrelation in poverty incidence implies a large heterogeneity in neighboring tracts, hence a higher level of the neighborhood component (relative to the non-neighborhood component).

A further advantage of the index in Eq. 3 is that the year-to-year change in urban poverty from \({\mathcal {A}}_t\) to \({\mathcal {A}}_{t+1}\) can be broken down into components measuring different contributions. The quantity \(\Delta G= G({\mathcal {A}}_{t+1})-G({\mathcal {A}}_t)\) measures the change in urban poverty from time t to \(t+1\). However, some aspects that are relevant for assessing the change in urban poverty may be neglected just by observing the variation in the index. Changes in the distribution of poor and non-poor people across census tracts may modify not only the relative difference in poverty incidence between tracts i and j in Eq. 3 but also the relative weight of that difference, which depends on tract population sizes.Footnote 3 To separate the roles of changes in tract population sizes and changes in tract poverty incidences when assessing urban poverty variations, Andreoli et al. (2021) broke down the change in urban poverty into three components, \(\Delta G:=W + R + D\). In the decomposition, W is the demographic component measuring the change in urban poverty due to changes in tract population sizes, while tract poverty incidences are kept fixed at time \(t+1\). A positive (negative) value of W means that the relative weights of more unequal pairs of tracts have increased (decreased) from t to \(t+1\). Component D measures the change in urban poverty owing to the changes in relative disparities between tract poverty incidences from t to \(t+1\), while the ranking of tracts by poverty incidence is kept fixed at time t. A negative (positive) value of D indicates that the relative disparities in poverty incidence between initially high-poverty and low-poverty tracts have become smaller (greater) in \(t+1\). As D measures the extent to which the poverty incidences of initially high-poverty and low-poverty tracts have changed disproportionately, this component can be seen as a measure of convergence (\(D<0\)) or divergence (\(D>0\)) in poverty incidence across tracts.Footnote 4 When tract poverty incidences vary disproportionately, a re-ranking of tracts can occur as the poverty incidence of an initially lower-poverty tract can become greater than that of an initially higher-poverty tract. Component R measures the change in urban poverty due to the re-ranking of tracts from t to \(t+1\). Since the re-ranking component is always non-negative (\(R\ge 0\)) Jenkins and Van Kerm (2016), the re-ranking component contrasts the effect of convergence (\(D<0\)) whereas reinforces the divergence effect (\(D>0\)) O’Neill and Van Kerm (2008).

Exploiting the linear aggregation features of the Gini index, Andreoli et al. (2021) showed that the three-term decomposition of the change in urban poverty can be further decomposed to separate the contribution of a change in poverty incidence at the MSA level, which otherwise remains incorporated in D.Footnote 5 Component D can be factored as the product of two terms, \(D=C\cdot E\). Component E is a measure of convergence (or divergence) in poverty incidence between tracts, just like component D, with the difference that E is calculated by assuming that the poverty incidence of the whole MSA is unchanged from t to \(t+1\). Component E is, therefore, a “pure” measure of convergence in poverty incidence across tracts. C is a scaling factor, expressed as a function of the relative variation in the poverty incidence of the whole MSA from t to \(t+1\).Footnote 6 A value of C smaller (greater) than 1 indicates that overall poverty incidence has increased (decreased) over time. The decomposition of the change in urban poverty becomes

$$\begin{aligned} \Delta G:=W + R + C\cdot E. \end{aligned}$$
(4)

As shown in Andreoli et al. (2021), \(\Delta G\) and each of its components, except C being a scaling factor, can be split into a neighborhood and a non-neighborhood component:

$$\begin{aligned} \Delta G_{N}+\Delta G_{nN}:=W_{N}+W_{nN} + R_{N}+R_{nN} + C\cdot \left( E_{N}+E_{nN}\right) . \end{aligned}$$
(5)

In Eq. 5, \(\Delta G_{N}\) is the change in the neighborhood component of the index and is given by the sum of the neighborhood components of W, R and E (scaled by C), \(\Delta G_{N}:=W_{N}+R_{N}+ C\cdot E_{N}\); the same holds for the change in the non-neighborhood component of the index, \(\Delta G_{nN}:=W_{nN}+R_{nN}+ C\cdot E_{nN}\).

In the rest of the paper, we study the metro-level drivers of the indices \(CP(.,\zeta )\), \(CP^*(.,\zeta )\), \(UP(.,\zeta )\), by setting the urban poverty threshold \(\zeta =0.2\) or \(\zeta =0.4\), and of G, corresponding to the case in which \(\zeta =0\).

3 Data

We use data produced by the US Census Bureau. Data for 1980, 1990, and 2000 are from the decennial census Summary Tape File 3A. Due to anonymization issues, the STF 3A data are given in the form of statistical tables representative at the census tract level. After 2000, the STF 3A files have been replaced with survey-based estimates of the income tables from the ACS, which runs annually since 2001 on representative samples of the US resident population. We focus on three waves of the 5-year module of ACS (estimates based on about 2% of the resident population): 2006–2010, 2010–2014, and 2012,–2016. We interpret estimates from the ACS modules as representative for the mid-interval year, i.e., 2008, 2012, and 2014, respectively. These years roughly correspond to the onset, the striking, and the early aftermath of the Great Recession period (Jenkins et al. 2013; Thompson and Smeeding 2013).

The census and ACS report, consistently across years, information about poverty incidence at the census tract level. Poverty incidence is measured by the number of individuals in families with total income below the poverty threshold, which varies by family size, number of children, and age of the family householder or unrelated individual. Poverty status is determined for all families (each family member is assigned with the same status). Poverty status is also determined for persons not in families, except for inmates of institutions, Armed Forces members living in barracks, college students living in dormitories, and unrelated individuals under 15 years of age.Footnote 7 The census reports poverty counts at the census tract level for various poverty thresholds. In this paper, we consider the households with income below the 100% federal poverty line as poor.

Poverty counts are estimated separately for each census tract in America. Following Andreoli and Peluso (2018), we consider the 2016 Census Bureau definition of MSA to aggregate census tracts into cities. The number and geographic size of the census tracts vary substantially over time within the same MSA. Some census tracts increase in population and are split into smaller tracts. Some other tracts may be consolidated to account for demographic shifts. While raw data allow estimating urban poverty at the MSA level, they cannot be used to perform the decomposition exercise, insofar the definition of neighborhood is not constant over time. We resort to the Longitudinal Tract Data Base (LTDB), which provides crosswalk files to create estimates of census tables based on the 2010 tract boundaries for any tract-level data that are available for prior years as well as in ACS following years Logan et al. (2014). These files make use of re-weighting methods to assign the population in each census and ACS year to the exact census tract boundary defined in the 2010 census. In this way, we can construct a balanced longitudinal database of census tracts for 395 MSAs (those with at least 10 census tracts according to the 2010 census) for 1980, 1990, 2000, 2008, 2012, and 2014. We calculate poverty incidence in each census tract/year and then construct measures of urban poverty and concentrated poverty in high (i.e., where poverty incidence is above 20% of the resident population) and extreme (i.e., where poverty incidence is above 40% of the resident population) poverty tracts.

The incidence of poverty at the MSA level is always below 16% on average in the sample under consideration. More than 93% of these MSAs display at least one census tract with a poverty incidence greater than 20%. The average number of census tracts by MSA that display more than 20% (40%) poverty incidence has more than doubled over 35 years, from 21.6 (5.4) in 1980 to 45.2 (10.8) in 2014. The balanced panel allows to further decompose changes in urban poverty in its underlying components and to study convergence/divergence in urban poverty incidence at the tract level. Census tracts are also geolocalized, implying that measures of the proximity of these tracts can be further produced to separate the neighborhood and non-neighborhood components of urban poverty.

3.1 Poverty

Table 1 Summary statistics of changes in urban poverty concentration, all 395 American MSA

In this study, the dependent variables are the measures of concentrated poverty (CP), urban poverty (\(CP^*\), UP and G), and the components of the change in the urban poverty Gini index (G). These measures are constructed from the observed poverty incidence \(P_i/N_i\) in census tract i, which is calculated by making use of the count of equivalent individuals that are poor in the tract, relative to the demographic size of the tract. When calculating CP and UP, the reference urban poverty thresholds provided by the Census Bureau are used to define high-poverty tracts (if the local poverty incidence is greater than 20%) and extreme-poverty tracts (if the local poverty incidence is greater than 40%). The Gini index of tract poverty incidence is obtained by setting the poverty threshold equal to \(\zeta =0\), implying that all census tracts of an MSA are considered when measuring urban poverty in the MSA. Table 1 provides descriptive statistics (mean, 25th, percentile, and 75th percentile) of the distributions of the changes in CP, UP, and G in five sub-periods of the 1980–2014 period. We also report the components of the change in G in Table 1. While \(\Delta CP\), \(\Delta CP^*\), \(\Delta UP\) and \(\Delta G\) are overall measures of the change in urban poverty concentration, the various components of \(\Delta G\) (C, D, E, R, and W) measure the effects of specific distributional changes in the distribution of poverty within a MSA.

We also consider a measure of convergence alternative to D and E. This measure is obtained by regressing the year-to-year log-change in poverty incidence \(P_i/N_i\) of each tract i of a given MSA on the log-level of tract poverty incidence in the initial period, getting a year-MSA specific measure of \(\beta \)-convergence. \(\beta \)-convergence occurs when the partial correlation between the log-change in tract poverty incidence and the initial log-level of tract poverty incidence is negative Barro and Sala-i-Martin (1992). We estimate the partial correlation coefficient via OLS for each MSA and year-to-year change. The descriptive statistics for the distribution of the partial correlation coefficients estimated for the 395 MSAs in each sub-period are reported in the last row of Table 1, denoted by \(\beta \)(log-log).Footnote 8 These descriptive statistics are negative for each sub-period, suggesting that poverty incidence grew less in census tracts where poverty was already highly concentrated in the initial period.

3.2 Covariates

Explanatory variables are also drawn from STF3A Census files and ACS. We identify two categories of covariates. For non-monetary characteristics, Census and ACS report information about the number of individuals reporting one specific attribute and living in a given census tract. We aggregate information at the MSA level and then standardize population counts by the appropriate population, so that all variables can be interpreted as population shares ranging between 0 and 1. For monetary variables, the Census and ACS report information about the total aggregate value in current dollars of that variable at the census tract level. We aggregate measures at the MSA level and compute per capita or per census tract values. Monetary variables always appear in logs after being actualized at the 2010 prices by using the CPI seasonally adjusted estimates for all US urban consumers (obtained from the Bureau of Labor Statistics).

Table 2 Descriptive statistics of regressors

Census and ACS data come in the form of tabulations by census tract level. We extrapolate information from these tables and aggregate it at the level of the MSA to produce relevant control variables. We construct a dataset of census tracts characteristics for 395 MSAs (those with at least 10 census tracts according to the 2010 census) for the years considered in this study. The sample of MSAs we consider is grouped by region: Northeast (12.66%), Midwest (27.34%), South (39.75%), and West (20.25%). Table 2 reports unweighted means and standard deviations of the variables observed in the sample of MSAs that we use to perform regression analysis.

The covariates can be grouped into five dimensions: demographics, housing, education, employment, and distributive aspects. Demographics (A) includes the total size of the population (expressed in log) and its composition in terms of both racial/ethnic, age, and origin groups (Foreign captures the proportion of non-US citizens and Moved from outside of state the proportion of those who declared to have moved from another US State to the MSA in previous years), which are expressed in terms of shares with respect to the entire population of the MSA.

The second group of control variables gathers housing characteristics (B) of the MSAs. We consider the shares of new and old houses aged less than 10 years (New Houses (10 less yrs old)) or more than 20 years (Old houses (20 plus yrs old)), respectively. These variables capture the aggregate quality of the MSA housing market. We further distinguish houses according to the occupant subject by considering the share of rented (Rented) or vacant (Vacant) houses with respect to the total number of houses. The variable Owner occupied refers instead to the share of houses that are occupied by the owner. The tenure status of the houses is a strong predictor of housing opportunities for low-income, renting households. Lastly, we include variables for the value of owner occupied houses and for the value of rents that are averaged across households (Avg. value house (ln) and Avg. rent (ln)). We also consider the distributions of owner-occupied housing values and rents across neighborhoods. This information allows distinguishing the situations in which low-rent/low-value houses are equally represented across all neighborhoods of the city (in which case the median rent by census tracts would coincide across census tracts) from the situations where the rents/values are highly heterogeneous across neighborhoods (in which case we would expect large variance in median values and rents by census tracts, with some census tracts being more affordable than others). Starting from the observation of the median value/rent at the census tract level, we aggregate distributional features of median housing values/rents across census tracts into median (Median value house by CT (ln) and Median rent by CT (ln)), first quartile of the housing value and rent distribution (p25% value house by CT (ln) and p25% rent by CT (ln)) and dispersion (S.d. value house by CT (ln) and S.d. rent by CT (ln)). All values are expressed in log terms.

The third group of covariates that we examine reports information about education and human capital (C). We separately consider three dimensions of education. First, we consider the proportion of the resident population aged 25 or above in a given MSA that has low education (Less than high school), some qualification at high school level (With high school) and tertiary education or above (With college). These variables measure the human capital composition of an MSA, reflecting both historical trends and residential choices of low and high educated people based on specific characteristics of the labor market and the supply of services and amenities produced at the census tract and MSA level. Second, we consider the share of the population that is actually enrolled in any form of education (Enrollment (any)), to measure of the demand for consumption of education services in the MSA. Third, we introduce indicators for whether the MSA is a college town and a student town. The former (College Town) identifies MSAs where most selective American colleges are located. The selectivity level is measured according to the college tier description used by the Department of Education (DOE) IPEDS database. We consider as college town those MSAs hosting colleges of tier levels equal to 1 or 2, which are associated respectively with Ivy League colleges plus Stanford, Chicago, Duke, MIT alongside other elite schools (both public and private) with a Barron’s 2009 selectivity index of 1. The second indicator (Student Town) identifies the top 20 MSA with the highest number of students enrolled in any college. The number of students refers to the number of IPEDS enrollment (full time and part time) in the Fall 2013 semester.Footnote 9

The employment structure (D) of the MSA is described by the share of workers occupied with managerial positions (Managerial Position) and by the share of workers less than half an hour away from the workplace (Timework). Both shares are computed with respect to the total population.

Lastly, to take into account the distributive aspects (E) of income, poverty, and ethnicity within MSA, we control for average household income in the city (Avg hh income (ln)) as an objective measure of well-being. The distribution of income across census tracts signals the quality of the tracts and their affordability. We use measures in the census and ACS about median income in the census tract and compute measures of the distribution of incomes across census tracts considering the median affluence of the tracts (Median hh income by CT (ln)), the household income for the poorest 25% of the census tracts (p25% hh income by CT (ln)) and a measure of the dispersion of income across census tracts (S.d. hh income by CT (ln)). We also consider information about the poverty incidence in the MSA as a whole (Fraction of poor) and the way poor and non-poor people (according to the 100% federal poverty line) are unevenly represented across the census tracts (Dissimilarity poor).Footnote 10 Finally, we measure the ethnic dimension of segregation across the MSA census tracts by using standard measures of segregation (dissimilarity index) for the Whites, Blacks, Hispanics, and Asians with respect to the overall population, as well as traditional measures of black and white segregation (Dissimilarity white-black).

Table 3 Descriptive statistics of gentrification

A related aspect to the quality and the affordability of tracts is the gentrification, which appears as both the cause and the consequence of the level of urban poverty. Gentrification, induced by the inflow of young, middle-class cohorts into the most affordable and historically high-poverty tracts located in inner cities, displaces poor residents toward traditionally middle-class and low-poverty neighborhoods that become more mixed. As a consequence of such a displacement, most non-gentrifying tracts tend to attract disproportionately more poor than other tracts (rising poverty concentration), although overall, the redistribution of poverty from inner cities toward more marginal neighborhoods makes the distribution of poverty more widespread across neighborhoods. In this paper, we follow Baum-Snow and Marion (2009) and consider the housing value variations at the census tract level to measure gentrification.Footnote 11 More specifically, for each census tract, we compute the variation in the average housing value between years t and \(t+1\) and consider the distribution of these variations at the MSA level. Then, in each period and each MSA, gentrified census tracts are located in the top tercile of the distribution of housing value appreciation. The middle and bottom tercile tracts are defined as stable and declining tracts, respectively. Table 3 reports the gentrification indicators for the 1980–2014 period.

4 Results

4.1 Trends in urban poverty

Table 4 Correlations of \(\beta \)-convergence (log-log specification) and measures of urban poverty concentration, by year-to-year changes

In the last decades, there has been evidence of convergence in poverty across American MSA neighborhoods, with poverty growing everywhere in cities after the Great Recession, but less so in high-poverty neighborhoods. Poverty has been increasingly concentrating into historically middle-class, low-poverty neighborhoods. The set of measures of urban poverty described in previous sections are capable of capturing some relevant features of this secular trend. More specifically, we consider the trends in CP and UP for urban poverty lines at 20% (i.e., \(\zeta =0.2\)) and 40% (i.e., \(\zeta =0.4\)), and the trends in \(CP^*\) and G, which are obtained in correspondence of a specific choice of the parameters \(\beta \), \(\gamma \) and \(\zeta \) in the general formula of UP (i.e., \(\beta =1\) and \(\gamma =0\) for \(CP^*\), and \(\beta =0\), \(\gamma =1\) and \(\zeta =0\) for G). We then calculate the correlations between the year-to-year changes in these measures and the MSA-specific measure of \(\beta \)-convergence in poverty incidence across tracts (see Table 1).

Table 4 shows that correlations are all statistically significant in the last two sub-periods considered, suggesting that all measures of urban poverty agree on the convergence of tracts within MSAs in terms of poverty incidence. However, these correlations are different in terms of magnitude, with mild levels for the general measures of urban poverty concentration (CP, UP and \(CP^*\)) and higher levels for the components of the change in Gini index, especially E and consequently \(D=C\cdot E\). Component E is highly and positively correlated with \(\beta \)-convergence estimates, indicating that MSAs where poverty growth is clustered in low-poverty tracts on average (higher levels of \(\beta \)-convergence), also display relatively low urban poverty. Component R is negatively and significantly related to the extent of \(\beta \)-convergence. MSAs where poverty growth is clustered in low-poverty tracts also display major changes in the map of poverty, with poverty growing proportionally much less, or even decreasing, in high-poverty tracts compared to the growth observed in low-poverty ones. This combination of changes induces a substantial re-ranking of tracts.

Fig. 1
figure 1

Neighborhood and non-neighborhood components of urban poverty levels and year-to-year changes. Note: Inequality changes and its components (R, E and \(D=C\cdot E\)), 1980-2014. Data for 395 MSAs

An advantage of using the Gini index to measure urban poverty is that each component of the change in urban poverty can be split into neighborhood and non-neighborhood components.Footnote 12 Figure 1 shows the box-plots of the distributions of the spatial components of \(\Delta G\), R, D and E. From Fig. 1, we see that convergence in tract poverty incidence occurs both among neighboring census tracts and among non-neighboring census tracts, as the values of \(E_{N}\) and \(E_{nN}\) are mainly negative. However, the re-ranking effect mitigates the convergence process both among neighboring census tracts and among non-neighboring census tracts.

4.2 Drivers of concentrated poverty and urban poverty: Levels

Table 5 Drivers of concentrated poverty and urban poverty levels, pooled sample

We produce comparative evidence about potential drivers of concentrated poverty and urban poverty and study the partial associations of these variables with the levels and changes in urban poverty across American MSAs. Table 5 reports estimates of the effects of the relevant drivers. Values of these indicators are obtained for each MSA and year. The partial correlations of urban poverty with the relevant drivers are obtained from pooled OLS regressions controlling for year, state, and region fixed effects and including a binary indicator for the Great Recession (years 2008 to 2012).

Regression results show that the demographic composition in terms of origin group is correlated with \(CP^{*}\) and UP calculated for both high (20%) and extreme (40%) poverty thresholds. The population distribution by age is correlated with \(CP^{*}\) and UP when the focus is on extreme poverty. Lower levels of urban poverty are associated with a larger share of population aged 25 or above. This may be explained by the fact that people aged 25–64 are more likely to be employed while those aged 65 or above are mainly retired. The education and employment composition, strongly associated with opportunities offered by the labor market, correlate with CP, \(CP^{*}\) and UP for both of the poverty thresholds that we consider. That is, MSAs with higher poverty concentration are characterized by lower shares of high educated population and living in proximity of the workplace. MSAs with higher shares of workers holding a managerial position tend to have higher levels of urban poverty, except when \(CP\left( .,0.2\right) \) is considered.

The distribution of income across census tracts, as well as the features of the housing market, have important implications for UP(., 0.2), \(CP^{*}(.,0.2)\) and G. This evidence can be reconciled with the implications of neighborhood affordability on the geography of poverty. MSAs with a higher median income across census tracts (holding average household income as fixed) display more income mix at the tract level and less inequality across tracts (as the median converges to the average, held fixed). This pattern of income sorting may indicate more widespread access to urban amenities and localized public goods, and hence lower incentives for high- and low-income families to sort unevenly across census tracts.

All estimated models agree that the poverty incidence in the whole MSA and the degree of dissimilarity in the distribution of poor within the MSA are important drivers of urban poverty. While the dissimilarity in the distribution of poor is strongly positively correlated with every measure of poverty concentration, poverty incidence has a positive impact on all measures except G. This is a consequence of the fact that the index G focuses on the inequality in poverty incidence across tracts and such inequality is less emphasized in MSAs where a large fraction of the population is poor. While the impacts of dissimilarity in the distribution of poor and poverty incidence in the MSA on urban poverty are not unexpected, they may be relevant to anti-poverty policy making when the effects of the remaining explanatory variables are controlled. Regression results suggest that the tendency of poor people to distribute unevenly across census tracts in an MSA is influenced only to a small extent by the explanatory variables that can be controlled, like the employment composition or the quality of the housing market.

Table 6 Drivers of concentrated poverty and urban poverty levels, FE estimator

Table 6 reports estimates of the marginal effects of interest based on a FE model. The incidence of poverty at the MSA level plays a major role in explaining urban poverty concentration, irrespective of the measure we use. Variables linked to income distribution mainly affect UP(., 0.2) and G but seem uncorrelated with other measures of urban poverty concentration. Overall, the FE estimates support the findings of the cross-sectional models.

4.3 Drivers of concentrated poverty and urban poverty: Changes

Table 7 Drivers of changes in concentrated poverty and urban poverty

We examine the effects of the same covariates considered in Sect. 4.2 on the pooled period-to-period changes in the measures of concentrated poverty and urban poverty. We expand the model by including controls for gentrification at the census tract level. Table 7 presents the effects of demographics, education, housing, labor market, and income distribution on the relevant urban poverty changes. Overall, we find that demographics, education, and housing are the most relevant (and significant) drivers of changes in concentrated and urban poverty. An increment in the proportion of non-US citizens increases \(CP^{*}(.,0.2)\) and UP(., 0.2), while the effect is statistically insignificant for the other indicators. Meanwhile, a high mobility of citizens from other US states is associated with a lower change in urban poverty. These findings suggest an asymmetry in natives and immigrants’ residential choice who move into an MSA. Ethnic density and segregation play a pivotal role in the changes of the indices. The segregation of Blacks and Hispanics is positively associated with the variations in the indices. Conversely, this finding does not apply to the segregation of Asians, which is negatively associated with changes in CP(., 0.2), \(CP^{*}(.,0.2)\) and UP(., 0.2).

An increment of housing value tends to increase urban and concentrated poverty. Higher housing costs might force poor individuals to concentrate in some areas, creating distinct rich and poor census tracts. Similarly, the higher the average rent at the MSA level, the higher the concentrated and urban poverty indices, while an increase in median rent by CT implies a decrease in the indices. Finally, the population size, the incidence of poverty, and the fact that a significant share of the population has managerial positions reduce concentrated and urban poverty indices.

Andreoli et al. (2021) tested the null hypothesis of spatial independence in the distribution of poverty incidence across census tracts of American MSAs in 1980 and 2014, rejecting the null hypothesis for the large majority of MSAs with a population larger than 300,000 residents for both years. The presence of positive spatial autocorrelation in poverty incidence across tracts, however, is not informative about its implications in terms of uneven distribution of poor people across the tracts. The Gini index decomposition into neighborhood and non-neighborhood components allows separating the contributions of neighboring and non-neighboring tracts to overall urban poverty.

The effects of the drivers of urban poverty on the levels and changes of the two spatial components of the Gini index are in Table 8, models (1)–(4). Few covariates have a significant and stable effect on the levels and changes of the neighborhood and non-neighborhood components of the Gini index. The segregation of the poor has a positive effect on the levels of both the spatial components of the Gini index. This means that a greater degree of segregation of the poor does not generally yield a greater tendency of tracts with similar levels of poverty incidence to cluster. The incidence of poor people is negatively associated with the neighborhood component, suggesting that high-poverty tracts have a greater tendency to cluster in MSAs with higher levels of poverty incidence. Demographic variables, like population composition by origin group and total population size, are positively correlated with the change in the non-neighborhood component, suggesting that a more heterogeneous population composition tends to increase disparities in poverty incidence between non-neighboring tracts within larger MSAs. Other controls, including housing and education, never yield significant effects.

Table 8 Drivers of levels and changes in G: different components

Columns (5) and (6) of Table 8 investigate the re-ranking and convergence components of the changes in urban poverty based on a pooled regression with year fixed effects. When considering the re-ranking component R (see column (5)), the pooled regression model shows that the incidence and segregation of poverty has a negative impact on R, suggesting that a re-ranking of tracts is less likely to occur in the MSAs with higher levels of incidence and segregation of poverty. Besides, we find that the features of the housing stock correlate with the re-ranking component. While a larger proportion of old dwellings is associated with a lower re-ranking component, the proportions of owner-occupied and vacant houses have a positive impact on that component. Finally, demographic variables do not seem to play a significant role in the re-ranking of tracts.

Column (6) of Table 8 shows the drivers of component D, measuring convergence in tract poverty incidence. Our estimates reveal that the explanatory variables are less informative about the convergence component than the re-ranking one. The pooled regression model shows that the incidence and segregation of poverty respectively affect positively and negatively the convergence component. Demographic, housing, education, and labor market variables (except the share of vacant dwellings and the commuting time) do not affect the convergence component. Only the average income level and the income dispersion across tracts are positively correlated with the convergence component.

4.4 Decomposition results

The previous sections illustrate the contributions of different drives of poverty concentration on the trends in concentrated poverty and urban poverty indices. However, this analysis cannot distinguish whether such effects are mainly due to changes of the drivers across MSAs or to changes in the magnitude of association between the drivers and the concentration of poverty over the time interval considered. To answer this question, we use the Oaxaca–Blinder (hereafter, O-B) threefold decomposition technique Blinder (1973); Oaxaca (1973).

The O-B decomposition breaks down the difference between the means of an outcome variable Y calculated for two different groups, say A and B, into three components. To obtain such components, consider the linear model \(Y=\mathbf{X }'\varvec{\beta }+\epsilon \) where X is a set of predictors observed for each group. The O-B decomposition divides the difference in the expected value of Y between the two groups into the sum of three terms Jann (2008), so that \(E\left[ Y_A\right] -E\left[ Y_B\right] =\Delta _E+\Delta _C+\Delta _I\). Component \(\Delta _E=\left\{ E\left[ \mathbf{X }_A\right] -E\left[ \mathbf{X }_B\right] \right\} ' \varvec{\beta }_B\) measures the contribution of group differences in the means of predictors (the so-called endowments), corresponding to the endowments effect. Component \(\Delta _C=E\left[ \mathbf{X }_B\right] '\left( \varvec{\beta }_A-\varvec{\beta }_B\right) \) measures the contribution of differences in the coefficients, i.e., the coefficients effect. Component \(\Delta _I=\left\{ E\left[ \mathbf{X }_A\right] -E\left[ \mathbf{X }_B\right] \right\} '\left( \varvec{\beta }_A-\varvec{\beta }_B\right) \) is the interaction effect of differences in endowments and coefficients between the two groups. As in the study by Iceland and Hernandez (2017), the two groups compared are the selected 395 MSAs with their endowments in the starting year of the considered period (1980) and the same MSAs with their endowments in the ending year (2014). We use the available measures of urban poverty as outcomes. The O-B decomposition is obtained from the perspective of MSAs in 2014, which then play the role of group B in the above-described formulation of the O-B decomposition. The results of the decomposition are shown in Table 9, where also the contribution of each group of explanatory variables (e.g., demographics) to each of the three components (endowments, coefficients, and interaction) is shown.

The endowments effect is the contribution of the differences in the explanatory variables between 1980 and 2014. It expresses the expected difference in predicted 2014 mean outcome (the urban poverty level) if the distribution of endowments across MSAs were that of 1980. The second component, the coefficients effect, measures the contribution of differences in period-to-period regression coefficients. This component indicates that the explanatory variables also differ in the influence they have across periods. The third component, the interaction effect, takes into account that the differences in terms of endowments and coefficients co-occur between 1980 and 2014. Ultimately, this decomposition method allows us to single out the extent to which the MSAs characteristics explain the differences between 1980 and 2014 (the endowment component) while holding as fixed the coefficients component and their interaction with endowments.

Table 9 Oaxaca–Blinder decomposition

We consider the same drivers as in the previous tables, but we group them by categories of interest to simplify the reading of the decomposition results. Unlike the model structure in the previous regressions, we do not include fixed effects for regions and years to avoid zero-variance cases. However, we do include fixed effects for states.Footnote 13

In column (1) of Table 9, concentrated poverty at a 20% poverty line is estimated at \(CP=0.317\) in 1980 and \(CP=0.503\) in 2014. These estimates result in a significant difference of \(-0.186\), which is additively split into the contributions of the three components described above. While the contributions of the endowments and interaction terms are positive, the difference due to the coefficients is negative and large, thus driving the sign of the overall difference observed between the two periods. The contribution of endowments, equal to 0.321, suggests that the overall difference could have been even more considerable if the MSAs had been similarly endowed between the two periods.

Each component is further broken down by the contribution of each group of explanatory variables in the following rows of Table 9. The detailed decomposition makes it possible to study the contribution of each group of variables to the decomposition. First, it can be observed that the difference due to endowments originates mainly from the housing-related variables. However, this difference is not significant. Moreover, each group’s sign, magnitude, and significance vary considerably in terms of coefficient and interaction components. The fixed effects (not shown) almost cancel out this effect with an opposite sign. Overall, the coefficients effect is negative, implying that the influence of the explanatory variables on concentrated poverty changed between 1980 and 2014, contributing to increasing CP.

The effects estimated in the remaining columns have a similar interpretation. As in column (1), the indicators are systematically higher in 2014 than in 1980, resulting in significant negative differences. Endowments account for most of the difference in CP(., 0.4), \(CP^{*}(.,0.2)\), UP(., 0.2), and G(.). Conversely, the coefficients are the main contributors of the changes in CP(., 0.2), \(CP^{*}(.,0.4)\), and UP(., 0.4). In all cases except G(.), the elements explaining most of the difference are significant. The contributions of the different components are not uniform across the various models considered. Endowments’ contribution is negative in all columns except for CP(., 0.2). This result indicates that endowments in 2014 are distributed differently from 1980 and amplify the two periods’ overall difference. Conversely, the coefficients seem to have a positive counterbalancing effect but are sparsely significant.

If we further decompose each component, it appears that only a few groups of variables are significant. When looking at the endowments, several variables contribute negatively except demographics, because the demographics group includes several variables that do not necessarily act in the same direction, creating a sizeable intra-group variability. Moreover, the distribution variables are the sole endowments that are consistently significant and negative across most concentrated and urban poverty measures except CP(., 0.2), playing a major role in endowments’ contribution.

Conversely, several variables are sparsely significant among the sub-components of the coefficients. The impact of housing on the coefficients is negative but not significant. Employment contributes positively to the difference due to the coefficients for CP(., 0.2) but negatively to the difference due to the other indices’ coefficients while being significant in both cases. A large part of the variation due to the coefficients is indeed attributed to fixed effects. This finding indicates a sizeable unexplained gap due to differences at the state level between 1980 and 2014.

Overall, Table 9 shows that the effects of the relevant groups of covariates on urban poverty changes are relatively similar across the urban poverty indices belonging to the family of urban poverty measures. The only exception is the first column, which uses CP(., 0.2) as a dependent variable. We explain these discrepancies in results between CP(., 0.2), often considered the golden rule in the study of urban poverty, and the other indices by the fact that CP disregards the information about the incidence and distribution of poverty across low-poverty neighborhoods.

Conversely, CP(., 0.4) does not show different patterns than UP and \(CP^*\), unlike CP(., 0.2). This pattern is caused by the tolerance level increase from 20 to 40%, therefore addressing a particular subset of census tracts where poverty is highly concentrated. While the group of people included in CP(., 0.2) is relatively large, CP(., 0.4) involves a tiny group of people in extreme poverty. This small group is likely to have specific characteristics that differ significantly from the rest of the population, resulting in endowments substantially determining incidence patterns. On the opposite, CP(., 0.2) is likely to include a larger group of people with less specific characteristics, sharing several similarities with non-poor people. Therefore, it is relevant to examine what the rest of the population experiences to understand urban poverty.

Finally, it is also interesting to compare the results obtained for CP(., 0.4) in column (2) with the results obtained by Iceland and Hernandez (2017). The whole difference is negative over the period, with a strong negative effect of endowments, a positive contribution of coefficients, and a positive interaction term. Our estimated effects for CP largely coincide with those estimated by Iceland and Hernandez (2017), thus validating their analysis.

5 Conclusions

The contribution of this article is twofold. First, we compare the effects of the main drivers of urban poverty concentration on alternative indices of urban poverty concentration. More specifically, the comparison deals with the widely-used concentrated poverty index, CP, and some indices belonging to the family of urban poverty measures, which Andreoli et al. (2021) have axiomatically derived to overcome some drawbacks of CP. We run pooled-regressions to examine the impacts of several variables on the levels of concentrated poverty and urban poverty over the 1980–2014 period. Regression results show that demographic variables and those related to income distribution and housing affect the level of urban poverty concentration; however, their impact changes based on the index we use and the fixed threshold of poverty incidence. Overall, we find that CP, calculated for a \(20\%\) poverty incidence threshold, tends to behave differently from other indices, including CP when the poverty incidence threshold is equal to \(40\%\). The share of poor residents and the degree of segregation of poor people are strongly positively correlated with every measure of urban poverty concentration, suggesting that the impact of such variables is consistent irrespective of the index considered.

Second, we examine the regression results on the pooled period-to-period changes in the measures of concentrated poverty and urban poverty, finding that demographics, education, and housing are the main drivers of changes in concentrated and urban poverty. However, these explanatory variables are less informative for the changes than for the levels of concentrated poverty and urban poverty. To further explore the analysis of changes in concentrated and urban poverty, the Oaxaca–Blinder threefold decomposition is used. Such a decomposition confirms that the choices concerning the index for measuring poverty concentration and the poverty incidence threshold have a role in explaining variations in urban poverty concentration. Indeed, while CP and all the indices belonging to the family of urban poverty measures, for both a 20% and a 40% poverty incidence threshold, indicate an increase in poverty concentration from 1980 to 2014, the effects of the endowments, coefficients, and interaction components differ based on the index and poverty incidence threshold considered. When setting a poverty incidence threshold equal to \(20\%\), the contribution of endowments is statistically significant for \(CP^*\) and UP but not for CP. This difference may be explained by the fact that CP completely neglects what happens to non-poor people, whereas \(CP^*\) and UP also consider some information on the composition of poverty in the whole population. When the poverty incidence threshold increases to \(40\%\), CP behaves similarly to the family of urban poverty measures, thus underlying the role of the fixed tolerance level to poverty in measuring poverty concentration. This apparently unexpected result may depend on the different sub-populations involved in calculating CP at \(40\%\). This index involves a relatively small group of people subject to extreme poverty (i.e., \(40\%\)), which is likely to have specific characteristics that differ significantly from those of the remaining population, resulting in endowments which substantially affect urban poverty trends.