Introduction

There is a longstanding debate regarding the drivers of tax revenues and tax effort (Gupta 2007). Tax effort can be defined as an index of the ratio between the share of actual tax collection in gross domestic product and taxable capacity—with tax capacity being “the predicted tax-to-gross domestic product ratio that can be estimated empirically, taking into account a country’s specific macroeconomic, demographic, and institutional features, which all change through time” (Le et al. 2012). Countries worldwide have been concerned with tax revenues and fiscal constraints (Arellano and Bai 2017). On the one hand, strong pressure exists on government spending, both for social expenditure on services such as education, health, and pensions (Jimenez 2017) and for public investment, with particular emphasis on addressing infrastructure gaps (Bacchiocchi et al. 2011; Sarmento and Renneboog 2016). On the other hand, increasing attention is being paid to the sustainability of public finances, particularly in contexts of low economic growth (Afonso and Jalles 2012). All this has led to governments attaching increased importance to raising revenues. Nevertheless, a high fiscal burden can be perceived as an obstacle to economic development and growth (Afonso and Jalles 2014). Accordingly, increasing tax revenues without a high level of tax effort is considered an essential element of fiscal policy. As a result, understanding the drivers of tax effort is of great relevance, particularly for policy makers.

The literature on the drivers of tax effort has focused mainly on the economic and demographic aspects. However, other blocks of drivers should also be given due weight. The issue of the determinants of a country’s tax effort are not well-understood in the extant literature—in particular, which determinants prevail in a large panel covering numerous countries over an extended period. Most studies have concentrated on the economic determinants (Frank 1959; Bird 1964; Lotz and Morss 1967; Balh 1971; Chelliah et al. 1975; Berry and Fording 1997; Piancastelli 2001; Rivero 2001; Gupta 2007; Bird et al. 2008; Fenochietto and Pessino 2013) while giving only scant attention to the role of other blocks of determinants. Our specific aim is to clarify these issues and close the knowledge gap. As in previous studies, we use a comprehensive list of determinants of tax efforts (Balh 1971; Chelliah et al. 1975; Mertens 2003; Balh 2004; Gupta 2007; Grigorian and Davoodi 2007; Bird et al. 2008; Pessino and Fenochietto 2010; Fenochietto and Pessino 2013; Akitoby et al. 2020). Unlike prior studies, we allow the five blocks of drivers to operate simultaneously in explaining tax effort, and our sample setting is unique; in that we cover a large panel of 122 countries over a 38-year period.

From an economic point of view, the main drivers of tax effort are GDP growth and GDP per capita (Frank 1959; Bird 1964; Rivero et al. 2001; Gupta 2007), although gross domestic income is also perceived as relevant (Lortz and Morss 1967; Chelliah et al. 1975; Piancastelli 2001). In both the economic and demographic cases, countries with a higher level of development tend to have higher taxation and tax effort levels. Furthermore, a higher contribution of agriculture to GDP tends to reduce the tax capacity (Balh 1971; Chelliah et al. 1975; Piancastelli 2001). Economies with a higher level of openness—that is, with more trade—tend to have lower levels of tax effort (Gupta 2007; Pessino and Fenochietto 2010; Fenochietto and Pessino 2013). In addition, a better institutional environment, such as low corruption and adherence to the rule of law, tends to improve tax performance and reduce the tax burden (Bird and Martinez-Vasquez 2008).

Our data covers a large panel of 122 countries from 1980 to 2017 and is sourced from the OECD database, the World Economic Forum, the IMF World Economic Outlook (WEO), and the World Bank’s World Development Indicators (WDI). Unlike other studies (Bahl 1971; Piancastelli 2001; Mertens 2003; Gupta 2007), a key feature of our research is the inclusion of two alternative measures of tax effort (the Frank and Bird indices) that correct for purchasing power and disposable income, for the largest available number of countries (122) and years (38). Our focus is on five blocks of determinants—namely economic, fiscal, openness, structural, and political. We constructed our baseline specification by using the standard fixed-effects model for each block of determinants from which we later dropped any insignificant variables from the full specification of determinants. This parsimonious process set our benchmark specification, which was narrowed down to geographical region (Latin America, Africa, Europe, and Asia), income group (advanced economies, and developing economies), and time period (1980 to 1998, and 1999 to 2017).

Overall, tax effort is influenced by all groups of determinants, although the results differ per income group. Tax effort in advanced economies is driven by all blocks of drivers, with the exception of political variables, which are not relevant. However, the three blocks of openness, structural, and political issues prevail in developing economies. Consistency is also absent from the drivers of tax effort across the four regions. Conversely, and in contrast to all other regions, tax effort in Africa is not associated with the economic, fiscal, and openness drivers. Over the period from 1980 to 1998, the level of tax effort is mainly associated with higher levels of a country’s tax revenues and the contribution of the agricultural sector to GDP. In the period after that the determinants of tax effort are mainly driven by left-wing governments and by the economic and fiscal blocks of determinants. The results are robust to a large number of robustness checks.

The remainder of this paper is organized as follows. Section 2 presents the literature review. Section 3 develops the conceptual framework underlying the empirical model. Section 4 discusses the data and presents some stylized facts, and Sect. 5 discusses the econometric results. The last section concludes and earmarks a number of critical policy implications.

Literature Review

The OECD defines tax revenues as “the revenues collected from taxes on income and profits, social security contributions, taxes levied on goods and services, payroll taxes, taxes on the ownership and transfer of property, and other taxes. Total tax revenue as a percentage of GDP indicates the share of a country’s output that is collected by the government through taxes.” The tax burden is measured by considering total tax revenues received as a percentage of GDP (Sarmento 2018). However, countries have different tax capacities that vary with their level of GDP, income, openness to trade, and institutional quality (Gaspar et al. 2016). Tax capacity is measured as the predicted tax-to-gross GDP ratio that a country can support, considering its specific macroeconomic, demographic, and institutional characteristics (Le et al. 2012). We can assess tax effort as a derivative of tax revenues and tax capacity. Tax effort is, therefore, the ratio between tax revenues as a percentage of GDP and the tax capacity. It is important to note that a simple comparison of tax revenues as a share of GDP can be misleading because it ignores differences in tax capacity across the countries in question (Mertens 2003). That is to say, the ratio does not take into account the effort required to produce the income captured in GDP (Bird 1964). Other ratios adjust for differences in population and personal income (Frank 1959) and differences in disposable income (Bird 1964).

The main studies on tax effort are summarized in Table in the Appendix. Studies have evolved into more complex and comprehensive studies based on the seminal works of Frank (1959) and Bird (1964)—they were the first to define tax capacity and tax effort. Most studies use a data sample of several countries for an extended period. For example, Lotz and Morss (1967) used 72 countries to assess tax capacity. For the period from 1963 to 1965, Tanzi (1968) used data from the USA, Canada, and European countries. Balh (1971) used a similar period (1966–1968) but concentrated on 49 low-income studies. Chelliah et al. (1975) also used 47 low-income countries drawing data from the 1969 to 1971 period. More recent studies include Rivero et al. (2001) that used 14 European countries with data ranging from 1967 to 1995, and Mertens (2003) that analyzed the ten central and eastern European countries from 1992 to 2000. Worldwide country analyses (using both developed and less developed countries) have been provided by Piancastelli (2001), Gupta (2007), Bird et al. (2008), Pessino and Fenochietto (2010), and Fenochietto and Pessino (2013).

Studies on tax effort are mainly divided into two large groups: (1) those that compare and evaluate tax effort across several countries, most of which use a cross section of data, ignoring time variation although some use a sample period (Pessino and Fenochietto 2010; Fenochietto and Pessino 2013); and (2) those that analyze the potential drivers of tax efforts. Of these, we can divide the main drivers of tax effort into tax, income, economic structure, population, and institutional environment. Table in Appendix summarizes the main drivers of tax effort, authors, expected effect on tax effort, and the main findings related to each driver. Table A2 in Appendix shows that tax effort determinants can be divided into five large blocks—namely economic, fiscal, open economy, structural, and political.

GDP and GDP per capita are used to measure the economic drivers of tax effort, with some authors (Lotz and Morss 1967; Chelliah et al. 1975) using GDI (Gross Domestic Income) and GDP per capita instead. In addition, Tanzi (1968) used personal income (total and per capita). More developed countries—measured as countries with a high GDP or GDI per capita—tend to have a higher level of tax revenues, primarily due to a higher level of expenditure (mainly social expenditure on pensions, health, and education). Nevertheless, tax effort accounts for purchasing power and disposable income, depending on the measure implemented.

The fiscal determinants are mainly per capita taxes, taxes as a percentage of GDP, and the total amount of tax revenues. Most authors (Frank 1959; Bird 1964; Tanzi 1968; Rivero et al. 2001) found evidence that a higher level of tax revenues and tax collection increases the pressure on taxpayers, leading to a higher tax effort. Higher tax revenues increase the numerator of the leading tax effort measure (Bird 1964), while decreasing the denominator irrespective of whether GDP remains the same. However, as more developed countries tend to have a large tax capacity due to their strong and richer economies as evidenced by a larger GDP, this effect can reduce the tax effort.

Openness of the economy is also a relevant block in tax effort. A higher level of trade tends to reduce the tax effort. This is due to several factors: (1) a larger volume of exports generates higher profits and creates more employment, which in turn lead to greater tax revenues; (2) in most cases, imports are taxed more; (3) trade liberalization leads to an improvement in customs procedures and also greater tax revenues (Keen and Simone 2004; Agbeyegbe et al. 2006). Nevertheless, Baunsgaard and Keen (2010) found a weaker relationship in low-income countries, even though they discovered a positive and significant relationship between trade and revenue in high-and middle-income countries.

The economic structure of a country is also related to its capacity to collect taxes and consequently influence the tax effort (Piancastelli 2001). Studies show that countries with a greater participation of agriculture in GDP tend to have a higher tax effort (Balh 1971; Chelliah et al. 1975; Mertens 2003; Gupta 2007). Agriculture tends to be rudimentary and has a predominance of small farmers—especially in less developed countries (Fenochietto and Pessino 2013). This means that the majority of the economic agents involved are less prone to pay their fair share of taxes and that the tax authority finds it more difficult to collect these revenues (Rajaraman 2004). As a result, the tax burden is shared by other sectors to a large degree (Rajaraman 2004). In less developed countries, the agriculture share of tax revenues is lower than the percentage of GDP and, consequently, tax performance is weaker (Akitoby et al. 2020). On the contrary, mining and industry tend to reduce the tax effort (Pessino and Fenochietto 2010; Fenochietto and Pessino 2013). In the case of mining, the high level of revenues—especially during a commodity price boom—tends to be highly taxed, which generates revenues. This, in turn, enables tax authorities to reduce the tax burden on other sectors.

Population also has a role in the level of tax effort. Balh (2004) found evidence that countries with a faster-growing population tend to have a low level of tax effort, and Bird et al. (2008) also found evidence of such an effect. Nevertheless, it is important to appreciate that countries with a growing population tend to be low-income countries and, therefore, the low level of tax effort could be more related to this factor.

Finally, the institutional quality of the country is a crucial aspect in determining the level of tax effort. Countries with a better institutional framework—which is reflected in low corruption, a stronger commitment to the rule of law, and higher levels of government efficiency—tend to be characterized by less tax effort (Grigorian and Davoodi 2007; Gupta 2007; Bird et al. 2008; Pessino and Fenochietto 2010; Fenochietto and Pessino 2013). On the whole, these authors argue that a higher level of informal economy and tax evasion leads to a higher tax effort. The ability of a large proportion of taxpayers to evade their tax obligations naturally leads to the tax burden being shared by a reduced number of agents. Authors such as Bird et al. (2008) argue that in the case of low-income countries, improving government institutions is the best route to improving their tax collection because such an improvement provides a better level of development than natural resources. The authors also claim that high-income countries have greater potential to enhance their tax performance because they have better institutions. Despite this, some studies in the literature place emphasis on the repercussions that result from the lack of administrative capacity to enforce taxation in developing countries (Bird 1989, 2004).

Conceptual Framework and Empirical Strategy

Conceptual Framework

To implement our analysis, we compute two measures of tax effort based on the seminal works of Frank (1959) and Bird (1964). The two measures are still relevant today, despite recent attempts to define more comprehensive indices by also including economic development and the degree of openness (Lotz and Morss 1967), foreign trade (Bahl 1971), the intensity in the use of specific taxes (Bahl 1972; ACIR 1988), and frontier production possibilities (Aigner et al. 1977). Frank (1959) proposed a “tax sacrifice” measure, which captures the effects of differences in population and personal income. In Eq. (1), the measure of tax effort begins with the tax burden in the numerator and then accounts for the ability to pay taxes:

$${\text{Frank}}_{{{\text{it}}}} = \left[ {\frac{{\left( \frac{T}{Y} \right)}}{{\left( \frac{Y}{P} \right)}}} \right] \times 100$$
(1)

where T is tax revenues, Y is the gross national product, and Y/P scales the gross national product by population (P).

The first studies on the topic computed the “tax burden” solely as T/Y without taking account of the level of economic development. Frank (1959) aimed to overcome this limitation and introduced a “tax burden” measure that adjusted for per capita purchasing power. Later, Bird (1964) asserted the need to adjust the measure to improve international comparisons of the tax burden, adding that the numerator in Frank’s measure failed to consider the effort required to produce the income. In addition, Bird (1964) challenged Frank’s inclusion of gross national product rather than gross domestic product—which better assesses performance in open economies. Nevertheless, the formulation of Bird’s index changed only the numerator part. The measure was first labeled as “tax sacrifice.” Since then, it has evolved into the “tax effort” measure that we highlight in this study (Reddy 1975; Ahmad and Stern 1989; Bird et al. 2008). The index proposed in this research uses disposal income to compute tax effort:

$${\text{Bird}}_{{{\text{it}}}} = \left[ { \left( {\frac{T}{Y - T}} \right) \div \left( \frac{Y}{P} \right)} \right] \times 100$$
(2)

Empirical Strategy

We empirically investigate the impact of different blocks of drivers on two alternative measures of tax effort computed for the largest available number of countries and years. This means that our unbalanced panel sample includes xx countries from 1980 to 2017. The analysis is further narrowed by geographical region (Latin America, Africa, Europe, and Asia), income group (AE—advanced economies, and DEV—developing economies, including EME—emerging market economies, and LICS—low-income countries), and time period. The following reduced-form equation is estimated using ordinary least squares (OLS) initially:

$$y_{{{\text{it}}}} = \beta_{1} + \beta_{2} y_{{{\text{it}} - 1}} + \beta_{3} {\text{eco}}_{{{\text{it}}}} + \beta_{4} {\text{fis}}_{{{\text{it}}}} + \beta_{5} {\text{open}}_{{{\text{i}},{\text{t}}}} + \beta_{6} {\text{stru}}_{{{\text{i}},{\text{t}}}} + \beta_{7} {\text{pol}}_{{{\text{i}},{\text{t}}}} + \eta_{{\text{i}}} + \mu_{{\text{t}}} + \varepsilon_{{{\text{i}},{\text{t}}}}$$
(3)

where the \(y_{{{\text{it}}}}\) dependent variable denotes tax effort (from either the Frank Index or the Bird Index) in the country i and time t, and \(y_{{{\text{it}} - 1}}\) is the lagged dependent variable included in the dynamic model later on (cf. robustness section). \({\text{eco}}_{{{\text{it}}}}\), \({\text{fis}}_{{{\text{it}}}}\), \({\text{open}}_{{{\text{it}}}}\) \({\text{stru}}_{{{\text{it}}}}\), and \({\text{pol}}_{{{\text{it}}}}\) are the blocks of drivers that correspond respectively to the economic, fiscal, open economy, structural, and political economy determinants of the tax burden. Building on previous research (Frank 1959; Rivero 2001; Tanzi 1968; Bird 1964); Balh 1971; Gupta 2007; Bird et al. 2008; Pessino and Fenochietto 2010; Fenochietto and Pessino 2013), \({\text{eco}}_{{{\text{it}}}}\) specifically includes the following variables: log of real GDP per capita; growth in real GDP; and the output gap (measured as the gap between potential and real GDP). In turn, \({\text{fis}}_{{{\text{it}}}}\) includes tax revenues and the country’s fiscal balance, both as a percentage of GDP. The openness of economies (\({\text{open}}_{{{\text{it}}}}\)) is captured by trade openness (measured as exports plus imports as a percentage of GDP), fiscal openness (measured by the World Bank as the commitments and implementation of fiscal transparency actions), and exchange rate stability. The block of structural drivers (\({\text{stru}}_{{{\text{it}}}}\)) includes the log of population density, the share of the agricultural sector in the economy, and the Gini index. To conclude, \({\text{pol}}_{{{\text{it}}}}\) includes a variable to capture the political orientation of the government (Left), the political timing (Horizon), the degree of political fragmentation (Cohesion), and government accountability—see below for details. Most of the variables that comprise the different blocks are derived and adapted from those used in previous analyses. They are described in both Section 2 and in Annex Table A1.

These control variables enter with a one-year lag in order to minimize reverse causation issues. The \(\eta_{{\text{i}}}\) and \(\mu_{{\text{t}}}\) coefficients denote the country-specific effects to capture time-invariant unobserved factors and the time effects controlling for common shocks (such as the global business cycle) that could affect fiscal conditions across all countries in a given year. \(\varepsilon_{{{\text{i}},{\text{t}}}}\) is an idiosyncratic error term that satisfies the standard assumptions of zero mean and constant variance. To account for possible heteroskedasticity, robust standard errors are clustered at the country level.

The Data

Tax Burden Indices

Figures 1a, b illustrate the level of tax effort worldwide, ranked from 0 to 30. While most advanced economics lie within the 0 to 1 range, Europe shows greater variability, especially when looking at the Bird Index. This measure captures the effort to produce income, and it shows that of the southern European countries (Belgium, France, Italy, Portugal, and Spain), Belgium and France exhibit above average values. The Baltic countries and some countries in the Balkans underperform. Asia shows no relevant variability in tax effort levels.

Fig. 1
figure 1

a Tax Burden across the world, 2017 (using the Frank Index) b Tax Burden across the world, 2017 (using the Bird Index)

Figure 2 illustrates the evolution of both indexes since 1980. The gap has decreased consistently over the years, although a visual inspection highlights a considerable drop in the interquartile range gap. However, the gap between the advanced and the developing economies was not similar (see Fig. A4 in Appendix). Figures A1, A2 and A3 in Appendix show the evolution of the Frank and Bird indices per country. Interestingly, Japan presents an inversion of the tax effort trend within the group of advanced economies. When looking at emerging market economies (EME) and low-income countries (LCIS), more distinct patterns can be observed. In Fig. 3, we report the distribution of the Frank and Bird indices, which shows that the dispersion is significantly greater for developing economics.

Fig. 2
figure 2

Evolution of the Frank and Bird indices over time, all countries Note: Interquartile range, plotting the median and first and third quartile of the respective distributions

Fig. 3
figure 3

Distribution of the Frank and Bird indices by income group, 2017

Political and Institutional Drivers

To test the role of political and institutional variables in driving the tax burden, as discussed in the previous sub-section, we propose an approach that relies on a principal component analysis (PCA), with variables grouped around three political dimensions—namely ideology, electoral proximity, and political strength. Data on political economy variables were retrieved from the Database of Political Institutions (DPI) (Cruz et al. 2018).

Ideology

This dimension captures whether a ruling government is left wing or not. The DPI original “chief executive party orientation (execrlc)” value takes three discrete values: 1 for right-wing parties, 2 for centrist parties, and 3 for left-wing parties. We define “left” as taking the value 1 if “execrlc” takes the value 3, and 0 otherwise.

Electoral Proximity

This dimension takes into account the time available to policy makers before forthcoming elections. Politicians facing elections can have higher or lower incentives to implement certain tax reforms, depending on the tax area, vested interests, and constituency voting patterns. We use three variables to compute the proximity to elections PCA. A higher electoral proximity is associated with a longer length of time in office for the party of the chief executive, a greater number of years in office of the chief executive, and a higher number of elapsed years from the current term. The first principal component is retained because this factor explains 60% of the variance in the standardized data (see Table A3).

Political Strength

This represents a dimension that captures the number of political actors participating in fiscal decisions, which typically tend to exhibit conflicting demands. These actors could be parties in government (or in opposition), interest groups, or, more generally, veto players. Strong governments are those that operate in less fragmented political environments. We use four variables to compute the strength of PCA, where greater political strength is associated swith a large parliamentary majority, executive control of all houses, and a weak opposition. This strength is measured by a larger number of parliamentary seats and a superior share of the votes held by the ruling government. Only the first principal component is retained because it explains more than 54% of the variance in the standardized data (see Table A3).

Political Accountability

This is the dimension that considers the institutional context in which fiscal policy decisions are made. Politicians tend to be more responsive to citizens’ demands and more accountable to voters for the promises they make when they operate in contexts typified by greater transparency, better governance, and a larger number of mechanisms designed to monitor their activities objectively. In such contexts, politicians operating in institutional contexts with greater accountability are associated with greater fiscal discipline and lower promise gaps. We use four variables to compute the accountability of PCA. A higher accountability index is associated with a stronger voice and greater accountability, with superior regulatory quality, more government effectiveness, and less corruption. Only the first principal component is retained (see Table A3).

Table A3 in Appendix lists the corresponding factor loadings and uniqueness of the PCAs, which can be interpreted as follows in this example: in the case of political strength, the resulting factor appears to mostly describe the margin of the majority and control of all houses, as indicated by their lower uniqueness.

Given that PCA is based on the classical covariance matrix, which is sensitive to outliers, we take a preliminary step by basing it on a robust estimation of the covariance (correlation) matrix. A well-suited method is the minimum covariance determinant (MCD), which considers all subsets containing h% of the observations and estimates the variance of the mean on the data of the subset associated with the smallest covariance matrix determinant—for which we implement Rousseeuw and Van Driessen’s (1999) algorithm. After re-computing the same indices with the MCD version, we broadly obtain similar results, which 5that outliers are not driving our factor analysis.

Empirical Results

Baseline

As a baseline, we estimate Equation (1) using the standard fixed-effects model and start with a specification that includes only macroeconomic variables in Column (1) of Table 1 as a reference point. We then present parsimonious specifications with alternative blocks of drivers organized by topic and enter them independently into Specifications (2) to (6). Specification (7) includes all blocks of drivers placed together in a single regression. While these results demonstrate a relatively consistent picture, we consider the model in Column (8)—which was obtained from dropping insignificant variables from Specification (7) one at a time—as our benchmark specification.

Table 1 Baseline: Frank Index—OLS

The dependent variables in Tables 1 and 2 are the Frank index and the Bird index, respectively. The results are very similar for the two proxies of tax effort, although with one exception. The Gini index is associated with higher tax effort for the Frank index but is not relevant for the Bird index. In fact, the latter differs in the way it accounts for the effort to produce income, thus making the measure of income inequality not relevant in explaining tax effort. We observe that countries with higher tax as a percentage of GDP and a higher fiscal balance experience higher tax effort. This result is consistent for both measures of tax effort and with the role of tax effort in achieving fiscal consolidation.

Table 2 Baseline: Bird Index—OLS

The results suggest the existence of higher tax effort for more open economies, which are more sensitive to exchange rate fluctuations. The works of Lotz and Morss (1967), Balh (1971), Chelliah et al. (1975), Mertens (2003), and Bird et al. (2008) all contended that openness exerts a negative effect on tax effort, although some other studies suggested the opposite effect, which is arguably due to import taxes (Gupta 2007; Pessino and Fenochietto 2010; Fenochietto and Pessino 2013). Consistent with Bird et al. (2008), tax effort increases population density, although the level of income can shape this relationship, as discussed below.

Increasing real GDP is associated with higher tax effort. However, contrary to most empirical evidence (Frank 1959; Gupta 2007; Pessino and Fenochietto 2010; Fenochietto and Pessino 2013), real GDP per capita appears to drive down countries’ tax efforts. Bird et al. (2008) asserted that more developed countries have a greater tax capacity. Accordingly, this supports our decision to narrow the analysis to geographical region and income group in the section that follows.

Collectively, results suggest a more prominent role in explaining a country’s tax effort for economic, fiscal, and political drivers. A country’s tax effort decreases as the real GDP per capita increases, which is expected considering the nature of the tax effort measure. Indeed, an increase in the prosperity of an economy based on its economic growth is likely to lead to increased tax revenue (Castro and Camarillo 2014), although at a more reduced pace. Regarding the fiscal block of determinants, our results demonstrate that those fiscal balances are triggers for a greater effort in collecting taxes, in order to fund government spending from the revenues collected (Tanzi 1989). The positive effect on tax effort aligns with the extant literature (Frank 1959; Bird 1964; Tanzi 1968 or Rivero et al. 2001). From an economic point of view, the relevance of the political driver on a country’s tax effort is surprising. Left-wing governments are keener to implement redistributive policies (Piketty 1995) that are funded with a higher tax burden. Moreover, taxation is likely to be more progressive in the majority of countries that have left-wing parties in government (Andersson 2022).

Sensitivity

We performed sensitivity analyses by income group (Table 3), geographical location (Table 4), and time period (Table 5). The results of the sensitivity analysis by income group show that there are only two consistent determinants of tax effort: real GDP per capita and population density. The results for other economic and fiscal drivers of the tax effort remain similar to the base case for advanced economies. The relevance of the agricultural sector in developing economies increases the tax effort. However, this association is negative for advanced economics, which is explained by the role of the agricultural sector on output per capita. Overall, the tax effort in advanced economies is driven by all the blocks of drivers, except for political variables. However, in developing economies, the level of tax effort is influenced by the openness, structural, and political blocks of drivers.

Table 3 Sensitivity: Frank and Bird Indices—OLS by income group
Table 4 Sensitivity: Frank and Bird Indices—OLS by geographical region
Table 5 Sensitivity: Frank and Bird Indices—OLS by time period

The sample is further narrowed in Table 4 to geographical location—namely Latin America, Africa, Europe, and Asia. Overall, there is no consistency in the drivers of tax effort across the four regions. Tax effort in African countries is not associated with economic, fiscal, and openness drivers. African countries exhibit lower tax efforts with higher density populations in the case of countries governed by left-wing parties and also for countries with a prominent agriculture sector in the economy. Latin American countries share similarities with African countries, although the Gini index and the economic and fiscal blocks of drivers need to be added to those cited above for African countries. The Gini index plays a role in the tax efforts of regions where there is a concentration of countries with greater inequality. In fact, the Gini index is only relevant to the Latin American region. The inequality index is highly concentrated above 40 for most Latin American countries, whereas it is not uniform throughout the African region and is relatively low in most countries of all other regions.

The Europe and Latin America regions share most economic drivers, although with opposing associations—namely increase in GDP per capita, lower real GDP growth, and smaller output gap are all associated with lower tax efforts in Europe, although they are higher in Latin America. The block of fiscal variables is relevant in explaining variability in tax efforts in the Asia region, together with financial openness and the contribution of the agriculture sector.

Table 5 shows the results for two time periods—1980 to 1998 and 1999 to 2017. The decision to formulate two periods was based on two considerations. First, it allowed us to split the sample into two periods of equal length—18 years each. In addition, these periods represent dissimilar economic growth. Second, in 1999, the eurozone was formed from 19 countries, which later led to the creation of the euro currency. The single currency market has yielded certain benefits: improved price stability; economic stability and growth; a more significant influence on the global economy; and enhanced firm competitiveness (Georgieva et al. 2014; Hutson and O’Driscoll 2010; Beetsma and Giuliodori 2010; Ferrando et al. 2019). During the first period, the tax effort is mainly associated with a higher level of tax revenues per country and the role of the agricultural sector in the composition of GDP. Turning to the next two decades, drivers differ significantly as the economic and social development of countries evolve. In this new situation, determinants of tax effort are now mainly driven by the economic and fiscal blocks of variables. The growth in GDP is associated with higher tax effort, although it is offset by the prosperity of economies measured by per capita GDP. The relevance of left-wing governments emerged in the second period—which acts as a determinant of decreasing tax effort.

Robustness

We began the empirical analysis with the standard fixed-effects model, which provided consistent and robust results. However, the model represented by Equation (1) is the reduced-form version, which therefore renders it impossible to make causal statements, or even quantify the clean effects of certain drivers on the tax burden. Adding covariates partly corrects for these biases. However, endogeneity can still arise from other omitted variables (unobserved heterogeneity and selection effects), measurement errors in variables, and reverse causality (simultaneity). Since causality can run in both directions, some of the right-hand-side regressors can be correlated with the error term. However, due to the potential for the existence of endogeneity and the persistence of tax burden indices, we checked the sensitivity of our baseline results by estimating the static model with the two-stage least squares (2SLS) estimator. We used lagged regressors as instruments (up to two lags), which are validated by the Kleibergen-Paap and Hansen statistics.Footnote 1 Furthermore, we used the system generalized method of moments (GMM) approach developed by Arellano and Bover (1995) and Blundell and Bond (1998) to estimate the dynamic version of our model—even though the GMM system approach is a very demanding estimator, especially when using a limited number of unbalanced observations. The system GMM approach involves constructing two sets of equations—one with first differences of the endogenous and pre-determined variables instrumented by suitable lags in their own levels, and the other with the levels of the endogenous and pre-determined variables instrumented with suitable lags of their own first differences. We apply the one-step version of the GMM system estimator to ensure the robustness of the results, as the standard errors from the two-step variant of the GMM system method are known to be downward biased in small samples.

The use of all available lagged levels of the variables in the GMM estimation leads to a proliferation in the number of instruments, which reduces the efficiency of the estimator in finite samples and can potentially lead to over-fitting. A further issue is that the use of many instruments significantly weakens the Hansen J test of over-identifying restrictions and, therefore, the detection of over-identification is harder when it is most needed. Conversely, however, restricting the instrument set too much results in a loss of information that, in turn, leads to imprecisely estimated coefficients. Accordingly, the estimation of such models involves a delicate balance between maximizing the information extracted from the data, on the one hand, and guarding against over-identification, on the other. To this end, we follow the strategy suggested by Roodman (2009) to deal with the problem of weak and excessively numerous instruments. We also validate the GMM system identification assumptions by applying a second-order serial correlation test for both the residuals and the Hansen J test for the over-identifying restrictions. The values reported for AR(1) and AR(2) in the respective tables are the p-values for the first-and second-order autocorrelated disturbances in the first-differenced equation. As expected, we find evidence of high first-order autocorrelation but no evidence of significant second-order autocorrelation. Similarly, the Hansen J test result indicates the validity of internal instruments used in the dynamic model estimated using the GMM system approach.

Finally, we consider the mean group (MG) estimator (Pesaran and Smith 1995) and the pooled mean group (PMG) estimator (Pesaran et al. 1999). Both the MG and PMG are appropriate for analyzing dynamic panels that have both large time and cross section dimensions. Furthermore, they have the advantage of accommodating both the long-run equilibrium and the possibly heterogeneous dynamic adjustment process. These estimators allow us to correct for the potential bias that could result from estimating tax buoyancy coefficients when using standard fixed-effects models in the presence of non-stationary error terms. Imposing parameter homogeneity would introduce bias into the estimating equation.

Results for the robustness analysis using the entire sample are given in Table 6, which contrasts with the results in Tables 1 and 2. The role of real GDP per capita in shaping countries’ tax efforts is reinforced in all estimations as documented by Bird et al. (2008), although the results are contrary to other studies (Frank 1959; Gupta 2007; Pessino and Fenochietto 2010; Pessino and Fenochietto 2013). We also observe that fiscal balances increase countries’ tax efforts, as in the baseline estimation. The battery of robustness tests yields better clarification on the influence of the Gini index in explaining tax efforts, although it is now robust across the two dependent variables used in our study.

Table 6 Robustness: Frank and Bird Indices—alternative estimators, all countries

Conclusion and Policy Implications

In this paper, we evaluated the determinants of tax effort in a large sample of 122 countries over 38 years from 1980 to 2017. In contrast to other studies (Bahl 1971; Piancastelli 2001; Mertens 2003; Gupta 2007), a key feature of our research is the inclusion of two alternative measures of tax effort (the Frank and Bird indices) that correct for purchasing power and disposable income. The focus of our analysis is on five groups of determinants—namely economic, fiscal, openness, structural, and political. We began our analysis by looking at the entire sample. Next, we narrowed the analysis by geographical region (Latin America, Africa, Europe, and Asia), income group (advanced economies, and developing economies), and time period (1980 to 1998, and 1999 to 2017).

Our main findings are summarized as follows. Tax effort is influenced by all groups of determinants, although the results differ per income group. All blocks of groups of determinants explain the tax effort in advanced economies—except the block of political variables—while, in developing economies, the level of tax effort is mainly influenced by the open structural and political blocks of drivers. Overall, there is no consistency in the drivers of tax effort across the four regions. Economic and fiscal determinants are relevant for Europe and Latin America, although in opposite directions. Conversely, none of the economic, fiscal, and openness drivers determines tax effort in Africa. The relevant determinants for African regions are the density of population, the weight of the agricultural sector, and political drivers.

Determinants of tax effort also differ according to each time period. On the one hand, tax effort is mainly associated with higher levels of countries’ tax revenues and the role of the agricultural sector in the GDP during the first period (1980–1998). However, on the other hand, in the second period, the relevance of left-wing governments emerged as a determinant of decreasing tax effort, together with the relevance of the economic and fiscal blocks of variables.

Our findings have four main policy implications. First, tax effort is a relevant measure of fiscal policy, but it has limitations. It is easy to collect and provide a simple overview of tax trends over countries. However, countries with different levels of development and income should be considered with caution. The best solution is to consider the impact of each block of determinants mainly in the same income group. Second, in the case of high-level income countries, as all blocks of determinants are relevant, countries should have a holistic view of their tax system. In addition, high-income countries have the potential to improve their tax efficiency by improving their institutions. Third, if countries outside the high-level income group want to take full advantage of their tax revenues, then they need to consider changes in their openness and in the structural aspects of their economy. Fourth, political and institutional drivers are becoming increasingly more relevant.

Future research should consider a finer decomposition of the tax burden, using the Frank index approach by tax category (personal, corporate, goods, and services). Another avenue of research that could usefully be pursued is the economic (and distributional) consequences of big increases in the tax burden—defined as a binary variable that could take the value 1 if the annual change in the index is larger than each country’s time-series average plus one standard deviation. This would identify years of major country-specific tax shocks.