Keywords

These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

1 Introduction

Tax evasion has always been a dear issue to policy makers, but in times of crisis when Governments fall short of resources, it easily becomes one of the favorite pieces in everybody’s political programme. It is clear that tax evasion dynamics should be investigated taking into account the joint action of many heterogeneous taxpayers and of the Government (or, equivalently, of the agency that collects taxes). However, starting with the seminal paper [1], taxpayers’ behavior has been at the centre of the focus, using models with fully rational agents who decide how much of their actual income to declare to the tax authority, in an expected utility framework à la von Neumann and Morgenstern. The conduct of the Government was instead simply mimicked using a constant fine rate and assuming that entirely random auditing is performed with some probability. This basic model well captures the deterrent effects of increased audit probability and fines, but typically predicts much higher evasion than it is observed and suggests that hikes in the tax rate would increase compliance, whereas intuition would lead to the opposite outcome. An impressive number of alternative models tried to overcome these shortcomings, resorting, say, to the tendency of overweighting small probabilities, [7], introducing a psychological cost of evasion (“shame”) in the utility function and developing more general paradigms in which the tax morale of a community plays a role, see [10]. The interaction between agents and Government can also be framed in terms of power and trust, see [8], where the former increases the enforced compliance and the latter can boost voluntary compliance.

In this paper, we exploit the flexibility and heterogeneity of an agent-based approach to show that the inclusion of more realistic features notably improves the descriptive accuracy of the standard model. The agent-based model enables us to depict a system of many heterogeneous agents, whose interaction might give rise to emergent phenomena that are not analytically derivable, in particular when the model exhibits potential network effects, nonlinear behavior, learning or adaptation. These features characterize a multitude of social phenomena, and certainly also the tax evasion framework we examine: in such a contest, we use agent-based simulations to grasp the effects that different combinations of societal configurations and audit policies might have.

First, we allow the tax agency to evolve its own control scheme, using minimal information about taxpayers’ income to improve the efficacy of the audits. The weight of these pieces of information can, in fact, be calibrated to optimize who is to be inspected.

Second, our agents are restricted in their evasion decisions and, in particular, we assume that some fraction of income cannot be concealed. Income can be categorized as traceable or non-traceable: the first includes salary or wage as well as self-employed income that can be matched by somebody else’s tax-report, whereas the second is formed by all those earnings’ components that are hardly matchable and easily hidden. Therefore, we emphasize in this work the distinction of matchable versus non-matchable income, as done in [5] where it is empirically shown that the non-matchable component notably increased in recent years for US taxpayers, with visible repercussions on aggregate compliance. Take, for instance, two taxpayers with the same gross income: the first earns 90 % of his revenues from matchable wage, whereas the second gets 90 % of income from non-matchable sources. Clearly, it is materially possible for the latter to strategically conceal a vast amount of her income and devise profitable tax evasion. Fresh evidence from a study conducted in Denmark has indeed corroborated Francis Bacon’s saying that opportunity makes a thief: tax evasion is found to be substantially higher for individuals who self-report their income compared to those who have most part of their resources reported by a third party to the tax agency, regardless of audits probabilities, [9].

Third, we allow population’s risk aversion to be correlated with income and with the portion of matchable wealth. This allows for the customization of the audit policy to different states of the economy, as in booming phases the correlation of risk aversion and income in the whole population is likely to be negative, whereas the link may be weaker in gloomy periods.

Fourth, we test two ways to sense and adapt the perceived probability of being audited. Taxpayers can either estimate this probability using a sample average based on random matches with other peers or using their own history of past audits. We call these schemes geographical and temporal adaptation, respectively. It turns out that the efficacy of audit schemes critically depends on how agents subjectively estimate their audit probability in the next period.

On the one hand, taxpayers may, in this setup, evade less than theoretically predicted just because they cannot do otherwise. On the other hand, this model fully incorporates in the game a tax agency who is able to use some (minimal) pieces of information about the agents to maximize the revenues from its audit policy: this looks realistic, as more sophisticated schemes than purely random inspections are clearly within reach for a tax agency. Other works investigated some simple endogenous audit schemes [3, 6], finding significantly lower levels of underreporting, but letting the score depend on the degree of matchable income appears, to the best of our knowledge, novel and promising.

This paper is organized as follows. Section 2 provides details on the ways taxpayers and the tax agency are modelled. The following section discusses our simulation results and show how alternative audit policies interact with the distribution of matchable income, tax morale and agents’ beliefs regarding the probability of verification. Section 4 provides some conclusive remarks and relates our findings with the existing literature, discussing a few policy suggestions.

2 The Model

We model a society made of a tax agency (TA) who collects taxes from N individuals. The next two subsections will provide a detailed description of the agents and of the society in which they are embedded. Such a society, as it is often the case in agent-based models, depends on some features of the tax system as well as on the meta-parameters of the distributions individual traits are drawn from.

2.1 Agents

Agents have exogenously given parameters, \(I_{j},\rho _{j},\kappa _{j},\beta _{j}\), j = 1, , N, held constant across time, which denote privately known income, risk aversion, tax morale and the fraction of matchable income, respectively. Individual parameters are sampled only once at the beginning of the simulation from a distribution described in the next subsection. Taxpayers decide what portion of their earnings d jt (\(0 \leq d_{\mathit{jt}} \leq 1\)) to disclose at time t = 1, , T in order to maximize their expected utility. The declared income, d jt I j cannot fall short of their own matchable income β j I j (0 ≤ β j  ≤ 1), which can proxy wage from (legal) employment, third-party reported income or earnings that cannot otherwise be concealed. Agents assume that they will be audited at time t with some probability p jt , whose true value is unknown and must be estimated in one of two ways. The first method, called geographical, assumes that each taxpayer meets k “neighbors” per periodFootnote 1 and learns the number m t of those who were audited. The previously held p j, t − 1 is updated as follows:

$$\displaystyle\begin{array}{rcl} p_{\mathit{jt}} = \mathit{wp}_{j,t-1} + (1 - w)\frac{m_{\mathit{jt}} + A_{\mathit{jt}}} {k + 1},& & {}\\ \end{array}$$

where w = 0. 5 and A jt is 1 if the j-th agent was audited at time t and 0 otherwise. In other words, the probability of experiencing an audit is the average of the past estimate, a fresh guess derived from the k encounters and the knowledge of whether she was audited herself. Alternatively, using temporal updating, each agent uses a time-average and computes, in any t

$$\displaystyle\begin{array}{rcl} p_{\mathit{jt}} = \frac{1} {t}\sum _{i=1}^{t}A_{\mathit{ ji}}.& & {}\\ \end{array}$$

While geographical updating almost exclusively depends on current audits on other taxpayers, temporal adjustment focuses only on the history of one agent, uses past information and is more accurate if a longer span of time is available.

The utility function of taxpayers is

$$\displaystyle\begin{array}{rcl} U(d_{\mathit{jt}}) = {(1 + d_{\mathit{jt}})}^{\kappa _{j}}W_{\mathit{jt}}^{(1-\rho _{j})}& & {}\\ \end{array}$$

where W jt is the wealth after taxes and fines (if any) and κ j (0 ≤ κ j  ≤ 1) represents the tax morale of agent j: the stronger the ethical sense and the more income the agent will report, the higher utility she will perceive. A stronger tax morale corresponds to a marked sensitivity of utility to changes in the choice of how much to conceal: indeed a less moral taxpayer, endowed with a lower κ j , will feel less ashamed in not paying taxes than an upright one.

The fraction d jt of income to be disclosed is optimally selected by each agent solving the problem

$$\displaystyle\begin{array}{rcl} \max _{d_{\mathit{jt}}\geq \beta _{j}}\mathit{EU}(d_{\mathit{jt}}) = p_{j,t-1}U(X_{\mathit{jt}}) + (1 - p_{j,t-1})U(Y _{\mathit{jt}}),& & {}\\ \end{array}$$

where \(X_{\mathit{jt}} = (1 -\tau d_{\mathit{jt}})I_{j} - f\tau (1 - d_{\mathit{jt}})I_{j}\) is the income if audited and \(Y _{\mathit{jt}} = (1 -\tau d_{\mathit{jt}})I_{j}\) is the income in the absence of an audit. In the previous equation, τ and f denotes the tax and penalty rates, respectively. Both parameters characterize the society in which agents live, which is described in the next subsection.

2.2 Society

We define “society” as the set of arrangements and parameters that describe the tax collection process and the distribution of individual traits previously detailed.

The TA collects taxes from agents setting the tax rate τ they must pay on their income and the fine rate f to be levied on hidden income, when and if evasion is discovered. The TA can audit a fixed number qN (0 ≤ q ≤ 1) of agents, according to some policy. We assume that this is done by assigning a score to each taxpayer and picking qN agents with probability proportional to the score. We consider four audit schemes: random auditing simply gives the same score to everyone in every period; strict cutoff sets the score to 1 for the qN agents whose declared income \(d_{\mathit{jt}}I_{j}\) is the smallest and 0 otherwise; in the (mild) cutoff rule, auditing is performed proportionally to the rank of the declared income; finally, by enhanced auditing we refer to a scoring system that is developed by the TA in the attempt of maximizing the revenues from enforcement.

Using strict cutoff, the TA will audit in each period the qN individuals who declared the least. Agents who report low income are likely to be inspected also under cutoff auditing but there is much more variability with respect to strict cutoff and many more individuals experience one or more audits along time.

Enhanced auditing is based on the score S jt , which is a function of matchable and declared income:

$$\displaystyle{S_{\mathit{jt}} = p_{1}{(\beta _{j}I_{j})}^{p_{2} } + {(d_{\mathit{jt}}I_{j})}^{p_{3} },}$$

where \(p_{1},p_{2}\) and p 3 are constants selected by the TA to approximately maximize the revenues from audits.Footnote 2 While random auditing is a special case of the enhanced scoring system (set \(p_{1} = 1,p_{2} = 0,p_{3} = 0\) to obtain constant scores), only a part of all possible functions of β I and dI is explored with this parametrization. Clearly, neither the strict nor the mild cutoff can be exactly replicated but the relative size of p 2 as compared to p 3 and the scaling factor p 1 can provide some guidance in singling which users are more likely to be audited using the enhanced scheme.

The description of the society is completed by the definition of the distributions used to draw the individual parameters, see Table 1. We assume population income to follow a log-normal distribution, with mean income being 30, 000 and standard deviation equal to 23, 500 (these figures vaguely reflect Italian ones); the risk aversion parameter is uniformly distributed in [0, 1]. For the sake of simplicity, we suppose that tax morale is uniformly distributed among the taxpayers and let the support of the density change symmetrically around the mean, thus effectively considering mean preserving “spreads” of the same distribution. This way we can easily represent different societies with distinct moral attitudes: on average tax morale has the same value in every country, but we can find societies with more extreme values – in both directions – than others and increments in \(\kappa _{\mathit{low}}\) increase aggregate compliance. The fraction of matchable income β j for the j-th agent follows a beta(a, a) distribution and, in particular, we focus on three specific values of a = 0. 5, 1, 2 describing density functions that are U-shaped, uniform and bell shaped, respectively. When \(\beta \sim \mathrm{ beta}(0.5,0.5)\) most of the agents have either high or low matchable income, whereas there is low density for middle ways; on the contrary, a beta(2, 2) distribution corresponds to a society where taxpayers mostly have a mix of matchable and non-matchable income, with few extreme cases. A country like Italy, where self-employment often leaves many opportunities for income disguising, can be representative of the first scenario (a = 0. 5), whereas a nordic country, say Norway, where usually payments are completed by traceable means could better be approximated by the second situation (a = 2). The case relative to a = 1 stands in between.

Table 1 Description of the societal individual parameters’ distributions

Finally, we capture important second order effects in the distribution of citizens’s individual traits of one society allowing for nontrivial correlation of parameters. Hence, while Table 1 reports the marginal distributions of parameters, we assume that \(\mathit{Cor}(\rho _{j},\beta _{j}) = r\) and \(\mathit{Cor}(\rho _{j},I_{j}) = -r\) across the population. Picking, say, r > 0 is tantamount to suppose that more risk-averse agents have on average a smaller income. At the same time, they are likely to have a larger matchable income. In most of the countries we can think of, this positive correlation appears to be reasonable to account for the self-selection process that generally leads risk-averse agents to seek employed job and risk-prone taxpayers to self-employ in more profitable activities that also leave more room for evasion.Footnote 3

Since the TA does not make q public, taxpayers are not fully informed about the true intensity of audits and have a perceived probability of auditing p jt that is updated according to one of the two possible ways explained before. Furthermore, taxpayers do not know how the tax agency actually runs the inspections and which algorithm is followed when selecting the tax files to audit.

3 Results

The model presented in the previous section generalizes the classic framework in several ways: agents are restricted in their compliance decision and must declare at least as much as their matchable income; they can update the perceived probability of audit using geographical or temporal adjustments, while the correlation among parameters and the level of tax morale of the population varies. We simulate a grid of beta distributions, where a ∈ { 0. 5, 0. 75, , 2} (7 values), and tax morale levels κ low  ∈ { 0. 025, 0. 050, , 0. 250} (11 values). Each grid was then replicated four times, to account for two possible values if correlations among parameters, r = 0 and r = 0. 5 (two values), and the two updating schemes, denominated geographical and temporal (two elements). For each set of values for the meta-parameters a, κ low , r and one updating method, we simulated 30 periods with N = 1, 000 agents, for a total of 7 ×11 ×2 ×2 = 308 societies, where τ = 27 % and f = 1 are constant. To avoid any dependence on the initial values of the probability p j0 at time 0, we discard the first 29 periods and report the results of the last one, which can be thought of as one fiscal year.

As customarily in agent-based models, the richness of the data is both a curse and a blessing and we especially focus in what follows on the audit policies, contrasting the random, enhanced, strict cutoff and mild cutoff audit systems. Figure 1 depicts the revenues from fines of the four policies, when agents geographically update their own subjective probabilities, r = 0. 5 and a = 0. 5.

Fig. 1
figure 1

Revenues obtained by strict cutoff, enhanced and mild cutoff selection methods (from top to bottom, normalized by the revenues of the random audit scheme), relative to geographical updating, r = 0. 5, a = 0. 5. The vertical dashed segments depict one standard deviation above and below the mean of revenues for the strict cutoff policy

The solid lines are relative to the gains of strict cutoff, enhanced and mild cutoff scoring, normalized by the revenues of the random scheme. For instance, with minimal tax morale (κ low  = 0), the strict cutoff and enhanced auditing produce revenues that exceed those of the random scheme by over 500 % and about 350 %, respectively, as can be seen in the left part of the picture. When tax morale is low, it is clear that audits based on the strict cutoff rule are more lucrative on average than any other scheme. Notice that this finding points out that the standard model, where only random auditing is performed and tax morale is identically null, is unstable in that the TA would have a strong incentive to change its auditing method. There is substantial variability in the outcomes, as displayed by the dashed lines showing one s.d. of revenues away from the mean: even though the average value is significantly higher under strict cutoff, there is a moderate and definitely non-null probability that, say, enhanced auditing will raise more fines in a single period (i.e., in a random batch of qN audits).

In the framework depicted in the figure, despite the effort made by the TA to “maximize” the revenues, the enhanced scheme is not the most profitable. This is due to the high level of noise present in the stochastic objective, to the limited resources allocated for the task (the search stops after 30 evaluations, see Footnote 2) and to specific features of the society.

The high average revenues generated by the strict cutoff rule derive from a somewhat extreme combination of effects: due to positive r, more wealthy taxpayers are likely to have lower risk aversion and are more inclined to evasion; in the cases where their tax morale and matchable income happen to be low, such agents simply conceal virtually all their income and declare d j  ≈ 0. The strict cutoff rule often samples repeatedly wealthy total evaders and, therefore, revenues are boosted. However, this is possible only when agents geographically update the probability of being audited and “do not realize” that they will probed much more frequently if they declare low incomes. For increasing levels of κ low the difference in performance among the scoring rules markedly decreases, thus showing that revenues from tax enforcement are relatively insensitive to the audit policy in societies with high tax morale (besides obviously being much lower).

Table 2 shows a summary of our results when agents use geographical updating of the probabilities. The data in the table show how the relative revenues (with respect to random auditing) of the three considered policies vary as r, the tax morale, and the distribution of matchable income change in the society. We denote by LTM (HTM) low (high) tax morale societies in which κ low  = 0. 025 (κ low  = 0. 225).

Table 2 Tax agency revenues from fines, when taxpayers geographically update p jt . Values are normalized by the revenues of the random audit scheme

The best performing audit policy (highlighted with  ∗ ) is quite often the (rather brutal) strict cutoff selecting system. This always holds when tax morale is low, regardless of the distribution of the matchable income and for any value of r. The relative efficacy of the strict cutoff policy decreases moving from a U-shaped (a = 0. 5) to a bell-shaped distribution (a = 2). This result is due to the reduction, as a moves from 0.5 to 2, of the number of “extreme” taxpayers who have, at the same time, low risk aversion, low individual tax morale κ j and small matchable income. Hence, the strict cutoff rule increasingly audits real poors as opposed to aggressively non-compliant taxpayers.

The inspection of the columns labelled HTM, relative to high levels of tax morale, reveals that the enhanced policy obtains better results in the case of bell-shaped distribution of matchable income. The intuition here is straightforward: high tax morale and few individual with extremely low β requires a more nuanced audit policy, which does not concentrate itself on the smallest declaration but takes a more sophisticated approach in which both \(d_{j}I_{j}\) and \(\beta _{j}I_{j}\) have a role in the scoring function. This is confirmed by the values of \(p_{1},p_{2}\) and p 3 that determine the enhanced policy: for increasing a, p 2 and p 3 have the tendency to increase and to cluster around zero, respectively, and this translates into a larger weight given to the β I term as compared to the dI one.Footnote 4

Table 3 presents the simulation results when agents temporally update their subjective assessment of the auditing probabilities. Recall that in this case, taxpayers focus exclusively on their own audits’ history, averaging over time the relative frequency of the inspections they have undergone. On the one hand, this disregards the information on the unconditional intensity of control q, which could be estimated by sampling information on the neighbors; on the other hand, however, any given taxpayer is likely to be able to estimate with much greater accuracy the probability of being audited conditional on his behaviour, which is ultimately what she should care about.

Table 3 Tax agency revenues from fines, when taxpayers temporally update p jt . Values are normalized by the revenues of the random audit scheme

The insights that can be obtained by Table 3 remarkably differ from the ones descending from Table 2. We stress that this is only due to the different method in updating the subjective audit probabilities. Overall, as several values are smaller than 1, there are numerous cases in which random auditing over-performs at least one competing scoring scheme. When r = 0 and a = 2, surprisingly, random auditing appears to be the best policy. Generally, the difference in performance across policies is reduced under temporal updating and there is no clearly dominant strategy: strict cutoff is the best option in three cases (only), whereas enhanced scoring achieves the best results in two cases (for positive r and bell-shaped matchable income). Mild cutoff scoring secures the highest revenues in five cases, mostly related to societies with high tax morale. Evidently, as temporal updating allows taxpayer to partially anticipate the decisions of the TA, a good deal of randomness in the choice of the taxpayers to audit has favorable payoffs. Mild cutoff, indeed, can be interpreted as a random scheme endowed with some bias that increases audit probabilities for low declarations; indeed, as noticed before, this audit policy keeps sampling a variety of agents and does not get trapped in repeated audits of the same agents.

A careful scrutiny of the rich set of data available at the micro level shows an interesting dynamics going on between evasion-prone agents and the TA. A taxpayer subject to audit at time t will, under temporal updating, revise upward his belief about the probability p j, t + 1 of experiencing an audit in the following period. Ceteris paribus, this will increase the amount d j, t + 1 disclosed to the TA at t + 1 and reduce the chance of an audit. As a consequence, p j, t + 2 would decrease on average, pushing the agents to conceal more income and – consequently – increase their likelihood of being audited at t + 2 and so on. Such a hide and seek game is only possible when temporal updating is used by taxpayers and, in this setup, one of the best options for the TA would be, game theory docet, a good deal of randomization with an eye to low levels of dI, which is exactly what may be achieved by an enforcement policy guided by the mild cutoff scoring.

4 Conclusions

The agent-based model we have studied in this paper extends the standard model of tax evasion, allowing for heterogeneous taxpayers, consideration of matchable income, the adoption of several alternatives to the common random audit schemes, and two plausible ways to assess and update the subjective probability of being audited. Some (but not all) versions of the previous features were studied in previous works, but our model provides a comprehensive picture of the complex interactions occurring between the TA and the taxpayers. Such a detailed representation is precluded to most analytically solvable models, that have a more limited scope and must be based on simplifying (and often heroic) assumptions.

As a first general remark, the model shows that effective audit policies are dependent on the context. We confirm that the tax morale is an important factor in explaining tax evasion. Indeed, high values of tax morale, nearly always correspond to lower revenues for the TA (due to lower evasion rate). At the same time, the way agents perceive and update the probability of audit is also extremely relevant: if taxpayers take compliance decisions mainly based on audits experienced by others, there is scope for the TA to adopt the lucrative strict cutoff rule that simply audits those who declare the least. On the contrary, a seemingly minor modification in the way probabilities are perceived and updated, that is taking into account the history of each taxpayer, results in a different outcome where most enforcing policies are somewhat similar and mild cutoff, a biased version of random auditing, appears to be the best option to maximize revenues. Realistically, taxpayers will use a mix of the two stylized updating methods we considered but, even in this case, the TA should modify its actions depending on judgements or guesses about both the way agents behave and their tax morale. The temporal updating method is definitely more precise than the geographical one in determining the likelihood of being audited – when this is done not randomly – and, as a matter of fact, when agents become more acute, the TA needs to use more subtle selection methods. As people use geographical updating (and react too little to their own experience), the strict cutoff outperforms almost always the other methods. An exception is the case of beta(2,2) and high levels of tax morale, possibly because agents are willing to declare more (they are more moral) and there are less people with plenty of non-matchable income. The TA needs to be more sophisticated: metaphorically, when you go fishing in a sea with both big and small fish, a wide-mesh net can be used; but if you go to the fishing pond, where fish are not as big and maybe swim deep, then more ingenious means are needed.

Other structural features of the society, such as the distribution of the matchable income or the correlations among individual parameters, have relevant effects. A U-shaped configuration, where many agents have plenty of material opportunity for evasion, should be tackled by targeting low declarations (still keeping robust doses of randomness in the choice). In contrast, bell-shaped distribution of matchable income requires different audit policies and more nuanced approaches, giving more weight to the matchable component and considering the level of tax morale. There is even one case (see the bottom right panel of Table 3) in which the best policy is simply random auditing.

According to the simulation, the efforts exerted by the TA to develop enhanced auditing rule are, with the minimal information set at disposal here, of relative effectiveness in a dynamic setup. A bit paradoxically, audit schemes are working well if they are “deceptive”: when taxpayers update their personal belief in a geographical way, implicitly assuming that the whole population is audited at random and each individual is equally likely to be picked, then the TA should optimally proceed with targeted audits, precisely because agents do not realize why they are chosen. Conversely, temporal update implies that taxpayers acknowledge that each individual is audited in way that reflects her individual features. In this situation, the introduction of some randomness on the part of the TA would shake the certainty of those who did not expected an audit and increase on average the efficacy of enforcement.