Abstract
In most countries, alcohol consumption distributions have been shown to possess universal features. Their unimodal right-skewed shape is usually modeled in terms of the Lognormal distribution, which is easy to fit, test, and modify. However, empirical distributions often deviate considerably from the Lognormal model, and both Gamma and Weibull distributions appear to better describe the survey data. In this paper we explain the appearance of these distributions by means of classical methods of kinetic theory of multi-agent systems. The microscopic variation of alcohol consumption of agents around a universal social accepted value of consumption, is built up introducing as main criterion for consumption a suitable value function in the spirit of the prospect theory of Kahneman and Twersky. The mathematical properties of the value function then determine the unique macroscopic equilibrium which results to be a generalized Gamma distribution. The modeling of the microscopic kinetic interaction allows to clarify the meaning of the various parameters characterizing the generalized Gamma equilibrium.
Similar content being viewed by others
Avoid common mistakes on your manuscript.
1 Introduction
Alcohol consumption is a component cause for a number of diseases, which makes studies on it very relevant from both medical and psychological point of views. Information about this social phenomenon requires to know at best its statistical distribution, to analyze the effect of the resulting alcohol consumption distribution on the estimation of the alcohol Population-Attributable Fractions, and to characterize the chosen alcohol consumption distribution by exploring if there is a global relationship within the distributions [43, 59].
Despite the increasing number of papers devoted to a precise fitting of the statistical distribution of the phenomenon of alcohol consumption and to the social motivations about it, the precise connections leading from the human behavior of alcohol consumers to the observed statistical distributions have been left largely unexplored, and a satisfactory mathematical modeling is still missing.
We started to care of this problem after the reading of two interesting research papers recently published into the journal Population Health Metrics [43, 59], whose conclusions were subsequently considered as a starting point for further statistical studies (cf. [57] and the references therein). In these papers the analysis of the fitting of real data about alcohol consumption in a huge number of countries, led the authors to the conclusion that, among various probability distributions often used in this context, Gamma and Weibull distributions appeared to furnish a better fitting with respect to the Lognormal distribution, usually used in this context, starting from the work of Ledermann [44], who proposed it as a reasonable model for the consumption problem.
Concerning the mathematical modeling of alcohol consumption by resorting to the methods of statistical physics, the results of the analysis presented in [43, 59] introduce into the problem of characterizing its distribution a number of challenging questions. Given a system of agents characterized in time by their level of alcohol consumption, and given a certain observed statistical distribution of their consumption, can we describe at the level of the single agents the stylized facts which lead at a macroscopic level to this distribution? And moreover, can we characterize at the level of single agents the differences which produce a variety of steady probability distributions for the same problem?
As far as the Lognormal distribution is concerned, the kinetic modeling of phenomena concerned with interacting multi-agent systems and leading to this distribution has been recently proposed in [39], starting from the results of [38].
Among probability distributions on \(\mathbb {R}_+\), Lognormal distribution has a leading role [1, 26]. The list of phenomena which fit Lognormal distribution in natural sciences is quite long, as documented the exhaustive review paper by Limpert et al. [48]. Moreover, the success of Lognormal distribution in the study of random variations that occur in the data from these phenomena that show more or less skewed probability distributions, is favored by its features, which render it easy to fit, test and modify.
In [39], it was pointed out that a relevant number of phenomena involving measurable quantities of a population and fitting Lognormal distribution comes from social sciences and economics, areas where it can be reasonably assumed that the appearance of this distribution is a consequence of a certain human behavior.
Among others, a good fitting has been observed while looking at the distribution of body weight [18], at women’s age at first marriage [58], at drivers behavior [40], or, from the economic world, when looking at consumption in a western society [5], at the size of cities [6], and at call-center service times [17].
These phenomena were modelled in [39] by resorting to the methodology of classical kinetic theory, and the Lognormal distribution (the macroscopic equilibrium of the multi-agent system) appears as a direct consequence of the details of the microscopic interactions of agents. This idea is fully coherent with the classical kinetic theory of rarefied gases, where the formation of a Maxwellian equilibrium in the spatially uniform Boltzmann equation is closely related to the conservation laws of the elastic microscopic binary collisions between molecules [19, 37].
The analysis of [38, 39] suggests a way to modify the microscopic interaction of the variation of the alcohol consumption of agents, that leads to the class of distributions fitted in [43, 59]. As in [38, 39], the elementary single change will be expressed in terms of a suitable value function, that meets the same properties as the one leading to the Lognormal distribution, built up in the spirit of the prospect theory of Kahneman and Twersky [41, 42]. As usual in the kinetic setting [30], the microscopic interaction generates a linear kinetic equation for the density of agents, that will be subsequently studied in the asymptotic regime of grazing interactions [66]. The grazing regime describes the situation in which a single interaction produces only a very small change of the consumption variable. In this regime, the variation of density of the alcohol consumption of the agent’s system is driven by a partial differential equation of Fokker–Planck type [30].
The precise mathematical model is the following. Let us denote by \(g = g(w,t)\) the density of agents which have a (positive) level of consumption equal to w at time \(t\ge 0\). Then, the density g is the solution of a linear Fokker–Planck equation with variable coefficients of diffusion and drift, given by
In (1.1) \(\bar{w}_L\) represents the critical (maximal) level of individual alcohol consumption in the chosen time unity, while \(1-\delta \), with \(0\le \delta \le 1\) is linked to the level of psychophysical addiction induced by alcohol . Moreover, \(\lambda \) and \(\mu \) are positive constants closely related to the randomness and intensity of the phenomenon under study, satisfying further bounds imposed by the physical conditions of the microscopic interaction. In particular, if \(\gamma = \mu /\lambda \), \( \gamma >\delta \), and the equilibrium density of the Fokker–Planck equation (1.1) is given by the (non normalized) generalized Gamma density [47, 62]
Moreover, for any given initial distribution \(g_0(w)\) of agents, convergence to equilibrium can be shown to hold exponentially fast in time with explicit rate [55]. Note that the steady distribution (1.2) includes as particular cases the Gamma distribution, and in this case \(\delta =1\), as well as the Weibull distribution, and in this other case \(\gamma = \delta (1+\delta )\). Moreover, a (non normalized) Lognormal distribution
is included into (1.2) and it is obtained from (1.2) by taking the limit \(\delta \rightarrow 0\). Hence, one can conjecture that the best fitting of the alcohol consumption distribution is achieved by suitably choosing the three parameters characterizing the generalized Gamma distribution, namely \(\gamma , \delta \) and \(\bar{w}_L\).
It is remarkable that the fitting analysis of [43, 59], was limited to the choice of particular cases of the generalized Gamma distribution (1.2), obtained by fixing precise values of the parameter \(\delta \). Hence, in [43, 59] the fitted distributions are characterized by the presence of only two parameters.
In the kinetic description leading to the steady state (1.2), one of these parameters is given by \(\bar{w}_L\), representing the critical (maximal) level of individual alcohol consumption. Following Ledermann [44], this parameter can be fixed a priori, thus reducing the two-parameter distributions considered in [43, 59] to one-parameter distributions. This generates a new way of fitting that follows along the lines indicated by Ledermann. In the case of the Lognormal distribution the description of the variation of the Lognormal distribution with respect to the unique unknown parameter is given by the so-called Ledermann curve [44], which describes the percentage of consumers drinking above a fixed level. Analogous results can be derived in the other cases.
The mathematical methodology of this paper takes essential advantage from previous attempts to model social and economic phenomena in a multi-agent system. Starting from economics, where the genesis of the formation of Pareto curves in wealth distribution of western countries was studied in various aspects [20,21,22, 25, 27, 35, 60], this field of research moved to social sciences, where the investigation of opinion formation played a leading rule [10,11,12,13,14,15,16, 23, 28, 31,32,33,34, 63, 64].
Recently, some of these investigations introduced into the microscopic description of the interactions specific behavioral aspects of agents. To our knowledge, this idea has been first proposed by Pareschi and Maldarella [51]. There the authors, to investigate the price formation of a good in a multi-agent market, introduced a kinetic model for a multi-agent system consisting of two different trader populations, playing different rules of trading, and possibly changing their point of view. The kinetic description was inspired by the microscopic Lux–Marchesi model [49, 50] (cf. also [45, 46]). In [51] the trading rules of agents were assumed to depend on the opinion of traders through a kinetic model of opinion formation recently introduced in [64]. Also, psychological and behavioral components of the agents, like the way they interact with each other and perceive risks, were taken into account. This has been done by resorting, in agreement with the prospect theory by Kahneman and Twersky [41, 42], to interactions containing a suitable value function.
The analysis of [51] enlightened in a kinetic description the role of human behaviors [7,8,9], pioneered in [68]. These aspects have been recently considered in [38], where the introduction of a suitable value function justified at a microscopic level the mechanism of formation of the service time distribution in a call center.
The leading idea in [38] is based on a general principle which can be easily verified in a number of social activities of agents in which one identifies the possibility of a certain addiction.
The paper is organized as follows. In Sect. 2, we shall introduce a linear kinetic model for a multi-agent system, in which agents are characterized in terms of their level of alcohol consumption that can be measured in terms of a fixed unit of measure, and subject to microscopic interactions which describe the microscopic rate of change of the value of their consumption, according to the previous general principle. The relevant mechanism of the microscopic interaction is based on a suitable value function, in the spirit of the analysis of Kahneman and Twersky [41, 42], which reproduces at best the human behavior in this situation.
In Sect. 3 we will show that in a suitable asymptotic procedure (hereafter called grazing limit) the solution to the kinetic model tends towards the solution of the Fokker-Planck type equation (1.1). Similar asymptotic analysis was performed in [24, 29] for a kinetic model for the distribution of wealth in a simple market economy subject to microscopic binary trades in presence of risk, showing formation of steady states with Pareto tails, in [65] on kinetic equations for price formation, and in [64] in the context of opinion formation in presence of self-thinking. A general view about this asymptotic passage from kinetic equations based on general interactions towards Fokker–Planck type equations can be found in [30]. Other relationships of this asymptotic procedure with the classical problem of the grazing collision limit of the Boltzmann equation in kinetic theory of rarefied gases have been recently enlightened in [37].
Once the Fokker–Planck equation (1.1) has been derived, in Sect. 4 we will investigate at a numerical level the relationships between the solutions of the linear kinetic model of Boltzmann type and its grazing asymptotics in the form of the Fokker–Planck equation, at different values of the vanishing \(\epsilon \)-parameter leading to the asymptotic itself. The numerical computations will confirm the perfect agreement between the stationary solutions of the two models, even for values of the \(\epsilon \)-parameter of the order \(10^{-1}\). This justifies, at least numerically, the role of the chosen value functions in the asymptotic description of the alcohol consumption phenomenon.
2 A linear Boltzmann Equation
2.1 Modeling Assumptions
Following the well-consolidated approach furnished by the kinetic theory [30, 54, 56], the statistical description of alcohol consumption in a society will be described by resorting to a linear Boltzmann-type equation in which the unknown is the probability distribution of alcohol consumption of the agent system. Since we want to fit the results obtained by this approach with the various dataset coming from real situations, the kinetic model is built up to respect some basic hypothesis we enumerate below.
First of all, given a population of agents, the population needs to be considered homogeneous with respect to alcohol consumption. In this case, to have a homogenous population, it is essential to restrict it with respect to some characteristics, like age, sex and social class [44, 61]. Once the homogeneity assumption is satisfied, agents in the system are considered indistinguishable [56]. This means that an agent’s state at any instant of time \(t\ge 0\) is completely characterized by the level \(w \ge 0\) of its alcohol consumption. This level can be easily and uniformly measured with respect to some unit. For example, w indicates the mean value of the daily consumption of alcohol in grams of agents. The unknown is the density (or distribution function) \(f = f(w, t)\), where \(w\in \mathbb {R}_+\) and the time \(t\ge 0\), and the target is to study its time evolution towards a certain equilibrium.
The precise meaning of the density f is the following. Given the system of agents to study, and given an interval or a more complex sub-domain \(D \subseteq \mathbb {R}_+\), the integral
represents the number of individuals with are characterized by a level \(w \in D\) of consumption at time \(t > 0\). It is assumed that the density function is normalized to one, that is
The change in time of the density is due to the fact that agents continuously upgrade the level w of their alcohol consumption in time by some action. To maintain the connection with classical kinetic theory of rarefied gases, we will always refer to a single upgrade of the level measure as a microscopic interaction.
2.2 The Value Function in Alcohol Consumption
Similarly to the problems treated in [38, 39], alcohol consumption deeply depends on both addiction and social attitudes, that can be easily translated by saying that, at least over a certain level, agents likely tend to increase the value w by interactions, while manifest a certain resistance to decrease it.
For the situation under study we can identify (and fix) for the system of agents two universal values. The first value, denoted by \(\bar{w}\), is the social barrier, and identifies the mean amount of alcohol consumption relative to the homogenous class, which is normally accepted by the social rules of the underlying society. The second value, denoted by \(\bar{w}_L\), with \(\bar{w}_L > \bar{w}\), is the threshold value that it would be better not to exceed to enter into addiction. Consequently, the human tendency to increase the value w by interactions has to be coupled with the existence of this limit value \(\bar{w}_L\) which serious health reasons suggest not to cross. Similar behaviors have been recently modeled in [38, 39] by resorting to the prospect theory introduced by Kahneman and Twersky in their pioneering paper [41], devoted to describe decision under risk.
Following [38, 39], we will describe the microscopic variation of alcohol consumption of agents in the form
According to (2.1), in a single interaction the value w of alcohol consumption can vary by two different motivations. The first one determines a variation of a quantity proportional to w in which the coefficient \(\varPhi (\cdot )\), which can assume assume both positive and negative values, is a function characterizing the predictable behavior of agents. The second one, still proportional to w, takes into account a certain amount of human unpredictability. The random variable \(\eta \) describes the unpredictable variations, which in the mean are assumed negligible, and in any case are not so significant to produce a sensible change of the value w. We will assume \(\eta \) of mean value zero and variance \(\lambda \). Hence, the consumption of alcohol can be both increasing and decreasing, and the mean intensity of this variation is fully determined by the function \(\varPhi \).
The function \(\varPhi \) plays the role of the value function in the prospect theory of Kahneman and Twersky [41], and contains the mathematical translation of the expected human behavior in the phenomenon under consideration.
In their pioneering theory [41, 42], Kahneman and Twersky, by resorting to various situation concerned with decision theory, identified the main properties characterizing a value function. This function has to be positive and concave above the reference value 1 (\(w > \bar{w}_L\)), while negative and convex below (\(w < \bar{w}_L\)). Hence, in terms of the variable \(s = w/\bar{w}_L \), provided that the value function is enough regular in s
Thus, as far as financial transactions are considered, the value function is generally concave for gains and commonly convex for losses. Also, by noticing that the aversiveness of symmetric fair bets generally increases with the size of the stack, they assumed that the value function satisfies the further properties
and
Hence, properties of value functions are well defined for deviations from the reference point (\(s=1\) in our case), and the value function for losses is steeper than the value function for gains.
In [38] we adapted the shape of the value function given in [41] to the problem of the characterization of the distribution of the length of a service time in a call center. A careful reading of [38] allows to recognize that the properties of the value function do not have a universal validity, since its precise form depends on the main aspects of the human behavior relative to the phenomenon under study. At difference with the choice of Kahneman and Twersky, in [38] a suitable value function was identified in the increasing concave function
In (2.5) \(0<\mu < 1\) and \(0< \epsilon < 1\) are suitable constants characterizing the agents behavior. In particular, the value \(\mu \) denotes the maximal amount of variation of service time that agents will be able to obtain in a single interaction. Note indeed that the value function \(\varPhi _0(s)\) is such that
The function in (2.5) satisfies properties (2.3) and (2.4), but in consequence of its uniform concavity does not satisfy assumption (2.2). Note that in the service time problem studied in [38], property (2.3) characterizes in a proper way the behavior of agents which have to do a service by respecting a certain time. Indeed, given two agents starting at the same distance from the limit value \(\bar{w}_L\) from below and above, it will be easier from the agent starting below to move closer to the optimal time, than for the agent starting above.
Going back to the form of the microscopic interaction (2.1), the minus sign in front of the value function \(\varPhi \) reflects the tendency of agents to increase their value w of alcohol consumption when \(w < \bar{w}_L\), while they would like to decrease consumption if \(w >\bar{w}_L\).
As shown in [38, 39], the choice of the value function (2.5) leads, as explained in details in the forthcoming Sect. 3, to a stationary distribution in the form of a Lognormal distribution. Hence, this choice would be in full agreement with the data fitting of alcohol consumption proposed by Ledermann in 1956 [44], choice which is still used in present times (cf. the recent paper [53] and the references therein).
On the other hand, as briefly discussed in the introduction, the recent fitting analysis of alcohol consumption in [43, 59] seems to corroborate the hypothesis that, with respect to the fitting provided through a Lognormal distribution, a better fitting can be obtained by resorting to Gamma and Weibull distributions. These results pose a question about the correctness of the shape of the value function (2.5), which could fail to represent at best the human behavior relative to alcohol consumption.
One of the points in which the shape of the value function \(\varPhi _0^\epsilon (s)\), defined in (2.5), could be not so well-adapted to the description of alcohol consumers, is its behavior for small values. Indeed, the first derivative of \(\varPhi _0(s)\) in \(s=0\) is blowing up. Hence, agents with a very low level of alcohol consumption are pushed very easily towards a higher consumption, action that does not agree with the common behavior of low drinkers. Indeed, agents which are characterized by a level \(w \ll w_L\), usually change their alcohol habits very slowly. Further, the lower bound in (2.6) does not depend on the value of the critical consumption value \(\bar{w}_L\). Hence, in terms of \(\varPhi _0(s)\) the maximal allowed increase in alcohol consumption in a single interaction does not depend on the distance of the point s from the reference point, which does not appear to be supported from previous studies [61].
For these reasons, for the description of alcohol consumption, the value function
seems more adapted. Similarly to (2.5), in (2.7) \(0<\mu < 1\) and \(0< \epsilon < 1\) are suitable constants characterizing the intensity of interactions of agents. At difference with the value function (2.5), the function (2.7) satisfies all properties originally proposed by Kahneman and Twersky in [41], including the property to be positive and concave above the reference value (\(s >1 \)), while negative and convex below (\(s <1\)). Moreover, at difference with (2.5), while the upper bound does not depend on \(\epsilon \), the lower bound depends on it, and
This property implies that, for small values of the parameter \(\epsilon \), the maximal amount of increase of the level of alcohol consumption is of the order of \(\epsilon \).
Together with (2.7) and (2.5) we consider the class of value functions
characterized by the presence of a parameter \(0< \delta <1\), that interpolate between \(\varPhi _0^\epsilon (s)\) and \(\varPhi _1^\epsilon (s)\). The meaning of the \(\delta \) parameter can be obtained by noticing that, for fixed values of s and \(\epsilon \), the value function (2.9), considered as function of \(\delta \), is increasing. Hence, for values of \(s<1\), which are such that \( \varPhi _\delta ^\epsilon (s) <0\), the function \(|\varPhi _\delta ^\epsilon (s)|\) decreases with \(\delta \). For example, at the point \(s=0\), given a fixed value \(\bar{\epsilon }\) of \(\epsilon \), we obtains
where the function \(D(\delta )\) is decreasing from the value \(\mu \), assumed for \(\delta \rightarrow 0\). Looking at the action of the value function in the elementary interaction (2.1), it appears evident that the interval in which the value function plays a major role is the interval \(0\le s \le 1\), where we are taking into account agents that are increasing their consume of alcohol, starting from \(w=0\). Hence, the value of the parameter \(\delta \) is essentially linked to the greater or lesser degree of dependence exerted by the consumption of alcohol relative to agents characterized by a medium-low consumption. The previous discussion clarifies that small values of the parameter \(\delta \) correspond to high values of increasing consumption and viceversa. In other words, values of \(\delta \) close to zero represent the situation in which it is easier to increase the value of alcohol consumption starting from a low level.
In Fig. 1 we report the shapes of \( \varPhi _\delta ^\epsilon (s) \) for different choices of the parameter \(\delta \). As proven in the appendix, these functions satisfy properties (2.3) and (2.4), which render them suitable as value functions.
2.3 The Kinetic Model
Given the elementary interaction (2.1), and fixed a value function \(\varPhi \), the study of the time-evolution of the distribution of the level of alcohol consumption follows by resorting to kinetic collision-like models [19, 56]. The variation of the density f(w, t) obeys to a linear Boltzmann-like equation, fruitfully written in weak form. The weak form corresponds to say that the solution f(w, t) satisfies, for all smooth functions \(\varphi (w)\) (the observable quantities) the integro-differential equation
In (2.11) the expectation \(\langle \cdot \rangle \) takes into account the presence of the random parameter \(\eta \) in (2.1). The positive constant \(\tau \) measures the interaction frequency.
The right-hand side of equation (2.11) represents the variation in density between agents that modify their value from w to \(w_* \) (loss term with negative sign) and agents that change their value from \(w_*\) to w (gain term with positive sign).
In reason of the nonlinearity (in the variable w) of the interaction (2.1), one easily verifies that equation (2.11) possesses only one conserved quantity, which is obtained by setting \(\varphi = 1\). This conservation law implies that the solution to (2.11) departing from a probability density function remains a probability density for all subsequent times \(t >0\). Note that, provided the initial density has a certain number of moments bounded, their evolution is difficult to follow analytically. However, as shown in [38], one can obtain time-dependent upper bounds for the mean value, and for higher moments. These bounds depend of the analogous bounds for the value function, as given by (2.6). These bounds do not change by changing the value of the parameter \(\delta \).
3 Grazing Limit and the Fokker-Planck Equation
3.1 The Grazing Limit
The linear kinetic equation (2.11) describes the evolution of the density consequent to interactions of type (2.1), and it is valid for any choice of the parameters \(\delta , \mu \) and \(\lambda \). In real situations, however, it happens that a single interaction determines only an extremely small change of the value w. This situation is well-known in kinetic theory of rarefied gases, where interactions of this type are called grazing collisions [56, 66]. In the value functions (2.9) the smallness assumption requires to fix \(\epsilon \ll 1\). At the same time [30], the balance of this smallness with the random part is achieved by setting
In this way the scaling assumptions allow to retain the effect of all parameters in (2.1) in the limit procedure. An exhaustive discussion on these scaling assumptions can be found in [30] (cf. also [39] for analogous computations in the case of the Lognormal distribution). For these reasons, we address the interested reader to these review papers for details. In what follows we only outline the differences in the asymptotic procedure induced by the new value functions (2.9). Using (3.1), we write the evolution of the mean value
in the form
It follows from Lagrange theorem that, if \(s \ge 1\)
and, for \(s \le 1\)
These bounds are identical to the ones obtained in [38] for \(\varPhi _0^\epsilon \). Hence, provided that the second moment of the initial density is bounded, the scaling (3.1) is such that, for any given fixed time \(t >0\), the consequent variation of the mean value m(t) is small with \(\epsilon \) small. To observe an evolution of the average value independent of \(\epsilon \), it is enough to resort to a scaling of the frequency \(\tau \). If we set
and \(f_\epsilon (w, t) \) will denote the density corresponding to the scaled interaction and frequency, then the evolution of the average value for \(f_\epsilon (w, t)\) satisfies
namely a computable evolution law for the average value of f, which remains bounded even in the limit \(\epsilon \rightarrow 0\), since pointwise
As explained in [30], the motivation leading to the scaling of the frequency of interactions is clear. Since for \(\epsilon \ll 1\) the microscopic interactions produce a very small change of the value w, a finite variation of the mean density can be observed only if agents in the system undergo a huge number of interactions in a fixed period of time. By using the same scaling one can easily observe a well-defined evolution equation for the second moment of \(f_\epsilon (w,t)\), which will be well-defined also in the limit \(\epsilon \rightarrow 0\) (cf. the analysis in [30]).
The next step is to clarify why the linear kinetic model (2.11) modifies with \(\epsilon \) to produce in the limit \(\epsilon \rightarrow 0\) the Fokker–Planck type equation (1.1) [30]. Given a smooth function \(\varphi (w)\), and a collision of type (2.1) that produces a small variation of the difference \(w_*-w\), using (3.1) and (3.2) one obtains
Therefore, equating powers of \(\epsilon \), it holds
where the remainder term \(R_\epsilon (w)\), for a suitable \(0\le \theta \le 1\) is such that
as \(\epsilon \rightarrow 0\). Therefore, the time variation of the (smooth) observable quantity \(\varphi (w)\) satisfies
where \(\mathcal R_\epsilon (t)\) denotes the integral remainder term
Letting \(\epsilon \rightarrow 0\), shows that in consequence of the scalings (3.1) and (3.2) the weak form of the kinetic model (2.11) is well approximated by the weak form of a linear Fokker–Planck equation (with variable coefficients)
In (3.4) the density function g(w, t) coincides with the limit, as \(\epsilon \rightarrow 0\), of the density \(f_\epsilon (w,t)\) [30]. Provided the boundary terms produced by the integration by parts vanish, equation (3.4) coincides with the weak form of the Fokker–Planck equation
Equation (3.5) describes the evolution of the distribution density g(w, t) of the level of alcohol consumption \(w \in \mathbb {R}_+\), in the limit of the grazing interactions. As often happens with Fokker-Planck type equations, the steady state density can be explicitly evaluated, and it results to be a generalized Gamma density, with parameters linked to the details of the microscopic interaction (2.1).
3.2 Steady States are Generalized Gamma Densities
Let us set \(\gamma = \mu /\lambda \), and suppose that \(\gamma >\delta \). In this case the stationary distribution of the Fokker–Planck equation (3.5) is an integrable function which can be found explicitly by solving the first order differential equation
Indeed, solving (3.6) with respect to \(h(w)= w^2 g(w)\) by separation of variables allows to conclude that the unique solutions to (3.6) are the functions
If we fix the mass of the steady state (3.7) equal to one, the consequent probability density is a generalized Gamma. The generalized Gamma is characterized in terms of a shape \(\kappa >0\), a scale parameter \(\theta >0\), and an exponent \(\delta >0\), and reads [47, 62]
The shape and the scale parameter of the equilibrium state (3.7) are given by
The limit \(\delta \rightarrow 0\) in the Fokker–Planck equation (3.5) corresponds to the drift term induced by the value function (2.5). In this case, the equilibrium distribution (3.7) takes the form of a Lognormal density [38]
where
Note that for all values \(\delta >0\) the moments are expressed in terms of the parameters \(\bar{w}_L\), \( \delta \), \(\lambda \), and \(\mu \), denoting respectively the limit level of alcohol consumption, the variance \(\lambda \) of the random effects and the values \( \delta \) and \(\mu \) characterizing the value function \(\phi _\delta ^\epsilon \) defined in (2.9). The condition \(\gamma >\delta \), that guarantees the existence of a steady state density, is achieved for small values of \(\delta \), or for sufficiently large values of \(\gamma \). Since \(\gamma = \mu / \lambda \), the quotient between the maximal amount of level of change of alcohol consumption and the variance of the random change, the condition is satisfied when the random effects are very small.
It is interesting to remark that, in the case of the gamma distribution, the mean value of the equilibrium density (3.7) is always less than \(\bar{w}_L\). In this case in fact
Remark 1
The analysis of the possible variety of steady states densities generated by the value functions (2.9) allows to conclude that the best fitting of the level of alcohol consumption could be obtained by a generalized Gamma distribution. Indeed, the three cases treated in [43, 59] are included as particular cases of generalized Gamma distributions. The data in [43] enlighten the fact that, in the Weibull case, the characteristic exponent \(\delta \ge 1/2\). This helps to understand why the Lognormal distribution, which is obtained by pushing \(\delta \) to zero, probably fails to give a good fitting of the level of alcohol consumption. It is remarkable that in [57], by noticing that the generalized Gamma nests the Gamma, Weibull and Lognormal distributions as special cases, it has been proposed as useful test distribution for the problem under investigation, since, if all other distributions are rejected, the generalized Gamma results minimally restrictive and less susceptible to misspecification bias [52].
Remark 2
Once the shape of the alcohol consumption distribution has been characterized, the next step would be to find the parameters of the generalized Gamma that provide the best possible fit of real datasets. However, the parameter estimation of the generalized Gamma distribution is a very difficult problem. This topic was dealt with in many papers (cf. the results in [2, 36, 52, 67] and the list of references therein) but since almost all of the proposed methods are quite complex, this topic is still an open area, that needs to be better studied.
4 Numerical Experiments
The main interest related to the Fokker–Planck equation (3.5) is the analytical description of the equilibrium, that in the present case is given by the generalized Gamma distribution (3.7). Except in some simple cases [3, 4], kinetic models of Boltzmann type do not allow similar results. On the other hand, the kinetic equation (2.11) is the right-one to study, since it contains all the microscopic details that are apparently lost in the grazing limit procedure. For this reason, we report in this Section several numerical experiments that describe convergence to the steady state (identified by the generalized Gamma distribution in the limit \(\epsilon \rightarrow 0\) in the Boltzmann dynamics (2.1) for various values of the parameter \(\epsilon \). In details, our study proceeds as follows. We consider two different values for the two parameters \(\gamma \) and \(\bar{w}_L\) of the model. The first parameter fixes the ratio between the intensity of the value function \(\varPhi _\delta ^\epsilon \) and the variance of a random variation in the behavior of the agents. In the Boltzmann dynamics, we consider a uniform random variable with zero mean and variance 1 / 12. The second parameter \(\bar{w}_L\) measures the threshold value that it would be better not to exceed to enter into alcohol addiction. The values considered in the experiments are \([\gamma ,\bar{w}_L]=[1,6]\) and \([\gamma ,\bar{w}_L]=[2,3]\).
For these two set of parameters, we study the behavior of the microscopic dynamics detailed in (2.1) for different values of the parameter \(\delta \) in the value function. This parameter is of paramount importance, since it modifies the shape of the steady states in the Fokker–Planck description. Depending on its value, from the generalized Gamma distribution we obtain a Gamma density (\(\delta =1\)), a Weibull density (\(\delta (1+\delta )=\gamma \)) or a Lognormal distribution (\(\delta \rightarrow 0\)) as equilibrium states.
In order to show that the Fokker–Planck dynamics can be seen as a particular case of the Boltzmann one, we place ourselves in the same scaling in which the Fokker–Planck equation (3.5) has been analytically derived. For this reason, we consider in the value function (2.9) a decreasing sequence of values for the scaling factor \(\epsilon \) ranging from 0.9 to \(10^{-3}\). Correspondingly we scale the effects of the random part, according to (3.1), and the interaction frequency according to (3.2) to have an \(\epsilon \)-free evolution of the mean. In Fig. 2 we show the different equilibrium states obtained with the Boltzmann dynamics for the different values of the scaling factor \(\epsilon \). Figure 3 reports the same results of Fig. 2 with magnification around the origin to better appreciate the differences between the solutions. Figures 4 and 5 show the same dynamics where \(\gamma \) and \(\bar{w}_L\) have been changed to respectively 2 and 3, namely by diminishing the threshold and increasing the variability.
The curves have been obtained by using \(10^3\) agents and by averaging the steady state solution over \(10^5\) realizations. In each image it is also reported the steady state of the Fokker-Planck model (3.7) and the initial data, which are the same for both models. The initial value has been considered uniformly distributed in [0.5, 2.5] for all different situations analyzed. From top left to bottom right the shape factor \(\delta \) takes the values 0.1, 0.3, 0.5 and 0.8 which causes the equilibrium state to completely change its shape. For all tested situations an extremely good agreement between the Boltzmann and the Fokker–Planck models is obtained from approximately \(\epsilon =0.001\). However, very similar steady state profiles are found from \(\epsilon = 0.1\), which fully describes a kinetic evolution of Boltzmann type. This interesting behavior can be observed also from the results collected in the Tables 1 and 2, which report the expected value and the variance obtained with the kinetic Boltzmann model for the different values of the parameter \(\epsilon \) and the shape factor \(\delta \).
5 Conclusions
We introduced and discussed in the present paper kinetic models able to describe the statistical distribution of alcohol consumption in a multi-agent society. We succeeded to explain the emerging of a generalized Gamma distribution by resorting to the value function theory pioneered by Kahneman and Twersky [41, 42]. In all cases, this macroscopic behavior is a consequence of the choice made at the microscopic level, choice that takes into account the essential features of the human behavior related to the phenomenon of alcohol consumption. The kinetic modeling is similar to the one introduced in [38], subsequently generalized in [39], in which the human behavior was responsible of the formation of a macroscopic equilibrium in the form of a Lognormal distribution. Hence, the present results can be considered as an extension of the kinetic description of [39], which allows to classify, at a microscopic level, the main differences in the elementary interaction which produce a whole class of generalized Gamma distributions, ranging from the classical Gamma density to the Lognormal one. Well-known arguments of kinetic theory allow to model these phenomena by means of a Fokker–Planck equation with variable coefficients of diffusion and drift. Numerical experiments show the perfect agreement between the steady states of the kinetic model of Boltzmann type, and its continuous counterpart, represented by a Fokker–Planck type equation.
We hope that the present analysis about the formation of generalized Gamma distribution in alcohol consumption can be of help to previous studies, like the ones presented in [43, 57, 59], whose analysis was limited to Lognormal, Gamma and Weibull distributions. As remarked in [43], the accurate modeling of alcohol consumption as an upshifted distribution will provide public health decision-makers with accurate data to assess the impact of alcohol consumption within and across countries and will aid in determining public health priorities and where to allocate resources.
References
Aitchison, J., Brown, J.A.C.: The Log-Normal Distribution. Cambridge University Press, Cambridge (1957)
Basak, I., Balakrishnan, N.: Estimation for the three-parameter gamma distribution based on progressively censored data. Stat. Methodol. 9(3), 305–319 (2012)
Bassetti, F., Toscani, G.: Explicit equilibria in a kinetic model of gambling. Phys. Rev. E 81, 066115 (2010)
Bassetti, F., Toscani, G.: Explicit equilibria in bilinear kinetic models for socio-economic interactions. ESAIM: Proc. Surv. 47, 1–16 (2014)
Battistin, E., Blundell, R., Lewbel, A.: Why is consumption more log normal than income? Gibrat’s law revisited. J. Polit. Econ. 117(6), 1140–1154 (2009)
Bee, M., Riccaboni, M., Schiavo, S.: The size distribution of US cities: not Pareto, even in the tail. Econ. Lett. 120, 232–237 (2013)
Bellomo, N., Herrero, M.A., Tosin, A.: On the dynamics of social conflicts looking for the Black Swan. Kinet. Relat. Models 6, 459–479 (2013)
Bellomo, N., Knopoff, D., Soler, J.: On the difficult interplay between life, complexity, and mathematical sciences. Math. Models Methods Appl. Sci. 23, 1861–1913 (2013)
Bellomo, N., Colasuonno, F., Knopoff, D., Soler, J.: From a systems theory of sociology to modeling the onset and evolution of criminality. Netw. Heterog. Media 10, 421–441 (2015)
Ben-Naim, E., Krapivski, P.L., Redner, S.: Bifurcations and patterns in compromise processes. Physica D 183, 190–204 (2003)
Ben-Naim, E., Krapivski, P.L., Vazquez, R., Redner, S.: Unity and discord in opinion dynamics. Physica A 330, 99–106 (2003)
Ben-Naim, E.: Opinion dynamics: rise and fall of political parties. Europhys. Lett. 69, 671–677 (2005)
Bertotti, M.L., Delitala, M.: On a discrete generalized kinetic approach for modelling persuader’s influence in opinion formation processes. Math. Comp. Model. 48, 1107–1121 (2008)
Boudin, L., Salvarani, F.: The quasi-invariant limit for a kinetic model of sociological collective behavior. Kinet. Relat. Models 2, 433–449 (2009)
Boudin, L., Salvarani, F.: A kinetic approach to the study of opinion formation, ESAIM: Math. Model. Num. Anal. 43, 507–522 (2009)
Boudin, L., Mercier, A., Salvarani, F.: Conciliatory and contradictory dynamics in opinion formation. Physica A 391, 5672–5684 (2012)
Brown, L., Gans, N., Mandelbaum, A., Sakov, A., Shen, H., Zeltyn, S., Zhao, L.: Statistical analysis of a telephone call center: a queueing-science perspective. J. Am. Stat. Assoc. 100(469), 36–50 (2005)
Burmaster, D.E., Crouch, E.A.: Lognormal distributions for body weight as a function of age for males and females in the United States, 1976–1980. Risk Anal. 17(4), 499–505 (1997)
Cercignani, C.: The Boltzmann Equation and its Applications. Springer Series in Applied Mathematical Sciences, vol. 67. Springer-Verlag, New York (1988)
Chakraborti, A., Chakrabarti, B.K.: Statistical mechanics of money: effects of saving propensity. Eur. Phys. J. B 17, 167–170 (2000)
Chatterjee, A., Chakrabarti, B.K., Manna, S.S.: Pareto law in a kinetic model of market with random saving propensity. Physica A 335, 155–163 (2004)
Chatterjee, A., Chakrabarti, B.K., Stinchcombe, R.B.: Master equation for a kinetic model of trading market and its analytic solution. Phys. Rev. E 72, 026126 (2005)
Comincioli, V., Della Croce, L., Toscani, G.: A Boltzmann-like equation for choice formation. Kinet. Relat. Mod. 2, 135–149 (2009)
Cordier, S., Pareschi, L., Piatecki, C.: Mesoscopic modelling of financial markets. J. Stat. Phys. 134(1), 161–184 (2009)
Cordier, S., Pareschi, L., Toscani, G.: On a kinetic model for a simple market economy. J. Stat. Phys. 120, 253–277 (2005)
Crow, E.L., Shimizu, K. (eds.): Log-Normal Distributions: Theory and Application. Marcel Dekker, New York (1988)
Drǎgulescu, A., Yakovenko, V.M.: Statistical mechanics of money. Eur. Phys. J. B 17, 723–729 (2000)
Düring, B., Markowich, P.A., Pietschmann, J.-F., Wolfram, M.-T.: Boltzmann and Fokker-Planck equations modelling opinion formation in the presence of strong leaders. Proc. R. Soc. Lond. Ser. A Math. Phys. Eng. Sci. 465, 3687–3708 (2009)
Düring, B., Matthes, D., Toscani, G.: Kinetic equations modelling wealth redistribution: a comparison of approaches. Phys. Rev. E 78, 056103 (2008)
Furioli, G., Pulvirenti, A., Terraneo, E., Toscani, G.: Fokker-Planck equations in the modelling of socio-economic phenomena. Math. Models Methods Appl. Sci. 27(1), 115–158 (2017)
Galam, S., Gefen, Y., Shapir, Y.: Sociophysics: a new approach of sociological collective behavior. I. Mean-behaviour description of a strike. J. Math. Sociol. 9, 1–13 (1982)
Galam, S., Moscovici, S.: Towards a theory of collective phenomena: consensus and attitude changes in groups. Euro. J. Soc. Psychol. 21, 49–74 (1991)
Galam, S.: Rational group decision making: a random field Ising model at \(T= 0\). Physica A 238, 66–80 (1997)
Galam, S., Zucker, J.D.: From individual choice to group decision-making. Physica A 287, 644–659 (2000)
Garibaldi, U., Scalas, E., Viarengo, P.: Statistical equilibrium in simple exchange games II. The redistribution game. Eur. Phys. Jour. B 60(2), 241–246 (2007)
Gomes, O., Combes, C., Dussauchoy, A.: Parameter estimation of the generalized Gamma distribution. Math. Comput. Simul. 79, 955–963 (2008)
Gualandi, S., Toscani, G.: Pareto tails in socio-economic phenomena: a kinetic description. Economics 12, 1–17 (2018)
Gualandi, S., Toscani, G.: Call center service times are lognormal. A Fokker–Planck description. Math. Models Methods Appl. Sci. 28(08), 1513–1527 (2018)
Gualandi, S., Toscani, G.: Human behavior and lognormal distribution. A kinetic description. Math. Models Methods Appl. Sci. 29(4), 717–753 (2019)
Jin, X., Zhang, Y., Wang, F., Li, L., Yao, D., Su, Y., Wei, Z.: Departure headways at signalized intersections: a log-normal distribution model approach. Transp. Res. Part C 17, 318–327 (2009)
Kahneman, D., Tversky, A.: Prospect theory: an analysis of decision under risk. Econometrica 47(2), 263–292 (1979)
Kahneman, D., Tversky, A.: Choices, Values, and Frames. Cambridge University Press, Cambridge (2000)
Kehoe, T., Gmel, G., Shield, K.D., Gmel, Gerhard, Rehm, J.: Determining the best population-level alcohol consumption model and its impact on estimates of alcohol-attributable harms. Popul. Health Metr. 10, 6 (2012)
Ledermann, S.: Alcool, Alcoolisme, Alcoolisation, vol. I. Presses Universitaires de France, Paris (1956)
Levy, M., Levy, H., Solomon, S.: A microscopic model of the stock market: cycles, booms and crashes. Econ. Lett. 45, 103–111 (1994)
Levy, M., Levy, H., Solomon, S.: Microscopic Simulation of Financial Markets: From Investor Behaviour to Market Phenomena. Academic Press, San Diego (2000)
Lienhard, J.H., Meyer, P.L.: A physical basis for the generalized gamma distribution. Q. Appl. Math. 25(3), 330–334 (1967)
Limpert, E., Stahel, W.A., Abbt, M.: Log-normal distributions across the sciences: keys and clues. BioScience 51(5), 341–352 (2001)
Lux, T., Marchesi, M.: Volatility clustering in financial markets: a microscopic simulation of interacting agents. Int. J. Theor. Appl. Financ. 3, 675–702 (2000)
Lux, T., Marchesi, M.: Scaling and criticality in a stocastich multi-agent model of a financial market. Nature 397(11), 498–500 (1999)
Maldarella, D., Pareschi, L.: Kinetic models for socio-economic dynamics of speculative markets. Physica A 391, 715–730 (2012)
Manning, W.G., Basu, A., Mullahy, J.: Generalized modeling approaches to risk adjustment of skewed outcomes data. J. Health Econ. 24, 465–488 (2005)
Mielecka-Kubien, Z.: On the estimation of the distribution of alcohol consumption. Math. Popul. Stud. 25(1), 1–19 (2018)
Naldi, G., Pareschi, L., Toscani, G. (eds.): Mathematical Modeling of Collective Behavior in Socio-economic and Life Sciences. Birkhauser, Boston (2010)
Otto, F., Villani, C.: Generalization of an inequality by Talagrand and links with the logarithmic Sobolev inequality. J. Funct. Anal. 173, 361–400 (2000)
Pareschi, L., Toscani, G.: Interacting Multiagent Systems: Kinetic Equations and Monte Carlo Methods. Oxford University Press, Oxford (2014)
Parish, W.J., Aldridge, A., Allaire, B., Ekwueme, D.U., Poehler, D., Guy Jr., G.P., Thomas, C.C., Trogdon, J.G.: A new methodological approach to adjust alcohol exposure distributions to improve the estimation of alcohol-attributable fractions. Addiction 112, 2053–2063 (2017)
Preston, F.W.: Pseudo-lognormal distributions. Ecology 62(2), 355–364 (1981)
Rehm, J., Kehoe, T., Gmel, Gerrit, Stinson, F., Grant, B., Gmel, Gerhard: Statistical modeling of volume of alcohol exposure for epidemiological studies of population health: the US example. Popul. Health Metr. 8, 3 (2010)
Scalas, E., Garibaldi, U., Donadio, S.: Statistical equilibrium in the simple exchange games I. Methods of solution and application to the Bennati–Dragulescu–Yakovenko (BDY) game. Eur. Phys. J. B 53, 267–272 (2006)
Skog, O.-J.: The collectivity of drinking cultures: a theory of the distribution of alcohol consumption. Br. J. Addict. 80, 83–89 (1985)
Stacy, E.W.: A generalization of the gamma distribution. Ann. Math. Stat. 33, 1187–1192 (1962)
Sznajd-Weron, K., Sznajd, J.: Opinion evolution in closed community. Int. J. Model Phys. C 11, 1157–1165 (2000)
Toscani, G.: Kinetic models of opinion formation. Commun. Math. Sci. 4, 481–496 (2006)
Toscani, G., Brugna, C., Demichelis, S.: Kinetic models for the trading of goods. J. Stat. Phys 151, 549–566 (2013)
Villani, C.: Contribution à l’étude mathématique des équations de Boltzmann et de Landau en théorie cinétique des gaz et des plasmas. PhD thesis, Univ. Paris-Dauphine (1998)
Yilmaz, H., Sazak, H.S.: Double-looped maximum likelihood estimation for the parameters of the generalized gamma distribution. Math. Comput. Simul. 98, 18–30 (2014)
Zipf, G.K.: Human Behavior and the Principle of Least Effort: An Introduction to Human Ecology. Addison-Wesley, Reading (1949)
Acknowledgements
This work has been written within the activities of GNFM group of INdAM (National Institute of High Mathematics), and partially supported by MIUR project “Optimal mass transportation, geometrical and functional inequalities with applications”. The research was partially supported by the Italian Ministry of Education, University and Research (MIUR): Dipartimenti di Eccellenza Program (2018–2022) - Dept. of Mathematics “F. Casorati”, University of Pavia.
Author information
Authors and Affiliations
Corresponding author
Additional information
Communicated by Eric A. Carlen.
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Appendix
Appendix
In this short appendix we verify that, for any given constant \(0< \delta <1\) and \(\epsilon >0\), the value function \(\varPhi _\delta ^\epsilon (s)\) satisfies, for any given s such that \(0< s <1\) properties (2.3) and (2.4). Indeed, substituting into (2.4) the value function (2.9) shows that property holds provided
Inequality (6.1) holds since, for any given \(0<s<1\), the function
is a convex function of \(\delta \) that is equal to zero at the boundaries of the interval \(0\le \delta \le 1\), and this clearly implies \(Z_s(\delta ) \le 0\) on this interval, independently of the value of s.
Property (2.3) holds since the value function \(\varPhi _\delta ^\epsilon (s)\) is concave with respect to \(s \ge 0\). Direct computations allow to conclude that the sign of the second derivative of the value function is non positive if
Inequality (6.2) holds considering that, for \(x \ge 0\), the function
is a convex function, positive in the interval \(0 < x \le (1-\delta )/\epsilon \).
Rights and permissions
About this article
Cite this article
Dimarco, G., Toscani, G. Kinetic Modeling of Alcohol Consumption. J Stat Phys 177, 1022–1042 (2019). https://doi.org/10.1007/s10955-019-02406-0
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10955-019-02406-0
Keywords
- Alcohol consumption
- Empirical distribution
- Generalized Gamma distribution
- Log-Normal distribution
- Weibull distribution
- Kinetic models
- Fokker–Planck equations