1 Introduction

Over the past decade, humanity has witnessed an increase in mortality and morbidity caused by environmental pollution. Air pollution adverdly affects the development of the natural world, threatening the safety of human life (Currie et al. 2008; Hansen & Selte 2000; Yazdi & Khanalizadeh 2017). The pressure of improving income per capita boosts economic activities, which intensifies the destruction of environmental quality (Hassan et al. 2019; Moutinho et al. 2017; Xu 2018). Environmental pollution and climate change trigger health problems, resulting in huge health costs. The impact of air pollution on individual health is obvious, but who pays health costs is an essential policy issue. In most emerging countries, the public health care service systems are overloaded, whereas government budgets spent on health insurance are not sufficient to meet the requirements of residents.

The nexus of economic growth, air pollution and health expenditure has been discussed in the United States and the United Kingdoms. in the early 1950s, when the American Institute of Public Health published a report of the impact of air pollution on health in 1957. Nowadays, a great number of scientists have found a positive impact of air pollution and economic growth on health spending. The pioneering work of Newhouse (1977) investigating the influence of economic growth on health care expenditure in 13 Organisation for Economic Co-operation and Development (OECD) countries revealed a positive effect of income on personal health budget. Consistent with this view, Murthy and Okunade (2016) examined the effects of the macroeconomic factors on American health spending from 1960 to 2012 by applying the ARDL approach. Real income, growth rate of the population over 65 years, and high medical technology are shown to be positively correlated to health spending. Analyzing a sample of 15 OECD countries for 1995–2011, Doğan et al. (2014) revealed that the largest and smallest effects on health care expenditure are exerted by public health expenses and labor force respectively, while the influence of income is positive. However, Hansen and King (1996) found no relationship between economic growth and health care expenses for 20 OECD countries. This conclusion was reconfirmed by McCoskey and Selden (1998). Via the fixed effect and random effect models on the data of 29 OECD countries for 1995–2014, Fernandez et al. (2019) also discovered that per capita income has a insignificantly positive effect on health spending. To explain the conflicting conclusions on the mentioned connection, Newhouse (1977), Blazquez-Fernandez et al. (2019), Liang and Tussing (2019) argued that the relationship depends on the “Norm” hypothesis, according to which physicians, medical facilities, and hospitals provide health care services by formal norms. That means health care services are offered more and better if people are involved in many types of health insurance, or participate at a high level of cost.

Like economic growth, CO\(_2\) emissions tend to positively affect health expenditure. Chaabouni et al. (2016) analyzing a sample of 51 countries for 1995–2013 using the GMM approach showed that there is bi-directional causality between CO\(_2\) emissions and economic growth and between health spending and economic growth for a global sample, but uni-directional causality running from CO\(_2\) emissions to health spending, except for low-income countries. Employing the Wavelet method for the U.S, Alola and Kirikkaleli (2018) indicated that \(CO_2\) emissions are related to healthcare. Okunade (2005) applying the ordinary least square (OLS) regression on a sample of 26 African countries found inequality and income per capita to raise health care expenditure. Mehrara et al. (2012) utilized the vector error correction model (VECM) to clarify the links between economic growth and health expenditure in 13 Middle East and North Africa (MENA) countries during 1995–2005. A bounds test confirms cointegration among the examined variables. Specifically, health care expenses have a positive impact on income growth, but with the rate of contribution declining. Using the PM10 emissions (PM10 is a particulate matter 10 \(\upmu \)m or less in diameter, which can be drawn deep into the lungs or blood) as a proxy for environmental pollution, Yazdi and Khanalizadeh (2017) reconfirmed the positive effect of air pollution and economic growth on health care expenditure in MENA countries. Likewise, using VECM for the Sub-Saharan African countries from 1990 to 2015, Zaidi and Saidi (2018) affirmed that economic growth positively affects health care expenses, while the effect of air pollution is negative. By using the autoregressive distributed lag (ARDL) approach to Turkey during 1975–2007, Yavuz et al. (2013) discovered that income has no effect on health care expenditure in the long-run. Furthermore, provincial output and environmental pollution have a positive impact on public health expenditure for China during 1997–2014 (Yu et al., 2016).

Recently, investigating 20 cities in China, Yang et al. (2019) found that PM2.5 emissions are related to mortality, morbidity and tend to cause economic loss. In analysis of the effects of macroeconomic factors on health care expenses in 18 Arab world countries during 1995–2015, Barkat et al. (2019) specified the same model for three groups: high-, upper-middle- and lower-middle-income countries. The empirical results from the pooled mean group (PMG) and the common correlated effects (CCE) suggest that income is not the only driver of health expenditure in the Arabian countries in the long-run. Similarly, Raeissi et al. (2018) reported that a 1% increase in carbon dioxide leads to an increase of 3.32% and 1.16% in public and private health expenditures in Iran. However, Zaidi and Saidi (2018) revealed that a 1% increase in CO\(_2\) and NO\(_2\) emissions leads to a 0.066% and 0.577% decrease in health care expenses in the Sub-Saharan African countries. Analyzing the Greece case for 2008–2015 through a probit 2SLS regression, Kyriopoulos et al. (2019) revealed that household health care expenses respond to permanent income changes more strongly than the ones arising from current income shocks.

Similar to CO\(_2\) emissions, urbanization could positively contribute to healthcare. People tend to move to cities because of many opportunities for them to find a job with a good income. Still, the environmental pollution situation in big cities is getting worse and worse, and life costs there are also much higher than in villages. Rapidly developing urban areas have intensified CO\(_2\) emissions (Hashmi et al. 2021). Although the relationship between air pollution, economic growth and health spending is not fully exhausted by our selective review, our paper affirms that conclusions drawn from the considered works rest mainly on data-driven frequentist inference as an obsolete estimator (Kalli & Griffin 2018; Kim 2002; Norets 2015). If the coefficients of independent variables are not yet significant, the suggestion of a conclusion or implication is difficult and even impossible (Hansen and King 1996, McCoskey and Selden 1998). The issue could be solved within the more balanced and more reliable Bayesian setting, but no research employed these methods to give substantial insights into the relationships between economic growth, air pollution, urbanization and private health care expenditure in the association of southeast Asian nations (ASEAN). This is the research gap which the current study wants to address. Hence, the contributions of the investigation are summarized as follows: first, to the best of our knowledge, ours is one of the first works accessing the impacts of air pollution, economic growth, and urbanization on private health care expenditure in ASEAN, and hence provides new empirical evidence; second, the majority of earlier studies on health care expenses utilized outdated frequence-based techniques, where coefficient parameters are fixed point estimates. More importantly, examining the links between all interested variables is impossible as non-significant coefficients are dropped out of analysis. We are the first to apply the Bayesian approach through the integrated Markov chain Monte-Carlo (MCMC) sampler to give probabilistic interpretations of model uncertainty and varying effects of air pollution, economic growth, and urbanization on private health care expenditure. With the accomplishments achieved, the research methodologically and empirically contributes to the health care expenditure field.

2 Methodology

In the context of an increasing crisis in standard frequentist statistics, due to the surprisingly rapid advancements of powerful computational tools, the Bayesian approach has been becoming a more and more commonly used methodology in behavioral and social sciences over the past 30 years (Lemoine 2019; Ngoc & Awan 2021). The research employs a Bayesian two-level mixed-effects method to capture the variability of initial health care expenditure across 10 ASEAN countries, due to which the precision of the estimates increases. Mixed-effects models or multilevel models are featured as incorporating both fixed effects and random effects. The former comparable to frequentist regression coefficients are estimated directly. On the contrary, the latter is summarized according to their estimated variances and covariances. Random effects may be either random intercepts or random coefficients, while the grouping structure of the data may consist of multiple levels of nested groups. We focus on random intercepts assuming that initial health care expenditure varies across the studied countries, while the effects of income, air pollution, and urbanization on private health care expenditure are the same.

We specify a random-effects model as follows:

$$\begin{aligned} lnH{E_{it}} = {\beta _0} + {\beta _1}.lnGD{P_{it}} + {\beta _2}.lnC{O_{2,it}} + {\beta _3}.U{B_{it}} + {u_i} + {\varepsilon _{it}} \end{aligned}$$

where HE is private health care expenditure per capita (unit: US dollar), GDP is income per capita (unit: US dollar in constant 2010 prices), CO\(_2\) is CO\(_2\) emissions per capita (unit: metric ton), UB is urbanization rate (unit: percentage), u is random intercepts, \(\varepsilon \) is random error, i is country, and t is year. Annual data is collected from the World Bank database from 2000 to 2016.

Because our data sample size is sufficiently large, different prior specifications do not influence posterior results and in this situation noninformative priors are enough for modeling. For comparison purposes, informative priors for the model parameters are specified too. Accordingly, five posterior simulations are made. A sensitivity analysis to prior choice will be performed through a Bayes factor test and a model test. We assume to have models \({M_k}\) parameterized by vectors \({\theta _k},k = 1,2,..,r\). Through the use of Bayes’s theorem, we calculate the posterior model probabilities:

$$\rho \left( {{M_k}\left| x \right. } \right) = \frac{{\rho (x\left| {{M_k})*\rho ({M_k})} \right. }}{{\rho (x)}}$$

Since it is challenging to calculate \(\rho (x)\), a popular practice is to compare two models, for example, \({M_k}\) and \({M_l}\) via posterior odds ratio:

$$PO{R_{kl}} = \frac{{\rho ({M_k}\left| x \right. )}}{{\rho ({M_l}\left| x \right. )}} = \frac{{\rho (x\left| {{M_k})*p({M_k})} \right. }}{{\rho (x\left| {{M_l})*p({M_l})} \right. }}$$

In the case of all equally plausible models, the posterior odds ratio is transformed into the Bayes factor:

$$B{F_{kl}} = \frac{{p(x\left| {{M_k})} \right. }}{{p(x\left| {{M_l})} \right. }}$$

Information criteria such as Akaike information criterion (AIC), Bayesian information criterion (BIC), and the deviance information criterion (DIC) are utilized to determine the most suitable model among candidate models that fits the data best. The drawback of all these criteria, however, is that they either ignore prior distributions or suppose only noninformative prior distributions. They are therefore not appropriate for Bayesian sensitivity analysis, when comparison models have the same parameters but different priors. In accessing the Bayesian framework, Bayes factors are preferred to model-selection criteria as, contrast to BIC, AIC, and DIC, they contain all information about prior distributions. Focus on prior information is crucial for Bayesian sensitivity analysis, when research compares models with the same parameters but various priors. The Bayes factor of two models is just the ratio of their marginal likelihoods calculated on the same data. Bayes factors usage, nevertheless, is often criticized in some venues for a difficulty in calculating marginal likelihoods. Besides, being relative quantities, Bayes factors cannot be adopted to evaluate goodness of fit of a model of interest unless the base model fits the data well. This is another limitation of Bayes factors.

While a Bayes model test compares posterior probabilities of specified Bayesian models to determine which model is more likely among examined models on the same dataset. The present study compares the candidate models with the same parameters but different priors.

3 Bayesian Simulation Outcomes

Model Comparison

This subsection compares five posterior regression models, where the respective Gaussian prior distributions are N(0,1), N(0,10), N(0,100), N(0,1000), and N(0,10000).

Table 1. Bayes factor test
Table 2. Bayesian model test
Table 3. Gelman-Rubin convergence diagnostic

The model comparison results are presented in Tables 1 and 2. In general, the smaller the DIC value, the larger the log(ML) and log(BF) estimate, the better a model fits the data (Table 1). P(My) denotes the posterior model probability. The higher P(My), the better a posterior model (Table 2). Consequently, model 5 is the best.

MCMC Convergence Test

In the application of a MCMC algorithm, convergence checks are needed before proceeding to inference. Once chain convergence is established, the model parameters have converged to equilibrium values. To avoid pseudo convergence, we simulate three MCMC chains and verify whether the results satisfy the convergence rule. This is because pseudo convergence takes place when the chains have seemingly converged, but indeed, they explored only a portion of the domain of a posterior distribution. As demonstrated in Table 3, the maximum Gelman-Rubin statistic Rc of 1.01 is close to 1.1, indicating MCMC convergence.

The model summary reports rate of acceptance and algorithm efficiency as initial indicators of MCMC convergence. The acceptance rate is the number of proposals accepted in the total proposals, whereas algorithm efficiency is the mixing properties of MCMC sampling. Concerning the chosen model 5, the acceptance rate of 0.83 is larger than the minimum level of 0.1, whereas average efficiency is equivalent to 0.19, which is more than the acceptable level of 0.01.

Bayesian Simulation Outcomes

Table 4 exhibits the simulation summaries of model 5. The variables \(lnCO_2\), lnGDP, and UB are of our interest. With a probability of mean between 0.7 and 1, lnGDP and \(lnCO_2\) exert strongly positive effects on lnHE, while the 54% probability denotes that the impact of UB is ambiguous. The 95% credible intervals also point to similar results. The lower Monte-Carlo standard error (MCSE) values, the more accurate posterior mean estimates. For MCMC algorithms, MCSE estimates close to one decimal are acceptable. Moreover, standard deviations for all the parameters are small, indicating the preciseness of parameter estimates. Importantly, because the variations of private health care expenditure between the researched countries are captured in a Bayesian mixed-effects model, its variance estimates decrease in comparison with those from maximum likelihood estimation.

Compared to frequentist statistics, credible Bayesian intervals have direct and intuitive probabilistic interpretation. Thus, we can state that the coefficient for \(lnCO_2\) belongs to the interval \([-0.139, 0.275]\) with a 95% probability. Similar interpretations can be offered for the remaining parameters of the model.

Table 4. Bayesian simulation outcomes

Discussion

Bayesian outcomes report the following findings. Per capita income, air pollution, and urbanization positively affect private health expenditure. However, their impact magnitude is different in terms of probability, with a strongly positive effect from income and a moderate effect from CO\(_2\) emission, but an ambiguous relationship between urbanization and private health care expenditure. Importantly, our estimation results are economically plausible. Compared with the previous studies, our outcome is in line with Yazdi and Khanalizadeh (2017), Zaidi and Saidi (2018), Barkat et al. (2019), reconfirming a positive impact of air pollution on health care expenditure. Regarding the effect of income on health care expenses, our result is consistent with Gerdtham et al. (1992) Hitiris and Posnett (1992), Bhat and Jain (2006), Wang and Rettenmaier (2007). The reasons for the outcomes could be as follows: first, the rise of income allows to obtain access to better health care services at a high-cost level (Rao et al. 2009); second, the consciousness of people on their health positively changes resulting from income improvement.

4 Concluding Remarks

The study estimates the effects of income, air pollution, and urbanization on private health care expenditure employing the Bayesian hierarchical mixed-effects regression through the hydrid Metropolis-Hastings sampler on a panel data of 10 ASEAN countries over the period 2000 to 2016. The Bayesian mix-effects regression allows for variance reduction and so increases the accuracy of the estimates. According to the simulation outcomes, we claim in view of the probability that economic growth strongly and positively affects private health care expenditure, CO\(_2\) emissions have a moderate positive impact, while the effect of urbanization is ambiguous.

Based on the obtained empirical results, two main policy implications are suggested: First, ASEAN countries should plan detailed policies to propagate on the harmful effects of environmental pollution, encouraging residents to use clean energy, enterprises to invest in high and green technologies; Second, medical waste is a factor in destroying environmental quality. Medical garbage should be well controlled by governments and hospitals.

The main limitation of this research is that we did not incorporate random coefficients due to the high dimensionality of multilevel models, which may lead to non-convergence for some model parameters.