Evaluating the performance of AIC and BIC for selecting spatial econometric models

Agiakloglou, Christos; Tsimpanos, Apostolos

doi:10.1007/s43071-022-00030-x

Evaluating the performance of AIC and BIC for selecting spatial econometric models

Original Paper
Open access
Published: 26 December 2022

Volume 4, article number 2, (2023)
Cite this article

Download PDF

You have full access to this open access article

Journal of Spatial Econometrics

Evaluating the performance of AIC and BIC for selecting spatial econometric models

Download PDF

2549 Accesses
3 Citations
Explore all metrics

Abstract

This study investigates using a Monte Carlo analysis the performance of the two most important information criteria, such as the Akaike’s Information Criterion and the Bayesian Information Criterion, not only in terms of selecting the true spatial econometric model but also in term of detecting spatial dependence in comparison with the LM tests for the simple two spatial models SLM and SEM. The analysis is also extended by incorporating several other spatial econometric models, such as the SLX, SDM, SARAR and SDEM, along with heteroscedastic and non-normal errors. Simulation results show that under ideal conditions these criteria can assist the analyst to select the true spatial econometric model and detect properly spatial dependence.

Evaluating information criteria for selecting spatial processes

Article 02 January 2021

Spatial smoothing in Bayesian models: a comparison of weights matrix specifications and their impact on inference

Article Open access 16 December 2017

Dirty spatial econometrics

Article 28 November 2015

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

Spatial autocorrelation in errors is a very common problem in linear regression models with spatial data, which should be treated with caution, since it violates the assumption of the random sample, leading the analyst to ambiguous results. This behavior typically arises from observations corresponding to geographically proximate locations that are correlated because of their spatial dependence. As Anselin (1988a) has clarified, this spatial dependence along with spatial heterogeneity define the concept of spatial effects. For this purpose, a set of Lagrange Multiplier (LM) tests, known as spatial dependence tests (see Burridge 1980; Anselin 1988b; Anselin et al. 1996), have been developed in the literature to assist the analyst in terms of selecting and estimating the most appropriate spatial econometric model that considers the presence of spatial dependence. Moreover, spatially autocorrelated errors can also appear in spatial regression analysis as a symptom of a false indication of spatial dependence due to spurious behavior, as Finglenton (1999), Mur and Trivez (2003) and Agiakloglou et. al. (2015) have indicated.

Nevertheless, these LM tests that have been widely used in many empirical applications contain two important drawbacks. The first one is related to the restrictive alternate model structure imposed by these tests and the second one is associated with their reliability in selecting the right model. Indeed, as it is known, these tests are applied exclusively to the choice between a simple econometric model and a spatial model with a spatial lag structure either in the dependent variable or in the error, whereas in several cases their application often leads to inconsistent conclusions as to the choice of the right spatial econometric model, a problem that has been addressed by Anselin and Florax (1995). In addition, LeSage and Pace (2009) point out that spatial dependence tests were developed and established in the logic that their statistics are calculated solely from the residuals derived from the estimation of the simple econometric model using least squares estimation without requiring estimating the corresponding spatial econometric model. Clearly, these tests for spatial dependence versus the null hypothesis of no dependence do not require maximum likelihood estimation of the spatial model under the alternative hypothesis. For this reason and given the current availability of software, LeSage and Pace (2009) suggested that the selection of a spatial model should be made in the context of comparing the likelihoods of different models, while the analysis should start from a more general model that nests both the spatial lag model and the spatial error model.

Thus, it will be very interesting to investigate whether the use of any information criterion can help the analyst to select the true spatial econometric model, knowing that these criteria are usually applied to any quantitative analysis and that their performance has been limited explored to spatial econometric analysis. For this purpose, a Monte Carlo analysis is conducted to evaluate the performance of the two most frequently used information criteria, such as the Akaike’s Information Criterion (AIC) and the Bayesian Information Criterion (BIC), using only the three most important spatial econometric models, such as the SIM, SLM and SEM, suitable for the application of the LM tests, not only in terms of detecting spatial dependence but also in terms of selecting the right spatial econometric model as a complementary approach to model selection using the LM tests for these models. Simulation results show that these criteria can assist the analyst to identify the right spatial econometric model and spatial dependence more effectively in some cases than the LM tests. Note that the simulation process is conducted using rook and queen construction matrices along with a real geographical structure, i.e., the spatial structure of Greece, which has a lot of geographical peculiarities resulting in quite asymmetric spatial weights matrices. The research is also expanded, in term of selecting the right spatial econometric model, using the two aforementioned information criteria, by considering more spatial econometric models, namely the SLX, SDM, SARAR, and SDEM, as well as two non-ideal situations, such as heteroscedasticity and non- normality, where the results vary considerably. Hence, the objective of this research is concentrated on selecting the best-fitted spatial econometric model given the weights matrix formation, rather than searching for the most appropriate weights matrix formation knowing the spatial econometric model, as discussed in Zhang and Yu (2018).

The remaining of the paper is organized as follows. Section 2 presents the three simple econometric models, namely the SIM, SLM and SEM, along with the SDM, as a special case of the SEM, the information criteria AIC and BIC and analyses the strategies applied for the LM tests. Section 3 describes the design of the simulation process and discusses the results. Section 4 presents the extension of the Monte Carlo analysis with all the elements that have been added including the additional spatial econometric models, such as the SLX, SARAR, and SDEM, along with heteroscedastic and non-normal errors and discusses the results. The concluding remarks are included in Sect. 5.

2 The LM tests for spatial econometric models and the information criteria

Consider the Spatial Independent Model (SIM), also known as the Non-Spatial Econometric Model (NSEM), defined as:

$${\varvec{y}}={\varvec{X}}{\varvec{\beta}}+{\varvec{\varepsilon}}$$

where y is a ($n \times 1$) vector of observations of the dependent variable, Χ is a [$n \times (k + 1)$] matrix of observations of k independent variables with values of 1 for its first column to include the presence of the constant term, β is the [$(k + 1) \times 1)$] vector of coefficients of the model and ε is the ($n \times 1$) random vector following the standard assumptions of regression, i.e., ε ~ N(0, σ²I).

Spatial econometric models have been introduced in the literature as multidirectional extensions of the time series econometric models on the geographical space, defining in that sense dependence for the values of a variable according to the geographical positions of its values and of the values of all independent variables in the model, including the error term and not according to their chronological dependence. The spatial dependence is incorporated into the model by the presence of the spatial weights matrix W, which defines the spatial interactions between the n neighboring regions and it is used in its row-standardized form.^{Footnote 1}

The two most important spatial econometric models are the Spatial Lag Model (SLM) defined as:

$${\varvec{y}}=\rho {\varvec{W}}{\varvec{y}}+{\varvec{X}}{\varvec{\beta}}+{\varvec{\varepsilon}}$$

and the Spatial Error Model (SEM) defined as:

$${\varvec{y}}={\varvec{X}}{\varvec{\beta}}+{\varvec{\varepsilon}}$$

with

$${\varvec{\varepsilon}}=\lambda {\varvec{W}}{\varvec{\varepsilon}}+{\varvec{u}}$$

where u is a ($n \times 1$) random vector and Wy and Wε are the spatially lagged vectors that incorporate spatial dependence consisting of weighted averages of the values of the variables in the n neighboring regions, while ρ and λ are the spatial lag coefficients of the dependent variable and the errors respectively. In addition, the SΕM model can be expressed as a spatial econometric model with spatial lags for all variables of the model, known as the Spatial Durbin Model (SDM), defined as:

$${\varvec{y}}=\rho {\varvec{W}}{\varvec{y}}+{\varvec{X}}{\varvec{\beta}}+{\varvec{W}}{\varvec{X}}{\varvec{\theta}}+{\varvec{\varepsilon}}$$

where ρ = λ, $-\lambda{\varvec{\beta}}={\varvec{\theta}}$, a condition that can be tested by performing the test presented by Mur and Angulo (2006), known as a common factor test, to investigate whether the model is actually a spatial error model or a more general spatial model, and the WX is the spatial lag matrix of the independent variables. Hence, when the SDM is estimated, while the true generating spatial model is the SEM, one should expect to get the same results.

It also important to mention that the values of the coefficients ρ and λ are not necessarily restricted strictly to the interval (− 1, + 1), as in time series analysis. The estimation is implemented provided that the Jacobian matrix is non-singular, an outcome that is related to the eigenvalues of the spatial weights matrices. More specific, row-standardized spatial weights matrices have always the largest eigenvalue equals to unity, something that ensures that the upper limit of the interval will always be + 1, while the value of the lower limit is unknown, and several times can be smaller than − 1 (see LeSage and Pace 2009). Thus, if the coefficient takes values inside the feasible interval, corresponding to the applied weights matrix, the Jacobian determinant will be positive, comforting that its logarithm exists and therefore the log-likelihood function will be well defined.

The selection of the best fitted model for a given set of spatial data is typically made through the LM tests for spatial dependence. In particular, the LM test for a SEM (LM-ERR), introduced by Burridge (1980), and the LM test for a SLM (LM-LAG), presented by Anselin (1988b), are conducted on a SIM under the null hypothesis, against a SEM and a SLM under the alternative hypothesis, respectively, and the data contains no spatial effects if the null hypothesis is accepted by both tests. Issues arise when both tests reject the null hypothesis, a behavior that tends to appear very often in practice, as indicated by Anselin et al. (1996), where the tests cannot clearly identify the type of the spatial effect, unless one test rejects and the other one accepts the null hypothesis. For this reason, Florax et al. (2003) proposed to select the spatial econometric model for which the LM statistic will have the highest value, hereafter strategy I. Moreover, and as an effort to minimize this problematic behavior, comparable robust LM tests have been constructed by Anselin et. al. (1996), namely the Robust LM-Error test (LM-EL) and the Robust LM-Lag test (LM-LE), showing that these tests have more power in locating the correct spatial model than the simple LM tests. Hence, the presence of spatial dependence is inspected through these new tests, hereafter strategy II, and if the null hypothesis is rejected by both robust LM tests, the spatial econometric model is selected according to the highest value of one of the two robust LM statistics. Lastly, another strategy has been proposed by Florax et al. (2003), known as hybrid strategy, which combines the classical and the robust LM tests, but it turns out that this strategy leads to the same results as the classical approach (strategy I), as indicated by Florax et al. (2003) and proved by Mur and Angulo (2009).

The choice also of the best fitted model for a given set of spatial data can also be conducted based on the values of a pre-selected information criterion, as this technique is typically applied in every quantitate analysis that involves model selection. Hence, the values of the two most often used in practice information criteria, such as the Akaike Information Criterion (AIC), presented by Akaike (1973), and the Bayesian Information Criterion (BIC), developed by Schwarz (1978), as an attempt to improve the performance of AIC, defined respectively as:

$${\text{AIC}} = - 2\ln \hat{L} + 2p$$

and

$${\text{BIC}} = - 2\ln \hat{L} + p\ln n$$

where $\ln \hat{L}$ is the maximized value of the log-likelihood function, p is the number of parameters estimated from the econometric model and n is the sample size used for the estimation of the model, can be computed right after the estimation of any spatial model by maximum likelihood estimation and the best fitted model is selected according to the minimum value of that criterion, a technique that has been used in practice by Chi and Zhu (2008) on their empirical work for demographic data to select the best fitted spatial econometric model.

For this reason, it will be very interesting to study the performance of these criteria for spatial data in lieu of the performance of the LM tests not only in terms of identifying spatial dependence but also in terms of selecting the best spatial econometric model for these limited alternatively spatial models, knowing that each criterion has its own penalty function and therefore different pattern for model selection. Indeed, as it is known, AIC has the tendency to select models with large number of parameters, whereas BIC typically chooses small models, as a true approximation of their unknown population behavior.^{Footnote 2} Note that the behavior of these criteria has been investigated for geostatistical models by Hoeting et al. (2006) and Lee and Ghosh (2009) and for spatial processes by Agiakloglou and Tsimpanos (2021), but not for spatial econometric models.

3 Simulation results

The performance of the two previously presented information criteria, i.e., the AIC and the BIC, is investigated in terms of selecting the true spatial econometric model among the three alternative models, namely, the SIM, the SLM and the SDM, using a Monte Carlo analysis, where the SEM is included as an estimated SDM. The simulation process is conducted by considering only one independent variable derived from a uniform U(0, 10) distribution which is simulated only once and then it remained constant for all iterations. Thus, the matrix ${\varvec{X}}$ has dimensions ($n\times 2$) consisting of one independent variable and a column of ones to estimate the constant term. The vector of coefficients ${\varvec{\beta}}$ with dimension $(2\times 1)$ is assumed to take values of one for both of its elements. The random error vector ${\varvec{\varepsilon}}$ is derived from a $N(0,\boldsymbol{\rm I})$ distribution and it is added to the vector ${\varvec{X}}{\varvec{\beta}}$ to produce the vector of the dependent variable ${\varvec{y}}$ for the non-spatial econometric model.

Spatial dependence is introduced into the models by defining a row-standardized spatial weights matrix ${\varvec{W}}$ with dimensions ($n\times n$) which is constructed using the rook (four neighbors-common edge) and the queen (eight neighbors-common edge and vertex) contiguity definitions, over a squared regular lattice for dimensions 10 × 10 and 20 × 20 providing samples of 100 and 400 observations. Calculation of the eigenvalues for the four spatial matrices shows that the lower value of the spatial coefficient for the rook criterion is − 1 for both sample sizes, meaning that the feasible range of values that the parameter can take is (− 1, 1), whereas the lower value for the queen criterion is − 1.97 for sample size of 100 observations and − 1.921 for sample size of 400 observations, leading to feasible ranges of (− 1.97, 1) and (− 1.921, 1) for sample sizes of 100 and 400 observations respectively (see Bivand et al. 2013 and Agiakloglou and Tsimpanos 2021). In addition, the simulation prosses is extended by including weight matrices derived from a geographical structure of Greece at the local authority districts of Kallikrates Operational Programme consisting of 325 municipalities.^{Footnote 3} The weights matrices are constructed to capture real geographical structure according to the 4-nearest neighbors and the 8-nearest neighbors definitions, based on the geographical coordinates of the centroid for each municipality. Therefore, the formation of the weights matrices will be considered as given for the whole Monte Carlo analysis, although Kelejian and Piras (2011) proposed a J-test for investigating alternative spatial econometric models with different weights matrices under the null hypothesis of a specific spatial econometric model (see also Jin and Lee 2013).

The simulation process is conducted in R using the SPDEP package developed by Bivand (2015) to generate: a) the SIM, b) the SLM, by multiplying the right-hand side of the SIM by the spatial multiplier ${\left(\boldsymbol{\rm I}-\rho {\varvec{W}}\right)}^{-1}$, that is:

$${\varvec{y}}={\left(\boldsymbol{\rm I}-\rho {\varvec{W}}\right)}^{-1}{\varvec{X}}{\varvec{\beta}}+{\left(\boldsymbol{\rm I}-\rho {\varvec{W}}\right)}^{-1}{\varvec{\varepsilon}}$$

c) the SEM as:

$${\varvec{y}}={\varvec{X}}{\varvec{\beta}}+{\left(\boldsymbol{\rm I}-\lambda {\varvec{W}}\right)}^{-1}{\varvec{u}}$$

where the vector error term ${\varvec{u}}$ with ($n\times 1$) dimensions is derived from $N(0,{\varvec{I}})$ distribution and the spatial parameters ρ and λ can take values within the feasible range intervals, as previously mentioned for these spatial weights matrices formulation, and d) the SDM as follows:

$${\varvec{y}}={\left(\boldsymbol{\rm I}-\rho {\varvec{W}}\right)}^{-1}\left({\varvec{X}}{\varvec{\beta}}+{\varvec{W}}{\varvec{X}}\theta +{\varvec{\varepsilon}}\right)$$

while no restrictions are applied for the values of θ. All models are estimated by maximizing the log-likelihood function so that the values of both information criteria can be calculated. The best fitted econometric model is selected according to the minimum value of the pre-defined criterion based on 1000 replications. Note that the SEM is not estimated directly but only indirectly as a SDM and the information criterion should select the SDM when the true generating model is the SEM.

Table 1 presents the percentage selection rates of both criteria when the true generating model is the SIM. As can be seen from this table, the BIC performs very well in terms of selecting the true model with selection rates close to 95% and 98% for samples of 100 and 400 observations respectively, regardless of the spatial weights matrix formation, including the Greek weight matrices. On the other hand, the selection rate of the true model based on both strategies using the LM tests is smaller than the selection rate of BIC, as can be seen from Table 1, although the empirical levels of all LM tests are close to the nominal level of 5% regardless of the sample size and the spatial weights matrix formation, a result that can also be found in Anselin and Florax (1995) with two independent variables used in the regression analysis. Hence, the BIC, unlike the AIC, will lead the analyst to the right model selection with confidence slightly larger than any of the LM tests strategy, especially for large sample sizes. The selection rates for all three econometric models based on both information criteria when the true generating model is the SLM are reported on Table 2. As can be seen from this table, the selection rate of the true model by both criteria is not affected by the value and by the sign of the spatial autoregressive parameter ρ, including the extreme case of a negative value smaller than -1 for the queen formation, as well as by the spatial matrix formation. The performance of both criteria is determined mainly by the sample size, i.e., the AIC selects the true model at a rate of 82% and 84% for sample sizes of 100 and 400 observations respectively, while the BIC selects the true model more accurate at rates of 96% and 99%, respectively. Hence, the BIC outperforms the AIC, as in the previous case, in terms of selecting the right spatial econometric model, for every given value of the spatial autoregressive parameter and sample size reaching levels close to certainty. It is also important to indicate that the SIM is not selected at all by both criteria, except for small values of ρ and small sample size at a very low rate. Furthermore, the spatial dependence as well as the right spatial econometric model are also recognized successfully by the LM tests, since one of the two the LM tests is designed for this alternative model structure. In that sense the LM tests using both strategies select the true model, i.e., the SLM, with more confidence than the BIC, reaching very frequently levels of certainty, as can be seen from Table 3, regardless of the matrix formation.^{Footnote 4}

Table 1 Percentage of selections for all three models based on AIC and BIC as well as on LM strategies based on the 5% nominal level when the true generating model is the SIM using 1000 replications

Evaluating the performance of AIC and BIC for selecting spatial econometric models

Abstract

Similar content being viewed by others

Evaluating information criteria for selecting spatial processes

Spatial smoothing in Bayesian models: a comparison of weights matrix specifications and their impact on inference

Dirty spatial econometrics

1 Introduction

2 The LM tests for spatial econometric models and the information criteria

3 Simulation results

4 Further simulation results

5 Concluding remarks

Notes

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher's Note

Supplementary Information

Supplementary file1 (DOCX 1095 KB)

Rights and permissions

About this article

Cite this article

Share this article

Keywords

JEL Classification

Search

Navigation