Keywords

Introduction

Nowadays, the residential market is a major component of the overall real estate market. Over time, methods and research related to this field of economics has shifted from classical econometrics to spatial econometrics (Anselin 1988).

The appearance of hedonic price models, derived mostly from Lancaster’s (Lancaster 1966) consumer theory and Rosen’s (Rosen 1974) model, became the milestone in econometric theory related to the real estate market. A hedonic price model decomposes the price of a good into separate components that determine the price. Basically, the hedonic equation is a regression of expenditures (rents or values) on housing characteristics of the unit that determine that rent or value. Other pricing models related to hedonic price indices include repeat-sales models (Wang and Zorn 1997) or hybrid models, which combine the elements of hedonic price and repeat-sales models (Quigley 1995). Meese and Wallace (1997) provide comprehensive research comparing the advantages and limitations of all mentioned models. The main drawback of the conventional hedonic model is that it is not capable of taking into account of spatial effects on housing prices even when locational variables are taken into consideration.

One of most important issues related to observed data in hedonic modelling is spatial autocorrelation. Basu and Thibodeau (1998) outlined two main reasons for spatial autocorrelation of housing prices. The first reason that housing prices are found to be spatially autocorrelated is that most of the dwellings in neighborhoods were built at the same time with similar structural characteristics such as dwelling size, year built, interior and exterior design features, etc. The second reason that housing prices are found to be spatially autocorrelated is a consequence of sharing the same neighborhood amenities such as proximity to public transportation, schools, markets, etc. Since hedonic house price parameters are usually estimated using ordinary least squares procedures – which assume independent observations from residuals that are spatially autocorrelated – the resulting parameter estimates often produce incorrect confidence intervals for estimated parameters and for predicted values. The importance of spatial relationships was recognized in recent hedonic studies by introducing spatial lag and spatial error models (Anselin and Bera 1998).

This problem could also be solved using spatial statistical techniques like GWR or RK that incorporate the observed spatial relationships between sample data. Geostatistics has become an essential tool in diverse environmental studies performed during the last few decades. It is imposed particularly in the field of spatial data analysis and in the prediction of numerous natural phenomena. Spatial econometrics, geostatistics, and spatial statistics share many similarities since these fields all deal spatial autocorrelation and spatial heterogeneity (Anselin 1999). Increased interest in use of geostatistics has resulted in numerous improvements and modifications that are essentially extensions to the fundamental kriging theory. Extended versions of kriging have been adopted to deal with non-normality (lognormal kriging, disjunctive kriging), while others address nonstationarity, i.e. varying trend or drift (universal kriging, kriging with external drift, IRF-k kriging and stratified kriging) (McBratney et al. 2000). The common characteristics of all geostatistical applications are that they were initially used for spatial modelling of diverse natural (i.e. non-anthropogenic) phenomena. Although hedonic regression models (Can and Megbolugbe 1997; Kim et al. 2003; Osland 2010) prevail in real estate appraisal applications, the role of geostatistics has increased in importance recently (Dubin 1998; Yoo and Kyriakidis 2009; Fernández-Avilés et al. 2012).

Most of those spatial statistical techniques are already available in most geographical information systems (GIS) operational environments. The use of GIS technology in spatial econometrics studies started in the mid-1990s. Soon after, the advantages of using GIS applications for hedonic price modelling were recognized in a number of studies (Lake et al. 1998; Anselin 1998; Lovett and Bateman 2001).

In this paper, a geostatistical method regression-kriging (RK) is presented as a method for spatial prediction and mapping of housing prices. Although the RK technique has not been extensively used in hedonic price modeling, there are certain examples where forms of RK were applied under different names. In the literature, the terms used for geostatistical methodologies can be confusing due to the different terms used for the same or very similar techniques (Hengl 2009). Yoo and Kyriakidis (2009) used the term area-to-point kriging with external drift (A2PKED). Chica-Olmo (2007) tested the performance of two co-kriging methods for prediction of housing location prices, in which the authors used a heterotropic version of co-kriging that is very similar to RK.

Usually, most input data for hedonic modelling is acquired from multiple listings databases. In this study, both transaction data and explanatory data are organized as auxiliary maps in a raster format. The majority of required spatial data is already available in GIS formats, thereby minimizing the effort required for pre-processing. The results of spatial prediction are produced as a raster GIS layer in the same resolution as input maps; the resulting map may be of interest to appraisers, real-estate companies and governmental agencies, as it provides an overview of location prices.

The performance of an alternative method, Geographically Weighted Regression (GWR), was also examined in this study; this method is intended for the local analysis of relationships in multivariate data sets. The comparison of the proposed approach using GWR reveals RK as the model of choice for spatial prediction of housing prices in combination with auxiliary data.

The results of spatial prediction are exported as a standard raster GIS layer, or in HTML format enabling simple creation of rich interactive Web-based maps. The user-created maps can assist in facilitating communication between appraisers, real-estate companies, governmental agencies, and other interested parties.

The overall objectives of this study were: (1) to compare two alternative regression techniques (OLS in implicit and error model form and GWR) against the RK geostatistical model for evaluating dwelling prices; (2) to compare the performance of all models using global statistics; and (3) to evaluate the performance of all models in terms of spatial distribution and clustering of residuals using local indicators of spatial association (LISA).

The next section contains a brief description of the theoretical foundations of hedonic price models, geographically weighted regression and regression-kriging methodologies, as well as some details about the R software environment used in this study. Included in this study is a concise representation of real estate market characteristics in the city of Belgrade as well as the description of the transaction and auxiliary data layers used. The important issues related to multicollinearity and attribute selection are discussed in the following section, along with detailed explanatory data analyses and mapping. The final section concludes the study.

Methodology

Hedonic Price Models

The basic hedonic price function can be represented as (Can and Megbolugbe 1997):

$$ \mathrm{Y}=f\left(\mathrm{S}\beta, \mathrm{N}\gamma \right)+\varepsilon $$
(1)

where Y is a vector of observed housing values; S is a matrix of structural characteristics of properties; N is a matrix of neighborhood characteristics, including measures of socioeconomic conditions, environmental amenities and public accommodations for area residents; β and γ are the parameter vectors corresponding to S and N; and ε is vector of random error terms. The basic form of hedonic regression assumes that each parameter is fixed in space, which means that each identified attribute has the same intrinsic contribution throughout the study area.

The given formula can be expressed like a common regression function:

$$ \mathrm{Y}=\mathrm{X}\upbeta +\varepsilon $$
(2)

where Y n×1 represents vector of observed sale prices of n dwellings, X n×k is a vector of k explanatory variables characterizing housing units. β k×1 is a vector of unknown coefficients and ε n × 1 is a vector representing the error term.

By using ordinary least squares (OLS), the unknown coefficients are solved as:

$$ \widehat{\upbeta}={\left({\mathrm{X}}^{\mathrm{T}}\mathrm{X}\right)}^{-1}{\mathrm{X}}^{\mathrm{T}}\mathrm{Y} $$
(3)

The basic assumption for OLS usage is the indepedence of observations, which is often violated due to spatial autocorrelation in data, leading to a biased estimation of standard errors of model parameters and misleading significance tests.

The above given regression formula has particular modified versions that are often used for house price modelling: the spatial-lag model (also known as spatial autoregressive model) and spatial error model (Dubin 1988), (Kim et al. 2003), (Osland 2010):

$$ {\displaystyle \begin{array}{l}\mathrm{Spatial}\ \mathrm{lag}\ \mathrm{model}:\kern2em \mathrm{Y}=\mathrm{X}\upbeta +\phi \mathrm{W}\mathrm{Y}+\varepsilon \\ {}\mathrm{Spatial}\ \mathrm{error}\ \mathrm{model}:\kern1.25em \mathrm{Y}=\mathrm{X}\upbeta +\xi +\lambda \mathrm{W}\varepsilon \end{array}} $$
(4)

where ε is the vector of errors terms and W represents the weights matrix that specifies the assumed spatial structure or connections between the observations. The elements of the weights matrix can be based upon contiguity (i.e., shared borders) or distance. The parameter φ is often referred to as the spatial correlation parameter giving the intensity of the dependence between neighboring prices. The term ξ represents random error and λ is a spatial autoregressive parameter.

If data exhibits a spatial lag process the target variable is affected by the values of the target variables in nearby places. The OLS hedonic model omits φwY, and this becomes part of the error which leads to biased parameters estimates (Anselin 1988). If spatial lag is in true functional form (Anselin and Bera 1998), then parameter estimates should be inefficient as well. Nevertheless, some studies showed that although spatially correlated errors are presented in data, non-spatial hedonic models (implicit OLS) may provide results useful for policy analysis (Mueller and Loomis 2008).

Geographically Weighted Regression

The geographically weighted regression is a relatively new method used in spatial modeling and was developed as an alternative format for spatial analysis that is local rather than global in its analytical design (Fotheringham et al. 2002). Although this method is useful in a wide range of applications, its widest practical application still is in the mass assessment of real property (Crespo et al. 2007; Yrigoyen et al. 2008; Hanink et al. 2010). Increasing application of this technique is made possible by GIS database processing tools and publicly available databases on the Internet.

GWR represents the extension of a conventional multiple regression framework by addressing the issue of non-stationary processes (Fotheringham et al. 2002):

$$ {y}_i={\beta}_0\left({u}_i,{v}_i\right)+\sum_{k=1}^m{\beta}_k\left({u}_i,{v}_i\right){x}_{ik}+{\varepsilon}_i,\kern0.75em i=1,\dots, n $$
(5)

where (u i ,v i ) are the coordinates for i-th point; β k (u i ,v i ) are the realizations of continuous function β k (u,v) at the same location; x i1, x i2,… x im are the explanatory variables at point i; and ε i is the error term.

$$ \widehat{\beta}(i)={\left({\mathrm{X}}^T\mathrm{W}(i)\mathrm{X}\right)}^{\hbox{-} 1}{\mathrm{X}}^T\mathrm{W}(i)\mathrm{y} $$
(6)

W(i) is a matrix of weights for particular location i, such that observations nearer to i are given greater weight than observations further away.

$$ \mathrm{W}(i)=\mathit{\operatorname{\it{diag}}}\left[{w}_{i1},{w}_{i2},\dots, {w}_{in}\right] $$
(7)

w ij is the weight related to data point j for the estimate of the local parameters at location i. Several types of parameterized weight functions may be used. A common choice also used here is the Gaussian distance-decay curve has the form:

$$ {w}_{ij}=\exp \left[-\frac{d_{ij}^2}{2{b}^2}\right] $$
(8)

where d ij is a distance between location i and location j, while parameter b is a range to be determined. The weight’s value would decay gradually with distance to the extent that when d ij  = b, the weight reaches the value of 0.5. In the event that the spatial distribution of sampled variables is spatially homogeneous, this parameter is taken as a constant value. However, a spatially variable (adaptive) parameter of the range should be used in the event that the spatial distribution of the variables is heterogeneous.

To calculate the parameters associated with a weighting function, such as the bandwidth and Nth nearest neighbors considered, the GWR methodology utilizes a calibration process. This calculates the parameter so as to form an appropriate trade-off between bias and standard error in the prediction of the overall model. Commonly used approaches include minimizing the cross-validation scores (CV) or Akaike Information Criterion (AICc) (Fotheringham et al. 2002). The optimal values for b and N reported here were obtained by minimizing the cross-validation scores.

Regression-Kriging

Regression-kriging (RK) is a geostatistical technique that combines the regression of the target variable on explanatory variables with kriging of the regression residuals. In the literature, this interpolation technique is termed as kriging with external drift (KED).

Let the measured values of the target variables be symbolized as Y(s i ), i = 1,…n, where s i represents spatial location and n number of realized measurements. The system of equations from which the estimated values of target variables \( \overset{\wedge }{Y}\left({s}_0\right) \) are obtained is:

$$ {\displaystyle \begin{array}{l}\overset{\wedge }{Y}\left({s}_0\right)=\overset{\wedge }{m}\left({s}_0\right)+\overset{\wedge }{e}\left({s}_0\right)\\ {}\overset{\wedge }{Y}\left({s}_0\right)=\sum \limits_{k=0}^p{\overset{\wedge }{\beta}}_k\cdot {q}_k\left({s}_0\right)+\sum \limits_{i=1}^n{w}_i\left({s}_0\right)\cdot e\left({s}_i\right);\\ {}{q}_0\left({s}_0\right)=1\end{array}} $$
(9)

where \( \overset{\wedge }{m}\left({s}_0\right) \) is the fitted deterministic part; \( \overset{\wedge }{e}\left({s}_0\right) \) is the interpolated residual; \( {\overset{\wedge }{\beta}}_k \) is estimated deterministic model coefficient; and w i represents ordinary kriging weights resolved by the spatial structure of residuals e(s i ) (Hengl 2009). The essential difference between RK and KED is explained as follows: while KED weights are solved within extended matrix taking into consideration trend and residuals at the same time, the RK drift model coefficients are computed separately and the residuals interpolated by ordinary kriging (OK) are summed back to the drift estimates using simple summing of predicted drift and residual surfaces. Despite the differences in the computational steps used, the resulting predictions and prediction variances are the same, given the same point set, auxiliary variables, regression functional form, and regression fitting method (Hengl et al. 2007). Regression coefficients \( {\overset{\wedge }{\beta}}_k \) could be obtained by different fitting methods such as ordinary least squares (OLS) or generalized least squares (GLS) which is more often recommended:

$$ {\overset{\wedge }{\beta}}_{GLS}={\left({\mathrm{q}}^{\mathrm{T}}\cdot {\mathrm{C}}^{\hbox{-} 1}\cdot \mathrm{q}\right)}^{\hbox{-} 1}\cdot {\mathrm{q}}^{\mathrm{T}}\cdot {\mathrm{C}}^{\hbox{-} 1}\cdot \mathrm{Y} $$
(10)

where \( {\overset{\wedge }{\beta}}_{GLS} \) is the vector of estimated regression coefficients; C is the covariance matrix of residuals; q is the matrix of predictors at the sampling location; and Y is the vector of measured values of target variable. The estimated \( {\overset{\wedge }{\beta}}_{GLS} \) coefficients basically present a special case of geographically weighted regression coefficients. The predicted variable value \( \overset{\wedge }{Y}\left({s}_0\right) \) at the location s0, obtained by regression-kriging is commonly written in matrix notation:

$$ {\overset{\wedge }{Y}}_{RK}\left({\mathrm{s}}_0\right)={\mathrm{q}}_0^{\mathrm{T}}\cdot {\overset{\wedge }{\beta}}_{GLS}+{\lambda}_0^{\mathrm{T}}\cdot \left(y-\mathrm{q}\cdot {\overset{\wedge }{\beta}}_{GLS}\right) $$
(11)

where q 0 is the vector of p + 1 predictors and λ0 is the vector of n kriging weights used for interpolation of residuals.

RK explicitly separates trend estimation from residual interpolation, thereby allowing the use of arbitrarily complex forms of regression rather than using the simple linear techniques that can be used with KED. Besides, RK allows the separate interpretation of the two interpolated components, which reinforces the advantage of the RK approach (Hengl et al. 2007).

Spatial Residual Analysis

We used global and local analysis methods (i.e., global and local Moran’s I indices) to evaluate spatial distribution and heterogeneity in model residuals. Spatial autocorrelation of residuals was calculated by Moran’s I statistic (O’Sullivan and Unwin 2003):

$$ MI=\frac{n}{\sum_{i=1}^n{\left({e}_i-\overline{e}\right)}^2}\frac{\sum_{i=1}^n{\sum}_{j=1}^n{w}_{ij}(h)\left({e}_i-\overline{e}\right)\left({e}_j-\overline{e}\right)}{\sum_{i=1}^n{\sum}_{j=1}^n{w}_{ij}} $$
(12)

Where e i and e j are calculated values of the residuals at the locations i and j; the term ē is a mean value; and w ij is the spatial weight within a given distance or bandwidth that was determined as h = 750 m, according to the variogram of the OLS model residuals (Fig. 5). The weighted matrix was row standardized in order to overcome the problem of uneven spatial distribution of observations, which causes up-weighted values for locations with more neighbors. The value of the Moran’s I statistic ranges from near +1 indicating clustering of the e values to near −1 indicating dispersed pattern of the e values. In the Global Moran’s I statistic, the results of the analysis are always interpreted within the context of its null hypothesis, which states that the variable (residuals in our case) being analyzed is randomly distributed among the locations in our study area; or better said, the spatial processes promoting the observed pattern of values is random chance. The results of Moran’s I statistic with significant p-values and positive Z-scores indicates spatially clustered datasets. However, at the same time, significant p-values and negative Z-scores depict that spatial pattern is more spatially dispersed than what would be expected to result from random spatial processes.

Local Indicators of Spatial Association (LISA) (Anselin 1995) that are based on the local Moran’s I test enable the assessment of significant local spatial clustering around an individual location- thereby providing details of (1) the degree of spatial clustering; (2) an estimate of detailed variations of clustering in the locally defined area; and (3) identification of the locations of the spatial clusters. The local version of Moran’s I at location i is given by:

$$ {MI}_i=\frac{\left({e}_i-\overline{e}\right)}{S_i^2}\sum_{j=1,j\ne i}^n{w}_{ij}(h)\left({e}_j-\overline{e}\right) $$
(13)

where w ij is the spatial weight within a given distance or bandwidth (h), as stated in Eq. 12 and S i 2 is calculated as:

$$ {S}_i^2=\frac{\sum_{j=1,j\ne i}^n{w}_{ij}}{n-1}-{\overline{e}}^2 $$
(14)

This local indicator represents a disaggregated measure of autocorrelation that depicts the extent to which the residuals for particular areal locations are similar to, or differ from, neighboring locations. The Local Moran’s I statistic is used to detect possible non-stationarity of the data – i.e., a clustered pattern – among sub-regions. A positive local MI i indicates a cluster of similar residual values of the same sign around i, while a negative MI i indicates that the value of the residual at location i has a sign opposite that of its neighbors. If values for e i and e j are close to ē, then local MI i indicates the absence of spatial autocorrelation.

Case Study: Belgrade Metropolitan Area

Belgrade, the capital of the Republic of Serbia, is situated at the confluence of the Sava and Danube Rivers. The administrative boundary encompasses a 3223 km2 area of nearly 2 million inhabitants. Its territory is divided into 17 municipalities, comprised of 157 settlements. The urban core, an area of 360 km2, includes the 10 urban municipalities which constitute the study area for this research. According to official census data records, there are approximately 470,000 households with 1,300,000 inhabitants in the study area, which represents metropolitan area of the city (Statistical Office of the Republic of Serbia 2011).

Due to political issues, and the fact that most apartments in large cities were under social ownership, the number of individual transactions was negligible prior to privatization. Between 1991 and 2000, most housing units were privatized. The housing market in Serbia started developing rapidly and housing prices spiked after privatization in the 1990s. Housing prices peaked in 2008, just prior to the present financial crisis. As of the time of publication, this market remains depressed and housing prices are falling along with declining construction activity.

According to local property experts, the housing prices are expected to fall by another 20% in the year 2013. Average housing prices in Belgrade vary considerably depending upon apartment location and structural characteristics; in some cases, average prices may be 60% higher in one Serbian municipality or town, as compared to another (Cvijanović 2006). However, housing studies regarding Belgrade and other cities in Serbia are limited because of the short history of the free housing market.

The original dataset used in this study consists of 747 records of apartment transactions referring to real estate sales in the year 2010. The dataset used was provided by several real estate trading companies because a unique database of real estate transactions has not been compiled for the Belgrade area. Selected transaction records include total transaction value (EUR), covered flats size (m2) and addresses. Since additional information regarding internal living space, age of building, and parking amenities was available only for some records, these attributes were not included in the analysis. A geographic information system (GIS) was used to match street addresses of the transactions with the official dataset of building geographic coordinates in order to geocode observations into the study area (Fig. 1).

Fig. 1
figure 1

Locations and values of observed transactions in Belgrade in the year 2010

Target Variable

Housing price (expressed in EUR/m2) is the target variable to be spatially evaluated. A study area map with corresponding housing price ranges is shown in Fig. 1 and the descriptive statistics of housing price are displayed in Fig. 2. The distribution is positively skewed, with a mean of EUR 1633 EUR/m2.

Fig. 2
figure 2

Histogram of observed housing prices

Mapped housing price values (Fig. 1) indicate that the most expensive apartments are located in the city center, while the prices are lower in peripheral areas. The variability of the prices is more pronounced in the central urban core, where the price depends upon the specific location of the building, the quality of interior infrastructure and other amenities (Cvijanović 2006).

Exogenous Variables

In applying the hedonic price model to the real estate market, the determinants of housing prices can be divided into four groups (Lake et al. 1998): (1) structural variables (e.g., age, the number of rooms in each house, etc.); (2) accessibility variables (e.g., the proximity of schools, bus routes, railway stations, shops, parks, and the Central Business District); (3) neighborhood variables (e.g., local unemployment rates); (4) environmental variables (e.g., road noise and visibility impact). The accessibility characteristics primarily consist of site-related factors.

In this study we were confined to predictors that incorporate accessibility, neighborhood and environmental variables (Table 1). The explanatory variables referring to accessibility and environment could be considered as spatial determinants referred to as the distance variables (Koramaz and Dokmeci 2012). They were arranged as input maps/grids of 20 m resolution by using a proximity function within the SAGA (System for Automated Geoscientific Analyses) GIS environment (http://www.saga-gis.org/). The values assigned to grid cells are calculated Euclidean distances between each cell and the input features (roads, schools, parks, etc.); these values are included in each cell in the grid map. Researchers have previously used travel times to the central business district and to highways, shopping centers, schools, and universities, instead of Euclidean distances to these features, in order to compute accessibility measures for hedonic models (Des Rosiers et al. 2000; Adair et al. 2000). Two neighborhood variables (illiteracy and income) are based upon census data (Statistical Office of the Republic of Serbia 2011). The illiteracy layer/map was generated as a factorial variable referenced to each municipality, where each cell represents the proportion of illiterate inhabitants in a particular municipality in regard to the whole city of Belgrade. The income variable represents the average income in each municipality so that every grid cell within each municipality area has the same value.

Table 1 The list of explanatory variables used in study with corresponding VIF (variance inflation factor) values

R Language Environment

The R open source language (Development Core Team 2008) contains the base system that allows statistical computation, linear algebra computation, graphics creation and the like. Most of the computations related to spatial autocorrelation estimation and model testing are utilized through the spdep package (Bivand et al. 2008). It includes a number of features such as tools for the creation of spatial weights matrix, a collection of tests for spatial autocorrelation, including global and local Moran’s I and and Getis/Ord G statistics, and functions for estimating spatial simultaneous autoregressive (SAR) lag and error models. There is also a set of developed R packages that are especially interesting for geoscientists.

All utilized methods were implemented using the open-source R statistical computing environment with gstat and spgwr packages (Bivand et al. 2008) intended for modelling and prediction, as well as the sp. package which provides classes and methods for dealing with spatial data in R (Pebesma 2004). The results obtained in R can easily be converted into any of the standard GIS formats, which enables the manipulation and analysis of the results in commercial GIS packages afterwards.

There are also several open-source and commercially available software packages with associated GWR methods. Unfortunately, GWR is a time-consuming computational procedure, especially in the case of large data sets. However, it is possible to solve this problem by using grid computing (Harris et al. 2010). The spgwr package used in R environment has also been adopted for use on grid based systems.

The recently developed R package plotGoogleMaps (Kilibarda and Bajat 2012 ), designed to automatically create web maps by combining users’ data and Google Maps layers as a base map, map was also used in this study to improve insight into predicted housing price layers.

Model Fitting and Evaluation

Before regression analyses were performed, an indicator of multicollinearity between exogenous variables was examined. The variance inflation factor (VIF) test (Fox 2008) indicates the presence of multicollinearity between predictors (Table 1). Principal components analysis (PCA) is often used with the aim of transforming a dataset with many correlated variables into a dataset consisting of a smaller number of uncorrelated variables, known as principal components (PCs) (Lake et al. 1998). PCA can assist when multicollinearity exists between predictor variables and can also assist in collapsing a large set of variables into a more efficient set of uncorrelated components. However, the main drawback of using PCA is that the newly generated components complicate the interpretation of the influence of the original variables.

After the performing PCA upon the set of explanatory variables, 21 PC predictors were derived. Because the predictors are now known to be independent, we can reduce their number by using step-wise regression (Draper and Smith 1981) based on AIC or t-statistics (Table 2).

Table 2 Results of step-wise regression analysis of predictors

By looking at the specific t-values of coefficients, we can infer which predictors are significant and useful in further analysis. Because the target variable is positively skewed, we used lognormal transformation to improve linearity. The results obtained show that eight predictors are highly significant, seven are marginally significant and six predictors can be removed from the list.

In order to check for the presence of spatial correlation we performed the Lagrange Multiplier test (Anselin 1988; Bivand et al. 2008).This test is often used in spatial econometrics as a supplement to Moran’s I and is designed to determine whether the input data generate a spatial lag (LMlag) or spatial error (LMerr) model (Eq. 4). In addition, the robust LM error (RLMerr) statistics tests for error dependence in the possible presence of a missing lagged dependent variable while its counterpart, the robust RLMlag statistics test, works the other way round. Based on the obtained test results (Table 3), it appears that the spatial correlation problem is more of the “spatial error type” rather than of the “spatial lag type”.

Table 3 Lagrange Multiplier test on OLS residuals

Nevertheless, model fitting statistics indicates that the introduction of the OLS modified „spatial error data “version (OLS.ERR) could not be of benefit to improve the performance of implicit OLS (Table 4). Both models had the same R2 values (0.56), while the error sum of squares (SSE) and the root mean square error (RMSE) values as well as the Fisher statistics for homogeneity of variance indicate better model fitting performance of implicit OLS over the spatial error data OLS model. For that reason we left the implicit OLS model as the benchmark model in further analysis.

Table 4 Model fitting statistics for utilized modeling techniques

Even though quantitative evaluation measures demonstrate the similar performance for RK and GWR methods, the RK method provides slightly better performance. Both methods achieved improvement in performance, this performance was not significantly improved over the results achieved via standard OLS hedonic modeling.

Results and Discussions

The model’s residuals were obtained through the leave one out cross-validation process of all referred spatial models. Table 5 and Fig. 3 show that the model’s residuals have similar characteristics. All models demonstrate that the dataset is characterized by positive skewness and large positive kurtosis. Figure 4 shows strong linear relationship within OLS, RK and GWR residuals.

Table 5 Characteristics of model residuals for utilized modeling techniques
Fig. 3
figure 3

Box plot of the model’s residuals

Fig. 4
figure 4

Matrix plot of residual relationship between referred models

A semivariogram of OLS residuals was created in order to examine the spatial distribution of the model’s residuals. Specifically, the semivariogram determines the range of the residuals, i.e. the distance at which the spatial correlations between observations fall to zero (Fig. 5). Based upon this information, we built a spatial (row standardized) weights matrix W (Eqs. 12 and 13) in which all buildings that are located within 750 m from one another are considered as neighbors. At same time the experimental variogram of GWR residuals exhibits practically no autocorrelation.

Fig. 5
figure 5

Semivariograms of the OLS (left) and GWR (right) residuals

The global Moran’s I (Eq. 12) was computed for all three models (Table 6). The global MI for the OLS residuals is significantly positive (Z.value > 1.96) indicating that model residuals tend to be similar across the space. In contrast, the global MI for RK’s and GWR’s are significantly negative (Z.value < −1.96), which means their residuals are clustered with dissimilar values (positive are surrounded with negative, and vice versa).

Table 6 Global Moran (MI) for utilized modeling techniques

The local indicator of spatial association MI i was used to determine “hot spots” (positive autocorrelation or similarity) and “cold spots” (negative autocorrelation or dissimilarity) of residual values of all utilized models (Anselin 1995). Table 7 indicates that local MI i of RK and GWR residuals have similar characteristics while MI i of OLS residuals are considerably different. Obviously, RK and GWR produced more frequently negative local MI i with stronger linear relationship in contrast to OLS (Fig. 6).

Table 7 Local Moran indexes (MI i ) for utilized modeling techniques
Fig. 6
figure 6

Matrix plot of local Moran’s indexes relationship between referred models

Similar patterns were obtained for calculated Z-values of the local Moran (MI i ) (Table 8).

Table 8 Z-values of the local Moran (MI i ) for utilized modeling techniques

The local Z-values for the local MI i were also evaluated for the significance level 0.05 (Zα=1.96). Table 9 indicates that RK and GWR produced a similar percentage of significant Z-values, where higher percentages among significant values are negative (76% and 71%, respectively). These results indicate the clustering of the model residuals, whereby a large residual tends to be surrounded by smaller neighboring residuals and vice versa when a small residual is surrounded by larger ones. On the other hand the majority of significant positive Z-values indicate opposite behavior of the OLS model (71%) – i.e., it generates more clusters of either positive or negative model residuals. That means that OLS produced sub-areas with underestimated (positive residuals) or overestimated (negative residuals) prediction.

Table 9 Significant Z-values for local Moran’s indexes

The visual inspection of the mapped residuals may give us more detailed insight into the performance of the model in the case of particular observations (Figs. 7, 8 and 9). Spatial patterns of Z-values in terms of size, sign and clustering are apparently similar for GWR and RK.

Fig. 7
figure 7

Mapped local MI values of OLS residuals

Fig. 8
figure 8

Mapped local MI values of GWR residuals

Fig. 9
figure 9

Mapped local MI values RK residuals

Generated local MI Z-values can also be completely mapped by plotGoogleMaps package (Kilibarda and Bajat 2012) in HTML format (available at http://osgl.grf.bg.ac.rs/en/materials/hedonic/), which has become a standard medium for cartographic communication.

Finally, based upon the estimated model parameters, we produced raster maps that depict predictions for dwelling prices over the whole case study area. The raster maps were generated in the same resolution (20 m) as input maps/grids (Figs. 10 and 11). Visual inspection of spatially predicted maps clearly emphasizes the limitation of the GWR method (Fig. 10). At first glance, the artefacts (areas with high price values) generated at the peripheral areas (north-western and northern) are easily distinguished on the GWR map. Those artefacts are caused by the spatial pattern of observed transactions (Fig. 1) because most of the transactions are grouped in the inner part of the city and peripheral parts are apparently lacking data. The peripheral parts of the case study area did not have any data on transactions and therefore the predicted values are a result of extrapolation.

Fig. 10
figure 10

The map of housing prices predicted by GWR (also available in HTML format at: http://osgl.grf.bg.ac.rs/static/materials/bajat/GWR_RK_htm/gwr.htm)

Fig. 11
figure 11

The map of housing prices predicted by RK (also available in HTML format at: http://osgl.grf.bg.ac.rs/static/materials/bajat/GWR_RK_htm/rk.htm)

Conclusions

This chapter demonstrates how GWR and RK techniques- which are already widely used for spatial prediction in various environmental studies – be useful tools for the prediction and mapping of housing prices. The primary objective of this research was to compare the performance of recently introduced spatial modelling methods in the hedonic prediction of residential property values in city of Belgrade, based upon examination of spatial distribution and heterogeneity in model residuals. Spatial models including geographically weighted regression (GWR) and regression-kriging (RK) were estimated in contrast to standard hedonic price modelling based upon an ordinary least squares (OLS) technique. Although similar values for quantitative evaluation measures were obtained for by each model, spatial patterns of residuals for the RK and GWR models were found to be different in contrast to the OLS model. The values erroneously predicted by the GWR model (located in the peripheral part of the city) indicate that this technique cannot correctly handle the samples pattern used in this study.

Coupling this methodology with GIS and the Web 2.0 environment facilitates large-scale housing price appraisal in the framework of collaborative GIS – thereby enabling platforms for evaluation and spatial decision support. The use of a Web-based GIS tools enables authorities to combine the different spatial layers, particularly socioeconomic datasets provided as raster maps, to spatially model distribution of housing values. This methodology provides a reliable view to spatial distribution of housing price and can be useful in hedonic price modelling.

Further research could focus on the application of spatiotemporal geostatistical techniques in hedonic price modelling. By combining the growing number of transaction database records dating from different time periods with the great variety of publically accessible spatial data in GIS formats, housing price models can be developed for spatial and temporal domains. The application of GWR and RK should be reinforced in hedonic price modelling in regard to the latest developments in the field of spatio-temporal modelling (Huang et al. 2010; Cressie and Wikle 2011).