Prediction of municipal solid waste generation using artificial neural network approach enhanced by structural break analysis

Adamović, Vladimir M.; Antanasijević, Davor Z.; Ristić, Mirjana Đ.; Perić-Grujić, Aleksandra A.; Pocajt, Viktor V.

doi:10.1007/s11356-016-7767-x

Prediction of municipal solid waste generation using artificial neural network approach enhanced by structural break analysis

Research Article
Published: 07 October 2016

Volume 24, pages 299–311, (2017)
Cite this article

Download PDF

Access provided by Autonomous University of Puebla

Environmental Science and Pollution Research Aims and scope Submit manuscript

Prediction of municipal solid waste generation using artificial neural network approach enhanced by structural break analysis

Download PDF

Vladimir M. Adamović¹,
Davor Z. Antanasijević²,
Mirjana Đ. Ristić³,
Aleksandra A. Perić-Grujić³ &
…
Viktor V. Pocajt³

1498 Accesses
53 Citations
Explore all metrics

Abstract

This paper presents the development of a general regression neural network (GRNN) model for the prediction of annual municipal solid waste (MSW) generation at the national level for 44 countries of different size, population and economic development level. Proper modelling of MSW generation is essential for the planning of MSW management system as well as for the simulation of various environmental impact scenarios. The main objective of this work was to examine the potential influence of economy crisis (global or local) on the forecast of MSW generation obtained by the GRNN model. The existence of the so-called structural breaks that occur because of the economic crisis in the studied period (2000–2012) for each country was determined and confirmed using the Chow test and Quandt–Andrews test. Two GRNN models, one which did not take into account the influence of the economic crisis (GRNN) and another one which did (SB-GRNN), were developed. The novelty of the applied method is that it uses broadly available social, economic and demographic indicators and indicators of sustainability, together with GRNN and structural break testing for the prediction of MSW generation at the national level. The obtained results demonstrate that the SB-GRNN model provide more accurate predictions than the model which neglected structural breaks, with a mean absolute percentage error (MAPE) of 4.0 % compared to 6.7 % generated by the GRNN model. The proposed model enhanced with structural breaks can be a viable alternative for a more accurate prediction of MSW generation at the national level, especially for developing countries for which a lack of MSW data is notable.

Municipal solid waste generation in China: influencing factor analysis and multi-model forecasting

Article 11 May 2018

Estimating Municipal Solid Waste Generation: From Traditional Methods to Artificial Neural Networks

Spatial–temporal redundancy evaluation of the municipal solid waste incineration treatment capacity: the case study of China

Article 18 April 2023

Discover the latest articles, news and stories from top researchers in related subjects.

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Introduction

Anthropogenic activities are always associated with the production of waste to a greater or lesser extent; hence, appropriate management of municipal solid waste (MSW) is crucial for any community in order to prevent environmental pollution and to reduce the risk for public health. Planning and development of strategies for waste management greatly depend on the ability to accurately predict the amount of MSW (Daskalopoulos et al. 1998; Bandara et al. 2007; Noori et al. 2009).

The main drivers of progressive growth of the quantity of generated waste are the increase in population, economic growth, as well as changes in life style and consumption patterns. The connection between the level of economic development and the quantities of MSW generated has been particularly significant (Breede and Bloom 1995; Khajuria et al. 2011; Hoornweg and Bhada-Tata 2012).

In the overall theory of economy, it is well known that various significant events such as political or economic crisis, natural catastrophes, wars, etc. can cause major changes in the economy that can lead to large differences in various socio-economic parameters, including the amount of generated MSW, for a period before and after this change (Gujarati and Porter 2009; Greene 2012). These differences in socio-economic parameters, which are referred to as structural breaks (Verbeek 2004), have been identified as a reason for a decrease of the accuracy of models based on economical parameters, e.g. the accuracy of the bankruptcy prediction model, under crisis conditions, dropped from 83.3 to 66.7 % or even lower (Sung et al. 1999). Although the impact of an economic crisis on the amount of MSW generated has still not been sufficiently studied, it can be assumed that significant changes in the economy are reflected in the generation of MSW (Inglezakis et al. 2012; Watson 2013).

Artificial neural networks (ANNs), among other regression methods, have been widely applied for the prediction of MSW generation (Noori et al. 2010; Ali Abdoli et al. 2011; Batinic et al. 2011; Antanasijević et al. 2013a; Shamshiry et al. 2014).

The modelling of solid waste generation on a weekly basis in Mashhad (Iran) was performed using back propagation (BP) ANN by Noori et al. (2010). Heuristic techniques and standard numerical optimization techniques have been used to optimize the network weights and bias in the BP ANN model. The reduction of the initial number of input variables (13) was performed using principal component analysis (7 inputs) and Gamma test (5 inputs), whereby these models have more effective results than the initial ANN model.

Also, the capability of multilayer perceptron (MLP) for modelling of long-term solid waste generation for the period of 2011–2032 in the city of Mashhad (Iran) was studied by Ali Abdoli et al. (2011). MLP is a type of feed forward ANN consisting of layers which are all fully connected and where neurons are represented by perceptrons with non-linear activation function. MLP utilizes back propagation as a supervised learning technique (Rosenblatt 1961; Rumelhart et al. 1986; Chau and Wu 2010). Population, household income and maximum temperature were indicated as significant factors for the generation of solid waste. Comparison between the results of ANN and multivariate regression model indicated that ANN approach had better performance in predicting the municipal solid waste generation.

An ANN model was used to determine the relation between the amount and composition of generated waste on one side with socio-economic indicators in ten municipalities of Serbia on the other (Batinic et al. 2011), with the average income, level of employment, age structure, educational level and housing conditions being used as socio-economic input indicators. Outputs were presented as six waste categories as follows: organic waste, paper, glass, metal, plastic and other waste. The model used projected socio-economic inputs for 2010–2026 to forecast the quantity and composition of waste.

In order to develop an ANN model for the prediction of annual MSW generation in European countries, two different architectures were evaluated: back propagation neural network (BPNN) and general regression neural network (GRNN; Antanasijević et al. 2013a). The gross domestic product (GDP), domestic material consumption (DMC) and resource productivity (RP) were used as input variables and municipal solid waste generation as output parameter, and data for 26 countries from the Eurostat database were used for the training and validation of the models. With both models (BNNN and GRNN) trained and tested using the same dataset, the model based on GRNN architecture achieved better results.

The prediction of the amount of solid waste in the tourist area of Langkawi Island, Malaysia during 2004–2009 was carried out using multiple regression analysis (MRA) and ANN (Shamshiry et al. 2014). Weekly data of solid waste generation were used as the output variable, while the fuel consumption, types of trucks and their trips and number of entrances to landfill were used as the input variables. Comparison between the final results showed that ANN has higher accuracy than regression analysis.

This paper describes the development of an ANN model enhanced with structural break analysis for the prediction of MSW generation at the national level. The model was created and tested using the data of 44 countries comprising the Organization for Economic Co-operation and Development (OECD) and 28 member states of the European Union (EU28) as well as some OECD partner countries for the period from 2000 to 2012. With regards to the global economic crisis of 2008 and the structural changes it caused, two different ANN models were compared: one where the structural breaks were neglected and the other in which structural breaks were taken into account.

Materials and methods

Input and output parameters

In order to provide an ANN-based model with accurate predictions, it is very important to identify the parameters that significantly affect the amount of generated waste (Benítez et al. 2008; Gallardo et al. 2014) and to utilize adequate input data.

Different indicators related to economy, demography, industry and environmental phenomena, as well as to social and consumer habits were used as initial input variables. Most of these parameters were previously used for MSW generation modelling like GDP (Intharathirat et al. 2015; Antanasijević et al. 2013a; Daskalopoulos et al. 1998; Chung 2010; Rimaityte et al. 2012), DMC (Antanasijević et al. 2013a), share of urban population (Bandara et al. 2007; Lebersorger and Beigl 2011; Keser et al. 2012; Intharathirat et al. 2015), population density (Benítez et al. 2008; Lebersorger and Beigl 2011; Keser et al. 2012; Gallardo et al. 2014; Intharathirat et al. 2015), household size (Dyson and Chang 2005; Lebersorger and Beigl 2011; Keser et al. 2012; Intharathirat et al. 2015), unemployment rate (Rimaityte et al. 2012; Keser et al. 2012; Intharathirat et al. 2015), etc. The list of input parameters and their descriptive statistics for the period 2000–2012 are presented in Table 1.

Table 1 Descriptive statistics of inputs selected for modelling of MSW generation

Full size table

Domestic material consumption (DMC), presented in Table 1, measures the total amount of materials directly used in the economy, excluding hidden flows (UN 2003). Value added of industry refers to the contribution of industry to overall GDP. Inbound tourism expenditure includes expenditure of non-resident visitors. The population within the age group 20 to 65 is the share in total population of people in that age group on 1st January of the current year. Unemployment rate is expressed as the share of total working-age population. Alcohol consumption among the population with an age of 15 years and more is expressed by litres per capita and per year. Carbon dioxide (CO₂) emissions from residential buildings and commercial and public services contain all emissions from fuel combustion in households and they are given in Table 1 as the share of total fuel combustion.

This study includes 44 countries; thereof, 34 are OECD member countries, 28 countries are EU members (EU28) and 3 of them are additional OECD partner countries. Seven of EU28 countries are not OECD members, four OECD countries from Europe are not members of European Union and nine of OECD countries are non-European (Fig. 1, Table 2). The data for 44 observed countries were mainly collected from the following databases: OECD Statistics (OECD 2015a), Eurostat—European Statistical Office (European Commission 2015), World Bank (World Bank 2015) and United Nation Department of Economic and Social Affairs (UN 2015).

Table 2 Descriptive statistics of MSW generation (kilograms per capita)

Full size table

About 57 % of the observed population and about 49 % of surface area of the 44 countries covered by this study belongs to OECD partner countries (Brazil, China and Russia), while about 17 % of population and nearly 6 % of the total surface area belongs to the EU28 countries.

The 44 selected countries are very different in the terms of the size of their territory, population and economic and industrial development, on one hand, and also in terms of their social and cultural habits, on the other. Additionally, the generation of municipal solid waste can be affected by climatic conditions that vary significantly between the countries covered in this research (Gómez et al. 2009; Keser et al. 2012; Denafas et al. 2014). All these complicate the creation of a unique prediction model for all of the observed countries.

Annual quantities of generated municipal solid waste in kilograms per capita at the national level were used as the single output variable in this research. This data was obtained from OECD Environment Statistics (OECD 2015b), Eurostat—European Statistical Office (European Environment Agency 2014) and from national data bases in cases where the data was not available in the two abovementioned sources. Statistics of municipal solid waste generation at the national level and for the entire MSW dataset are shown in Table 2.

The dataset is organized as a panel data form, which combines characteristics of both time series and cross-section data. Time series is a set of observations on the values that a variable has at different times, while cross-section data is data coming from one or more variables collected at the same point in time. Panel or longitudinal data is a special type of data in which the same cross-sectional unit is surveyed over time. In short, panel data has space as well as time dimensions (Gujarati and Porter 2009).

In addition, this dataset represents a so-called balanced panel, which means that each subject (country) has the same number of observations. The panel contains data for 12 different independent and 1 dependent variables for 44 countries during the period spanning 13 years (from 2000 to 2012). The dataset was divided into three subsets: the first two were used for training and validation of the ANN model (in proportion 4:1), while the third one (test subset) was used to test the model prediction capability.

Artificial neural networks

Artificial neural networks (ANNs) are mathematical tools inspired by biological neural networks. The development of ANNs derives from the desire to construct artificial systems capable of sophisticated “smart” calculation, in a similar way as the human brain routinely performs (Freeman and Skapura 1991).

An artificial neuron receives input signals analogue to electrochemical impulses and responds with adequate output, which to a certain extent corresponds to the output of biological neurons. An ANN consists of neurons grouped into layers (input, hidden and output layer) whereby an ANN can have one or more hidden layers. In most cases, one hidden layer is sufficient for an ANN to approximate any nonlinear function (Noori et al. 2010).

In this study, a general regression neural network (GRNN; Specht 1991) was used, since it proved to be superior to back propagation (BP) ANN for the forecasting of municipal waste generation at the national level (Antanasijević et al. 2013a). GRNN architecture consists of four layers, where the number of neurons in the input layer corresponds to the number of input variables, while the number of neurons in output layer is equal to number of output variables. The number of pattern neurons corresponds to the number of data patterns, and the number of neurons in the summation sublayer is consistently higher by one when compared to the number of output neurons. Since there is one output neuron in this case, there are two neurons in the summation sublayer, one being the summation neuron and other division neuron (Fig. 2).

A GRNN learning algorithm can be regarded as a type of Nadaraya–Watson kernel regression (Tomandl and Schober 2001). The regression of a dependent variable y, which is usually a vector and represents the system output, on an independent variable x (usually also a vector and the system input) is, in fact, the computation of the most probable value of y. The determination of the y value for a known x value requires the assumption of a functional dependency with unknown parameters. In the GRNN algorithm, this functional dependency is expressed in terms of a probability distribution function f(x,y), whose determination is based on the x value using Parzen window estimation (Specht 1991).

Considering that the distribution function f(x,y) is not known, it needs to be calculated using the known values of variables x and y. Within this calculation, the prediction of the unknown value y is actually performed on the basis of probability whose range (width) depends on a parameter known as the smoothing factor (σ _f), which is determined for every couple of Y and X. X is a particular (e.g. measured) value of random variable x, and the correlated Y is a particular value of the random variable y (Tomandl and Schober 2001). The final probability is equal to the sum of these individual probabilities.

The smoothing factor represents the width of Gaussian curve for every individual probability density function and it is the only parameter that is unknown in the GRNN algorithm. In general, the smoothing factor is always greater than 0, and the closer it gets to zero, the regression surfaces are smoother. In this study, genetic algorithm was used for the determination of smoothing factor; more details on this approach can be found in (Kim and Kim 2008; Chen and Chang 2009).

In practice, the GRNN compares the distances between the input data (vectors) and predicted values using the following equation:

$$ Y(X)=\frac{{\displaystyle {\sum}_{i=1}^n}{Y}_i\mathit{\exp}\left(\frac{-{D}_i^2}{2{\sigma}_f^2}\right)}{{\displaystyle {\sum}_{i=1}^n}\mathit{\exp}\left(\frac{-{D}_i^2}{2{\sigma}_f^2}\right)} $$

(1)

Y(X) is a value obtained by the GRNN for input X. Y _i is a measured (accurate) value and D _i is the distances of training patterns in N-dimensional space, i.e. Euclidean distance.

The numerator in Eq. 1 is the summation neuron which computes the sum of weighted outputs of the pattern layer, while the denominator is the division neuron which calculates the unweighted outputs of the pattern neurons. In order to get the desired estimate, the output layer divides the output of the summation neuron by the output of the division neuron (Antanasijević et al. 2014).

Structural breaks

Major events at the financial market (such as global or regional financial crisis), commodity market (e.g. large fluctuations in the oil price) or in the legislation field (e.g. adoption and implementation of some crucial laws), may cause abrupt changes in the economic and social environment (Greene 2012) called structural breaks. In that scenario, it can be expected that the economy demonstrates different features and performance in the periods before and after structural breaks. If a structural break exists, neglecting this phenomenon can lead to significant errors in the modelling of economy parameters.

One such event is the 2007–2009 financial crisis that began in 2007 and continued to seriously affect the world economy up until the present day (Dwyer and Lothian 2012). Considering the occurrence of the global economic crisis, the presence of a structural break in the studied period (2000–2012) is therefore to be expected. To test this hypothesis, the panel dataset was divided by countries into 44 individual time series, one for each country.

Tests of structural breaks can generally be classified into two groups: tests based on the assumption that the break date is known and tests that examine the presence of structural break at an unknown place within the sample. Most of the tests estimate whether a structural break is present by using the null hypothesis of no structural change, against the alternative of break at time τ (Perron 2006).

Chow test is most commonly used to test for the presence of structural break in a time series when the break date is known (Wooldridge 2013). For a regression model:

$$ {y}_t={\beta}_0+{\beta}_1{x}_t+{u}_t,for\ all\ t=1,2,\ldots T $$

(2)

the sum of squared residuals is:

$$ SS{R}_R={\displaystyle {\sum}_{t=0}^T{u}_t^2} $$

(3)

But, if there is a structural break at the time τ in the regression model (Eq. 2), sample can be split around the break point to:

$$ {y}_{1t}={\beta}_{10}+{\beta}_{11}{x}_{1t}+{u}_{1t},t=1,2,\ldots \tau $$

(4)

and

$$ {y}_{2t}={\beta}_{20}+{\beta}_{21}{x}_{2t}+{u}_{2t},t=\tau +1,\ldots T $$

(5)

The individual sum of squared residuals is:

$$ SS{R}_1={\displaystyle {\sum}_{t=0}^{\tau }{u}_{1t}^2}\ \mathrm{and}\kern0.5em SS{R}_2={\displaystyle {\sum}_{t=\tau +1}^T{u}_{2t}^2} $$

(6)

The total sum of squared residuals from the equation (6) is:

$$ SS{R}_{UR}=SS{R}_1+SS{R}_2 $$

(7)

The sum of squared residuals SSR _R (Eq. 3) from the pooled estimation (Eq. 2) is the restricted sum of squared residuals because it is obtained by imposing the restrictions that β₁₀ = β₂₀ and β₁₁ = β₂₁, so there is only one regression model (Eq. 2). On the other hand, the sum of squared residuals SSR _UR (Eq. 7) for the two separately estimated time periods (Eq. 4 and 5) is an unrestricted sum of squared residuals.

The Chow test is based on the Wald statistic and it is given as the F statistic which represents the comparison of the restricted and unrestricted sum of squared residuals. A single breakpoint can be computed as (Gujarati and Porter 2009):

$$ F=\frac{\left(SS{R}_R-SS{R}_{UR}\right)/k}{\left(SS{R}_{UR}\right)/\left(T-2k\right)} $$

(8)

where T is the total number of observations and k is the number of parameters in the observed equation.

The null hypothesis of the test is that there is no break at the specified breakpoints. In that case, β ₁₀ = β ₂₀ and β ₁₁ = β ₂₁, which implies that Eq. 2 can be used. But if F from Eq.8 is greater than the upper critical value of the F distribution, with a significance level of less than 5 %, then structural change cannot be ignored.

Besides the Chow test, the Quandt–Andrews test has also been applied in this study for finding unknown structural breakpoints in the sample. The basic idea of this test is that a single Chow test is performed at every observation over the interval [ξT, (1 − ξ)T], and after that, all of the n test statistics from those tests are summarized and the supremum of the F statistics is calculated (Berger 2011):

$$ \sup F= \sup \tau \epsilon \left[\xi T,\left(1-\upxi \right)\mathrm{T}\right]\mathrm{F} $$

(9)

Two additional test statistics, the average and exponential F statistics, have been developed (Andrews and Ploberger 1994):

$$ Ave\ F=\frac{1}{n}{\displaystyle {\sum}_{\tau =\xi T}^{\left(1-\xi \right)T}F\left(\tau \right)} $$

(10)

$$ Exp\ F= \ln \left[\frac{1}{n}{\displaystyle {\sum}_{\tau =\xi T}^{\left(1-\xi \right)T} \exp}\left(\frac{1}{2}F\left(\tau \right)\right)\right] $$

(11)

The trimming parameter (ξ) is used because the distribution of statistics (Eq.9–11) becomes degenerated as it approaches the beginning (ξT) or the end [(1 − ξ)T] of the sample. Because of that, it is generally suggested that the first ξT and last ξT of the observations are not to be included into the testing procedure. Like in the Chow test, with the Quandt–Andrews test, the null hypothesis of no break is rejected if the maximum of the F statistic is greater than critical values.

Performance metrics

The characteristics of models and their ability to provide accurate results in this study were determined using the following criteria:

The root mean squared error (RMSE):

$$ RMSE=\sqrt{\frac{1}{n}\left[{\displaystyle {\sum}_{i=1}^n{\left({P}_i-{O}_i\right)}^2}\right]} $$

(12)

The mean absolute error (MAE):

$$ MAE=\frac{1}{n}{\displaystyle {\sum}_{i=1}^n\left|{P}_i-{O}_i\right|} $$

(13)

The mean absolute percentage error (MAPE):

$$ MAPE=\frac{1}{n}{\displaystyle {\sum}_{i=1}^n\frac{\left|{P}_i-{O}_i\right|}{O_i}\cdot 100} $$

(14)

Percentage of prediction within a factor 1.1 (FA1.1) of observed values:

$$ 0.9<\frac{P_i}{O_i}<1.1 $$

(15)

Modified index of agreement (d ₁):

$$ {d}_1=1-\frac{{\displaystyle {\sum}_{i=1}^n}\left|{P}_i-{O}_i\right|}{{\displaystyle {\sum}_{i=1}^n}\left(\left|{P}_i-{\overline{O}}_i\right|+\left|{O}_i-{\overline{O}}_i\right|\right)} $$

(16)

The Nash–Sutcliffe coefficient of efficiency (E _f):

$$ {E}_f=1-\frac{{\displaystyle {\sum}_{i=1}^n}{\left({P}_i-{O}_i\right)}^2}{{\displaystyle {\sum}_{i=1}^n}{\left({P}_i-{\overline{O}}_i\right)}^2} $$

(17)

In Eq. 12–17, n is the number of predictions, P _i is predicted and O _i is the observed value of MSW generation.

FA1.1 shows the proportion of predictions with an error of less than ±10 %, i.e. the percentage of cases in which the values of the ratio among predicted and observed values are in the range of 0.9 to 1.1. d ₁ is introduced by (Legates and McCabe Jr 1999), and the advantage of this form of Willmott’s index of agreement is that the errors and differences are given their appropriate weighting. The Nash–Sutcliffe coefficients of efficiency range from −∞ to 1, where E = 0 means that the model is not performing better than by merely taking the mean value as predicted output (Wang et al. 2015; Duveiller et al. 2016).

Results and discussion

Correlation analysis

Besides the selection of representative independent input variables for a particular dependent output variable, the performance of ANN model may be strongly affected by a potential correlation among independent variables. If correlated input data are used, then this can cause confusion to the neural network during the learning process. For this reason, if two variables are highly correlated, one of them can be removed from the dataset without adversely affecting the ANN performance (Walczak and Cerpa 1999) or even with an increase of model performance (Antanasijević et al. 2013b).

In this study, after the data was collected, a correlation analysis was carried out, the primary objective thereof being to measure the strength or degree of linear association between two variables (Gujarati and Porter 2009). To examine the relationship between input variables, Pearson correlation coefficients were used. A correlation coefficient greater than 0.8 is interpreted as high (Hamilton 1990) and variables in that case are highly correlated. For this reason, if there are input variables with the mutual correlation coefficient greater than 0.8, one of these variables can be removed. The conducted correlation analysis showed that there were no variables with a mutual coefficient of correlation higher than 0.8 (Table 3).

Table 3 Correlation analysis results

Full size table

Testing structural breaks

In this study, it was necessary to examine whether and when there was an occurrence of structural changes in the observed countries. To test the presence of structural breaks for each country, the Quandt–Andrews test was conducted first. After that, for every single year after the year when a structural break occurred, the Chow test was applied to check whether there were any additional structural changes after the appearance of the dominant structural break.

By applying the Chow test and Quandt–Andrews tests for each of the individual countries, it can be concluded that for the most of the countries, there was a statistically significant structural break which occurred as a consequence of the 2007–2009 global financial crisis (Fig.3).

As can be seen from Fig.3, statistically significant structural breaks as a result of the global financial crisis (period from 2007 to 2010) have occurred in 68.2 % of all observed countries. In the previous years (2002–2006), structural changes have occurred in 20.4 % of the sample. The reasons for the occurrences of structural breaks in that period may be different, for instance, a recession in 2002 in Germany (Dustmann et al. 2014) or the changed scope of MSW to include only household waste in 2004 in Norway (ETC/SCP 2013a). There were no significant structural changes in five countries which contributes 11.4 % of all observed countries.

It should be noted that structural changes in economy were not always accompanied by simultaneous changes in waste generation. For this reason, structural breaks were lacking in some countries (Cyprus, Austria, Slovakia, Turkey and Israel) in the observed period.

The prediction of MSW generation

Considering the results of structural break testing, two models were created: a GRNN model, in which structural breaks were neglected, and SB-GRNN model, which took into account structural breaks. In the GRNN model, the data from the years 2000–2010 were used for training and validation (484 data patterns), while the data from the years 2011–2012 were used to test the model (Table 4). The data from 2000 to 2010 was randomly divided into training and validation subsets at a ratio of about 4:1, respectively. In the SB-GRNN model, only the data from the years after the structural breaks (if they occurred) until 2011 was used for training and validation, while the data from 2012 were used to test the SB-GRNN model. It can be observed that the SB-GRNN model was tested with 50 % data points less, and this reduction was needed in order to maintain the ratio between the data used for model development and the data used for evaluation above 4 (Table 4).

Table 4 Number of data patterns per dataset and corresponding subsets

Full size table

Although both models (Table 5) demonstrated good performance, better predictions were achieved with the SB-GRNN model, which had MAPE = 4.0 % and FA1.1 = 86.4 %. Since GRNN model had MAPE = 6.7 % and FA1.1 = 80.7 %, the selection of data based on SB analysis apparently resulted in an improvement of the MSW model.

Table 5 Performance metrics of created MSW models (test dataset)

Full size table

The results obtained for the test data using the GRNN and SB-GRNN models are presented in Fig. 4. It can be seen that SB-GRNN has enhanced prediction capability (R ² = 0.96, d ₁ = 0.925, E _f = 0.962) in comparison with the GRNN model (R ² = 0.91, d ₁ = 0.871, E _f = 0.909). Further performance analysis can be made by accessing the discrepancy ratio (Fig. 4.): the GRNN model has about 80 and 95 % predictions that are within the error margin of ±10 and ±20 %, respectively, while SB-GRNN achieves 86 and 100 % within the same error margins. Moreover, the ratio of MSW predictions with the relative error up to ±5 % significantly increased as a result of taking SB into account from 54 to 75 %, which yield an SB-GRNN MAPE value of only 4.0 %.

As can be observed in Table 5 and Fig. 4, the superior performance of SB-GRNN over the simple GRNN model is even more obvious, if only the results for 2012 are analysed.

Comparisons of the relative errors for the test data are presented in Fig. 5. Relatively higher errors (≥15 %) for the SB-GRNN model can be observed only for Romania, Latvia and Slovenia, but it appears that the deviation between the actual and predicted values for those countries can be attributed to the uncertainty of MSW data used for the training and testing of the model.

The highest relative error was obtained for Romania (≈20 %), for which the actual MSW values are estimated, not measured (Eurostat 2015), because waste collection services were not covering the entire population, favouring illegal dumping (Mihail 2013). Therefore, those values have increased uncertainty; it can be noticed that both models provided MSW predictions for Romania with similarly high relative errors. In the case of Latvia, the SB-GRNN model overestimated the MSW quantity, which can be related to the fact that significant amounts of municipal waste (e.g. metals and glass packaging) are exported for recovery in other countries, and therefore, this waste has not been included in the amounts that Latvia has reported to Eurostat as MSW (ETC/ECP 2013; Kara 2014). For Slovenia, The Statistical Office of the Republic of Slovenia used the methodology for collecting data on generated MSW which includes waste imported for recycling, but excludes exported waste. Also, package waste was not always reported as MSW, which can be another source of uncertainty (ETC/SCP 2013b).

Conclusion

This paper describes development of a new model for forecasting the generation of MSW at the national level. The model based on general regression neural networks was applied to 44 countries of differing size, population, level of economic development, social patterns, climates and other factors.

An additional objective of this research was to examine a potential influence of structural breaks (SBs), abrupt changes in economy and society, to MSW generation, especially having in mind the financial crisis between 2007 and 2009 which still bears consequences to the global economy. Two models were created for that purpose: in the first model, a GRNN, the existence of SBs was neglected, whilst in the other model, a SB-GRNN, took into account potential SBs for each individual country and then only input variables from the years after the occurrence of SBs were used for modelling. The input dataset comprised 12 different variables obtained from official databases.

While both models achieved good results, the SB-GRNN model demonstrated superior performance, with relative errors in the range of ±10 % (FA1.1) for 86 % of countries, FA1.2 achieving 100 %, a mean absolute percentage error (MAPE) of 4.0 % and R ² = 0.96, in comparison with the GRNN model results (FA1.1 = 81 %, MAPE = 6.7 % and R ² = 0.91).

Based on the presented results, it can be concluded that the application of analysis of SBs can further enhance the forecasting capabilities of general regression neural networks models and that the enhanced model has a potential to provide accurate predictions of MSW generation for a wide spectrum of countries, different in the terms of size, population, level of industrial and economic development, as well as social and climatic factors. In addition, since the model uses widely available statistical parameters as inputs, its application may contribute to overcome the lack of data for municipal solid waste generation, which is frequently a challenge in developing countries. Further research should demonstrate whether the techniques applied in this study, with appropriate adjustments, can provide satisfactory results when applied for the prediction of healthcare waste, waste composition and/or types of waste treatments.

References

Ali Abdoli M, Falah Nezhad M, Salehi Sede R, Behboudian S (2011) Longterm forexasting of solid waste generation by the artificial neural networks. Environ Prog Sustain Energy 31:628–636. doi:10.1002/ep.10591
Article Google Scholar
Andrews DWK, Ploberger W (1994) Optimal tests when a nuisance parameter is present only under the alternative. Econometrica 62:1383–1414
Article Google Scholar
Antanasijević D, Pocajt V, Popović I et al (2013a) The forecasting of municipal waste generation using artificial neural networks and sustainability indicators. Sustain Sci 8:37–46. doi:10.1007/s11625-012-0161-9
Article Google Scholar
Antanasijević DZ, Ristić MĐ, Perić-Grujić AA, Pocajt VV (2013b) Forecasting human exposure to PM10 at the national level using an artificial neural network approach. J Chemom 27:170–177. doi:10.1002/cem.2505
Article Google Scholar
Antanasijević D, Pocajt V, Perić A, Ristić M (2014) Modelling of dissolved oxygen in the Danube River using artificial neural networks and Monte Carlo simulation uncertainty analysis. J Hydrol 519:1–26. doi:10.1016/j.jhydrol.2014.10.009
Article Google Scholar
Bandara NJGJ, Hettiaratchi JPA, Wirasinghe SC, Pilapiiya S (2007) Relation of waste generation and composition to socio-economic factors: a case study. Environ Monit Assess 135:31–39. doi:10.1007/s10661-007-9705-3
Article Google Scholar
Batinic B, Vukmirovic S, Vujic G et al (2011) Using ANN model to determine future waste characteristics in order to achieve specific waste management targets -case study of Serbia. J Sci Ind Res (India) 70:513–518
Google Scholar
Benítez SO, Lozano-Olvera G, Morelos RA, de Vega CA (2008) Mathematical modeling to predict residential solid waste generation. Waste Manag 28(Suppl 1):S7–S13. doi:10.1016/j.wasman.2008.03.020
Article Google Scholar
Berger T (2011) Estimating Europe’s natural rates. Empir Econ 40:521–536. doi:10.1007/s00181-010-0342-2
Article Google Scholar
Breede D, Bloom D (1995) Economics of the generation and management of municipal solid waste. New York
Chau KW, Wu CL (2010) A hybrid model coupled with singular spectrum analysis for daily rainfall prediction. J Hydroinformatics 12:458–473. doi:10.2166/hydro.2010.032
Article Google Scholar
Chen Y, Chang FJ (2009) Evolutionary artificial neural networks for hydrological systems forecasting. J Hydrol 367:125–137. doi:10.1016/j.jhydrol.2009.01.009
Article Google Scholar
Chung SS (2010) Projecting municipal solid waste: the case of Hong Kong SAR. Resour Conserv Recycl 54:759–768. doi:10.1016/j.resconrec.2009.11.012
Article Google Scholar
Daskalopoulos E, Badr O, Probert SD (1998) Municipal solid waste: a prediction methodology for the generation rate and composition in the European Union countries and the United States of America. Resour Conserv Recycl 24:155–166. doi:10.1016/S0921-3449(98)00032-9
Article Google Scholar
Denafas G, Ruzgas T, Martuzevičius D et al (2014) Seasonal variation of municipal solid waste generation and composition in four east European cities. Resour Conserv Recycl 89:22–30. doi:10.1016/j.resconrec.2014.06.001
Article Google Scholar
Dustmann C, Fitzenberger B, Schönberg U, Spitz-oener A (2014) From sick man of Europe to economic superstar : Germany ’ s resurgent economy †. J Econ Perspect 28:167–188
Article Google Scholar
Duveiller G, Fasbender D, Meroni M (2016) Revisiting the concept of a symmetric index of agreement for continuous datasets. Sci Rep. doi:10.1038/srep19401
Google Scholar
Dwyer GP, Lothian JR (2012) International and historical dimensions of the financial crisis of 2007 and 2008. J Int Money Financ 31:1–9. doi:10.1016/j.jimonfin.2011.11.006
Article Google Scholar
Dyson B, Chang N (2005) Forecasting municipal solid waste generation in a fast-growing urban region with system dynamics modeling. Waste Manag 25:669–679. doi:10.1016/j.wasman.2004.10.005
Article Google Scholar
ETC/ECP (2013) Municipal waste management in Latvia. European Environment Agency
ETC/SCP (2013a) Municipal waste management in Norway. European Environment Agency (EEA)
ETC/SCP (2013b) Municipal waste management in Slovenia. European Environment Agency
European Commission (2015) Eurostat. http://ec.europa.eu/eurostat. Accessed 27 Aug 2015
European Environment Agency (2014) MSW generation and treatment, by type of treatment method. http://ec.europa.eu/eurostat/tgm/table.do?tab=table&init=1&language=en&pcode=tsdpc240&plugin=1. Accessed 4 Mar 2015
Eurostat (2015) Municipal Solid Waste generation. http://ec.europa.eu/eurostat/tgm/table.do?tab=table&init=1&language=en&pcode=tsdpc240&plugin=1. Accessed 15 Jul 2016
Freeman JA, Skapura DM (1991) Neural networks: algorithms, applications and programming techniques. Addison-Wesley Publishing Company, Houston, Texas, USA
Google Scholar
Gallardo A, Carlos M, Peris M, Colomer FJ (2014) Methodology to design a municipal solid waste generation and composition map: a case study. Waste Manag 34:1920–1931. doi:10.1016/j.wasman.2014.05.014
Article CAS Google Scholar
Gómez G, Meneses M, Ballinas L, Castells F (2009) Seasonal characterization of municipal solid waste (MSW) in the city of Chihuahua, Mexico. Waste Manag 29:2018–2024. doi:10.1016/j.wasman.2009.02.006
Article Google Scholar
Greene WH (2012) Econometric analysis, Seventh edn. Pearson Education Limited, Harlow, Essex, England, UK
Google Scholar
Gujarati DN, Porter DC (2009) Basic econometrics, 5th edn. McGraw-Hill Irwin, New York
Google Scholar
Hamilton LC (1990) Modern data analysis: a first course in aplied statistics. Brooks/Cole Pub. Co., Pacific Grove, CA, USA
Google Scholar
Hoornweg D, Bhada-Tata P (2012) What a waste—a global review of solid waste management. Washington, DC 20433 USA
Inglezakis V, Zorpas A, Venetis C et al (2012) Municipal solid waste generation and economic growth analysis for the years 2000-2013 in Romania, Bulgaria, Slovenia and Greece. Fresenius Environ Bull 21:2362–2367
CAS Google Scholar
Intharathirat R, Abdul Salam P, Kumar S, Untong A (2015) Forecasting of municipal solid waste quantity in a developing country using multivariate grey models. Waste Manag. doi:10.1016/j.wasman.2015.01.026
Google Scholar
Kara P (2014) Recycling of glass wastes in Latvia—its application as cement substitute in self-compacting concrete. J Sustain Archit Civ Eng. doi:10.5755/j01.sace.6.1.6127
Google Scholar
Keser S, Duzgun S, Aksoy A (2012) Application of spatial and non-spatial data analysis in determination of the factors that impact municipal solid waste generation rates in Turkey. Waste Manag 32:359–371. doi:10.1016/j.wasman.2011.10.017
Article Google Scholar
Khajuria A, Matsui T, Machimura T (2011) Economic growth decoupling municipal solid waste loads in terms of environmental Kuznets curve: symptom of the decoupling in India. J Sustain Dev 4:51–58. doi:10.5539/jsd.v4n3p51
Article Google Scholar
Kim S, Kim HS (2008) Neural networks and genetic algorithm approach for nonlinear evaporation and evapotranspiration modeling. J Hydrol 351:299–317. doi:10.1016/j.jhydrol.2007.12.014
Article Google Scholar
Lebersorger S, Beigl P (2011) Municipal solid waste generation in municipalities: quantifying impacts of household structure, commercial waste and domestic fuel. Waste Manag 31:1907–1915. doi:10.1016/j.wasman.2011.05.016
Article CAS Google Scholar
Legates DR, McCabe GJ Jr (1999) Evaluating the use of “goodness of fit” measures in hydrologic and Hydroclimatic model validation. Water Resour Res 35:233–241. doi:10.1029/1998WR900018
Article Google Scholar
Mihail F-C (2013) Development of MSW collection services on regional scale: spatial analysis and urban disparities in north-east region, Romania. Acta Geogr Debrecina Landsc Environ Ser 7:13–18
Google Scholar
Noori R, Abdoli M, Ghazizade MJ, Samieifard R (2009) Comparison of neural network and principal component- regression analysis to predict the solid waste generation in Tehran. Iran J Publ Heal 38:74–84
Google Scholar
Noori R, Karbassi A, Salman Sabahi M (2010) Evaluation of PCA and gamma test techniques on ANN operation for weekly solid waste prediction. J Environ Manag 91:767–771. doi:10.1016/j.jenvman.2009.10.007
Article CAS Google Scholar
OECD (2015a) OECD Statistics. http://stats.oecd.org/. Accessed 27 Aug 2015
OECD (2015b) Municipal waste—OECD Environment Statistics-OECD iLibrary. http://www.oecd-ilibrary.org/environment/data/oecd-environment-statistics/municipal-waste_data-00601-en. Accessed 4 Mar 2015
Perron P (2006) Dealing with structural breaks. In: Palgrave handbook of econometrics. pp 278–352
Rimaityte I, Ruzgas T, Denafas G et al (2012) Application and evaluation of forecasting methods for municipal solid waste generation in an eastern-European city. Waste Manag Res 30:89–98. doi:10.1177/0734242X10396754
Article Google Scholar
Rosenblatt F (1961) Principles of neurodynamics: perceptrons and the theory of brain mechanisms. Cornell Aeronautical Laboratory, Inc., New York, USA
Google Scholar
Rumelhart DE, Hinton GE, Williams RJ (1986) Learning internal representations by error propagation. In: Rumelhart DE, McClelland JL (eds) Parallel distributed processing: Exlorations in the microstructure of cognition. MIT Press, Cambridge, MA, pp. 318–362
Google Scholar
Shamshiry E, Mokhtar MB, Abdulai A (2014) Comparison of artificial neural network (ANN) and multiple regression analysis for predicting the amount of solid waste generation in a tourist and tropical area—Langkawi Island. In: International Conference on Biological, Civil and Envirnonmental Engineering (BCEE-2014). Dubai (UAE), pp 161–166
Specht DF (1991) A general regression neural network. IEEE Trans NEURAL NETWORKS 2:568–576
Article CAS Google Scholar
Sung TK, Chang N, Lee G (1999) Dynamics of modeling in data mining: interpretive approach to bankruptcy prediction. J Manag Inf Syst 16:63–85
Article Google Scholar
Tomandl D, Schober A (2001) A modified general regression neural network (MGRNN) with new, efficient training algorithms as a robust “black box”-tool for data analysis. Neural Netw 14:1023–1034. doi:10.1016/S0893-6080(01)00051-X
Article Google Scholar
UN (2003) Handbook of National Accounting: Integrated environmental and economic accounting 2003. United Nations, European Commission, International Monetary Fund, Organisation for Economic Co-operation and Development, World Bank
UN (2015) World population prospects—population Division—United Nations. http://esa.un.org/unpd/wpp/. Accessed 27 Aug 2015
Verbeek M (2004) A guide to modern econometrics, 2nd edn. Wiley, Rotterdam
Google Scholar
Walczak S, Cerpa N (1999) Heuristic principles for the design of artificial neural networks. Inf Softw Technol 41:107–117. doi:10.1016/S0950-5849(98)00116-5
Article Google Scholar
Wang W, Chau K, Xu D, Chen X-Y (2015) Improving forecasting accuracy of annual runoff time series using ARIMA based on EEMD decomposition. Water Resour Manag 29:2655–2675. doi:10.1007/s11269-015-0962-6
Article Google Scholar
Watson D (2013) Municipal Waste Management in Ireland. Eur Environ Agency :1–23
Wooldridge JM (2013) Introductory econometrics—a modern approach, Fifth edit edn. South-Western Cengange learning, Mason, OH, USA
Google Scholar
World Bank (2015) Data | The World Bank. http://data.worldbank.org/. Accessed 27 Aug 2015

Download references

Acknowledgments

The authors are grateful to the Ministry of Education, Science and Technological Development of the Republic of Serbia, Project No. 172007 for financial support.

Author information

Authors and Affiliations

Institute for Technology of Nuclear and other Mineral Raw Materials, Bulevar Franš d’Eperea 86, Belgrade, 11000, Serbia
Vladimir M. Adamović
Innovation Center of the Faculty of Technology and Metallurgy, Karnegijeva 4, Belgrade, 11120, Serbia
Davor Z. Antanasijević
Faculty of Technology and Metallurgy, University of Belgrade, Karnegijeva 4, Belgrade, 11120, Serbia
Mirjana Đ. Ristić, Aleksandra A. Perić-Grujić & Viktor V. Pocajt

Authors

Vladimir M. Adamović
View author publications
You can also search for this author in PubMed Google Scholar
Davor Z. Antanasijević
View author publications
You can also search for this author in PubMed Google Scholar
Mirjana Đ. Ristić
View author publications
You can also search for this author in PubMed Google Scholar
Aleksandra A. Perić-Grujić
View author publications
You can also search for this author in PubMed Google Scholar
Viktor V. Pocajt
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Davor Z. Antanasijević.

Ethics declarations

Funding

This study was funded by the Ministry of Education, Science and Technological Development of the Republic of Serbia (Project No. 172,007).

Conflict of interest

The authors declare that they have no conflict of interest.

Statement of human rights and statement on the welfare of animals

This article does not contain any studies with human participants or animals performed by any of the authors.

Additional information

Responsible editor: Marcus Schulz

Rights and permissions

Reprints and permissions

About this article

Cite this article

Adamović, V.M., Antanasijević, D.Z., Ristić, M.Đ. et al. Prediction of municipal solid waste generation using artificial neural network approach enhanced by structural break analysis. Environ Sci Pollut Res 24, 299–311 (2017). https://doi.org/10.1007/s11356-016-7767-x

Download citation

Received: 20 July 2016
Accepted: 22 September 2016
Published: 07 October 2016
Issue Date: January 2017
DOI: https://doi.org/10.1007/s11356-016-7767-x

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Prediction of municipal solid waste generation using artificial neural network approach enhanced by structural break analysis

Abstract

Similar content being viewed by others

Municipal solid waste generation in China: influencing factor analysis and multi-model forecasting

Estimating Municipal Solid Waste Generation: From Traditional Methods to Artificial Neural Networks

Spatial–temporal redundancy evaluation of the municipal solid waste incineration treatment capacity: the case study of China

Introduction