Exploring social determinants of municipal solid waste management: survey processing with fuzzy logic and self-organized maps

Kokkinos, Konstantinos; Karayannis, Vayos; Lakioti, Evangelia; Moustakas, Konstantinos

doi:10.1007/s11356-019-05506-2

Exploring social determinants of municipal solid waste management: survey processing with fuzzy logic and self-organized maps

Advances & Prospects in the field of Waste Management
Published: 28 May 2019

Volume 26, pages 35288–35304, (2019)
Cite this article

Download PDF

Access provided by Autonomous University of Puebla

Environmental Science and Pollution Research Aims and scope Submit manuscript

Exploring social determinants of municipal solid waste management: survey processing with fuzzy logic and self-organized maps

Download PDF

Konstantinos Kokkinos¹,
Vayos Karayannis²,
Evangelia Lakioti² &
…
Konstantinos Moustakas³

721 Accesses
18 Citations
Explore all metrics

Abstract

In the present study, the establishment of decision-making criteria and a novel and robust interdisciplinary approach for systematically characterizing effects of uncertainties in social determinants of municipal solid waste management using an important fuzzy logic methodology is demonstrated. The primary goal is to highlight the social benefits of this waste management option such as job creation, hygiene and health protection, and working safety as well as to indicate certain side effects occurring during waste processing (odor and leachate production, social trust). The current research is based on a social survey in an agro-industrial region, Thessaly, Greece, and indicates a set of diversified key factors that are related to public acceptance of municipal waste management schemes. These features are input to Kohonen Self-Organized Maps (a special type of Artificial Neural Networks) for clustering residents according to their social perception and attitudes in terms of solid waste collecting and recycling. Both analyses highlight the environmental concern, social perception, hygiene and health, economic status, and lifestyle as the primary social determinants in affecting the public attitudes towards recycling. In both cases, these soft computing techniques seem to outperform the classical statistical and logical regression methodologies and become very promising in accurately predicting waste management practice and possibly other environmental behaviors.

Modeling construction and demolition waste quantities in Tanta City, Egypt: a synergistic approach of remote sensing, geographic information system, and hybrid fuzzy neural networks

Article Open access 20 September 2023

Designing a fuzzy decision support framework for assessing solid waste management in the South European region

Article 29 January 2022

An ideal plastic waste management system based on an enhanced MCDM technique

Article Open access 19 March 2024

Discover the latest articles, news and stories from top researchers in related subjects.

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Introduction

Due to the implementation of relevant environmental legislation, the amount of municipal solid waste (MSW) sent to landfills has been decreasing in the last years, the collecting of recyclables keeps increasing, and also household composting has lately started to take place. However, despite the improvement of recovery performance and quality of recycled materials, sustainable MSW management still remains a relatively complicated procedure involving multiple environmental and socio-economic criteria at the top of the agenda (Moustakas and Loizidou 2018; Turcott Cervantes et al. 2018).

Surveys appear to be helpful tools regarding the role of social stakeholders’ behavior and awareness for effective decision-making in a sustainable environmental management process. For instance, perception and attitudes of residents in townships with various climate change vulnerabilities were found to affect their behavioral intentions with regard to climate change adaptation and disaster risk (Lee et al. 2018). In order to contribute to greenhouse gas emissions reductions, various social research tools can provide an understanding of several factors influencing and shaping public support for the implementation of carbon capture and storage technologies, which represents an increasingly urgent priority (Braun et al. 2018; Karayannis et al. 2014). Moreover, for improving the public’s awareness of urban air pollution and promote the establishment of more efficient policy for pollution control and climate change mitigation, employees’ perceptions of air pollution and their willingness to pay taxes for improved quality were investigated (Liu et al. 2016).

Particularly, several questionnaire-based surveys focus on the level of knowledge and attitudes of people towards various waste management issues. Residents showed relatively high environmental awareness and willingness to pay for solid waste management (Song et al. 2016). A more specific aim can be to distinguish the key motivational drivers for encouraging households’ participation in a “no plastic campaign” (Afroz et al. 2017). Also, social behavior was identified to be the most significant reason for pharmaceutical disposal leading to the generation of pharmaceutical wastes (Zorpas et al. 2018). People’s attitudes and behavior regarding the problem of massive illegal dumping of construction waste are also important in order to gain awareness towards construction waste minimization (Liu et al. 2018). Even in managing contaminated sites, the role of understanding the different needs and attitudes of social stakeholders is critical to indicate effective remediation policies (Stezar et al. 2014). Generally, public support appears to be a key factor for the implementation of waste management schemes, along with technological advancement and ecological impact in a holistic approach (Lakioti et al. 2017). Conclusions and suggestions derived from social surveys may be used to improve people awareness for waste minimization and recycling behavior and promote enhanced public education and environmental protection campaigns in the framework of sustainable environmental management policies.

There is not a crisp measured tool set to use in order to obtain precise measurements of social determinants in solid waste processing and recycling. This is due to the fact that the most important impact evaluation techniques require the existence of diverse variables for estimation, which must be converted to a unified scale that measures the method quality. Furthermore, sometimes, there is not a descriptive technique to include many of these impact variables as key performance indicators, which also makes these variables to introduce a high degree of uncertainty and inaccuracy. For that reason, the use of soft computing techniques is preferable in producing adequate solutions in a multi-criteria analysis approach. Even though these methodologies are mostly applied in computer science problems, they have shown very promising performance in various interdisciplinary problems. Among this variety of tools, neural networks, fuzzy logic (FL), self-organized (Kohonen) maps (SOM), and probabilistic (Bayesian) neural networks are the most promising. In the current work, the important issue of social determinants estimation on urban waste management is quantitatively addressed by illustrating the computational ability of FL and SOMs methodologies.

FL was introduced by Zadeh (1965, 2008) as a methodology based on “degrees of truth” rather than the usual “true or false” (1 or 0) Boolean logic on which the modern computer is based. FL aims to represent expert knowledge in a rule-based style in order to build a standard control law that faithfully reflects this knowledge (Mamdani and Assilian 1975; Yen and Langeri 1999). According to (Zadeh 2008), FL’s dynamics are twofold: (a) FL is able to provide rational decision-making, even under uncertainty, partiality of truth and value imprecision, and (b) FL performs adequately in multidisciplinary tasks that do not need crisp measurements and exact value computations hence similarly to multi-criteria analysis (Munda 2004; Munda 2008).

Due to its intrinsic characteristics, FL has already been employed in many applications, especially in control engineering, modeling, and systems analysis, but also in biomedical and hydrological studies among others (Nadiri et al. 2019; Ostadi et al. 2019; Akintola et al. 2018; Singh et al. 2006). Especially for engineering problems, FL proves to be the most promising alternative due to its distinct property to deal with imprecise input. Although research efforts using FL theory to deal with interdisciplinary problems of fuzzy input span among many fields, however, there is very limited research reported in literature relative to the qualitative evaluation of the social impacts regarding a specific environmental process (Kokkinos et al. 2018). Moreover, for social determinants’ evaluation on municipal solid waste management practice, the research literature becomes far more limited.

In the present work, a FL-based decision support tool is proposed, which is able to set up all the measurable parameters affecting the process of waste collecting and recycling while at the same time focusing on the social attributes along with this methodology. The research is based on the results of a residents’ survey in an agro-industrial region, Thessaly region, Greece, involving a set of diversified key factors that are related to public perception, attitudes, and acceptance of municipal waste management schemes. These attributes are input to Kohonen SOMs that are able to cluster residents according to their behavior on solid waste management, using the neural network–based pattern analysis. After the de-fuzzification process (see following chapters), this method succeeds to determine the overall social impact with a quantitative characterization: i.e., it is able to predict the future adoption of waste recycling by quantitatively and indirectly calculating the residents’ perception of the topic. In both aforementioned analyses, the scope of the study is to highlight the principal components of social perception that are responsible in boosting attitudes towards waste recycling and at the same time to provide a quantitative order of these determinants indicating the ones with the highest impact.

The novelty introduced in this study is trifold, as it incorporates in important municipal solid waste management issues a social science methodology and a set of artificial intelligence methods that are able to provide adequate results when pure statistical or numerical theories cannot meet this challenge.

Materials and methods

Basics of fuzzy inference systems

In the description of a problem using FL, the variables may be linguistically described by a fuzzy set. However, linguistic variables are occasionally unable to fully express a whole fuzzy scale. For this reason, adjectives or adverbs are added to modify their values (very, low, medium, etc.). Fuzzification operations that map the above values into fuzzy membership functions (inclusions or exclusions from a fuzzy set) are used. Exactly on the inverse procedure, de-fuzzification operations map fuzzy membership functions into “crisp” output using fuzzy-type if–then rules. These rules create a robust knowledge representation scheme where the output is a consensus of all of the inputs making the FL system to adequately respond even when input values are not available or not trustworthy. The importance of each rule in the rule base can dynamically change, so that the degree to which each rule affects the output values can be controlled. FL processes elements of a fuzzy set, which are taken from a universe of discourse according to the fuzzy set theory using membership functions (Çolak and Kaya 2017).

Given a finite set of objects, E described as E = {e₁, e₂, e₃, …e_n} assumes a membership function of e_i to be denoted as mf_i. A fuzzy set A is defined as a linear combination of the e_i and mf_i terms: A = {mf₁(e₁), mf₂(e₂), mf₃(e₃), … , mf_n(e_n)}. An important component of a FL system apart from the fuzzy set is the fuzzy rule base that consists of the “if–then”-type rules. Rule learning is achieved from the perspective of finding a favorable balance between the accuracy of the system, the speed required to learn the rules, and, finally, the interpretability of the rule bases obtained (Duţu et al. 2018). FL systems accept inputs that are initially given to the fuzzification inference unit. With the involvement of the knowledge base, the input variables are first compared with the membership functions on the antecedent part to obtain the membership values of each linguistic label. At the next step, the qualified consequents or each rule depending on the firing strength above are generated and, finally, the qualified consequents are aggregated to produce a crisp output (see Fig. 1).

Membership functions

For a fuzzy set A on the universe of discourse X, a membership function is defined as μ_A:X → [0,1], where each element of X is mapped to a value between 0 and 1 (Lee 1996). This value is called the degree of membership and quantitatively describes the membership of the element of X into the fuzzy set A. Membership functions can be easily depicted in a planar graph with x-axis to be the space of discourse and y-axis the membership degree within the interval [0, 1]. The method of producing three of the most popular membership functions and their corresponding graphical illustrations are shown in Figs. 2 and 3.

Basic operations

The fuzzy logic operations, which are analogous to the classical logic ones, are the union, intersection, and complement:

Union: assuming μ_A and μ_B are any two membership functions relating to fuzzy sets A and B respectively, the union of A and B is a fuzzy set defined by the membership function:
$$ {\mu}_{\mathrm{A}\cup \mathrm{B}}(x)=\operatorname{MAX}\left({\mu}_{\mathrm{A}}(x),{\mu}_{\mathrm{B}}(x)\right). $$
Intersection: assuming μ_A and μ_B are any two membership functions relating to fuzzy sets A and B respectively, the intersection of A and B is a fuzzy set defined by the membership function:
$$ {\mu}_{\mathrm{A}\cap \mathrm{B}}(x)=\operatorname{MIN}\left({\mu}_{\mathrm{A}}(x),{\mu}_{\mathrm{B}}(x)\right). $$
Complement: assuming μ_A is a membership function relating to fuzzy set A, the complement of A is a fuzzy set defined by the membership function:
$$ {\mu}_{{\mathrm{A}}^c}(x)=1-{\mu}_{\mathrm{A}}(x). $$

Basics of SOM

SOMs are a special type of artificial neural networks (ANN) that use the methodology of unsupervised learning for training in order to produce a two-dimensional space topographic map of the input space, preserving the following properties: (a) the spatial location and distance of the output neurons directly correspond to a specific characteristic drawn from the input space and (b) incoming information from the input space is organized in such a way to preserve its proper context/neighborhood. Thus, the difference of SOMs to traditional ANN is that they use this method of the closeness or the neighborhood function to show the input space features and not the method of step-by-step fine-tuning or correction of the error using for example backpropagation.

The primary goal when creating a SOM is firstly to identify each of the corresponding signal patterns (features) of arbitrary dimension and convert them by reducing their dimensions into a one- or two-dimensional topographically ordered map. Each one of the nodes in a SOM represents a weighted vector and also the pair of (x, y) coordinates of the node within the two-dimensional lattice. Note that the input features are also represented as a vector of the same dimensions. In this way, what node corresponds to the closest-match can be computed based on a distance measure, using the aforementioned vector weights. Once the best/closest matching node is calculated, the area surrounding the “best” node is optimized, so that this region now represents the cluster in which the input vector belongs. This process is repeated many times until equilibrium or rather a stabilization of the clusters is reached. The steps to be taken for the learning process of SOMs are the following (Kaya et al. 2016; Hong et al. 2003; Kohonen 1998, 2001, 2013, 2014; SOM in Matlab (2014):

The nodes’ weights are randomly initialized.
Out of the training data, a vector to be presented to the SOM is randomly selected.
The best matching node, i.e., the node with its weights to be closest to the input vector, is calculated.
The region surrounding the best matching node is reorganized shrinking each time to fine-tune the system.
The weights of all nodes are adjusted inside in each cluster and especially those closer to the best matching node.
Finally, the whole process is repeated up until stabilization of the system is reached.

More specifically, the aforementioned learning process is rigorously illustrated as follows (Lin and Chen 2005): let X = [x₁, x₂, … , x_M]^T be the input layer to the SOM as a vector of M-neurons, with the output neurons to be a set of N items (u_j, j = 1 … N) organized as a 2D-lattice, as shown in Fig. 4.

Let also W = {w_{i, j} |i = 1 … M, j = 1 … N} be the weights assigned from input layer neuron to the output layer neuron. After all weights have been initialized to small random numbers, then the network is triggered to calculate the distance between the input vector X and the weight vector W_j, and this must be calculated for each neuron u_j. This distance is defined to be the Euclidean distance and it is used as the similarity measurement between the weight vector and the input vector:

$$ {d}_j=\left|\left|X-{W}_j\right|\right|=\sqrt{\sum \limits_{i=1}^M{\left({x}_i-{w}_{ij}\right)}^2} $$

The application of this algorithm on the SOM creates a topological map where dark areas illustrate the subset of neurons that belong to the same cluster according to the abovementioned similarity criteria (see Fig. 5). In order to take advantage of a SOM when data is fed to this neural network, we need to group together all items that belong to the same cluster. The two methodologies that have been proven to provide near optimal clustering are the hierarchical agglomerative clustering and the partitive clustering using k-means. Using the first technique, a hierarchy of clusters is built using a “bottom-up” approach where each data observation starts in its own cluster, and as we move up, the hierarchy clusters are merged into bigger ones. On the opposite side, partitive clustering is a prototype-based model, where prototypes are separating boundaries according to a concise description of a characteristic of the original data set, and this leads to the construction of each cluster. Many prototypes can be applied (hyperplanes, hyperspheres, etc.), forming a variety of clustering algorithms. Agglomerative clustering is obtained from clustering trees (otherwise called dendrograms), where K-means constructs a spherical cluster, as denoted by:

$$ S=\sum \limits_{i=1}^C\sum \limits_{x\in {Q}_i}{\left\Vert x-{c}_i\right\Vert}^2 $$

where C is the set of clusters and c_i is the centroid of the cluster Q_i.

MSW case study and analysis using FL and SOM

Case study and survey

In 2018, a survey was conducted to identify the social perceptions of collecting recyclables and composting biowastes as major procedures for practicing MSW management in an agro-industrial region, Larissa city, Thessaly, Greece. After a pilot study was carried out to verify the validity of the questions, a survey was conducted into the greater residential body of the city. The survey targeted primarily the homeowners and old-time residents, and also university students on a secondary preference. Note that such surveys start with the bias that a “pro-green” methodology and behavior is mentioned, and therefore it is likely to be considered highly acceptable by most of the people regardless of their broader environmental concerns. The survey collected 327 anonymous responses, with 23 responses to have incomplete or outlier answers, thus leaving 314 valid data items. The questionnaire was designed to cover the issues of general behavior, social acceptance, hygiene and health protection, attributes of behavior towards waste production, and lifestyle apart from several general demographic attributes that relate to age, gender, educational level, and economic status. A compendious view of the distributed questionnaire is provided in Table 1.

Table 1 Condensed version of questionnaire

Full size table

All answers on the questions Q1 to Q22 were framed on a five-point Likert scale acting in accordance with a generalized data treatment for the fuzzification process of the responses and allowing a unified methodology in analyzing inference for categorical and numerical data using chi-square tests. Initial findings from the responses (see Fig. 6a–f) showed 240 responders that have adopted recycling in every aspect of their life, while the rest of 74 responders have answered negatively to the corresponding question.

It must be noted here that the design of the questionnaire was based on questions similar to those used in the questionnaires of De Fao (2014) and Milutinovic et al. (2016), trying to achieve results from our processing methodology that will be compatible with the aforementioned. Furthermore, in our analysis, the predetermined fact is included that the measurement of the social acceptance impacted by the composting process cannot be assessed by one survey question or even a cluster of those. The reason for this determination is the fact that people usually agree for the proposed “green” technologies but, when they are asked to implement them, they are rather reluctant. The questions Q20–Q22 are the most significant questions of the survey, as, via them, responders are classified into two groups of (i) adopters and (ii) non-adopters of the recycling behavior. It is of high importance that there is a significant difference in the way that the three perception variables imposed by Q20–Q22 affect the two subgroups of participants as Fig. 6 a–f illustrate.

Application of FL to residents’ social acceptance and adaptation to recycling

The fundamental aspects in the development of a FL model consist of the individual construction of the following modules: (a) identification of the fuzzy sets involved, i.e., sets with smooth boundaries and to what these set correspond; (b) inclusion of appropriate linguistic variables, i.e., variables whose values have their corresponding representative fuzzy set that describes their quantity and quality characteristics; (c) variable value and possibility distribution, i.e., the inclusion of all the repressions, constraints, and liberties that a linguistic variable can take when a fuzzy set is assigned to it; and (d) the set of fuzzy inference rules in the form of (if–then), which succeed to describe the functional mapping of a logic formula, as the vague attributes and labels of the linguistic variable are transformed into standard mathematical values. This model has been proven to demonstrate substantial improvements in data analyses achieving accurate deductions.

For the case study described in the “Case study and survey” subsection and relatively to the identification of the fuzzy set of value involved in the questionnaire responses, all questions from Q1–Q22 are set up to include answers in Likert scale where (1 = Very Low, 2 = Low, 3 = Medium, 4 = High, and 5 = Very High) are the corresponding equivalent fuzzy values on the linguistic variables imposed by the questions.

For the fuzzy model we try to build here, we concentrate on the questions Q20–Q22 that define how the adaptation of recycling emanates from the influence of (a) environmental concern and general behavior, (b) social perception hygiene and health, and (c) economic status and lifestyle. Note that we chose the above three perception linguistic variables, because they summarize all the attributes and determinants of all the questions in the survey, as these questions have been classified according to these perceptions. Furthermore, out of the six possible categories of determinants, as they have been separated in the questionnaire, we choose to pair up perceptions and finally end up with three basic perception linguistic variables. The reason we choose to do so it that we significantly decrease the number of linguistic input variables in the fuzzy model, keeping the construction of if–then rules a rather small process. On the other hand, the inclusion of six input variables on a FL system makes the process of creation of inference rules a rather complicated procedure due to its exponential nature. Additionally, the increase of the input variables prohibits the deep analysis of the system due to the exponential combinations of membership function types involved in a thorough analysis.

Our primary target is to evaluate the residents/responders social acceptance and adopting of recycling, by constructing a FL system and using the responses we received from the survey. We denote the linguistic variables described above as follows:

ECGB = environmental concern and general behavior
SPHH = social perception hygiene and health
ESLF = economic status and lifestyle

We furthermore distinguish as ECGB_a, ECGB_n, SPHH_a, SPHH_n, ESLF_a, and ESLF_n the above linguistic variables among the recycling adopter and non-adopter participants. The inputs into the FL system will be the combinations of the above six, while the output will be the adaptation of recycling technology or not, but in fuzzy terms of the abovementioned Likert scale spanning from Very Low to Very High. Figure 7 shows the method used: first a combination of input variables are fed into the FL system, which responds back with the output of affirmative, medium, or negative adoptability of recycling. In the case of positive output, we note the combination of the input values as well as the combination of the membership functions used for them. Otherwise (output is negative or of low confidence), membership functions and inference rules are changed. This fine-tuning of the system is continued until better results are achieved.

For both the distinct cases of adopters and non-adopters, we formulate a fuzzy set that is defined by a function that maps the corresponding objects in the survey (i.e., the beliefs B_a, ECGB_n, SPHH_a, SPHH_n, ESLF_a, and ESLF_n) to a domain of concern to their membership value in the set. As mentioned before, this is the membership function μ_A. The selection of the membership function for each of the inputs directly affects the outcome of the FIS even though the most popular functions include the triangular, the trapezoidal, the bell, and the Gaussian. Regardless of the membership function selection, the if–then logic inference rules are the ones that primarily define the reaction of the FIS to any combination of inputs having any combination of fuzzy values. Any rule that is added in the FIS carries: (a) a description, (b) an antecedent, (c) a consequent, (d) a weight, and (e) a connection in the network. For example “IF ECGB is low and SPHH is high and ESLF is high THEN Recycling_Adaptation (RA) is medium” is a rule that drives the behavior of FIS when the combination of fuzzy input is as noted. The construction of the rule set is based on the assumption that if ECGB, SPHH, and ESLF are scored high in the Likert Scale (1 = Very Low, 2 = Low, 3 = Medium, 4 = High, and 5 = Very High), then the acceptance of this methodology is considered to be high logically. Note that the five fuzzy values for each of the three inputs create a rule set of cardinality 5 × 5 × 5 = 125 rules. The outcome of each of the rules is based on the mean of the equivalent of the input values in the Likert scale and the rounding to the closest integer either up or down in the scale in the de-fuzzification process. The first part of the rule set (first 25 rules) for the case of (ECGB = Very Low) is given in Table 2. The rest of the rules are produced using the same methodology.

Table 2 Part of the FIS if–then rule set

Full size table

Three FIS were developed for the set of ECGB_a, SPHH_a, and ESLF_a variables and another three for the set of ECGB_n, SPHH_n, and ESLF_n variables corresponding to the use of the triangular, trapezoidal, and generalized bell functions. For each of the inputs and for each of the used membership functions, their parameters were set up accordingly. The same was held for the output variable for which the same configuration was used. In the following set of figures (Fig. 8), screenshots of the membership functions for each case and the 3D surface of the FIS outcome for the configuration of the triangular membership function are provided.

Segmentation and visualization of public opinions towards MSW processing, using SOM

In this subsection, SOMs are used to segment the public opinions towards MSW management processing. This segmentation can be done according to a variety of criteria (age, educational level, economic status, etc.). To discover distinct public orientations towards MSW management procedures, our methodology is consisted of three stages: (a) to create a spatial model of the responses aggregated by the survey; (b) to select a specific number of fragments on the map, which correspond to representative attributes of the social acceptance characteristics referred in the questionnaire; and (c) to create profiles for these opinions/attributes. Taking the above steps in order, the creation of the SOM is basically the representation of respondent clusters into regions of a two-dimensional rectilinear space illustrating the similarity of opinions. SOMs can achieve nonlinear mappings from multi-dimensional data to a low-dimensional space (preferably two dimensional) while preserving all similarity relationships of the original data. The operation of the network is based on unsupervised learning to cluster the input vectors into regions that are defined depending on their Euclidean distance to already mapped vectors. This technique is best known by the name of ordered vector quantization. More specifically, for each questionnaire-answer sheet, a vector of all the answers (in Likert scale) is created. Following, these vectors are transformed creating ratios of opinions based on the average of answers A_mean using the aforementioned quantization. This organization allows ordering the resulted clusters of opinions forming reduced abstractions of complex data. In Table 3, the performance of FIS for both the adopter and the non-adopter case are included. In any case, the values in Table 3 represent averages either over the total adopter body (240) or the non-adopter body (74).

Table 3 Characteristic first level clustering from the survey

Full size table

In order to determine the number of clusters, the questionnaire responses were grouped using the MATLAB nnstart software. The system has been run with a variety of training setups (epochs) and also a variety of values on the size of the two-dimensional map. For all cases, there were no clearly separated regions. A typical run is shown in Fig. 9a giving an unclear view of well-separated regions in the 6 × 6 map. To overcome this problem, k-nn clustering was additionally applied, which succeeded to reduce the number of initial neighborhoods into 5 as shown in Fig. 9b. The regions in Fig. 9b correspond to the clustering emanating from the input vectors that contain all the features described in Table 4. Note that some of the regions are bigger than others, while the number of hits in each region does not necessarily correspond in an analogous way. The hit count for each region is as follows: 1➔89, 2➔10, 3➔112, 4➔54, and 5➔49. However, we cannot be sure that the above clustering corresponds to crisp classes as these have been implied by the five-level Likert scale responses. We cannot induce for example that these five classes correspond to the five different educational levels in demographics for the survey.

Table 4 Features used in the input vectors of SOM

Full size table

To further deepen our intuition into the clustering information coming from this approach, it is useful to study the component planes that resulted from the SOM construction. These give more detailed information as to how data that contain some specific feature are distributed over the map, which in consequence affects the overall clustering process. In Fig. 10, an array of component planes is produced for each of the features as they have been determined in Table 4. Each one of these features indicates the contribution of the GB, SP, HH, BD, and LF categories of responses into the overall responses. They show, for example, that responders with low average on the responses overall GB_mean/A_mean (i.e., people that score higher than the average in the section of the General Behavior) are mapped on the top and right outer edges of the map, while participants that score on average less on the General Behavior features than the overall average of responses are mapped to the left and lower section of the map. Similar inferences can be done for each one of the five different features in Fig. 10, having always in mind that the darker the color of the SOM map is, the higher the value of the features is. These component planes need to be analyzed in parallel with the SOM map to identify the clusters as these are depicted in Fig. 9a and b.

Discussion of results

Performance of the FIS models

The various cases taken in the development of the FIS included both the adopters and the non-adopters of the recycling behavior and focused on how the environmental concern, the general behavior, the social perception, the hygiene, the health, the economic status, and the lifestyle of the responders affect this behavior. For this reason, similar responses were clustered and aggregated into three groups for both non-adopters and adopters, which were denoted as ECGB_a, SPHH_a, and ESLF_a, and ECGB_n, SPHH_n, and ESLF_n accordingly. For both classes of responders, the same rule set for the FL system was used. Furthermore, a variety of membership functions was experimented with, spanning from triangular to trapezoidal and generalized bell. In all cases, the outcome of all FIS shows that fuzzy input is mapped through the membership functions close to 5 (i.e., equivalent to responses in Likert scale). Figure 8d illustrates this trend for two out of the three inputs, and the output again tends to reach the high values for high scoring in responses in the three classes. On the other hand, the processing of all questionnaires summarized in Table 3 shows the attitude of respondents towards recycling.

To ensure the scientific validity of the FIS methodology in predicting resident intentions relating to adopting recycling, all analyzed scenario outcomes have been clustered into 6 groups that recorded the ECGB_a, SPHH_a, and ESLF_a, and ECGB_n, SPHH_n, and ESLF_n cases into separate spreadsheets. Based on these cases, several statistical tests were performed, by concentrating on the statistical significance of our simulation outcomes (Borror 2009), having as the main target to access the prediction of the FIS models depicted above. Thus, we deal with the statistical significance that is a valid estimate as to how close the outcome of the FIS models lie around the true statistics coming from the survey within a confidence interval. Common levels of reliability for newly introduced models are around the 95% confidence, also written as p = 0.05, called the p-confidence level, even though there are research methodologies that signify their importance with a p = 0.001 confidence level.

Given the set of 314 distinct questionnaire responses, the means of 2.908 and 2.506 were calculated for the adopter and the non-adopter case and also the corresponding variances. However, what we focus on in parallel is the true mean (μ) of all the simulation tests. For this study, let us assume that the produced FIS models score Recycling_Adaptation (RA) outcomes distributed according to the normal distribution. Using this assumption, it is therefore implied that there is not any significant dependency between any RA output and all other RA outputs from previous FIS models. The fuzzy output of the FIS model cannot produce the true mean μ and the variance σ², thus, the distribution of fuzzy outcome cannot be modeled with the normal distribution. Instead, the Student’s t-distribution (Gosset 1908) can be used, which similarly tries to approximate the normal distribution when the input is of large quantities (Fig. 11). As Krzywinski and Altman (2013) as well as Fay and Proschan (2010) indicate, the functional mapping between the proposed 95% confidence interval [x – d, x + d] and the probability q can be obtained by performing an integration over the approximate normal distribution. Nevertheless, in our case, the data is not significantly large, thus the solution does not exist in a closed form, and instead we can use numerical methods.

In Appendix Table 5, the pair correlations are shown for SURVEY statistics taken from the questionnaires and the FIS statistics taken over the adopter case and non-adopter case simulations. The Pearson’s, Kendall’s tau_b, and Spearman’s_rho correlation tests in the screenshots taken from SPSS (SPSS, © ver 20, Chicago, USA) are provided. Note that Pearson’s test gives a significant correlation of p = 0.05 where the other two reach also a significant correlation of p = 0.01.

This allowed also performing non-parametric pairwise sample t tests between the SURVEY statistics and the FIS model statistics that span from 95 to 99% confidence intervals justifying the statistical significance of our model. Appendix Table 6 shows the mean, the standard deviation, and the standard error mean indicating that the upper and lower bounds remain the same only fluctuating the t values.

Performance of the SOM in clustering

Mean absolute error (MAE) and root mean squared error (RMSE) are the most popular metrics used to measure accuracy and apparently performance of SOM models even though MAE and RMSE are primarily used for continuous variables. The definition of MAE and RMSE is given by the following formulas:

$$ \mathrm{MAE}=\frac{1}{n}\sum \limits_{i=1}^n\mid {x}_i-{\overline{x}}_i\mid \mathrm{and}\ \mathrm{RMSE}=\sqrt{\frac{1}{n}\sum \limits_{i=1}^n{\left({x}_i-{\overline{x}}_i\right)}^2} $$

where the quantities x_i and $ {\overline{x}}_i $ correspond to the predicted value of the model and the actual value respectively in relation to the entity x. RMSE has the benefit of more penalizing large errors, and therefore it is used more than MAE. Since we forced the system to produce five classes using the k-nn classifier after the initial SOM categorization, we cannot compute the two aforementioned criteria for adopters and non-adopters separately.

For this reason, the average of the two average quantities is used for each one of the answer groups as they have been calculated in Table 3. We denote the average of the two general behavior averages in Table 3 as $ \overline{GB} $ and similar notation is used for the rest of the answer groups. Hence, the RMSE and MAE are computed for $ \overline{GB},\overline{SP},\overline{HH},\overline{BD} $, and $ \overline{\mathrm{LF}} $. Results show the values of MAE = 1.987 and RMSE = 5.239. These values are the averages of the above five classes and indicate the misplacements of responders to classes 1 to 5 in Fig. 9b due to their preference in their questionnaire answers. The results of RMSE and MAE are typical in values for soft computing applications that contain fuzzy information.

Social determinants affecting recycling

Many variables affect residents’ decision-making process of changing attitude towards adopting recycling as every practice. The resident attitudinal survey showed that the three most important questions (the ones that summarized the categories of social issues) have been ordered differently to the residents that are adopters of the recycling practice than the non-adopters. More non-adopters believe that the findings are the most critical and show that the drivers of environmental concern and the social perception of hygiene and health are more influential than the economic status and the lifestyle of the respondents. For this reason, we used these variables as the perception variables. The FL system developed having the previous statistical findings in mind to develop the rule set for inference. The “Performance of the FIS models” subsection proves how close the inference system can perform to achieve similar findings as the statistical ones proving the statistical significance of our test in the range of 0.01. On the other hand, the use of SOM neural networks uncovered the profile of the five categories of questions of social issues involved according to the survey.

One important caveat in the study is that the data is biased. When a resident is asked questions relating to a social issue, everybody tries to present better behavior even though this is not always the case. Thus, to validate the model and provide policy direction, a larger sample is required for further survey study.

Conclusions

The evaluation and establishment of social determinants that affect specific public attitudes such as recycling behavior are difficult tasks, due to the fuzziness in measuring attributes, perceptions, and beliefs of adopters and non-adapters of the methodology. The uncertainty introduced in drawing conclusions from attitudinal surveys by just only calculating statistics makes FL a suitable tool to deal with ambiguous data or data that hide complex parametric associations.

Based on this assumption, we developed a FL system that uses three aggregated fuzzy input variables incorporating the attributes of environmental concern, general behavior, social perception, hygiene, health, economic status, and lifestyle. For this system, we also developed a rule set as a means to de-fuzzify the FL system outcome. In order to experimentally verify the correctness of this system, we also provided the results from statistical methods that estimate the statistical significance as to how close the outcome of the FIS models lies around the true statistics coming from the survey within a confidence interval. Results showed close statistical significance in the range of p = 0.05 using Pearson’s correlation test and p = 0.01 using Kendall’s and Spearman’s pairwise correlation tests.

We also introduced the application of SOM as a novel approach to segment and cluster public opinion related to recycling by using a variety of clustering parameters. What has proved to work closely to the real statistical output (RMSE = 5.239) was the creation of a spatial model based on characteristics to be the ratios of answer groups to the overall answer mean using the Likert scale.

Future challenges

The novelty suggested in this paper is due to the fact that established social science methods and a set of artificial intelligence methodologies (instead of pure statistical or numerical methodologies) are incorporated in solid waste management practices, thus providing adequate results. However, there are some other challenges that such systems must overcome: (a) mostly the combination of the fuzzy logic system and the SOM system should be refined to be able to predict the correct attitude of a future resident in relation to recycling behavior, taken into account specific characteristics of his/her social perception; (b) the system must be further rectified to discover and highlight the higher importance of specific behavioral characteristics as opposed to others, as at this point all attitudes from all answer groups equally participate in the findings of corresponding averages; and (c) since waste treatment and recycling provide a measure of sustainability, it will be useful to make a specific study to analyze and gain deeper knowledge relative to the percentage of this affect as opposed to the overall impact towards the recycling behavior (in other words to prove the significance of the social affect in relation to the other type of impact towards recycling). This study has explored the applicability of soft computing model to social acceptance issues and discussed potential policy implications. To make concrete policy suggestions, more work is needed in the future, both with regard to data (larger sample) and methodology (more delicate models).

Abbreviations

Notation:: Definition
FL:: fuzzy logic
MSW:: municipal solid waste
SOM:: self-organized maps
ANN:: artificial neural network
ECGB:: environmental concern and general behavior
SPHH:: social perception hygiene and health
ESLF:: economic status and lifestyle
GB:: general behavior
SP:: social perception attitudes and beliefs
HH:: hygiene and health protection
BD:: behavior determinants towards waste production
LF:: lifestyle
A_mean :: mean answer of all available data
GB_mean/A_mean :: ratio of mean-GB over A_mean
SP_mean/A_mean :: ratio of mean-SP over A_mean
HH_mean/A_mean :: ratio of mean-HH over A_mean
BD_mean/A_mean :: ratio of mean-BD over A_mean
LF_mean/A_mean :: ratio of mean-LF over A_mean

References

Afroz R, Rahman A, Masud MM, Akhtar R (2017) The knowledge, awareness, attitude and motivational analysis of plastic waste and household perspective in Malaysia. Environ Sci Pollut Res 24(3):2304–2315
Article Google Scholar
Akintola OA, Sangodoyin AY, Agunbiade FO (2018) Anthropogenic activities impact on atmospheric environmental quality in a gas-flaring community: application of fuzzy logic modelling concept. Environ Sci Pollut Res 25(22):21915–21926
Article CAS Google Scholar
Borror CM (2009) Statistical decision making. In: Borror CM (ed) The certified quality engineer handbook, 3rd edn. ASQ Quality Press, Milwaukee, pp 418–472
Google Scholar
Braun C, Merk C, Pönitzsch G, Rehdanz K, Schmidt U (2018) Public perception of climate engineering and carbon capture and storage in Germany: survey evidence. Clim Pol 18(4):471–484
Article Google Scholar
Çolak M, Kaya İ (2017) Prioritization of renewable energy alternatives by using an integrated fuzzy MCDM model: a real case application for Turkey. Renew Sust Energ Rev 80:840–853
Article Google Scholar
De Fao G (2014) Sociological survey in a municipality with a high level separate collection programme in an area of historic unpopularity. Waste Manag J 34:1369–1380
Article Google Scholar
Duţu LC, Mauris G, Bolon P (2018) A fast and accurate rule-base generation method for Mamdani fuzzy systems. IEEE Trans Fuzzy Syst 26(2):715–733
Article Google Scholar
Fay MP, Proschan MA (2010) Wilcoxon-Mann-Whitney or t-test? On assumptions for hypothesis tests and multiple interpretations of decision rules. Stat Surv 4:1–39
Article Google Scholar
Gosset WS (1908) The probable error of a mean. Biometrika 6:1–25
Article Google Scholar
Hong Y-ST, Rosen MR, Bhamidimarri R (2003) Analysis of a municipal wastewater treatment plant using a neural network-based pattern analysis. Water Res 37(7):1608–1618
Article CAS Google Scholar
Karayannis V, Charalampides G, Lakioti E (2014) Socio-economic aspects of CCS. Proced Econ Financ 14:295–302
Article Google Scholar
Kaya M, Conley S, Varol A (2016) Visualization of the social bot’s fingerprints. 4th International Symposium on Digital Forensics and Security (ISDFS’16), Little Rocl, AR., USA
Kohonen T (1998) The self-organizing map. Neurocomputing 21(1):1–6
Article Google Scholar
Kohonen T (2001) Self-organizing maps. Springer Series in Information Sciences, vol. 30
Book Google Scholar
Kohonen T (2013) Essentials of the self-organizing map. J Neural Netw 37:52–65
Article Google Scholar
Kohonen T (2014) MATLAB implementations and applications of the self-organizing map. Unigrafia Oy, Helsinki
Kokkinos K, Lakioti E, Papageorgiou E, Moustakas K, Karayannis V (2018) Fuzzy cognitive map-based modeling of social acceptance to overcome uncertainties in establishing waste biorefinery facilities. Front Energy Res 6(112):1–17
Google Scholar
Krzywinski M, Altman N (2013) Points of significance: significance, P values and t-tests. Nat Methods 10:1041–1042
Article CAS Google Scholar
Lakioti E, Moustakas K, Komilis D, Domopoulou A, Karayannis V (2017) Sustainable solid waste management: socio-economic considerations. Chem Eng Trans 56:661–666
Google Scholar
Lee HM (1996) Applying fuzzy set theory to evaluate the rate of aggregative risk in software development. Fuzzy Sets Syst 79:323–336
Article Google Scholar
Lee Y-J, Tung C-M, Lin S-C (2018) Attitudes to climate change, perceptions of disaster risk, and mitigation and adaptation behavior in Yunlin County, Taiwan. Environ Sci Pollut Res. Article in Press:1–11
Lin GF, Chen LH (2005) Identification of homogenous regions for regional frequency analysis using the self-organizing map. J Hydrol 324:1–9
Article Google Scholar
Liu X, Wu Y, Hu Y, Liu D, Zhang J, Chen C, Yuan Z, Lu Y (2016) Government employees’ perception of urban air pollution and willingness to pay for improved quality: a cross-sectional survey study in Nanchang, China. Environ Sci Pollut Res 23(21):22183–22189
Article Google Scholar
Liu J, Gong E, Wang D, Lai X, Zhu J (2018) Attitudes and behaviour towards construction waste minimisation: a comparative analysis between China and the USA. Environmental science and pollution research, Article in Press
Mamdani E, Assilian S (1975) An experiment in linguistic synthesis with a fuzzy logic controller. Int J Man Mach Stud 7:1–13
Article Google Scholar
Milutinovic B, Stefanovic G, Milutinovic S, Cojbasic Z (2016) Application of fuzzy logic for evaluation of the level of social acceptance of waste treatment. Clean Techn Environ Policy 18:863–1875
Article Google Scholar
Moustakas K, Loizidou M (2018) Sustainable waste management. Environ Sci Pollut Res 25(36):35761–35763
Article Google Scholar
Munda G (2004) Social multi-criteria evaluation: methodological foundations and operational consequences. Eur J Oper Res 158:662–677
Article Google Scholar
Munda G (2008) Social multi criteria evaluation for a sustainable economy. Springer, Berlin
Book Google Scholar
Nadiri AA, Naderi K, Khatibi R, Gharekhani M (2019) Modelling groundwater level variations by learning from multiple models using fuzzy logic. Hydrological Sciences Journal, Article in Press
Ostadi B, Mokhtarian Daloie R, Sepehri MM (2019) A combined modelling of fuzzy logic and time-driven activity-based costing (TDABC) for hospital services costing under uncertainty. J Biomed Inform 89:11–28
Article Google Scholar
Singh J, Singh N, Sharma JK (2006) Fuzzy modeling and control of HVAC systems – a review. Int J Sci Indust Res 65:470–476
Google Scholar
SOM in Matlab (2014) (last accessed Jan. 17th, 2019) http://www.mathworks.com/help/nnet/gs/cluster-data-with-a-self-organizing-map.html
Song Q, Wang Z, Li J (2016) Exploring residents’ attitudes and willingness to pay for solid waste management in Macau. Environ Sci Pollut Res 23(16):16456–16462
Article Google Scholar
Stezar IC, Ozunu A, Barry DL (2014) The role of stakeholder attitudes in managing contaminated sites: survey of Romanian stakeholder awareness. Environ Sci Pollut Res 21(1):787–800
Article CAS Google Scholar
Turcott Cervantes DE, López Martínez A, Cuartas Hernández M, Lobo García de Cortázar A (2018) Using indicators as a tool to evaluate municipal solid waste management: A critical review. Waste Manag 80:51–63
Article Google Scholar
Yen J, Langeri R (1999) Fuzzy logic: intelligence, control, and information. Prentice Hall, Englewood Cliffs, New Jersey
Google Scholar
Zadeh LA (1965) Fuzzy sets. Inf Control J 8(3):338–353
Article Google Scholar
Zadeh LA (2008) Is there a need for fuzzy logic? Inf Sci J 178:2751–2779
Article Google Scholar
Zorpas AA, Dimitriou M, Voukkali I (2018) Disposal of household pharmaceuticals in insular communities: social attitude, behaviour evaluation and prevention activities. Environ Sci Pollut Res 25(27):26725–26735
Article Google Scholar

Download references

Author information

Authors and Affiliations

Department of Computer Science and Telecommunications, University of Thessaly, Lamia, Greece
Konstantinos Kokkinos
Faculty of Engineering, University of Western Macedonia, Kozani, Greece
Vayos Karayannis & Evangelia Lakioti
School of Chemical Engineering, National Technical University of Athens, Athens, Greece
Konstantinos Moustakas

Authors

Konstantinos Kokkinos
View author publications
You can also search for this author in PubMed Google Scholar
Vayos Karayannis
View author publications
You can also search for this author in PubMed Google Scholar
Evangelia Lakioti
View author publications
You can also search for this author in PubMed Google Scholar
Konstantinos Moustakas
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Konstantinos Moustakas.

Additional information

Responsible editor: Philippe Garrigues

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Appendix

Table 5 Pair correlations between the SURVEY statistics and the FIS model statistics

Full size table

Table 6 Pairwise sample Statistics

Full size table

Rights and permissions

Reprints and permissions

About this article

Cite this article

Kokkinos, K., Karayannis, V., Lakioti, E. et al. Exploring social determinants of municipal solid waste management: survey processing with fuzzy logic and self-organized maps. Environ Sci Pollut Res 26, 35288–35304 (2019). https://doi.org/10.1007/s11356-019-05506-2

Download citation

Received: 25 February 2019
Accepted: 16 May 2019
Published: 28 May 2019
Issue Date: December 2019
DOI: https://doi.org/10.1007/s11356-019-05506-2

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Exploring social determinants of municipal solid waste management: survey processing with fuzzy logic and self-organized maps

Abstract

Similar content being viewed by others

Modeling construction and demolition waste quantities in Tanta City, Egypt: a synergistic approach of remote sensing, geographic information system, and hybrid fuzzy neural networks

Designing a fuzzy decision support framework for assessing solid waste management in the South European region

An ideal plastic waste management system based on an enhanced MCDM technique

Introduction