Comparative study of predicting hospital solid waste generation using multiple linear regression and artificial intelligence

Golbaz, Somayeh; Nabizadeh, Ramin; Sajadi, Haniye Sadat

doi:10.1007/s40201-018-00324-z

Comparative study of predicting hospital solid waste generation using multiple linear regression and artificial intelligence

Research Article
Published: 26 February 2019

Volume 17, pages 41–51, (2019)
Cite this article

Download PDF

Access provided by Autonomous University of Puebla

Journal of Environmental Health Science and Engineering Aims and scope Submit manuscript

Comparative study of predicting hospital solid waste generation using multiple linear regression and artificial intelligence

Download PDF

Somayeh Golbaz¹,
Ramin Nabizadeh¹ &
Haniye Sadat Sajadi²

1154 Accesses
61 Citations
Explore all metrics

Abstract

Purpose

A successful hospital solid waste (HSW) management needs an accurate estimation of waste generation rates. The conventional regression methods upon increasing the number of input variables hardly can predict the HSW generation rate and require more complex modeling. In return, application of machine learning methods seems to be able to increase the power of predicting the produced wastes.

Methods

To predict the HSW, Multiple Linear Regression(MLR) and several Neuron- and Kernel-based machine learning methods were employed to analyze data from hospitals of Karaj metropolis. The number of wards, active and occupied beds, staffs and inpatients, and ownership type and activity years of hospital were defined as the model inputs. In addition, proposed models performance was evaluated based on coefficient of determination (R²) and Mean-Square Error (MSE).

Results

The performance of Neuron- and Kernel-based machine learning methods indicated that both models were satisfactory in predicting HSW. However, the better results of 0.82–0.86 for average R² value and 0.003–0.008 for average MSE value, indicated relative superiority of Kernel-based models compared to Neuron based (average R² = 0.68–0.74, average MSE = 0.009–0.023) and MLR models. Number of staffs and hospital ownership type were the most influential model variables in predicting the HSW generation rate.

Conclusions

The machine learning methods could interpret the relationship between waste generation rate and model inputs, appropriately. Thus, they may play an effective role in developing cost-effective methods for suitable HSW management.

Estimating Municipal Solid Waste Generation: From Traditional Methods to Artificial Neural Networks

Use of Machine Learning to Investigate Factors Affecting Waste Generation and Processing Processes in Russia

Machine learning-based prediction of construction and demolition waste generation in developing countries: a case study

Article 29 July 2024

Discover the latest articles, news and stories from top researchers in related subjects.

Artificial Intelligence

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Introduction

Hospital solid waste (HSW) is a critical public health issue in all societies due to the presence of different pathogens, hazardous and chemical anticancer agents and radioactive wastes therein which include all kinds of perilous wastes. Moreover, cutting and sharp materials are available in those places which are extremely dangerous for the people who are in contact with them. Poor management system for such materials causes environmental pollution and endangers the human health [1, 2]. Thus, a well-established management system is required. Establishing such management system is very difficult because of the complexity and heterogeneous nature of hospital waste productions. One of the crucial factors in the start-up of such a complex system is the extent of accurate estimation of waste generation rates that may be either short-term or long-term. The short-term estimation of HSW generation rates is necessary for better design and management of storage, collection, and transfer systems [3, 4] and long-term estimation is required for selecting appropriate waste treatment technologies or selecting landfill sites or understanding the impacts of new policies and initiatives [4]. HSW generation rates can be measured by direct sampling; but in many cases, the hospitals do not have enough resources to create a complete database of HSW quantity [4, 5]. Several methods including data mining, sample surveys, and models based on knowledge of effective factors have been used for prediction of waste generation rate. These models include statistical models or conventional method that mostly focus on deterministic methods or trend analysis regardless of the dynamic properties of municipal solid waste (MSW) generation [4].

Therefore, data modeling of complex systems requires advanced methods, which have acceptable performance in the prediction of the behavior of the dynamic systems, to establish a nonlinear relationship between inputs and outputs. In recent years, machine learning methods such as Artificial Neural Networks (ANN), Fuzzy Logic - Artificial Neural Networks (ANFIS), and Support Vector Regression (SVM) have emerged and are becoming popular because of their high flexibility and proven prediction abilities. The ANN model was shown to be able to predict industrial solid waste generation by Tiwari et al. [6]. Wieland et al., (2002) pointed to algorithms for deriving qualitative rules from ANN models and reported this could be developed [7]. ANFIS is one of these developed algorithms. The review of studies shows that only a small number of 106 published articles were associated with advanced and non-conventional methods [4] and unfortunately the use of artificial intelligence models as a tool for the planning, operation and optimization of healthcare waste management system is not widespread as in other fields of environmental engineering. In addition, most of the articles focused on the municipal solid waste issue [8,9,10] and there is limited evidence about the prediction of hospital solid waste generation as well as an optimal model for this purpose.

Accordingly, this study was aimed to determine the variables that affect the HSW generation by using different methods such as feature selection. Then various data mining methods was examined in order to achieve a more accurate prediction. Therefore, Multiple Linear Regression (MLR) along with several machine learning methods including Artificial Neural Networks (ANN), Fuzzy Logic - Artificial Neural Networks (ANFIS), Support Vector Regression (SVM), Least Squares Support Vector Regression (LSSVM), Fuzzy Logic - Support Vector Regression (FSVM) have been employed in this study to introduce an appropriate model in prediction of HSW generation rate.

Materials and methods

Dataset

The data of eight single-specialty hospitals in Karaj metropolis (35°48′45″N, 51°0′30″E, Iran) in 2016 was used in this study. The hospitals included four university hospitals (H₁ to H₄), three private hospitals (H₅ to H₇), and one social security hospital (H₈).

Model variables

In this study, hospital waste was divided into three groups of infectious (IHSW), general (GHSW), and total (THSW) waste. Their values were obtained by sampling and weighing of waste for four months according to the procedures described by Farzadkia et al. [11]. These dependent variables were considered as model outputs.

According to an overview of effective parameters in HSW generation rate [11,12,13,14,15,16], interviews with academics and hospital administrators, medical waste management checklist of Ministry of Health and Medical Education of Iran, and feature selection method (Relief-F for regression (RRelief-F)), seven independent variables were selected as input features in HSW generation rate prediction, including: number of active beds (NAB): total hospital beds which are regularly maintained and staffed and immediately available for the care of admitted patients; number of the hospital’s wards (NHW): total wards within the hospital for the care of numerous patients having the same condition, e.g., a maternity ward; number of hospital’s staff (NHS); hospital ownership type (HOT) that was encoded governmental = 1, private = 2, and social hospital = 3; number of occupied beds (NOB): total beds that are licensed, physically available, staffed, and occupied by a patient; number of inpatients (NIP): total patients who come to the hospital for diagnosis or treatment that requires an overnight stay; number of hospital’s activity years (NAY). Table S1 shows the ranks and weights of mentioned predictors for each of response vector (IHSW, GHSW, and THSW) using Relief-F for regression (RRelief-F). The weights (a range from −1 to 1) and ranks were the indexes of the most important predictors. It should be noted that the multicollinearity test showed that there is no similarity between the independent variables in the model.

In this study, inputs data was a 105 × 7 matrix, representing static data: 105 samples of 7 elements. Also, target was a 105 × 1 matrix, representing static data: 105 samples of 1 element.

Table 1 presents the average values of model input and output variables in HSW generation rate prediction.

Table 1 Mean and standard deviation of the model variables

Full size table

Note that three targets i.e., IHSW, GHSW, and THSW were modeled separately.

Models

MLR model

Since HSW generation forecasting depends on several factors, Multiple Linear Regression (MLR) method is commonly used. In fact, a MLR model states relationship between the independent variables and the dependent variable according to the following equation:

$$ \mathrm{HSW}={\upbeta}_0+{\upbeta}_1\mathrm{NAB}+{\upbeta}_2\mathrm{NHW}+{\upbeta}_3\mathrm{NHS}+{\upbeta}_4\mathrm{HOT}+{\upbeta}_5\mathrm{NOB}+{\upbeta}_6\mathrm{NIP}+{\upbeta}_7\mathrm{NAY}+\mathrm{e}, $$

(1)

where the dependent variable HSW represents the response variables (IHSW, GHSW, and THSW); and NAB, NHW, NHS, HOT, NOB, NIP and NAY are input variables with the coefficients β₀ to β₇ to be estimated from the data.

Multiple linear regression model was calculated using Entry method (using SPSS software version 16). The standard method is simultaneous; all independent variables were entered into the equation at the same time. It is an appropriate analysis when dealing with a small set of predictors and when the researcher does not know which independent variables will create the best prediction equation. In addition, the MLR model was used with the forward and backward stepwise method as a selection method. Also, the Normal probability plot was used for testing normality of the dependent variables. Since the variable HOT is categorical we used dummy variables HOT₁ (Governmental), and HOT₂ (Private) represents the binary independent variables (Table 2).

Table 2 Dummy coding for a type of hospital variable

Full size table

Machine learning methods

Different types of machine learning methods exist, but they are typically classified in two major groups: a) Neuron based methods i.e., ANN and ANFIS, and b) Kernel-based methods i.e., SVM, LSSVM, and FSVM [17].

In order to compare the performance of the models, feature scaling was used to standardize the range of input and output variable between 0 and 1 [18].

ANN model

One of the machine learning methods that have acceptable performance in the prediction of nonlinear and time series regression problems, is ANN method. This method is formed based on the nodes derived from a simplified model of nervous system Neurons [19]. This method usually has three layers including input, learner (hidden) and output layers. The nodes in the learning layer learn the relationships between inputs and outputs as some of the optimized sigmoid functions [20]. These sigmoid functions are introduced with bias (b) and width (w) parameters. During the training process, the parameters of sigmoid functions change to the extent that results in the lowest prediction error. After optimizing the nodes functions, the output variables are obtained based on a linear composition of optimizing sigmoid [21]. The ANN architecture used in this study is shown in Fig. 1.

In this study, Levenberg–Marquardt back-propagation algorithm has been used for optimization of nodes’ learning functions. Since seven input parameters have been used for prediction of HSW in this study, the network architecture is 7 × n × 1. The number of Neurons in the hidden layer changed to determine the most appropriate number of nodes in the hidden layer for prediction of output parameters of the system.

ANFIS model

One of the machine learning methods is ANFIS, therein nodes learning is based on the fuzzy rules. In this method, before learning the training samples by learning layer nodes, the input data is fuzzified using fuzzy membership functions [22]. The membership functions are designed based on the linguistic variables that can map the values of features from mathematical space to the human logic [23].

The fuzzy rules are propounded as {if-then} rules, defining the relationships between input and output membership functions. For instance, a fuzzy rule may be explained as {if the number of active beds is increased, then the waste produced in the wards is increased}. Using the membership functions of input and output features, the target is calculated as fuzzy values. The last step in this method is the conversion of the fuzzy value of the target to the mathematical value which is called Defuzzification. In this research, two membership functions {low, high} have been considered for each of problem variables, and HSW generation rate was predicted using this network for different levels of input parameters (Fig. 2).

SVM model

In recent years, SVM method has been used widely for prediction of the nonlinear behavior of dynamic and complex systems. This method has been used in the classification and regression problems as well [24]. Learning these machines is based on finding a hyperplane in the features space for data modeling. The specimens that are located within the epsilon distance to this plane, are assumed to have similar behaviors, and the behavior of other specimens are determined based on the distance from this plane (ξ) (Fig. 3). The position of this plane is specified based on points which are called support vector.

This plane may be explained by different equations (Kernels) such as linear (f = ɣxx₀), polynomial (f = (ɣxx₀) ⁿ), radial basis (f = e^(−ɣ(x-x₀⁾²), and sigmoid (f = tanh(ɣxx₀)) [25]. In these relations, ɣ denotes the Kernel parameter.

LSSVM model

In SVM method, hyperplane position is optimized based on the Kernel margin, but in LSSVM method, hyperplane position is optimized based on minimizing the total square of the prediction error of training data [26]. In this method, radial basis function and sigmoid Kernels are usually used. One of the advantages of this method is a better prediction of the behavior of data closer to the hyperplane in comparison to the SVM method.

FSVM model

One of the ideas in designing machine learning is the use of fuzzy logic in learning the training data using support vector machine. In this method, the behavior of samples is not calculated linearly based on their distance to the hyperplane [27]. Figure 4 shows a triangular membership function that its center lies on the data having similar behavior near the hyperplane. Whatever the sample’s distance from hyperplane is increased, its membership degree to the values more different than the values lying on the plane is increased and vice versa. In Fig. 4, the sample M that has a distance from the plane equal to b, its membership degree to the K is lower than the samples near the center.

Training and test procedure of machine learning models

In this study, a code written in MATLAB programming environment was used for implementation of machine learning methods. In the experiments, five-fold cross-validation was employed which four folds were used for training and the last fold was used for the test. Since a validation process is required in ANN method, 70, 15, and 15% of data was considered for training, validation, and test, respectively. LibSVM 3.1 and LSSVM v 1.8 with default Kernels and parameter values were used for Kernel-based methods.

Performance criteria

The performance of the methods was evaluated by comparing their predicted outputs with observed data. In this study, Mean-Square Error (MSE), and coefficient of determination (R²) were considered for performance evaluation (eqs. 2 and 3).

$$ MSE(t)=\left(\frac{1}{n}\right)\sum \limits_{i=1}^n{\left( HS{W}_p(t)- HS{W}_A(t)\right)}^2 $$

(2)

$$ {R}^2=1-\frac{\sum \limits_{i=1}^n{\left( HS{W}_p(t)- HS{W}_A(t)\right)}^2}{\sum \limits_{i=1}^n{\left( HS{W}_A(t)-{\overline{HSW}}_A(t)\right)}^2} $$

(3)

where HSW_p(t), HSW_A(t), $ {\overline{HSW}}_A(t) $ and n are predicted, actual, and average value of HSW and the number of samples, respectively.