Keywords

1 Introduction

Internet of things (IoT) is known as daily object interconnection. By linking an increasing number of objects, it enhances the quality of human life. According to projections, IoT will have 50 billion connected devices by 2020. And we are deploying IoT sensors such as CO2 and NO2 in this project to collect the sensor values that can be collected and stored in the server for data analysis. Big data is used for data processing, the Hadoop distributed file system and recurrent neural networks are used for prediction. Apache Hadoop is an open-source software platform for distributed big data storage and distributed data processing. The Hadoop Distributed file system (HDFS) partitions files into file size-based blocks and distributes the blocks among the cluster nodes.

Recurrent neural networks are fairly primitive electronic neuron networks, based on the brain’s neuronal structure. The method records one at a time and learns by comparing their (mostly arbitrary) record prediction to the known actual record. The errors from the first record’s initial forecast are fed back to the network and used to change the network’s algorithm for the second iteration.

2 Literature Survey

Examined that the neural network models are an efficient method for controlling air quality in big cities due to a low level of error estimation by exploring the possibilities of creating an air quality control prognostic method in big cities [1, 2]. Just 1.9% in test series and 1.4% in training series were in the optimally built false prognosis models only 1.9% in testing series and 1.4% in training series. Proposed a model for the estimation of concentrations of inorganic airborne pollutants: H2S–SO2, NO–NO2–NOx, CO (CO2) and PM10 (aerodynamic particulate matter of 10ìm or less) from a risk area (two industrial area-IA) and urban area (UA) from Constanta [3]. A small cumulative absolute error of 0.42 was predicted from simulation result with the actual value in the urban area. It shows the efficacy and validity of the proposed approach in determining the different concentrations of airborne contaminants.

Studied the model evaluation shows that the degree of success in CO concentration forecasting is promising by evaluating a model for predicting CO concentration from 2002 to 2004 [4, 5]. Analyzed Internet traffic data over IP networks through the creation of an artificial neural network (ANN) model based on multi-layer perceptron (MLP) [6]. Levenberg Marquardt Algorithm (LM) and Resilient Backpropagation Algorithm (RP) models were developed for effecive analysis of internet traffic over the network in the time series manner.

Found that for all contaminants using the CNN model, the association between expected and observed data was greater (0.54–0.87) [7]. The CNN model estimated concentrations of SO2 that were higher than those of PM10. Another interesting finding was that all contaminants were expected better in winter concentrations than in summer concentrations [8]. Experiments showed that accurate predictions of missing concentrations of air pollutants can be made using the new approach found in the CNN model by predicting daily concentrations of air pollutants (PM10 and SO2) from the Istanbul air pollution measurement stations in Yenibosna and Umraniye for periods when pollution data was not reported [9, 10]. Historical Ozone component (O3) observations were used to predict air pollution concentrations of NO2 and SO2 in urban areas by using Adaptive Neuro Fuzzy techniques [11]. Proposed algorithm focused primarily on pollution caused by vehicle smoke in urban areas and implemented various fuzzy rules to demonstrate the extent of air pollution induced by vehicle smoke and dust in nearby Jabalpur areas examined the ability of artificial neural network technique in air temperature prediction for daily and monthly ambient [12]. In feed-forward network and Elman network, the mean, minimum, and maximum ambient air temperature was used as the input parameter during the years 1961–2004 [13, 14]. The values of R, MSE and MAE variables in both networks showed that the ANN method is a good model in the prediction of ambient air temperature, whereas the estimates of the mean temperature one day ahead and the maximum temperature one month ahead are more reliable using the Elman network [15].

Computer based models play a vital importance in research and analysis and for decision support systems [16, 17]. For both training and testing stages, the efficiency of the proposed solution is calculated by the mean squared error. Comparison between the functionality of the hybrid ICA-NN and the listed MLP network provides the evidence that the ICA-NN was superior with an reasonable accuracy for figure in terms of reliable performance [18, 19]. ANN approach was applied in problem solving techniques and decision making to improve the environmental factors [20]. Peninsular Malaysia centered on the use of main component analysis (PCA) and artificial neural network (ANN) to predict air pollutant index (API) in the seven selected Malaysian air monitoring stations in the seven-year database in the southern region (2005–2011) [21, 22].

3 Motivation

It is concluded that the ANN approach can be used effectively as decision-making and problem-solving strategies for better environmental management [23, 24]. Peninsular Malaysia based on the use of main component analysis (PCA) and artificial neural network (ANN) to predict air pollutant index (API) in the seven-year database selected Malaysian air monitoring stations in the southern region (2005–2011).

4 Proposed System

Effective recurrent neural network methods are deployed and an IoT-based hardware is connected to air quality sensors which allow us to collect real-time air quality prediction data. The key idea here is to forecast the air quality values in relation to the location values of the previous year and to build awareness about air pollution. In the proposed system, the air pollution data from the Raspberry pi sensor was collected to the system. In several locations, the data was collected, and finally, the data stored in the MYSQL server. The detailed flow diagram for the proposed systems is shown in Fig. 1.

Fig. 1
figure 1

Flow diagram of air quality monitoring using IoT

4.1 PIC Board

PIC microcontrollers are electronic circuits that can be configured to perform a wide range of tasks. They can be programmed for being timers or managing a production line, and more. This is used for linking the device to the circuit microcontroller as shown in Fig. 2.

Fig. 2
figure 2

PC bord

4.2 Nitrogen Dioxide Sensor

In urban settings, the main source of nitrogen dioxide gas is the burning of fossil fuels. This is most generally related to emissions from motor vehicles in urban areas. Areas with high-density road networks near to large populations like cities and towns are at highest risk of over-exposure. The manufacturing sites would also contain high nitrogen dioxide concentrations. Those include any industry that uses processes such as power plants, electrical utilities and industrial boilers for combustion. Long-term exposure to nitrogen dioxide gas can lead to respiratory problems such as bronchitis and asthma. Aeroqual provides a fixed or portable NO2 sensor for calculating the nitrogen dioxide concentrations for studies of human exposure as well as health and safety at industrial and urban sites. Here, this NO2 sensor is used to measure the current position concentration of NO2 gases and then it collects the parameters of NO2 gas in microgram/cubic meter.

4.3 Sulfur Dioxide Sensor

Sulfur dioxide is a colorless gas with a strong odor. It is not combustible but an highly poisonous substance. Sulfur dioxide when mixed with water is highly corrosive sulfuric acid and causes chemical burns. There are two key reasons why control of sulfur dioxide levels is important: because industrial buildings such as power plants emit significant quantities of sulfur dioxide and sulfur dioxide is a highly reactive gas. Sulfur dioxide is heavier than air, so it is measured by PemTech using electromechanical technology. OSHA has set the PEL for sulfur dioxide to be just 5 ppm, as it can be highly harmful to be exposed to only small levels for a short time. Here, this SO2 sensor is used to measure the current position concentration of SO2 gasses and then it collects the SO2 gas data in microgram/cubic meter.

5 Implementation

The user interface architecture is implemented using Java coding for the user to pick the data input and location for data analysis. Using this module, all data is analyzed using the big data and recurrent neural network to predict the location’s air pollution and then the IOT sensor is interfaced with the Java coding application.

5.1 Recurrent Neural Network

The recurrent neural network is based on the concept of saving a layer’s output and feeding this back to input to help predict the year’s outcome. Here, the first layer is generated with the product of the sum of the weights and the features similar to the feed-forward neural network. If this is determined, the recurrent neural network cycle begins; it means that each neuron would recall any knowledge that it had in the previous time phase from one-time phase to the next. This makes each neuron function in the output of computations as a memory cell. We need to let the neural network operate on front propagation in this step and remember what information it needs for later use. IoT sensors such as SO2 and NO2 are interfaced with the microcontroller for real-time data collection and storage on a centralized server and then all data is collected from the sensors and the generated data set is stored for analysis in a centralized database using a recurrent neural network to extract the useful data generated for data analysis. All data is stored in a centralized database called MYSQL and then all data is collected from the sensors and the data set for data analysis is generated. We deploy big data-HDFS for the process of data processing and prediction. For data analysis, all sensor values are collected and stored in the server and then all data is processed and analyzed using a neural network to capture the data and predict air pollution at a given location and time. The values are compared with the approved data collection, and the values of the sensors are also processed. Alerted to monitor emissions and safeguard the atmosphere until the data is analyzed.

5.2 Comparative Analysis

Many of them control the air quality in current systems and these systems can not predict the air quality of the gases. And it depends on the quality of the air which affects how you live and breathe. Therefore, it is important to take precautionary measures to forecast air pollution parameters such as sulfur dioxide and nitrogen dioxide. Such parameters can have a direct impact on degree of air pollution and are measured using sensing instruments. In proposed research, IOT-based hardware and sensor interface to detect the existing runtime air pollution of various contaminants in the air and thus deploy the principles of big data and recurrent neural networks help us predict the air quality of gases with values of previous years concerning location.

6 Result and Discussion

In this section of results, briefly review the table below to find out the concentration of gasses such as SO2 and NO2 in microgram/cubic meter parameters in some areas of Chennai city. The air pollution concentrations such as SO2 and NO2 were collection for the month of Feb 2018–2019. The real time sensor data of Feb 2020 were compared with the collected concentrations. Table 1 highlighted the standard index about air quality.

Table 1 Air quality index

The air quality concentrations of SO2 and NO2 from various parts of the city (Anna Nagar, Adyar, Nungambakkam, etc.) were calibrated with the IoT sensor and the data stored in the MYSQL database. Figure 3 shows the sensor calibrated value of NO2 and SO2.

Fig. 3
figure 3

NO2 and SO2. concentrations from IoT sensor

The recurrent neural network is a time series prediction method. In this study, the concentration values of NO2 and SO2 were analyzed as a time series manner and the data was interpreted in MATLAB for training and testing purpose. Finally, Figure 4 shows the comparative analysis of sensor data with the RNN.

Fig. 4
figure 4

Comparative analysis of IoT-SO2 with RNN

7 Conclusion

In this study, along with recurrent neural network, we deploy the principle of IOT and big data for successful analysis and prediction of air pollution. Thus, we are forecasting the air quality of various air pollutants for years to come in this phase.