Abstract
Air pollution is one of the critical health problems affecting the quality of life, especially in city centers. The air quality index (AQI) is the primary parameter used to measure air pollution. This parameter is constantly measured in city centers with measuring devices that contain various sensors. Due to the high purchasing costs and the need for periodic calibration, there is a need to develop more economical technologies to calculate AQI values. This study aims to predict air pollution with minimum sensors based on machine learning. In the study, daily-based five-year air quality measurement data from India, taken from the Kaggle database, were used. Air quality data includes \(PM_{2.5}\), \(PM_{10}\), \(O_{3}\), \(NO_{2}\), \(SO_{2}\), \(CO_{}\) values obtained from sensors in the measuring devices, and AQI calculated with these values. A feature selection algorithm is used to reduce the sensor cost. Then, artificial intelligence-based AQI was calculated with minimum sensor data. According to the findings, artificial intelligence-based AQI calculation model performances r and RMSE were determined as 0.93 and 20.57, respectively. It has been evaluated that AQI data can be calculated based on artificial intelligence with a minimum of sensors.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Yolsal, H.: Estimation of the air quality trends in Istanbul. İktisadi İdari Bilimler Dergisi 38, 375 (2016)
Khuda, K.-E.: Causes of air pollution in Bangladesh’s capital city and its impacts on public health. Nat. Environ. Pollut. Technol. 19, 1483–1490 (2020)
Yilmaz, F.: Hava kirliliği, bileşenleri ve sağlık. FSM İlmî Araştırmalar İnsan ve Toplum Bilimleri Dergisi 231–250 (2021)
Pope, C.A., Dockery, D.W.: Health effects of fine particulate air pollution: lines that connect. J. Air Waste Manag. Assoc. 56, 709–742 (2006)
Who air quality guidelines for particulate matter, ozone, nitrogen dioxide and sulfur dioxide (2006)
Maji, S., Ahmed, S., Ghosh, S., Garg, S.K.: Evaluation of air quality index for air quality data interpretation in Delhi, India (2020)
History of air pollution \(|\) US EPA
AQI basics \(|\) airnow.gov
Kuznetsova, I.N., Tkacheva, Yu.V., Shalygina, I.Yu., Lezina, E.A.: Calculation of air quality index and assessment of its informativeness for Russia based on monitoring data for Moscow. Russ. Meteorol. Hydrol. 46, 530–538 (2021)
Plaia, A., Ruggieri, M.: Air quality indices: a review. Rev. Environ. Sci. Biotechnol. 10, 165–179 (2011)
How is the AQI calculated? \(|\) US EPA
Wood, D.A.: Local integrated air quality predictions from meteorology (2015 to 2020) with machine and deep learning assisted by data mining. Sustain. Anal. Model. 2, 100002 (2022)
Du, W., et al.: Deciphering urban traffic impacts on air quality by deep learning and emission inventory. J. Environ. Sci. 124, 745–757 (2023)
Lightstone, S.D., Moshary, F., Gross, B.: Comparing CMAQ forecasts with a neural network forecast model for PM2.5 in New York. Atmosphere 8(161), 2017 (2017)
Cabaneros, S.M., Calautit, J.K., Hughes, B.R.: A review of artificial neural network models for ambient air pollution prediction. Environ. Modell. Softw. 119, 285–304 (2019)
Liu, H., Li, Q., Yu, D., Gu, Y.: Air quality index and air pollutant concentration prediction based on machine learning algorithms. Appl. Sci. 9, 4069 (2019)
Cabaneros, S.M.S., Calautit, J.K.S., Hughes, B.R.: Hybrid artificial neural network models for effective prediction and mitigation of urban roadside NO2 pollution. Energy Procedia 142, 3524–3530 (2017)
Kekulanadara, K.M.O.V.K., Kumara, B.T.G.S., Kuhaneswaran, B.: Machine learning approach for predicting air quality index. In: 2021 International Conference on Decision Aid Sciences and Application, DASA 2021, pp. 622–626 (2021)
CPCB \(|\) central pollution control board
Mishra, S., Datta-Gupta, A.: Applied statistical modeling and data analytics: a practical guide for the petroleum geosciences (2018)
Bebis, G., Georgiopoulos, M.: Feed-forward neural networks. IEEE Potent. 13, 27–31 (1994)
Vogt, M., Kecman, V.: Active-set methods for support vector machines. In: Wang, L. (ed.) Support Vector Machines: Theory and Applications. Studies in Fuzziness and Soft Computing, vol. 177, pp. 133–158. Springer, Heidelberg (2005). https://doi.org/10.1007/10984697_6
Breiman, L., Friedman, J.H., Olshen, R.A., Stone, C.J.: Classification and regression trees. Classif. Regression Trees 1–358 (2017)
Abiri, O., Twala, B.: Modelling the flow stress of alloy 316l using a multi-layered feed forward neural network with Bayesian regularization. In: IOP Conference Series: Materials Science and Engineering, vol. 225, p. 9 (2017)
Uçar, M.K., Uçar, Z., Köksal, F., Daldal, N.: Estimation of body fat percentage using hybrid machine learning algorithms. Meas.: J. Int. Meas. Confederation, 167 (2021)
Tanabe, K.: Pareto’s 80/20 rule and the gaussian distribution. Phys. A Stat. Mech. Appl. 510, 635–640 (2018)
Flake, G.W., Lawrence, S.: Efficient SVM regression training with SMO. Mach. Learn. 46, 271–290 (2002)
Cherkassky, V., Ma, Y.: Practical selection of SVM parameters and noise estimation for SVM regression. Neural Netw. 17, 113–126 (2004)
Polat, K., Koc, K.O.: Detection of skin diseases from dermoscopy image using the combination of convolutional neural network and one-versus-all (2020)
On the mean squared error of an estimator \(|\) introduction to probability \(|\) supplemental resources \(|\) mit opencourseware
Root mean square error (RMSE) \(|\) cros
Standard error \(|\) what it is, why it matters, and how to calculate
Rahimpour, A., Amanollahi, J., Tzanis, C.G.: Air quality data series estimation based on machine learning approaches for urban environments. Air Qual. Atmos. Health 14, 191–201 (2021)
Ausati, S., Amanollahi, J.: Assessing the accuracy of ANFIS, EEMD-GRNN, PCR, and MLR models in predicting PM2.5. Atmos. Environ. 142, 465–474 (2016)
Zhu, S., Lian, X., Liu, H., Hu, J., Wang, Y., Che, J.: Daily air quality index forecasting with hybrid models: a case in China. Environ. Pollut. 231, 1232–1244 (2017)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Ethics declarations
Conflict of Interests
There is no conflict of interest between the authors.
Financial Support
No support was received for this research.
Ethics Committee Approval
Ethics committee approval is not required as the open source data set was used in the study.
Data Collection and Use Permission
.
Rights and permissions
Copyright information
© 2023 The Author(s), under exclusive license to Springer Nature Switzerland AG
About this paper
Cite this paper
Sari, F.A., Haşıloğlu, M.A., Uçar, M.K., Güler, H. (2023). Determining Air Pollution Level with Machine Learning Algorithms: The Case of India. In: Hemanth, D.J., Yigit, T., Kose, U., Guvenc, U. (eds) 4th International Conference on Artificial Intelligence and Applied Mathematics in Engineering. ICAIAME 2022. Engineering Cyber-Physical Systems and Critical Infrastructures, vol 7. Springer, Cham. https://doi.org/10.1007/978-3-031-31956-3_48
Download citation
DOI: https://doi.org/10.1007/978-3-031-31956-3_48
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-31955-6
Online ISBN: 978-3-031-31956-3
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)