Prediction of Air Quality Index Using Ensemble Models

  • Theresia Herlina Rochadiani Pradita University

Abstract

The impact of air pollution on health is measured by the Air Quality Index (AQI). Accurate AQI prediction is essential for pollution reduction and public health recommendations. Traditional methods of monitoring air quality are inaccurate and time-consuming. This study uses IoT-based air quality data from Kampung Kalipaten, Tangerang to build an AQI prediction model with machine learning, specifically an ensemble model. Ensemble techniques such as bagging and boosting, which increase the reliability of predictions by reducing model bias and inconsistency, improve AQI prediction. Four ensemble models used in this study, they are Random Forest Regressor, Gradient Boosting Regressor, Adaboosting Regressor, and Bagging Regressor. As the evaluation, RMSE and R2 metrics used. Random Forest Regressor perform the best with RMSE value of 0.6054 and R2 value of 0.6271, although no significant differences of RMSE and R2 value of the rest models.

References

[1] T. Madan, S. Sagar, and D. Virmani, “Air Quality Prediction using Machine Learning Algorithms –A Review,” in 2020 2nd International Conference on Advances in Computing, Communication Control and Networking (ICACCCN), Dec. 2020, vol. 840, pp. 140–145, doi: 10.1109/ICACCCN51052.2020.9362912.
[2] N. S. Gupta, Y. Mohta, K. Heda, R. Armaan, B. Valarmathi, and G. Arulkumaran, “Prediction of Air Quality Index Using Machine Learning Techniques: A Comparative Analysis,” J. Environ. Public Health, vol. 2023, pp. 1–26, Jan. 2023, doi: 10.1155/2023/4916267.
[3] U. S. E. P. Agency and I. Division, “Air Quality Index (AQI),” in Encyclopedia of Quality of Life and Well-Being Research, no. February, Dordrecht: Springer Netherlands, 2014, pp. 120–120.
[4] T. H. Rochadiani et al., “Design of Air Quality Monitoring Using LoRaWAN In Human Settlement,” in 2022 IEEE 14th International Conference on Humanoid, Nanotechnology, Information Technology, Communication and Control, Environment, and Management (HNICEM), Dec. 2022, pp. 1–6, doi: 10.1109/HNICEM57413.2022.10109605.
[5] C. Wu, J. Wang, X. Chen, P. Du, and W. Yang, “A novel hybrid system based on multi-objective optimization for wind speed forecasting,” Renew. Energy, vol. 146, pp. 149–165, Feb. 2020, doi: 10.1016/j.renene.2019.04.157.
[6] A. Mosavi, F. Sajedi Hosseini, B. Choubin, M. Goodarzi, A. A. Dineva, and E. Rafiei Sardooi, “Ensemble Boosting and Bagging Based Machine Learning Models for Groundwater Potential Prediction,” Water Resour. Manag., vol. 35, no. 1, pp. 23–37, Jan. 2021, doi: 10.1007/s11269-020-02704-3.
[7] Y. Li and Y. Pan, “A novel ensemble deep learning model for stock prediction based on stock prices and news,” Int. J. Data Sci. Anal., vol. 13, no. 2, pp. 139–149, Mar. 2022, doi: 10.1007/s41060-021-00279-9.
[8] B. Zhang, M. Duan, Y. Sun, Y. Lyu, Y. Hou, and T. Tan, “Air Quality Index Prediction in Six Major Chinese Urban Agglomerations: A Comparative Study of Single Machine Learning Model, Ensemble Model, and Hybrid Model,” Atmosphere (Basel)., vol. 14, no. 10, 2023, doi: 10.3390/atmos14101478.
[9] Y.-C. Liang, Y. Maimury, A. H.-L. Chen, and J. R. C. Juarez, “Machine Learning-Based Prediction of Air Quality,” Appl. Sci., vol. 10, no. 24, p. 9151, Dec. 2020, doi: 10.3390/app10249151.
[10] C. Li, Y. Li, and Y. Bao, “Research on Air Quality Prediction Based on Machine Learning,” in 2021 2nd International Conference on Intelligent Computing and Human-Computer Interaction (ICHCI), Nov. 2021, pp. 77–81, doi: 10.1109/ICHCI54629.2021.00022.
[11] J. Zheng, D. Xin, Q. Cheng, M. Tian, and L. Yang, “The Random Forest Model for analyzing and Forecasting the US Stock Market under the background of smart finance,” 2024, pp. 82–90.
[12] S. Park, S. Jung, J. Lee, and J. Hur, “A Short-Term Forecasting of Wind Power Outputs Based on Gradient Boosting Regression Tree Algorithms,” Energies, vol. 16, no. 3, p. 1132, Jan. 2023, doi: 10.3390/en16031132.
[13] K. Mukherjee, S. S. Ahmed, M. Aasif, S. Kundu, and S. Ghosh, “House Rent Prediction Using Ensemble-Based Regression With Real-Time Data,” in Novel Research and Development Approaches in Heterogeneous Systems and Algorithms, no. March 2023, 2023, pp. 258–271.
[14] J. T R, N. S. Reddy, and U. D. Acharya, “Modeling Daily Reference Evapotranspiration from Climate Variables: Assessment of Bagging and Boosting Regression Approaches,” Water Resour. Manag., vol. 37, no. 3, pp. 1013–1032, Feb. 2023, doi: 10.1007/s11269-022-03399-4.
[15] D. Iskandaryan, F. Ramos, and S. Trilles, “Air Quality Prediction in Smart Cities Using Machine Learning Technologies Based on Sensor Data: A Review,” Appl. Sci., vol. 10, no. 7, p. 2401, Apr. 2020, doi: 10.3390/app10072401.
[16] M. M. Londhe, “Data Mining and Machine Learning Approach for Air Quality Index Prediction,” Int. J. Eng. Appl. Phys., vol. 1, no. 2, pp. 136–153, 2021, [Online]. Available: https://ijeap.org/.
Published
2024-11-12
How to Cite
ROCHADIANI, Theresia Herlina. Prediction of Air Quality Index Using Ensemble Models. JOURNAL OF APPLIED INFORMATICS AND COMPUTING, [S.l.], v. 8, n. 2, p. 384-389, nov. 2024. ISSN 2548-6861. Available at: <http://704209.wb34atkl.asia/index.php/JAIC/article/view/8532>. Date accessed: 28 nov. 2024. doi: https://doi.org/10.30871/jaic.v8i2.8532.
Section
Articles

Most read articles by the same author(s)

Obs.: This plugin requires at least one statistics/report plugin to be enabled. If your statistics plugins provide more than one metric then please also select a main metric on the admin's site settings page and/or on the journal manager's settings pages.