
(2) Aji Supriyanto

*Corresponding author
AbstractThis study aims to compare the performance of automatic and manual rain gauges in the Central Highlands of Central Java using a machine learning approach based on Extreme Gradient Boosting (XGBoost) and Random Forest Regression (RFR) algorithms. Daily rainfall data were collected from five regencies Banyumas, Banjarnegara, Wonosobo, Temanggung, and Pemalang between 2021 and 2024. Preprocessing involved merging data from two types of instruments (Automatic Rain Gauge/AWS and manual ombrometer), correcting anomalies, and standardizing date-time formats. The models were developed using feature engineering techniques, including multi-lag and moving averages, and evaluated using MAE, RMSE, and R-squared (R²) metrics. The results show that the XGBoost model with automatic data achieved the best performance, with a Mean Absolute Error (MAE) of 17.3632 mm, Root Mean Squared Error (RMSE) of 27.0282 mm, and R² of 0.5050. In comparison, the Random Forest model with automatically generated data produced an MAE of 16.6307 mm, an RMSE of 28.5286 mm, and an R² of 0.4485. Models with manual data showed lower performance, with R² values below 0.30. These findings indicate that automatic measurement data are more stable and effective for building predictive rainfall models using machine learning. This supports the use of automatic instruments as the primary data source in rainfall forecasting and hydrometeorological disaster mitigation systems.
KeywordsAutomatic Gauge; Predictive Modeling; Rainfall; Random Forest; XGBoost
|
DOIhttps://doi.org/10.33122/ejeset.v6i1.955 |
Article metricsAbstract views : 344 | PDF views : 278 |
Cite |
Full Text Download
|
References
Abdullah, M., & Said, S. (2024). Performance Evaluation of Machine Learning Regression Models for Rainfall Prediction. Iranian Journal of Science and Technology, Transactions of Civil Engineering, December. https://doi.org/10.1007/s40996-024-01691-4
Darmawan, H., Yuliana, M., & Hadi, M. Z. S. (2023). GRU and XGBoost Performance with Hyperparameter Tuning Using GridSearchCV and Bayesian Optimization on an IoT-Based Weather Prediction System. International Journal on Advanced Science, Engineering and Information Technology, 13(3), 848–859. https://doi.org/10.18517/ijaseit.13.3.18377
Droździoł, R., & Absalon, D. (2023). Evaluation of Selected Amateur Rain Gauges with Hellmann Rain Gauge Measurements. Climate, 11(5). https://doi.org/10.3390/cli11050107
Dunkerley, D. (2025). Limited Time Resolution of Event Data Loggers Can Bias Intensity Measurements from Tipping-Bucket Rain Gauges Limited Time Resolution of Event Data Loggers Can Bias Intensity Measurements from Tipping-Bucket Rain Gauges. 0–11. https://doi.org/10.20944/preprints202407.0322.v2
Fadil Danu Rahman, Mulki, M. I. Z., & Taryana, A. (2024). Clustering Dan Klasifikasi Data Cuaca Cilacap Dengan Menggunakan Metode K-Means Dan Random Forest. Jurnal SINTA: Sistem Informasi Dan Teknologi Komputasi, 1(2), 90–97. https://doi.org/10.61124/sinta.v1i2.15
Hasnain, K. N. (2025). Integrating Machine Learning for Real - Time Energy Load Forecasting in US Smart Grids : A Multi - Model Comparative Approach Abstract : 1–19.
Islam, K. I., Elias, E., Carroll, K. C., & Brown, C. (2023). Exploring Random Forest Machine Learning and Remote Sensing Data for Streamflow Prediction: An Alternative Approach to a Process-Based Hydrologic Modeling in a Snowmelt-Driven Watershed. Remote Sensing, 15(16). https://doi.org/10.3390/rs15163999
Jamal, N. A., Mohamed Noor, N., Jafri, I. A. M., & Wibowo, G. J. (2025). Assessment of Moving Average (MA) Method for Rainfall Prediction in Yogyakarta, Indonesia. 5. https://doi.org/10.3390/eesp2025033005
Kumar, V., Kedam, N., Sharma, K. V., Khedher, K. M., & Alluqmani, A. E. (2023). A Comparison of Machine Learning Models for Predicting Rainfall in Urban Metropolitan Cities. Sustainability (Switzerland), 15(18). https://doi.org/10.3390/su151813724
No, V., Syahreza, A., Ningrum, N. K., & Syahrazy, M. A. (2024). Edumatic : Jurnal Pendidikan Informatika Perbandingan Kinerja Model Prediksi Cuaca : Random Forest , Support Vector Regression , dan XGBoost. 8(2), 526–534. https://doi.org/10.29408/edumatic.v8i2.27640
Papacharalampous, G., Tyralis, H., Doulamis, A., & Doulamis, N. (2023). Comparison of Tree-Based Ensemble Algorithms for Merging Satellite and Earth-Observed Precipitation Data at the Daily Time Scale. Hydrology, 10(2). https://doi.org/10.3390/hydrology10020050
Pratiwi, A., & Nurhuda, W. (2021). Analisis Perbandingan Data Pengukuran Curah Hujan Metode Otomatis (Automatic Rain Gauge) dan Metode Observasi. Prosiding Seminar Nasional Fisika, 116–128.
Quayesam, D., Akubire, J., & Darkwah, O. (2024). A Comparison of Machine Learning Algorithms for Predicting Sea Surface Temperature in the Great Barrier Reef Region. http://arxiv.org/abs/2411.15202
Rahman, A. U., Abbas, S., Gollapalli, M., Ahmed, R., Aftab, S., Ahmad, M., Khan, M. A., & Mosavi, A. (2022). Rainfall Prediction System Using Machine Learning Fusion for Smart Cities. Sensors, 22(9). https://doi.org/10.3390/s22093504
Rizky, M. H., Faisal, M. R., Budiman, I., Kartini, D., & Abadi, F. (2024). Effect of Hyperparameter Tuning Using Random Search on Tree-Based Classification Algorithm for Software Defect Prediction. IJCCS (Indonesian Journal of Computing and Cybernetics Systems), 18(1), 95. https://doi.org/10.22146/ijccs.90437
Sangaji, D., & Sutabri, T. (2025). Analisis XGBoost dan Random Forest untuk Prediksi Curah Hujan dalam Mendukung Mitigasi Karhutla. Jurnal Pustaka AI (Pusat Akses Kajian Teknologi Artificial Intelligence), 5(1), 13–18. https://doi.org/10.55382/jurnalpustakaai.v5i1.905
Segovia-Cardozo, D. A., Bernal-Basurco, C., & Rodríguez-Sinobas, L. (2023). Tipping Bucket Rain Gauges in Hydrological Research: Summary on Measurement Uncertainties, Calibration, and Error Reduction Strategies. Sensors, 23(12). https://doi.org/10.3390/s23125385
Shankar, B., Soni, A., Chandrasekar, H., Raghavan, S., & Shrivastava, M. (2019). First Observations on the Trap-Induced Avalanche Instability and Safe Operating Area Concerns in AlGaN/GaN HEMTs. IEEE Transactions on Electron Devices, 66(8), 3433–3440. https://doi.org/10.1109/TED.2019.2919491
Tricha, A., & Moussaid, L. (2024). Evaluating machine learning models for precipitation prediction in Casablanca City. June, 1325–1332. https://doi.org/10.11591/ijeecs.v35.i2.pp1325-1332
Yan, X., & Duan, G. (2022). The Real‐Time Prediction of Product Quality Based on the Equipment Parameters in a Smart Factory. Processes, 10(5). https://doi.org/10.3390/pr10050967
Refbacks
- There are currently no refbacks.
Copyright (c) 2025 Indri Budiarto*, Aji Supriyanto

This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.

























Download 