Skip to main content
Log in

Improving the performance of random forest for estimating monthly reservoir inflow via complete ensemble empirical mode decomposition and wavelet analysis

  • Original Paper
  • Published:
Stochastic Environmental Research and Risk Assessment Aims and scope Submit manuscript

Abstract

Estimation of reservoir inflow is of particular importance in optimal planning and management of water resources, proper allocation of water to consumption sectors, hydrological studies, etc. This study aimed to estimate monthly inflow (Q) to the Maroon Dam reservoir located in Iran utilizing climatic data such as minimum, maximum, and mean air temperatures (Tmin, Tmax, T), reservoir evaporation (E), and rainfall (R). The impact of any of the mentioned variables was analyzed by the entropy-based pre-processing technique. The results of the pre-processing showed that the rainfall is the most important parameter affecting the reservoir inflow. Therefore, three types of input patterns were taken into consideration consisting the antecedent Q-based, antecedent R-based, and combined antecedent Q and R-based input combinations. To estimate the monthly reservoir inflow, a random forest (RF) was firstly employed as the standalone model. Then, two different types of hybrid models were proposed via coupling the RF on complete ensemble empirical mode decomposition (CEEMD) and wavelet analysis (W) in order to implement the coupled CEEMD-RF and W-RF models. It is worthwhile to mentioning that six mother wavelets were used in developing the hybrid W-RF models. Four error metrics including root mean square error (RMSE), mean absolute error (MAE), Kling-Gupta efficiency (KGE), and Willmott index (WI) were used to assess the accuracy of implemented models. The attained results indicated the superiority of proposed hybrid models over the classic RF for estimating the monthly reservoir inflow. The most precise model during the test phase was W-RF(3) utilizing the Sym(2) as the mother wavelet under a lagged Q-based pattern with error measures of RMSE = 15.011 m3/s, MAE = 10.439 m3/s, KGE = 0.832, WI = 0.773.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6

Similar content being viewed by others

References

  • Adnan RM, Petroselli A, Heddam S, Santos CAG, Kisi O (2021a) Short term rainfall-runoff modelling using several machine learning methods and a conceptual event-based model. Stoch Environ Res Risk Assess 35(3):597–616

    Google Scholar 

  • Adnan RM, Liang Z, Parmar KS, Soni K, Kisi O (2021b) Modeling monthly streamflow in mountainous basin by MARS, GMDH-NN and DENFIS using hydroclimatic data. Neural Comput Applic 33(7):2853–2871

    Google Scholar 

  • Afan HA, Allawi MF, El-Shafie A, Yaseen ZM, Ahmed AN, Malek MA, El-Shafie A (2020) Input attributes optimization using the feasibility of genetic nature inspired algorithm: application of river flow forecasting. Sci Reports 10(1):1–15

    Google Scholar 

  • Ahmadi F, Mehdizadeh S, Mohammadi B, Pham QB, Doan TNC, Vo ND (2021a) Application of an artificial intelligence technique enhanced with intelligent water drops for monthly reference evapotranspiration estimation. Agric Water Manage 244:106622

  • Ahmadi F, Mehdizadeh S, Mohammadi B (2021b) Development of bio-inspired- and wavelet-based hybrid models for reconnaissance drought index modeling. Water Resour Manage 35(12):4127–4147

    Google Scholar 

  • Ahmadi F, Nazeri Tahroudi M, Mirabbasi R, Khalili K, Jhajharia D (2018) Spatiotemporal trend and abrupt change analysis of temperature in Iran. Meteorol Appl 25(2):314–321

    Google Scholar 

  • Ali M, Prasad R, Xiang Y, Yaseen ZM (2020) Complete ensemble empirical mode decomposition hybridized with random forest and kernel ridge regression model for monthly rainfall forecasts. J Hydrol 584:124647

  • Apaydin H, Feizi H, Sattari MT, Colak MS, Shamshirband S, Chau KW (2020) Comparative analysis of recurrent neural network architectures for reservoir inflow forecasting. Water 12(5):1500

    Google Scholar 

  • Bednarik M, Magulová B, Matys M, Marschalko M (2010) Landslide susceptibility assessment of the Kraľovany-Liptovský Mikuláš railway case study. Phys Chem Earth Parts a/b/c 35(3–5):162–171

    Google Scholar 

  • Booker DJ, Snelder TH (2012) Comparing methods for estimating flow duration curves at ungauged sites. J Hydrol 434:78–94

    Google Scholar 

  • Breiman L (2001) Random Forests. Mach Learn 45(1):5–32

    Google Scholar 

  • Chen BH, Wang XZ, Yang SH, McGreavy C (1999) Application of wavelets and neural networks to diagnostic system development, 1, feature extraction. Comput Chem Eng 23(7):899–906

    CAS  Google Scholar 

  • Chen S, Dong S (2020) A sequential structure for water inflow forecasting in coal mines integrating feature selection and multi-objective optimization. IEEE Access 8:183619–183632

    Google Scholar 

  • Chu TY, Huang WC (2020) Application of empirical mode decomposition method to synthesize flow data: A case study of Hushan Reservoir in Taiwan. Water 12(4):927

    Google Scholar 

  • Darbandsari P, Coulibaly P (2020) Introducing entropy-based Bayesian model averaging for streamflow forecast. J Hydrol 591:125577

  • Fathian F, Mehdizadeh S, Sales AK, Safari MJS (2019) Hybrid models to improve the monthly river flow prediction: Integrating artificial intelligence and non-linear time series models. J Hydrol 575:1200–1213

    Google Scholar 

  • Friedman J, Hastie T, Tibshirani R (2001) The elements of statistical learning (Vol. 1, No. 10). Springer series in statisti, New York

  • Ghorbani MA, Deo RC, Kim S, Kashani MH, Karimi V, Izadkhah M (2020) Development and evaluation of the cascade correlation neural network and the random forest models for river stage and river flow prediction in Australia. Soft Comput 24:12079–12090

    Google Scholar 

  • Herath HMVV, Chadalawada J, Babovic V (2020) Hydrologically informed machine learning for rainfall-runoff modelling: Towards distributed modelling. Hydrol Earth Syst Sci Discussions, pp 1–42

  • Huang NE, Shen Z, Long SR, Wu MC, Shih HH, Zheng Q, Yen NC, Tong CC, Liu H (1998) The empirical mode decomposition and Hilbert spectrum for nonlinear and nonstationary time series analysis. Procee Royal Soci A 545(1971):903–995

    Google Scholar 

  • Hussain D, Khan AA (2020) Machine learning techniques for monthly river flow forecasting of Hunza River, Pakistan. Earth Sci Inform 13:939–949

    Google Scholar 

  • Khalili K, Nazeri Tahoudi M, Mirabbasi R, Ahmadi F (2016) Investigation of spatial and temporal variability of precipitation in Iran over the last half century. Stoch Environ Res Risk Assess 30(4):1205–1221

    Google Scholar 

  • Lee D, Kim H, Jung I, Yoon J (202) Monthly reservoir inflow forecasting for dry period using teleconnection indices: a statistical ensemble approach. Appl Sci 10(10):3470.

  • Mallat SG (2009) A theory for multiresolution signal decomposition: the wavelet representation. In Fundamental Papers in Wavelet Theory (pp. 494–513). Princeton University Press.

  • Mehdizadeh S (2020) Using AR, MA, and ARMA time series models to improve the performance of MARS and KNN approaches in monthly precipitation modeling under limited climatic data. Water Resour Manage 34(1):263–282

    Google Scholar 

  • Mehdizadeh S, Ahmadi F, Mehr AD, Safari MJS (2020a) Drought modeling using classic time series and hybrid wavelet-gene expression programming models. J Hydrol 587:125017

    Google Scholar 

  • Mehdizadeh S, Ahmadi F, Sales AK (2020b) Modelling daily soil temperature at different depths via the classical and hybrid models. Meteorol Appl 27(4):e1941

    Google Scholar 

  • Mehdizadeh S, Sales AK (2018) A comparative study of autoregressive, autoregressive moving average, gene expression programming and Bayesian networks for estimating monthly streamflow. Water Resour Manage 32(9):3001–3022

    Google Scholar 

  • Mehr AD, Kahya E, Bagheri F, Deliktas E (2014) Successive-station monthly streamflow prediction using neuro-wavelet technique. Earth Sci Inform 7(4):217–229

    Google Scholar 

  • Mehr AD, Nourani V, Hrnjica B, Molajou A (2017) A binary genetic programing model for teleconnection identification between global sea surface temperature and local maximum monthly rainfall events. J Hydrol 555:397–406

    Google Scholar 

  • Misiti M, Misiti Y, Oppenheim G, Poggi JM (1996) Wavelet Toolbox for Use with Matlab. The Mathworks Inc, Natick, Massachusetts, USA

    Google Scholar 

  • Mohammadi B, Ahmadi F, Mehdizadeh S, Guan Y, Pham QB, Linh NTT, Tri DQ (2020) Developing novel robust models to improve the accuracy of daily streamflow modeling. Water Resour Manage 34(10):3387–3409

    Google Scholar 

  • Molajou A, Nourani V, Afshar A, Khosravi M, Brysiewicz A (2021) Optimal design and feature selection by genetic algorithm for emotional artificial neural network (EANN) in rainfall-runoff modeling. Water Resour Manage. https://doi.org/10.1007/s11269-021-02818-2

    Article  Google Scholar 

  • Nayak PC, Sudheer KP, Rangan DM, Ramasastri KS (2004) A neuro-fuzzy computing technique for modeling hydrological time series. J Hydrol 291(1–2):52–66

    Google Scholar 

  • Nazir HM, Hussain I, Faisal M, Shoukry AM, Sharkawy MAW, Al-Deek FF, Ismail M (2020) Dependence structure analysis of multisite river inflow data using vine copula-CEEMDAN based hybrid model. PeerJ 8:e10285

  • Nourani V, Baghanam AH, Adamowski J, Kisi O (2014) Applications of hybrid wavelet–artificial intelligence models in hydrology: a review. J Hydrol 514:358–377

    Google Scholar 

  • Nourani V, Molajou A, Uzelaltinbulat S, Sadikoglu F (2019) Emotional artificial neural networks (EANNs) for multi-step ahead prediction of monthly precipitation; case study: northern Cyprus. Theor Appl Climatol 138:1419–1434

    Google Scholar 

  • Ouarda TB, Charron C, Mahdi S, Yousef LA (2021) Climate teleconnections, interannual variability, and evolution of the rainfall regime in a tropical Caribbean island: case study of Barbados. Theor Appl Climatol. https://doi.org/10.1007/s00704-021-03653-6

    Article  Google Scholar 

  • Pei-Yue L, Hui Q, Jian-Hua W (2010) Groundwater quality assessment based on improved water quality index in Pengyang County, Ningxia. Northwest China J Chem 7(S1):209–216

    Google Scholar 

  • Pham LT, Luo L, Finley AO (2020) Evaluation of random forest for short-term daily streamflow forecast in rainfall and snowmelt driven watersheds. Hydrol Earth Syst Sci Discussions, pp 1–33

  • Polikar R (1999) Fundamental concepts and overview of the wavelet theory: the wavelet tutorial–part I.

  • Pour SH, Abd Wahab AK, Shahid S (2020) Spatiotemporal changes in precipitation indicators related to bioclimate in Iran. Theor Appl Climatol 141(1):99–115

    Google Scholar 

  • Rahmani-Rezaeieh A, Mohammadi M, Mehr AD (2020) Ensemble gene expression programming: a new approach for evolution of parsimonious streamflow forecasting model. Theor Appl Climatol 139(1–2):549–564

    Google Scholar 

  • Ray SN, Chattopadhyay S (2021) Analyzing surface air temperature and rainfall in univariate framework, quantifying uncertainty through Shannon entropy and prediction through artificial neural network. Earth Sci Inform 14(1):485–503

    Google Scholar 

  • Roy DK (2021) Long short-term memory networks to predict one-step ahead reference evapotranspiration in a subtropical climatic zone. Environ Proc 8(2):911–941

    Google Scholar 

  • Salehi S, Dehghani M, Mortazavi SM, Singh VP (2020) Trend analysis and change point detection of seasonal and annual precipitation in Iran. Int J Climatol 40(1):308–323

    Google Scholar 

  • Santos CA, Freire PK, Silva RMD, Akrami SA (2019) Hybrid wavelet neural network approach for daily inflow forecasting using tropical rainfall measuring mission data. J Hydrol Eng 24(2):04018062

    Google Scholar 

  • Saray MH, Eslamian SS, Klöve B, Gohari A (2020) Regionalization of potential evapotranspiration using a modified region of influence. Theor Appl Climatol 140(1):115–127

    Google Scholar 

  • Shannon CE (1948) A mathematical theory of communication. Bell Syst Tech J 27(3):379–423

    Google Scholar 

  • Sharafi S, Karim NM (2020) Investigating trend changes of annual mean temperature and precipitation in Iran. Arab J Geosci 13(16):1–11

    Google Scholar 

  • Shataee S, Kalbi S, Fallah A, Pelz D (2012) Forest attribute imputation using machine-learning methods and ASTER data: comparison of k-NN, SVR and random forest regression algorithms. Int J Remote Sens 33(19):6254–6280

    Google Scholar 

  • Singh VP (2018) Hydrologic modeling: progress and future directions. Geosci Lett 5(1):1–18

    Google Scholar 

  • Tang T, Liang Z, Hu Y, Li B, Wang J (2020) Research on flood forecasting based on flood hydrograph generalization and random forest in Qiushui River basin. China J Hydroinform 22(6):1588–1602

    Google Scholar 

  • Torres ME, Colominas MA, Schlotthauer G, Flandrin P (2011) A complete ensemble empirical mode decomposition with adaptive noise. In 2011 IEEE international conference on acoustics, speech and signal processing (ICASSP) (pp. 4144–4147). IEEE.

  • Vidyarthi VK, Jain A, Chourasiya S (2020) Modeling rainfall-runoff process using artificial neural network with emphasis on parameter sensitivity. Model Earth Syst Environ 6:2177–2188

    Google Scholar 

  • Wang J, Wang X, hui Lei X, Wang H, hua Zhang X, jun You J, lian Liu X (2020) Teleconnection analysis of monthly streamflow using ensemble empirical mode decomposition. J Hydrol 582:124411

  • Wu Z, Huang NE (2009) Ensemble empirical mode decomposition: a noise-assisted data analysis method. Adv Adapt Data Anal 1(01):1–41

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Saeid Mehdizadeh.

Ethics declarations

Conflict of interest

The authors declare no conflict of interest.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Ahmadi, F., Mehdizadeh, S. & Nourani, V. Improving the performance of random forest for estimating monthly reservoir inflow via complete ensemble empirical mode decomposition and wavelet analysis. Stoch Environ Res Risk Assess 36, 2753–2768 (2022). https://doi.org/10.1007/s00477-021-02159-x

Download citation

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s00477-021-02159-x

Keywords

Navigation