Abstract
The function of a sewage treatment plant is to treat the sewage to acceptable standards before being discharged into the receiving waters. To design and operate such plants, it is necessary to measure and predict the influent flow rate. In this research, the influent flow rate of a sewage treatment plant (STP) was modelled and predicted by autoregressive integrated moving average (ARIMA), nonlinear autoregressive network (NAR) and support vector machine (SVM) regression time series algorithms. To evaluate the models’ accuracy, the root mean square error (RMSE) and coefficient of determination (R2) were calculated as initial assessment measures, while relative error (RE), peak flow criterion (PFC) and low flow criterion (LFC) were calculated as final evaluation measures to demonstrate the detailed accuracy of the selected models. An integrated model was developed based on the individual models’ prediction ability for low, average and peak flow. An initial assessment of the results showed that the ARIMA model was the least accurate and the NAR model was the most accurate. The RE results also prove that the SVM model’s frequency of errors above 10% or below − 10% was greater than the NAR model’s. The influent was also forecasted up to 44 weeks ahead by both models. The graphical results indicate that the NAR model made better predictions than the SVM model. The final evaluation of NAR and SVM demonstrated that SVM made better predictions at peak flow and NAR fit well for low and average inflow ranges. The integrated model developed includes the NAR model for low and average influent and the SVM model for peak inflow.
Similar content being viewed by others
References
Adamowski J, Chan HF (2011) A wavelet neural network conjunction model for groundwater level forecasting. J Hydrol 407(1-4):28–40. https://doi.org/10.1016/j.jhydrol.2011.06.013
Akbari-Alashti H, Bozorg Haddad O, Mariño MA (2015) Evaluation of a developed discrete time-series method in flow forecasting models. Water Resour Manag 29(9):3211–3225. https://doi.org/10.1007/s11269-015-0991-1
Amari S, Wu S (1999) Improving support vector machine classifiers by modifying kernel functions. Neural Netw 12(6):783–789. https://doi.org/10.1016/S0893-6080(99)00032-5
Asefa T, Kemblowski M, McKee M, Khalil A (2006) Multi-time scale stream flow predictions: the support vector machines approach. J Hydrol 318(1-4):7–16. https://doi.org/10.1016/j.jhydrol.2005.06.001
Ashrafi O, Yerushalmi L, Haghighat F (2013) Application of dynamic models to estimate greenhouse gas emission by wastewater treatment plants of the pulp and paper industry. Environ Sci Pollut Res 20(3):1858–1869. https://doi.org/10.1007/s11356-012-1310-5
Ayele GT, Teshale EZ, Yu B, Rutherfurd ID, Jeong J (2017) Streamflow and sediment yield prediction for watershed prioritization in the upper Blue Nile river basin, Ethiopia. Water (Switzerland) 9(10). https://doi.org/10.3390/w9100782
Badrzadeh H, Sarukkalige R, Jayawardena AW (2015) Hourly runoff forecasting for flood risk management: application of various computational intelligence models. J Hydrol 529(Part 3):1633–1643. https://doi.org/10.1016/j.jhydrol.2015.07.057
Benmouiza K, Cheknane A (2013) Forecasting hourly global solar radiation using hybrid k-means and nonlinear autoregressive neural network models. Energy Convers Manag 75:561–569. https://doi.org/10.1016/j.enconman.2013.07.003
Box GEP, Jenkins GM (1976) Time series analysis: forecasting and control, 1st edn. Holden-Day, California
Cao S, Wang F, Tam W, Tse LA, Kim JH, Liu J, Lu Z (2013) A hybrid seasonal prediction model for tuberculosis incidence in China. BMC Med Inform Decis Mak 13(1):56. https://doi.org/10.1186/1472-6947-13-56
Ch S, Anand N, Panigrahi BK, Mathur S (2013) Streamflow forecasting by SVM with quantum behaved particle swarm optimization. Neurocomputing 101:18–23. https://doi.org/10.1016/j.neucom.2012.07.017
Chang F-J, Chiang Y-M, Chang L-C (2007) Multi-step-ahead neural networks for flood forecasting. Hydrol Sci J 52(1):114–130. https://doi.org/10.1623/hysj.52.1.114
Collobert R, Bengio S (2001) SVMTorch: support vector machines for large-scale regression problems. J Mach Learn Res 1:143–160
Cortes C, Vapnik V (1995) Support-vector networks. Mach Learn 20(3):273–297. https://doi.org/10.1007/bf00994018
Coulibaly P, Bobée B, Anctil F (2001) Improving extreme hydrologic events forecasting using a new criterion for artificial neural network selection. Hydrol Process 15(8):1533–1536. https://doi.org/10.1002/hyp.445
Daliakopoulos IN, Coulibaly P, Tsanis IK (2005) Groundwater level forecasting using artificial neural networks. J Hydrol 309(1-4):229–240. https://doi.org/10.1016/j.jhydrol.2004.12.001
De Gooijer JG, Hyndman RJ (2006) 25 years of time series forecasting. Int J Forecast 22(3):443–473. https://doi.org/10.1016/j.ijforecast.2006.01.001
Dibike YB, Velickov S, Solomatine D, Abbott MB (2001) Model induction with support vector machines: introduction and applications. J Comput Civ Eng 15(3):208–216. https://doi.org/10.1061/(ASCE)0887-3801(2001)15:3(208)
Drucker H, Burges CJC, Kaufman L, Smola A, Vapnik V (1996) Support vector regression machines. Paper presented at the Proceedings of the 9th International Conference on Neural Information Processing Systems, Denver, Colorado
El-Shafie A, Abdin AE, Noureldin A, Taha MR (2009) Enhancing inflow forecasting model at Aswan high dam utilizing radial basis neural network and upstream monitoring stations measurements. Water Resour Manag 23(11):2289–2315. https://doi.org/10.1007/s11269-008-9382-1
El-Shafie A, Taha MR, Noureldin A (2007) A neuro-fuzzy model for inflow forecasting of the Nile river at Aswan high dam. Water Resour Manag 21(3):533–556. https://doi.org/10.1007/s11269-006-9027-1
Escalas-Cañellas A, Ábrego-Góngora CJ, Barajas-López MG, Houweling D, Comeau Y (2008) A time series model for influent temperature estimation: application to dynamic temperature modelling of an aerated lagoon. Water Res 42(10-11):2551–2562. https://doi.org/10.1016/j.watres.2008.02.016
Ghorbani MA, Zadeh HA, Isazadeh M, Terzi O (2016) A comparative study of artificial neural network (MLP, RBF) and support vector machine models for river flow prediction. Environ Earth Sci 75(6). https://doi.org/10.1007/s12665-015-5096-x
Güldal V, Tongal H (2010) Comparison of recurrent neural network, adaptive neuro-fuzzy inference system and stochastic models in Eğirdir Lake level forecasting. Water Resour Manag 24(1):105–128. https://doi.org/10.1007/s11269-009-9439-9
Guzman SM, Paz JO, Tagert MLM (2017) The use of NARX neural networks to forecast daily groundwater levels. Water Resour Manag 31(5):1591–1603. https://doi.org/10.1007/s11269-017-1598-5
He Z, Wen X, Liu H, Du J (2014) A comparative study of artificial neural network, adaptive neuro fuzzy inference system and support vector machine for forecasting river flow in the semiarid mountain region. J Hydrol 509:379–386. https://doi.org/10.1016/j.jhydrol.2013.11.054
Hipni A, El-shafie A, Najah A, Karim OA, Hussain A, Mukhlisin M (2013) Daily forecasting of dam water levels: comparing a support vector machine (SVM) model with adaptive neuro fuzzy inference system (ANFIS). Water Resour Manag 27(10):3803–3823. https://doi.org/10.1007/s11269-013-0382-4
Joachims T (1999) Making large-scale support vector machine learning practical. In: Bernhard S, lkopf, Christopher JCB, Alexander JS (eds) Advances in kernel methods. MIT Press, pp 169–184
Kalogirou SA (2000) Applications of artificial neural-networks for energy systems. Appl Energy 67(1-2):17–35. https://doi.org/10.1016/S0306-2619(00)00005-2
Kalogirou SA (2001) Artificial neural networks in renewable energy systems applications: a review. Renew Sust Energ Rev 5(4):373–401. https://doi.org/10.1016/S1364-0321(01)00006-5
Khashei M, Bijari M (2010) An artificial neural network (p, d, q) model for timeseries forecasting. Expert Syst Appl 37(1):479–489. https://doi.org/10.1016/j.eswa.2009.05.044
Kim M, Kim Y, Kim H, Piao W, Kim C (2016a) Evaluation of the k-nearest neighbor method for forecasting the influent characteristics of wastewater treatment plant. Front Environ Sci Eng 10(2):299–310. https://doi.org/10.1007/s11783-015-0825-7
Kim M, Kim Y, Kim H, Piao W, Kim C (2016b) Operator decision support system for integrated wastewater management including wastewater treatment plants and receiving water bodies. Environ Sci Pollut Res 23(11):10785–10798. https://doi.org/10.1007/s11356-016-6272-6
Kisi O (2010) Wavelet regression model for short-term streamflow forecasting. J Hydrol 389(3-4):344–353. https://doi.org/10.1016/j.jhydrol.2010.06.013
Kişi Ö (2008) Stream flow forecasting using neuro-wavelet technique. Hydrol Process 22(20):4142–4152. https://doi.org/10.1002/hyp.7014
Kişi Ö (2009) Neural networks and wavelet conjunction model for intermittent streamflow forecasting. Journal of Hydrologic Engineering 14(8):773–782. https://doi.org/10.1061/(ASCE)HE.1943-5584.0000053
Kisi O, Cimen M (2011) A wavelet-support vector machine conjunction model for monthly streamflow forecasting. J Hydrol 399(1-2):132–140. https://doi.org/10.1016/j.jhydrol.2010.12.041
Kusiak A, Wei X, Verma AP, Roz E (2013) Modeling and prediction of rainfall using radar reflectivity data: a data-mining approach. IEEE Trans Geosci Remote Sens 51(4):2337–2342. https://doi.org/10.1109/TGRS.2012.2210429
Legates DR, McCabe GJ Jr (1999) Evaluating the use of 'goodness-of-fit' measures in hydrologic and hydroclimatic model validation. Water Resour Res 35(1):233–241. https://doi.org/10.1029/1998WR900018
Li X, Zeng G, Huang G, Li J, Jiang R (2007) Short-term prediction of the influent quantity time series of wastewater treatment plant based on a chaos neural network model. Front Environ Sci Eng China 1(3):334–338. https://doi.org/10.1007/s11783-007-0057-6
Liu Y, Ganigué R, Sharma K, Yuan Z (2016) Event-driven model predictive control of sewage pumping stations for sulfide mitigation in sewer networks. Water Res 98:376–383. https://doi.org/10.1016/j.watres.2016.04.039
MATLAB and Statistics and Machine Learning Toolbox (2015) Release 2015b. The MathWorks, Inc., Natick
Moriasi DN, Arnold JG, Van Liew MW, Bingner RL, Harmel RD, Veith TL (2007) Model evaluation guidelines for systematic quantification of accuracy in watershed simulations. Trans ASABE 50(3):885–900. https://doi.org/10.13031/2013.23153
Nourani V, Hosseini Baghanam A, Adamowski J, Kisi O (2014) Applications of hybrid wavelet–artificial intelligence models in hydrology: a review. J Hydrol 514:358–377. https://doi.org/10.1016/j.jhydrol.2014.03.057
Othman F, Heydari M, Ahmadizadeh M, Nozari H, Sadegh M (2017) Investigating the effectiveness of seasonalization based on statistical parameters in normalizing, modeling and forecasting inflow time series. Fresenius Environ Bull 26:590–597
Patel SS, Ramachandran P (2015) A comparison of machine learning techniques for modeling river flow time series: the case of upper Cauvery River basin. Water Resour Manag 29(2):589–602. https://doi.org/10.1007/s11269-014-0705-0
Patil K, Deo MC, Ghosh S, Ravichandran M (2013) Predicting sea surface temperatures in the North Indian Ocean with nonlinear autoregressive neural networks. Int J Oceanogr 2013:11. https://doi.org/10.1155/2013/302479
Platt JC (1999) Fast training of support vector machines using sequential minimal optimization. Adv Kernel Methods 185–208
Sehgal V, Tiwari MK, Chatterjee C (2014) Wavelet bootstrap multiple linear regression based hybrid modeling for daily river discharge forecasting. Water Resour Manag 28(10):2793–2811. https://doi.org/10.1007/s11269-014-0638-7
Shahverdi K, Khodaei K, Shokohi A (2015) Prediction of monthly inflow to dam reservoir using ANN and ANFIS: a case study of Latyan dam. Int Bull Water Res Dev 3:X–XVIII
Sivapragasam C, Liong S-Y, Pasha MFK (2001) Rainfall and runoff forecasting with SSA–SVM approach. J Hydroinf 3:141–152
Sivapragasam C, Liong SY (2005) Flow categorization model for improving forecasting. Nord Hydrol 36:37–48
Solaimany-Aminabad M, Maleki A, Hadi M (2013) Application of artificial neural network (ANN) for the prediction of water treatment plant influent characteristics. J Adv Environ Health Res 1
Sudheer C, Maheswaran R, Panigrahi BK, Mathur S (2014) A hybrid SVM-PSO model for forecasting monthly streamflow. Neural Comput & Applic 24(6):1381–1389. https://doi.org/10.1007/s00521-013-1341-y
Sweetapple C, Fu G, Butler D (2014) Multi-objective optimisation of wastewater treatment plant control to reduce greenhouse gas emissions. Water Res 55:52–62. https://doi.org/10.1016/j.watres.2014.02.018
Taher A, Faridah O (2017) Time series analysis and forecasting of wastewater inflow into Bandar Tun Razak sewage treatment plant in Selangor, Malaysia. IOP Conf Ser Mater Sci Eng 210:012028
Vapnik V (1995) The nature of statistical learning theory. Springer, New York. https://doi.org/10.1007/978-1-4757-2440-0
Vyas M, Modhera B, Vyas V, Sharma A (2011) Performance forecasting of common effluent treatment plant parameters by artificial neural network. ARPN J Eng Appl Sci 6:38–42
Wei S, Song J, Khan NI (2012a) Simulating and predicting river discharge time series using a wavelet-neural network hybrid modelling approach. Hydrol Process 26(2):281–296. https://doi.org/10.1002/hyp.8227
Wei S, Zuo D, Song J (2012b) Improving prediction accuracy of river discharge time series using a wavelet-NAR artificial neural network. J Hydroinf 14(4):974–991. https://doi.org/10.2166/hydro.2012.143
Wei W, Jiang J, Liang H, Gao L, Liang B, Huang J, Zang N, Liao Y, Yu J, Lai J, Qin F, Su J, Ye L, Chen H (2016) Application of a combined model with autoregressive integrated moving average (ARIMA) and generalized regression neural network (GRNN) in forecasting hepatitis incidence in Heng County, China. PLoS One 11(6):e0156768. https://doi.org/10.1371/journal.pone.0156768
Wei X, Kusiak A, Sadat HR (2012c) Prediction of influent flow rate: data-mining approach. J Energy Eng 139:118–123
Wu CL, Chau KW (2010) Data-driven models for monthly streamflow time series prediction. Eng Appl Artif Intell 23(8):1350–1367. https://doi.org/10.1016/j.engappai.2010.04.003
Yaseen ZM, El-shafie A, Jaafar O, Afan HA, Sayl KN (2015) Artificial intelligence based models for stream-flow forecasting: 2000–2015. J Hydrol 530:829–844. https://doi.org/10.1016/j.jhydrol.2015.10.038
Yu P-S, Chen S-T, Chang IF (2006) Support vector regression for real-time flood stage forecasting. J Hydrol 328(3-4):704–716. https://doi.org/10.1016/j.jhydrol.2006.01.021
Yu X, Liong S-Y, Babovic V (2004) EC-SVM approach for real-time hydrologic forecasting. J Hydroinf 6:209–223
Zhang GP (2003) Time series forecasting using a hybrid ARIMA and neural network model. Neurocomputing 50:159–175. https://doi.org/10.1016/S0925-2312(01)00702-0
Zhang Q, Ji FY, Xu XY (2016) Optimization of nitrate removal from wastewater with a low C/N ratio using solid-phase denitrification. Environ Sci Pollut Res 23(1):698–708. https://doi.org/10.1007/s11356-015-5308-7
Acknowledgments
The authors would like to acknowledge the financial support from the Ministry of Education and University of Malaya for grants FRGS (FP016-2014A) and UMRG (FL001-13SUS), respectively. We would also like to acknowledge the Water Research Centre of University of Malaya for their support. We appreciate the cooperation given by relevant parties and companies for providing the necessary data and assistance. We are most grateful and would like to thank the reviewers for their valuable suggestions, which have led to substantial improvement of the article.
Author information
Authors and Affiliations
Corresponding author
Additional information
Responsible editor: Marcus Schulz
Rights and permissions
About this article
Cite this article
Ansari, M., Othman, F., Abunama, T. et al. Analysing the accuracy of machine learning techniques to develop an integrated influent time series model: case study of a sewage treatment plant, Malaysia. Environ Sci Pollut Res 25, 12139–12149 (2018). https://doi.org/10.1007/s11356-018-1438-z
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11356-018-1438-z