Skip to main content
Log in

Forecasting financial series using clustering methods and support vector regression

  • Published:
Artificial Intelligence Review Aims and scope Submit manuscript

Abstract

This paper proposes a two-stage model for forecasting financial time series. The first stage uses clustering methods in order to segment the time series into its various contexts. The second stage makes use of support vector regressions (SVRs), one for each context, to forecast future values of the series. The series used in the experiments is composed of values of an equity fund of a Brazilian bank. The proposed model is compared to a hierarchical model (HM) presented in the literature. In this series, the HM presented prediction results superior to both a support vector machine (SVM) and a multilayer perceptron (MLP) models. The experiments show that the proposed model is superior to HM, reducing the forecasting error of the HM by 32%. This means that the proposed model is also superior to the SVM and MLP models. An analysis of the construction and use of clusters associated with a series volatility study shows that data obtained from only one type of volatility (low or high) are enough to provide sufficient knowledge to the model so that it is able to forecast future values with good accuracy. Another analysis on the quality of the clusters formed by the model shows that each cluster carries different information about the series. Furthermore, there is always a group of SVRs capable of making adequate forecasts and, for the most part, the SVR used in forecasting is a SVR belonging to this group.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11

Similar content being viewed by others

References

  • Abu-Mostafa YS, Atiya AF (1996) Introduction to financial forecasting. Appl Intell 6(3):205–213

    Article  Google Scholar 

  • Anderson DR, Sweeney DJ, Williams TA, Camm JD, Cochran JJ, Fry MJ, Ohlmann JW (2016) An introduction to management science: quantitative approaches to decision making. Int J Forecast 8(1):69–80

    Google Scholar 

  • Armano G, Marchesi M, Murru A (2005) A hybrid genetic-neural architecture for stock indexes forecasting. Inf Sci 170(1):3–33 (computational Intelligence in Economics and Finance)

    Article  MathSciNet  Google Scholar 

  • Armstrong J, Collopy F (1992) Error measures for generalizing about forecasting methods: empirical comparisons. Int J Forecast 8(1):69–80

    Article  Google Scholar 

  • Atsalakis GS, Valavanis KP (2009) Surveying stock market forecasting techniques—part II: soft computing methods. Expert Syst Appl 36(3, Part 2):5932–5941. https://doi.org/10.1016/j.eswa.2008.07.006

    Article  Google Scholar 

  • Azad MK, Uddin S, Takruri M (2018) Support vector regression based electricity peak load forecasting. In: 2018 11th international symposium on mechatronics and its applications (ISMA), pp 1–5. https://doi.org/10.1109/ISMA.2018.8330143

  • Bezdek JC, Ehrlich R, Full W (1984) FCM: the Fuzzy C-Means clustering algorithm. Comput Geosci 10(2–3):191–203

    Article  Google Scholar 

  • Boser BE, Guyon IM, Vapnik VN (1992) A training algorithm for optimal margin classifiers. In: Proceedings of the ACM annual workshop on computational learning theory (COLT), pp 144–152

  • BOVESPA (2017) http://www.bmfbovespa.com.br/pt_br/produtos/indices/indices-amplos/indice-brasil-100-ibrx-100-1.htm

  • Box GEP, Jenkins GM, Reinsel GC (2008) Time series analysis, forecasting and control. Wiley, Hoboken

    Book  MATH  Google Scholar 

  • Bank of Brazil (2010) http://www37.bb.com.br/portalbb/fundosInvestimento/fundosinvestimento/gf07,802,10340,10340,1,0.bbx?fundo=6

  • Cao L (2003) Support vector machines experts for time series forecasting. Neurocomputing 51:321–339

    Article  Google Scholar 

  • Cao L, Tay FEH (2001a) Application of support vector machines in financial time series forecasting. Omega 29(4):309–317

    Article  Google Scholar 

  • Cao L, Tay FEH (2001b) Improved financial time series forecasting by combining support vector machines with self-organizing feature map. Intell Data Anal 5(4):339–354

    Article  MATH  Google Scholar 

  • Cao L, Tay FEH (2003) Support vector machine with adaptive parameters in financial time series forecasting. IEEE Trans Neural Netw 14(6):1506–1518

    Article  Google Scholar 

  • Carpinteiro OAS, Leite JPRR, Pinheiro CAM, Lima I (2012) Forecasting models for prediction in time series. Artif Intell Rev 38(2):163–171

    Article  Google Scholar 

  • Chabaa S, Zeroual A, Antari J (2010) Identification and prediction of internet traffic using artificial neural networks. J Intell Learn Syst Appl 2(3):147–155

    Google Scholar 

  • Chang CC, Lin CJ (2011) LIBSVM: a library for support vector machines. ACM Trans Intell Syst Technol 2(3):27

    Article  Google Scholar 

  • Clements MP, Franses PH, Swanson NR (2004) Forecasting economic and financial time-series with non-linear models. Int J Forecast 20(2):169–183

    Article  Google Scholar 

  • Diebold FX, Mariano RS (2002) Comparing predictive accuracy. J Bus Econ Stat 20(1):134–144. https://doi.org/10.1198/073500102753410444

    Article  MathSciNet  Google Scholar 

  • Everitt BS, Landau S, Leese M, Stahl D (2011) Cluster analysis. Wiley, Hoboken

    Book  MATH  Google Scholar 

  • Harvey DI, Leybourne SJ, Newbold P (1998) Tests for forecast encompassing. J Bus Econ Stat 16(2):254–259

    Google Scholar 

  • Haviluddin, Alfred R (2015) A genetic-based backpropagation neural network for forecasting in time-series data. In: Proceedings of the international conference on science in information technology (ICSITech), pp 158–163

  • Haykin S (2009) Neural networks and learning machines. Prentice Hall, Upper Saddle River

    Google Scholar 

  • Huang W, Nakamori Y, Wang SY (2005) Forecasting stock market movement direction with support vector machine. Comput Oper Res 32(10):2513–2522

    Article  MATH  Google Scholar 

  • Kara Y, Boyacioglu MA, Baykan ÖK (2011) Predicting direction of stock price index movement using artificial neural networks and support vector machines: the sample of the istanbul stock exchange. Expert Syst Appl 38(5):5311–5319. https://doi.org/10.1016/j.eswa.2010.027

    Article  Google Scholar 

  • Kim Y, Enke D (2016) Developing a rule change trading system for the futures market using rough set analysis. Expert Syst Appl 59:165–173. https://doi.org/10.1016/j.eswa.2016.04.031

    Article  Google Scholar 

  • Kutics A, O’Connell C, Nakagawa A (2013) Segment-based image classification using layered-SOM. In: Proceedings of the IEEE international conference on image processing, pp 2430–2434

  • Limei L, Xuan H (2017) Study of electricity load forecasting based on multiple kernels learning and weighted support vector regression machine. In: 2017 29th Chinese control and decision conference (CCDC), pp 1421–1424. https://doi.org/10.1109/CCDC.2017.7978740

  • Lin Q, Wang Q, Zhang G, Shi Y, Liu H, Deng L (2018) Maximum daily load forecasting based on support vector regression considering accumulated temperature effect. In: 2018 Chinese control and decision conference (CCDC), pp 5199–5203. https://doi.org/10.1109/CCDC.2018.8408035

  • Liu D, Chen Q, Mori K (2015) Time series forecasting method of building energy consumption using support vector regression. In: 2015 IEEE international conference on information and automation, pp 1628–1632. https://doi.org/10.1109/ICInfA.2015.7279546

  • MacQueen J (1967) Some methods for classification and analysis of multivariate observations. In: Proceedings of the Berkeley symposium on mathematical statistics and probability

  • Makridakis S, Whellwright SC, Hyndman RJ (1998) Forecasting: methods and appplications. Wiley, Hoboken

    Google Scholar 

  • Mercer J (1909) Functions of positive and negative type, and their connection with the theory of integral equations. Philos Trans R Soc Lond A: Math Phys Eng Sci 209(441–458):415–446

    Article  MATH  Google Scholar 

  • Oliveira JV, Pedrycz W (2007) Advances in fuzzy clustering and its applications. Wiley, Hoboken

    Book  Google Scholar 

  • Patel J, Shah S, Thakkar P, Kotecha K (2015) Predicting stock and stock price index movement using trend deterministic data preparation and machine learning techniques. Expert Syst Appl 42(1):259–268. https://doi.org/10.1016/j.eswa.2014.07.040

    Article  Google Scholar 

  • Perwej Y, Perwej A (2012) Prediction of the Bombay Stock Exchange (BSE) market returns using artificial neural network and genetic algorithm. J Intell Learn Syst Appl 4(2):108–119

    Google Scholar 

  • Popovici R, Andonie R (2015) Music genre classification with self-organizing maps and edit distance. In: Proceedings of the international joint conference on neural networks (IJCNN), pp 1–7

  • Rosowsky YI, Smith RE (2013) Rejection based support vector machines for financial time series forecasting. In: Proceedings of the international joint conference on neural networks (IJCNN), pp 1–7

  • Sapankevych NI, Sankar R (2009) Time series prediction using support vector machines: a survey. IEEE Comput Intell Mag 4(2):24–38

    Article  Google Scholar 

  • Shen W, Xing M (2009) Stock index forecast with back propagation neural network optimized by genetic algorithm. In: Proceedings of the international conference on information and computing science, vol 2, pp 376–379

  • Singh S, Bhambri P, Gill J (2011) Time series based temperature prediction using back propagation with genetic algorithm technique. Int J Comput Sci Issues 8(5):28

    Google Scholar 

  • Small GR, Wong R (2002) The validity of forecasting. In: Pacific rim real estate society international conference Christchurch

  • Tong H (2002) Nonlinear time series analysis since 1990: some personal reflections. Acta Math Appl Sin 18(2):177–184

    Article  MathSciNet  MATH  Google Scholar 

  • Tsay R (2010) Analysis of financial time series, Wiley series in probability and statistics, 3rd edn. Wiley-Interscience, Hoboken

    Book  Google Scholar 

  • Van Gestel T, Suykens JAK, Baestaens DE, Lambrechts A, Lanckriet G, Vandaele B, De Moor B, Vandewalle J (2001) Financial time series prediction using least squares support vector machines within the evidence framework. IEEE Trans Neural Netw 12(4):809–821

    Article  Google Scholar 

  • Vapnik VN (1998) Statistical learning theory. Wiley, Hoboken

    MATH  Google Scholar 

  • Vapnik VN (1999) An overview of statistical learning theory. IEEE Trans Neural Netw 10(5):988–999

    Article  Google Scholar 

  • Wang JZ, Wang JJ, Zhang ZG, Guo SP (2011) Forecasting stock indices with back propagation neural network. Expert Syst Appl 38(11):14346–14355

    Google Scholar 

  • Yang H, Chan L, King I (2002) Support vector machine regression for volatile stock market prediction. In: Proceedings of the international conference on intelligent data engineering and automated learning (IDEAL), pp 391–396

  • Yang H, Huang K, Chan L, King I, Lyu MR (2004) Outliers treatment in support vector regression for financial time series prediction. In: Proceedings of the international conference on neural information processing (ICONIP), pp 1260–1265

  • Yizhen L, Wenhua Z, Ling L, Jun W, Gang L (2011) The forecasting of Shanghai index trend based on genetic algorithm and back propagation artificial neural network algorithm. In: Proceedings of the international conference on computer science education (ICCSE), pp 420–424

  • Yu L, Dai W, Tang L, Wu J (2015) A hybrid grid-GA-based LSSVR learning paradigm for crude oil price forecasting

  • Yu L, Xu H, Tang L (2016) LSSVR ensemble learning with uncertain parameters for crude oil price forecasting

  • Yu L, Zhang X, Wang S (2017) Assessing potentiality of support vector machine method in crude oil price forecasting. Eurasia J Math Sci Technol Educ 13(12):7893–7904. https://doi.org/10.12973/ejmste/77926

    Google Scholar 

  • Zadeh LA (1965) Fuzzy sets. Inf Control 8(3):338–353

    Article  MATH  Google Scholar 

  • Zhao C, Yu Z (2017) The research on forecasting model based on support vector machine and discrete grey system. In: 2017 international conference on computing intelligence and information system (CIIS), pp 104–107. https://doi.org/10.1109/CIIS.2017.24

  • Zhong X, Enke D (2017) Forecasting daily stock market return using dimensionality reduction. Expert Syst Appl 67:126–139. https://doi.org/10.1016/j.eswa.2016.09.027

    Article  Google Scholar 

Download references

Acknowledgements

The authors would like to thank the anonymous reviewers for their valuable comments.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Lucas F. S. Vilela.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Vilela, L.F.S., Leme, R.C., Pinheiro, C.A.M. et al. Forecasting financial series using clustering methods and support vector regression. Artif Intell Rev 52, 743–773 (2019). https://doi.org/10.1007/s10462-018-9663-x

Download citation

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10462-018-9663-x

Keywords

Navigation