Skip to main content

Sales Demand Forecast in E-commerce Using a Long Short-Term Memory Neural Network Methodology

  • Conference paper
  • First Online:
Neural Information Processing (ICONIP 2019)

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 11955))

Included in the following conference series:

Abstract

Generating accurate and reliable sales forecasts is crucial in the E-commerce business. The current state-of-the-art techniques are typically univariate methods, which produce forecasts considering only the historical sales data of a single product. However, in a situation where large quantities of related time series are available, conditioning the forecast of an individual time series on past behaviour of similar, related time series can be beneficial. Since the product assortment hierarchy in an E-commerce platform contains large numbers of related products, in which the sales demand patterns can be correlated, our attempt is to incorporate this cross-series information in a unified model. We achieve this by globally training a Long Short-Term Memory network (LSTM) that exploits the non-linear demand relationships available in an E-commerce product assortment hierarchy. Aside from the forecasting framework, we also propose a systematic pre-processing framework to overcome the challenges in the E-commerce business. We also introduce several product grouping strategies to supplement the LSTM learning schemes, in situations where sales patterns in a product portfolio are disparate. We empirically evaluate the proposed forecasting framework on a real-world online marketplace dataset from Walmart.com. Our method achieves competitive results on category level and super-departmental level datasets, outperforming state-of-the-art techniques.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Hyndman, R., et al.: Forecasting with Exponential Smoothing: The State Space Approach. Springer, Heidelberg (2008). doi: 10.1007/978-3-540-71918-2

    Book  Google Scholar 

  2. Box, G.E.P., et al.: Time Series Analysis: Forecasting and Control. Wiley, Hoboken (2015)

    Google Scholar 

  3. Trapero, J.R., Kourentzes, N., Fildes, R.: On the identification of sales forecasting models in the presence of promotions. J. ORS 66, 299–307 (2015)

    Google Scholar 

  4. Borovykh, A., Bohte, S., Oosterlee, C.W.: Conditional time series forecasting with convolutional neural networks. arXiv [cs.AI] (2017)

    Google Scholar 

  5. Flunkert, V., Salinas, D., Gasthaus, J.: DeepAR: probabilistic forecasting with autoregressive recurrent networks. arXiv [cs.AI] (2017)

    Google Scholar 

  6. Wen, R. et al.: A multi-horizon quantile recurrent forecaster. arXiv [stat.ML] (2017)

    Google Scholar 

  7. Chapados, N.: Effective Bayesian modeling of groups of related count time series. In: Proceedings of the 31st ICML (2014)

    Google Scholar 

  8. Bandara, K., Bergmeir, C., Smyl, S.: Forecasting across time series databases using RNNs on groups of similar series: a clustering approach. arXiv [cs.LG] (2017)

    Google Scholar 

  9. Elman, J.L.: Finding structure in time. Cogn. Sci. 14(2), 179–211 (1990)

    Article  Google Scholar 

  10. Williams, R.J., Zipser, D.: Gradient-based learning algorithms for RNNs and their computational complexity (1995)

    Google Scholar 

  11. Bengio, Y., Simard, P., Frasconi, P.: Learning long-term dependencies with gradient descent is difficult. IEEE Trans. Neural Netw. 2, 157–166 (1994)

    Article  Google Scholar 

  12. Hochreiter, S., Schmidhuber, J.: Long short-term memory. Neural Comput. 9(8), 1735–1780 (1997)

    Article  Google Scholar 

  13. Sutskever, I., Vinyals, O., Le, Q.V.: Sequence to sequence learning with neural networks. In: Ghahramani, Z., et al. (eds.) Proceedings of the 27th NIPS (2014)

    Google Scholar 

  14. Ben Taieb, S., et al.: A review and comparison of strategies for multi-step ahead time series forecasting based on the NN5 forecasting competition. Expert Syst. Appl. 39(8), 7067–7083 (2012)

    Article  Google Scholar 

  15. Ord, K., Fildes, R.A., Kourentzes, N.: Principles of Business Forecasting, 2nd edn. Wessex Press Publishing Co., New York (2017)

    Google Scholar 

  16. Abadi, M., et al.: TensorFlow: Large-scale machine learning on heterogeneous distributed systems. arXiv [cs.DC] (2016)

    Google Scholar 

  17. Hutter, F., Hoos, H.H., Leyton-Brown, K.: Sequential model-based optimization for general algorithm configuration. In: Proceedings of the 5th International Conference on Learning and Intelligent Optimization, Italy, Rome, pp. 507–523 (2011)

    Google Scholar 

  18. Fernando, Bayesian-optimization: Bayesian Optimization of Hyper-parameters. Github (2017). https://bit.ly/2EssG1r

  19. Snoek, J., Larochelle, H., Adams, R.P.: Practical Bayesian optimization of machine learning algorithms. In: Proceedings of the 25th NIPS (2012)

    Google Scholar 

  20. Kingma, D.P., Ba, J.: Adam: a method for stochastic optimization. arXiv [cs.LG] (2014)

    Google Scholar 

  21. Orabona, F., Tommasi, T.: Training deep networks without learning rates through coin betting. arXiv [cs.LG] (2017)

    Google Scholar 

  22. Hyndman, R.J., Khandakar, Y.: Automatic time series forecasting: the forecast package for R. J. Stat. Softw. 26(3), 1–22 (2008)

    Google Scholar 

  23. Hyndman, R.J., et al.: Time series features R package (2018). https://bit.ly/2GekHql

  24. Kornelson, K.P., Vajjiravel, M., Prasad, R., Clark, P.D., Najm, T.: Method and system for developing extract transform load systems for data warehouses (2006)

    Google Scholar 

  25. Makridakis, S., Spiliotis, E., Assimakopoulos, V.: The M4 competition: results, findings, conclusion and way forward. Int. J. Forecast. 34, 802–808 (2018)

    Article  Google Scholar 

  26. Weller, M., Crone, S.: Supply chain forecasting: best practices & benchmarking study. Technical report, Lancaster Centre for Forecasting (2012)

    Google Scholar 

  27. Taylor, S.J., Letham, B.: Forecasting at scale. Am. Stat. 72(1), 37–45 (2018)

    Article  MathSciNet  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Kasun Bandara .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2019 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Bandara, K., Shi, P., Bergmeir, C., Hewamalage, H., Tran, Q., Seaman, B. (2019). Sales Demand Forecast in E-commerce Using a Long Short-Term Memory Neural Network Methodology. In: Gedeon, T., Wong, K., Lee, M. (eds) Neural Information Processing. ICONIP 2019. Lecture Notes in Computer Science(), vol 11955. Springer, Cham. https://doi.org/10.1007/978-3-030-36718-3_39

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-36718-3_39

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-36717-6

  • Online ISBN: 978-3-030-36718-3

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics