skip to main content
10.1145/3269206.3271757acmconferencesArticle/Chapter ViewAbstractPublication PagescikmConference Proceedingsconference-collections
research-article

Network-wide Crowd Flow Prediction of Sydney Trains via Customized Online Non-negative Matrix Factorization

Authors Info & Claims
Published:17 October 2018Publication History

ABSTRACT

Crowd Flow Prediction (CFP) is one major challenge in the intelligent transportation systems of the Sydney Trains Network. However, most advanced CFP methods only focus on entrance and exit flows at the major stations or a few subway lines, neglecting Crowd Flow Distribution (CFD) forecasting problem across the entire city network. CFD prediction plays an irreplaceable role in metro management as a tool that can help authorities plan route schedules and avoid congestion. In this paper, we propose three online non-negative matrix factorization (ONMF) models. ONMF-AO incorporates an Average Optimization strategy that adapts to stable passenger flows. ONMF-MR captures the Most Recent trends to achieve better performance when sudden changes in crowd flow occur. The Hybrid model, ONMF-H, integrates both ONMF-AO and ONMF-MR to exploit the strengths of each model in different scenarios and enhance the models' applicability to real-world situations. Given a series of CFD snapshots, both models learn the latent attributes of the train stations and, therefore, are able to capture transition patterns from one timestamp to the next by combining historic guidance. Intensive experiments on a large-scale, real-world dataset containing transactional data demonstrate the superiority of our ONMF models.

References

  1. Gowtham Atluri, Anuj Karpatne, and Vipin Kumar. 2017. Spatio-Temporal Data Mining: A Survey of Problems and Methods. arXiv:1711.04710 (2017). Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. Mathieu Blondel, Yotaro Kubo, and Ueda Naonori. 2014. Online passive-aggressive algorithms for non-negative matrix factorization and completion. In Artificial Intelligence and Statistics . 96--104.Google ScholarGoogle Scholar
  3. Bin Cao, Dou Shen, Jian-Tao Sun, Xuanhui Wang, Qiang Yang, and Zheng Chen. 2007. Detect and Track Latent Factors with Online Nonnegative Matrix Factorization.. In IJCAI , Vol. 7. 2689--2694. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. Irina Ceapa, Chris Smith, and Licia Capra. 2012. Avoiding the crowds: understanding tube station congestion patterns from trip data. In Proceedings of the ACM SIGKDD international workshop on urban computing. ACM, 134--141. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. Mu-Chen Chen and Yu Wei. 2011. Exploring time variants for short-term passenger flow. Journal of Transport Geography , Vol. 19, 4 (2011), 488--498.Google ScholarGoogle ScholarCross RefCross Ref
  6. Freddy Chong Tat Chua, Richard J Oentaryo, and Ee-Peng Lim. 2013. Modeling temporal adoptions using dynamic matrix factorization. In Data Mining (ICDM), 2013 IEEE 13th International Conference on. IEEE, 91--100.Google ScholarGoogle ScholarCross RefCross Ref
  7. Dingxiong Deng, Cyrus Shahabi, Ugur Demiryurek, Linhong Zhu, Rose Yu, and Yan Liu. 2016. Latent Space Model for Road Networks to Predict Time-Varying Traffic. In Proceedings of the 22nd ACM SIGKDD, August 13--17, 2016. 1525--1534. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. Sheng Gao, Hao Luo, Da Chen, Shantao Li, Patrick Gallinari, and Jun Guo. 2013. Cross-domain recommendation via cluster-level latent factor model. In Joint European Conference on Machine Learning and Knowledge Discovery in Databases. Springer, 161--176.Google ScholarGoogle ScholarCross RefCross Ref
  9. Aude Hofleitner, Ryan Herring, and Alexandre Bayen. 2012. Probability distributions of travel times on arterial networks: a traffic flow and horizontal queuing theory approach. In 91st Transportation Research Board Annual Meeting .Google ScholarGoogle Scholar
  10. Daniel D Lee and H Sebastian Seung. 2001. Algorithms for non-negative matrix factorization. In Advances in neural information processing systems. 556--562.Google ScholarGoogle Scholar
  11. Biao Leng, Jiabei Zeng, Zhang Xiong, Weifeng Lv, and Yueliang Wan. 2013. Probability tree based passenger flow prediction and its application to the Beijing subway system. Frontiers of Computer Science , Vol. 7, 2 (2013), 195--203. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. Kun-Yu Lin, Chang-Dong Wang, Yu-Qin Meng, and Zhi-Lin Zhao. 2017. Multi-view unit intact space learning. In International Conference on Knowledge Science, Engineering and Management. Springer, 211--223.Google ScholarGoogle ScholarCross RefCross Ref
  13. Xiaolei Ma, Yao-Jan Wu, Yinhai Wang, Feng Chen, and Jianfeng Liu. 2013. Mining smart card data for transit riders' travel patterns. Transportation Research Part C: Emerging Technologies , Vol. 36 (2013), 1--12.Google ScholarGoogle ScholarCross RefCross Ref
  14. Ming Ni, Qing He, and Jing Gao. 2017. Forecasting the subway passenger flow under event occurrences with social media. IEEE Transactions on Intelligent Transportation Systems , Vol. 18, 6 (2017), 1623--1632.Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. Neal Parikh, Stephen Boyd, et almbox. 2014. Proximal algorithms. Foundations and Trends® in Optimization , Vol. 1, 3 (2014), 127--239. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. Marie-Pier Pelletier, Martin Trépanier, and Catherine Morency. 2011. Smart card data use in public transit: A literature review. Transportation Research Part C: Emerging Technologies , Vol. 19, 4 (2011), 557--568.Google ScholarGoogle ScholarCross RefCross Ref
  17. Carl Edward Rasmussen and Christopher KI Williams. 2006. Gaussian processes for machine learning . Vol. 1. MIT press Cambridge.Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. Lijun Sun, Jian Gang Jin, Der-Horng Lee, Kay W Axhausen, and Alexander Erath. 2014. Demand-driven timetable design for metro services. Transportation Research Part C: Emerging Technologies , Vol. 46 (2014), 284--299.Google ScholarGoogle ScholarCross RefCross Ref
  19. Lijun Sun, Yang Lu, Jian Gang Jin, Der-Horng Lee, and Kay W Axhausen. 2015b. An integrated Bayesian approach for passenger flow assignment in metro networks. Transportation Research Part C: Emerging Technologies , Vol. 52 (2015), 116--131.Google ScholarGoogle ScholarCross RefCross Ref
  20. Yuxing Sun, Biao Leng, and Wei Guan. 2015a. A novel wavelet-SVM short-time passenger flow prediction in Beijing subway system. Neurocomputing , Vol. 166 (2015), 109--121. Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. Evelien Van Der Hurk, Leo Kroon, Gábor Maróti, and Peter Vervest. 2015. Deduction of passengers' route choices from smart card data. IEEE Transactions on Intelligent Transportation Systems , Vol. 16, 1 (2015), 430--440.Google ScholarGoogle ScholarDigital LibraryDigital Library
  22. Fei Wang, Chenhao Tan, Ping Li, and Arnd Christian König. 2011. Efficient document clustering via online nonnegative matrix factorizations. In Proceedings of the 2011 SIAM International Conference on Data Mining. SIAM, 908--919.Google ScholarGoogle ScholarCross RefCross Ref
  23. Yu Wei and Mu-Chen Chen. 2012. Forecasting the short-term metro passenger flow with empirical mode decomposition and neural networks. Transportation Research Part C: Emerging Technologies , Vol. 21, 1 (2012), 148--162.Google ScholarGoogle ScholarCross RefCross Ref
  24. Billy M Williams and Lester A Hoel. 2003. Modeling and forecasting vehicular traffic flow as a seasonal ARIMA process: Theoretical basis and empirical results. Journal of transportation engineering , Vol. 129, 6 (2003), 664--672.Google ScholarGoogle ScholarCross RefCross Ref
  25. Huaxiu Yao, Xianfeng Tang, Hua Wei, Guanjie Zheng, Yanwei Yu, and Zhenhui Li. 2018a. Modeling Spatial-Temporal Dynamics for Traffic Prediction. arXiv preprint arXiv:1803.01254 (2018).Google ScholarGoogle Scholar
  26. Huaxiu Yao, Fei Wu, Jintao Ke, Xianfeng Tang, Yitian Jia, Siyu Lu, Pinghua Gong, and Jieping Ye. 2018b. Deep multi-view spatial-temporal network for taxi demand prediction. arXiv preprint arXiv:1802.08714 (2018).Google ScholarGoogle Scholar
  27. Xiuwen Yi, Yu Zheng, Junbo Zhang, and Tianrui Li. 2016. ST-MVL: filling missing values in geo-sensory time series data. (2016).Google ScholarGoogle Scholar
  28. Yu Zhang and Dit-Yan Yeung. 2012. Overlapping community detection via bounded nonnegative matrix tri-factorization. In Proceedings of the 18th ACM SIGKDD. ACM, 606--614. Google ScholarGoogle ScholarDigital LibraryDigital Library
  29. Juanjuan Zhao, Qiang Qu, Fan Zhang, Chengzhong Xu, and Siyuan Liu. 2017. Spatio-Temporal Analysis of Passenger Travel Patterns in Massive Smart Card Data. IEEE Transactions on Intelligent Transportation Systems (2017).Google ScholarGoogle ScholarDigital LibraryDigital Library
  30. Yu Zheng, Licia Capra, Ouri Wolfson, and Hai Yang. 2014. Urban computing: concepts, methodologies, and applications. ACM Transactions on Intelligent Systems and Technology (TIST) , Vol. 5, 3 (2014), 38. Google ScholarGoogle ScholarDigital LibraryDigital Library
  31. Jingbo Zhou and Anthony KH Tung. 2015. Smiler: A semi-lazy time series prediction system for sensors. In 2015 ACM SIGMOD . 1871--1886. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Network-wide Crowd Flow Prediction of Sydney Trains via Customized Online Non-negative Matrix Factorization

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in
    • Published in

      cover image ACM Conferences
      CIKM '18: Proceedings of the 27th ACM International Conference on Information and Knowledge Management
      October 2018
      2362 pages
      ISBN:9781450360142
      DOI:10.1145/3269206

      Copyright © 2018 ACM

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 17 October 2018

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • research-article

      Acceptance Rates

      CIKM '18 Paper Acceptance Rate147of826submissions,18%Overall Acceptance Rate1,861of8,427submissions,22%

      Upcoming Conference

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader