research-article

Network-wide Crowd Flow Prediction of Sydney Trains via Customized Online Non-negative Matrix Factorization

Authors:
Yongshun Gong

University of Technology, Sydney, Sydney, Australia

University of Technology, Sydney, Sydney, Australia
View Profile

,
Zhibin Li

University of Technology, Sydney, Sydney, Australia

University of Technology, Sydney, Sydney, Australia
View Profile

,
Jian Zhang

University of Technology, Sydney, Sydney, Australia

University of Technology, Sydney, Sydney, Australia
View Profile

,
Wei Liu

University of Technology, Sydney, Sydney, Australia

University of Technology, Sydney, Sydney, Australia
View Profile

,
Yu Zheng

JD Finance, Beijing, Beijing, China

JD Finance, Beijing, Beijing, China
View Profile

,
Christina Kirsch

Sydney Trains-Operational Technology, Sydney, Australia

Sydney Trains-Operational Technology, Sydney, Australia
View Profile

CIKM '18: Proceedings of the 27th ACM International Conference on Information and Knowledge ManagementOctober 2018Pages 1243–1252https://doi.org/10.1145/3269206.3271757

Published:17 October 2018Publication History

CIKM '18: Proceedings of the 27th ACM International Conference on Information and Knowledge Management

Pages 1243–1252

ABSTRACT

Crowd Flow Prediction (CFP) is one major challenge in the intelligent transportation systems of the Sydney Trains Network. However, most advanced CFP methods only focus on entrance and exit flows at the major stations or a few subway lines, neglecting Crowd Flow Distribution (CFD) forecasting problem across the entire city network. CFD prediction plays an irreplaceable role in metro management as a tool that can help authorities plan route schedules and avoid congestion. In this paper, we propose three online non-negative matrix factorization (ONMF) models. ONMF-AO incorporates an Average Optimization strategy that adapts to stable passenger flows. ONMF-MR captures the Most Recent trends to achieve better performance when sudden changes in crowd flow occur. The Hybrid model, ONMF-H, integrates both ONMF-AO and ONMF-MR to exploit the strengths of each model in different scenarios and enhance the models' applicability to real-world situations. Given a series of CFD snapshots, both models learn the latent attributes of the train stations and, therefore, are able to capture transition patterns from one timestamp to the next by combining historic guidance. Intensive experiments on a large-scale, real-world dataset containing transactional data demonstrate the superiority of our ONMF models.

References

Gowtham Atluri, Anuj Karpatne, and Vipin Kumar. 2017. Spatio-Temporal Data Mining: A Survey of Problems and Methods. arXiv:1711.04710 (2017). Google ScholarDigital Library
Mathieu Blondel, Yotaro Kubo, and Ueda Naonori. 2014. Online passive-aggressive algorithms for non-negative matrix factorization and completion. In Artificial Intelligence and Statistics . 96--104.Google Scholar
Bin Cao, Dou Shen, Jian-Tao Sun, Xuanhui Wang, Qiang Yang, and Zheng Chen. 2007. Detect and Track Latent Factors with Online Nonnegative Matrix Factorization.. In IJCAI , Vol. 7. 2689--2694. Google ScholarDigital Library
Irina Ceapa, Chris Smith, and Licia Capra. 2012. Avoiding the crowds: understanding tube station congestion patterns from trip data. In Proceedings of the ACM SIGKDD international workshop on urban computing. ACM, 134--141. Google ScholarDigital Library
Mu-Chen Chen and Yu Wei. 2011. Exploring time variants for short-term passenger flow. Journal of Transport Geography , Vol. 19, 4 (2011), 488--498.Google ScholarCross Ref
Freddy Chong Tat Chua, Richard J Oentaryo, and Ee-Peng Lim. 2013. Modeling temporal adoptions using dynamic matrix factorization. In Data Mining (ICDM), 2013 IEEE 13th International Conference on. IEEE, 91--100.Google ScholarCross Ref
Dingxiong Deng, Cyrus Shahabi, Ugur Demiryurek, Linhong Zhu, Rose Yu, and Yan Liu. 2016. Latent Space Model for Road Networks to Predict Time-Varying Traffic. In Proceedings of the 22nd ACM SIGKDD, August 13--17, 2016. 1525--1534. Google ScholarDigital Library
Sheng Gao, Hao Luo, Da Chen, Shantao Li, Patrick Gallinari, and Jun Guo. 2013. Cross-domain recommendation via cluster-level latent factor model. In Joint European Conference on Machine Learning and Knowledge Discovery in Databases. Springer, 161--176.Google ScholarCross Ref
Aude Hofleitner, Ryan Herring, and Alexandre Bayen. 2012. Probability distributions of travel times on arterial networks: a traffic flow and horizontal queuing theory approach. In 91st Transportation Research Board Annual Meeting .Google Scholar
Daniel D Lee and H Sebastian Seung. 2001. Algorithms for non-negative matrix factorization. In Advances in neural information processing systems. 556--562.Google Scholar
Biao Leng, Jiabei Zeng, Zhang Xiong, Weifeng Lv, and Yueliang Wan. 2013. Probability tree based passenger flow prediction and its application to the Beijing subway system. Frontiers of Computer Science , Vol. 7, 2 (2013), 195--203. Google ScholarDigital Library
Kun-Yu Lin, Chang-Dong Wang, Yu-Qin Meng, and Zhi-Lin Zhao. 2017. Multi-view unit intact space learning. In International Conference on Knowledge Science, Engineering and Management. Springer, 211--223.Google ScholarCross Ref
Xiaolei Ma, Yao-Jan Wu, Yinhai Wang, Feng Chen, and Jianfeng Liu. 2013. Mining smart card data for transit riders' travel patterns. Transportation Research Part C: Emerging Technologies , Vol. 36 (2013), 1--12.Google ScholarCross Ref
Ming Ni, Qing He, and Jing Gao. 2017. Forecasting the subway passenger flow under event occurrences with social media. IEEE Transactions on Intelligent Transportation Systems , Vol. 18, 6 (2017), 1623--1632.Google ScholarDigital Library
Neal Parikh, Stephen Boyd, et almbox. 2014. Proximal algorithms. Foundations and Trends® in Optimization , Vol. 1, 3 (2014), 127--239. Google ScholarDigital Library
Marie-Pier Pelletier, Martin Trépanier, and Catherine Morency. 2011. Smart card data use in public transit: A literature review. Transportation Research Part C: Emerging Technologies , Vol. 19, 4 (2011), 557--568.Google ScholarCross Ref
Carl Edward Rasmussen and Christopher KI Williams. 2006. Gaussian processes for machine learning . Vol. 1. MIT press Cambridge.Google ScholarDigital Library
Lijun Sun, Jian Gang Jin, Der-Horng Lee, Kay W Axhausen, and Alexander Erath. 2014. Demand-driven timetable design for metro services. Transportation Research Part C: Emerging Technologies , Vol. 46 (2014), 284--299.Google ScholarCross Ref
Lijun Sun, Yang Lu, Jian Gang Jin, Der-Horng Lee, and Kay W Axhausen. 2015b. An integrated Bayesian approach for passenger flow assignment in metro networks. Transportation Research Part C: Emerging Technologies , Vol. 52 (2015), 116--131.Google ScholarCross Ref
Yuxing Sun, Biao Leng, and Wei Guan. 2015a. A novel wavelet-SVM short-time passenger flow prediction in Beijing subway system. Neurocomputing , Vol. 166 (2015), 109--121. Google ScholarDigital Library
Evelien Van Der Hurk, Leo Kroon, Gábor Maróti, and Peter Vervest. 2015. Deduction of passengers' route choices from smart card data. IEEE Transactions on Intelligent Transportation Systems , Vol. 16, 1 (2015), 430--440.Google ScholarDigital Library
Fei Wang, Chenhao Tan, Ping Li, and Arnd Christian König. 2011. Efficient document clustering via online nonnegative matrix factorizations. In Proceedings of the 2011 SIAM International Conference on Data Mining. SIAM, 908--919.Google ScholarCross Ref
Yu Wei and Mu-Chen Chen. 2012. Forecasting the short-term metro passenger flow with empirical mode decomposition and neural networks. Transportation Research Part C: Emerging Technologies , Vol. 21, 1 (2012), 148--162.Google ScholarCross Ref
Billy M Williams and Lester A Hoel. 2003. Modeling and forecasting vehicular traffic flow as a seasonal ARIMA process: Theoretical basis and empirical results. Journal of transportation engineering , Vol. 129, 6 (2003), 664--672.Google ScholarCross Ref
Huaxiu Yao, Xianfeng Tang, Hua Wei, Guanjie Zheng, Yanwei Yu, and Zhenhui Li. 2018a. Modeling Spatial-Temporal Dynamics for Traffic Prediction. arXiv preprint arXiv:1803.01254 (2018).Google Scholar
Huaxiu Yao, Fei Wu, Jintao Ke, Xianfeng Tang, Yitian Jia, Siyu Lu, Pinghua Gong, and Jieping Ye. 2018b. Deep multi-view spatial-temporal network for taxi demand prediction. arXiv preprint arXiv:1802.08714 (2018).Google Scholar
Xiuwen Yi, Yu Zheng, Junbo Zhang, and Tianrui Li. 2016. ST-MVL: filling missing values in geo-sensory time series data. (2016).Google Scholar
Yu Zhang and Dit-Yan Yeung. 2012. Overlapping community detection via bounded nonnegative matrix tri-factorization. In Proceedings of the 18th ACM SIGKDD. ACM, 606--614. Google ScholarDigital Library
Juanjuan Zhao, Qiang Qu, Fan Zhang, Chengzhong Xu, and Siyuan Liu. 2017. Spatio-Temporal Analysis of Passenger Travel Patterns in Massive Smart Card Data. IEEE Transactions on Intelligent Transportation Systems (2017).Google ScholarDigital Library
Yu Zheng, Licia Capra, Ouri Wolfson, and Hai Yang. 2014. Urban computing: concepts, methodologies, and applications. ACM Transactions on Intelligent Systems and Technology (TIST) , Vol. 5, 3 (2014), 38. Google ScholarDigital Library
Jingbo Zhou and Anthony KH Tung. 2015. Smiler: A semi-lazy time series prediction system for sensors. In 2015 ACM SIGMOD . 1871--1886. Google ScholarDigital Library

Index Terms

Network-wide Crowd Flow Prediction of Sydney Trains via Customized Online Non-negative Matrix Factorization
1. Information systems
  1. Information systems applications
    1. Spatial-temporal systems

Recommendations

Exploiting Multiple Correlations Among Urban Regions for Crowd Flow Prediction
Abstract
Crowd flow prediction has become a strategically important task in urban computing, which is the prerequisite for traffic management, urban planning and public safety. However, due to variousness of crowd flows, multiple hidden correlations among ...
Read More
STGs: construct spatial and temporal graphs for citywide crowd flow prediction
Abstract
Crowd flow prediction is one of the most remarkable issues in a wide range of areas, from traffic control to public safety, and aims to forecast the inflow and outflow of crowds in each region of a city. Most existing studies adopt CNN and its ...
Read More
Deep alternating non-negative matrix factorisation
Abstract
Non-negative matrix factorisation (NMF) is a promising data-mining technique for non-negative data. NMF achieves feature extraction by factorising the original data matrix into a basis matrix and coding matrix both with non-negative entries. ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in
CIKM '18: Proceedings of the 27th ACM International Conference on Information and Knowledge Management
October 2018
2362 pages
ISBN:9781450360142
DOI:10.1145/3269206
General Chair:
Alfredo Cuzzocrea
University of Trieste, Italy
,
Program Chairs:
James Allan
University of Massachusetts, USA
,
Norman Paton
University of Manchester, United Kingdom
,
Divesh Srivastava
AT&T Labs Research, USA
,
Rakesh Agrawal
Data Insights Lab, USA
,
Andrei Broder
Google Research, USA
,
Mohammed Zaki
Rensselaer Polytechnic Institute, USA
,
Selcuk Candan
Arizona State University, USA
,
Alexandros Labrinidis
University of Pittsburgh, USA
,
Assaf Schuster
Technion, Israel
,
Haixun Wang
Google Research, USA
Copyright © 2018 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 17 October 2018
Permissions
Request permissions about this article.
Request Permissions

Check for updates
Author Tags
city trains network
crowd flow prediction
online non-negative matrix factorization
Qualifiers
- research-article
Conference

Acceptance Rates
CIKM '18 Paper Acceptance Rate147of826submissions,18%Overall Acceptance Rate1,861of8,427submissions,22%
More
Upcoming Conference
CIKM '24

Sponsor:

sigir

sigir

The 33rd ACM International Conference on Information and Knowledge Management

October 21 - 25, 2024

Boise , ID , USA
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 35
  Total Citations
  View Citations
- 467
  Total Downloads
- Downloads (Last 12 months)37
- Downloads (Last 6 weeks)4
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Network-wide Crowd Flow Prediction of Sydney Trains via Customized Online Non-negative Matrix Factorization

CIKM '18: Proceedings of the 27th ACM International Conference on Information and Knowledge Management

ABSTRACT

References

Cited By

Index Terms

Recommendations

Exploiting Multiple Correlations Among Urban Regions for Crowd Flow Prediction

STGs: construct spatial and temporal graphs for citywide crowd flow prediction

Deep alternating non-negative matrix factorisation