ABSTRACT
Crowd Flow Prediction (CFP) is one major challenge in the intelligent transportation systems of the Sydney Trains Network. However, most advanced CFP methods only focus on entrance and exit flows at the major stations or a few subway lines, neglecting Crowd Flow Distribution (CFD) forecasting problem across the entire city network. CFD prediction plays an irreplaceable role in metro management as a tool that can help authorities plan route schedules and avoid congestion. In this paper, we propose three online non-negative matrix factorization (ONMF) models. ONMF-AO incorporates an Average Optimization strategy that adapts to stable passenger flows. ONMF-MR captures the Most Recent trends to achieve better performance when sudden changes in crowd flow occur. The Hybrid model, ONMF-H, integrates both ONMF-AO and ONMF-MR to exploit the strengths of each model in different scenarios and enhance the models' applicability to real-world situations. Given a series of CFD snapshots, both models learn the latent attributes of the train stations and, therefore, are able to capture transition patterns from one timestamp to the next by combining historic guidance. Intensive experiments on a large-scale, real-world dataset containing transactional data demonstrate the superiority of our ONMF models.
- Gowtham Atluri, Anuj Karpatne, and Vipin Kumar. 2017. Spatio-Temporal Data Mining: A Survey of Problems and Methods. arXiv:1711.04710 (2017). Google ScholarDigital Library
- Mathieu Blondel, Yotaro Kubo, and Ueda Naonori. 2014. Online passive-aggressive algorithms for non-negative matrix factorization and completion. In Artificial Intelligence and Statistics . 96--104.Google Scholar
- Bin Cao, Dou Shen, Jian-Tao Sun, Xuanhui Wang, Qiang Yang, and Zheng Chen. 2007. Detect and Track Latent Factors with Online Nonnegative Matrix Factorization.. In IJCAI , Vol. 7. 2689--2694. Google ScholarDigital Library
- Irina Ceapa, Chris Smith, and Licia Capra. 2012. Avoiding the crowds: understanding tube station congestion patterns from trip data. In Proceedings of the ACM SIGKDD international workshop on urban computing. ACM, 134--141. Google ScholarDigital Library
- Mu-Chen Chen and Yu Wei. 2011. Exploring time variants for short-term passenger flow. Journal of Transport Geography , Vol. 19, 4 (2011), 488--498.Google ScholarCross Ref
- Freddy Chong Tat Chua, Richard J Oentaryo, and Ee-Peng Lim. 2013. Modeling temporal adoptions using dynamic matrix factorization. In Data Mining (ICDM), 2013 IEEE 13th International Conference on. IEEE, 91--100.Google ScholarCross Ref
- Dingxiong Deng, Cyrus Shahabi, Ugur Demiryurek, Linhong Zhu, Rose Yu, and Yan Liu. 2016. Latent Space Model for Road Networks to Predict Time-Varying Traffic. In Proceedings of the 22nd ACM SIGKDD, August 13--17, 2016. 1525--1534. Google ScholarDigital Library
- Sheng Gao, Hao Luo, Da Chen, Shantao Li, Patrick Gallinari, and Jun Guo. 2013. Cross-domain recommendation via cluster-level latent factor model. In Joint European Conference on Machine Learning and Knowledge Discovery in Databases. Springer, 161--176.Google ScholarCross Ref
- Aude Hofleitner, Ryan Herring, and Alexandre Bayen. 2012. Probability distributions of travel times on arterial networks: a traffic flow and horizontal queuing theory approach. In 91st Transportation Research Board Annual Meeting .Google Scholar
- Daniel D Lee and H Sebastian Seung. 2001. Algorithms for non-negative matrix factorization. In Advances in neural information processing systems. 556--562.Google Scholar
- Biao Leng, Jiabei Zeng, Zhang Xiong, Weifeng Lv, and Yueliang Wan. 2013. Probability tree based passenger flow prediction and its application to the Beijing subway system. Frontiers of Computer Science , Vol. 7, 2 (2013), 195--203. Google ScholarDigital Library
- Kun-Yu Lin, Chang-Dong Wang, Yu-Qin Meng, and Zhi-Lin Zhao. 2017. Multi-view unit intact space learning. In International Conference on Knowledge Science, Engineering and Management. Springer, 211--223.Google ScholarCross Ref
- Xiaolei Ma, Yao-Jan Wu, Yinhai Wang, Feng Chen, and Jianfeng Liu. 2013. Mining smart card data for transit riders' travel patterns. Transportation Research Part C: Emerging Technologies , Vol. 36 (2013), 1--12.Google ScholarCross Ref
- Ming Ni, Qing He, and Jing Gao. 2017. Forecasting the subway passenger flow under event occurrences with social media. IEEE Transactions on Intelligent Transportation Systems , Vol. 18, 6 (2017), 1623--1632.Google ScholarDigital Library
- Neal Parikh, Stephen Boyd, et almbox. 2014. Proximal algorithms. Foundations and Trends® in Optimization , Vol. 1, 3 (2014), 127--239. Google ScholarDigital Library
- Marie-Pier Pelletier, Martin Trépanier, and Catherine Morency. 2011. Smart card data use in public transit: A literature review. Transportation Research Part C: Emerging Technologies , Vol. 19, 4 (2011), 557--568.Google ScholarCross Ref
- Carl Edward Rasmussen and Christopher KI Williams. 2006. Gaussian processes for machine learning . Vol. 1. MIT press Cambridge.Google ScholarDigital Library
- Lijun Sun, Jian Gang Jin, Der-Horng Lee, Kay W Axhausen, and Alexander Erath. 2014. Demand-driven timetable design for metro services. Transportation Research Part C: Emerging Technologies , Vol. 46 (2014), 284--299.Google ScholarCross Ref
- Lijun Sun, Yang Lu, Jian Gang Jin, Der-Horng Lee, and Kay W Axhausen. 2015b. An integrated Bayesian approach for passenger flow assignment in metro networks. Transportation Research Part C: Emerging Technologies , Vol. 52 (2015), 116--131.Google ScholarCross Ref
- Yuxing Sun, Biao Leng, and Wei Guan. 2015a. A novel wavelet-SVM short-time passenger flow prediction in Beijing subway system. Neurocomputing , Vol. 166 (2015), 109--121. Google ScholarDigital Library
- Evelien Van Der Hurk, Leo Kroon, Gábor Maróti, and Peter Vervest. 2015. Deduction of passengers' route choices from smart card data. IEEE Transactions on Intelligent Transportation Systems , Vol. 16, 1 (2015), 430--440.Google ScholarDigital Library
- Fei Wang, Chenhao Tan, Ping Li, and Arnd Christian König. 2011. Efficient document clustering via online nonnegative matrix factorizations. In Proceedings of the 2011 SIAM International Conference on Data Mining. SIAM, 908--919.Google ScholarCross Ref
- Yu Wei and Mu-Chen Chen. 2012. Forecasting the short-term metro passenger flow with empirical mode decomposition and neural networks. Transportation Research Part C: Emerging Technologies , Vol. 21, 1 (2012), 148--162.Google ScholarCross Ref
- Billy M Williams and Lester A Hoel. 2003. Modeling and forecasting vehicular traffic flow as a seasonal ARIMA process: Theoretical basis and empirical results. Journal of transportation engineering , Vol. 129, 6 (2003), 664--672.Google ScholarCross Ref
- Huaxiu Yao, Xianfeng Tang, Hua Wei, Guanjie Zheng, Yanwei Yu, and Zhenhui Li. 2018a. Modeling Spatial-Temporal Dynamics for Traffic Prediction. arXiv preprint arXiv:1803.01254 (2018).Google Scholar
- Huaxiu Yao, Fei Wu, Jintao Ke, Xianfeng Tang, Yitian Jia, Siyu Lu, Pinghua Gong, and Jieping Ye. 2018b. Deep multi-view spatial-temporal network for taxi demand prediction. arXiv preprint arXiv:1802.08714 (2018).Google Scholar
- Xiuwen Yi, Yu Zheng, Junbo Zhang, and Tianrui Li. 2016. ST-MVL: filling missing values in geo-sensory time series data. (2016).Google Scholar
- Yu Zhang and Dit-Yan Yeung. 2012. Overlapping community detection via bounded nonnegative matrix tri-factorization. In Proceedings of the 18th ACM SIGKDD. ACM, 606--614. Google ScholarDigital Library
- Juanjuan Zhao, Qiang Qu, Fan Zhang, Chengzhong Xu, and Siyuan Liu. 2017. Spatio-Temporal Analysis of Passenger Travel Patterns in Massive Smart Card Data. IEEE Transactions on Intelligent Transportation Systems (2017).Google ScholarDigital Library
- Yu Zheng, Licia Capra, Ouri Wolfson, and Hai Yang. 2014. Urban computing: concepts, methodologies, and applications. ACM Transactions on Intelligent Systems and Technology (TIST) , Vol. 5, 3 (2014), 38. Google ScholarDigital Library
- Jingbo Zhou and Anthony KH Tung. 2015. Smiler: A semi-lazy time series prediction system for sensors. In 2015 ACM SIGMOD . 1871--1886. Google ScholarDigital Library
Index Terms
- Network-wide Crowd Flow Prediction of Sydney Trains via Customized Online Non-negative Matrix Factorization
Recommendations
Exploiting Multiple Correlations Among Urban Regions for Crowd Flow Prediction
AbstractCrowd flow prediction has become a strategically important task in urban computing, which is the prerequisite for traffic management, urban planning and public safety. However, due to variousness of crowd flows, multiple hidden correlations among ...
STGs: construct spatial and temporal graphs for citywide crowd flow prediction
AbstractCrowd flow prediction is one of the most remarkable issues in a wide range of areas, from traffic control to public safety, and aims to forecast the inflow and outflow of crowds in each region of a city. Most existing studies adopt CNN and its ...
Deep alternating non-negative matrix factorisation
AbstractNon-negative matrix factorisation (NMF) is a promising data-mining technique for non-negative data. NMF achieves feature extraction by factorising the original data matrix into a basis matrix and coding matrix both with non-negative entries. ...
Comments