skip to main content
10.1145/3085504.3085511acmotherconferencesArticle/Chapter ViewAbstractPublication PagesssdbmConference Proceedingsconference-collections
research-article

Incremental Temporal Pattern Mining Using Efficient Batch-Free Stream Clustering

Authors Info & Claims
Published:27 June 2017Publication History

ABSTRACT

This paper address the problem of temporal pattern mining from multiple data streams containing temporal events. Temporal events are considered as real world events aligned with comprehensive starting and ending timing information rather than simple integer timestamps. Predefined relations, such as "before" and "after", describe the heterogeneous relationships hidden in temporal data with limited diversity. In this work, the relationships among events are learned dynamically from the temporal information. Each event is treated as an object with a label and numerical attributes. An online-offline model is used as the primary structure for analyzing the evolving multiple streams. Different distance functions on temporal events and sequences can be applied depending on the application scenario. A prefix tree is introduced for a fast incremental pattern update.

Events in the real world usually persist for some period. It is more natural to model events as intervals with temporal information rather than as points on the timeline. Based on the representation proposed in this work, our approach can also be extended to handle interval data. Experiments show how the method, with richer information and more accurate results than the state-of-the-art, processes both point-based and interval-based event streams efficiently.

References

  1. Rakesh Agrawal and Ramakrishnan Srikant. 1995. Mining sequential patterns. In ICDE. IEEE, 3--14. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. James F Allen. 1983. Maintaining knowledge about temporal intervals. Commun. ACM 26, 11 (1983), 832--843. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. Feng Cao, Martin Ester, Weining Qian, and Aoying Zhou. 2006. Density-Based Clustering over an Evolving Data Stream with Noise. In SDM. SIAM, 328--339.Google ScholarGoogle Scholar
  4. Chung-i Chang and Nancy P Lin. 2009. Sequential Patterns Mining with Fuzzy Time-Intervals. ICSAI (2009), 165--169.Google ScholarGoogle Scholar
  5. Lei Chang, Tengjiao Wang, Dongqing Yang, and Hua Luan. 2008. Seqstream: Mining closed sequential patterns over stream sliding windows. (2008), 83--92. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. Gong Chen, Xindong Wu, and Xingquan Zhu. 2005. Sequential pattern mining in multiple streams. In ICDM'05. IEEE, 585--588. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. Kuan-Ying Chen, Bijay Prasad Jaysawal, Jen-Wei Huang, and Yong-Bin Wu. 2014. Mining frequent Time Interval-based Event with duration patterns from temporal database. In DSAA. IEEE, 548--554.Google ScholarGoogle Scholar
  8. Yen-Liang Chen and Tony Cheng-Kui Huang. 2005. Discovering fuzzy time-interval sequential patterns in sequence databases. IEEE Trans. Systems, Man, and Cybernetics, Part B 35, 5 (2005), 959--972. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. Yi-Cheng Chen, Wen-Chih Peng, and Suh-Yin Lee. 2015. Mining Temporal Patterns in Time Interval-based Data. TKDE 27, 12 (2015), 3318--3331. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. Martin Ester, Hans-Peter Kriegel, Jörg Sander, and Xiaowei Xu. 1996. A density-based algorithm for discovering clusters in large spatial databases with noise.. In Kdd, Vol. 96. 226--231. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. Fosca Giannotti, Mirco Nanni, and Dino Pedreschi. 2006. Efficient Mining of Temporally Annotated Sequences.. In SDM. SIAM, 348--359.Google ScholarGoogle Scholar
  12. Thomas Guyet and René Quiniou. 2008. Mining temporal patterns with quantitative intervals. In IEEE ICDM Workshops. IEEE, 218--227. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. Thomas Guyet and René Quiniou. 2011. Extracting temporal patterns from interval-based sequences. In IJCAI, Vol. 22. 1306. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. Thomas Guyet and René Quiniou. 2012. Incremental mining of frequent sequences from a window sliding over a stream of itemsets. Atelier CIDN Classification Incrémentale et Détection de Nouveauté (2012).Google ScholarGoogle Scholar
  15. Marwan Hassani. 2015. Efficient Clustering of Big Data Streams. Ph.D. Dissertation. RWTH Aachen University.Google ScholarGoogle Scholar
  16. Marwan Hassani, Christian Beecks, Daniel Töws, Tatiana Serbina, Max Haberstroh, Paula Niemietz, Sabina Jeschke, Stella Neumann, and Thomas Seidl. 2015. Sequential Pattern Mining of Multimodal Streams in the Humanities. In BTW. 683--686.Google ScholarGoogle Scholar
  17. Marwan Hassani, Philipp Kranen, Rajveer Saini, and Thomas Seidl. 2014. Subspace anytime stream clustering. In SSDBM. 37. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. Marwan Hassani and Thomas Seidl. 2011. Towards a mobile health context prediction: Sequential pattern mining in multiple streams. In 2011 IEEE 12th MDM, Vol. 2. IEEE, 55--57. Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. Marwan Hassani, Sergio Siccha, Florian Richter, and Thomas Seidl. 2015. Efficient Process Discovery From Event Streams Using Sequential Pattern Mining. In SSCI. IEEE, 1366--1373.Google ScholarGoogle Scholar
  20. Marwan Hassani, Pascal Spaus, Alfredo Cuzzocrea, and Thomas Seidl. 2015. Adaptive Stream Clustering Using Incremental Graph Maintenance. In BigMine 2015 at KDD'15. 49--64. Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. Yu Hirate and Hayato Yamana. 2006. Generalized sequential pattern mining with item intervals. Journal of Computers 1, 3 (2006), 51--60.Google ScholarGoogle ScholarCross RefCross Ref
  22. Chin-Chuan Ho, Hua-Fu Li, Fang-Fei Kuo, and Suh-Yin Lee. 2006. Incremental mining of sequential patterns over a stream sliding window. In Sixth IEEE ICDM Workshops. IEEE, 677--681. Google ScholarGoogle ScholarDigital LibraryDigital Library
  23. J Zico Kolter and Matthew J Johnson. 2011. REDD: A public data set for energy disaggregation research. Citeseer.Google ScholarGoogle Scholar
  24. Alexios Kotsifakos, Panagiotis Papapetrou, and Vassilis Athitsos. 2013. IBSM: Interval-based sequence matching. In SDM. SIAM, 596--604.Google ScholarGoogle Scholar
  25. Luiz F Mendes, Bolin Ding, and Jiawei Han. 2008. Stream sequential pattern mining with precise error bounds. In ICDM. IEEE, 941--946. Google ScholarGoogle ScholarDigital LibraryDigital Library
  26. Fumiya Nakagaito, Tomonobu Ozaki, and Takenao Ohkawa. 2009. Discovery of Quantitative Sequential Patterns from Event Sequences. In Proceedings of the 2009 IEEE ICDM Workshops. IEEE Computer Society, 31--36. Google ScholarGoogle ScholarDigital LibraryDigital Library
  27. Jian Pei, Jiawei Han, Behzad Mortazavi-Asl, Helen Pinto, Qiming Chen, Umeshwar Dayal, and Mei-Chun Hsu. 2001. Prefixspan: Mining sequential patterns efficiently by prefix-projected pattern growth. In ICDE. IEEE, 215--224. Google ScholarGoogle ScholarDigital LibraryDigital Library
  28. Alfred J Reich. 1994. Intervals, Points, and Branching Time.. In TIME. 121--133.Google ScholarGoogle Scholar
  29. Shin-Yi Wu and Yen-Liang Chen. 2007. Mining nonambiguous temporal patterns for interval-based events. KAIS 19, 6 (2007), 742--758. Google ScholarGoogle ScholarDigital LibraryDigital Library
  30. Mariko Yoshida, Tetsuya Iizuka, Hisako Shiohara, and Masanori Ishiguro. 2000. Mining sequential patterns including time intervals. SPIE 4057 (2000), 213--220.Google ScholarGoogle Scholar

Index Terms

  1. Incremental Temporal Pattern Mining Using Efficient Batch-Free Stream Clustering

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in
    • Published in

      cover image ACM Other conferences
      SSDBM '17: Proceedings of the 29th International Conference on Scientific and Statistical Database Management
      June 2017
      373 pages
      ISBN:9781450352826
      DOI:10.1145/3085504

      Copyright © 2017 ACM

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 27 June 2017

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • research-article
      • Research
      • Refereed limited

      Acceptance Rates

      Overall Acceptance Rate56of146submissions,38%

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader