Updating the Built FUSP Trees with Sequence Deletion Based on Prelarge Concept

Lin, Chun-Wei; Gan, Wensheng; Hong, Tzung-Pei; Pan, Jeng-Shyang

doi:10.1007/978-3-662-45071-0_34

Chun-Wei Lin¹⁷,
Wensheng Gan,
Tzung-Pei Hong^18,19 &
…
Jeng-Shyang Pan¹⁷

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 473))

Included in the following conference series:

International Conference, MISNC

1088 Accesses

Abstract

Among various data mining techniques, sequential-pattern mining is used to discover the frequent subsequences from a sequence database. Most research handles the static database in batch mode to discover the desired sequential patterns. Transactions or customer sequences are, however, dynamically changed in real-world applications. In the past, the FUSP tree was designed to maintain and update the discovered information based on Fast UPdated (FUP) approach with sequence insertion and sequence deletion. The original customer sequences is still required to be rescanned if it is necessary. In this paper, the prelarge concept is adopted to maintain and update the built FUSP tree with sequence deletion. When the number of deleted customers is smaller than the safety bound of the prelarge concept, the original database is unnecessary to be rescanned but the sequential patterns can still be actually maintained and updated. Experiments are also conducted to show the performance of the proposed algorithm in terms of execution time and number of tree nodes.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

A Sequential Pattern Mining Framework (2010), http://www.philippe-fournier-viger.com/spmf/index.php
Agrawal, R., Imielinski, T., Swami, A.: Database Mining: A Performance Perspective. IEEE Transactions on Knowledge and Data Engineering 5, 914–925 (1993)
Article Google Scholar
Agrawal, R., Srikant, R.: Fast Algorithms for Mining Association Rules in Large Databases. In: The International Conference on Very Large Data Bases, pp. 487–499 (1994)
Google Scholar
Agrawal, R., Srikant, R.: Mining Sequential Patterns. In: The International Conference on Data Engineering, pp. 3–14 (1995)
Google Scholar
Bodon, F.: A Fast Apriori Implementation. In: IEEE ICDM Workshop on Frequent Itemset Mining Implementations (2003)
Google Scholar
Chen, M.S., Han, J., Yu, P.S.: Data Mining: An Overview from a Database Perspective. IEEE Transactions on Knowledge and Data Engineering 8, 866–883 (1996)
Article Google Scholar
Cheung, D.W., Han, J., Ng, V., Wong, C.Y.: Maintenance of Discovered Association Rules in Large Databases: An Incremental Updating Technique. In: International Conference on Data Engineering, pp. 106–114 (1996)
Google Scholar
Cheung, D.W., Lee, S.D., Kao, B.: A General Incremental Technique for Maintaining Discovered Association Rules. In: The International Conference on Database Systems for Advanced Applications, pp. 185–194 (1997)
Google Scholar
Guyet, T., Quiniou, R.: Extracting Temporal Patterns from Interval-Based Sequences. In: The International Joint Conference on Artificial Intelligence, pp. 1306–1311 (2011)
Google Scholar
Han, J., Pei, J., Yin, Y., Mao, R.: Mining Frequent Patterns without Candidate Generation: A Frequent-Pattern Tree Approach. Data Mining and Knowledge Discovery 8, 53–87 (2004)
Article MathSciNet Google Scholar
Hong, T.P., Wang, C.Y., Tao, Y.H.: A New Incremental Data Mining Algorithm using Pre-large Itemsets. Intelligent Data Analysis 5, 111–129 (2001)
MATH Google Scholar
Hong, T.P., Lin, C.W., Wu, Y.L.: Incrementally Fast Updated Frequent Pattern Trees. Expert Systems with Applications 34, 2424–2435 (2008)
Article Google Scholar
Hong, T.P., Wang, C.Y., Tseng, S.S.: An Incremental Mining Algorithm for Maintaining Sequential Patterns using Pre-large Sequences. Expert Systems with Applications 38, 7051–7058 (2011)
Article Google Scholar
Kim, C., Lim, J.H., Ng, R.T., Shim, K.: Squire: Sequential Pattern Mining with Quantities. Journal of Systems and Software 80, 1726–1745 (2007)
Article Google Scholar
Lin, C.W., Hong, T.P., Lu, W.H., Lin, W.Y.: An Incremental FUSP-Tree Maintenance Algorithm. In: The International Conference on Intelligent Systems Design and Applications, pp. 445–449 (2008)
Google Scholar
Lin, C.W., Hong, T.P., Lu, W.H.: An Efficient FUSP-Tree Update Algorithm for Deleted Data in Customer Sequences. In: International Conference on Innovative Computing, Information and Control, pp. 1491–1494 (2009)
Google Scholar
Lin, M.Y., Lee, S.Y.: Incremental Update on Sequential Patterns in Large Databases. In: IEEE International Conference on Tools with Artificial Intelligence, pp. 24–31 (1998)
Google Scholar
Mooney, C.H., Roddick, J.F.: Sequential Pattern Mining - Approaches and Algorithms. ACM Computing Surveys 45, 1–39 (2013)
Article Google Scholar
Pei, J., Han, J., Mortazavi-Asl, B., Wang, J., Pinto, H., Chen, Q., Dayal, U., Hsu, M.C.: Mining Sequential Patterns by Pattern-Growth: the PrefixSpan Approach. IEEE Transactions on Knowledge and Data Engineering 16, 1424–1440 (2004)
Article Google Scholar
Srikant, R., Agrawal, R.: Mining Sequential Patterns: Generalizations and Performance Improvements. In: Apers, P.M.G., Bouzeghoub, M., Gardarin, G. (eds.) EDBT 1996. LNCS, vol. 1057, pp. 3–17. Springer, Heidelberg (1996)
Google Scholar
Wang, C.Y., Hong, T.P., Tseng, S.S.: Maintenance of Sequential Patterns for Record Deletion. In: IEEE International Conference on Data Mining, pp. 536–541 (2001)
Google Scholar
Zaki, M.J.: SPADE: An Efficient Algorithm for Mining Frequent Sequences. Machine Learning 42, 31–60 (2001)
Article MATH Google Scholar
Zheng, Z., Kohavi, R., Mason, L.: Real World Performance of Association Rule Algorithms. In: ACM International Conference on Knowledge Discovery and Data Mining, pp. 401–406 (2001)
Google Scholar

Download references

Author information

Authors and Affiliations

Shenzhen Key Laboratory of Internet Information Collaboration, School of Computer Science and Technology, Harbin Institute of Technology Shenzhen Graduate School, HIT Campus Shenzhen University Town, Xili, Shenzhen, P.R. China
Chun-Wei Lin & Jeng-Shyang Pan
Department of Computer Science and Information Engineering, National University of Kaohsiung, Kaohsiung, Taiwan, R.O.C.
Tzung-Pei Hong
Department of Computer Science and Engineering, National Sun Yat-sen University, Kaohsiung, Taiwan, R.O.C.
Tzung-Pei Hong

Authors

Chun-Wei Lin
View author publications
You can also search for this author in PubMed Google Scholar
Wensheng Gan
View author publications
You can also search for this author in PubMed Google Scholar
Tzung-Pei Hong
View author publications
You can also search for this author in PubMed Google Scholar
Jeng-Shyang Pan
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

National University of Kaohsiung, Taiwan
Leon Shyue-Liang Wang
Chung-Ang University, Seoul, Korea
Jason J. June
Dept of Electrical Engineering, National Kaohsiung University of Applied Sciences, Kaohsiung, Taiwan
Chung-Hong Lee
Osaka University, Japan
Koji Okuhara
National University of Kaohsiung, Dept of Information Management, Kaohsiung, Taiwan
Hsin-Chang Yang

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Lin, CW., Gan, W., Hong, TP., Pan, JS. (2014). Updating the Built FUSP Trees with Sequence Deletion Based on Prelarge Concept. In: Wang, L.SL., June, J.J., Lee, CH., Okuhara, K., Yang, HC. (eds) Multidisciplinary Social Networks Research. MISNC 2014. Communications in Computer and Information Science, vol 473. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-662-45071-0_34

Download citation

DOI: https://doi.org/10.1007/978-3-662-45071-0_34
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-662-45070-3
Online ISBN: 978-3-662-45071-0
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics