Abstract
Sequential pattern mining is one of the most widespread data mining tasks with several real-life decision-making applications. In this mining process, constraints were added to improve the mining efficiency for discovering patterns meeting specific user requirements. Therefore, the temporal constraints, in particular, those that arise from the implicit temporality of sequential patterns, will have the ability to efficiently apply temporary restrictions such as, window and gap constraints. In this paper, we propose a novel window and gap constrained algorithms based on the well-known PrefixSpan algorithm. For this purpose, we introduce the virtual multiplication operation aiming for a generalized window mining algorithm that preserves other constraints. We also extend the PrefixSpan Pseudo-Projection algorithm to mining patterns under the gap-constraint. Our performance study shows that these extensions have the same time complexity as PrefixSpan and good linear scalability.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsNotes
- 1.
Specifically, we modify the appending candidate search function range coupled to the union of ranges generated from the intervals starting on the pointers used by the Pseudo-projections and ending \(\gamma \) positions away.
References
Alatrista-Salas, H., Bringay, S., Flouvat, F., Selmaoui-Folcher, N., Teisseire, M.: Spatio-sequential patterns mining: beyond the boundaries. Intell. Data Anal. 20(2), 293–316 (2016)
Chaudhari, M., Mehta, C.: Extension of prefix span approach with grc constraints for sequential pattern mining. In: 2016 International Conference on Electrical, Electronics, and Optimization Techniques (ICEEOT), pp. 2496–2498 (2016)
George, A., Binu, D.: DRL-prefixspan: a novel pattern growth algorithm for discovering downturn, revision and launch (DRL) sequential patterns. Cent. Eur. J. Comput. Sci. 2, 426–439 (2012)
Guevara-Cogorno, A., Flamand, C., Alatrista-Salas, H.: Copper - constraint optimized prefixspan for epidemiological research. Procedia Comput. Sci. 63, 433–438 (2015). https://doi.org/10.1016/j.procs.2015.08.364. The 6th International Conference on Emerging Ubiquitous Systems and Pervasive Networks (EUSPN 2015)/ The 5th International Conference on Current and Future Trends of Information and Communication Technologies in Healthcare (ICTH-2015)
He, D., Stone, L.: Spatio-temporal synchronization of recurrent epidemics. Proc. Biol. Sci. R. Soc. 270(1523), 1519–1526 (2003). https://doi.org/10.1098/rspb.2003.2366
Lin, M.-Y., Lee, S.-Y.: Efficient mining of sequential patterns with time constraints by delimited pattern growth. Knowl. Inf. Syst. 7(4), 499–514 (2004). https://doi.org/10.1007/s10115-004-0182-5
Mari, J.F., Ber, F.L.: Temporal and spatial data mining with second-order hidden markov models. Soft Comput. Fus. Found. Methodol. Appl. 10(5), 406–414 (2006)
Masseglia, F., Poncelet, P., Teisseire, M.: Efficient mining of sequential patterns with time constraints: reducing the combinations. Exp. Syst. Appl. 36(2), 2677–2690 (2009). https://doi.org/10.1016/j.eswa.2008.01.021
Mennis, J., Liu, J.W.: Mining association rules in spatio-temporal data: an analysis of urban socioeconomic and land cover change. Trans. GIS 9(1), 5–17 (2005). https://doi.org/10.1111/j.1467-9671.2005.00202.x
Pedreshi, D., Ruggieri, S., Turini, F.: Discrimination-aware data mining. In: Proceedings of the 14th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, KDD 2008, pp. 560–568. Association for Computing Machinery, New York (2008). https://doi.org/10.1145/1401890.1401959
Pei, J., et al.: Mining sequential patterns by pattern-growth: the prefixspan approach. IEEE Trans. Knowl. Data Eng. 6(11), 1424–1440 (2004). https://doi.org/10.1109/TKDE.2004.77
Pei, J., et al.: Mining sequential patterns by pattern-growth: the prefixspan approach. Proc. IEEE TKDE 16(11), 1424–1440 (2004)
Srikant, R., Agrawal, R.: Mining sequential patterns: generalizations and performance improvements. In: Apers, P., Bouzeghoub, M., Gardarin, G. (eds.) EDBT 1996. LNCS, vol. 1057, pp. 1–17. Springer, Heidelberg (1996). https://doi.org/10.1007/BFb0014140
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2020 Springer Nature Switzerland AG
About this paper
Cite this paper
Alatrista-Salas, H., Guevara-Cogorno, A., Maehara, Y., Nunez-del-Prado, M. (2020). Efficiently Mining Gapped and Window Constraint Frequent Sequential Patterns. In: Torra, V., Narukawa, Y., Nin, J., Agell, N. (eds) Modeling Decisions for Artificial Intelligence. MDAI 2020. Lecture Notes in Computer Science(), vol 12256. Springer, Cham. https://doi.org/10.1007/978-3-030-57524-3_20
Download citation
DOI: https://doi.org/10.1007/978-3-030-57524-3_20
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-57523-6
Online ISBN: 978-3-030-57524-3
eBook Packages: Computer ScienceComputer Science (R0)