Synonyms
Frequent subsequences
Definition
A sequence database D = {S1, S2,…,Sn} for sequential pattern mining consists of n input sequences (where n ≥ 1), and an input sequence Si = 〈ei1, ei2, … , eim〉(1 ≤ i ≤ n) is an ordered list of m events (where m ≥1). Each event\( {e}_{i_j}\left(1\le i\le n,1\le j\le m\right) \) is a non-empty set of items. Given two sequences, Sa = 〈ea1, ea2, … , eak〉 and Sb = 〈eb1, eb2, … , ebl〉, if k ≤ l and there exist integers 1≤x1<x2< … < xk ≤l such that \( {e}_{a1}\subseteq {e}_{b_{x1}},{e}_{a2}\subseteq {e}_{b_{x2}},\ldots,{e}_{ak}\subseteq {e}_{b{{}_x}_k},{S}_b \) is said to contain Sa (or equivalently, Sa is said to be contained in Sb). The number of input sequences in D that contain sequence S is called the support of S in D, denoted by supD (S). Given a user-specified minimum support threshold min_sup, S is called a sequential pattern (or a frequent subsequence) in D if supD (S)≥min_sup. If there exists no proper supersequence of a sequential pattern S...
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Recommended Reading
Agrawal R, Srikant R. Mining sequential patterns. In: Proceedings of the 11th International Conference on Data Engineering; 1995.
Aggarwal CC, Ta N, Wang J, Feng J, Zaki MJ. XProj: a framework for projected structural clustering of XML documents. In: Proceedings of the 13th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining; 2007.
Ayres J, Gehrke J, Yiu T, Flannick J. Sequential pattern mining using a bitmap representation. In: Proceedings of the 8th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining; 2002.
Han J, Pei J, Mortazavi-Asl B, Chen Q, Dayal U, Hsu MC. FreeSpan: frequent pattern-projected sequential pattern mining. In: Proceedings of the 6th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining; 2000.
Li Z, Chen Z, Srinivasan S, Zhou Y. C-Miner: mining block correlations in storage systems. In: Proceedings of the 3rd USENIX Conference of on File and Storage Technologies; 2004.
Li Z, Lu S, Myagmar S, Zhou Y. CP-Miner: finding copy-paste and related bugs in large-scale software code. IEEE Trans Softw Eng. 2006;32(3):176–92.
Lo D, Khoo SC SMArTIC: towards building an accurate, robust and scalable specification miner. In: Proceedings of the 14th ACM SIGSOFT International Symposium on Foundations of Software Engineering; 2006.
Pei J, Han J, Mortazavi-Asl B, Pinto H, Chen Q, Dayal U, Hsu MC. PrefixSpan: mining sequential patterns efficiently by prefix-projected pattern-growth. In: Proceedings of the 17th International Conference on Data Engineering; 2001.
She R, Chen F, Wang K, Ester M, Gardy JL, Brinkman FSL. Frequent-subsequence-based prediction of outer membrane proteins. In: Proceedings of the 9th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining; 2003.
Srikant R, Agrawal R Mining sequential patterns: generalizations and performance improvements. In: Advances in Database Technology, Proceedings of the 5th International Conference on Extending Database Technology; 1996.
Sun G, Liu X, Cong G, Zhou M, Xiong Z, Lee J, Lin CY. Detecting erroreous sentences using automatically mined sequential patterns. In: Proceedings of the 45th Annual Meeting of the Association for Computational Linguistics; 2007.
Wang J, Han J, Li C. Frequent closed sequence mining without candidate maintenance. IEEE Trans Knowl Data Eng. 2007;19(8):1042–56.
Xie T, Pei J. Data mining for software engineering. In: Proceedings of the 12th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining; 2006.
Yan X, Han J, Afshar R CloSpan: mining closed sequential patterns in large databases. In: Proceedings of the 2003 SIAM International Conference on Data Mining; 2003.
Zaki MJ. SPADE: an efficient algorithm for mining frequent sequences. Mach Learn. 2001;42(1/2):31–60.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Section Editor information
Rights and permissions
Copyright information
© 2018 Springer Science+Business Media, LLC, part of Springer Nature
About this entry
Cite this entry
Wang, J. (2018). Sequential Patterns. In: Liu, L., Özsu, M.T. (eds) Encyclopedia of Database Systems. Springer, New York, NY. https://doi.org/10.1007/978-1-4614-8265-9_343
Download citation
DOI: https://doi.org/10.1007/978-1-4614-8265-9_343
Published:
Publisher Name: Springer, New York, NY
Print ISBN: 978-1-4614-8266-6
Online ISBN: 978-1-4614-8265-9
eBook Packages: Computer ScienceReference Module Computer Science and Engineering