Methods for Mining Frequent Sequential Patterns

Jiang, Linhui; Hamilton, Howard J.

doi:10.1007/3-540-44886-1_38

Linhui Jiang⁵ &
Howard J. Hamilton⁵

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 2671))

Included in the following conference series:

Conference of the Canadian Society for Computational Studies of Intelligence

1065 Accesses
3 Citations

Abstract

We consider the problem of finding frequent subsequences in sequential data. We examine three algorithms using a trie with K levels. The O(K ² n) breadth-first (BF) algorithm inserts a pattern into the trie at level k only if level k-1 has been completed. The O(Kn) depth-first (DF) algorithm inserts a pattern and all its prefixes into the trie before examining another pattern. A threshold is used to store only frequent subsequences. Since DF cannot apply the threshold until the trie is complete, it makes poor use of memory. The heuristic depth-first (HDF) algorithm, a variant of DF, uses the threshold in the same manner as BF. HDF gains efficiency but loses a predictable amount of accuracy.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Agrawal, R., and Srikant, R.,“Mining Sequential Patterns.” Proceedings IEEE International Conference on Data Engineering, Taipei, Taiwan, 1995.
Google Scholar
Hand, D. J., Mannila, H., and Smyth, P., Principles of Data Mining, MIT Press, Cambridge, Massachusetts, 2001.
Google Scholar
Jiang, L., “A Quick Look at Methods for Mining Long Subsequences”, Proceedings AI’2003, this volume.
Google Scholar
Srikant, R., and Agrawal, R., Mining Sequential Patterns: Generalizations and Performance Improvements. Research Report RJ9994, IBM Almaden Research Center, San Jose, California, December 1995.
Google Scholar
Vilo, J., Discovering Frequent Patterns from Strings, Technical Report C-1998-9, Department of Computer Science, University of Helsinki, FIN-00014, University of Helsinki, May 1998.
Google Scholar
Zaki, M. J., Fast Mining of Sequential Patterns in Very Large Databases, Technical Report 668, Computer Science Department, University of Rochester, Rochester, New York, Nov. 1999
Google Scholar

Download references

Author information

Authors and Affiliations

Departmentof Computer Science, University of Regina, Regina, SK, Canada, S4S 0A2
Linhui Jiang & Howard J. Hamilton

Authors

Linhui Jiang
View author publications
You can also search for this author in PubMed Google Scholar
Howard J. Hamilton
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Department of Computing and Information Science, College of Physical and Engineering Science, University of Guelph, Guelph, Ontario, Canada, N1G 2W1
Yang Xiang
Dépt. Informatique-Génie Logiciel, Université Laval, Pavillon Pouliot, Ste-Foy, PQ, Canada, G1K 7P4
Brahim Chaib-draa

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Jiang, L., Hamilton, H.J. (2003). Methods for Mining Frequent Sequential Patterns. In: Xiang, Y., Chaib-draa, B. (eds) Advances in Artificial Intelligence. Canadian AI 2003. Lecture Notes in Computer Science, vol 2671. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-44886-1_38

Download citation

DOI: https://doi.org/10.1007/3-540-44886-1_38
Published: 27 May 2003
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-40300-5
Online ISBN: 978-3-540-44886-0
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics