Discovering Maximal Frequent Patterns in Sequence Groups

Guan, J. W.; Bell, David A.; Liu, Dayou

doi:10.1007/978-3-540-25929-9_74

J. W. Guan^20,21,
David A. Bell²⁰ &
Dayou Liu²¹

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 3066))

Included in the following conference series:

International Conference on Rough Sets and Current Trends in Computing

868 Accesses
3 Citations

Abstract

In this paper, we give a general treatment for some kind of sequences such as customer sequences, document sequences, and DNA sequences, etc. Large collections of transaction, document, and genomic information have been accumulated in recent years, and embedded latently in it there is potentially significant knowledge for exploitation in the retailing industry, in information retrieval, in medicine and in the pharmaceutical industry, respectively. The approach taken here to the distillation of such knowledge is to detect strings in sequences which appear frequently, either within a given sequence (eg for a particular customer, document, or patient) or across sequences (eg from different customers, documents, or patients sharing a particular transaction, information retrieval, or medical diagnosis; respectively).

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Agrawal, R., Srikant, R.: Mining sequential patterns. In: Proceedings of the 11th International Conference on Data Engineering, Taipei, Taiwan, March 1995, (1994-1995); IBM Research Report RJ 9910 (October 1994) (expanded version)
Google Scholar
Bell, D.A., Guan, J.W.: Computational methods for rough classification and discovery. Journal of the American Society for Information Science, Special Topic Issue on Data Mining 49(5), 403–414 (1998)
Google Scholar
Bell, D.A., Guan, J.W.: Data mining for motifs in DNA sequences. In: Wang, G., et al. (eds.) Proceedings of the 9th International Conference on Rough Sets, Fuzzy Sets, Data Mining and Granular Computing (RSFDGrC 2003), Chongqing, China, October 19-22 (2003)
Google Scholar
Feldman, R., Aumann, Y., Amir, A., Zilberstain, A., Kloesgen, W., Ben-Yehuda, Y.: Maximal association rules: a new tool for mining for keyword co-occurrences in document collection. In: Proceedings of the 3rd International Conference on Knowledge Discovery (KDD 1997), pp. 167–170 (1997)
Google Scholar
Frawley, W.J., Piatetsky-Shapiro, G., Matheus, C.J.: Knowledge discovery in databases: an overview. In: Piatetsky-Shapiro, G., Frawley, W.J. (eds.) Knowledge Discovery in Databases, pp. 1–27. AAAI/MIT Press (1991)
Google Scholar
Guan, J.W., Bell, D.A.: Rough computational methods for information systems. Artificial Intelligence – An International Journal 105, 77–104 (1998)
MATH Google Scholar
Kiem, H., Phuc, D.: Discovering motif based association rules in a set of DNA sequences. In: Ziarko, W.P., Yao, Y. (eds.) RSCTC 2000. LNCS (LNAI), vol. 2005, pp. 348–352. Springer, Heidelberg (2001) ISBN 0828-3494, ISBN 0-7731-0413-5
Chapter Google Scholar
Pawlak, Z.: Rough sets: theoretical aspects of reasoning about data. Kluwer, Dordrecht (1991)
MATH Google Scholar
Srikant, R., Agrawal, R.: Mining sequential patterns: generalizations and performance improvements. In: Apers, P.M.G., Bouzeghoub, M., Gardarin, G. (eds.) EDBT 1996. LNCS, vol. 1057, Springer, Heidelberg (1996); IBM Research Report RJ 9994 (December 1995) (expanded version)
Chapter Google Scholar

Download references

Author information

Authors and Affiliations

School of Computer Science, The Queen’s University of Belfast, BT7 1NN, Northern Ireland, UK
J. W. Guan & David A. Bell
College of Computer Science and Technology, Jilin University, 130012, Changchun, P.R.China
J. W. Guan & Dayou Liu

Authors

J. W. Guan
View author publications
You can also search for this author in PubMed Google Scholar
David A. Bell
View author publications
You can also search for this author in PubMed Google Scholar
Dayou Liu
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Shimane University, 89-1 Enya-cho Izumo, 6938501, Shimane, Japan
Shusaku Tsumoto
Systems Research Institute, Polish Academy of Sciences, 01-447, Warsaw, Poland
Roman Słowiński
The Linnaeus Centre for Bioinformatics, Uppsala University, Uppsala, Sweden
Jan Komorowski
Institute of Computer Science, Polish Academy of Sciences, 01–237, Warsaw, Poland
Jerzy W. Grzymała-Busse

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Guan, J.W., Bell, D.A., Liu, D. (2004). Discovering Maximal Frequent Patterns in Sequence Groups. In: Tsumoto, S., Słowiński, R., Komorowski, J., Grzymała-Busse, J.W. (eds) Rough Sets and Current Trends in Computing. RSCTC 2004. Lecture Notes in Computer Science(), vol 3066. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-25929-9_74

Download citation

DOI: https://doi.org/10.1007/978-3-540-25929-9_74
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-22117-3
Online ISBN: 978-3-540-25929-9
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics