Constrained Sequential Pattern Knowledge in Multi-relational Learning

Ferreira, Carlos Abreu; Gama, João; Costa, Vítor Santos

doi:10.1007/978-3-642-24769-9_21

Constrained Sequential Pattern Knowledge in Multi-relational Learning

Carlos Abreu Ferreira²¹,
João Gama²² &
Vítor Santos Costa²³

Conference paper

1429 Accesses

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 7026))

Abstract

In this work we present XMuSer, a multi-relational framework suitable to explore temporal patterns available in multi-relational databases. XMuSer’s main idea consists of exploiting frequent sequence mining, using an efficient and direct method to learn temporal patterns in the form of sequences. Grounded on a coding methodology and on the efficiency of sequential miners, we find the most interesting sequential patterns available and then map these findings into a new table, which encodes the multi-relational timed data using sequential patterns. In the last step of our framework, we use an ILP algorithm to learn a theory on the enlarged relational database that consists on the original multi-relational database and the new sequence relation.

We evaluate our framework by addressing three classification problems. Moreover, we map each one of three different types of sequential patterns: frequent sequences, closed sequences or maximal sequences.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Agrawal, R., Srikant, R.: Fast algorithms for mining association rules. In: Proceedings of the 20th International Conference on Very Large Data Bases, pp. 487–499. Morgan Kaufmann, Santiago de Chile (1994)
Google Scholar
Agrawal, R., Srikant, R.: Mining sequential patterns. In: Eleventh International Conference on Data Engineering, Taipei, Taiwan, pp. 3–14 (1995)
Google Scholar
de Amo, S., Furtado, D.A.: First-order temporal pattern mining with regular expression constraints. Data & Knowledge Engineering 62(3), 401–420 (2007); including special issue: 20th Brazilian Symposium on Databases (SBBD 2005)
Article Google Scholar
Blockeel, H., Sebag, M.: Scalability and efficiency in multi-relational data mining. SIGKDD Explorations 5(1), 17–30 (2003)
Article Google Scholar
Davis, J., Burnside, E., Ramakrishnan, R., Costa, V.S., Shavlik, J.: View learning for statistical relational learning: With an application to mammography. In: Proc. of the 19th International Joint Conference on Artificial Intelligence, Professional Book Center, Edinburgh, Scotland, UK, pp. 677–683 (2005)
Google Scholar
Dehaspe, L., Toivonen, H.: Discovery of frequent DATALOG patterns. Data Mining and Knowledge Discovery 3(1), 7–36 (1999)
Article Google Scholar
Esposito, F., Di Mauro, N., Basile, T.M.A., Ferilli, S.: Multi-dimensional relational sequence mining. Fundamenta Informaticae 89(1), 23–43 (2009)
MATH Google Scholar
Ferreira, C.A., Gama, J., Costa, V.S.: Sequential Pattern Mining in Multi-Relational Datasets. In: Meseguer, P., Mandow, L., Gasca, R.M. (eds.) CAEPIA 2009. LNCS, vol. 5988, pp. 121–130. Springer, Heidelberg (2010)
Chapter Google Scholar
Garofalakis, M., Rastogi, R., Shim, K.: Mining sequential patterns with regular expression constraints. IEEE Transactions on Knowledge and Data Engineering 14(3), 530–552 (2002)
Article Google Scholar
Jian, P., Han, J., Mortazavi-asl, B., Pinto, H., Chen, Q., Dayal, U., Hsu, M.: Prefixspan: Mining sequential patterns efficiently by prefix-projected pattern growth. In: Proc. of the 17th International Conference on Data Engineering, pp. 215–224. IEEE Computer Society, Heidelberg (2001)
Chapter Google Scholar
Dan Lee, S., De Raedt, L.: Constraint Based Mining of First Order Sequences in SeqLog. In: Meo, R., Lanzi, P.L., Klemettinen, M. (eds.) Database Support for Data Mining Applications. LNCS (LNAI), vol. 2682, pp. 154–173. Springer, Heidelberg (2004)
Chapter Google Scholar
Ohara, K., Yoshida, T., Geamsakul, W., Motoda, H., Washio, T., Yokoi, H., Takabayashi, K.: Analysis of Hepatitis Dataset by Decision Tree Graph-Based Induction. In: Proceedings of Discovery Challenge, pp. 173–184 (2004)
Google Scholar
Srikant, R., Agrawal, R.: Mining Sequential Patterns: Generalizations and Performance Improvements. In: Apers, P.M.G., Bouzeghoub, M., Gardarin, G. (eds.) EDBT 1996. LNCS, vol. 1057, pp. 3–17. Springer, Heidelberg (1996)
Google Scholar
Yan, X., Han, J., Afshar, R.: Clospan: Mining closed sequential patterns in large datasets. In: Proc. of the Third SIAM International Conference on Data Mining, pp. 166–177. SIAM, San Francisco (2003)
Google Scholar
Yu, L., Liu, H.: Feature selection for high-dimensional data: A fast correlation-based filter solution. In: 20th Int. Conf. on Machine Learning, pp. 856–863 (2003)
Google Scholar
Zelezny, F., Lavrac, N.: Propositionalization-Based Relational Subgroup Discovery with RSD. Machine Learning 62(1-2), 33–63 (2006)
Article Google Scholar

Download references

Author information

Authors and Affiliations

LIAAD-INESC and ISEP, Polytechnic Institute of Porto, Porto, Portugal
Carlos Abreu Ferreira
LIAAD-INESC and Faculty of Economics, University of Porto, Porto, Portugal
João Gama
CRACS-INESC and Faculty of Sciences, University of Porto, Porto, Portugal
Vítor Santos Costa

Authors

Carlos Abreu Ferreira
View author publications
You can also search for this author in PubMed Google Scholar
João Gama
View author publications
You can also search for this author in PubMed Google Scholar
Vítor Santos Costa
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Faculdade de Ciências, Departamento de Informática, GUESS/LabMAg/Universidade de Lisboa, Campo Grande, 749-016, Lisboa, Portugal
Luis Antunes
Department of Computer Science and Engineering, INESC-ID, Instituto Superior Técnico, IST, Avenida Rovisco Pais, 1049-001, Lisboa, Portugal
H. Sofia Pinto

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Ferreira, C.A., Gama, J., Costa, V.S. (2011). Constrained Sequential Pattern Knowledge in Multi-relational Learning. In: Antunes, L., Pinto, H.S. (eds) Progress in Artificial Intelligence. EPIA 2011. Lecture Notes in Computer Science(), vol 7026. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-24769-9_21

Download citation

DOI: https://doi.org/10.1007/978-3-642-24769-9_21
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-24768-2
Online ISBN: 978-3-642-24769-9
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics