Combining Pattern Discovery and Probabilistic Modeling in Data Mining

Mannila, Heikki

doi:10.1007/3-540-45471-3_2

Heikki Mannila^6,7

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 2368))

Included in the following conference series:

Scandinavian Workshop on Algorithm Theory

682 Accesses

Abstract

Data mining has in recent years emerged as an interesting area in the boundary between algorithms, probabilistic modeling, statistics, and databases. Data mining research has come from two different traditions. The global approach aims at modeling the joint distribution of the data, while the local approach aims at efficient discovery of frequent patterns from the data. Among the global modeling techniques, mixture models have emerged as a strong unifying theme, and methods exist for fitting such models on large data sets. For pattern discovery, the methods for finding frequently occurring positive conjunctions have been applied in various domains. An interesting open issue is how to combine the two approaches, e.g., by inferring joint distributions from pattern frequencies. Some promising results have been achieved using maximum entropy approaches. In the talk we describe some basic techniques in global and local approaches to data mining, and present a selection of open problems.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Author information

Authors and Affiliations

HIIT Basic Research Unit, University of Helsinki, Department of Computer Science, PO Box 26, FIN-00014, Finland
Heikki Mannila
Laboratory of Computer and Information Science, Helsinki University of Technology, PO Box 5400, FIN-02015, HUT, Finland
Heikki Mannila

Authors

Heikki Mannila
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Department of Computer Science and Applied Mathematics, University of Kuopio, P.O. Box 1627, 70211, Kuopio, Finland
Martti Penttonen
BRICS, University of Aarhus, Department of Computer Science, NY Munkegade, 8000, Aarhus C, Denmark
Erik Meineche Schmidt

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Mannila, H. (2002). Combining Pattern Discovery and Probabilistic Modeling in Data Mining. In: Penttonen, M., Schmidt, E.M. (eds) Algorithm Theory — SWAT 2002. SWAT 2002. Lecture Notes in Computer Science, vol 2368. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-45471-3_2

Download citation

DOI: https://doi.org/10.1007/3-540-45471-3_2
Published: 21 June 2002
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-43866-3
Online ISBN: 978-3-540-45471-7
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics