Detecting Interesting Instances

Morik, Katharina

doi:10.1007/3-540-45728-3_2

Katharina Morik²

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 2447))

461 Accesses
11 Citations

Abstract

Most valid rules that are learned from very large and high dimensional data sets are not interesting, but are already known to the users. The dominant model of the overall data set may well suppress the interesting local patterns. The search for interesting local patterns can be implemented by a two step learning approach which first acquires the global models before it focuses on the rest in order to detect local patterns. In this paper, three sets of interesting instances are distinguished. For these sets, the hypothesis space is enlarged in order to characterize local patterns in a second learning step.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Rakesh Agrawal, Heikki Mannila, Ramakrishnan Srikant, Hannu Toivonen, and A. Inkeri Verkamo. Fast discovery of association rules. In Usama M. Fayyad, Gregory Piatetsky-Shapiro, Padhraic Smyth, and Ramasamy Uthurusamy, eds., Advances in Knowledge Discovery and Data Mining, chapter 12, pages 307–328. AAAI Press/The MIT Press, Cambridge Massachusetts, London England, 1996.
Google Scholar
Peter Brockhausen and Katharina Morik. Direct access of an ILP algorithm to a database management system. In Bernhard Pfaringer and Johannes Fürnkranz, eds., Data Mining with Inductive Logic Programming (ILP for KDD), MLnet Sponsored Familiarization Workshop, pages 95–110, Bari, Italy, jul 1996.
Google Scholar
L. DeRaedt and M. Bruynooghe. An overview of the interactive concept-learner and theory revisor CLINT. In Stephen Muggleton, ed., Inductive Logic Programming., number 38 in The A.P.I.C. Series, chapter 8, pages 163–192. Academic Press, London [u.a.], 1992.
Google Scholar
T. Fawcett and F. Provost. Adaptive fraud detection. Data Mining and Knowledge Discovery, 1(3):291–316, 1997.
Article Google Scholar
P. A. Flach. A framework for inductive logic programming. In Stephen Muggleton, ed., Inductive Logic Programming., number 38 in The A.P.I.C. Series, chapter 9, pages 193–212. Academic Press, London [u.a.], 1992.
Google Scholar
Dragan Gamberger and Nada Lavrac. Filtering noisy instances and outliers. In Huan Liu and Hiroshi Motoda, eds., Instance Selection and Construction for Data Mining, pages 375–394. Kluwer, 2001.
Google Scholar
Isabelle Guyon, Nada Matic, and Vladimir Vapnik. Discovering informative patterns and data cleaning. In Usama M. Fayyad, Gregory Piatetsky-Shapiro, Padhraic Smyth, and Ramasamy Uthurusamy, eds., Advances in Knowledge Discovery and Data Mining, chapter 2, pages 181–204. AAAI Press/The MIT Press, Menlo Park, California, 1996.
Google Scholar
David Hand, Heikki Mannila, and Padhraic Smyth. Principles of Data Mining. Massachusetts Institute of Technology, 2001.
Google Scholar
Nicolas Helft. Inductive generalisation: A logical framework. In Procs. of the 2nd European Working Session on Learning, 1987.
Google Scholar
J.-U. Kietz and S. Wrobel. Controlling the complexity of learning in logic through syntactic and task-oriented models. In Stephen Muggleton, ed., Inductive Logic Programming., number 38 in The A.P.I.C. Series, chapter 16, pages 335–360. Academic Press, London [u.a.], 1992.
Google Scholar
Jörg Uwe Kietz. Induktive Analyse relationaler Daten. PhD thesis, Technische Universität Berlin, Berlin, oct 1996.
Google Scholar
Willi Klösgen. Handbook of Knowledge Discovery and Data Mining, chapter Subgroup patterns. Oxford University Press, London, 2000. 2000 to appear.
Google Scholar
Huan Liu and Hiroshi Motoda. Instance Selection and Construction for Data Mining. Kluwer Publishers, 2001.
Google Scholar
Katharina Morik and Peter Brockhausen. A multistrategy approach to relational knowledge discovery in databases. Machine Learning Journal, 27(3):287–312, jun 1997.
Article MATH Google Scholar
Gordon D. Plotkin. A note on inductive generalization. In B. Meltzer and D. Michie, eds., Machine Intelligence, chapter 8, pages 153–163. American Elsevier, 1970.
Google Scholar
Tobias Scheffer and Stefan Wrobel. A Sequential Sampling Algorithm for a General Class of Utility Criteria. In Proceedings of the International Conference on Knowledge Discovery and Data Mining, 2000.
Google Scholar
Stefan Wrobel. Concept Formation and Knowledge Revision. Kluwer Academic Publishers, Dordrecht, 1994.
MATH Google Scholar
Stefan Wrobel. An algorithm for multi-relational discovery of subgroups. In J. Komorowski and J. Zytkow, eds., Principles of Data Minig and Knowledge Discovery: First European Symposium (PKDD 97), pages 78–87, Berlin, New York, 1997. Springer.
Google Scholar

Download references

Author information

Authors and Affiliations

Computer Science Department, LS VIII, Univ. Dortmund, Germany
Katharina Morik

Authors

Katharina Morik
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Department of Mathematics, Imperial College of Science, Technology and Medicine, Huxley Building, 180 Queen’s Gate, SW7 2BZ, London, UK
David J. Hand , Niall M. Adams & Richard J. Bolton , &

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Morik, K. (2002). Detecting Interesting Instances. In: Hand, D.J., Adams, N.M., Bolton, R.J. (eds) Pattern Detection and Discovery. Lecture Notes in Computer Science(), vol 2447. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-45728-3_2

Download citation

DOI: https://doi.org/10.1007/3-540-45728-3_2
Published: 02 September 2002
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-44148-9
Online ISBN: 978-3-540-45728-2
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics