Skip to main content

Detecting Interesting Instances

  • Conference paper
  • First Online:
Pattern Detection and Discovery

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 2447))

Abstract

Most valid rules that are learned from very large and high dimensional data sets are not interesting, but are already known to the users. The dominant model of the overall data set may well suppress the interesting local patterns. The search for interesting local patterns can be implemented by a two step learning approach which first acquires the global models before it focuses on the rest in order to detect local patterns. In this paper, three sets of interesting instances are distinguished. For these sets, the hypothesis space is enlarged in order to characterize local patterns in a second learning step.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Rakesh Agrawal, Heikki Mannila, Ramakrishnan Srikant, Hannu Toivonen, and A. Inkeri Verkamo. Fast discovery of association rules. In Usama M. Fayyad, Gregory Piatetsky-Shapiro, Padhraic Smyth, and Ramasamy Uthurusamy, eds., Advances in Knowledge Discovery and Data Mining, chapter 12, pages 307–328. AAAI Press/The MIT Press, Cambridge Massachusetts, London England, 1996.

    Google Scholar 

  2. Peter Brockhausen and Katharina Morik. Direct access of an ILP algorithm to a database management system. In Bernhard Pfaringer and Johannes Fürnkranz, eds., Data Mining with Inductive Logic Programming (ILP for KDD), MLnet Sponsored Familiarization Workshop, pages 95–110, Bari, Italy, jul 1996.

    Google Scholar 

  3. L. DeRaedt and M. Bruynooghe. An overview of the interactive concept-learner and theory revisor CLINT. In Stephen Muggleton, ed., Inductive Logic Programming., number 38 in The A.P.I.C. Series, chapter 8, pages 163–192. Academic Press, London [u.a.], 1992.

    Google Scholar 

  4. T. Fawcett and F. Provost. Adaptive fraud detection. Data Mining and Knowledge Discovery, 1(3):291–316, 1997.

    Article  Google Scholar 

  5. P. A. Flach. A framework for inductive logic programming. In Stephen Muggleton, ed., Inductive Logic Programming., number 38 in The A.P.I.C. Series, chapter 9, pages 193–212. Academic Press, London [u.a.], 1992.

    Google Scholar 

  6. Dragan Gamberger and Nada Lavrac. Filtering noisy instances and outliers. In Huan Liu and Hiroshi Motoda, eds., Instance Selection and Construction for Data Mining, pages 375–394. Kluwer, 2001.

    Google Scholar 

  7. Isabelle Guyon, Nada Matic, and Vladimir Vapnik. Discovering informative patterns and data cleaning. In Usama M. Fayyad, Gregory Piatetsky-Shapiro, Padhraic Smyth, and Ramasamy Uthurusamy, eds., Advances in Knowledge Discovery and Data Mining, chapter 2, pages 181–204. AAAI Press/The MIT Press, Menlo Park, California, 1996.

    Google Scholar 

  8. David Hand, Heikki Mannila, and Padhraic Smyth. Principles of Data Mining. Massachusetts Institute of Technology, 2001.

    Google Scholar 

  9. Nicolas Helft. Inductive generalisation: A logical framework. In Procs. of the 2nd European Working Session on Learning, 1987.

    Google Scholar 

  10. J.-U. Kietz and S. Wrobel. Controlling the complexity of learning in logic through syntactic and task-oriented models. In Stephen Muggleton, ed., Inductive Logic Programming., number 38 in The A.P.I.C. Series, chapter 16, pages 335–360. Academic Press, London [u.a.], 1992.

    Google Scholar 

  11. Jörg Uwe Kietz. Induktive Analyse relationaler Daten. PhD thesis, Technische Universität Berlin, Berlin, oct 1996.

    Google Scholar 

  12. Willi Klösgen. Handbook of Knowledge Discovery and Data Mining, chapter Subgroup patterns. Oxford University Press, London, 2000. 2000 to appear.

    Google Scholar 

  13. Huan Liu and Hiroshi Motoda. Instance Selection and Construction for Data Mining. Kluwer Publishers, 2001.

    Google Scholar 

  14. Katharina Morik and Peter Brockhausen. A multistrategy approach to relational knowledge discovery in databases. Machine Learning Journal, 27(3):287–312, jun 1997.

    Article  MATH  Google Scholar 

  15. Gordon D. Plotkin. A note on inductive generalization. In B. Meltzer and D. Michie, eds., Machine Intelligence, chapter 8, pages 153–163. American Elsevier, 1970.

    Google Scholar 

  16. Tobias Scheffer and Stefan Wrobel. A Sequential Sampling Algorithm for a General Class of Utility Criteria. In Proceedings of the International Conference on Knowledge Discovery and Data Mining, 2000.

    Google Scholar 

  17. Stefan Wrobel. Concept Formation and Knowledge Revision. Kluwer Academic Publishers, Dordrecht, 1994.

    MATH  Google Scholar 

  18. Stefan Wrobel. An algorithm for multi-relational discovery of subgroups. In J. Komorowski and J. Zytkow, eds., Principles of Data Minig and Knowledge Discovery: First European Symposium (PKDD 97), pages 78–87, Berlin, New York, 1997. Springer.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2002 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Morik, K. (2002). Detecting Interesting Instances. In: Hand, D.J., Adams, N.M., Bolton, R.J. (eds) Pattern Detection and Discovery. Lecture Notes in Computer Science(), vol 2447. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-45728-3_2

Download citation

  • DOI: https://doi.org/10.1007/3-540-45728-3_2

  • Published:

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-44148-9

  • Online ISBN: 978-3-540-45728-2

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics