Skip to main content

Refutable/Inductive Learning from Neighbor Examples and Its Application to Decision Trees over Patterns

  • Chapter
  • First Online:
Progress in Discovery Science

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 2281))

  • 496 Accesses

Abstract

The paper develops the theory of refutable/inductive learning as a foundation of discovery science from examples. We consider refutable/inductive language learning from positive examples, some of which may be incorrect. The error or incorrectness we consider is the one described uniformly in terms of a distance over strings. We define a k-neighbor closure of a language L as the collection of strings each of which is at most k distant from some string in L. In ordinary learning paradigm, a target language is assumed to belong to a hypothesis space without any guarantee. In this paper, we allow an inference machine to infer a neighbor closure instead of the original language as an admissible approximation. We formalize such kind of learning, and give some sufficient conditions for a hypothesis space.

As its application to concrete problems, we deal with languages defined by decision trees over patterns. The problem of learning decision trees over patterns has been studied from a viewpoint of knowledge discovery for Genome information processing in the framework of PAC learning from both positive and negative examples. We investigate their learnability in the limit from neighbor examples as well as refutable learnability from complete examples, i.e., from both positive and negative examples. Furthermore, we present some procedures which plays an important role for designing efficient learning algorithms for decision trees over regular patterns.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. D. Angluin: Finding patterns common to a set of strings, in Proceedings of the Eleventh Annual Symposium on Theory of Computing (1979) 130–141.

    Google Scholar 

  2. D. Angluin: Inductive inference of formal languages from positive data, Information and Control 45 (1980) 117–135.

    Article  MATH  MathSciNet  Google Scholar 

  3. S. Arikawa, S. Kuhara, S. Miyano, Y. Mukouchi, Y. Shinohara and T. Shinohara: A machine discovery from amino acid sequences by decision trees over regular patterns, New Generation Computing 11(3,4) (1993) 361–375.

    Article  MATH  Google Scholar 

  4. H. Arimura, T. Shinohara and S. Otsuki: Finding minimal generalizations for unions of pattern languages and its application to inductive inference from positive data, in Proceedings of the Eleventh Symposium on Theoretical Aspects of Computer Science, Lecture Notes in Computer Science 775 (1994) 646–660.

    Google Scholar 

  5. J. Case and S. Jain: Synthesizing learners tolerant computable noisy data, in Proceedings of the Ninth International Conference on Algorithmic Learning Theory, Lecture Notes in Artificial Intelligence 1501 (1998) 205–219.

    Google Scholar 

  6. A. Gabrielian: Pure grammars and pure languages, International Journal of Computer Mathematics 9 (1981) 3–16.

    Article  MATH  MathSciNet  Google Scholar 

  7. E.M. Gold: Language identification in the limit, Information and Control 10 (1967) 447–474.

    Article  MATH  Google Scholar 

  8. R. Hamming: “Coding and Information Theory,” Prentice Hall, englewood Cliffs, NJ (1982).

    Google Scholar 

  9. S. Jain: Program synthesis in the presence of infinite number of inaccuracies, in Proceedings of the Fifth International Workshop on Algorithmic Learning Theory, Lecture Notes in Artificial Intelligence 872 (1994) 333–348.

    Google Scholar 

  10. S. Jain: Learning with refutation, in Proceedings of the Ninth International Conference on Algorithmic Learning Theory, Lecture Notes in Artificial Intelligence 1501 (1998) 291–305.

    Google Scholar 

  11. S. Kobayashi and T. Yokomori: On approximately identifying concept classes in the limit, in Proceedings of the Sixth International Workshop on Algorithmic Learning Theory, Lecture Notes in Artificial Intelligence 997 (1995) 298–312.

    Google Scholar 

  12. S. Lange and T. Zeugmann: Types of monotonic language learning and their characterization, in Proceedings of the Fifth Annual ACMWorkshop on Computational Learning Theory (1992) 377–390.

    Google Scholar 

  13. S. Lange and P. Watson: Machine discovery in the presence of incomplete or ambiguous data, in Proceedings of the Fifth International Workshop on Algorithmic Learning Theory, Lecture Notes in Artificial Intelligence 872 (1994) 438–452.

    Google Scholar 

  14. A. Lindenmayer: Developmental systems for cellular interactions, their languages and grammars, Journal of Theoretical Biology 21 (1971) 455–484.

    Article  Google Scholar 

  15. S. Miyano: Learning theory towards Genome informatics, in Proceedings of the Fourth International Workshop on Algorithmic Learning Theory, Lecture Notes in Artificial Intelligence 744 (1993) 19–36.

    Google Scholar 

  16. T. Moriyama and M. Sato: Properties of language classes with finite elasticity, in Proceedings of the Fourth International Workshop on Algorithmic Learning Theory, Lecture Notes in Artificial Intelligence 744 (1993) 187–196.

    Google Scholar 

  17. T. Motoki, T. Shinohara and K. Wright: The correct definition of finite elasticity: corrigendum to identification of unions, in Proceedings of the Fourth Annual ACM Workshop on Computational Learning Theory (1991) 375–375.

    Google Scholar 

  18. Y. Mukouchi: Inductive inference of an approximate concept from positive data, in Proceedings of the Fifth International Workshop on Algorithmic Learning Theory, Lecture Notes in Artificial Intelligence 872 (1994) 484–499.

    Google Scholar 

  19. Y. Mukouchi and S. Arikawa: Towards a mathematical theory of machine discovery from facts, Theoretical Computer Science 137 (1995) 53–84.

    Article  MATH  MathSciNet  Google Scholar 

  20. Y. Mukouchi, I. Yamaue and S. Sato: Inferring a rewriting system from examples, in Proceedings of the First International Conference on Discovery Science, Lecture Notes in Artificial Intelligence 1532 (1998) 93–104.

    Google Scholar 

  21. Y. Mukouchi: Refutable inference with a restricted target class, Mathematica Japonica 49(3) (1999) 363–372.

    MATH  MathSciNet  Google Scholar 

  22. Y. Mukouchi and M. Sato: Language learning with a neighbor system, in Proceedings of the Third International Conference on Discovery Science, Lecture Notes in Artificial Intelligence 1967 (2000) 183–196.

    Google Scholar 

  23. Y. Mukouchi and M. Sato: Refutable language learning with a neighbor system, to appear in Theoretical Computer Science.

    Google Scholar 

  24. Y. Sakakibara and R. Siromoney: A noise model on learning sets of strings, in Proceedings of the Fifth Annual ACM Workshop on Computational Learning Theory (1992) 295–302.

    Google Scholar 

  25. R. A. Wagner and M. J. Fischer: The string-to-string correction problem, Journal of the ACM 21(1) (1974) 168–173.

    Article  MATH  MathSciNet  Google Scholar 

  26. M. Sato: Inductive inference of formal languages, Bulletin of Informatics and Cybernetics 27(1) (1995) 85–106.

    MATH  MathSciNet  Google Scholar 

  27. M. Sato, Y. Mukouchi and D. Zheng: Characteristic sets for unions of regular pattern languages and compactness, in Proceedings of the Ninth International Conference on Algorithmic Learning Theory, Lecture Notes in Artificial Intelligence 1501 (1998) 220–233.

    Google Scholar 

  28. M. Sato and Y. Mukouchi: Inductive inference of languages generated by regular pattern languages, in Proceedings of JSAI Meeting at Okinawa Convention Center (1999).

    Google Scholar 

  29. T. Shinohara: Polynomial time inference of pattern languages and its applications, in Proceedings of the Seventh IBM Symposium on Mathematical Foundations of Computer Sciences (1982) 191–209.

    Google Scholar 

  30. T. Shinohara: Inductive inference from negative data, Bulletin of Informatics and Cybernetics 21(3,4) (1985) 67–70.

    MathSciNet  MATH  Google Scholar 

  31. F. Stephan: Noisy inference and oracles, in Proceedings of the Sixth International Workshop on Algorithmic Learning Theory, Lecture Notes on Artificial Intelligence 997 (1995) 185–200.

    Google Scholar 

  32. R. M. Smullyan: “Elementary Formal Systems,” Princeton University Press (1961).

    Google Scholar 

  33. M. Terada, Y. Mukouchi and M. Sato: Inductive inference of decision trees over patterns from positive examples, Transactions of IEICE J83-D-I(1) 60–67, in Japanese.

    Google Scholar 

  34. K. Wright: Identifition of unions of languages drawn from an identifile class, in Proceedings of the Second Annual ACM Workshop on Computational Learning Theory (1989) 328–333.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2002 Springer-Verlag Berlin Heidelberg

About this chapter

Cite this chapter

Sato, M., Mukouchi, Y., Terada, M. (2002). Refutable/Inductive Learning from Neighbor Examples and Its Application to Decision Trees over Patterns. In: Arikawa, S., Shinohara, A. (eds) Progress in Discovery Science. Lecture Notes in Computer Science(), vol 2281. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-45884-0_12

Download citation

  • DOI: https://doi.org/10.1007/3-540-45884-0_12

  • Published:

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-43338-5

  • Online ISBN: 978-3-540-45884-5

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics