Probabilistic Generalization of Simple Grammars and Its Application to Reinforcement Learning

Shibata, Takeshi; Yoshinaka, Ryo; Chikayama, Takashi

doi:10.1007/11894841_28

Probabilistic Generalization of Simple Grammars and Its Application to Reinforcement Learning

Takeshi Shibata²¹,
Ryo Yoshinaka²² &
Takashi Chikayama²¹

Conference paper

749 Accesses
1 Citations

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 4264))

Abstract

Recently, some non-regular subclasses of context-free grammars have been found to be efficiently learnable from positive data. In order to use these efficient algorithms to infer probabilistic languages, one must take into account not only equivalences between languages but also probabilistic generalities of grammars. The probabilistic generality of a grammar G is the class of the probabilistic languages generated by probabilistic grammars constructed on G. We introduce a subclass of simple grammars (SGs), referred as to unifiable simple grammars (USGs), which is a superclass of an efficiently learnable class, right-unique simple grammars (RSGs). We show that the class of RSGs is unifiable within the class of USGs, whereas SGs and RSGs are not unifiable within the class of SGs and RSGs, respectively. We also introduce simple context-free decision processes, which are a natural extension of finite Markov decision processes and intuitively may be thought of a Markov decision process with stacks. We propose a reinforcement learning method on simple context-free decision processes, as an application of the learning and unification algorithm for RSGs from positive data.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Angluin, D.: Inductive inference of formal languages from positive data. Information and Control 45, 117–135 (1980)
Article MATH MathSciNet Google Scholar
Angluin, D.: Inference of reversible languages. Journal of the Association for Computing Machinery 29, 741–765 (1982)
MATH MathSciNet Google Scholar
Barto, A.G., Mahadevan, S.: Recent advances in hierarchical reinforcement learning. Discrete Event Dynamic Systems: Theory and Applications 13, 41–77 (2003)
Article MATH MathSciNet Google Scholar
Bertsekas, D.P., Tsitsiklis, J.N.: Neuro-dynamic Programming. Athena Scientific, Sec. 5.6 (1996)
Google Scholar
Hirshfeld, Y., Jerrum, M., Moller, F.: A polynomial algorithm for deciding bisimilarity of normed context-free processes. Theoretical Computer Science 158, 143–159 (1996)
Article MATH MathSciNet Google Scholar
Kobayashi, S.: Iterated transductions and efficient learning from positive data: A unifying view. In: Oliveira, A.L. (ed.) ICGI 2000. LNCS, vol. 1891, pp. 157–170. Springer, Heidelberg (2000)
Chapter Google Scholar
Kaelbling, L.P., Littman, M.L., Cassandra, A.R.: Planning and acting in partially observable stochastic domains. Artificial Intelligence 101, 99–134 (1998)
Article MATH MathSciNet Google Scholar
Sakakibara, Y.: Recent advances of grammatical inference. Theoretical Computer Science 185, 15–45 (1997)
Article MATH MathSciNet Google Scholar
Sutton, R.S., Barto, A.G.: Reinforcement Learning: An Introduction. MIT Press, Cambridge (1998)
Google Scholar
Wakatsuki, M., Teraguchi, K., Tomita, E.: Polynomial time identification of strict deterministic restricted one-counter automata in some class from positive data. In: Paliouras, G., Sakakibara, Y. (eds.) ICGI 2004. LNCS, vol. 3264, pp. 260–272. Springer, Heidelberg (2004)
Chapter Google Scholar
Wetherell, C.S.: Probabilistic languages: A review and some open questions. Computing Surveys 12(4), 361–379 (1980)
Article MATH MathSciNet Google Scholar
Yokomori, T.: Polynomial-time identification of very simple grammars from positive data. Theoretical Computer Science 298, 179–206 (2003)
Article MATH MathSciNet Google Scholar
Yoshinaka, R.: Polynomial-Time Identification of an Extension of Very Simple Grammars from Positive Data. In: Sakakibara, Y., Kobayashi, S., Sato, K., Nishino, T., Tomita, E. (eds.) ICGI 2006. LNCS, vol. 4201, pp. 45–58. Springer, Heidelberg (2006)
Chapter Google Scholar

Download references

Author information

Authors and Affiliations

Department of Electronic Engineering, the University of Tokyo,
Takeshi Shibata & Takashi Chikayama
Graduate School of Interdisciplinary Information Studies, the University of Tokyo,
Ryo Yoshinaka

Authors

Takeshi Shibata
View author publications
You can also search for this author in PubMed Google Scholar
Ryo Yoshinaka
View author publications
You can also search for this author in PubMed Google Scholar
Takashi Chikayama
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Departament de Llenguatges i Sistemes Informàtics Laboratori d’Algorísmica Relacional, Complexitat i Aprenentatge, Universitat Politècnica de Catalunya, Barcelona,
José L. Balcázar
Google, 1600 Amphitheatre Parkway, 94043, Mountain View, CA, USA
Philip M. Long
Department of Computer Science and Department of Mathematics, National University of Singapore, 117543, Singapore, Republic of Singapore
Frank Stephan

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Shibata, T., Yoshinaka, R., Chikayama, T. (2006). Probabilistic Generalization of Simple Grammars and Its Application to Reinforcement Learning. In: Balcázar, J.L., Long, P.M., Stephan, F. (eds) Algorithmic Learning Theory. ALT 2006. Lecture Notes in Computer Science(), vol 4264. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11894841_28

Download citation

DOI: https://doi.org/10.1007/11894841_28
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-46649-9
Online ISBN: 978-3-540-46650-5
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics