Abstract
This paper describes a formalization based on tree automata for incremental learning of context-free grammars from positive samples of their structural descriptions. A structural description of a context-free grammar is a derivation tree of the grammar in which labels are removed. The tree automata based learning in this paradigm is early introduced by Sakakibara in 1992, however his scheme assumes that all training examples are available to the learning algorithm at the beginning (i.e., it cannot be employed as an online learning) and also it doesn’t optimize the storage requirements as well. Our model has several desirable features that runs in O(n 3) time in the sum of the sizes of the input examples, obtains O(n) storage space saving, achieves good incremental behavior by updating a guess incrementally and infers a grammar from positive-only examples efficiently. Several examples and experimental results are given to illustrate the scheme and its efficient execution.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Aho, A.V., Hopcroft, J.E., Ullman, J.D.: Data Structures and Algorithms. Addison-Wesley, Reading (1983)
Angluin, D.: Inference of Reversible Languages. J. ACM 29, 741–765 (1982)
Crespi-Reghizzi, S., Melkanoff, M.A., Lichten, L.: The use of Grammatical Inference for Designing Programming Languages. Comm. ACM 16, 83–90 (1973)
Gold, E.M.: Language Identification in the Limit. Inform. and Control 10, 447–474 (1967)
de la Higuera, C.: Current Trends in Grammatical Inference. In: Amin, A., Pudil, P., Ferri, F.J., Iñesta, J.M. (eds.) SPR 2000 and SSPR 2000. LNCS, vol. 1876, pp. 28–31. Springer, Heidelberg (2000)
Higuera, C., de la Higuera, C.: A Bibliographical Study of Grammatical Inference. Pattern Recognition 38, 1332–1348 (2005)
Lee, S.: Learning of Context-Free Languages: A Survey of the Literatures. Technical Report TR-12-96, Centre for Research in Computing Technology, Harvard University Cambridge, Massachusetts (1996)
Pereira, F., Schabes, Y.: Inside-Outside Reestimation for Partially Bracketed Corpora. In: Proc. 30th Ann. Meeting of the Assoc. for the Comput. Linguistics, pp. 128–135 (1992)
Richetin, M., Vernadat, F.: Efficient Regular Grammatical Inference for Pattern Recognition. Pattern Recognition 17(2), 245–250 (1984)
Sakakibara, Y.: Efficient Learning of Context-Free Grammars from Positive Structural Examples. Inform. and Comput. 97, 23–60 (1992)
Sakakibara, Y.: Grammatical Inference in Bioinformatics. IEEE Transactions on Pattern Analysis and Machine Intelligence 27, 1051–1062 (2005)
Sakakibara, Y., Brown, M., Underwood, R.C., Mian, I.S., Haussler, D.: Stochastic CFGs for Modeling RNA. In: Proc. of the 27th Hawaii Int. Conf. on Syst. Sciences, pp. 284–293 (1994)
Sakakibara, Y.: Recent Advances of Grammatical Inference. Theoret. Comput. Sci. 185, 15–45 (1997)
Searls, D.: The Language of Genes. Nature 420, 211–217 (2002)
Valiant, L.G.: A Theory of the Learnable. Comm. ACM 27, 1134–1142 (1984)
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 2008 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Prajapati, G.L., Chaudhari, N.S., Chandwani, M. (2008). Efficient Incremental Model for Learning Context-Free Grammars from Positive Structural Examples. In: Darzentas, J., Vouros, G.A., Vosinakis, S., Arnellos, A. (eds) Artificial Intelligence: Theories, Models and Applications. SETN 2008. Lecture Notes in Computer Science(), vol 5138. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-87881-0_23
Download citation
DOI: https://doi.org/10.1007/978-3-540-87881-0_23
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-87880-3
Online ISBN: 978-3-540-87881-0
eBook Packages: Computer ScienceComputer Science (R0)