Abstract
In this paper we present eg-GRIDS, an algorithm for inducing context-free grammars that is able to learn from positive sample sentences. The presented algorithm, similar to its GRIDS predecessors, uses simplicity as a criterion for directing inference, and a set of operators for exploring the search space. In addition to the basic beam search strategy of GRIDS, eg-GRIDS incorporates an evolutionary grammar selection process, aiming to explore a larger part of the search space. Evaluation results are presented on artificially generated data, comparing the performance of beam search and genetic search. These results show that genetic search performs better than beam search while being significantly more efficient computationally.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Angluin, D.: Inference of reversible languages. Journal of ACM 29, 741–765 (1982)
Emerald, J.D., Subramanian, K.G., Thomas, D.G.: Learning Code regular and Code linear languages. In: Miclet, L., de la Higuera, C. (eds.) ICGI 1996. LNCS, vol. 1147, pp. 211–221. Springer, Heidelberg (1996)
Garc’ia, P., Vidal, E.: Inference of K-testable languages in the strict sense and applications to syntactic pattern recognition. Journal of IEEE Transactions on Pattern Analysis and Machine Intelligence 12(9), 920–925 (1990)
Hopcroft, J., Ullman, J.: Introduction to Automata Theory, Languages and Computation. Addison – Wesley, Reading (1979)
Koshiba, T., Makinen, E., Takada, Y.: Inferring pure context-free languages from positive data., Technical report A-1997-14, Department of Computer Science, University of Tampere (1997)
Langley, P., Stromsten, S.: Learning Context-Free Grammars with a Simplicity Bias. In: Lopez de Mantaras, R., Plaza, E. (eds.) ECML 2000. LNCS (LNAI), vol. 1810, pp. 220–228. Springer, Heidelberg (2000)
Mäkinen, F.: On the structural grammatical inference problem for some classes of context-free grammars. Information Processing Letters 42, 193–199 (1992)
Nakamura, K., Ishiwata, T.: Synthesizing Context Free Grammars from Simple Strings Based on Inductive CYK Algorithm. In: Oliveira, A.L. (ed.) ICGI 2000. LNCS (LNAI), vol. 1891, pp. 186–195. Springer, Heidelberg (2000)
Parekh, R., Honavar, V.: Grammar Inference, Automata Induction, and Language Acquisition. In: Dale, R., Moisl, H., Somers, H. (eds.) Handbook of Natural Language Processing, ch.29, pp. 727–764. Marcel Dekker Inc, New York (2000)
Petasis, G., Paliouras, G., Karkaletsis, V., Halatsis, C., Spyropoulos, C.D.: e- GRIDS: Computationally Efficient Grammatical Inference from Positive Examples. Grammars, Special Issue (2004), Available from http://217.125.102.104/special4.asp
Rissanen, J.: Stochastic Complexity in Statistical Inquiry. World Scientific Publishing Co, Singapore (1989)
Rulot, H., Vidal, E., Devijer, Kittler: Modelling (sub)string-length-based constraints through grammatical inference methods. Springer, Heidelberg (1987)
Sakakibara, Y.: Efficient learning of context-free grammars from positive structural examples. Information and Computation 97, 23–60 (1992)
Sakakibara, Y., Muramatsu, H.: Learning Context-Free Grammars from Partially Structured Examples. In: Oliveira, A.L. (ed.) ICGI 2000. LNCS (LNAI), vol. 1891, pp. 229–240. Springer, Heidelberg (2000)
Stolcke, A.: Bayesian Learning of Probabilistic Language Models., PhD Thesis, University of California at Berkley (1994)
Stolcke, A., Omohundro, S.: Inducing Probabilistic Grammars by Bayesian Model Merging. In: Carrasco, R.C., Oncina, J. (eds.) ICGI 1994. LNCS, vol. 862, pp. 106–118. Springer, Heidelberg (1994)
Tanida, N., Yokomori, T.: AII 1994 and ALT 1994. LNCS, vol. 872, pp. 560–573. Springer, Heidelberg (1994)
Wolff, G.: Grammar Discovery as data compression. In: Proceedings of the AISB/GI Conference on Artificial Intelligence, pp. 375–379. Hamburg, West Germany (1978)
Wolff, G.: Language Acquisition, Data Compression and Generalisation. Language and Communication 2, 57–89 (1982)
Yokomori, T.: On Polynomial-Time Learnability in the Limit of Strictly Deterministic Automata. Journal of Machine Learning 19, 153–179 (1995)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2004 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Petasis, G., Paliouras, G., Spyropoulos, C.D., Halatsis, C. (2004). eg-GRIDS: Context-Free Grammatical Inference from Positive Examples Using Genetic Search. In: Paliouras, G., Sakakibara, Y. (eds) Grammatical Inference: Algorithms and Applications. ICGI 2004. Lecture Notes in Computer Science(), vol 3264. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-30195-0_20
Download citation
DOI: https://doi.org/10.1007/978-3-540-30195-0_20
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-23410-4
Online ISBN: 978-3-540-30195-0
eBook Packages: Springer Book Archive