Learning a deterministic finite automaton with a recurrent neural network

Firoiu, Laura; Oates, Tim; Cohen, Paul R.

doi:10.1007/BFb0054067

Laura Firoiu¹,
Tim Oates¹ &
Paul R. Cohen¹

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 1433))

Included in the following conference series:

International Colloquium on Grammatical Inference

176 Accesses
1 Citations

Abstract

We consider the problem of learning a finite automaton with recurrent neural networks from positive evidence. We train an Elman recurrent neural network with a set of sentences in a language and extract a finite automaton by clustering the states of the trained network. We observe that the generalizations beyond the training set, in the language recognized by the extracted automaton, are due to the training regime: the network performs a “loose” minimization of the prefix DFA of the training set, the automaton that has a state for each prefix of the sentences in the set.

This research is supported by DARPA/AFOSRF and DARPA under contracts No. F49620-97-1-0485 and No. N66001-96-C-8504. The U.S. Government is authorized to reproduce and distribute reprints for governmental purposes notwithstanding any copyright notation hereon.The views and conclusions contained herein are those of the authors and should not be interpreted as necessarily representing the official policies or endorsements either expressed or implied, of the or the U.S. Government.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

M. Casey. The dynamics of discrete-time computation, with application to recurrent neural networks and finite state machine extraction. Neural Computation, 8:1135–1178, 1996.
Google Scholar
A. Cleeremans, D. Servan-Schreiber, and J.L. McClelland. Finite state automata and simple recurrent networks. Neural Computation, 1:372–381, 1989.
Google Scholar
P. R. Cohen. Empirical Methods for Artificial Intelligence. The MIT Press, 1995.
Google Scholar
J. L. Elman. Finding structure in time. Cognitive science, 14:179–211, 1990.
Article Google Scholar
J. L. Elman. Distributed representations, simple recurrent networks, and grammatical structure. Machine Learning, 1992.
Google Scholar
C. L. Giles, C. B. Miller, D. Chen, G. Z. Sun, H. H. Chen, and Y. C. Lee. Extracting and learning an unknown grammar with recurrent neural networks. In Advances in Neural Information Processing Systems 4. 1992.
Google Scholar
E. M. Gold. Language identification in the limit. Information and control, 10:447–474, 1967.
Article MATH Google Scholar
John F. Kolen. Fool's gold: Extracting finite state machines from recurrent network dynamics. In Advances in Neural Information Processing Systems 6, 1994.
Google Scholar
E. Makinen. Inferring regular languages by merging nonterminals. Technical Report A-1997-6, Department of Computer Science, University of Tampere, 1997.
Google Scholar
Christian W. Omlin and C. Lee Giles. Constructing deterministic finite-state automata in recurrent neural networks. Journal of the ACM, 45(6):937, 1996.
Article MathSciNet Google Scholar
Leonard Pitt and Manfred K. Warmuth. The minimum consistent dfa problem cannot be approximated within any polynomial. Journal of the ACM, 40(1):95–142, 1993.
Article MATH MathSciNet Google Scholar
D. E. Rumelhart, R. Durbin, R. Golden, and Y. Chauvin. Backpropagation: The basic theory. In Backpropagation: Theory, architectures, and applications. Erlbaum, 1993.
Google Scholar
H. T Siegelmann. Theoretical Foundations of Recurrent Neural Networks. PhD thesis, Rutgers, 1992.
Google Scholar
P. N. Werbos. The roots of backpropagation. John Wiley & Sons, Inc., 1994.
Google Scholar

Download references

Author information

Authors and Affiliations

Computer Science Department, University of Massachusetts at Amherst, USA
Laura Firoiu, Tim Oates & Paul R. Cohen

Authors

Laura Firoiu
View author publications
You can also search for this author in PubMed Google Scholar
Tim Oates
View author publications
You can also search for this author in PubMed Google Scholar
Paul R. Cohen
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Vasant Honavar Giora Slutzki

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Firoiu, L., Oates, T., Cohen, P.R. (1998). Learning a deterministic finite automaton with a recurrent neural network. In: Honavar, V., Slutzki, G. (eds) Grammatical Inference. ICGI 1998. Lecture Notes in Computer Science, vol 1433. Springer, Berlin, Heidelberg. https://doi.org/10.1007/BFb0054067

Download citation

DOI: https://doi.org/10.1007/BFb0054067
Published: 23 May 2006
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-64776-8
Online ISBN: 978-3-540-68707-8
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics