Abstract
Finite-state transducers can be used to map a language onto a set of values. This paper proposes an alternate representation method for such a mapping, consisting of associating a finite-state automaton accepting the input language with a decision tree representing the output values. The advantages of this approach are that it leads to more compact representations than transducers, and that decision trees can easily be synthesized by machine learning techniques.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Aoe, J-I., Morimoto, K., Shishibori, M., and Park, K. A trie compaction algorithm for a large set of keys. IEEE Transaction on Knowledge and Data Engineering8, 3 (1996), 476–491.
Bentley, J., and Sedgewick B. Fast algorithms for sorting and searching strings. In Annual ACM-SIAM Symposium on Discrete Algorithms (1996), pp. 1–10.
Bescherelle. L’art de Conjuguer. Hatier, Paris, 1966.
Czech, Z. J., and Majewski, B. S. An optimal algorithm for generating minimal perfect hash function. Information Processing Letters 43 (1992), 257–264.
Daciuk, J., Mihov, S., Watson, B. W., and Watson, R. E. Incremental construction of finite-state automata. Association for Computational Linguistics 26, 1 (2000), 3–16.
Brain, M. D., and Tharp, A. L. Using trie to eliminate pattern collision in perfect hashing. IEEE Transaction on Knowledge and Data Engineering 6 (1994), 476–491.
Fatholahzadeh, A. DAWG-ID3: Retrieving key-information using graph and classification algorithms. In International Symposium on Database Technology & Software Engineering, WEB and Cooperative Systems (2000), Lasker G. E. and Gerhard W. (eds.), 117–124.
Fatholahzadeh, A. Experiments with automata and information gain. In International Conference on Implementation and Application of Automata (2000), Daley M., Eramian E., and Yu S. (eds.), 252.
Fatholahzadeh, A. Nationality word graph for fast information retrieval. In Collaborative Electronic Commerce Technology and Research (2000), Williams M. A. and P. Swatman P. (eds.), International Conference on Knowledge Representation, 1–12. Available via http://www.collecter.org/collUSA.
Fatholahzadeh, A. Tagging nationality words using automata. In ROMMAND: RObust Methods in Analysis of Natural Language Data (Swiss 2000), Ballim A., Pallotta V., and Ghorbel H. (eds.), 69–76.
Fatholahzadeh, A. Online treatment of official names. In International Conference on Artificial Intelligence (2001), Arabnia H. R.(ed.), 203–209.
Kempe, A. Factorizations of ambiguous finite-state transducers. In International Conference on Implementation and Application of Automata (2000), Daley M., Eramian M., and Yu S. (eds.), 157–164.
Mihov, S., and Murel, D. Direct construction of minimal acyclic sub-sequential transducers. In International Conference on Implementation and Application of Automata (2000), Daley M., Eramian E., and S. Yu (eds.), 150–156.
Mitchell, T. M. Machine Learning. McGraw-Hill, 1997.
Mohri, M. On some application of finite-state automata theory to natural language. Natural Language Engineering 2, 1 (1996), 1–20.
Mohri, M. Finite-state transducers in language and speech processing. Computational Linguistics 23, 2 (1997), 269–311.
Mohri, M. Generic ∈-removal algorithm for weighted automata. In International Conference on Implementation and Application of Automata (2000), Daley M., Eramian E., and Yu S. (eds.) 26–35.
Quinlan, R. C4.5: Programs for Machine Learning. Morgan Kaufmann, 1993.
Rozenberg G. and Salomaa A. (eds.) Handbook of Formal Language. Springer-Verlag, Berlin Heidelberg, 1997.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2003 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Fatholahzadeh, A. (2003). Implementation of Dictionaries via Automata and Decision Trees. In: Champarnaud, JM., Maurel, D. (eds) Implementation and Application of Automata. CIAA 2002. Lecture Notes in Computer Science, vol 2608. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-44977-9_9
Download citation
DOI: https://doi.org/10.1007/3-540-44977-9_9
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-40391-3
Online ISBN: 978-3-540-44977-5
eBook Packages: Springer Book Archive