Inferring Deterministic Linear Languages

  • Colin de la Higuera
  • Jose Oncina
Conference paper
Part of the Lecture Notes in Computer Science book series (LNCS, volume 2375)


Linearity and determinism seem to be two essential conditions for polynomial language learning to be possible. We compare several definitions of deterministic linear grammars, and for a reasonable definition prove the existence of a canonical normal form. This enables us to obtain positive learning results in case of polynomial learning from a given set of both positive and negative examples. The resulting class is the largest one for which this type of results has been obtained so far.


Polynomial Time Regular Language Terminal Symbol Grammatical Inference Regular Grammar 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.


Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.


  1. 1.
    Oncina, J., García, P.: Identifying regular languages in polynomial time. In Bunke, H., ed.: Advances in Structural and Syntactic Pattern Recognition. Volume 5 of Series in Machine Perception and Artificial Intelligence. World Scientific (1992) 99–108Google Scholar
  2. 2.
    Pitt, L., Warmuth, M.: The minimum consistent DFA problem cannot be approximated within any polynomial. Journal of the Association for Computing Machinery 40 (1993) 95–142zbMATHMathSciNetGoogle Scholar
  3. 3.
    Kearns, M., Valiant, L.: Cryptographic limitations on learning boolean formulae and finite automata. In: 21st ACM Symposium on Theory of Computing. (1989) 433–444Google Scholar
  4. 4.
    Sakakibara, Y.: Recent advances of grammatical inference. Theoretical Computer Science 185 (1997) 15–45zbMATHCrossRefMathSciNetGoogle Scholar
  5. 5.
    Lee, S.: Learning of context-free languages: A survey of the literature (1996)Google Scholar
  6. 6.
    de la Higuera, C.: Characteristic sets for polynomial grammatical inference. Machine Learning 27 (1997) 125–138zbMATHCrossRefGoogle Scholar
  7. 7.
    Sempere, J., García, P.: A characterisation of even linear languages and its application to the learning problem. In: Grammatical Inference and Applications, ICGI’94. Number 862 in Lecture Notes in Artificial Intelligence, Springer Verlag (1994) 38–44Google Scholar
  8. 8.
    Takada, Y.: A hierarchy of language families learnable by regular language learners. In: Grammatical Inference and Applications, ICGI’94. Number 862 in Lecture Notes in Artificial Intelligence, Springer Verlag (1994) 16–24Google Scholar
  9. 9.
    Sakakibara, Y., Kondo, M.: Ga-based learning of context-free grammars using tabular representations. In: Proceedings of 16th International Conference on Machine Learning (ICML-99). (1999) 354–360Google Scholar
  10. 10.
    Sakakibara, Y.: Learning context-free grammars from structural data in polynomial time. Theoretical Computer Science 76 (1990) 223–242CrossRefMathSciNetzbMATHGoogle Scholar
  11. 11.
    Sakakibara, Y., Brown, M., Hughley, R., Mian, I., Sjolander, K., Underwood, R., Haussler, D.: Stochastic context-free grammars for trna modeling. Nuclear Acids Res. 22 (1994) 5112–5120CrossRefGoogle Scholar
  12. 12.
    Ishizaka, I.: Learning simple deterministic languages. In: Proceedings of COLT 89. (1989)Google Scholar
  13. 13.
    Angluin, D.: Learning regular sets from queries and counterexamples. Information and Control 39 (1987) 337–350CrossRefMathSciNetGoogle Scholar
  14. 14.
    Pitt, L.: Inductive inference, DFA’s, and computational complexity. In: Analogical and Inductive Inference. Number 397 in Lecture Notes in Artificial Intelligence. Springer-Verlag, Berlin (1989) 18–44Google Scholar
  15. 15.
    Autebert, J., Berstel, J., Boasson, L.: Context-free languages and pushdown automata. In Salomaa, A., Rozenberg, G., eds.: Handbook of Formal Languages. Volume 1, Word Language Grammar. Springer-Verlag, Berlin (1997) 111–174Google Scholar
  16. 16.
    Nasu, M., Honda, N.: Mappings induced by pgsm-mappings and some recursively unsolvable problems of finite probabilistic automata. Information and Control 15 (1969) 250–273zbMATHCrossRefMathSciNetGoogle Scholar
  17. 17.
    Ibarra, O. H., Jiang, T., Ravikumar, B.: Some subclasses of context-free languages in nc1. Information Processing Letters 29 (1988) 111–117zbMATHCrossRefMathSciNetGoogle Scholar
  18. 18.
    Holzer, M., Lange, K. J.: On the complexities of linear ll(1) and lr(1) grammars. In: Proceedings of the 9th International Conference on Fundamentals of Computation Theory. Number 710 in Lecture Notes in Computer Science, Springer Verlag (1993) 299–308Google Scholar
  19. 19.
    Valiant, L.: A theory of the learnable. Communications of the Association for Computing Machinery 27 (1984) 1134–1142zbMATHGoogle Scholar
  20. 20.
    Gold, E.: Complexity of automaton identification from given data. Information and Control 37 (1978) 302–320zbMATHCrossRefMathSciNetGoogle Scholar
  21. 21.
    Parekh, R., Honavar, V.: Learning dfa from simple examples. Machine Learning Journal 44 (2001) 9–35zbMATHCrossRefGoogle Scholar
  22. 22.
    Parekh, R., Honavar, V.: On the relationship between models for learning in helpful environments. In Oliveira, A., ed.: Proceedings of ICGI 2000. Volume 1891 of Lecture Notes in Artificial Intelligence., Berlin Heidelberg New York, Springer Verlag (2000) 207–220Google Scholar
  23. 23.
    Ginsburg, S., Spanier, E.: Finite-turn pushdown automata. SIAM Control 4 (1966) 429–453zbMATHCrossRefMathSciNetGoogle Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2002

Authors and Affiliations

  • Colin de la Higuera
    • 1
  • Jose Oncina
    • 2
  1. 1.EURISEUniversité de Saint-EtienneSaint-EtienneFrance
  2. 2.Departamento de Lenguajes y Sistemas InformáticosUniversidad de AlicanteAlicanteSpain

Personalised recommendations