Skip to main content

Introduction

  • Chapter
  • First Online:
Grammatical Inference

Part of the book series: Studies in Computational Intelligence ((SCI,volume 673))

  • 862 Accesses

Abstract

The introductory chapter contains the different formulations of grammatical inference and indicates such formulations that will be considered in this book. Basically, the problem of grammatical inference within the present book is to be studied from machine learning and combinatorial optimization perspectives. In addition to this, the different representations of languages are assumed: deterministic and non-deterministic finite-state automata, regular expressions, and context-free grammars. As regards the machine learning approach, the design and analysis of learning experiments for comparing GI algorithms to each other or with machine learning methods will be discussed. Typical applications of the GI field are presented in this chapter as well.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

eBook
USD 16.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 129.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 129.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    This technique can also be used for algorithms that work on \(S = (S_+, S_-)\), provided that their computational complexity primarily depends on the size of \(S_+\).

  2. 2.

    The median is the number separating the higher half of a data sample, a population, or a probability distribution, from the lower half. If there is an even number of observations, then there is no single middle value; the median is then usually defined to be the mean of the two middle values.

  3. 3.

    Scikit-learn provides a range of supervised and unsupervised learning algorithms via a consistent interface in Python. It is licensed under a permissive simplified BSD license. Its web page is http://scikit-learn.org/stable/.

References

  • Alpaydin E (2010) Introduction to machine learning, 2nd edn. The MIT Press

    Google Scholar 

  • Angluin D (1976) An application of the theory of computational complexity to the study of inductive inference. PhD thesis, University of California

    Google Scholar 

  • Angluin D (1988) Queries and concept learning. Mach Learn 2(4):319–342

    MathSciNet  Google Scholar 

  • Book RV, Otto F (1993) String-rewriting systems. Springer, Text and Monographs in Computer Science

    Google Scholar 

  • Bunke H, Sanfelieu A (eds) (1990) Grammatical inference. World Scientific, pp 237–290

    Google Scholar 

  • Charikar M, Lehman E, Liu D, Panigrahy R, Prabhakaran M, Sahai A, Shelat A (2005) The smallest grammar problem. IEEE Trans Inf Theory 51(7):2554–2576

    Article  MathSciNet  MATH  Google Scholar 

  • de la Higuera C (2005) A bibliographical study of grammatical inference. Pattern Recogn 38(9):1332–1348

    Article  Google Scholar 

  • de la Higuera C (2010) Grammatical inference: learning automata and grammars. Cambridge University Press, New York, NY, USA

    Book  MATH  Google Scholar 

  • Domaratzki M, Kisman D, Shallit J (2002) On the number of distinct languages accepted by finite automata with \(n\) states. J Autom Lang Comb 7:469–486

    MathSciNet  MATH  Google Scholar 

  • Dupont P (1994) Regular grammatical inference from positive and negative samples by genetic search: the GIG method. In: Proceedings of 2nd international colloquium on grammatical inference, ICGI ’94, Lecture notes in artificial intelligence, vol 862. Springer, pp 236–245

    Google Scholar 

  • Eyraud R, de la Higuera C, Janodet J (2007) Lars: a learning algorithm for rewriting systems. Mach Learn 66(1):7–31

    Article  Google Scholar 

  • Gold EM (1967) Language identification in the limit. Inf Control 10:447–474

    Article  MATH  Google Scholar 

  • Gold EM (1978) Complexity of automaton identification from given data. Inf Control 37:302–320

    Article  MathSciNet  MATH  Google Scholar 

  • Grune D, Jacobs CJ (2008) Parsing techniques: a practical guide, 2nd edn. Springer

    Google Scholar 

  • Heinz J, de la Higuera C, van Zaanen M (2015) Grammatical inference for computational linguistics. Synthesis lectures on human language technologies. Morgan & Claypool Publishers

    Google Scholar 

  • Hopcroft JE, Motwani R, Ullman JD (2001) Introduction to automata theory, languages, and computation, 2nd edn. Addison-Wesley

    Google Scholar 

  • Hunt HB III, Rosenkrantz DJ, Szymanski TG (1976) On the equivalence, containment, and covering problems for the regular and context-free languages. J Comput Syst Sci 12:222–268

    Article  MathSciNet  MATH  Google Scholar 

  • Imada K, Nakamura K (2009) Learning context free grammars by using SAT solvers. In: Proceedings of the 2009 international conference on machine learning and applications, IEEE computer society, pp 267–272

    Google Scholar 

  • Japkowicz N, Shah M (2011) Evaluating learning algorithms: a classification perspective. Cambridge University Press

    Google Scholar 

  • Jiang T, Ravikumar B (1993) Minimal NFA problems are hard. SIAM J Comput 22:1117–1141

    Article  MathSciNet  MATH  Google Scholar 

  • Kuroda S (1964) Classes of languages and linear-bounded automata. Inf Control 7(2):207–223

    Article  MathSciNet  MATH  Google Scholar 

  • Maurer-Stroh S, Debulpaep M, Kuemmerer N, Lopez de la Paz M, Martins IC, Reumers J, Morris KL, Copland A, Serpell L, Serrano L et al (2010) Exploring the sequence determinants of amyloid structure using position-specific scoring matrices. Nat Methods 7(3):237–242

    Article  Google Scholar 

  • Meyer AR, Stockmeyer LJ (1972) The equivalence problem for regular expressions with squaring requires exponential space. In: Proceedings of the 13th annual symposium on switching and automata theory, pp 125–129

    Google Scholar 

  • Moore C, Eppstein D (2003) One-dimensional peg solitaire, and duotaire. In: More games of no chance. Cambridge University Press, pp 341–350

    Google Scholar 

  • Nowakowski RJ (ed) (1996) Games of no chance. Cambridge University Press

    Google Scholar 

  • Rozenberg G, Salomaa A (eds) (1997) Handbook of formal languages, vol 3. Beyond words. Springer

    Google Scholar 

  • Trakhtenbrot B, Barzdin Y (1973) Finite automata: behavior and synthesis. North-Holland Publishing Company

    Google Scholar 

  • Valiant LG (1984) A theory of the learnable. Commun ACM 27:1134–1142

    Article  MATH  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Wojciech Wieczorek .

Rights and permissions

Reprints and permissions

Copyright information

© 2017 Springer International Publishing AG

About this chapter

Cite this chapter

Wieczorek, W. (2017). Introduction. In: Grammatical Inference. Studies in Computational Intelligence, vol 673. Springer, Cham. https://doi.org/10.1007/978-3-319-46801-3_1

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-46801-3_1

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-46800-6

  • Online ISBN: 978-3-319-46801-3

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics