Skip to main content

On Categorial Grammatical Inference and Logical Information Systems

  • Chapter
  • First Online:
Book cover Logic and Algorithms in Computational Linguistics 2018 (LACompLing2018)

Part of the book series: Studies in Computational Intelligence ((SCI,volume 860))

Abstract

We consider several classes of categorial grammars and discuss their learnability. We review results where learning is viewed as a symbolic issue in an unsupervised setting, from raw or from structured data, for some variants of Lambek grammars and of categorial dependency grammars. In that perspective, we discuss for these frameworks different type connectives and structures, some limitations (negative results) but also some algorithms (positive results) under some hypothesis. On the experimental side, we also consider the Logical Information Systems approach, that allows for navigation, querying, updating, and analysis of heterogeneous data collections where data are given (logical) descriptors. Categorial grammars can be seen as a particular case of Logical Information System.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 119.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 159.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 159.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    \(^*\) in \(V^*\) is the Kleene star unary operation on V.

  2. 2.

    \(\mathbb {N}\) is the set of natural numbers \(\{0, 1, 2, 3, \ldots \}\).

  3. 3.

    Usually a lexicon is a subset of the words of a natural language and an entry is one of the words of the lexicon.

  4. 4.

    A list of entries can be compatible with none, one or several types.

  5. 5.

    Transitivity means that it is possible to derive the type \(A {/}C\) from the types \(A {/}B\) and \(B {/}C\) or the type \(A {\setminus }C\) from the types \(A {\setminus }B\) and \(B {\setminus }C\).

  6. 6.

    RG means Rigid Grammar.

  7. 7.

    Trivial classes are for instance finite sets of grammars, for which learnability is obvious.

  8. 8.

    The closure properties are satisfied as mentioned in [43]:

    \(X \vdash X^Y\), \((X^Y)^Y \vdash X^Y\), if \(X \vdash Z\) then \(X^Y \vdash Z^Y\)

  9. 9.

    www.irisa.fr/LIS/softwares.

  10. 10.

    Lefff stands for: “Lexique des Formes Fléchies du Français/Lexicon of French inflected forms” (see http://alpage.inria.fr/~sagot/lefff-en.html).

  11. 11.

    http://universaldependencies.org/format.html.

  12. 12.

    This complies with Sequoia data, but may be a simplification for other corpora.

  13. 13.

    https://universaldependencies.org/conllu_viewer.html.

  14. 14.

    http://talc2.loria.fr/grew/.

References

  1. Angluin, D.: Inductive inference of formal languages from positive data. Inf. Control 45, 117–135 (1980)

    Article  MathSciNet  Google Scholar 

  2. Baader, F., Calvanese, D., McGuinness, D.L., Nardi, D., Patel-Schneider, P.F. (eds.): The Description Logic Handbook: Theory, Implementation, and Applications. Cambridge University Press (2003)

    Google Scholar 

  3. Bar-Hillel, Y.: A quasi arithmetical notation for syntactic description. Language 29, 47–58 (1953)

    Article  Google Scholar 

  4. Béchet, D.: k-valued link grammars are learnable from strings. In: Jäger, G., Monachesi, P., Penn, G., Wintner, S. (eds.) Proceedings of the 8th conference on Formal Grammar (FGVienna), Vienna, Austria, pp. 9–18 (2003)

    Google Scholar 

  5. Béchet, D., Dikovsky, A., Foret, A.: Dependency structure grammar. In: Blache, P., Stabler, E., Busquets, J., Moot, R. (eds.) Logical Aspects of Computational Linguistics, 5th International Conference, LACL 2005, Bordeaux, France, April 28–30, 2005, Proceedings. Lecture Notes in Artificial Intelligence (LNAI), vol. 3492, pp. 18–34. Springer (2005). https://doi.org/10.1007/b136076. http://signes.labri.fr/LACL/cfp.htm

    Google Scholar 

  6. Béchet, D., Dikovsky, A., Foret, A.: On dispersed and choice iteration in incrementally learnable dependency types. In: Logical Aspects of Computational Linguistics - 6th International Conference, LACL 2011, Montpellier, France. Lecture Notes in Computer Science (LNCS), vol. 6736, pp. 80–95. Springer (2011)

    Google Scholar 

  7. Béchet, D., Dikovsky, A., Foret, A.: Sur les itérations dispersées et les choix itérés pour l’apprentissage incrémental des types dans les grammaires de dépendances. In: Conférence Francophone d’Apprentissage 2011 (CAP), Chambéry, France (2011)

    Google Scholar 

  8. Béchet, D., Dikovsky, A., Foret, A.: Two models of learning iterated dependencies. In: de Groote, P., Nederhof, M.J. (eds.) Formal Grammar, 15th and 16th International Conferences, FG 2010, Copenhagen, Denmark, August 2010, FG 2011, Lubljana, Slovenia, August 2011, Revised Selected Papers. Lecture Notes in Computer Science (LNCS), vol. 7395, pp. 17–32. Springer (2012). https://doi.org/10.1007/978-3-642-32024-8

    Google Scholar 

  9. Béchet, D., Dikovsky, A., Foret, A., Moreau, E.: On learning discontinuous dependencies from positive data. In: Proceedings of the Formal Grammar Conference (FG 2004) (2004)

    Google Scholar 

  10. Béchet, D., Foret, A.: Apprentissage des grammaires de Lambek rigides et d’arité bornée pour le traitement automatique des langues. In: Actes de la Conférence d’APprentissage 2003 (CAP’2003) (2003)

    Google Scholar 

  11. Béchet, D., Foret, A.: k-valued non-associative Lambek categorial grammars are not learnable from strings. In: ACL (ed.) Proceedings of the 41st Annual Meeting of the Association for Computational Linguistics (ACL 2003) (2003)

    Google Scholar 

  12. Béchet, D., Foret, A.: k-valued non-associative Lambek grammars are learnable from function-argument structures. In: Proceedings of the 10th Workshop on Logic, Language, Information and Computation (WoLLIC’2003). Electronic Notes in Theoretical Computer Science, vol. 85 (2003)

    Article  Google Scholar 

  13. Béchet, D., Foret, A.: Remarques et perspectives sur les langages de prégroupe d’ordre 1/2. In: Actes, Dixième conférence de Traitement Automatique des Langues Naturelles (TALN 2003), pp. 309–314. ATALA (2003). (Poster)

    Google Scholar 

  14. Béchet, D., Foret, A.: k-valued non-associative Lambek grammars (without product) form a strict hierarchy of languages. In: Blache, P., Stabler, E., Busquets, J., Moot R. (eds.) Logical Aspects of Computational Linguistics, 5th International Conference, LACL 2005, Bordeaux, France, April 28–30, 2005, Proceedings. Lecture Notes in Artificial Intelligence (LNAI), vol. 3492, pp. 1–16. Springer (2005). https://doi.org/10.1007/b136076. http://signes.labri.fr/LACL/cfp.htm

    Google Scholar 

  15. Béchet, D., Foret, A.: On rigid NL Lambek grammars inference from generalized functor-argument data. In: FGMOL’05, the Tenth Conference on Formal Grammar and the Ninth on the Mathematics of Language, Edinburgh, Scotland (2005)

    Google Scholar 

  16. Béchet, D., Foret, A.: k-valued non-associative Lambek grammars are learnable from generalized functor-argument structures. J. Theor. Comput. Sci. 355(2) (extended version of [12]) (2006)

    Article  MathSciNet  Google Scholar 

  17. Béchet, D., Foret, A.: A pregroup toolbox for parsing and building grammars of natural languages. Linguist. Anal. J. 36 (2010)

    Google Scholar 

  18. Béchet, D., Foret, A.: Categorial dependency grammars with iterated sequences. In: Amblard, M., de Groote, P., Pogodalla, S., Retoré, C. (eds.) Logical Aspects of Computational Linguistics. Celebrating 20 Years of LACL (1996–2016) - 9th International Conference, LACL 2016, Nancy, France, December 5–7, 2016, Proceedings. Lecture Notes in Computer Science (LNCS), vol. 10054, pp. 34–51 (2016). https://doi.org/10.1007/978-3-662-53826-5_3

    Chapter  Google Scholar 

  19. van Benthem, J., ter Meulen, A. (eds.): Handbook of Logic and Language. North-Holland Elsevier, Amsterdam (1997)

    Google Scholar 

  20. Bonato, R., Retoré, C.: Learning rigid Lambek grammars and minimalist grammars from structured sentences. In: Third workshop on Learning Language in Logic, Strasbourg (2001)

    Google Scholar 

  21. Buszkowski, W., Penn, G.: Categorial grammars determined from linguistic data by unification. Studia Logica 49, 431–454 (1990)

    Article  MathSciNet  Google Scholar 

  22. Dekhtyar, M., Dikovsky, A.: Categorial dependency grammars. In: Proceedings of International Conference on Categorial Grammars, pp. 76–91. Montpellier (2004)

    Google Scholar 

  23. Dikovsky, A.: Polarized non-projective dependency grammars. In: de Groote, P., Morill, G., Retoré, C. (eds.) Proceedings of the Fourth International Conference on Logical Aspects of Computational Linguistics. Lecture Notes in Artificial Intelligence (LNAI), vol. 2099, pp. 139–157. Springer, Le Croisic, France (2001)

    Chapter  Google Scholar 

  24. Dikovsky, A.: Dependencies as categories. In: Kruiff, G.J., Duchier, D. (eds.) Proceedings of Workshop “Recent Advances in Dependency Grammars”. In conjunction with COLING 2004, pp. 90–97. Geneva, Switzerland (2004)

    Google Scholar 

  25. Dudau-Sofronie, D., Tellier, I., Tommasi, M.: Learning categorial grammars from semantic types. In: 13th Amsterdam Colloquium (2001)

    Google Scholar 

  26. Ferré, S.: Camelis: a logical information system to organize and browse a collection of documents. Int. J. Gen. Syst. 38(4) (2009)

    Article  Google Scholar 

  27. Ferré, S., Ridoux, O.: An introduction to logical information systems. Inf. Process. Manag. 40(3), 383–419 (2004)

    Article  Google Scholar 

  28. Ferré, S., Ridoux, O.: Logic functors: a toolbox of components for building customized and embeddable logics. Research Report RR-5871, INRIA, 103 pp. (2006). http://www.inria.fr/rrrt/rr-5871.html

  29. Florêncio, C.C.: Consistent identification in the limit of the class \(k\)-valued is NP-hard. In: de Groote, P., Morill, G., Retoré, C. (eds.) Proceedings of the Fourth International Conference on Logical Aspects of Computational Linguistics. Lecture Notes in Artificial Intelligence (LNAI), vol. 2099, pp. 125–138. Springer, Le Croisic, France (2001)

    Google Scholar 

  30. Foret, A.: Pregroup calculus as a logical functor. In: Proceedings of WOLLIC 2007. Lecture Notes in Computer Science (LNCS), vol. 4576. Springer (2007)

    Google Scholar 

  31. Foret, A.: A modular and parameterized presentation of pregroup calculus. Inf. Comput. J. 208(5), 395–604 (2010)

    Article  MathSciNet  Google Scholar 

  32. Foret, A., Ferré, S.: On categorial grammars as logical information systems. In: Kwuida, L., Sertkaya, B. (eds.) International Conference on Formal Concept Analysis. Lecture Notes in Computer Science (LNCS), vol. 5986, pp. 225–240. Springer (2010)

    Google Scholar 

  33. Foret, A., Ferré, S.: On categorial grammars and logical information systems: using camelis with linguistic data. In: System Demonstration at LACL 2012 (2012). https://lacl2012.sciencesconf.org/conference/lacl2012/demo_proceedings.pdf

  34. Foret, A., Nir, Y.L.: Lambek rigid grammars are not learnable from strings. In: COLING’2002, 19th International Conference on Computational Linguistics. Taipei, Taiwan (2002)

    Google Scholar 

  35. Foret, A., Nir, Y.L.: On limit points for some variants of rigid Lambek grammars. In: ICGI’2002, the 6th International Colloquium on Grammatical Inference. Lecture Notes in Artificial Intelligence (LNAI), vol. 2484. Springer, Amsterdam, the Netherlands (2002)

    Chapter  Google Scholar 

  36. Girard, J.Y.: Linear logic: its syntax and semantics. In: Advances in Linear Logic, pp. 1–42. Cambridge University Press (1995). http://iml.univ-mrs.fr/~girard/Articles.html

  37. Gold, E.M.: Language identification in the limit. Inf. Control 10, 447–474 (1967)

    Article  MathSciNet  Google Scholar 

  38. de la Higuera, C.: Grammatical Inference: Learning Automata and Grammars. Cambridge University Press (2010)

    Google Scholar 

  39. Kanazawa, M.: Learnable classes of categorial grammars. Studies in Logic, Language and Information. FoLLI & CSLI (1998)

    Google Scholar 

  40. Lambek, J.: The mathematics of sentence structure. Am. Math. Mon. 65, 154–169 (1958)

    Article  MathSciNet  Google Scholar 

  41. Lambek, J.: On the calculus of syntactic types. In: Jakobson, R. (ed.) Structure of Language and Its Mathematical Aspects, pp. 166–178. American Mathematical Society (1961)

    Google Scholar 

  42. Lecomte, A.: Grammaire et théorie de la preuve: une introduction. Traitement Automatique des Langues 37(2), 1–38 (1996)

    Google Scholar 

  43. Moortgat, M.: Categorial type logic. In: van Benthem and ter Meulen [19] , Chap. 2, pp. 93–177

    Google Scholar 

  44. Moot, R., Retoré, C.: The logic of categorial grammars. In: LNCS FOLLI 6850. Springer (2011). http://hal.archives-ouvertes.fr/hal-00607670. LCNS 6850

  45. Nicolas, J.: Grammatical inference as unification. In: Rapport de Recherche RR-3632, INRIA (1999). http://www.inria.fr/RRRT/publications-eng.html

  46. Partee, B.: Montague grammar. In: van Benthem and ter Meulen [19], Chap. 1, pp. 5–92

    Google Scholar 

  47. Pentus, M.: Lambek grammars are context-free. In: Logic in Computer Science. IEEE Computer Society Press (1993)

    Google Scholar 

  48. Shinohara, T.: Inductive inference from positive data is powerful. In: The 1990 Workshop on Computational Learning Theory, pp. 97–110. Morgan Kaufmann, San Mateo, California (1990)

    Google Scholar 

  49. Sleator, D., Temperley, D.: Parsing English with a link grammar. In: Third International Workshop on Parsing Technologies (1993). ftp://bobo.link.cs.cmu.edu/pub/sleator/link-grammar/

  50. Wright, K.: Identifications of unions of languages drawn from an identifiable class. In: The 1989 Workshop on Computational Learning Theory, pp. 328–333. San Mateo, California (1989)

    Chapter  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Annie Foret .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2020 Springer Nature Switzerland AG

About this chapter

Check for updates. Verify currency and authenticity via CrossMark

Cite this chapter

Foret, A., Béchet, D. (2020). On Categorial Grammatical Inference and Logical Information Systems. In: Loukanova, R. (eds) Logic and Algorithms in Computational Linguistics 2018 (LACompLing2018). Studies in Computational Intelligence, vol 860. Springer, Cham. https://doi.org/10.1007/978-3-030-30077-7_6

Download citation

Publish with us

Policies and ethics