Skip to main content

Gramateca: Corpus-Based Grammar of Portuguese

  • Conference paper
Computational Processing of the Portuguese Language (PROPOR 2014)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 8775))

  • 632 Accesses

Abstract

This paper has two aims: to present Gramateca, an initiative for corpus-based grammar of Portuguese recently launched by Linguateca, discussing its rationale and aims, and to present a small pilot of an advanced corpus description for Portuguese, which we believe to be a pre-requisite to do sound grammar work, “know thy corpus”.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Santos, D.: Linguateca’s infrastructure for Portuguese and how it allows the detailed study of language varieties. In: Johannessen, J. (ed.) Language Variation Infrastructure: Papers on Selected Projects, vol. 3, pp. 113–128 (2011)

    Google Scholar 

  2. Varejão, F. de O.A.: O português do Brasil: Revisitando a História. In: Cadernos de Letras da UFF Dossiê: Difusão da Língua Portuguesa, vol. 39, pp. 119–137 (2009)

    Google Scholar 

  3. Ilari, R. (ed.): Palavras de classe aberta. da Gramática do Portugués Culto Falado no Brasil, vol. 3. Editora Contexto (2014)

    Google Scholar 

  4. Raposo, E.B.P., do Nascimento, M.F.B., da Mota, M.A.C., Segura, L., Mendes, A., Vicente, G., Veloso, R. (eds.): Gramática do Português I e II. Gulbenkian (2013)

    Google Scholar 

  5. Bick, E.: Portuguese syntax: Teaching manual (2000)

    Google Scholar 

  6. Galves, C.: Tycho Brahe Parsed Corpus of Historical Portuguese: Syntactic Annotation System (2007-2008), http://www.tycho.iel.unicamp.br/~tycho/corpus/manual/syn-frm.html

  7. Biber, D.: Variation across speech and writing. Cambridge University Press (1988)

    Google Scholar 

  8. Biber, D., Johansson, S., Leech, G., Conrad, S., Finegan, E.: The Longman grammar of spoken and written English. Longman (1999)

    Google Scholar 

  9. Crystal, D.: Internet Linguistics: A Student Guide. Routledge (2011)

    Google Scholar 

  10. Bick, E.: The Parsing System “Palavras”: Automatic Grammatical Analysis of Portuguese in a Constraint Grammar Framework. PhD thesis, Aarhus University, Aarhus, Denmark (November 2000)

    Google Scholar 

  11. Santos, D., Bick, E.: Providing Internet access to Portuguese corpora: The AC/DC project. In: Gavrilidou, M., Carayannis, G., Markantonatou, S., Piperidis, S., Stainhauer, G. (eds.) Proceedings of the Second International Conference on Language Resources and Evaluation (LREC 2000), May 31-June 2, pp. 205–210 (2000)

    Google Scholar 

  12. Costa, L., Santos, D., Rocha, P.A.: Estudando o portugués tal como é usado: o serviço AC/DC. In: The 7th Brazilian Symposium in Information and Human Language Technology (STIL 2009), September 8-11 (2009)

    Google Scholar 

  13. Santos, D.: Podemos contar com as contas? In: Aluísio, S., Tagnin, S. (eds.) New Language Technologies and Linguistic Research: A Two-way Road. Cambridge Scholars Publishing (2014)

    Google Scholar 

  14. Santos, D.: Comparing oral (transcribed) and written corpora in portuguese, Presentation at GSCP 2014, Stockholm (2014), http://www.linguateca.pt/Diana/download/CompOralEstoc.pdf

  15. Santos, D., Silva, R., Freitas, C.: Pluralidades na cor: contrastando a língua do Brasil e de Portugal. In: da Silva, A.S., Torres, A., Gonçalves, M. (eds.) Línguas Pluricêntricas: Variação Linguística e Dimensões Sociocognitivas, Braga, Aletheia, Publicações da Faculdade de Filosofia da Universidade Católica Portuguesa, pp. 555–572 (2011)

    Google Scholar 

  16. Marques, R.: Modalidade e condicionais em português (submitted, 2014)

    Google Scholar 

  17. Freitas, C., Santos, D., Sousa, R., Jansen, H., Mota, C.: Esqueleto: Body language in Portuguese (submitted, 2014)

    Google Scholar 

  18. Santos, D., Mota, C.: Opinions and sentiment analysis in Gramateca (submitted, 2014)

    Google Scholar 

  19. Simões, A.: Comparador: forma de auscultar corpos no AC/DC (2014), http://www.linguateca.pt/documentos/Comparador.pdf

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2014 Springer International Publishing Switzerland

About this paper

Cite this paper

Santos, D. (2014). Gramateca: Corpus-Based Grammar of Portuguese. In: Baptista, J., Mamede, N., Candeias, S., Paraboni, I., Pardo, T.A.S., Volpe Nunes, M.d.G. (eds) Computational Processing of the Portuguese Language. PROPOR 2014. Lecture Notes in Computer Science(), vol 8775. Springer, Cham. https://doi.org/10.1007/978-3-319-09761-9_24

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-09761-9_24

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-09760-2

  • Online ISBN: 978-3-319-09761-9

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics