Advertisement

Analysis of Interrogatives in Different Domains

  • Helena Moniz
  • Fernando Batista
  • Isabel Trancoso
  • Ana Isabel Mata
Part of the Lecture Notes in Computer Science book series (LNCS, volume 6456)

Abstract

The aim of this work is twofold: to quantify the distinct interrogative types in different domains for European Portuguese, and to discuss the weight of the linguistic features that best describe these structures, in order to model interrogatives in speech.

We analyzed spoken dialogue, university lectures, and broadcast news corpora, and, for the sake of comparison, newspaper texts. The statistical analysis confirms that the percentage of the different types of interrogative is highly dependent on the nature of the corpus. Experiments on the automatic detection of interrogatives for European Portuguese, using only lexical cues, show results that are strongly correlated with the detection of a specific type of interrogatives (namely wh- questions). When acoustic and prosodic features (pitch, energy and duration) are added, yes/no and tag questions are then increasingly identified, showing the advantages of combining both lexical, acoustic and prosodic information.

Keywords

Interrogatives punctuation prosody 

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. 1.
    Amaral, R., Meinedo, H., Caseiro, D., Trancoso, I., Neto, J.: A prototype system for selective dissemination of broadcast news in European Portuguese. EURASIP. Journal of Advances in Signal Processing 37507 (2007)Google Scholar
  2. 2.
    Batista, F., Caseiro, D., Mamede, N., Trancoso, I.: Recovering capitalization and punctuation marks for automatic speech recognition: Case study for Portuguese broadcast news. Speech Communication 50(10), 847–862 (2008)CrossRefGoogle Scholar
  3. 3.
    Batista, F., Moniz, H., Trancoso, I., Meinedo, H., Mata, A.I., Mamede, N.: Extending the punctuation module for European Portuguese. In: Interspeech 2010 (2010)Google Scholar
  4. 4.
    Boakye, K., Favre, B., Hakkani-Tür, D.: Any questions? Automatic question detection in meetings. In: ASRU, Merano, Italy (2009)Google Scholar
  5. 5.
    Carletta, J.: Assessing agreement on classification tasks: The kappa statistic. Computational Linguistics 22, 249–254 (1996)Google Scholar
  6. 6.
    Cruz-Ferreira, M.: Intonation in European Portuguese. In: Hirst, D., Di Cristo, A. (eds.) Intonation systems, pp. 167–178. Cambridge University Press, Cambridge (1965)Google Scholar
  7. 7.
    Daumé III, H.: Notes on CG and LM-BFGS optimization of logistic regression (2004), http://hal3.name/megam/
  8. 8.
    Falé, I.: Fragmento da Prosódia do Português Europeu: as Estruturas Coordenadas. Master’s thesis, University of Lisbon (1995)Google Scholar
  9. 9.
    Frota, S.: Prosody and Focus in European Portuguese. Phonological Phrasing and Intonation. Garland Publishing, New York (2000)Google Scholar
  10. 10.
    Frota, S.: Nuclear falls and rises in European Portuguese: a phonological analysis of declarative and question intonation. Probus (14), 113–146 (2002)Google Scholar
  11. 11.
    Frota, S.: The intonational phonology of European Portuguese. In: Sun-uh (ed.) Prosodic Typology II. Oxford University Press, Oxford (2009)Google Scholar
  12. 12.
    Liscombe, J., Venditti, J., Hirschberg, J.: Classifying the form and function of student questions in spoken tutorial dialogs. Elsevier, Amsterdam (submitted)Google Scholar
  13. 13.
    Makhoul, J., Kubala, F., Schwartz, R., Weischedel, R.: Performance measures for information extraction. In: Proc. of the DARPA BN Workshop (1999)Google Scholar
  14. 14.
    Martins, F.: Entoação e organização do enunciado. Master’s thesis, University of Lisbon (1986)Google Scholar
  15. 15.
    Mata, A.I.: Questões de entoação e interrogação no Português. Isso é uma pergunta?. Master’s thesis, University of Lisbon (1990)Google Scholar
  16. 16.
    Mata, A.I., Santos, A.L.: On the intonation of confirmation-seeking requests in child-directed speech. In: Speech prosody (2010)Google Scholar
  17. 17.
    Moniz, H., Batista, F., Meinedo, H., Abad, A., Trancoso, I., Mata, A.I., Mamede, N.: Prosodically-based automatic segmentation and punctuation. In: Speech Prosody 2010 (2010)Google Scholar
  18. 18.
    Neto, J., Meinedo, H., Amaral, R., Trancoso, I.: The development of an automatic system for selective dissemination of multimedia information. In: International Workshop on Content-Based Multimedia Indexing (2003)Google Scholar
  19. 19.
    Shriberg, E., et al.: Can prosody aid the automatic classification of dialog acts in conversational speech? Language and Speech (41), 439–487 (1998)Google Scholar
  20. 20.
    Shriberg, E., Favre, B., Fung, J., Hakkani-Tur, D., Cuendet, S.: Prosodic similarities of dialog act boundaries across speaking styles. In: Tseng, S.C. (ed.) Linguistic Patterns in Spontaneous Speech, pp. 213–239. Institute of Linguistics, Academia Sinica, Taipei (2009)Google Scholar
  21. 21.
    Silverman, K., Beckman, M., Pitrelli, J., Ostendorf, M., Wightman, C., Price, P., Pierrehumbert, J., Hirschberg, J.: ToBI: a standard for labeling English prosody. In: International Conference on Spoken Language Processing, Banff, Canada (1992)Google Scholar
  22. 22.
    Trancoso, I., Martins, R., Moniz, H., Mata, A.I., Viana, M.C.: The Lectra corpus - classroom lecture transcriptions in European Portuguese. In: LREC 2008 - Language Resources and Evaluation Conference, Marrakesh, Morocco (May 2008)Google Scholar
  23. 23.
    Vassière, J.: Language-independent prosodic features. In: Cutler, A., Ladd, R. (eds.) Prosody: modules and measurements, pp. 55–66. Springer, Heidelberg (1983)Google Scholar
  24. 24.
    Viana, M.C.: Para a Síntese da Entoação do Português. Ph.D. thesis, University of Lisbon (1987)Google Scholar
  25. 25.
    Viana, M.C., Frota, S., Falé, I., Mascarenhas, I., Mata, A.I., Moniz, H., Vigário, M.: Towards a P_ToBI. In: Unpublished Workshop of the Transcription of Intonation in the Ibero-Romance Languages, PaPI 2007 (2007), http://www2.ilch.uminho.pt/eventos/PaPI2007/Extended-Abstract-P-ToBI.PDF
  26. 26.
    Viana, M.C., Trancoso, I., Mascarenhas, I., Duarte, I., Matos, G., Oliveira, L., Campos, H., Correia, C.: Apresentação do projecto Coral - corpus de diálogo etiquetado. In: Workshop de Linguística Computacional, Lisbon, Portugal (April 1998)Google Scholar
  27. 27.
    Vigário, M.: Aspectos da prosódia do Portugês Europeu. Estruturas com advérbios de exclusão e negação frásica. Master’s thesis, University of Minho (1995)Google Scholar
  28. 28.
    Wang, D., Narayanan, S.: A multi-pass linear fold algorithm for sentence boundary detection using prosodic cues. In: ICASSP (2004)Google Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2011

Authors and Affiliations

  • Helena Moniz
    • 1
    • 2
  • Fernando Batista
    • 2
    • 3
  • Isabel Trancoso
    • 2
    • 4
  • Ana Isabel Mata
    • 1
  1. 1.Faculdade de Letras da Universidade de Lisboa (FLUL)Centro de Linguística da Universidade de Lisboa (CLUL), Alameda da UniversidadePortugal
  2. 2.INESC-IDLisboaPortugal
  3. 3.ISCTE-IUL - Instituto Universitário de LisboaLisboaPortugal
  4. 4.Instituto Superior TécnicoUniversidade Técnica de LisboaLisboaPortugal

Personalised recommendations