Computer-Supported Human-Human Multilingual Communication

Waibel, Alex; Bernardin, Keni; Wölfel, Matthias

doi:10.1007/978-3-540-77296-5_25

Computer-Supported Human-Human Multilingual Communication

Alex Waibel^1,2,
Keni Bernardin¹ &
Matthias Wölfel¹

Chapter

4575 Accesses

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 4850))

Abstract

Computers have become an essential part of modern life, providing services in a multiplicity of ways. Access to these services, however, comes at a price: human attention is bound and directed toward a technical artifact in a human-machine interaction setting at the expense of time and attention for other humans. This paper explores a new class of computer services that support human-human interaction and communication implicitly and transparently. Computers in the Human Interaction Loop (CHIL), require consideration of all communication modalities, multimodal integration and more robust performance. We review the technologies and several CHIL services providing human-human support. Among them, we specifically highlight advanced computer services for cross-lingual communication.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Stiefelhagen, R., Bernardin, K., Bowers, R., Garafolo, J., Mostefa, D., Soundararajan, P.: The CLEAR 2006 Evaluation. In: Stiefelhagen, R., Garofolo, J. (eds.) CLEAR 2006. LNCS, vol. 4122, Springer, Heidelberg (2007)
Google Scholar
Fiscus, J., Ajot, J., Michel, M., Garofolo, J.: The rich transcription 2006 spring meeting recognition evaluation. In: Renals, S., Bengio, S., Fiscus, J.G. (eds.) MLMI 2006. LNCS, vol. 4299, Springer, Heidelberg (2006)
Chapter Google Scholar
Canton-Ferrer, C., Casas, J.R., Pardàs, M.: Human Model and Motion Based 3D Action Recognition in Multiple View Scenarios. In: EUSIPCO, Firenze (September 2006)
Google Scholar
Lanz, O.: Approximate Bayesian Multibody Tracking. IEEE Trans. PAMI 28(9) (September 2006)
Google Scholar
Stiefelhagen, R., Bernardin, K., Ekenel, H.K., McDonough, J., Nickel, K., Voit, M., Wölfel, M.: Audio-Visual Perception of a Lecturer in a Smart Seminar Room. Signal Processing 86(12) (December 2006)
Google Scholar
Wölfel, M., Nickel, K., McDonough, J.: Microphone array driven speech recognition: Influence of localization on the word error rate. In: Renals, S., Bengio, S. (eds.) MLMI 2005. LNCS, vol. 3869, Springer, Heidelberg (2006)
Chapter Google Scholar
Maganti, H.K., Gatica-Perez, D.: Speaker Localization for Microphone Array-Based ASR: The Effects of Accuracy on Overlapping Speech. In: ICMI, Banff, Canada (November 2006)
Google Scholar
Wojek, C., Nickel, K., Stiefelhagen, R.: Activity Recognition and Room-Level Tracking in an Office Environment. In: Proc. of the IEEE Intl. Conference on Multisensor Fusion and Integration for Intelligent Systems, Heidelberg, Germany (2006)
Google Scholar
Stiefelhagen, R., Yang, J., Waibel, A.: Modeling Focus of Attention for Meeting Indexing. In: ACM Multimedia, Orlando, Florida (October 1999)
Google Scholar
Voit, M., Stiefelhagen, R.: Tracking Head Pose and Focus of Attention with Multiple Far-field Cameras. In: ICMI, Banff, Canada (November 2006)
Google Scholar
CHIL – Computers in the Human Interaction Loop, http://chil.server.de
VACE – Video Analysis and Content Extraction, http://www.ic-arda.org
TRECVID – TREC Video Retrieval Evaluation, http://www-nlpir.nist.gov/projects/t01v/
PETS – Performance Evaluation of Tracking and Surveillance, http://www.pets2006.net/
ETISEO – Video Understanding Evaluation, http://www.silogic.fr/etiseo
D2.2 Functional Requirements & CHIL Cooperative Information System Software Design, Part 2, Cooperative Information System Software Design, http://chil.server.de
Waibel, A., Bett, M., Finke, M., Stiefelhagen, R.: Meeting browser: Tracking and summarizing meetings. In: Proceedings of the Broadcast News Transcription and Understanding Workshop, Lansdowne, Virginia, pp. 281–286 (1998)
Google Scholar
Bouamrane, M.-M., Luz, S.: Meeting browsing. Multimedia Systems 12(4-5), 439–457 (2006)
Article Google Scholar
Wang, Q.Y., Battocchi, A., Graziola, I., Pianesi, F., Tomasini, D., Zancanaro, M., Nass, C.: The Role of Psychological Ownership and Ownership Markers in Collaborative Working Environment. In: ICMI, Banff, Canada (2006)
Google Scholar
Danninger, M., Kluge, T., Stiefelhagen, R.: MyConnector – Analysis of Context Cues to Predict Human Availability for Communication. In: ICMI, Banff, Canada (2006)
Google Scholar
Neumann, J., Casas, J.R., Macho, D., Ruiz, J.: Multimodal Integration of Sensor Networks. In: Proc. of AIAI, Athens, Greece, pp. 312–323 (2006)
Google Scholar
Waibel, A., Jain, A.N., McNair, A.E., Saito, H., Hauptmann, A.G., Tebelskis, J.: JANUS: A Speech-to-speech Translation Using Connectionist and Symbolic Processing Strategies. In: Proc. of ICASSP 1991, pp. 793–796 (May 1991)
Google Scholar
Morimoto, T., Takezawa, T., Yato, F., Sagayama, S., Tashiro, T., Nagata, M., Kurematsu, A.: ATR’s speech translation system: ASURA. In: Proc. 3rd European Conf. on Speech Communication and Technology, pp. 1291–1294 (September 1993)
Google Scholar
Hsiao, R., Venugopal, A., Köhler, T., Zhang, Y., Charoenpornsawat, P., Zollmann, A., Vogel, S., Black, A.W., Schultz, T., Waibel, A.: Optimizing Components for Handheld Two-way Speech Translation for English-Iraqi Arabic System. In: Proceedings of Interspeech (2006)
Google Scholar
GALE – http://www.darpa.mil/ipto/programs/gale
Gauvain, J.L.: Speech transcription: general presentation of existing technologies within TC-Star. In: TC-Star Review Workshop, May 28-30, 2007, Luxembourg (2007)
Google Scholar
Ney, H.: TC-Star: Statistical MT of Text and Speech. In: TC-Star Review Workshop, May 28-30, 2007, Luxembourg (2007)
Google Scholar
Choukri, K.: Importance of the Evaluation of Human-Language Technologies. In: TC-Star Review Workshop, May 28-30, 2007, Luxembourg (2007)
Google Scholar
Kolss, M., Zhao, B., Vogel, S., Hildebrand, A., Niehues, J., Venugopal, A., Zhang, Y.: The ISL Statistical Machine Translation System for the TC-STAR Spring 2006 Evaluation. In: Proc. of the TC-STAR Workshop on Speech-to-Speech Translation, Barcelona, Spain (June 2006)
Google Scholar
Fügen, C., Kolss, M., Paulik, M., Waibel, A.: Open Domain Speech Translation: From Seminars and Speeches to Lectures. In: Proc. of the TC-STAR Workshop on Speech-to-Speech Translation, Barcelona, Spain (2006)
Google Scholar
Fiscus, J., Ajot, J.: The Rich Transcription 2007 Speech-To-Text (STT) and Speaker Attributed STT (SASTT) Results. In: The Rich Transcription 2007 Meeting Recognition (2007)
Google Scholar
Olszewski, D., Prasetyo, F., Linhard, K.: Steerable Highly Directional Audio Beam Louspeaker. In: Proc. of the Interspeech, Lisboa, Portugal (September 2006)
Google Scholar
Schultz, T.: Multilinguale Spracherkennung - Kombination akustischer Modelle zur Portierung auf neue Sprachen. PhD thesis, Universität Karlsruhe (June 2000)
Google Scholar
Eck, M., Vogel, S., Waibel, A.: Low Cost Portability for Statistical Machine Translation based on N-gram Frequency and TF-IDF. In: Proc. of IWSLT, Pittsburgh, PA (October 2005)
Google Scholar
Gavalda, M., Waibel, A.: Growing semantic grammars. In: Proceedings of the COLING/ACL, Montreal, Canada (1998)
Google Scholar
Paulik, M., Stüker, S., Fügen, C., Schultz, T., Schaaf, T., Waibel, A.: Speech Translation Enhanced Automatic Speech Recognition. In: ASRU, Cancun, Mexico (December 2005)
Google Scholar

Download references

Author information

Authors and Affiliations

InterACT, International Center for Advanced Communication Technology, Universität Karlsruhe (TH), Karlsruhe, Germany
Alex Waibel, Keni Bernardin & Matthias Wölfel
InterACT, International Center for Advanced Communication Technology, Carnegie Mellon University, Pittsburgh, PA, USA
Alex Waibel

Authors

Alex Waibel
View author publications
You can also search for this author in PubMed Google Scholar
Keni Bernardin
View author publications
You can also search for this author in PubMed Google Scholar
Matthias Wölfel
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Max Lungarella Fumiya Iida Josh Bongard Rolf Pfeifer

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Waibel, A., Bernardin, K., Wölfel, M. (2007). Computer-Supported Human-Human Multilingual Communication. In: Lungarella, M., Iida, F., Bongard, J., Pfeifer, R. (eds) 50 Years of Artificial Intelligence. Lecture Notes in Computer Science(), vol 4850. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-77296-5_25

Download citation

DOI: https://doi.org/10.1007/978-3-540-77296-5_25
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-77295-8
Online ISBN: 978-3-540-77296-5
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics