Advertisement

Speechreading by Humans and Machines

Models, Systems, and Applications

  • David G. Stork
  • Marcus E. Hennecke

Part of the NATO ASI Series book series (NATO ASI F, volume 150)

Table of contents

  1. Front Matter
    Pages I-XV
  2. Speechreading by Humans

    1. Front Matter
      Pages 1-1
    2. P. M. T. Smeele
      Pages 3-15
    3. Lynne E. Bernstein, Edward T. Auer Jr.
      Pages 17-26
    4. Barbara Dodd, Beth McIntosh, Lynn Woodhouse
      Pages 27-41
    5. Dominic W. Massaro
      Pages 79-101
    6. Helena M. Saldaña, David B. Pisoni, Jennifer M. Fellowes, Robert E. Remez
      Pages 145-151
    7. Michael M. Cohen, Rachel L. Walker, Dominic W. Massaro
      Pages 153-168
    8. Beatrice de Gelder, Paul Bertelson, Jean Vroomen
      Pages 179-191
    9. Jordi Robert-Ribes, Michel Piquemal, Jean-Luc Schwartz, Pierre Escudier
      Pages 193-210
    10. Marie-Agnès Cathiard, Mohamed-Tahar Lallouache, Christian Abry
      Pages 211-219
    11. Eric Vatikiotis-Bateson, Kevin G. Munhall, Makoto Hirayama, Y. Victor Lee, Demetri Terzopoulos
      Pages 221-232
    12. Shizuo Hiki, Yumiko Fukuda
      Pages 239-246
    13. Christian Abry, Mohamed-Tahar Lallouache, Marie-Agnès Cathiard
      Pages 247-255
    14. Margaret Kathleen Pichora-Fuller
      Pages 257-274
    15. Helena M. Saldaña, Lynne C. Nygaard, David B. Pisoni
      Pages 275-281
    16. C. Benoît, T. Guiard-Marigny, B. Le Goff, A. Adjoudani
      Pages 315-328
  3. Speechreading by Machines

    1. Front Matter
      Pages 329-329
    2. Marcus E. Hennecke, David G. Stork, K. Venkatesh Prasad
      Pages 331-349
    3. Barney Dalton, Robert Kaucic, Andrew Blake
      Pages 373-382
    4. Juergen Luettin, Neil A. Thacker, Steve W. Beet
      Pages 383-390
    5. Tarcisio Coianiz, Lorenzo Torresani, Bruno Caprile
      Pages 391-398
    6. Christoph Bregler, Stephen M. Omohundro, Jianbo Shi, Yochai Konig
      Pages 409-423
    7. Peter L. Silsbee, Qin Su
      Pages 489-496
    8. Alan J. Goldschen, Oscar N. Garcia, Eric D. Petajan
      Pages 505-515
  4. Panel discussions

    1. Front Matter
      Pages 517-517
    2. David G. Stork, Marcus E. Hennecke
      Pages 519-523
    3. David G. Stork, Marcus E. Hennecke
      Pages 525-531
    4. David G. Stork, Marcus E. Hennecke
      Pages 533-540
    5. David G. Stork, Marcus E. Hennecke
      Pages 541-548
    6. David G. Stork, Marcus E. Hennecke
      Pages 549-555
  5. Back Matter
    Pages 557-695

About this book

Introduction

This book is one outcome of the NATO Advanced Studies Institute (ASI) Workshop, "Speechreading by Man and Machine," held at the Chateau de Bonas, Castera-Verduzan (near Auch, France) from August 28 to Septem­ ber 8, 1995 - the first interdisciplinary meeting devoted the subject of speechreading ("lipreading"). The forty-five attendees from twelve countries covered the gamut of speechreading research, from brain scans of humans processing bi-modal stimuli, to psychophysical experiments and illusions, to statistics of comprehension by the normal and deaf communities, to models of human perception, to computer vision and learning algorithms and hardware for automated speechreading machines. The first week focussed on speechreading by humans, the second week by machines, a general organization that is preserved in this volume. After the in­ evitable difficulties in clarifying language and terminology across disciplines as diverse as human neurophysiology, audiology, psychology, electrical en­ gineering, mathematics, and computer science, the participants engaged in lively discussion and debate. We think it is fair to say that there was an atmosphere of excitement and optimism for a field that is both fascinating and potentially lucrative. Of the many general results that can be taken from the workshop, two of the key ones are these: • The ways in which humans employ visual image for speech recogni­ tion are manifold and complex, and depend upon the talker-perceiver pair, severity and age of onset of any hearing loss, whether the topic of conversation is known or unknown, the level of noise, and so forth.

Keywords

Active Shape Model Hidden Markov Model Markov model animation behavior cognition computer vision fuzzy knowledge learning memory neural networks perception psychophysics speech recognition

Editors and affiliations

  • David G. Stork
    • 1
    • 2
  • Marcus E. Hennecke
    • 1
    • 2
  1. 1.Ricoh California Research CenterMenlo ParkUSA
  2. 2.Department of Electrical EngineeringStanford UniversityStanfordUSA

Bibliographic information

  • DOI https://doi.org/10.1007/978-3-662-13015-5
  • Copyright Information Springer-Verlag Berlin Heidelberg 1996
  • Publisher Name Springer, Berlin, Heidelberg
  • eBook Packages Springer Book Archive
  • Print ISBN 978-3-642-08252-8
  • Online ISBN 978-3-662-13015-5
  • Series Print ISSN 0258-1248
  • Buy this book on publisher's site
Industry Sectors
Pharma
Automotive
Biotechnology
Finance, Business & Banking
Electronics
IT & Software
Telecommunications
Consumer Packaged Goods
Aerospace
Oil, Gas & Geosciences
Engineering