Audiovisual Integration of Face–Voice Gender Studied Using “Morphed Videos”

Watson, Rebecca; Charest, Ian; Rouger, Julien; Casper, Christoph; Latinus, Marianne; Belin, Pascal

doi:10.1007/978-1-4614-3585-3_7

Audiovisual Integration of Face–Voice Gender Studied Using “Morphed Videos”

Rebecca Watson⁴,
Ian Charest⁵,
Julien Rouger⁶,
Christoph Casper⁷,
Marianne Latinus⁴ &
…
Pascal Belin^4,8

Chapter
First Online: 01 January 2012

1942 Accesses
1 Citations

Abstract

Both the face and the voice provide us with not only linguistic information but also a wealth of paralinguistic information, including gender cues. However, the way in which we integrate these two sources in our perception of gender has remained largely unexplored. In the following study, we used a bimodal perception paradigm in which varying degrees of incongruence were created between facial and vocal information within audiovisual stimuli. We found that in general participants were able to combine both sources of information, with gender of the face being influenced by that of the voice and vice versa. However, in conditions that directed attention to either modality, we observed that participants were unable to ignore the gender of the voice, even when instructed to. Overall, our results point to a larger role of the voice in gender perception, when more controlled visual stimuli are used.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 169.00; Price excludes VAT (USA)

Softcover Book: USD 219.99; Price excludes VAT (USA)

Hardcover Book: USD 219.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

References

Brown, E., & Perrett, D. I. (1993). What gives a face its gender? Perception, 22(7), 829–840.
Article PubMed CAS Google Scholar
Bruce, V., Burton, A. M., Hanna, E., Healey, P., Mason, O., Coombes, A., Fright, R., & Linney, A. (1993). Sex discrimination: How do we tell the difference between male and female faces? Perception, 22(2), 131–152.
Article PubMed CAS Google Scholar
Campanella, S., & Belin, P. (2007). Integrating face and voice in person perception. Trends in Cognitive Sciences, 11(12), 535–543.
Article PubMed Google Scholar
de Gelder, B., & Vroomen, J. (2000). The perception of emotions by ear and by eye. Cognition and Emotion, 14(3), 289–311.
Article Google Scholar
Haxby, J. V., Hoffman, E. A., & Gobbini, M. I. (2000). The distributed human neural system for face perception. Trends in Cognitive Sciences, 4(6), 223–233.
Article PubMed Google Scholar
Joassin, F., Maurage, P., & Campanella, S. (2011). The neural network sustaining the crossmodal processing of human gender from faces and voices: An fMRI study. Neuroimage, 54(2), 1654–1661.
Article PubMed Google Scholar
Joassin, F., Pesenti, M., Maurage, P., Verreckt, E., Bruyer, R., & Campanella, S. (2011). Cross-modal interactions between human faces and voices involved in person recognition. Cortex, 47(3), 367–376.
Article PubMed Google Scholar
Kamachi, M., Hill, H., Lander, K., & Vatikiotis-Bateson, E. (2003). “Putting the face to the voice”: Matching identity across modality. Current Biology, 13(19), 1709–1714.
Article PubMed CAS Google Scholar
Kawahara, H. (2003). Exemplar-based voice quality analysis and control using a high quality auditory morphing procedure based on straight. In: VoQual 03: Voice Quality: Functions, Analysis and Synthesis. Geneva (Switzerland): ISCA Tutorial and Research Workshop.
Article PubMed CAS Google Scholar
Kilts, C. D., Egan, G., Gideon, D. A., Ely, T. D., & Hoffman, J. M. (2003). Dissociable neural pathways are involved in the recognition of emotion in static and dynamic facial expressions. Neuroimage, 18, 156–168.
Article PubMed Google Scholar
Linke, C. E. (1973). A study of pitch characteristics of female voices and their relationship to vocal effectiveness. Folia Phoniatrica, 25, 173–185.
Article PubMed CAS Google Scholar
McGurk, H., & MacDonald, J. (1976). Hearing lips and seeing voices. Nature, 64(5588), 746–748.
Article Google Scholar
Munhall, K. G., Gribble, P., Sacco, L., & Ward, M. (1996). Temporal constraints on the McGurk effect. Perception and Psychophysics, 58(3), 351–362.
Article PubMed CAS Google Scholar
Schweinberger, S. R., Robertson, D., & Kaufmann, J. M. (2007). Hearing facial identities. The Quarterly Journal of Experimental Psychology, 60(10), 1446–1456.
Article PubMed Google Scholar
Tiddeman, B., & Perrett, D. (2001). Moving facial image transformations based on static 2D prototypes. Paper presented at the 9th International conference in Central Europe on Computer Graphics, Visualization and Computer Vision 2001 (WSCG 2001), Plzen, Czech Republic.
Article PubMed Google Scholar
Vroomen, J., Driver, J., & de Gelder, B. (2001). Is cross-modal integration of emotional expressions independent of attentional resources? Cognitive, Affective and Behavioural Neurosciences, 1(4), 382–387.
Article CAS Google Scholar

Download references

Author information

Authors and Affiliations

Voice Neurocognition Laboratory, Institute of Neuroscience and Psychology, College of Medical, Veterinary and Life Sciences, University of Glasgow, Glasgow, UK
Rebecca Watson, Marianne Latinus & Pascal Belin
MRC Cognition and Brain Sciences Unit, Cambridge, UK
Ian Charest
Brain Innovation BrainVoyager, Maastricht, The Netherlands
Julien Rouger
Department of Business Administration and Human Resource Management, University of Cologne, Cologne, Germany
Christoph Casper
International Laboratories for Brain, Music and Sound (BRAMS), Université de Montréal & McGill University, Montreal, Quebec, Canada
Pascal Belin

Authors

Rebecca Watson
View author publications
You can also search for this author in PubMed Google Scholar
Ian Charest
View author publications
You can also search for this author in PubMed Google Scholar
Julien Rouger
View author publications
You can also search for this author in PubMed Google Scholar
Christoph Casper
View author publications
You can also search for this author in PubMed Google Scholar
Marianne Latinus
View author publications
You can also search for this author in PubMed Google Scholar
Pascal Belin
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Rebecca Watson .

Editor information

Editors and Affiliations

Dept. Psychology, Voice Neurocognition Lab., University of Glasgow, Glasgow, Glasgow, G12 8QB, United Kingdom
Pascal Belin
Brugmann University Hospital, Brussels, B-1020, Belgium
Salvatore Campanella
University of Tubingen, Tubingen, 72074, Germany
Thomas Ethofer

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Watson, R., Charest, I., Rouger, J., Casper, C., Latinus, M., Belin, P. (2013). Audiovisual Integration of Face–Voice Gender Studied Using “Morphed Videos”. In: Belin, P., Campanella, S., Ethofer, T. (eds) Integrating Face and Voice in Person Perception. Springer, New York, NY. https://doi.org/10.1007/978-1-4614-3585-3_7

Download citation

DOI: https://doi.org/10.1007/978-1-4614-3585-3_7
Published: 26 June 2012
Publisher Name: Springer, New York, NY
Print ISBN: 978-1-4614-3584-6
Online ISBN: 978-1-4614-3585-3
eBook Packages: Biomedical and Life SciencesBiomedical and Life Sciences (R0)

Publish with us

Policies and ethics