Cross-modal integration during vowel identification in audiovisual speech: a functional magnetic resonance imaging study

Mika Murase; Daisuke N Saito; Takanori Kochiyama; Hiroki C Tanabe; Satoshi Tanaka; Tokiko Harada; Yu Aramaki; Manabu Honda; Norihiro Sadato

doi:10.1016/j.neulet.2008.01.044

Cross-modal integration during vowel identification in audiovisual speech: a functional magnetic resonance imaging study

Neurosci Lett. 2008 Mar 21;434(1):71-6. doi: 10.1016/j.neulet.2008.01.044. Epub 2008 Jan 26.

Authors

Mika Murase¹, Daisuke N Saito, Takanori Kochiyama, Hiroki C Tanabe, Satoshi Tanaka, Tokiko Harada, Yu Aramaki, Manabu Honda, Norihiro Sadato

Affiliation

¹ Department of Physiological Sciences, The Graduate University for Advanced Studies (Sokendai), Kanagawa, Japan.

PMID: 18280656
DOI: 10.1016/j.neulet.2008.01.044

Abstract

To investigate the neural substrates of the perception of audiovisual speech, we conducted a functional magnetic resonance imaging study with 28 normal volunteers. We hypothesized that the constraint provided by visually-presented articulatory speech (mouth movements) would lessen the workload for speech identification if the two were concordant, but would increase the workload if the two were discordant. In auditory attention sessions, subjects were required to identify vowels based on auditory speech. Auditory vowel stimuli were presented with concordant or discordant visible articulation movements, unrelated lip movements, and without visual input. In visual attention sessions, subjects were required to identify vowels based on the visually-presented vowel articulation movements. The movements were presented with concordant or discordant uttered vowels and noise, and without sound. Irrespective of the attended modality, concordant conditions significantly shortened the reaction time, whereas discordant conditions lengthened the reaction time. Within the neural substrates that were commonly activated by auditory and visual tasks, the mid superior temporal sulcus showed greater activity for discordant stimuli than concordant stimuli. These findings suggest that the mid superior temporal sulcus plays an important role in the auditory-visual integration process underlying vowel identification.

Publication types

Research Support, Non-U.S. Gov't

MeSH terms

Acoustic Stimulation
Adult
Attention / physiology
Brain / anatomy & histology
Brain / physiology*
Brain Mapping
Dominance, Cerebral / physiology
Female
Humans
Language Tests
Language*
Magnetic Resonance Imaging
Male
Nerve Net / anatomy & histology
Nerve Net / physiology
Neural Pathways / anatomy & histology
Neural Pathways / physiology
Pattern Recognition, Visual / physiology
Phonetics*
Photic Stimulation
Reaction Time / physiology
Reading*
Speech Perception / physiology*
Temporal Lobe / anatomy & histology
Temporal Lobe / physiology
Verbal Behavior / physiology*