Temporal Encoding Of The Voice Onset Time (VOT) Phonetic Parameter By Field Potentials Recorded Directly From Human Auditory Cortex. Mitchell Steinschneider1, Igor O. Volkov2, M. Daniel Noh2, P. Charles Garell2, and Matthew A. Howard III2. 1Departments of Neurology and Neuroscience, Albert Einstein College of Medicine, Bronx, New York 10461 and 2Department of Surgery (Division of Neurosurgery), University of Iowa College of Medicine, Iowa City, Iowa 52242.
APStracts 6:0382N, 1999.
Voice onset time (VOT) is an important parameter of speech that denotes the time interval between consonant onset and the onset of low frequency periodicity generated by rhythmic vocal cord vibration. Voiced stop consonants (/b/, /g/, and /d/) in syllable initial position are characterized by short VOTs, while unvoiced stop consonants (/p/, /k/, and /t/) contain prolonged VOTs. As the VOT is increased in incremental steps, perception rapidly changes from a voiced stop consonant to an unvoiced consonant at an interval of 20-40 msec. This abrupt change in consonant identification is an example of categorical speech perception, and is a central feature of phonetic discrimination. This study tested the hypothesis that VOT is represented within auditory cortex by transient responses time-locked to consonant and voicing onset. Auditory evoked potentials (AEPs) elicited by stop consonant-vowel (CV) syllables were recorded directly from Heschl's gyrus, the planum temporale and the superior temporal gyrus in three patients undergoing evaluation for surgical remediation of medically intractable epilepsy. Voiced CV syllables elicited a triphasic sequence of field potentials within Heschl's gyrus. AEPs evoked by unvoiced CV syllables contained additional response components time-locked to voicing onset. Syllables with a VOT of 40, 60 or 80 msec evoked components time-locked to consonant release and voicing onset. In contrast, the syllable with a VOT of 20 msec evoked a markedly diminished response to voicing onset, and elicited an AEP very similar in morphology to that evoked by the syllable with a 0 msec VOT. Similar response features were observed in the AEPs evoked by click trains. In this case, there was a marked decrease in amplitude of the transient response to the second click in trains with inter-pulse-intervals of 20-25 msec. Speech-evoked AEPs recorded from the posterior superior temporal gyrus lateral to Heschl's gyrus displayed comparable response features, while field potentials recorded from 3 locations in the planum temporale did not contain components time-locked to voicing onset. This study demonstrates that VOT is at least partially represented in primary and specific secondary auditory cortical fields by synchronized activity time-locked to consonant release and voicing onset. Furthermore, AEPs exhibit features that may facilitate categorical perception of stop consonants, and these response patterns appear to be based on temporal processing limitations within auditory cortex. Demonstrations of similar speech-evoked response patterns in animals support a role for these experimental models in clarifying selected features of speech encoding.

Received 26 January 1999; accepted in final form 5 August 1999.
APS Manuscript Number J064-9.
Article publication pending Journal of Neurophysiology.
ISSN 1080-4757 Copyright 1999 The American Physiological Society.
Published in APStracts on 13 August 1999