Temporal encoding of the voice onset time phonetic parameter by field potentials recorded directly from human auditory cortex

Mitchell Steinschneider; Igor O. Volkov; M. Daniel Noh; P. Charles Garell; Matthew A. Howard

doi:10.1152/jn.1999.82.5.2346

Temporal encoding of the voice onset time phonetic parameter by field potentials recorded directly from human auditory cortex

Mitchell Steinschneider, Igor O. Volkov, M. Daniel Noh, P. Charles Garell, Matthew A. Howard

Neurology

Research output: Contribution to journal › Article › peer-review

160 Scopus citations

Abstract

Voice onset time (VOT) is an important parameter of speech that denotes the time interval between consonant onset and the onset of low-frequency periodicity generated by rhythmic vocal cord vibration. Voiced stop consonants (/b/, /g/, and /d/) in syllable initial position are characterized by short VOTs, whereas unvoiced stop consonants (/p/, /k/, and t/) contain prolonged VOTs. As the VOT is increased in incremental steps, perception rapidly changes from a voiced stop consonant to an unvoiced consonant at an interval of 20-40 ms. This abrupt change in consonant identification is an example of categorical speech perception and is a central feature of phonetic discrimination. This study tested the hypothesis that VOT is represented within auditory cortex by transient responses time-locked to consonant and voicing onset. Auditory evoked potentials (AEPs) elicited by stop consonant- vowel (CV) syllables were recorded directly from Heschl's gyms, the planum temporale, and the superior temporal gyms in three patients undergoing evaluation for surgical remediation of medically intractable epilepsy. Voiced CV syllables elicited a triphasic sequence of field potentials within Heschl's gyms. AEPs evoked by unvoiced CV syllables contained additional response components time-locked to voicing onset. Syllables with a VOT of 40, 60, or 80 ms evoked components time-locked to consonant release and voicing onset. In contrast, the syllable with a VOT of 20 ms evoked a markedly diminished response to voicing onset and elicited an AEP very similar in morphology to that evoked by the syllable with a O-ms VOT. Similar response features were observed in the AEPs evoked by click trains. In this case, there was a marked decrease in amplitude of the transient response to the second click in trains with interpulse intervals of 20-25 ms. Speech-evoked AEPs recorded from the posterior superior temporal gyms lateral to Heschl's gyms displayed comparable response features, whereas field potentials recorded from three locations in the planum temporale did not contain components time-locked to voicing onset. This study demonstrates that VOT at least partially is represented in primary and specific secondary auditory cortical fields by synchronized activity time-locked to consonant release and voicing onset. Furthermore, AEPs exhibit features that may facilitate categorical perception of stop consonants, and these response patterns appear to be based on temporal processing limitations within auditory cortex. Demonstrations of similar speech-evoked response patterns in animals support a role for these experimental models in clarifying selected features of speech encoding.

Original language	English (US)
Pages (from-to)	2346-2357
Number of pages	12
Journal	Journal of neurophysiology
Volume	82
Issue number	5
DOIs	https://doi.org/10.1152/jn.1999.82.5.2346
State	Published - 1999

ASJC Scopus subject areas

General Neuroscience
Physiology

Access to Document

10.1152/jn.1999.82.5.2346

Cite this

@article{9c8fdf4f3229475d936b51da64edda39,

title = "Temporal encoding of the voice onset time phonetic parameter by field potentials recorded directly from human auditory cortex",

abstract = "Voice onset time (VOT) is an important parameter of speech that denotes the time interval between consonant onset and the onset of low-frequency periodicity generated by rhythmic vocal cord vibration. Voiced stop consonants (/b/, /g/, and /d/) in syllable initial position are characterized by short VOTs, whereas unvoiced stop consonants (/p/, /k/, and t/) contain prolonged VOTs. As the VOT is increased in incremental steps, perception rapidly changes from a voiced stop consonant to an unvoiced consonant at an interval of 20-40 ms. This abrupt change in consonant identification is an example of categorical speech perception and is a central feature of phonetic discrimination. This study tested the hypothesis that VOT is represented within auditory cortex by transient responses time-locked to consonant and voicing onset. Auditory evoked potentials (AEPs) elicited by stop consonant- vowel (CV) syllables were recorded directly from Heschl's gyms, the planum temporale, and the superior temporal gyms in three patients undergoing evaluation for surgical remediation of medically intractable epilepsy. Voiced CV syllables elicited a triphasic sequence of field potentials within Heschl's gyms. AEPs evoked by unvoiced CV syllables contained additional response components time-locked to voicing onset. Syllables with a VOT of 40, 60, or 80 ms evoked components time-locked to consonant release and voicing onset. In contrast, the syllable with a VOT of 20 ms evoked a markedly diminished response to voicing onset and elicited an AEP very similar in morphology to that evoked by the syllable with a O-ms VOT. Similar response features were observed in the AEPs evoked by click trains. In this case, there was a marked decrease in amplitude of the transient response to the second click in trains with interpulse intervals of 20-25 ms. Speech-evoked AEPs recorded from the posterior superior temporal gyms lateral to Heschl's gyms displayed comparable response features, whereas field potentials recorded from three locations in the planum temporale did not contain components time-locked to voicing onset. This study demonstrates that VOT at least partially is represented in primary and specific secondary auditory cortical fields by synchronized activity time-locked to consonant release and voicing onset. Furthermore, AEPs exhibit features that may facilitate categorical perception of stop consonants, and these response patterns appear to be based on temporal processing limitations within auditory cortex. Demonstrations of similar speech-evoked response patterns in animals support a role for these experimental models in clarifying selected features of speech encoding.",

author = "Mitchell Steinschneider and Volkov, {Igor O.} and Noh, {M. Daniel} and Garell, {P. Charles} and Howard, {Matthew A.}",

year = "1999",

doi = "10.1152/jn.1999.82.5.2346",

language = "English (US)",

volume = "82",

pages = "2346--2357",

journal = "Journal of neurophysiology",

issn = "0022-3077",

publisher = "American Physiological Society",

number = "5",

}

TY - JOUR

T1 - Temporal encoding of the voice onset time phonetic parameter by field potentials recorded directly from human auditory cortex

AU - Steinschneider, Mitchell

AU - Volkov, Igor O.

AU - Noh, M. Daniel

AU - Garell, P. Charles

AU - Howard, Matthew A.

PY - 1999

Y1 - 1999

N2 - Voice onset time (VOT) is an important parameter of speech that denotes the time interval between consonant onset and the onset of low-frequency periodicity generated by rhythmic vocal cord vibration. Voiced stop consonants (/b/, /g/, and /d/) in syllable initial position are characterized by short VOTs, whereas unvoiced stop consonants (/p/, /k/, and t/) contain prolonged VOTs. As the VOT is increased in incremental steps, perception rapidly changes from a voiced stop consonant to an unvoiced consonant at an interval of 20-40 ms. This abrupt change in consonant identification is an example of categorical speech perception and is a central feature of phonetic discrimination. This study tested the hypothesis that VOT is represented within auditory cortex by transient responses time-locked to consonant and voicing onset. Auditory evoked potentials (AEPs) elicited by stop consonant- vowel (CV) syllables were recorded directly from Heschl's gyms, the planum temporale, and the superior temporal gyms in three patients undergoing evaluation for surgical remediation of medically intractable epilepsy. Voiced CV syllables elicited a triphasic sequence of field potentials within Heschl's gyms. AEPs evoked by unvoiced CV syllables contained additional response components time-locked to voicing onset. Syllables with a VOT of 40, 60, or 80 ms evoked components time-locked to consonant release and voicing onset. In contrast, the syllable with a VOT of 20 ms evoked a markedly diminished response to voicing onset and elicited an AEP very similar in morphology to that evoked by the syllable with a O-ms VOT. Similar response features were observed in the AEPs evoked by click trains. In this case, there was a marked decrease in amplitude of the transient response to the second click in trains with interpulse intervals of 20-25 ms. Speech-evoked AEPs recorded from the posterior superior temporal gyms lateral to Heschl's gyms displayed comparable response features, whereas field potentials recorded from three locations in the planum temporale did not contain components time-locked to voicing onset. This study demonstrates that VOT at least partially is represented in primary and specific secondary auditory cortical fields by synchronized activity time-locked to consonant release and voicing onset. Furthermore, AEPs exhibit features that may facilitate categorical perception of stop consonants, and these response patterns appear to be based on temporal processing limitations within auditory cortex. Demonstrations of similar speech-evoked response patterns in animals support a role for these experimental models in clarifying selected features of speech encoding.

AB - Voice onset time (VOT) is an important parameter of speech that denotes the time interval between consonant onset and the onset of low-frequency periodicity generated by rhythmic vocal cord vibration. Voiced stop consonants (/b/, /g/, and /d/) in syllable initial position are characterized by short VOTs, whereas unvoiced stop consonants (/p/, /k/, and t/) contain prolonged VOTs. As the VOT is increased in incremental steps, perception rapidly changes from a voiced stop consonant to an unvoiced consonant at an interval of 20-40 ms. This abrupt change in consonant identification is an example of categorical speech perception and is a central feature of phonetic discrimination. This study tested the hypothesis that VOT is represented within auditory cortex by transient responses time-locked to consonant and voicing onset. Auditory evoked potentials (AEPs) elicited by stop consonant- vowel (CV) syllables were recorded directly from Heschl's gyms, the planum temporale, and the superior temporal gyms in three patients undergoing evaluation for surgical remediation of medically intractable epilepsy. Voiced CV syllables elicited a triphasic sequence of field potentials within Heschl's gyms. AEPs evoked by unvoiced CV syllables contained additional response components time-locked to voicing onset. Syllables with a VOT of 40, 60, or 80 ms evoked components time-locked to consonant release and voicing onset. In contrast, the syllable with a VOT of 20 ms evoked a markedly diminished response to voicing onset and elicited an AEP very similar in morphology to that evoked by the syllable with a O-ms VOT. Similar response features were observed in the AEPs evoked by click trains. In this case, there was a marked decrease in amplitude of the transient response to the second click in trains with interpulse intervals of 20-25 ms. Speech-evoked AEPs recorded from the posterior superior temporal gyms lateral to Heschl's gyms displayed comparable response features, whereas field potentials recorded from three locations in the planum temporale did not contain components time-locked to voicing onset. This study demonstrates that VOT at least partially is represented in primary and specific secondary auditory cortical fields by synchronized activity time-locked to consonant release and voicing onset. Furthermore, AEPs exhibit features that may facilitate categorical perception of stop consonants, and these response patterns appear to be based on temporal processing limitations within auditory cortex. Demonstrations of similar speech-evoked response patterns in animals support a role for these experimental models in clarifying selected features of speech encoding.

UR - http://www.scopus.com/inward/record.url?scp=0032719109&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=0032719109&partnerID=8YFLogxK

U2 - 10.1152/jn.1999.82.5.2346

DO - 10.1152/jn.1999.82.5.2346

M3 - Article

C2 - 10561410

AN - SCOPUS:0032719109

SN - 0022-3077

VL - 82

SP - 2346

EP - 2357

JO - Journal of neurophysiology

JF - Journal of neurophysiology

IS - 5

ER -

Temporal encoding of the voice onset time phonetic parameter by field potentials recorded directly from human auditory cortex

Abstract

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this