Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Comparing pre-linguistic normalization models against US English listeners' vowel perception
Stockholm University, Faculty of Humanities, Department of Swedish Language and Multilingualism, Scandinavian Languages.ORCID iD: 0000-0001-5226-8568
University of Rochester.ORCID iD: 0000-0002-1158-7308
2023 (English)In: 184th Meeting of the Acoustical Society of America: Abstracts, 2023, Vol. 153, article id A77Conference paper, Poster (with or without abstract) (Other academic)
Abstract [en]

One of the central computational challenges for speech perception is that talkers differ in pronunciation--i.e., how they map linguistic categories and meanings onto the acoustic signal. Yet, listeners typically overcome these difficulties within minutes (Clarke & Garrett, 2004; Xie et al., 2018). The mechanisms that underlie these adaptive abilities remain unclear. One influential hypothesis holds that listeners achieve robust speech perception across talkers through low-level pre-linguistic normalization. We investigate the role of normalization in the perception of L1-US English vowels. We train ideal observers (IOs) on unnormalized or normalized acoustic cues using a phonetic database of 8 /h-VOWEL-d/ words of US English (N = 1240 recordings from 16 talkers, Xie & Jaeger, 2020). All IOs had 0 DFs in predicting perception—i.e., their predictions are completely determined by pronunciation statistics. We compare the IOs’ predictions against L1-US English listeners’ 8-way categorization responses for /h-VOWEL-d/ words in a web-based experiment. We find that (1) pre-linguistic normalization substantially improves the fit to human responses from 74% to 90% of best-possible performance (chance = 12.5%); (2) the best-performing normalization accounts centered and/or scaled formants by talker; and (3) general purpose normalization (C-CuRE, McMurray & Jongman, 2011) performed as well as vowel-specific normalization. © 2023 Acoustical Society of America.

  

 

Place, publisher, year, edition, pages
2023. Vol. 153, article id A77
National Category
General Language Studies and Linguistics
Research subject
Scandinavian Languages
Identifiers
URN: urn:nbn:se:su:diva-233435DOI: 10.1121/10.0018218OAI: oai:DiVA.org:su-233435DiVA, id: diva2:1897463
Conference
184th Meeting of the Acoustical Society of America, Chicago, Illinois, May 8-12, 2023
Available from: 2024-09-13 Created: 2024-09-13 Last updated: 2024-09-13Bibliographically approved

Open Access in DiVA

No full text in DiVA

Other links

Publisher's full text

Authority records

Persson, Anna

Search in DiVA

By author/editor
Persson, AnnaJaeger, T. Florian
By organisation
Scandinavian Languages
General Language Studies and Linguistics

Search outside of DiVA

GoogleGoogle Scholar

doi
urn-nbn

Altmetric score

doi
urn-nbn
Total: 24 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf