Ändra sökning
RefereraExporteraLänk till posten
Permanent länk

Direktlänk
Referera
Referensformat
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Annat format
Fler format
Språk
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Annat språk
Fler språk
Utmatningsformat
  • html
  • text
  • asciidoc
  • rtf
Comparing pre-linguistic normalization models against US English listeners' vowel perception
Stockholms universitet, Humanistiska fakulteten, Institutionen för svenska och flerspråkighet, Svenska/Nordiska språk.ORCID-id: 0000-0001-5226-8568
University of Rochester.ORCID-id: 0000-0002-1158-7308
2023 (Engelska)Ingår i: 184th Meeting of the Acoustical Society of America: Abstracts, 2023, Vol. 153, artikel-id A77Konferensbidrag, Poster (med eller utan abstract) (Övrigt vetenskapligt)
Abstract [en]

One of the central computational challenges for speech perception is that talkers differ in pronunciation--i.e., how they map linguistic categories and meanings onto the acoustic signal. Yet, listeners typically overcome these difficulties within minutes (Clarke & Garrett, 2004; Xie et al., 2018). The mechanisms that underlie these adaptive abilities remain unclear. One influential hypothesis holds that listeners achieve robust speech perception across talkers through low-level pre-linguistic normalization. We investigate the role of normalization in the perception of L1-US English vowels. We train ideal observers (IOs) on unnormalized or normalized acoustic cues using a phonetic database of 8 /h-VOWEL-d/ words of US English (N = 1240 recordings from 16 talkers, Xie & Jaeger, 2020). All IOs had 0 DFs in predicting perception—i.e., their predictions are completely determined by pronunciation statistics. We compare the IOs’ predictions against L1-US English listeners’ 8-way categorization responses for /h-VOWEL-d/ words in a web-based experiment. We find that (1) pre-linguistic normalization substantially improves the fit to human responses from 74% to 90% of best-possible performance (chance = 12.5%); (2) the best-performing normalization accounts centered and/or scaled formants by talker; and (3) general purpose normalization (C-CuRE, McMurray & Jongman, 2011) performed as well as vowel-specific normalization. © 2023 Acoustical Society of America.

  

 

Ort, förlag, år, upplaga, sidor
2023. Vol. 153, artikel-id A77
Nationell ämneskategori
Jämförande språkvetenskap och allmän lingvistik
Forskningsämne
nordiska språk
Identifikatorer
URN: urn:nbn:se:su:diva-233435DOI: 10.1121/10.0018218OAI: oai:DiVA.org:su-233435DiVA, id: diva2:1897463
Konferens
184th Meeting of the Acoustical Society of America, Chicago, Illinois, May 8-12, 2023
Tillgänglig från: 2024-09-13 Skapad: 2024-09-13 Senast uppdaterad: 2024-09-13Bibliografiskt granskad

Open Access i DiVA

Fulltext saknas i DiVA

Övriga länkar

Förlagets fulltext

Person

Persson, Anna

Sök vidare i DiVA

Av författaren/redaktören
Persson, AnnaJaeger, T. Florian
Av organisationen
Svenska/Nordiska språk
Jämförande språkvetenskap och allmän lingvistik

Sök vidare utanför DiVA

GoogleGoogle Scholar

doi
urn-nbn

Altmetricpoäng

doi
urn-nbn
Totalt: 24 träffar
RefereraExporteraLänk till posten
Permanent länk

Direktlänk
Referera
Referensformat
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Annat format
Fler format
Språk
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Annat språk
Fler språk
Utmatningsformat
  • html
  • text
  • asciidoc
  • rtf