Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Evaluating normalization accounts against the dense vowel space of Stockholm Swedish
Stockholm University, Faculty of Humanities, Department of Swedish Language and Multilingualism, Scandinavian Languages.ORCID iD: 0000-0001-5226-8568
2023 (English)In: 184th meeting of the acoustical society of America: Abstracts, 2023, Vol. 153, article id A370Conference paper, Poster (with or without abstract) (Other academic)
Abstract [en]

Talkers vary in the phonetic realization of their vowels. One influential hypothesis holds that listeners overcome this inter-talker variability through pre-linguistic auditory mechanisms that normalize the acoustic or phonetic cues that form the input to speech recognition. Dozens of competing normalization accounts exist —including both vowel-specific (e.g., Lobanov, 1971; Nearey, 1978; Syrdal and Gopal, 1986) and general-purpose accounts applicable to any type of phonetic cue (McMurray and Jongman, 2011). We add to the cross-linguistic literature by comparing normalization accounts against a new database of Swedish, a language with a particularly dense vowel inventory of 21 vowels differing in quality and quantity. We train Bayesian ideal observers (IOs) on unnormalized or normalized vowel data under different assumptions about the relevant cues to vowel identity (F0-F3, vowel duration), and evaluate their performance in predicting the category intended by talker. The results indicate that the best-performing normalization accounts centered and/or scaled formants by talker (e.g., Lobanov), replicating previous findings for other languages with less dense vowel spaces. The relative advantage of Lobanov decreased when including additional cues, indicating that simple centering relative to the talker’s mean might be sufficient to achieve robust inter-talker perception (e.g., C-CuRE).

Place, publisher, year, edition, pages
2023. Vol. 153, article id A370
National Category
General Language Studies and Linguistics
Research subject
Scandinavian Languages
Identifiers
URN: urn:nbn:se:su:diva-233434DOI: 10.1121/10.0019201OAI: oai:DiVA.org:su-233434DiVA, id: diva2:1897460
Conference
184th Meeting of the Acoustical Society of America, Chicago, Illinois, May 8-12, 2023
Available from: 2024-09-13 Created: 2024-09-13 Last updated: 2024-09-13Bibliographically approved

Open Access in DiVA

No full text in DiVA

Other links

Publisher's full text

Authority records

Persson, Anna

Search in DiVA

By author/editor
Persson, AnnaJaeger, T. Florian
By organisation
Scandinavian Languages
General Language Studies and Linguistics

Search outside of DiVA

GoogleGoogle Scholar

doi
urn-nbn

Altmetric score

doi
urn-nbn
Total: 27 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf