Statistical sequence and parsing models for descriptive linguistics and psycholinguistics
2016 (English)In: New Approaches to English Linguistics: Building bridges / [ed] Olga Timofeeva, Anne-Christine Gardner, Alpo Honkapohja, Sarah Chevalier, John Benjamins Publishing Company, 2016, 281-320 p.Chapter in book (Refereed)
This study shows that using computational linguistic models is beneficial for descriptive linguistics and psycholinguistics. It applies two models to various English genres and learner language: 1) surprisal and 2) a syntactic parser, allowing us to investigate the role of ambiguity and the interplay between idiom and syntax principles. We find that surprisal and ambiguity are higher for learner language, while parser scores and model fit are lower. In addition, the random application of alternations leads to more ambiguous sentences. Failures to generate optimal orderings in the sense of relevance theory, such as nonnative-like utterances by language learners exhibit, increase processing load, both for human and automatic processors. As human and automatic parsing difficulties correlate, we suggest syntactic parsers as psycholinguistic processing models.
Place, publisher, year, edition, pages
John Benjamins Publishing Company, 2016. 281-320 p.
, Studies in Language Companion Series, ISSN 0165-7763 ; 177
syntactic parsing, ambiguity, idiom and syntax principle, statistical models, language processing
General Language Studies and Linguistics
Research subject Linguistics
IdentifiersURN: urn:nbn:se:su:diva-134664ISBN: 9789027259424OAI: oai:DiVA.org:su-134664DiVA: diva2:1037150