Endre søk
RefereraExporteraLink to record
Permanent link

Direct link
Referera
Referensformat
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Annet format
Fler format
Språk
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Annet språk
Fler språk
Utmatningsformat
  • html
  • text
  • asciidoc
  • rtf
From lexical bundles to surprisal: Measuring the idiom principle
Stockholms universitet, Humanistiska fakulteten, Institutionen för lingvistik, Avdelningen för datorlingvistik.ORCID-id: 0000-0003-2815-395X
2014 (engelsk)Inngår i: Lexical bundles in English non-fiction writing: forms and functions, 2014Konferansepaper, Oral presentation with published abstract (Fagfellevurdert)
Abstract [en]

Lexical bundles (LB) testify to Sinclair's idiom principle (SIP), and measure formulaicity, complexity and (non-) creativity (FCN). We exploit the information-theoretic measure of surprisal to analyze these.Frequency as measure of LB has been criticized (McEnery et al, 2006:208–220), instead collocation measures were suggested until Biber (2009:286–290) raised three criticisms. First, MI ranks rare collocations, which often include idioms, highest. We answer that also idioms are formulaic, and there are collocation measures which have a bias towards frequent collocations.Second, MI doesn't respect word order. We thus use directed word transition probabilities like surprisal (Levy and Jaeger 2007):3-gram surprisal =Third, formulaic sequences are often discontinuous. We thus sum over sequences, use 3-grams as atoms, and address syntactic surprisal.We argue that abstracting to surprisal as measure of LB and FCN is appropriate, as it expresses reader expectations and text entropy. We use surprisal to analyse differences between:

  1. spoken and written learner language (L2);
  2. L2 across proficiency levels;
  3. L2 compared with L1

We test Pawley and Syder (1983)'s and Levy and Jaeger (2007)'s hypothesis that native speakers play the tug-of-war between formulaicity and expressiveness best, thus minimizing comprehension difficulty, according to the uniform information density principle.

sted, utgiver, år, opplag, sider
2014.
HSV kategori
Identifikatorer
URN: urn:nbn:se:su:diva-109372OAI: oai:DiVA.org:su-109372DiVA, id: diva2:764522
Konferanse
12th ESSE conference, Kosice, Slovakia, August 29–September 2, 2014
Tilgjengelig fra: 2014-11-19 Laget: 2014-11-19 Sist oppdatert: 2018-03-20bibliografisk kontrollert

Open Access i DiVA

Fulltekst mangler i DiVA

Søk i DiVA

Av forfatter/redaktør
Grigonyte, Gintare
Av organisasjonen

Søk utenfor DiVA

GoogleGoogle Scholar

urn-nbn

Altmetric

urn-nbn
Totalt: 462 treff
RefereraExporteraLink to record
Permanent link

Direct link
Referera
Referensformat
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Annet format
Fler format
Språk
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Annet språk
Fler språk
Utmatningsformat
  • html
  • text
  • asciidoc
  • rtf