Endre søk
RefereraExporteraLink to record
Permanent link

Direct link
Referera
Referensformat
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Annet format
Fler format
Språk
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Annet språk
Fler språk
Utmatningsformat
  • html
  • text
  • asciidoc
  • rtf
Inferring the location of authors from words in their texts
Stockholms universitet, Humanistiska fakulteten, Institutionen för lingvistik, Avdelningen för datorlingvistik.ORCID-id: 0000-0002-6027-4156
Stockholms universitet, Humanistiska fakulteten, Institutionen för lingvistik, Avdelningen för allmän språkvetenskap.ORCID-id: 0000-0002-0840-1357
2015 (engelsk)Inngår i: Proceedings of the 20th Nordic Conference of Computational Linguistics: NODALIDA 2015 / [ed] Beáta Megyesi, Linköping: Linköping University Electronic Press, ACL Anthology , 2015, s. 211-218Konferansepaper, Publicerat paper (Fagfellevurdert)
Abstract [en]

For the purposes of computational dialectology or other geographically bound text analysis tasks, texts must be annotated with their or their authors' location. Many texts are locatable but most have no ex- plicit annotation of place. This paper describes a series of experiments to determine how positionally annotated microblog posts can be used to learn location indicating words which then can be used to locate blog texts and their authors. A Gaussian distribution is used to model the locational qualities of words. We introduce the notion of placeness to describe how locational words are.

We find that modelling word distributions to account for several locations and thus several Gaussian distributions per word, defining a filter which picks out words with high placeness based on their local distributional context, and aggregating locational information in a centroid for each text gives the most useful results. The results are applied to data in the Swedish language.

sted, utgiver, år, opplag, sider
Linköping: Linköping University Electronic Press, ACL Anthology , 2015. s. 211-218
Serie
Linköping Electronic Conference Proceedings, ISSN 1650-3638 ; 109
HSV kategori
Forskningsprogram
datorlingvistik
Identifikatorer
URN: urn:nbn:se:su:diva-127529ISBN: 978-91-7519-098-3 (tryckt)OAI: oai:DiVA.org:su-127529DiVA, id: diva2:909564
Konferanse
20th Nordic Conference of Computational Linguistics (NODALIDA 2015), Vilnius, Lithuania, May 11–13, 2015
Prosjekter
SINUS (Spridning av innovationer i nutida svenska)
Forskningsfinansiär
Swedish Research Council
Tilgjengelig fra: 2016-03-07 Laget: 2016-03-07 Sist oppdatert: 2018-01-10bibliografisk kontrollert

Open Access i DiVA

Fulltekst mangler i DiVA

Andre lenker

Free full text

Søk i DiVA

Av forfatter/redaktør
Karlgren, JussiÖstling, RobertParkvall, Mikael
Av organisasjonen

Søk utenfor DiVA

GoogleGoogle Scholar

isbn
urn-nbn

Altmetric

isbn
urn-nbn
Totalt: 134 treff
RefereraExporteraLink to record
Permanent link

Direct link
Referera
Referensformat
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Annet format
Fler format
Språk
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Annet språk
Fler språk
Utmatningsformat
  • html
  • text
  • asciidoc
  • rtf