Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
A Bayesian model for joint word alignment and part-of-speech transfer
Stockholm University, Faculty of Humanities, Department of Linguistics, Computational Linguistics. University of Helsinki, Finland.ORCID iD: 0000-0002-6027-4156
2016 (English)In: Proceedings of COLING 2016, the 26th International Conference on Computational Linguistics: Technical Papers, Osaka, Japan: Association for Computational Linguistics, 2016, p. 620-629Conference paper, Published paper (Refereed)
Abstract [en]

Current methods for word alignment require considerable amounts of parallel text to deliver accurate results, a requirement which is met only for a small minority of the world’s approximately 7,000 languages. We show that by jointly performing word alignment and annotation transfer in a novel Bayesian model, alignment accuracy can be improved for language pairs where annotations are available for only one of the languages—a finding which could facilitate the study and processing of a vast number of low-resource languages. We also present an evaluation where our method is used to perform single-source and multi-source part-of-speech transfer with 22 translations of the same text in four different languages. This allows us to quantify the considerable variation in accuracy depending on the specific source text(s) used, even with different translations into the same language.

Place, publisher, year, edition, pages
Osaka, Japan: Association for Computational Linguistics, 2016. p. 620-629
Keywords [en]
bayesian learning, word alignment, part of speech tagging, multilingual nlp
National Category
Language Technology (Computational Linguistics)
Research subject
Computational Linguistics
Identifiers
URN: urn:nbn:se:su:diva-159763ISBN: 978-4-87974-702-0 (print)OAI: oai:DiVA.org:su-159763DiVA, id: diva2:1245604
Conference
COLING 2016, the 26th International Conference on Computational Linguistics, Osaka, Japan, December 11-17, 2016
Available from: 2018-09-05 Created: 2018-09-05 Last updated: 2019-04-01Bibliographically approved

Open Access in DiVA

fulltext(194 kB)7 downloads
File information
File name FULLTEXT01.pdfFile size 194 kBChecksum SHA-512
b4e0e2fea0b099159561b421d8ebbcef0ba21349c9f8810c544d2f23c388fcbadd3dd3bcd46861dac0ff011ac1a4c60392af1911744881905a7a7e7d3f7e6ec1
Type fulltextMimetype application/pdf

Other links

Free full text

Search in DiVA

By author/editor
Östling, Robert
By organisation
Computational Linguistics
Language Technology (Computational Linguistics)

Search outside of DiVA

GoogleGoogle Scholar
Total: 7 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

isbn
urn-nbn

Altmetric score

isbn
urn-nbn
Total: 103 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf