Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Consistency Checking for Treebank Alignment
Stockholm University, Faculty of Humanities, Department of Linguistics, Computational Linguistics.
Department of Linguistics, Indiana University .
2010 (English)In: Proceedings of the Fourth Linguistic Annotation Workshop / [ed] Nianwen Xue and Massimo Poesio, Association for Computational Linguistics , 2010, 38-46 p.Conference paper, Published paper (Refereed)
Abstract [en]

This paper explores ways to detect errors in aligned corpora, using very little technology. In the first method, applicableto any aligned corpus, we consider alignment as a string-to-string mapping. Treating the target string as a label, we examine each source string to find inconsistencies in alignment. Despite setting up the problem on a par with grammatical annotation, we demonstrate crucial differences in sorting errors from legitimate variations. The second method examines phrase nodes which are predicted to be aligned, based on the alignment of their yields. Both methods are effective in complementary ways.

Place, publisher, year, edition, pages
Association for Computational Linguistics , 2010. 38-46 p.
Keyword [en]
alignment, consistency
National Category
Language Technology (Computational Linguistics)
Research subject
Computational Linguistics
Identifiers
URN: urn:nbn:se:su:diva-53545OAI: oai:DiVA.org:su-53545DiVA: diva2:390905
Conference
ACL 2010
Available from: 2011-01-24 Created: 2011-01-24 Last updated: 2011-06-16Bibliographically approved

Open Access in DiVA

No full text

Other links

http://www.aclweb.org/anthology/W10-18

Search in DiVA

By author/editor
Samuelsson, Yvonne
By organisation
Computational Linguistics
Language Technology (Computational Linguistics)

Search outside of DiVA

GoogleGoogle Scholar

urn-nbn

Altmetric score

urn-nbn
Total: 37 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf