Alignment Tools for Parallel Treebanks
2007 (English)In: Data Structures for Linguistic Resources and Applications: Proceedings of the Biennial GLDV Conference 2007, 2007Conference paper (Refereed)
This paper reports about our efforts in creating a tri-lingual parallel treebank. The focal points are consistency checking and all aspects of sub-sentential alignment. We discuss the alignment guidelines, the importance of quality checks, and special alignment problems. Then we look at alignment algorithms and alignment visualization tools and we compare our own TreeAligner with other alignment tools. Our constituent structure treebanks contain just over 1,000 sentences and around 18,000 tokens in each language.
Place, publisher, year, edition, pages
parallel treebank, alignment
Language Technology (Computational Linguistics)
IdentifiersURN: urn:nbn:se:su:diva-10560ISBN: 978-3-8233-6314-9OAI: oai:DiVA.org:su-10560DiVA: diva2:177079
Biennial GLDV Conference 2007