Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
TED Multilingual Discourse Bank (TED-MDB): a parallel corpus annotated in the PDTB style
Stockholm University, Faculty of Humanities, Department of Linguistics, Computational Linguistics. Middle East Technical University, Turkey.
Show others and affiliations
2019 (English)In: Language resources and evaluation, ISSN 1574-020X, E-ISSN 1574-0218Article in journal (Refereed) Epub ahead of print
Abstract [en]

TED-Multilingual Discourse Bank, or TED-MDB, is a multilingual resource where TED-talks are annotated at the discourse level in 6 languages (English, Polish, German, Russian, European Portuguese, and Turkish) following the aims and principles of PDTB. We explain the corpus design criteria, which has three main features: the linguistic characteristics of the languages involved, the interactive nature of TED talks—which led us to annotate Hypophora, and the decision to avoid projection. We report our annotation consistency, and post-annotation alignment experiments, and provide a cross-lingual comparison based on corpus statistics.

Place, publisher, year, edition, pages
2019.
Keywords [en]
Discourse, Discourse relations, Corpus creation, Annotation, Multilingual corpus
National Category
General Language Studies and Linguistics
Research subject
Linguistics
Identifiers
URN: urn:nbn:se:su:diva-173474DOI: 10.1007/s10579-019-09445-9OAI: oai:DiVA.org:su-173474DiVA, id: diva2:1354083
Available from: 2019-09-24 Created: 2019-09-24 Last updated: 2019-09-25

Open Access in DiVA

No full text in DiVA

Other links

Publisher's full text

Search in DiVA

By author/editor
Kurfali, Murathan
By organisation
Computational Linguistics
In the same journal
Language resources and evaluation
General Language Studies and Linguistics

Search outside of DiVA

GoogleGoogle Scholar

doi
urn-nbn

Altmetric score

doi
urn-nbn
Total: 40 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf