Home
TED-Multilingual Discourse Bank is a corpus consisting of TED
talks transcripts in 6 languages (English, German, Polish,
European Portuguese, Russian and Turkish), with the ultimate aim
to provide a clearly described level of discourse structure and
semantics in multiple languages. The corpus is manually annotated
following the goals and principles of PDTB, involving explicit and
implicit discourse connectives, entity relations, alternative
lexicalizations and no relations.