TED-MDB

Home

TED-Multilingual Discourse Bank is a corpus consisting of TED talks transcripts in 6 languages (English, German, Polish, European Portuguese, Russian and Turkish), with the ultimate aim to provide a clearly described level of discourse structure and semantics in multiple languages. The corpus is manually annotated following the goals and principles of PDTB, involving explicit and implicit discourse connectives, entity relations, alternative lexicalizations and no relations.