The Prague Dependency Treebank – Consolidated (PDT-C) will be a consolidated release of the existing PDT-corpora of Czech data with manual annotation at all three PDT-annotation layers (morphological, surface syntax and deep syntax layer).

 

PDT-corpora included in PDT-C:

There is manual annotation of deep syntax in all four treebanks included. The morphological and surface syntax layers are manually annotated only in PDT 3.5. Within the project of PDT-C, we want to manually annotate morphology and surface syntax in PDTSC, Czech part of PCEDT and PDT-Faust, and make corrections in PDT.