PDTSC - Prague Dependency Treebank of Spoken Czech

Project description and history with comprehensive overview of the problem as reported in corpora and literature is available as a UFAL Technical report No. 33, or as a pdf file here (in Czech), and the current PDTSC Annotation Guidelines as a UFAL Technical report No. 38 (also available electronically here in .pdf).

The data is currently available after registration (see also the menu on the left).

Internal (yet publicly accessible) web pages about the project are available at https://wiki.ufal.ms.mff.cuni.cz/pdtsc:start.

The data is best viewed and searched by the Speech Reconstruction editor MeD, which is publicly available here (please ignore the certificate warning (click Continue even if it says Not Recommended) when redirected to download MeD).

Contents of the project web pages: Jan Hajic.
Authors and contributors: Jan Hajic, Silvie Cinkova, Marie Mikulova, Petr Pajas, Petr Podvesky, Martina Otradovcova, Jan Ptacek, Josef Toman, Zdenka Uresova and all the annotators: Anna Hlavacova, Heather McGadie, Petra Mickova, Christine Warkentin, Helena Glucksmannova, Ludmila Kaplanova, Michaela Lunackova, Jana Grollova, Anna Kapsova, Petra Schnaubertova, Hana Stepankova and Jan Ures.
This work was funded in part by the Companions project (www.companions-project.org) sponsored by the European Commission as part of the Information Society Technologies (IST) programme under EC grantnumber IST-FP6-034434, MSM0021620838, ME838 and LC536 of Ministry of Education, Youth and Sports of the Czech Republic and GA405/06/0589 of the Grant Agency of the Czech Republic. The data themselves are the sole result of the project GACR GA405/06/0589. The ME838 project is attached to the project No. 0530118 of the PIRE program of the NSF/OISE.
