PDT-Vallex Valency Lexicon
News
Jan. 9, 2012: Dissertation errata added to the web.
Dec. 9, 2011: Publications about PDT-Vallex, Intro, additional links added.
Nov. 8, 2011: PDF of PDT-Vallex and some links added.
Sep. 16, 2011: Pages created.
About PDT-Vallex
The valency lexicon PDT-Vallex has been built in close connection with the annotation of the Prague Dependency Treebank project (PDT) and its successors (mainly the Prague Czech-English Dependency Treebank project, PCEDT). It contains over 11000 valency frames for more than 7000 verbs which occurred in the PDT or PCEDT. It is available in electronically processable format (XML) together with the aforementioned treebanks (to be viewed and edited by TrEd, the PDT/PCEDT main annotation tool) , and also in more human readable form (see the links above and below). The main feature of the lexicon is its linking to the annotated corpora - each occurrence of each verb is linked to the appropriate valency frame with additional (generalized) information about its usage and surface morphosyntactic form alternatives.Resources and access
Quick links (but please read below the basic facts):
- Web-based browsing of PDT-Vallex 2.0
- PDT 2.0 (PDT-Vallex 1.0 is part of it)
- PDT-Vallex in PDF format (with intro and format description in Czech)
- LINDAT repository with more resources
PDT-Vallex is available in several forms. First and foremost, it is part of the Prague Dependency Treebank 2.0. After the extensions described in the PDT-Vallex PhD dissertation, and later published also as two books (please see our list of UFAL published books, and look for author "Uresova"). It has been also converted into a human-readable form in the pdf format. Lastly, it is also available in a browsable and partly searchable form with links to the texts which have been annotated with it (for more about the treebanks themselves, see PDT, PCEDT). It is also available in the original and pdf formats in the new LINDAT repository.
Publications about PDT-Vallex
1. Urešová Zdeňka: PDT-Vallex - trochu jiný valenční slovník. In: Slovo – Tvorba – Dynamickosť. Na počesť Kláry Buzássyovej, Copyright © Veda, Bratislava, Slovakia, ISBN 978-80-224-1107-3, pp. 278-286, 2010
2. Urešová Zdeňka: Building the PDT-VALLEX valency lexicon. In: On-line Proceedings of the fifth Corpus Linguistics Conference, http://ucrel.lancs.ac.uk/publications/cl2009, University of Liverpool, UK. 2009
3. Urešová Zdeňka, Štěpánek Jan, Hajič Jan: PDT Vallex for PDT 2.0. Institute of Formal and Applied Linguistics MFF UK Prague, http://ufal.mff.cuni.cz/pdt2.0/visual-data/pdt-vallex/vallex.html, 2007 (now obsolete; see http://ufal.mff.cuni.cz/lindat/PDT-Vallex.html for the current PDT-Vallex version)
4. Hajič Jan, Urešová Zdeňka: Linguistic Annotation: from Links to Cross-Layer Lexicons. In: Proceedings of The Second Workshop on Treebanks and Linguistic Theories, Copyright © Vaxjo University Press, Vaxjo, Sweden, ISBN 91-7636-394-5, ISSN 1651-0267, pp. 69-80, Nov. 2003
5. Hajič Jan, Panevová Jarmila, Urešová Zdeňka, Bémová Alevtina, Kolářová Veronika, Pajas Petr: PDT-VALLEX: Creating a Large-coverage Valency Lexicon for Treebank Annotation. In: Proceedings of The Second Workshop on Treebanks and Linguistic Theories, Copyright © Vaxjo University Press, Vaxjo, Sweden, ISBN 91-7636-394-5, ISSN 1651-0267, pp. 57-68, Nov. 2003