Documentation

EngVallex is part of the PCEDT releases (PCEDT 2.0  and now their replacement, PCEDT 3.0). Information about the lexicon as well as annotation guidelines for using it within tectogrammatical annotation projects for English can be found in the following documents:

  • Valency and EngVallex lexicon
  • Tectogrammatical Annotation Guide for English (pdf)
  • Reference Manual for English Tectogrammatical Annotation can be found here (as a part of PCEDT 3.0 web)

EngVallex-to-PropBank mapping is described here. Related documentation:

  • Engvallex-to-Propbank mapping - the most frequent only (txt)
  • Engvallex-to-Propbank mapping - all mappings, without node ids (txt)
  • Engvallex-to-Propbank mapping - all mappings, with node ids (txt)

The following publications refer to EngVallex or the CzEngVallex and SynSemClass projects, which use EngVallex as its underlying lexicon:

Cinková Silvie: From PropBank to EngValLex: Adapting the PropBank-Lexicon to the Valency Theory of the Functional Generative Description. In: Proceedings of the 5th International Conference on Language Resources and Evaluation (LREC 2006), ELRA, Genova, Italy, pp. 2170–2175, 2006. (pdf)

Urešová Zdeňka, Dušek Ondřej, Fučíková Eva, Hajič Jan, Šindlerová Jana: Bilingual English-Czech Valency Lexicon Linked to a Parallel Corpus. In: Proceedings of the The 9th Linguistic Annotation Workshop (LAW IX 2015) , Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-941643-47-1, pp. 124-128, 2015. (pdf)

Urešová Zdeňka, Fučíková Eva, Šindlerová Jana: CzEngVallex: a bilingual Czech-English valency lexicon. In: The Prague Bulletin of Mathematical Linguistics, Vol. 105, Univerzita Karlova v Praze, Prague, Czech rep., ISSN 0032-6585, pp. 17-50, 2016. (pdf)

Urešová Zdeňka, Fučíková Eva, Hajičová Eva, Hajič Jan: SynSemClass Linked Lexicon: Mapping Synonymy between Languages. In: Proceedings of the 2020 Globalex Workshop on Linked Lexicography (LREC 2020), European Language Resources Association, Marseille, France, ISBN 979-10-95546-46-7, pp. 10-19, 2020. (pdf)

Urešová Z., Fučíková E., Hajičová E., Hajič J. (2020) Syntactic-Semantic Classes of Context-Sensitive Synonyms Based on a Bilingual Corpus. In: Vetulani Z., Paroubek P., Kubis M. (eds) Human Language Technology. Challenges for Computer Science and Linguistics. LTC 2017. Lecture Notes in Computer Science, vol 12598. Springer, Cham. https://doi.org/10.1007/978-3-030-66527-2_18

Annotation Principles Related to PropBank:

Bonial Claire, Babko-Malaya Olga, Choi Jinho D., Hwang Jena, Palmer Martha, Reese Nicholas: PropBank Annotation Guidelines. Version 3.0. Center for Computational Language and Education Research, Institute of Cognitive Science, University of Colorado at Boulder. 2010. (pdf)