DeriVallex 1.0 is a valency lexicon of automatically generated valency frames of Czech noun and adjectival derivatives the valency of which exhibits systemic correspondences with the valency of their base words. It contains 10,220 derivatives corresponding to 17,288 lexical units (i.e., individual senses). In particular, DeriVallex describes 3,134 nouns corresponding to 5,089 lexical units and 7,086 adjectives corresponding to 12,199 lexical units. DeriVallex was created with the aim of providing information on the valency of nouns and adjectives, which is not sufficiently covered in existing lexical resources. Focusing on nominal and adjectival derivatives that exhibit systematic valency behavior in comparison with their base words, it captures the productive and systemic core of the Czech lexicon, thus laying the foundation for the further extension of current lexical resources. The following word-formation categories are covered: action nouns (e.g., dobytí města nepřáteli ‘conquering the city by enemies’), quality nouns (e.g., učitelova laskavost k dětem ‘the teacher’s kindness to children’), simultaneous action adjectives (e.g., lidé bojující proti bezpráví ‘people fighting against injustice’), anterior action adjectives (e.g., dluh narostlý na 400 milionů ‘a debt that has risen to 400 million’ and muži navrátivší se z války ‘men who have returned from the war’), passive action adjectives (e.g., úspory diktované Evropě konzervativní vládou ‘austerity measures dictated to Europe by a conservative government’), and potentiality adjectives (e.g., dužina oddělitelná od pecky ‘flesh separable from the pit’). In compiling the lexicon, data from the following lexical resources were used: NomVallex 2.6, VALLEX 4.5, and DeriNet 2.3.
To satisfy different needs of potential users, the lexicon is distributed:
Václava Kettnerová, Jiří Mírovský, Veronika Kolářová and Michal Olbrich
The creation of the DeriVallex lexicon has been supported by the LINDAT/CLARIAH-CZ Research Infrastructure, supported by the Ministry of Education, Youth and Sports of the Czech Republic (Project No. LM2023062), and it has been using data and tools provided by this project too.
DeriVallex is publicly available under the Creative Commons - Attribution-NonCommercial-ShareAlike 4.0 International license (CC BY-NC-SA). Its non-commercial use is conditioned by appropriate citation:
Kettnerová, Václava and Mírovský, Jiří and Kolářová, Veronika and Olbrich, Michal. 2026. DeriVallex 1.0. LINDAT/CLARIAH-CZ digital library at the Institute of Formal and Applied Linguistics (ÚFAL). http://hdl.handle.net/11234/1-6109.