Principal investigator (ÚFAL): 
Provider: 
Grant id: 
GA16-02196S
Duration: 
2016-2018

NomVallex

Corpus-based Valency Lexicon of Czech Nouns

The project dealt with the lexicographic treatment of valency of Czech deverbal nouns.

The lexicon, called NomVallex I., is available in three formats:

 

An example of two related lexicon entries:

An example of two related lexicon entries

 

 

The main characteristics of the NomVallex I. lexicon can be summarized as follows:

  1. The lexicon captures valency of Czech deverbal nouns belonging in at least one of their meanings to one of the following semantic classes: Communication (e.g. dotaz ‘question’, dotazování (se) – dotázání (se) ‘asking’), Mental Action (e.g. plán ‘plan’, plánování ‘planning’) or Psych State (e.g. nenávist ‘hatred’, nenávidění ‘hating’). In total, the lexicon includes 505 lexical units in 248 lexemes (when considering aspectual counterparts, such as namítáníimpf – namítnutípf ‘expressing objections’, to be individual lexical units, the number rises to 655 lexical units covering a total of 297 lemmas).
  2. The lexicon is created within the theoretical framework of Functional Generative Description. It is founded on data from the SYN series of corpora from the Czech National Corpus and the Araneum Bohemicum Maximum corpus.
  3. The lexicon follows in the footsteps of the VALLEX lexicon (Lopatková et al., 2016b); it adopts the VALLEX annotation scheme, and in relevant cases, deverbal nouns captured in NomVallex I. mirror the division of lexemes into lexical units and the assignment of lexical units to semantic classes of the base verbs captured in the VALLEX lexicon. (In its electronic version, NomVallex I. also provides links to the valency lexicon PDT-Vallex.)
  4. It captures all lexical meanings of the nouns, differentiating between basic “categorial” meanings, i.e. action (e.g. žádání (si) ‘asking’, dovtípení (se) ‘inferring’), abstract result of an action (e.g. žádost ‘request’), property/quality (e.g. důvtip ‘ingenuity’), material object (e.g. pohled ‘postcard’) and container/quantity (e.g. počet ‘number’).
  5. Considering morphosyntactic properties of the studied nouns, the lexicon differentiates three basic types of noun derivates, namely syntactic, syntacticolexical and lexical derivates. In order to be able to compare valency behaviour of different types of noun derivates, a decision was made to include both stem-nominals (derived from verbs by suffixes -ní/-tí and containing a theme suffix, e.g. žádání (si)impf ‘asking’, navrhováníimpf – navrhnutípf1 – navrženípf2 ‘suggesting/proposing’, namítáníimpf – namítnutípf ‘expressing objections’) and root-nominals (derived from verbs by various suffixes, including the zero suffix, but not containing a theme suffix, e.g. žádost no-aspect ‘request’, návrhno-aspect ‘proposal’, námitkano-aspect ‘objection’).
  6. Nouns matching the following criteria were included in the lexicon: its semantic class is either Communication, Mental Action or Psych State, its categorial meaning is action or abstract result of an action, and it exhibits non-systemic valency behaviour (especially non-systemic forms of participants). When both stemnominals and root-nominals derived from the same verb are available, both are included if at least one of them satisfies the above criteria in at least one of its senses.
  7. Valency properties are captured in the form of a valency frame (in which valency slots are specified by a functor and a list of morphemic forms), and examples which occurred in the corpus data. The lexicon aims to illustrate the full range of syntactic structures of noun phrases, and thus the syntactic behaviour of every lexical unit is exemplified with all combinations of its participants (in all forms specified in the valency frame) which were found in the corpus data.
  8. In accordance with valency lexicons VALLEX and PDT-Vallex, the NomVallex I. lexicon assumes that every lexical meaning (sense) is linked to a corresponding valency frame, and vice versa, a difference in valency frames proves a difference in meaning. Morphosyntactic properties are taken into account when there is ambiguity about whether separate senses should be distinguished or not.
  9. Along with the printed version, the NomVallex I. lexicon is also available in an electronic form, both as publicly available web-pages (https://ufal.mff.cuni.cz/nomvallex) and as machine readable data suitable for further research into valency of Czech deverbal nouns and for other NLP applications. The online version and an offline application allow for formulating specific and complex queries based on a wide range of criteria, e.g. the type of derivation of the noun (stem vs. root nominals), its aspectual characteristics, categorial meaning, semantic class, types of its valency complementations and their morphemic forms (including their distribution depending on the type of the noun and/or the type of the complementation itself, individually and in combinations), and the relation of the noun to its base verb including the differences in valency behaviour. Last but not least, the lexicon also quotes rich corpus evidence supporting the valency characteristics described in the given valency frames.
  10. A comparison of valency frames of nouns in the NomVallex I. lexicon and valency frames of their base verbs in VALLEX enables us to gain insight into systemic and non-systemic valency behaviour of Czech deverbal nouns.

 

 

 

Publications

Publications

Kolářová Veronika, Vernerová Anna, Klímová Jana (2020): NomVallex I. Valenční slovník substantiv. Praha: Ústav formální a aplikované lingvistiky, ISBN: 978-80-88132-07-3.

Kolářová Veronika, Vernerová Anna, Klímová Jana (2020): NomVallex I. LINDAT/CLARIAH-CZ digital library at the Institute of Formal and Applied Linguistics (ÚFAL), Faculty of Mathematics and Physics, Charles University, http://hdl.handle.net/11234/1-3420, https://ufal.mff.cuni.cz/nomvallex.

Kolářová Veronika, Klímová Jana, Vernerová Anna (2018): Valency Lexicon of Czech Nouns NomVallex: Starting Point and Goals. In: Slovanská lexikografie počátkem 21. století. Sborník příspěvků z mezinárodní konference. Praha: Slovanský ústav AV ČR, v.v.i., ISBN 978-80-86420-65-3, pp. 219-226.

Panevová Jarmila, Kolářová Veronika (2018): Aktant, nebo volné doplnění? (K netypickým formám ve valenčním poli substantiv). Prace Filologiczne, Vol. 72. Warszava: Uniwersytet Warszawski, ISSN 0138-0567, pp. 275-284.

Kolářová Veronika, Vernerová Anna, Klímová Jana (2018): Předložková vyjádření adnominálních valenčních doplnění. Prace Filologiczne, Vol. 72. Warszava: Uniwersytet Warszawski, ISSN 0138-0567, pp. 211-223.

Kettnerová Václava, Kolářová Veronika, Vernerová Anna (2017): Deverbal Nouns in Czech Light Verb Constructions. In: Mitkov R. (eds), Computational and Corpus-Based Phraseology. EUROPHRAS 2017. Lecture Notes in Computer Science, vol 10596. Cham: Springer, pp. 205-219.  pdf

Kolářová Veronika, Vernerová Anna, Klímová Jana, Kolář Jan (2017): Possible but not probable: A quantitative analysis of valency behaviour of Czech nouns in the Prague Dependency Treebank. Jazykovedný časopis / Journal of Linguistics, Vol. 68, No. 2. SAP – Slovak Academic Press, ISSN 0021-5597, pp. 208-218. pdf, presentation

Kolářová Veronika (2017): Valence českých deverbativních substantiv reprezentujících vybrané sémantické třídy. Prace Filologiczne, Vol. 70, Uniwersytet Warszawski, ISSN 0138-0567, pp. 287-303. presentation

Panevová Jarmila (2017): Od valence slovesa k valenci substantiv a adjektiv. Prace filologiczne, Vol. 70, Uniwersytet Warszawski, ISSN 0138-0567, pp. 59 - 71.

Kolářová Veronika, Klímová Jana and Vernerová Anna (2017): NomVallex: Valency Patterns of Semantically Classified Czech Nouns. Corpus Linguistics 2017 Conference, University of Birmingham, 25-28 July 2017.  pdf

Kolářová Veronika, Kolář Jan, Mikulová Marie (2017): Difference between Written and Spoken Czech: The Case of Verbal Nouns Denoting an Action. The Prague Bulletin of Mathematical Linguistics No. 107, pp. 19–38. pdf

Klímová Jana, Kolářová Veronika, Vernerová Anna (2016): Towards a Corpus-based Valency Lexicon of Czech Nouns. In: GLOBALEX 2016: Lexicographic Resources for Human Language Technology. GLOBALEX workshop 2016, pp. 1-7. pdf, presentation

Presentations

NomVallex: Valenční slovník českých substantiv založený na korpusu

Valenční slovník českých substantiv: Východiska a cíle

Related publications supported by other projects

Book:

Kolářová Veronika (2010): Valence deverbativních substantiv v češtině (na materiálu substantiv s dativní valencí). Praha: Karolinum.

Papers on particular phenomena related to valency of nouns:

Kolářová Veronika (2014): Nominalizované struktury se dvěma aktanty ve formě bezpředložkového genitivu. Naše řeč, Vol. 97, No. 4-5, Copyright © Ústav pro jazyk český Akademie věd České republiky, Praha, Czechia, ISSN 0027-8203, pp. 286-299.

Kolářová Veronika (2014): Preference v souvýskytu aktantů u českých substantiv mluvení. Korpus – gramatika – axiologie, Vol. 5, No. 10, Copyright © Gaudeamus, Hradec Králové, Czechia, ISSN 1804-137X, pp. 23-40.

Kolářová Veronika (2014): Special valency behavior of Czech deverbal nouns. In: Noun Valency, Copyright © John Benjamins Publishing Company, Amsterdam, The Netherlands, ISBN 9789027259233, pp. 19-60.

Kolářová Veronika (2014): Valence vybraných typů deverbativních substantiv ve valenčním slovníku PDT-Vallex. Technical report no. 2014/TR-2014-56, Copyright © ÚFAL MFF UK, ISSN 1214-5521, 34 pp.

Kolářová Veronika (2013): Adverbální předmětový genitiv a jeho protějšky v nominálních konstrukcích: Případ posesiva. In: Zborník Filozofickej fakulty Univerzity Komenského, Philologica LXXII, Slovo a tvar v štruktúre a komunikácii, Copyright © Univerzita Komenského, Bratislava, Slovakia, ISBN 978-80-223-3562-1, pp. 411-421.

Kolářová Veronika (2013): Agents Expressed by Prepositionless Instrumental Modifying Czech Nouns Derived from Intransitive Verbs. In: Proceedings of the Seventh International Conference Slovko 2013; Natural Language Processing, Corpus Linguistics, E-learning, Copyright © RAM-Verlag, Lüdenscheid, Germany, ISBN 978-3-942303-18-7, pp. 129-147.

Kolářová Veronika (2012): Valence dějových substantiv odvozených od sloves s předmětovým genitivem. In: Čeština v pohledu synchronním a diachronním. Stoleté kořeny Ústavu pro jazyk český. Copyright © Karolinum, Praha, Czechia. ISBN 978-80-246-2121-0, pp. 609-614.

Panevová Jarmila (2000): Poznámky k valenci podstatných jmen. In: Čeština - univerzália a specifika 2. Sborník konference ve Šlapanicích u Brna, 17.-19.11.1999 (ed. Zdeňka Hladká, Petr Karlík), Copyright © Masarykova Univerzita v Brně, ISBN 80-210-2262-0, pp. 173--180.

Vernerová Anna (2011): Nominal Valency in Lexicons. In: WDS'11 Proceedings of Contributed Papers, Part I, Copyright © Matfyzpress, Praha, Czechia, ISBN 978-80-7378-184-2, pp. 171-176.