Silvie Cinková

office
423
office hours
upon agreement
email
cinkova@ufal.mff.cuni.cz
phone
+420 951 554 224
fax
+420 257 223 293
address
Malostranské náměstí 25
118 00 Praha 1
Czech Republic

Main Research Interests

  • lexical semantics
  • knowledge representation
  • corpus linguistics
  • linguistic annotation
  • computational lexicography
  • Germanic languages (English, German, Swedish, and Icelandic)
  • analysis of multimodal documents

Projects

Manual annotations

As my first project at UFAL, I have coordinated the manual deep-syntax ("tectogrammatical") annotations of the Prague English Dependency Treebank and later the Prague DaTabase of Spoken English.

Recently, I coordinated and performed the manual annotation of a sample of English verbs according to the Corpus Pattern Analysis to explore how high an interannotator agreement we were able to achieve with this approach. For more detail and further experiments with lexical semantics, see our Semantic Pattern Recognition project page or directly browse our sample.

In the CEMI project, I was performing some pilot annotations and creating annotation instructions to the Image Text Understanding task.

Until 2015 I was in charge of the Czech-Swedish parallel corpus in the Intercorp project.

Rule-based automatic annotations

As part of my dissertation, I created a rule-based Swedish lemmatizer (not maintained since 2009) and word-sketch definitions to find verbs and their relevant noun collocates, including their modifiers and several other structures. These rules were later adopted in the Sketch Engine.

Currently I am working on a set of labels for English WSJ-style tagged and dependency-parsed dependency trees, which I call grammar structure tags. These draw on the part-of-speech tags as well as the dependency structures and provide morphosyntactic analysis of complex forms of lexical verbs. Also, they partly abstract from the tags and group together words in particular clause positions that are likely to have the same syntactic function despite their different part-of-speech tags. In this respect, the grammar structure tags, which are inserted at the analytical parse layer (surface syntax) are quite similar to the grammatemes contained at the tectogrammatical (deep) annotation layer. For more detail, see the GRASS project page.

More linguistic information for distributional lexical analysis of English and Czech

A project starting 2015 -- see http://ufal.mff.cuni.cz/silvie-cinkova/zellig-harris

Quantitative linguistics and R programming for linguists and students of humanities

see http://ufal.mff.cuni.cz/courses/npfl111-npfl112-statisticke_vyhodnocovani_jazykovych_dat_v_R

I fell for R in 2014. With my purely scholarly background making me learn all this the hardest way, I am a very empathetic teacher. smiley

Curriculum Vitae

Structured CV in Czech

 

 

Teaching

List of classesNPFL111-112

Teaching

NPFL111-112 Statistické vyhodnocování jazykových dat v R

Selected Bibliography

In fact, this bibliography is not selected -- it is a pregenerated overview from Biblio. 
  1. Daniel Zeman, Martin Popel, Milan Straka, Jan Hajič, Joakim Nivre, Filip Ginter, Juhani Luotolahti, Sampo Pyysalo, Slav Petrov, Martin Potthast, Francis Tyers, Elena Badmaeva, Memduh Gökırmak, Anna Nedoluzhko, Silvie Cinková, Jan Hajič, jr., Jaroslava Hlaváčová, Václava Kettnerová, Zdeňka Urešová, Jenna Kanerva, Stina Ojala, Anna Missilä, Christopher Manning, Sebastian Schuster, Siva Reddy, Dima Taji, Nizar Habash, Herman Leung, Marie-Catherine de Marneffe, Manuela Sanguinetti, Maria Simi, Hiroshi Kanayama, Valeria de Paiva, Kira Droganova, Héctor Martínez Alonso, Çağrı Çöltekin, Umut Sulubacak, Hans Uszkoreit, Vivien Macketanz, Aljoscha Burchardt, Kim Harris, Katrin Marheinecke, Georg Rehm, Tolga Kayadelen, Mohammed Attia, Ali Elkahky, Zhuoran Yu, Emily Pitler, Saran Lertpradit, Michael Mandl, Jesse Kirchner, Hector Fernandez Alcalde, Jana Strnadová, Esha Banerjee, Ruli Manurung, Antonio Stella, Atsuko Shimada, Sookyoung Kwak, Gustavo Mendonça, Tatiana Lando, Rattima Nitisaroj, Josie Li (2017): CoNLL 2017 Shared Task: Multilingual Parsing from Raw Text to Universal Dependencies. In: Proceedings of the CoNLL 2017 Shared Task: Multilingual Parsing from Raw Text to Universal Dependencies, pp. 1-19, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-945626-70-8 (pdf, biblio, bibtex)
  2. Vít Baisa, Silvie Cinková, Ema Krejčová, Anna Vernerová (2016): VPS-GradeUp: Graded Decisions on Usage Patterns. In: Proceedings of the 10th International Conference on Language Resources and Evaluation (LREC 2016), pp. 823-827, European Language Resources Association, Paris, France, ISBN 978-2-9517408-9-1 (url, biblio, batt1.pdf, obd, bibtex)
  3. Silvie Cinková (2016): WordSim353 for Czech. In: Text, Speech, and Dialogue: 19th International Conference, TSD 2016, Lecture Notes in Computer Science, ISSN 0302-9743, 9924, pp. 190-197, Springer International Publishing, Cham / Heidelberg / New York / Dordrecht / London, ISBN 978-3-319-45509-9 (url, biblio, batt1.pdf, batt2.pdf, obd, bibtex)
  4. Silvie Cinková, Ema Krejčová, Anna Vernerová, Vít Baisa (2016): Graded and Word-Sense-Disambiguation Decisions in Corpus Pattern Analysis: a Pilot Study. In: Proceedings of the 10th International Conference on Language Resources and Evaluation (LREC 2016), pp. 848-854, European Language Resources Association, Paris, France, ISBN 978-2-9517408-9-1 (pdf, biblio, batt1.pdf, obd, bibtex)
  5. Silvie Cinková, Ema Krejčová, Anna Vernerová, Vít Baisa (2016): What Do Graded Decisions Tell Us about Verb Uses. In: Proceedings of the XVII EURALEX International Congress: Lexicography and Linguistic Diversity, pp. 318-328, Tbilisi University Press, Tbilisi, Georgia, ISBN 978-9941-13-542-2 (pdf, biblio, batt1.pdf, obd, bibtex)
  6. Anna Nedoluzhko, Michal Novák, Silvie Cinková, Marie Mikulová, Jiří Mírovský (2016): Coreference in Prague Czech-English Dependency Treebank. In: Proceedings of the 10th International Conference on Language Resources and Evaluation (LREC 2016), pp. 169-176, European Language Resources Association, Paris, France, ISBN 978-2-9517408-9-1 (url, biblio, batt1.pdf, obd, bibtex)
  7. Stephan Oepen, Marco Kuhlmann, Yusuke Miyao, Daniel Zeman, Silvie Cinková, Dan Flickinger, Jan Hajič, Angelina Ivanova, Zdeňka Urešová (2016): Towards Comparability of Linguistic Graph Banks for Semantic Parsing. In: Proceedings of the 10th International Conference on Language Resources and Evaluation (LREC 2016), pp. 3991-3995, European Language Resources Association, Paris, France, ISBN 978-2-9517408-9-1 (url, biblio, batt1.pdf, obd, bibtex)
  8. Vít Baisa, Jane Bradbury, Silvie Cinková, Ismail El Maarouf, Adam Kilgarriff, Octavian Popescu (2015): SemEval-2015 Task 15: A CPA dictionary-entry-building task. In: Proceedings of the 9th International Workshop on Semantic Evaluation (SemEval 2015), pp. 315-324, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-941643-40-2 (url, biblio, batt1.pdf, obd, bibtex)
  9. Stephan Oepen, Marco Kuhlmann, Yusuke Miyao, Daniel Zeman, Silvie Cinková, Dan Flickinger, Jan Hajič, Zdeňka Urešová (2015): SemEval 2015 Task 18: Broad-Coverage Semantic Dependency Parsing. In: Proceedings of the 9th International Workshop on Semantic Evaluation (SemEval 2015), pp. 915-926, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-941643-40-2 (url, biblio, batt1.pdf, obd, bibtex)
  10. Silvie Cinková, Martin Holub, Ema Krejčová, Lenka Smejkalová (2013): Rule-Based Extraction of English Verb Collocates from a Dependency-Parsed Corpus. In: Proceedings of the Second International Conference on Dependency Linguistics, Depling 2013, pp. 60-67, Matfyzpress, Praha, Czechia, ISBN 978-80-7378-240-5 (biblio, obd, bibtex)
  11. Ondřej Bojar, Mauro Cettolo, Silvie Cinková, Philipp Koehn, Miroslav Týnovský, Zdeněk Žabokrtský (2012): Scientific Report on Rich Tree-Based SMT (technical report). ÚFAL, Charles University (biblio, bibtex)
  12. Ondřej Bojar, Silvie Cinková, Jan Hajič, Barbora Hladká, Vladislav Kuboň, Jiří Mírovský, Jarmila Panevová, Nino Peterek, Johanka Spoustová, Zdeněk Žabokrtský (2012): The Czech Language in the Digital Age. In: , ISBN 978-3-642-30705-8 (biblio, batt1.pdf, obd, bibtex)
  13. Silvie Cinková, Martin Holub, Vincent Kríž (2012): Optimizing semantic granularity for NLP - report on a lexicographic experiment. In: Proceedings of the 15th EURALEX International Congress, pp. 523-531, Department of Linguistics and Scandinavian Studies, University of Oslo, Oslo, Norway, ISBN 978-82-303-2228-4 (biblio, batt1.pdf, obd, bibtex)
  14. Silvie Cinková, Martin Holub, Vincent Kríž (2012): Managing Uncertainty in Semantic Tagging. In: Proceedings of 13th Conference of the European Chapter of the Association for Computational Linguistics, pp. 840-850, Association for Computational Linguistics, Avignon, France, ISBN 978-1-937284-19-0 (pdf, biblio, obd, bibtex)
  15. Silvie Cinková, Martin Holub, Adam Rambousek, Lenka Smejkalová (2012): A database of semantic clusters of verb usages. In: Proceedings of the 8th International Conference on Language Resources and Evaluation (LREC 2012), pp. 3176-3183, European Language Resources Association, İstanbul, Turkey, ISBN 978-2-9517408-7-7 (pdf, biblio, obd, bibtex)
  16. Silvie Cinková, Lenka Smejkalová, Anna Vernerová, Jonáš Thál, Martin Holub (2012): Maintaining consistency of monolingual verb entries with interannotator agreement. In: Nordiska studier i lexikografi - Rapport från Konferensen om lexikografi i Norden, pp. 169-180, Nordiska föreningen for lexikografi, Lund, Sweden, ISBN 978-91-85333-42-4 (biblio, batt1.pdf, obd, bibtex)
  17. Jan Hajič, Eva Hajičová, Jarmila Panevová, Petr Sgall, Ondřej Bojar, Silvie Cinková, Eva Fučíková, Marie Mikulová, Petr Pajas, Jan Popelka, Jiří Semecký, Jana Šindlerová, Jan Štěpánek, Josef Toman, Zdeňka Urešová, Zdeněk Žabokrtský (2012): Announcing Prague Czech-English Dependency Treebank 2.0. In: Proceedings of the 8th International Conference on Language Resources and Evaluation (LREC 2012), pp. 3153-3160, European Language Resources Association, İstanbul, Turkey, ISBN 978-2-9517408-7-7 (url, biblio, batt1.pdf, obd, bibtex)
  18. Martin Holub, Vincent Kríž, Silvie Cinková, Eckhard Bick (2012): Tailored Feature Extraction for Lexical Disambiguation of English Verbs Based on Corpus Pattern Analysis. In: Proceedings of the 24th International Conference on Computational Linguistics (Coling 2012), pp. 1195-1209, Coling 2012 Organizing Committee, Mumbai, India (biblio, batt1.pdf, obd, bibtex)
  19. Silvie Cinková (2010): Aim and result – A Swedish-Czech comparison of consecutive clauses. In: InterCorp: Exploring a Multilingual Corpus, pp. 70-82, Nakladatelství Lidové noviny, Praha, Czechia, ISBN 978-80-7422-042-5 (biblio, batt1.doc, obd)
  20. Silvie Cinková, Martin Holub, Pavel Rychlý, Lenka Smejkalová, Jana Šindlerová (2010): Can Corpus Pattern Analysis Be Used in NLP? In: Text, Speech and Dialogue. 13th International Conference, TSD 2010, Brno, Czech Republic, September 6-10, 2010. Proceedings, Lecture Notes in Computer Science, ISSN 0302-9743, 6231, pp. 67-74, Springer, Berlin / Heidelberg, ISBN 978-3-642-15759-2 (biblio, batt1.pdf, obd, bibtex)
  21. Silvie Cinková, Martin Holub, Lenka Smejkalová (2010): The Lexical Population of Semantic Types in Hanks’s PDEV. In: A Way with Words: Recent Advances in Lexical Theory and Analysis. A Festschrift for Patrick Hanks, pp. 199-214, Menha, Kampala, Uganda, ISBN 978-9970-101-01-6 (biblio, batt1.pdf, obd)
  22. Jan Ptáček, Pavel Ircing, Miroslav Spousta, Jan Romportl, Zdeněk Loose, Silvie Cinková, José Relaño Gil, Raúl Santos (2010): Integration of Speech and Text Processing Modules into a Real-Time Dialogue System. In: Text, Speech and Dialogue. 13th International Conference, TSD 2010, Brno, Czech Republic, September 6-10, 2010. Proceedings, Lecture Notes in Computer Science, ISSN 0302-9743, vol. 6231, no. 6231/2010, pp. 552-559, Springer, Berlin / Heidelberg, ISBN 978-3-642-15759-2 (url, biblio, batt1.pdf, obd, bibtex)
  23. Silvie Cinková (2009): Words that Matter: Towards a Swedish-Czech Colligational Dictionary of Basic Verbs. UFAL, Malostranské nám. 25, 118 00 Praha 1, ISBN 978-80-904175-3-3 (pdf, biblio, batt1.pdf, batt2.pdf, bibtex)
  24. Silvie Cinková (2009): A Contrastive Lexical description of Basic Verbs. Examples from Swedish and Czech. In: The Prague Bulletin of Mathematical Linguistics, ISSN 0032-6585, 92, pp. 21-62 (pdf, biblio, obd, bibtex)
  25. Silvie Cinková (2009): Semantic Representation of Non-Sentential Utterances in Dialog. In: Proceedings of SRSL 2009, the 2nd Workshop on Semantic Representation of Spoken Language, pp. 26-33, Association for Computational Linguistics, Athina, Greece (url, biblio, bibtex)
  26. Silvie Cinková, Josef Toman, Jan Hajič, Kristýna Čermáková, Václav Klimeš, Lucie Mladová, Jana Šindlerová, Kristýna Tomšů, Zdeněk Žabokrtský (2009): Tectogrammatical Annotation of the Wall Street Journal. In: The Prague Bulletin of Mathematical Linguistics, ISSN 0032-6585, 92, pp. 85-104 (biblio, batt1.pdf, obd, bibtex)
  27. Ondřej Bojar, Silvie Cinková, Jan Ptáček (2008): Towards English-to-Czech MT via Tectogrammatical Layer. In: The Prague Bulletin of Mathematical Linguistics, ISSN 0032-6585, 90, pp. 57-68 (biblio, batt1.pdf, obd, bibtex)
  28. Silvie Cinková (2008): Lemmatisierung der verbalen Reflexivität im entstehenden Großen Deutsch-Tschechischen akademischen Wörterbuch. In: Beiträge zur bilingualen lexikographie, pp. 141-152, Univerzita Karlova v Praze, Praha, Czechia, ISBN 978-80-7308-217-8 (biblio, obd, bibtex)
  29. Silvie Cinková, Jan Hajič, Jan Ptáček (2008): An Annotation Scheme for Speech Reconstruction on a Dialog Corpus. In: Fourth International Workshop on Human-Computer Conversation, The Companions consortium, Bellagio, Italy (pdf, biblio, batt1.pdf, obd, bibtex)
  30. Silvie Cinková, Eva Hajičová, Jarmila Panevová, Petr Sgall (2008): Two Languages - One Annotation Scenario? Experience from the Prague Dependency Treebank. In: The Prague Bulletin of Mathematical Linguistics, ISSN 0032-6585, 89, pp. 5-22 (biblio, batt1.pdf, obd, bibtex)
  31. Silvie Cinková, Eva Hajičová, Jarmila Panevová, Petr Sgall (2008): The Tectogrammatics of English: on Some Problematic Issues from the Viewpoint of the Prague Dependency Treebank. In: Resourceful Language Technology: Festschrift in Honor of Anna Sågvall Hein, pp. 33-48, Uppsala University, Humanistisk-samhällsvetenskapliga vetenskapsområdet, Faculty of Languages, Department of Linguistics and Philology, Uppsala, Sweden, ISBN 978-91-554-7226-9 (url, biblio, batt1.pdf, batt2.pdf, obd)
  32. Silvie Cinková, Marie Mikulová (2008): Speech reconstruction for the syntactic and semantic analysis of the NAP/AAA corpus (technical report). ÚFAL MFF UK, Prague, Czech Rep. (biblio, batt1.pdf, batt2.pdf, bibtex)
  33. Jan Hajič, Silvie Cinková, Marie Mikulová, Petr Pajas, Jan Ptáček, Josef Toman, Zdeňka Urešová (2008): PDTSL: An Annotated Resource For Speech Reconstruction. In: Proceedings of the 2008 IEEE Workshop on Spoken Language Technology, pp. 93-96, IEEE, Goa, India, ISBN 978-1-4244-3472-5 (biblio, obd, bibtex)
  34. Ondřej Bojar, Silvie Cinková, Jan Ptáček (2007): Towards English-to-Czech MT via Tectogrammatical Layer. In: Proceedings of the 6th International Workshop on Treebanks and Linguistic Theories (TLT 2007), NEALT Proceedings Series, ISSN 1736-6305, 1, pp. 7-18, North European Association for Language Technology, Bergen, Norway (url, biblio, batt1.pdf, obd, bibtex)
  35. Silvie Cinková (2007): “Movement towards Structure”: Foreign Learners, Language Patterns and Learners' Lexicons. In: Rapport fra konference om leksikografi i Norden, LexicoNordica, ISSN 0805-2735, 9, Nordisk Forening for Leksikografi, Akureyri, Iceland (biblio, batt1.pdf, obd, bibtex)
  36. Jana Šindlerová, Lucie Mladová, Josef Toman, Silvie Cinková (2007): An Application of the PDT-scheme to a Parallel Treebank. In: Proceedings of the 6th International Workshop on Treebanks and Linguistic Theories (TLT 2007), NEALT Proceedings Series, ISSN 1736-6305, 1, pp. 163-174, North European Association for Language Technology, Bergen, Norway (biblio, obd, bibtex)
  37. Silvie Cinková (2006): From PropBank to EngValLex: Adapting the PropBank-Lexicon to the Valency Theory of the Functional Generative Description. In: Proceedings of the 5th International Conference on Language Resources and Evaluation (LREC 2006) , pp. 2170-2175, ELRA, Genova, Italy, ISBN 2-9517408-2-4 (biblio, batt1.pdf, obd, bibtex)
  38. Silvie Cinková, Jan Hajič, Marie Mikulová, Lucie Mladová, Anja Nedolužko, Petr Pajas, Jarmila Panevová, Jiří Semecký, Jana Šindlerová, Josef Toman, Zdeňka Urešová, Zdeněk Žabokrtský (2006): Annotation of English on the tectogrammatical level (technical report). In: (biblio, batt1.pdf, bibtex)
  39. Silvie Cinková, Petr Podveský, Pavel Pecina, Pavel Schlesinger (2006): Semi-automatic Building of Swedish Collocation Lexicon. In: Proceedings of the 5th International Conference on Language Resources and Evaluation (LREC 2006) , pp. 1890-1893, ELRA, Genova, Italy, ISBN 2-9517408-2-4 (biblio, obd, bibtex)
  40. Silvie Cinková, Jan Pomikálek (2006): LEMPAS: A Make-Do Lemmatizer for the Swedish PAROLE-Corpus . In: The Prague Bulletin of Mathematical Linguistics, ISSN 0032-6585, 86, pp. 47-54 (biblio, batt1.pdf, obd, bibtex)
  41. Jiří Semecký, Silvie Cinková (2006): Constructing an English Valency Lexicon. In: Proceedings of Frontiers in Linguistically Annotated Corpora, pp. 94-97, The Association for Computational Linguistics, Sydney, Australia, ISBN 1-932432-78-7 (pdf, biblio, obd, bibtex)
  42. Silvie Cinková, Zdeněk Žabokrtský (2005): Swedish-Czech Combinatorial Valency Lexicon of Predicate Nouns: Describing Event Structure in Support Verb Constructions. In: Proceedings of the 8th International Conference on Computational Lexicography COMPLEX, pp. 50-59, Nyelvtudományi Intézet, Magyar Tudományos Akadémia, Budapest, Hungary, ISBN 963-9074-35-7 (biblio, batt1, batt2.pdf, bibtex)
  43. Silvie Cinková, Zdeněk Žabokrtský (2005): Treating support verb constructions in a lexicon: Swedish-Czech combinatorial valency lexicon of predicate nouns. In: Proceedings of Interdisciplinary Workshop on the Identification and Representation of Verb Features and Verb Classes, pp. 22-27, Universität des Saarlandes, Germany, Saarbrücken, Germany (biblio, bibtex)
  44. Silvie Cinková (2004): Mats Wahlberg (ed.): Svenskt ortnamnslexikon (Švédský slovník místních jmen) (review). In: Acta Onomastica, ISSN 1211-4413, XLV, pp. 105-106 (biblio, bibtex)
  45. Silvie Cinková (2004): Extraction of Swedish Verb-Noun Collocations from a Large Msd-Annotated Corpus. (biblio, bibtex)
  46. Silvie Cinková (2004): Manuál pro tektogramatickou anotaci angličtiny (technical report). ÚFAL/CKL MFF UK (biblio, bibtex)
  47. Silvie Cinková (2004): Recenze - Ruslan Mitkov (ed.) The Oxford Handbook of Computational Linguistics. (biblio, bibtex)
  48. Silvie Cinková, Veronika Kolářová (2004): Nouns as Components of Support Verb Constructions in the Prague Dependency Treebank. In: Korpusy a korpusová lingvistika v zahraničí a na Slovensku (in press) (biblio, bibtex)
  49. Silvie Cinková (2003): Belegsuche bei der lexikographischen Bearbeitung von selten gebrauchtem Wortschatz. In: Das Wort. Germanistisches Jahrbuch 2003, pp. 353--365, Deutscher akademischer Austauschdienst, Moskva (pdf, biblio, batt1.pdf, bibtex)
  50. Silvie Cinková (2001): /Sýnihefti sagnorðabókar/: andere Betrachtungsweise der lexikographischen Bearbeitung der Verben. In: , Germanistica Pragensia, ISSN 0567-8269, XVII, pp. 133--139, (v tisku) Lancaster, pp.37-48 (biblio, bibtex)