Silvie Cinková
Education
- 2009: PhD at the Faculty of Arts and Philosophy.
Thesis defended on September 16, 2009. Annotation and link see below.
- 2001: Mgr. in German and Swedish Philology, Faculty of Arts and Philosophy
Research interests
- Germanic languages (German, Swedish, English, Icelandic), tectogrammatical description of English, valency, parallel corpora, collocation analysis, grammaticalization, lexicography
Tectogrammatical Representation of English
The first "serious" version of the annotation manual:
Cinková, Silvie; Hajič, Jan; Mikulová, Marie; Mladová, Lucie; Nedolužko, Anja; Pajas, Petr; Panevová, Jarmila; Semecký, Jiří; Šindlerová, Jana; Toman, Josef; Urešová, Zdeňka; Žabokrtský, Zdeněk (2006): Annotation of English on the tectogrammatical level. UFAL CKL Technical Report TR-2006-35. UFAL MFF UK. Supported by: LC536 (Integrované centrum počítačového zpracování přirozeného jazyka, 2005-2009, Hajič); 1ET201120505 (Od jazyka ke znalostem a sémantickému webu, 2005-2009, Hajič); MSM 0021620838 (Výzkumný záměr informační sekce MFF UK 2005 -2010); GA405/06/0589 (TR pro mluveny jazyk a strojovy preklad, 2006-2008, Hajic/Cinkova); GAUK 489/2004. 190 pages.
zipped pdf, approx. 5 MB
pdf,
approx. 10 MB. A browser-friendly (html) version can be viewed
here.
Speech reconstruction in English dialogs
Speech reconstruction is a pre-processing step for further analysis of spontaneous speech we do within the
Companions project. The annotation manual can be downloaded
here.
Swedish-Czech Parallel Corpus
Together with a group of students of Swedish at the Faculty of Arts I am building a manually aligned Swedish-Czech parallel corpus. The work is being financed by the joint project of the Czech Ministry of Education Nr. 0021620823/2005-2011(
The Czech National Corpus and Corpora of Other Languages). The corpus comprises approx. 2 million tokens at the moment. Translators, please feel free to donate your texts in electronic form. We will take care of all legal issues and we will give you access to the working version of the corpus.
PhD Thesis
My PhD thesis explores the secondary uses of Swedish basic verbs (e.g.
komma, ställa, ligga, ge, få) and attempts at creating a scheme for their comprehensive lexical description. Special attention is paid to light verb constructions. You can download the entire thesis
here (pdf, 2.5 MB) and the resulting data
here (zipped XML, CSS, DTD - 25 kb).
Publications
(see our publication database to get BibTex entries of publications released
until 2005 and
since 2006 )
Hajič Jan, Cinková Silvie, Čermáková Kristýna, Mladová Lucie, Nedolužko Anja, Pajas Petr, Semecký Jiří, Šindlerová Jana, Toman Josef, Tomšů Kristýna, Korvas Matěj, Rysová Magdaléna, Veselovská Kateřina, Žabokrtský Zdeněk: Prague English Dependency Treebank 1.0 , Software or data, Institute of Formal and Applied Linguistics, Charles University in Prague, Malostranské nám. 25, 118 00 Praha 1, ISBN 978-80-904175-0-2 , Jan 2009 link
Cinková Silvie, Toman Josef, Hajič Jan, Čermáková Kristýna, Klimeš Václav, Mladová Lucie, Šindlerová Jana, Tomšů Kristýna, Žabokrtský Zdeněk: Tectogrammatical Annotation of the Wall Street Journal, accepted for publication in Prague Bulletin of Mathematical Linguistics , No. 92, Univerzita Karlova, Prague, Czech Republic, ISSN 0032-6585, 2009
Hajič Jan, Cinková Silvie, Mičková Petra, Pajas Petr, Peterek Nino, Spousta Miroslav: Prague Dependency Treebank of Spoken Language - English ,
Software or data, Institute of Formal and Applied Linguistics, Charles University in Prague, Malostranské nám. 25, 118 00 Praha 1, Jan 2009 link
Cinková Silvie: Semantic Representation of Non-Sentential Utterances in Dialog,in
Proceedings of SRSL 2009, the 2nd Workshop on Semantic Representation of Spoken Language, Copyright © Association for Computational Linguistics, Athina, Greece, pp. 26-33, 2009
pdf
Bojar Ondřej, Cinková Silvie, Ptáček Jan: Towards English-to-Czech MT via Tectogrammatical Layer, in Prague Bulletin of Mathematical Linguistics, Vol. 90, Univerzita Karlova, ISSN 0032-6585, Dec 2008
Cinková Silvie, Hajič Jan, Ptáček Jan: An Annotation Scheme for Speech Reconstruction on a Dialog Corpus, in Fourth International Workshop on Human-Computer Conversation, Copyright © The Companions consortium, University of Sheffield, Bellagio, Italy, pp. 1-6, 2008
Cinková Silvie: Lemmatisierung der verbalen Reflexivität im entstehenden Großen Deutsch-Tschechischen akademischen Wörterbuch, in Beiträge zur bilingualen Lexikographie, Copyright © Charles University in Prague, Faculty of Philosophy and Arts, Praha, Czechia, ISBN 978-80-7308-217-8, pp. 141-152, 2008
Cinková Silvie, Hajičová Eva, Panevová Jarmila, Sgall Petr: The Tectogrammatics of English: on Some Problematic Issues from the Viewpoint of the Prague Dependency Treebank, in Resourceful Language Technology: Festschrift in Honor of Anna Sågvall Hein, Uppsala University, Humanistisk-samhällsvetenskapliga vetenskapsområdet, Faculty of Languages, Department of Linguistics and Philology, Uppsala, Sweden, ISBN 978-91-554-7226-9, pp. 33-48, 214 pp., 2008
Cinková Silvie, Hajičová Eva, Panevová Jarmila, Sgall Petr: Two Languages - One Annotation Scenario? Experience from the Prague Dependency Treebank,
in Prague Bulletin of Mathematical Linguistics, Vol. 89, Univerzita Karlova, ISSN 0032-6585, pp. 5-22, 2008
Cinková Silvie, Mikulová Marie: Speech reconstruction for the syntactic and semantic analysis of the NAP/AAA corpus, Tech. report no. 2008/TR-2008-37, ÚFAL MFF UK, Prague, Czech Rep., ISSN 1214-5521, 60 pp., Nov 2008
Hajič Jan, Cinková Silvie, Mikulová Marie, Pajas Petr, Ptáček Jan, Toman Josef, Urešová Zdeňka: PDTSL: An Annotated Resource For Speech Reconstruction,
in Proceedings of the 2008 IEEE Workshop on Spoken Language Technology, Copyright © IEEE, Goa, India, ISBN 978-1-4244-3472-5, 2008
Bojar Ondřej, Cinková Silvie, Ptáček Jan: Towards English-to-Czech MT via Tectogrammatical Layer, in NEALT Proceedings Series, Vol. 1, Proceedings of the 6th International Workshop on Treebanks and Linguistic Theories (TLT 2007), Copyright © North European Association for Language Technology, Bergen, Norway, ISSN 1736-6305, pp. 7-18, 2007
Cinková Silvie: “Movement towards Structure”: Foreign Learners, Language Patterns and Learners' Lexicons,in Rapport fra konference om leksikografi i Norden, Copyright © Nordisk Forening for Leksikografi, Akureyri, Iceland, 2007
Šindlerová Jana, Mladová Lucie, Toman Josef, Cinková Silvie: An Application of the PDT-scheme to a Parallel Treebank,in NEALT Proceedings Series, Vol. 1, Proceedings of the 6th International Workshop on Treebanks and Linguistic Theories (TLT 2007), Copyright © North European Association for Language Technology, Bergen, Norway, ISSN 1736-6305, pp. 163-174, 2007
Šindlerová Jana, Toman Josef, Cinková Silvie, Semecký Jiří: EngVallex 1.0,Software or data, Institute of Formal and Applied Linguistics MFF UK, 2007
Cinková, Silvie; Hajič, Jan; Mikulová, Marie; Mladová, Lucie; Nedolužko, Anja; Pajas, Petr; Panevová , Jarmila; Semecký, Jiří; Šindlerová, Jana; Toman, Josef; Urešová, Zdeňka; Žabokrtský, Zdeněk (2006): Annotation of English on the tectogrammatical level. UFAL Technical Report nr. 35. UFAL MFF UK.
Cinková, Silvie (2006): From PropBank to EngVALLEX: Adapting PropBank-Lexicon to the Valency Theory of Functional Generative Description. Proceedings of the fifth International conference on Language Resources and Evaluation (LREC 2006), Genova, Italy, May 2006.
Cinková, Silvie; Pecina, Pavel; Podveský, Petr; Schlesinger, Pavel (2006): Semi-automatic Building of Swedish Collocation Lexicon. Proceedings of the fifth International conference on Language Resources and Evaluation (LREC 2006), Genova, Italy, May 2006.
Cinková, Silvie; Pomikálek, Jan (2006): LEMPAS: A Make-Do Lemmatizer for the Swedish PAROLE-Corpus. Prague Bulletin of Mathematical Linguistics 86, pp. 47-54.
Semecký, Jiří; Cinková, Silvie (2006): Constructing an English Valency Lexicon In: Proceedings of Frontiers in Linguistically Annotated Corpora. The Association for Computational Linguistics. Sydney, Australia. pp. 111-113.
Cinková, Silvie; Kolářová, Veronika (2005): Nouns as Components of Support Verb Constructions in the Prague Dependency Treebank. In Insight into Slovak and Czech Corpus Linguistics (ed. Mária Šimková). Supported by LN00A063 and GA-UK 489/2004.
Cinková, Silvie; Žabokrtský, Zdeněk (2005): Swedish-Czech Combinatorial Valency Lexicon of Predicate Nouns: Describing Event Structure in Support Verb Constructions. In Proceedings of the 8th International Conference on Computational Lexicography COMPLEX, pp. 50-59 (eds. Ferenc Kiefer, Gábor Kiss, Júlia Pajzs), Budapest, Hungary, June 17-18. Supported by GA405/04/0243 and GA-UK 489/2004.
Cinková, Silvie; Žabokrtský, Zdeněk (2005): Treating support verb constructions in a lexicon: Swedish-Czech combinatorial valency lexicon of predicate nouns In Proceedings of Interdisciplinary Workshop on the Identification and Representation of Verb Features and Verb Classes, pp. 22-27 (eds. Katrin Erk, Alissa Melinger, Sabine Schulte im Walde), Saarbrücken, Germany, Feb. 28 - March 1. Supported by GA405/04/0243 and GA-UK 489/2004.
Cinková, Silvie: Manuál pro tektogramatickou anotaci angličtiny. In: ÚFAL/CKL, ÚFAL/CKL MFF UK, Praha. 2004. pp. 2-172.
Cinková, Silvie (2004): Extraction of Swedish Verb-Noun Collocations from a Large Msd-Annotated Corpus. In: The Prague Bulletin of Mathematical Linguistics 82. 2004. pp. 99-102.
Cinková, Silvie (2004): Review - Mats Wahlberg (ed.): Svenskt ortnamnslexikon (Švédský slovník místních jmen) In: Acta Onomastica, Praha.
Cinková, Silvie (2004): Review - Ruslan Mitkov (ed.) The Oxford Handbook of Computational Linguistics In: The Prague Bulletin of Mathematical Linguistics 82, Praha. 2004. pp. 87-94.
Cinková, Silvie (2003): Belegsuche bei der lexikographischen Bearbeitung von selten gebrauchtem Wortschatz In: Das Wort. Germanistisches Jahrbuch 2003, eds. Vollstedt, Marina, Moskva. 2003. pp. 353-365.
Cinková, Silvie (2001): Sýnihefti sagnorđabókar: andere Betrachtungsweise der lexikographischen Bearbeitung der Verben. Acta Universitatis Carolinae - Philologica 2, XVII. 2001. pp. 133-139.
Talks, Presentations
Cinková Silvie: Semantic annotation of a dialog corpus, Contributed talk, American Association For Corpus Linguistics Conference, Brigham Young University, Brigham, UT, USA, Mar 2008
Cinková Silvie: Speech reconstruction in PDT (on NAP-AAA corpora), English specifics in TR structure, Coreference, Non-sentential utterances in dialog ,
Contributed talk, Edinburgh Semantic Workshop (Companions Quarterly Meeting), Napier University, Edinburgh, UK, Mar 2008