Biblio Static

Editable Biblio is accessible from the internal network (or through VPN): http://10.10.24.239:8080/biblio/?section=publications

Christopher Brückner, Pavel Pecina (2025): Towards Semantic Tagging of Segmented Holocaust Narratives. In: Compendium of Papers of the Prague Visual History and Digital Humanities Conference 2025, pp. 177-192, MatfyzPress, Praha, Czechia, ISBN 978-80-7378-523-9 (local PDF, local PDF, bibtex)
Anna Dvořáková, Jan Hajič, jr. (2025): Visualizing Gregorian Traditions: ChantMapper. In: Music Encoding Conference 2025, pp. 143-151, Knowledge Commons (url, bibtex)
Katharina Hämmerl, Tomasz Limisiewicz, Jindřich Libovický, Alexander Fraser (2025): Beyond Literal Token Overlap: Token Alignability for Multilinguality. In: Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 2: Short Papers), pp. 756-767, Association for Computational Linguistics, Kerrville, TX, USA, ISBN 979-8-89176-190-2 (url, bibtex)
Ondřej Hrách, Martin Richter, Rudolf Rosa, Tereza Hannemann (2025): Aignos učí školáky tvořit hry s pomocí umělé inteligence. In: Matfyz.cz, pp. 1-5 (url, bibtex)
Johannes Kiesel, Çağrı Çöltekin, Maximilian Heinrich, Maik Fröbe, Milad Alshomary, Bertrand De Longueville, Tomaž Erjavec, Nicolas Handke, Matyáš Kopp, Nikola Ljubešić, Katja Meden, Nailia Mirzhakhmedova, Vaidas Morkevičius, Theresa Reitis-Münstermann, Mario Scharfbillig, Nicolas Stefanovitch, Henning Wachsmuth, Martin Potthast, Benno Stein (2025): Overview of Touché 2025: Argumentation Systems. In: Advances in Information Retrieval, 47th European Conference on Information Retrieval, ECIR 2025, Part V, Lecture Notes in Computer Science, ISSN 0302-9743, 15576, pp. 459-466, Springer Nature Switzerland, Cham, Switzerland, ISBN 978-3-031-88719-2 (url, bibtex)
Markéta Lopatková, Eva Fučíková, Federica Gamba, Jan Hajič, Hana Hledíková, Marie Mikulová, Michal Novák, Jan Štěpánek, Daniel Zeman, Šárka Zikánová (2025): UMR 2.0 - Czech: Release Notes (technical report). In: (pdf, bibtex)
Marie Mikulová, Barbora Štěpánková, Jan Štěpánek (2025): From Form to Meaning: The Case of Particles within the Prague Dependency Treebank Annotation Scheme. In: The 31st International Conference on Computational Linguistics, Proceedings of the Main Conference, pp. 2163-2175, ICCL, Sheffield, UK (pdf, bibtex)
Jiří Novák, Rudolf Rosa (2025): Důležité je klást AI správné otázky. In: Forum, ISSN 1211-1724, vol. 1/2025, no. 69, pp. 12-15 (pdf, local PDF, bibtex)
Rudolf Rosa, David Mareček, Tomáš Musil, Michal Chudoba, Jakub Landsperský (2025): EduPo: Progress and Challenges of Automated Analysis and Generation of Czech Poetry. In: Proceedings of the 5th International Conference on Natural Language Processing for Digital Humanities, pp. 524-542, Association for Computational Linguistics, Kerrville, TX, USA, ISBN 979-8-89176-234-3 (url, bibtex)
Barbora Štěpánková, Michal Škrabal (2025): Czech Lexicography. In: Reference Module in Social Sciences, pp. 1-6, Elsevier, Amsterdam, ISBN 9780443157851 (url, bibtex)
Barbora Štěpánková, jr., Rudolf Rosa (2025): Song Lyrics Adaptations: Computational Interpretation of the Pentathlon Principle. In: Proceedings of the 5th International Conference on Natural Language Processing for Digital Humanities, pp. 117-128, Association for Computational Linguistics, Kerrville, TX, USA, ISBN 979-8-89176-234-3 (url, bibtex)
Dima Taji, Daniel Zeman (2025): Towards Generating Automatic Anaphora Annotations (Electronic). In: ArXiv.org Computing Research Repository, ISSN 2331-8422 (url)
Zdeňka Urešová, Eva Fučíková, Cristina Fernández Alcaina, Jan Hajič (2025): Linking an Event-type Ontology to Morphosyntax of the Predicate-Argument Structure. In: Dictionaries: Journal of the Dictionary Society of North America, ISSN 0197-6745, vol. 46, no. 1, pp. 207-227 (url, local PDF, local PDF, bibtex)
Ondřej Vojtíšek, Karel Piorecký, Rudolf Rosa (2025): 08. O generování poezie AI – o projektu EduPo (Electronic). (url)
Ibrahim Sa'id Ahmad, Antonios Anastasopoulos, Ondřej Bojar, Claudia Borg, Marine Carpuat, Roldano Cattoni, Mauro Cettolo, William Chen, Qianqian Dong, Marcello Federico, Barry Haddow, Dávid Javorský, Mateusz Krubiński, Tsz Kin Lam, Xutai Ma, Prashant Mathur, Evgeny Matusov, Chandresh Kumar Maurya, John McCrae, Kenton Murray, Satoshi Nakamura, Matteo Negri, Jan Niehues, Xing Niu, Atul Kr. Ojha, John Ortega, Sara Papi, Peter Polák, Pavel Pecina, Adam Pospíšil, Elizabeth Salesky, Nivedita Sethiya, Anoop Sarkar, Jiatong Shi, Claytone Sikasote, Matthias Sperber, Sebastian Stüker, Katsuhito Sudoh, Brian Thompson, Alex Waibel, Shinji Watanabe, Patrick Wilken, Petr Zemánek, Rodolfo Zevallos (2024): FINDINGS OF THE IWSLT 2024 EVALUATION CAMPAIGN. In: Proceedings of the 21st International Conference on Spoken Language Translation, pp. 1-11, Association for Computational Linguistics, Stroudsburg, USA, ISBN 979-8-89176-141-4 (url, bibtex)
Adnan Al Ali, Jindřich Libovický (2024): How Gender Interacts with Political Values: A Case Study on Czech BERT Models. In: Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024), pp. 3038-3045, European Language Resources Association, Torino, Italy, ISBN 978-2-493814-10-4 (local PDF, bibtex)
Rabab Alkhalifa, Hsuvas Borkakoty, Romain Deveaud, Alaa El-Ebshihy, Luis Espinosa-Anke, Tobias Fink, Petra Galuščáková, Gabriela Gonzalez-Saez, Lorraine Goeuriot, David Iommi, Maria Liakata, Harish Tayyar Madabushi, Pablo Medina-Alias, Philippe Mulhem, Florina Piroi, Martin Popel, Arkaitz Zubiaga (2024): Extended overview of the CLEF 2024 LongEval Lab on Longitudinal Evaluation of Model Performance. In: Working Notes of CLEF 2024 - Conference and Labs of the Evaluation Forum, pp. 2267-2289, CEUR-WS, Aachen, Germany (pdf, local PDF, bibtex)
Rabab Alkhalifa, Hsuvas Borkakoty, Romain Deveaud, Alaa El-Ebshihy, Luis Espinosa-Anke, Tobias Fink, Petra Galuščáková, Gabriela Gonzalez-Saez, Lorraine Goeuriot, David Iommi, Maria Liakata, Harish Tayyar Madabushi, Pablo Medina-Alias, Philippe Mulhem, Florina Piroi, Martin Popel, Arkaitz Zubiaga (2024): Overview of the CLEF 2024 LongEval Lab on Longitudinal Evaluation of Model Performance. In: Experimental IR Meets Multilinguality, Multimodality, and Interaction. Proceedings of the Fifteenth International Conference of the CLEF Association (CLEF 2024), Lecture Notes in Computer Science, ISSN 0302-9743, 14959, pp. 208-230, Springer, Berlin, Germany, ISBN 978-3-031-71907-3 (url, bibtex)
Rabab Alkhalifa, Hsuvas Borkakoty, Romain Deveaud, Alaa El-Ebshihy, Luis Espinosa-Anke, Tobias Fink, Gabriela Gonzalez-Saez, Petra Galuščáková, Lorraine Goeuriot, David Iommi, Maria Liakata, Harish Tayyar Madabushi, Pablo Medina-Alias, Philippe Mulhem, Florina Piroi, Martin Popel, Christophe Servan, Arkaitz Zubiaga (2024): LongEval: Longitudinal Evaluation of Model Performance at CLEF 2024. In: Advances in Information Retrieval, 46th European Conference on Information Retrieval, ECIR 2024, Part VI, Lecture Notes in Computer Science, ISSN 0302-9743, 14613, pp. 60-66, Springer Nature Switzerland, Cham, Switzerland, ISBN 978-3-031-56071-2 (url, bibtex)
Mariia Anisimova, Šárka Zikánová (2024): Problematic cases of attitude annotation in diplomatic speeches. In: Proceedings of the 24th Conference Information Technologies – Applications and Theory (ITAT 2024), pp. 91-95, CEUR-WS.org, Košice, Slovakia (pdf, local PDF, bibtex)
Mariia Anisimova, Šárka Zikánová (2024): Attitudes in Diplomatic Speeches: Introducing the CoDipA UNSC 1.0. In: Proceedings of the Twentieth Workshop on Interoperable Semantic Annotation., pp. 17-26, The 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation, Turin, Italy, ISBN 978-2-493814-32-6 (pdf, local PDF, bibtex)
Nikolay Arefyev, Pinzhen Chen, Ona De Gibert Bonet, Barry Haddow, Jindřich Helcl, Bhavitvya Malik, Gema Ramírez-Sánchez, Pavel Stepachev, Jörg Tiedemann, Dušan Variš, Jaume Zaragoza-Bernabeu (2024): HPLT’s First Release of Data and Models. In: Proceedings of the 25th Annual Conference of the European Association for Machine Translation (Volume 2), pp. 53-54, European Association for Machine Translation (EAMT), Sheffield, UK, ISBN 978-1-0686907-1-6 (url, bibtex)
Simone Balloccu, Ehud Reiter, Karen Jia-Hui Li, Rafael Sargsyan, Vivek Kumar, Diego Angelo Gaetano Reforgiato Recupero, Daniele Riboni, Ondřej Dušek (2024): Ask the experts: sourcing a high-quality nutrition counseling dataset through Human-AI collaboration. In: Findings of the Association for Computational Linguistics: EMNLP 2024, pp. 11519-11545, Association for Computational Linguistics, Kerrville, TX, USA, ISBN 979-8-89176-168-1 (url, bibtex)
Simone Balloccu, Patrícia Schmidtová, Mateusz Lango, Ondřej Dušek (2024): Leak, Cheat, Repeat: Data Contamination and Evaluation Malpractices in Closed-Source LLMs. In: Proceedings of the 18th Conference of the European Chapter of the Association for Computational Linguistics, pp. 67-93, Association for Computational Linguistics, Kerrville, TX, USA, ISBN 979-8-89176-088-2 (url, bibtex)
Verginica Barbu Mititelu, Voula Giouli, Kilian Evang, Daniel Zeman, Petya Osenova, Carole Tiberius, Simon Krek, Stella Markantonatou, Ivelina Stoyanova, Ranka Stanković, Christian Chiarcos (2024): Multiword Expressions between the Corpus and the Lexicon: Universality, Idiosyncrasy, and the Lexicon-Corpus Interface. In: Proceedings of the LREC-COLING 2024 Joint Workshop on Multiword Expressions and Universal Dependencies (MWE-UD 2024), pp. 147-153, European Language Resources Association (ELRA), Torino, Italy, ISBN 978-2-493814-20-3 (pdf, local PDF, local PDF, bibtex)
Soheila Behrooznia, Ebrahim Ansari, Zdeněk Žabokrtský (2024): Enhancing Turkish Word Segmentation: A Focus on Borrowed Words and Invalid Morpheme. In: Proceedings of the Seventh Workshop on Technologies for Machine Translation of Low-Resource Languages, pp. 85-93, Association for Computational Linguistic, Stroudsburg, USA, ISBN 979-8-89176-149-0 (pdf, bibtex)
Sunit Bhattacharya, Vilém Zouhar, Věra Kloudová, Ondřej Bojar (2024): Stroop Effect in Multi-Modal Sight Translation (Electronic). In: ArXiv.org Computing Research Repository, ISSN 2331-8422, pp. 1-5 (url, local PDF)
Julia Bonn, Matthew Buchholz, Jayeol Chun, Andrew Cowell, William Croft, Lukas Denk, Jens E L Van Gysel, Jan Hajič, Kenneth Lai, James Martin, Skatje Myers, Alexis Palmer, Martha Palmer, James Pustejovsky, Zdeňka Urešová, Nianwen Xue, Jin Zhao, Bennet Post, Kristine Stenzel, Haibo Sun, Rosa Vallejos, Sijia Ge (2024): Building a Broad Infrastructure for Uniform Meaning Representations. In: Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024), pp. 2537-2547, European Language Resources Association, Torino, Italy, ISBN 978-2-493814-10-4 (url, local PDF, bibtex)
Christopher Brückner, Leixin Zhang, Pavel Pecina (2024): Similarity-Based Cluster Merging for Semantic Change Modeling. In: Proceedings of the 5th Workshop on Computational Approaches to Historical Language Change, pp. 23-28, Association for Computational Linguistics, Kerrville, TX, USA, ISBN 979-8-89176-138-4 (url, local PDF, local PDF, bibtex)
Silvie Cinková (2024): Linguistic Factors in the Readability of Czech Administrative and Legal Texts . In: To Understand Is to Be Free. Interdisciplinary Aspects of Comprehensibility and Understanding., pp. 303-325, Praesens Verlag, Vienna, Austria, ISBN 9783706912143 (url, bibtex)
Silvie Cinková, Barbora Hladká, Jiří Mírovský, Sylvie Archaimbault (2024): Data Storytelling Around André Mazon’s Correspondence. In: Colloquia Humanistica, ISSN 2392-2419, 13, pp. 1-18 (url, bibtex)
Silvie Cinková, Petr Plecháč, Martin Popel (2024): Rhymes and Syntax: A Morpho-Syntactic Analysis of Czech Poetry . In: Primerjalna Književnost, ISSN 2591-1805, vol. 47, no. 2, pp. 65-88 (url, bibtex)
Çağrı Çöltekin, Matyáš Kopp, Katja Meden, Vaidas Morkevičius, Nikola Ljubešić, Tomaž Erjavec (2024): Multilingual Power and Ideology Identification in the Parliament: a Reference Dataset and Simple Baselines. In: Proceedings of the LREC 2024 ParlaCLARIN IV Workshop on Creating, Analysing, and Increasing Accessibility of Parliamentary Corpora, pp. 94-100, European Language Resources Association (ELRA), Torino, Italy, ISBN 978-2-493814-24-1 (pdf, local PDF, bibtex)
Marie-Catherine de Marneffe, Joakim Nivre, Daniel Zeman (2024): Function Words in Universal Dependencies. In: Linguistic Analysis, ISSN 0098-9053, vol. 43, no. 3-4, pp. 549-588 (pdf, local PDF, local PDF, bibtex)
Kira Droganova, Daniel Zeman (2024): Towards a Unified Taxonomy of Deep Syntactic Relations. In: Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024), pp. 16412-16421, European Language Resources Association, Torino, Italy, ISBN 978-2-493814-10-4 (pdf, local PDF, bibtex)
Dominika Ďurišková, Daniela Jurášová, Matúš Žilinec, Eduard Šubert, Ondřej Bojar (2024): Khan Academy Corpus: A multilingual corpus of Khan Academy lectures. In: Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024), pp. 9743-9752, European Language Resources Association, Torino, Italy, ISBN 978-2-493814-10-4 (url, bibtex)
Vojtěch Dvořák, Jan Hajič, jr., Jiří Mayer (2024): Staff Layout Analysis Using the YOLO Platform. In: Proceedings of the 6th International Workshop on Reading Music Systems, pp. 18-22, University of Alicante, Alicante, Spain (url, bibtex)
Michelle Elizabeth, Ondřej Bojar (2024): Revamping the SLTev Tool for Evaluation of Spoken Language Translation. In: The Prague Bulletin of Mathematical Linguistics, ISSN 0032-6585, 121, pp. 5-14 (pdf, bibtex)
Tomaž Erjavec, Matyáš Kopp, Nikola Ljubešić, Taja Kuzman, Paul Rayson, Petya Osenova, Maciej Ogrodniczuk, Çağrı Çöltekin, Danijel Koržinek, Katja Meden, Jure Skubic, Peter Rupnik, Tommaso Agnoloni, José Aires, Starkaður Barkarson, Roberto Bartolini, Núria Bel, María Calzada Pérez, Roberts Darģis, Sascha Diwersy, Maria Gavriilidou, Ruben van Heusden, Mikel Iruskieta, Neeme Kahusk, Anna Kryvenko, Noémi Ligeti-Nagy, Carmen Magariños, Martin Mölder, Costanza Navarretta, Kiril Simov, Lars Magne Tungland, Jouni Tuominen, John Vidler, Adina Ioana Vladu, Tanja Wissik, Väinö Yrjänäinen, Darja Fišer (2024): ParlaMint II: advancing comparable parliamentary corpora across Europe (Electronic). In: Language Resources and Evaluation, ISSN 1574-020X (url, local PDF)
Eva Fučíková, Cristina Fernández-Alcaina, Jan Hajič, Zdeňka Urešová (2024): Textual Coverage of Eventive Entries in Lexical Semantic Resources. In: Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024), pp. 15835-15841, European Language Resources Association, Torino, Italy, ISBN 978-2-493814-10-4 (url, local PDF, bibtex)
Federica Gamba (2024): Predicate Sense Disambiguation for UMR Annotation of Latin: Challenges and Insights. In: Proceedings of the 1st Workshop on Machine Learning for Ancient Languages, pp. 19-29, Association for Computational Linguistics, Kerrville, TX, USA, ISBN 979-8-89176-144-5 (url, bibtex)
Federica Gamba, Marco Passarotti, Paolo Ruffolo (2024): Publishing the Dictionary of Medieval Latin in the Czech Lands as Linked Data in the LiLa Knowledge Base. In: Italian Journal of Computational Linguistics, ISSN 2499-4553, vol. 10, no. 1, pp. 95-116 (url, bibtex)
Federica Gamba, Abishek Stephen, Zdeněk Žabokrtský (2024): Universal Feature-based Morphological Trees. In: Proceedings of the LREC-COLING 2024 Joint Workshop on Multiword Expressions and Universal Dependencies (MWE-UD 2024), pp. 125-137, European Language Resources Association (ELRA), Torino, Italy, ISBN 978-2-493814-20-3 (pdf, local PDF, bibtex)
Jan Hajič, Eva Fučíková, Markéta Lopatková, Zdeňka Urešová (2024): Mapping Czech Verbal Valency to PropBank Argument Labels. In: Proceedings of the Fifth International Workshop on Designing Meaning Representations (DMR 2024) @ LREC-COLING 2024, pp. 88-100, ELRA Language Resource Association, ISBN 978-2-493814-39-5 (url, local PDF, bibtex)
Eva Hajičová, Jarmila Panevová, Marie Mikulová, Jan Hajič (2024): Function Words in Praguian Functional Generative Description. In: Linguistic Analysis, ISSN 0098-9053, vol. 43, no. 3-4, pp. 465-512 (pdf, bibtex)
Katharina Hämmerl, Jindřich Libovický, Alexander Fraser (2024): Understanding Cross-Lingual Alignment—A Survey. In: Findings of the Association for Computational Linguistics: ACL 2024, pp. 10922-10943, Association for Computational Linguistics, Kerrville, TX, USA, ISBN 979-8-89176-099-8 (url, local PDF, local PDF, bibtex)
Katharina Hämmerl, Andrei-Alexandru Manea, Gianluca Vico, Jindřich Helcl, Jindřich Libovický (2024): CUNI and LMU Submission to the MRL 2024 Shared Task on Multi-lingual Multi-task Information Retrieval. In: Proceedings of the Fourth Workshop on Multilingual Representation Learning (MRL 2024), pp. 357-364, Association for Computational Linguistics, Kerrville, TX, USA, ISBN 979-8-89176-184-1 (url, bibtex)
Jindřich Helcl, Zdeněk Kasner, Ondřej Dušek, Tomasz Limisiewicz, Dominik Macháček, Tomáš Musil, Jindřich Libovický (2024): Teaching LLMs at Charles University: Assignments and Activities. In: The Sixth Workshop on Teaching NLP: Proceedings of the Workshop, pp. 69-72, Association for Computational Linguistics, Kerrville, TX, USA, ISBN 979-8-89176-134-6 (url, local PDF, local PDF, bibtex)
Jaroslava Hlaváčová (2024): Michal Škrabal, Zuzana Laubeová, Barbora Štěpánková (eds.): Korpusové přístupy k české diglosii (review). In: Korpus – gramatika – axiologie, ISSN 1804-137X, 29, pp. 57-60 (bibtex)
Hana Hledíková (2024): Investigating valency-changing prefixes in Czech and German using large syntactically annotated data. In: Proceedings of the Society for Computation in Linguistics 2024, pp. 293-296, University of Massachusetts Amherst Libraries, Amherst, Massachusetts (url, bibtex)
Hana Hledíková (2024): Konverze v češtině a angličtině: sémantické vztahy mezi substantivy a slovesy akce a pohybu. In: Česká slovotvorná koncepce v kontextu slovanské jazykovědy, pp. 237-250, Academia, Praha, Czechia, ISBN 978-80-200-3548-6 (bibtex)
Hana Hledíková, Magda Ševčíková (2024): Conversion in languages with different morphological structures: A semantic comparison of English and Czech. In: Morphology, ISSN 1871-5656, 34, pp. 73-102 (url, bibtex)
Miroslav Hrabal, Josef Jon, Martin Popel, Nam Hoang Luu, Danil Semin, Ondřej Bojar (2024): CUNI at WMT24 General Translation Task: LLMs, (Q)LoRA, CPO and Model Merging. In: Proceedings of the Ninth Conference on Machine Translation, pp. 232-246, Association for Computational Linguistics, Kerrville, TX, USA, ISBN 979-8-89176-179-7 (url, bibtex)
Michal Chudoba, Rudolf Rosa (2024): GPT Czech Poet: Generation of Czech Poetic Strophes with Language Models (Electronic). In: ArXiv.org Computing Research Repository, ISSN 2331-8422, arXiv:2407.12790 cs.CL, pp. 1-9 (url, local PDF)
Maarten Janssen (2024): UDMorph: Morphosyntactically Tagged UD Corpora. In: Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024), pp. 16933-16940, European Language Resources Association, Torino, Italy, ISBN 978-2-493814-10-4 (pdf, bibtex)
Maarten Janssen, Matyáš Kopp (2024): ParlaMint in TEITOK. In: Proceedings of the LREC 2024 ParlaCLARIN IV Workshop on Creating, Analysing, and Increasing Accessibility of Parliamentary Corpora, pp. 121-126, European Language Resources Association (ELRA), Torino, Italy, ISBN 978-2-493814-24-1 (pdf, local PDF, bibtex)
Josef Jon, Ondřej Bojar (2024): GAATME: A Genetic Algorithm for Adversarial Translation Metrics Evaluation. In: Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024), pp. 7562-7569, European Language Resources Association, Torino, Italy, ISBN 978-2-493814-10-4 (url, bibtex)
Josef Jon, Ondřej Bojar (2024): An Analysis of Surprisal Uniformity in Machine and Human Translations. In: Proceedings of the 1st Workshop on Creative-text Translation and Technology, pp. 40-56, European Association for Machine Translation, Sheffield, UK, ISBN 9781068690730 (bibtex)
Zdeněk Kasner (2024): Data-to-Text Generation with Neural Language Models (PhD thesis). In: (url, bibtex)
Zdeněk Kasner, Ondřej Dušek (2024): Beyond Traditional Benchmarks: Analyzing Behaviors of Open LLMs on Data-to-Text Generation. In: Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pp. 12045-12072, Association for Computational Linguistics, Kerrville, TX, USA, ISBN 979-8-89176-094-3 (url, bibtex)
Zdeněk Kasner, Ondřej Plátek, Patrícia Schmidtová, Simone Balloccu, Ondřej Dušek (2024): factgenie: A Framework for Span-based Evaluation of Generated Texts. In: Proceedings of the 17th International Natural Language Generation Conference: System Demonstrations, pp. 13-15, Association for Computational Linguistics, Kerrville, TX, USA, ISBN 979-8-89176-123-0 (url, bibtex)
Johannes Kiesel, Çağrı Çöltekin, Maximilian Heinrich, Maik Fröbe, Milad Alshomary, Bertrand De Longueville, Tomaž Erjavec, Nicolas Handke, Matyáš Kopp, Nikola Ljubešić, Katja Meden, Nailia Mirzhakhmedova, Vaidas Morkevičius, Theresa Reitis-Münstermann, Mario Scharfbillig, Nicolas Stefanovitch, Henning Wachsmuth, Martin Potthast, Benno Stein (2024): Overview of Touché 2024: Argumentation Systems. In: Working Notes of CLEF 2024 - Conference and Labs of the Evaluation Forum, pp. 3341-3366, CEUR-WS, Aachen, Germany (pdf, local PDF, bibtex)
Johannes Kiesel, Çağrı Çöltekin, Maximilian Heinrich, Maik Fröbe, Milad Alshomary, Bertrand De Longueville, Tomaž Erjavec, Nicolas Handke, Matyáš Kopp, Nikola Ljubešić, Katja Meden, Nailia Mirzhakhmedova, Vaidas Morkevičius, Theresa Reitis-Münstermann, Mario Scharfbillig, Nicolas Stefanovitch, Henning Wachsmuth, Martin Potthast, Benno Stein (2024): Overview of Touché 2024: Argumentation Systems. In: Advances in Information Retrieval, 46th European Conference on Information Retrieval, ECIR 2024, Part V, Lecture Notes in Computer Science, ISSN 0302-9743, 14612, pp. 466-473, Springer Nature Switzerland, Cham, Switzerland, ISBN 978-3-031-56069-9 (url, bibtex)
Johannes Kiesel, Çağrı Çöltekin, Maximilian Heinrich, Maik Fröbe, Milad Alshomary, Bertrand De Longueville, Tomaž Erjavec, Nicolas Handke, Matyáš Kopp, Nikola Ljubešić, Katja Meden, Nailia Mirzhakhmedova, Vaidas Morkevičius, Theresa Reitis-Münstermann, Mario Scharfbillig, Nicolas Stefanovitch, Henning Wachsmuth, Martin Potthast, Benno Stein (2024): Overview of Touché 2024: Argumentation Systems. In: Experimental IR Meets Multilinguality, Multimodality, and Interaction. Proceedings of the Fifteenth International Conference of the CLEF Association (CLEF 2024), Lecture Notes in Computer Science, ISSN 0302-9743, 14959, pp. 308-332, Springer, Berlin, Germany, ISBN 978-3-031-71907-3 (url, local PDF, bibtex)
Tom Kocmi, Eleftherios Avramidis, Rachel Bawden, Ondřej Bojar, Anton Dvorkovich, Christian Federmann, Mark Fishel, Markus Freitag, Thamme Gowda, Roman Grundkiewicz, Barry Haddow, Marzena Karpinska, Philipp Koehn, Benjamin Marie, Christof Monz, Kenton Murray, Masaaki Nagata, Martin Popel, Maja Popović, Mariya Shmatova, Steinþór Steingrímsson, Vilém Zouhar (2024): Findings of the WMT24 General Machine Translation Shared Task: The LLM Era is Here but MT is Not Solved Yet. In: Proceedings of the Ninth Conference on Machine Translation, pp. 1-46, Association for Computational Linguistics, Kerrville, TX, USA, ISBN 979-8-89176-179-7 (pdf, bibtex)
Veronika Kolářová (2024): Active and passive syntax of Czech deverbal and deadjectival nouns. In: Lingua, ISSN 0024-3841, 307, pp. 1-27 (url, local PDF, bibtex)
Veronika Kolářová, Jiří Mírovský (2024): Looking for sense in nonsense: Valency of negative forms of nouns and adjectives in the NomVallex lexicon. In: Lexicography and Semantics. Proceedings of the XXI EURALEX International Congress., pp. 485-496, Institut za hrvatski jezik, Zagreb, Croatia, ISBN 978-953-7967-77-2 (url, local PDF, bibtex)
Veronika Kolářová, Jiří Mírovský (2024): Příbuzná deverbální a deadjektivní abstraktní substantiva ve valenčním slovníku NomVallex. In: Česká slovotvorná koncepce v kontextu slovanské jazykovědy. Monografie věnovaná 110. výročí narození a 20. výročí úmrtí Miloše Dokulila., pp. 263-274, Academia, Praha, Czechia, ISBN 978-80-200-3548-6 (bibtex)
Matěj Kripner (2024): Self-Supervised Summarization via Reinforcement Learning (masters thesis). In: (bibtex)
Mateusz Krubiński (2024): Multimodal Summarization (PhD thesis). In: (pdf, bibtex)
Mateusz Krubiński, Pavel Pecina (2024): Towards Unified Uni- and Multi-modal News Headline Generation. In: Findings of the Association for Computational Linguistics: EACL 2024, pp. 437-450, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 979-8-89176-093-6 (pdf, bibtex)
Nalin Kumar, Ondřej Dušek (2024): LEEETs-Dial: Linguistic Entrainment in End-to-End Task-oriented Dialogue systems. In: Findings of the Association for Computational Linguistics: NAACL 2024, pp. 727-735, Association for Computational Linguistics, Kerrville, TX, USA, ISBN 979-8-89176-119-3 (url, bibtex)
Marta Lango, Borys Naglik, Mateusz Lango, Iwo Naglik (2024): Polish-ASTE: Aspect-Sentiment Triplet Extraction Datasets for Polish. In: Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024), pp. 12821-12828, European Language Resources Association, Torino, Italy, ISBN 978-2-493814-10-4 (url, bibtex)
Mateusz Lango, Patrícia Schmidtová, Simone Balloccu, Ondřej Dušek (2024): ReproHum #0043-4: Evaluating Summarization Models: investigating the impact of education and language proficiency on reproducibility. In: Fourth Workshop on Human Evaluation of NLP Systems (HumEval) @ LREC-COLING 2024, pp. 229-237, ELRA, Paris, France, ISBN 978-2-493814-41-8 (url, bibtex)
Vojtěch Lanz, Pavel Pecina (2024): Paragraph Retrieval for Enhanced Question Answering in Clinical Documents. In: Proceedings of the 23rd Workshop on Biomedical Natural Language Processing, pp. 580-590, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 979-8-89176-130-8 (url, bibtex)
Jindřich Libovický, Jindřich Helcl (2024): Lexically Grounded Subword Segmentation. In: Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing (EMNLP), pp. 7403-7420, Association for Computational Linguistics, Kerrville, TX, USA, ISBN 979-8-89176-164-3 (url, bibtex)
Tomasz Limisiewicz, David Mareček, Tomáš Musil (2024): Debiasing Algorithm through Model Adaptation. In: Proceedings of the 12th International Conference on Learning Representations, pp. 1-20, International Conference on Learning Representations (ICLR), Appleton, USA, ISBN 9781713898658 (url, bibtex)
Markéta Lopatková, Eva Fučíková, Federica Gamba, Jan Štěpánek, Daniel Zeman, Šárka Zikánová (2024): Towards a Conversion of the Prague Dependency Treebank Data to the Uniform Meaning Representation. In: Proceedings of the 24th Conference Information Technologies – Applications and Theory (ITAT 2024), pp. 62-76, CEUR-WS.org, Košice, Slovakia (url, local PDF, bibtex)
Xing Han Lu, Zdeněk Kasner, Siva Reddy (2024): WebLINX: Real-World Website Navigation with Multi-Turn Dialogue. In: Proceedings of the 41st International Conference on Machine Learning, pp. 1-50, Proceedings of Machine Learning Research (PMLR), San Diego, USA, ISBN 9798331302238 (url, bibtex)
Dominik Macháček (2024): Multi-Source Simultaneous Speech Translation (PhD thesis). In: (url, bibtex)
David Mareček, Marie Nováková, Klára Vosecká, Josef Doležal, Tomáš Musil, Rudolf Rosa (2024): Annotation and automated classification of dramatic situations. In: Computational Drama Analysis: Reflecting on Methods and Interpretations, pp. 107-122, De Gruyter, Berlin, Boston, ISBN 9783111071763 (url, bibtex)
Jiří Mayer, Milan Straka, Jan Hajič, jr., Pavel Pecina (2024): Practical End-to-End Optical Music Recognition for Pianoform Music. In: Document Analysis and Recognition -- ICDAR 2024, pp. 55-73, Springer International Publishing, Cham, Switzerland, ISBN 978-3-030-86333-3 (url, local PDF, bibtex)
Marie Mikulová (2024): Fine-grained Classification of Circumstantial Meanings within the Prague Dependency Treebank Annotation Scheme. In: Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024), pp. 7314-7323, European Language Resources Association, Torino, Italy, ISBN 978-2-493814-10-4 (url, bibtex)
Jiří Mírovský, Pavlína Synková, Lucie Poláková, Marie Paclíková (2024): Cost-Effective Discourse Annotation in the Prague Czech–English Dependency Treebank. In: Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024), pp. 4067-4077, European Language Resources Association, Torino, Italy, ISBN 978-2-493814-10-4 (url, local PDF, local PDF, bibtex)
Sourabrata Mukherjee, Atul Kr. Ojha, Akansha Bansal, Deepak Alok, John McCrae, Ondřej Dušek (2024): Multilingual Text Style Transfer: Datasets & Models for Indian Languages. In: Proceedings of the 17th International Natural Language Generation Conference, pp. 494-522, Association for Computational Linguistics, Kerrville, TX, USA, ISBN 979-8-89176-122-3 (url, bibtex)
Sourabrata Mukherjee, Atul Kr. Ojha, Ondřej Dušek (2024): Are Large Language Models Actually Good at Text Style Transfer?. In: Proceedings of the 17th International Natural Language Generation Conference, pp. 523-539, Association for Computational Linguistics, Kerrville, TX, USA, ISBN 979-8-89176-122-3 (url, bibtex)
Tomáš Musil, David Mareček (2024): Exploring Interpretability of Independent Components of Word Embeddings with Automated Word Intruder Test. In: Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024), pp. 6922-6928, European Language Resources Association, Torino, Italy, ISBN 978-2-493814-10-4 (pdf, bibtex)
Iwo Naglik, Mateusz Lango (2024): ASTE-Transformer: Modelling Dependencies in Aspect-Sentiment Triplet Extraction. In: Findings of the Association for Computational Linguistics: EMNLP 2024, pp. 2324-2339, Association for Computational Linguistics, Kerrville, TX, USA, ISBN 979-8-89176-168-1 (url, bibtex)
Michal Novák, Barbora Dohnalová, Miloslav Konopík, Anna Nedoluzhko, Martin Popel, Ondřej Pražák, Jakub Sido, Milan Straka, Zdeněk Žabokrtský, Daniel Zeman (2024): Findings of the Third Shared Task on Multilingual Coreference Resolution. In: Proceedings of The Seventh Workshop on Computational Models of Reference, Anaphora and Coreference, pp. 78-96, Association for Computational Linguistics, Kerrville, TX, USA, ISBN 979-8-89176-171-1 (url, local PDF, bibtex)
Michal Novák, Peter Polák, Kateřina Rysová, Magdaléna Rysová, Ondřej Bojar (2024): Towards Automated Spoken Language Assessment: A Study of ASR Transcription of Examinations for Non-Native Speakers of Czech. In: The Prague Bulletin of Mathematical Linguistics, ISSN 0032-6585, 122, pp. 43-70 (pdf, local PDF, bibtex)
Adam Osuský, Dávid Javorský, Ondřej Bojar (2024): InsBERT: Word importance from artificial insertions. In: Proceedings of the 24th Conference Information Technologies – Applications and Theory (ITAT 2024), pp. 96-106, CEUR-WS.org, Košice, Slovakia (pdf, bibtex)
Jarmila Panevová (2024): Paradigmatické a syntagmatické vztahy u vybraných slovesných adjektiv. In: Česká slovotvorná koncepce v kontextu slovanské jazykovědy, pp. 147-154, Academia, Praha, Czechia, ISBN 978-80-200-3548-6 (bibtex)
Jarmila Panevová (2024): Vladimír Petkevič – mnohostranná osobnost ve vědě s četnými zájmy v životě (K sedmdesátým narozeninám). In: Korpus – gramatika – axiologie, ISSN 1804-137X, 30, pp. 69-70 (url, bibtex)
Jarmila Panevová, Adrian Barentsen (2024): Таксис в чешском языке. In: Таксис в славянских языках. Типологический анализ, pp. 642-704, Издательский Дом ЯСК, Moskva, Russia, ISBN 9785907498754 (url, bibtex)
Jarmila Panevová, Patrice Pognan (2024): Šmilauer – Tesnière – závislostní syntax. In: Acta Universitatis Carolinae Philologica, ISSN 0567-8269, 1, pp. 43-46 (url, bibtex)
Shantipriya Parida, Ondřej Bojar, Idris Abdulmumin, Shamsuddeen Hassan Muhammad, Ibrahim Sa'id Ahmad (2024): Findings of WMT2024 English-to-Low Resource Multimodal Translation Task. In: Proceedings of the Ninth Conference on Machine Translation, pp. 677-683, Association for Computational Linguistics, Kerrville, TX, USA, ISBN 979-8-89176-179-7 (url, bibtex)
Nataliia Petliak, Cristina Fernández-Alcaina, Eva Fučíková, Jan Hajič, Zdeňka Urešová (2024): Search tool for An Event-Type Ontology. In: Proceedings of the Twentieth Workshop on Interoperable Semantic Annotation., pp. 66-70, The 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation, Turin, Italy, ISBN 978-2-493814-32-6 (url, bibtex)
Massimo Poesio, Maciej Ogrodniczuk, Vincent Ng, Sameer Pradhan, Juntao Yu, Nafise Sadat Moosavi, Silviu Paun, Amir Zeldes, Anna Nedoluzhko, Michal Novák, Martin Popel, Zdeněk Žabokrtský, Daniel Zeman (2024): Universal Anaphora: The First Three Years. In: Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024), pp. 17087-17100, European Language Resources Association, Torino, Italy, ISBN 978-2-493814-10-4 (pdf, local PDF, bibtex)
Lucie Poláková, Jiří Mírovský, Šárka Zikánová, Eva Hajičová (2024): Developing a Rhetorical Structure Theory Treebank for Czech. In: Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024), pp. 4802-4810, European Language Resources Association, Torino, Italy, ISBN 978-2-493814-10-4 (url, local PDF, bibtex)
Tomáš Polák (2024): Prediction of transformed time series (masters thesis). In: (url, bibtex)
Martin Popel, Lucie Poláková, Michal Novák, Jindřich Helcl, Jindřich Libovický, Pavel Straňák, Tomáš Krabač, Jaroslava Hlaváčová, Mariia Anisimova, Tereza Chlaňová (2024): Charles Translator: A Machine Translation System between Ukrainian and Czech. In: Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024), pp. 3038-3045, European Language Resources Association, Torino, Italy, ISBN 978-2-493814-10-4 (pdf, local PDF, bibtex)
Josef Psutka, Jan Hajič, Ondřej Dušek, Jan Černocký (2024): České řečové a jazykové technologie: ledoborec ve světovém moři umělé inteligence. In: Proč se nebát umělé inteligence? AI pohledem nejen českých odborníků, pp. 119-140, JOTA, Brno, Czechia, ISBN 978-80-7689-459-4 (bibtex)
Rudolf Rosa (2024): Jak funguje jazykový model. In: Gymnasion, ISSN 1214-603X, vol. 18/1, no. 34, pp. 68-73 (bibtex)
Philipp Rösch, Norbert Oswald, Michaela Geierhos, Jindřich Libovický (2024): Enhancing Conceptual Understanding in Multimodal Contrastive Learning through Hard Negative Samples. In: The 3rd Workshop on Advances in Language and Vision Research: Proceedings of the Workshop, pp. 102-115, Association for Computational Linguistics (ACL), Kerrville, TX, USA , ISBN 979-8-89176-153-7 (pdf, local PDF, local PDF, bibtex)
Agata Savary, Daniel Zeman, Verginica Barbu Mititelu, Anabela Barreiro, Olesea Caftanatov, Marie-Catherine de Marneffe, Kaja Dobrovoljc, Gülşen Cebiroğlu Eryiğit, Voula Giouli, Bruno Guillaume, Stella Markantonatou, Nurit Melnik, Joakim Nivre, Atul Kr. Ojha, Carlos Ramisch, Abigail Walsh, Beata Wójtowicz, Alina Wróblewska (2024): UniDive: A COST Action on Universality, Diversity and Idiosyncrasy in Language Technology. In: Proceedings of the LREC 2024 Workshop of the ELRA/ISCA Special Interest Group on Under-Resourced Languages (SIGUL 2024), pp. 372-382, European Language Resources Association (ELRA), Paris, France, ISBN 978-2-493814-29-6 (url, local PDF, bibtex)
Patrícia Schmidtová (2024): Faithfulness in Natural Language Generation. In: 20th Annual Meeting of the Young Reseachers' Roundtable on Spoken Dialogue Systems, pp. 21-24, Association for Computational Linguistics, Kerrville, TX, USA, ISBN 979-8-89176-162-9 (url, bibtex)
Patrícia Schmidtová, Saad Mahamood, Simone Balloccu, Ondřej Dušek, Albert Gatt, Dimitra Gkatzia, David M. Howcroft, Ondřej Plátek, Adarsa Sivaprasad (2024): Automatic Metrics in Natural Language Generation: A Survey of Current Evaluation Practices. In: Proceedings of the 17th International Natural Language Generation Conference, pp. 557-583, Association for Computational Linguistics, Kerrville, TX, USA, ISBN 979-8-89176-122-3 (url, bibtex)
Tomáš Sourada, Jana Straková, Rudolf Rosa (2024): OOVs in the Spotlight: How to Inflect them?. In: Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024), pp. 12455-12466, European Language Resources Association, Torino, Italy, ISBN 978-2-493814-10-4 (pdf, bibtex)
Matthias Sperber, Ondřej Bojar, Barry Haddow, Dávid Javorský, Xutai Ma, Matteo Negri, Jan Niehues, Peter Polák, Elizabeth Salesky, Katsuhito Sudoh, Marco Turchi (2024): Evaluating the IWSLT2023 Speech Translation Tasks: Human Annotations, Automatic Metrics, and Segmentation. In: Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024), pp. 6484-6495, European Language Resources Association, Torino, Italy, ISBN 978-2-493814-10-4 (url, bibtex)
Abishek Stephen, Vojtěch John, Zdeněk Žabokrtský (2024): Unsupervised Extraction of Morphological Categories for Morphemes. In: 27th International Conference on Text, Speech and Dialogue, pp. 239-251, Springer, Cham, Switzerland, ISBN 978-3-031-70563-2 (url, bibtex)
Abishek Stephen, Daniel Zeman (2024): Light Verb Constructions in Universal Dependencies for South Asian Languages. In: Proceedings of the LREC-COLING 2024 Joint Workshop on Multiword Expressions and Universal Dependencies (MWE-UD 2024), pp. 163-177, European Language Resources Association (ELRA), Torino, Italy, ISBN 978-2-493814-20-3 (pdf, local PDF, local PDF, bibtex)
Milan Straka (2024): CorPipe at CRAC 2024: Predicting Zero Mentions from Raw Text. In: Proceedings of The Seventh Workshop on Computational Models of Reference, Anaphora and Coreference, pp. 97-106, Association for Computational Linguistics, Kerrville, TX, USA, ISBN 979-8-89176-171-1 (url, local PDF, bibtex)
Milan Straka, Jana Straková (2024): Open-Source Web Service with Morphological Dictionary--Supplemented Deep Learning for Morphosyntactic Analysis of Czech. In: 27th International Conference on Text, Speech and Dialogue, pp. 279-290, Springer, Cham, Switzerland, ISBN 978-3-031-70563-2 (url, local PDF, bibtex)
Milan Straka, Jana Straková, Federica Gamba (2024): ÚFAL LatinPipe at EvaLatin 2024: Morphosyntactic Analysis of Latin. In: Proceedings of the Third Workshop on Language Technologies for Historical and Ancient Languages (LT4HALA) @ LREC-COLING-2024, pp. 207-214, ELRA and ICCL, Torino, Italia, ISBN 978-2-493814-46-3 (pdf, local PDF, bibtex)
Emil Svoboda (2024): Modelování kompozit pro vícejazyčné zdroje jazykových dat (PhD thesis). In: (bibtex)
Emil Svoboda, Magda Ševčíková (2024): PaReNT (Parent Retrieval Neural Tool): A Deep Dive into Word Formation Across Languages. In: Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024), pp. 12611-12621, European Language Resources Association, Torino, Italy, ISBN 978-2-493814-10-4 (pdf, bibtex)
Emil Svoboda, Magda Ševčíková (2024): Compounds in Universal Dependencies: A Survey in Five European Languages. In: Proceedings of the 6th Workshop on Research in Computational Linguistic Typology and Multilingual NLP, pp. 88-99, Association for Computational Linguistics, Kerrville, TX, USA, ISBN 979-8-89176-071-4 (pdf, bibtex)
Pavlína Synková, Jiří Mírovský, Lucie Poláková, Magdaléna Rysová (2024): Announcing the Prague Discourse Treebank 3.0. In: Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024), pp. 1270-1279, European Language Resources Association, Torino, Italy, ISBN 978-2-493814-10-4 (url, local PDF, bibtex)
Magda Ševčíková (2024): A paradigmatic account of word formation: Conversion between noun and verb in Czech (habilitation). In: (bibtex)
Magda Ševčíková (2024): Slovesa s cizími kořeny ve slovotvorném systému češtiny. In: Česká slovotvorná koncepce v kontextu slovanské jazykovědy, pp. 353-362, Academia, Praha, Czechia, ISBN 978-80-200-3548-6 (bibtex)
Radek Šimík, Olga Nádvorníková, Kateřina Chládková, Jan Chromý, Lucie Saicová Římalová, Magda Ševčíková (2024): V předvečer Bienále české lingvistiky 2024. In: Slovo a slovesnost, ISSN 0037-7031, vol. 85, no. 1, pp. 79-80 (url, bibtex)
Jan Štěpánek (2024): Supporting Universal Dependencies in Tree Editor TrEd. In: The Science Perl Journal, pp. 38-46, The Science Perl Journal, USA, ISBN 9798218984748 (local PDF, bibtex)
Barbora Štěpánková, Lucie Poláková, Jana Šindlerová, Michal Novák (2024): What Can Dictionaries Tell Us About Pragmatic Markers – Building the Lexicon of Epistemic and Evidential Markers in Czech. In: Lexicography and Semantics. Proceedings of the XXI EURALEX International Congress., pp. 728-741, Institut za hrvatski jezik, Zagreb, Croatia, ISBN 978-953-7967-77-2 (pdf, bibtex)
Vojtěch Vančura, Pavel Kordík, Milan Straka (2024): beeFormer: Bridging the Gap Between Semantic and Interaction Similarity in Recommender Systems. In: Proceedings of the 18th ACM Conference on Recommender Systems, pp. 1102-1107, Association for Computing Machinery, New York, NY, United States, ISBN 979-8-4007-0505-2 (url, local PDF, bibtex)
Josef Vonášek, Milan Straka, Rostislav Krč, Lenka Lasoňová, Ekaterina Egorova, Jana Straková, Jakub Náplava (2024): CWRCzech: 100M Query-Document Czech Click Dataset and Its Application to Web Relevance Ranking. In: Proceedings of the 47th International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 1221-1231, Association for Computing Machinery, New York, NY, USA, ISBN 9798400704314 (url, local PDF, bibtex)
Hening Wang, Leixin Zhang, Ondřej Bojar (2024): Human and Machine: Language Processing in Translation Tasks. In: Proceedings of the 7th International Conference on Natural Language and Speech Processing (ICNLSP 2024), pp. 243-250, Association for Computational Linguistics, Online (url, bibtex)
Jędrzej Warczyński, Mateusz Lango, Ondřej Dušek (2024): Leveraging Large Language Models for Building Interpretable Rule-Based Data-to-Text Systems. In: Proceedings of the 17th International Natural Language Generation Conference, pp. 622-630, Association for Computational Linguistics, Kerrville, TX, USA, ISBN 979-8-89176-122-3 (url, bibtex)
Adam Wojciechowski, Mateusz Lango, Ondřej Dušek (2024): Faithful and plausible natural language explanations for image classification: a pipeline approach. In: Findings of the Association for Computational Linguistics: EMNLP 2024, pp. 2340-2351, Association for Computational Linguistics, Kerrville, TX, USA, ISBN 979-8-89176-168-1 (url, bibtex)
Uladzislau Yorsh, Martin Holeňa, Ondřej Bojar, David Herel (2024): On Difficulties of Attention Factorization through Shared Memory. In: The Second Tiny Papers Track at ICLR 2024, pp. 1-8, OpenReview.net (bibtex)
Frances Yung, Merel Scholman, Šárka Zikánová, Vera Demberg (2024): DiscoGeM 2.0: A Parallel Corpus of English, German, French and Czech Implicit Discourse Relations. In: Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024), pp. 4940-4956, European Language Resources Association, Torino, Italy, ISBN 978-2-493814-10-4 (url, local PDF, bibtex)
Patrik Zavoral, Dušan Variš, Ondřej Bojar (2024): Adversarial Testing as a Tool for Interpretability: Length-based Overfitting of Elementary Functions in Transformers (Electronic). In: ArXiv.org Computing Research Repository, ISSN 2331-8422, pp. 1-9 (url)
Leixin Zhang, David Burian, Vojtěch John, Ondřej Bojar (2024): Unveiling Semantic Information in Sentence Embeddings. In: Proceedings of the Fifth International Workshop on Designing Meaning Representations (DMR 2024) @ LREC-COLING 2024, pp. 39-47, ELRA Language Resource Association, ISBN 978-2-493814-39-5 (url, bibtex)
Šárka Zikánová (2024): Text Structure and Its Ambiguities: Corpus Annotation as a Helpful Guide. In: Proceedings of the 24th Conference Information Technologies – Applications and Theory (ITAT 2024), pp. 2-12, CEUR-WS.org, Košice, Slovakia (pdf, local PDF, bibtex)
Vilém Zouhar, Ondřej Bojar (2024): Quality and Quantity of Machine Translation References for Automatic Metrics. In: Fourth Workshop on Human Evaluation of NLP Systems (HumEval) @ LREC-COLING 2024, pp. 1-11, ELRA, Paris, France, ISBN 978-2-493814-41-8 (url, bibtex)
Vilém Zouhar, Věra Kloudová, Martin Popel, Ondřej Bojar (2024): Evaluating Optimal Reference Translations. In: Natural Language Processing, ISSN 2977-0424, 2024, pp. 1-24 (url, bibtex)
Milind Agarwal, Sweta Agrawal, Antonios Anastasopoulos, Luisa Bentivogli, Ondřej Bojar, Claudia Borg, Marine Carpuat, Roldano Cattoni, Mauro Cettolo, Mingda Chen, William Chen, Khalid Choukri, Alexandra Chronopoulou, Thierry Declerck, Qianqian Dong, Kevin Duh, Yannick Estève, Marcello Federico, Souhir Gahbiche, Barry Haddow, Benjamin Hsu, Phu Mon Htut, Hirofumi Inaguma, Dávid Javorský, John Judge, Yasumasa Kano, Tom Ko, Rishu Kumar, Pengwei Li, Xutai Ma, Prashant Mathur, Evgeny Matusov, Paul McNamee, John McCrae, Kenton Murray, Maria Nadejde, Satoshi Nakamura, Matteo Negri, Ha Nguyen, Jan Niehues, Xing Niu, Atul Kr. Ojha, John Ortega, Proyag Pal, Juan Pino, Lonneke van der Plas, Peter Polák, Elijah Rippeth, Elizabeth Salesky, Jiatong Shi, Matthias Sperber, Sebastian Stüker, Katsuhito Sudoh, Yun Tang, Brian Thompson, Kevin Tran, Marco Turchi, Alex Waibel, Mingxuan Wang, Shinji Watanabe, Rodolfo Zevallos (2023): FINDINGS OF THE IWSLT 2023 EVALUATION CAMPAIGN. In: Proceedings of the 20th International Conference on Spoken Language Translation, pp. 1-61, Association for Computational Linguistics, Stroudsburg, USA, ISBN 978-1-959429-84-5 (url, bibtex)
Rabab Alkhalifa, Iman Bilal, Hsuvas Borkakoty, Jose Camacho-Collados, Romain Deveaud, Alaa El-Ebshihy, Luis Espinosa-Anke, Gabriela Gonzalez-Saez, Petra Galuščáková, Lorraine Goeuriot, Elena Kochkina, Maria Liakata, Daniel Loureiro, Harish Tayyar Madabushi, Philippe Mulhem, Florina Piroi, Martin Popel, Christophe Servan, Arkaitz Zubiaga (2023): LongEval: Longitudinal Evaluation of Model Performance at CLEF 2023. In: Advances in Information Retrieval. 45th European Conference on Information Retrieval, ECIR 2023, Dublin, Ireland, April 2–6, 2023, Proceedings, Part III, Lecture Notes in Computer Science, ISSN 0302-9743, 13982, pp. 499-505, Springer Nature Switzerland, Cham, Switzerland, ISBN 978-3-031-28240-9 (bibtex)
Rabab Alkhalifa, Iman Bilal, Hsuvas Borkakoty, Jose Camacho-Collados, Romain Deveaud, Alaa El-Ebshihy, Luis Espinosa-Anke, Gabriela Gonzalez-Saez, Petra Galuščáková, Lorraine Goeuriot, Elena Kochkina, Maria Liakata, Daniel Loureiro, Philippe Mulhem, Florina Piroi, Martin Popel, Christophe Servan, Harish Tayyar Madabushi, Arkaitz Zubiaga (2023): Extended Overview of the CLEF-2023 LongEval Lab on Longitudinal Evaluation of Model Performance. In: Working Notes of the Conference and Labs of the Evaluation Forum (CLEF 2023), pp. 2181-2203, CEUR-WS, Aachen, Germany (pdf, bibtex)
Rabab Alkhalifa, Iman Bilal, Hsuvas Borkakoty, Jose Camacho-Collados, Romain Deveaud, Alaa El-Ebshihy, Luis Espinosa-Anke, Gabriela Gonzalez-Saez, Petra Galuščáková, Lorraine Goeuriot, Elena Kochkina, Maria Liakata, Daniel Loureiro, Philippe Mulhem, Florina Piroi, Martin Popel, Christophe Servan, Harish Tayyar Madabushi, Arkaitz Zubiaga (2023): Overview of the CLEF-2023 LongEval Lab on Longitudinal Evaluation of Model Performance. In: Experimental IR Meets Multilinguality, Multimodality, and Interaction 14th International Conference of the CLEF Association (CLEF 2023), Lecture Notes in Computer Science, ISSN 0302-9743, 14163, pp. 440-458, Springer, Berlin, Germany, ISBN 978-303142447-2 (url, local PDF, bibtex)
Diego Alves, Božo Bekavac, Daniel Zeman, Marko Tadić (2023): Corpus-based Syntactic Typological Methods for Dependency Parsing Improvement. In: Proceedings of SIGTYP, pp. 76-88, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-959429-56-2 (url, local PDF, bibtex)
Diego Alves, Božo Bekavac, Daniel Zeman, Marko Tadić (2023): Analysis of Corpus-based Word-Order Typological Methods. In: Proceedings of the Sixth Workshop on Universal Dependencies (UDW, GURT/SyntaxFest 2023), pp. 36-46, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-959429-34-0 (pdf, local PDF, bibtex)
Anya Belz, Craig Thomson, Ehud Reiter, Gavin Abercrombie, Jose M. Alonso-Moral, Mohammad Arvan, Jackie Cheung, Mark Cieliebak, Elizabeth Clark, Kees van Deemter, Tanvi Dinkar, Ondřej Dušek, Steffen Eger, Qixiang Fang, Albert Gatt, Dimitra Gkatzia, Javier González Corbelle, Dirk Hovy, Manuela Hürlimann, Takumi Ito, John D. Kelleher, Filip Klubička, Huiyuan Lai, Chris van der Lee, Emiel van Miltenburg, Yiru Li, Saad Mahamood, Margot Mieskes, Malvina Nissim, Natalie Parde, Ondřej Plátek, Verena Rieser, Pablo Mosteiro Romero, Joel Tetreault, Antonio Toral, Xiaojun Wan, Leo Wanner, Lewis Watson, Diyi Yang (2023): Missing information, unresponsive authors, experimental flaws: The impossibility of assessing the reproducibility of previous human evaluations in NLP. In: The Fourth Workshop on Insights from Negative Results in NLP: Proceedings of the Workshop, pp. 1-10, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-959429-49-4 (url, bibtex)
Sunit Bhattacharya, Ondřej Bojar (2023): Unveiling Multilinguality in Transformer Models: Exploring Language Specificity in Feed-Forward Networks (Electronic). In: ArXiv.org Computing Research Repository, ISSN 2331-8422, pp. 120-126 (pdf)
Olivier Bonami, Lukáš Kyjánek, Marine Wauquier (2023): Assessing the Featural Organisation of Paradigms with Distributional Methods. In: Proceedings of the Society for Computation in Linguistics 2023, pp. 310-320, Association for Computational Linguistics, Amherst, Massachusetts (url, bibtex)
Julia Bonn, Andrew Cowell, Jan Hajič, Alexis Palmer, Martha Palmer, James Pustejovsky, Zdeňka Urešová, Shira Wein, Nianwen Xue, Jin Zhou (2023): UMR annotation of multiword expressions. In: Proceedings of the 4th International Workshop on Designing Meaning Representation, pp. 99-109, ACL, Stroustrup, PA, USA (url, bibtex)
Julia Bonn, Skatje Myers, Jens E. L. van Gysel, Lukas Denk, Meagan Vigus, Jin Zhou, Andrew Cowell, William Croft, Jan Hajič, James Martin, Alexis Palmer, Martha Palmer, James Pustejovsky, Zdeňka Urešová, Rosa Vallejos, Nianwen Xue (2023): Mapping AMR to UMR: Resources for Adapting Existing Corpora for Cross-Lingual Compatibility. In: Proceedings of the 21st International Workshop on Treebanks and Linguistic Theories, pp. 74-95, Association for Computational Linguistics, Washington, D.C., USA, ISBN 978-1-959429-33-3 (url, local PDF, attachment2, bibtex)
Christopher Brückner (2023): Multi-Feature Clustering of Search Results (masters thesis). In: (bibtex)
Silvie Cinková, Julie Birkholz, Ingo Börner, Tess Dejaeghere, Serge Heiden, Maarten Janssen, Michal Křen, Alvaro Perez Pozo (2023): CLS INFRA D8.1 Report of the tools for the basic Natural Language Processing (NLP) tasks in the CLS context (technical report). In: (url, bibtex)
Silvie Cinková, Václav Cvrček, Maarten Janssen, Michal Křen (2023): How Corpus Analysis Helps Operationalize Research Questions and Entices Literary Scholars to Learn Programming. . In: Digital Humanities 2023: Book of Abstracts, pp. 323-325, Centre for Information Modelling - Austrian Centre for Digital Humanities1, Graz, Austria (url, bibtex)
Emma Daly, Jane Dunne, Federico Gaspari, Teresa Lynn, Natalia Resende, Andy Way, Maria Giagkou, Stelios Piperidis, Tereza Vojtěchová, Jan Hajič, Annika Grützner-Zahn, Stefanie Hegele, Katrin Marheinecke, Georg Rehm (2023): Results of the Forward-looking Community-wide Consultation. In: European Language Equality - A Strategic Agenda for Digital Language Equality, pp. 245-262, Springer Nature Switzerland AG, Cham, Switzerland, ISBN 978-3-031-28819-7 (url, bibtex)
Koenraad De Smedt, Iulianna van der Lek, Henk van den Heuvel, Antonio Balvet, Maarten Janssen, Silvie Cinková, Amelia Sanz, Stavros Assimakopoulos, Louis ten Bosch (2023): CLARIN in Training and Education . In: Selected papers from the CLARIN Annual Conference 2022, pp. 34-50, Linköping Electronic Conference Proceedings, Linköping, Sweden, ISBN 978-91-8075-254-1 (url, bibtex)
Ana Díaz-Negrillo, Cristina Fernández Alcaina (2023): A corpus-based study of the semantic distribution of denominal verb formation in English. In: SKASE Journal of Theoretical Linguistics, ISSN 1336-782X, vol. 20, no. 4, pp. 2-19 (pdf, bibtex)
Kira Droganova, Daniel Zeman (2023): A Unified Taxonomy of Deep Syntactic Relations (Electronic). In: ArXiv.org Computing Research Repository, ISSN 2331-8422, 2303.12220 (url, local PDF)
Tomaž Erjavec, Matyáš Kopp, Katja Meden (2023): TEI and Git in ParlaMint: Collaborative Development of Language Resources. In: Selected papers from the CLARIN Annual Conference 2022, pp. 44-56, Linköping Electronic Conference Proceedings, Linköping, Sweden, ISBN 978-91-8075-254-1 (url, bibtex)
Tomaž Erjavec, Maciej Ogrodniczuk, Petya Osenova, Nikola Ljubešić, Kiril Simov, Andrej Pančur, Michał Rudolf, Matyáš Kopp, Starkaður Barkarson, Steinþór Steingrímsson, Çağrı Çöltekin, Jesse de Does, Katrien Depuydt, Tommaso Agnoloni, Giulia Venturi, María Calzada Pérez, Luciana de Macedo, Costanza Navarretta, Giancarlo Luxardo, Matthew Coole, Paul Rayson, Vaidas Morkevičius, Tomas Krilavičius, Roberts Darģis, Orsolya Ring, Ruben van Heusden, Maarten Marx, Darja Fišer (2023): The ParlaMint corpora of parliamentary proceedings. In: Language Resources and Evaluation, ISSN 1574-020X, vol. 57, no. 1, pp. 415-448 (url, local PDF, bibtex)
Cristina Fernández Alcaina, Eva Fučíková, Jan Hajič, Zdeňka Urešová (2023): Spanish Synonyms as Part of a Multilingual Event-Type Ontology. In: Jazykovedný časopis / Journal of Linguistics, ISSN 0021-5597, vol. 74, no. 1, pp. 153-162 (pdf, local PDF, local PDF, bibtex)
Cristina Fernández-Alcaina, Eva Fučíková, Jan Hajič, Zdeňka Urešová (2023): Spanish Verbal Synonyms in the SynSemClass Ontology. In: Proceedings of the 21st International Workshop on Treebanks and Linguistic Theories, pp. 11-20, Association for Computational Linguistics, Washington, D.C., USA, ISBN 978-1-959429-33-3 (url, local PDF, attachment2, bibtex)
Eva Fučíková, Jan Hajič, Zdeňka Urešová (2023): Corpus-Based Multilingual Event-type Ontology: annotation tools and principles. In: Proceedings of the 21st International Workshop on Treebanks and Linguistic Theories, pp. 1-10, Association for Computational Linguistics, Washington, D.C., USA, ISBN 978-1-959429-33-3 (url, attachment, bibtex)
Petra Galuščáková, Romain Deveaud, Gabriela Gonzalez-Saez, Philippe Mulhem, Lorraine Goeuriot, Florina Piroi, Martin Popel (2023): LongEval-Retrieval: French-English Dynamic Test Collection for Continuous Web Search Evaluation. In: Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 3089-3094, Association for Computing Machinery, New York, NY, USA, ISBN 978-1-4503-9408-6 (url, local PDF, bibtex)
Federica Gamba, Marco Passarotti, Paolo Ruffolo (2023): Linking the Dictionary of Medieval Latin in the Czech Lands to the LiLa Knowledge Base. In: Proceedings of the Ninth Italian Conference on Computational Linguistics, pp. 1-8, CEUR Workshop Proceedings, Venice, Italy (pdf, bibtex)
Federica Gamba, Daniel Zeman (2023): Latin Morphology through the Centuries: Ensuring Consistency for Better Language Processing. In: Proceedings of the Ancient Language Processing Workshop, pp. 59-67, INCOMA, Varna, Bulgaria, ISBN 978-954-452-087-8 (pdf, local PDF, local PDF, bibtex)
Federica Gamba, Daniel Zeman (2023): Universalising Latin Universal Dependencies: a harmonisation of Latin treebanks in UD. In: Proceedings of the Sixth Workshop on Universal Dependencies (UDW, GURT/SyntaxFest 2023), pp. 7-16, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-959429-34-0 (pdf, local PDF, local PDF, bibtex)
Tirthankar Ghosal, Ondřej Bojar, Marie Hledíková, Tom Kocmi, Anna Nedoluzhko (2023): Overview of the Second Shared Task on Automatic Minuting (AutoMin) at INLG 2023. In: Proceedings of the 16th International Natural Language Generation Conference, pp. 138-167, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 979-8-89176-001-1 (bibtex)
Jan Hajič, Maria Giagkou, Stelios Piperidis, Georg Rehm, Natalia Resende (2023): Consulting the Community: How to Reach Digital Language Equality in Europe by 2030?. In: European Language Equality - A Strategic Agenda for Digital Language Equality, pp. 229-244, Springer Nature Switzerland AG, Cham, Switzerland, ISBN 978-3-031-28819-7 (url, bibtex)
Jan Hajič, jr., Petr Žabička, Jan Rychtář, Jiří Mayer, Martina Dvořáková, Filip Jebavý, Markéta Vlková, Pavel Pecina (2023): The OmniOMR Project. In: Proceedings of the 5th International Workshop on Reading Music Systems, pp. 12-14, University of Alicante, Alicante, Spain (url, bibtex)
Katharina Hämmerl, Björn Dieseroth, Patrick Schramowski, Jindřich Libovický, Constantin A. Rothkopf, Alexander Fraser, Kristian Kersting (2023): Speaking Multiple Languages Affects the Moral Bias of Language Models. In: Findings of the Association for Computational Linguistics: ACL 2023, pp. 2137-2156, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-959429-62-3 (url, bibtex)
Katharina Hämmerl, Alina Fastowski, Jindřich Libovický, Alexander Fraser (2023): Exploring Anisotropy and Outliers in Multilingual Language Models for Cross-Lingual Semantic Sentence Similarity. In: Findings of the Association for Computational Linguistics: ACL 2023, pp. 7023-7037, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-959429-62-3 (url, bibtex)
Michael Hanna, Roberto Zamparelli, David Mareček (2023): The Functional Relevance of Probed Information: A Case Study. In: Proceedings of the 17th Conference of the European Chapter of the Association for Computational Linguistics, pp. 835-848, Association for Computational Linguistics, Kerrville, TX, USA, ISBN 978-1-959429-44-9 (url, bibtex)
Jonáš Havelka, Jiří Mayer, Pavel Pecina (2023): Symbol Generation via Autoencoders for Handwritten Music Synthesis. In: Proceedings of the 5th International Workshop on Reading Music Systems, pp. 20-24, University of Alicante, Alicante, Spain (url, bibtex)
Alžběta Havlová, Rudolf Rosa, Tomáš Studeník (2023): Před pár lety byla umělá inteligence obskurní disciplínou. Dnes umí vygenerovat pohádku i divadelní hru (Electronic). (url)
Alžběta Havlová, Rudolf Rosa, Tomáš Studeník (2023): V umění je potřeba inovace. Umělá inteligence čerpá jen z toho, co už bylo, říká počítačový lingvista (Electronic). (url)
Alžběta Havlová, Rudolf Rosa, Tomáš Studeník (2023): Lidé se často bojí umělé inteligence špatně, čekají, kdy roboti získají nadvládu a všechny nás pozabíjejí, říká výzkumník (Electronic). (url)
Alžběta Havlová, Rudolf Rosa, Tomáš Studeník (2023): Je dobré se umělé inteligence bát, ale je nutné se bát správným způsobem, říká počítačový výzkumník (Electronic). (url)
Jindřich Helcl, Jindřich Libovický (2023): CUNI Submission to MRL 2023 Shared Task on Multi-lingual Multi-task Information Retrieval. In: Proceedings of the The 2nd Workshop on Multi-lingual Representation Learning (MRL), pp. 302-309, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 979-8-89176-056-1 (pdf, local PDF, local PDF, bibtex)
Jaroslava Hlaváčová (2023): Language Report Czech. In: European Language Equality - A Strategic Agenda for Digital Language Equality, pp. 115-118, Springer Nature Switzerland AG, Cham, Switzerland, ISBN 978-3-031-28819-7 (url, bibtex)
Martin Holub, Patrícia Martinková (2023): Supervised Machine Learning for Text Analysis in R (review). In: Journal of the American Statistical Association, ISSN 1537-274X, pp. 2207-2209 (url, bibtex)
Vojtěch Hudeček, Ondřej Dušek (2023): Are Large Language Models All You Need for Task-Oriented Dialogue?. In: Proceedings of the 24th Meeting of the Special Interest Group on Discourse and Dialogue (SIGDIAL), pp. 216-228, Association for Computational Linguistics, Stroudsubrgh, PA, USA, ISBN 979-8-89176-028-8 (url, bibtex)
Bar Iluz, Tomasz Limisiewicz, Gabriel Stanovsky, David Mareček (2023): Exploring the Impact of Training Data Distribution and Subword Tokenization on Gender Bias in Machine Translation. In: Proceedings of the 13th International Joint Conference on Natural Language Processing and the 3rd Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics (Volume 1: Long Papers), pp. 885-896, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 979-8-89176-014-1 (pdf, local PDF, bibtex)
Maarten Janssen (2023): Dynamically Chaining APIs: from Dracor to TEITOK. In: CLARIN Annual Conference Proceedings 2023, pp. 116-119, CLARIN ERIC, Leuven, Belgium (bibtex)
Dávid Javorský, Ondřej Bojar, François Yvon (2023): Assessing Word Importance Using Models Trained for Semantic Tasks. In: Findings of the Association for Computational Linguistics: ACL 2023, pp. 8846-8856, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-959429-62-3 (pdf, bibtex)
Vojtěch John, Magda Ševčíková, Zdeněk Žabokrtský (2023): Identification of root morphs in morphologically segmented data. In: The Fourth International Workshop on Resources and Tools for Derivational Morphology , pp. 23-32, Croatian Language Technology Society, Zagreb, Croatia, ISBN 978-953-55375-5-7 (pdf, bibtex)
Vojtěch John, Zdeněk Žabokrtský (2023): The Unbearable Lightness of Morph Classification. In: 26th International Conference, TSD 2023, pp. 105-115, Springer, Cham, Switzerland, ISBN 978-3-031-40497-9 (url, bibtex)
Josef Jon, Ondřej Bojar (2023): Character-level NMT and language similarity. In: Proceedings of Machine Translation Summit XIX vol. 1: Research Track, pp. 360-371, Asia-Pacific Association for Machine Translation (AAMT), Kyoto, Japan, ISBN 978-4-9913461-0-1 (pdf, bibtex)
Josef Jon, Ondřej Bojar (2023): Breeding Machine Translations: Evolutionary approach to survive and thrive in the world of automated evaluation. In: Proceedings of 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pp. 2191-2212, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-959429-72-2 (url, bibtex)
Josef Jon, Martin Popel, Ondřej Bojar (2023): CUNI at WMT23 General Translation Task: MT and a Genetic Algorithm. In: Proceedings of the Eighth Conference on Machine Translation, pp. 119-127, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 979-8-89176-041-7 (pdf, bibtex)
Josef Jon, Dušan Variš, Michal Novák, Joao Paulo Aires, Ondřej Bojar (2023): Negative Lexical Constraints in Neural Machine Translation. In: Proceedings of Machine Translation Summit XIX vol. 1: Research Track, pp. 372-384, Asia-Pacific Association for Machine Translation (AAMT), Kyoto, Japan, ISBN 978-4-9913461-0-1 (pdf, bibtex)
Lukáš Kačena, Jana Hamrlová, Jan Hajič (2023): Open Calls and Pilot Projects. In: European Language Grid - A Language Technology Platform for Multilingual Europe, pp. 257-270, Springer Nature Switzerland AG, Cham, Switzerland, ISBN 978-3-031-17258-8 (url, local PDF, bibtex)
Zdeněk Kasner, Ekaterina Garanina, Ondřej Plátek, Ondřej Dušek (2023): TabGenie: A Toolkit for Table-to-Text Generation. In: Proceedings of 61st Annual Meeting of the Association for Computational Linguistics (Volume 3: System Demonstrations), pp. 444-455, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-941643-00-6 (url, bibtex)
Zdeněk Kasner, Ioannis Konstas, Ondřej Dušek (2023): Mind the Labels: Describing Relations in Knowledge Graphs With Pretrained Models. In: Proceedings of the 17th Conference of the European Chapter of the Association for Computational Linguistics, pp. 2398-2415, Association for Computational Linguistics, Kerrville, TX, USA, ISBN 978-1-959429-44-9 (url, bibtex)
Václava Kettnerová (2023): Valency structure of complex predicates with Light Verbs, The case of Czech. In: Light Verb Constructions as Complex Verbs , pp. 19-43, De Gruyter Mouton , Berlin, Germany, ISBN 9783110747997 (bibtex)
Václava Kettnerová, Veronika Kolářová (2023): K reciprocitě adjektiv v češtině. In: Slovo a slovesnost, ISSN 0037-7031, vol. 84, no. 3, pp. 179-200 (url, local PDF, bibtex)
Václava Kettnerová, Veronika Kolářová, Marie Mikulová, Magda Ševčíková (2023): K narozeninám Jarmily Panevové. In: Slovo a slovesnost, ISSN 0037-7031, vol. 84, no. 4, pp. 334-337 (url, local PDF, bibtex)
Kristýna Klesnilová, Michelle Elizabeth (2023): Team Synapse @ AutoMin 2023: Leveraging BART-Based Models for Automatic Meeting Minuting. In: Proceedings of the 16th International Natural Language Generation Conference: Generation Challenges, pp. 108-113, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 979-8-89176-003-5 (url, bibtex)
Věra Kloudová, David Mraček, Ondřej Bojar, Martin Popel (2023): Možnosti a meze tvorby tzv. optimálních referenčních překladů: po stopách „překladatelštiny“ v profesionálních překladech zpravodajských textů. In: Slovo a slovesnost, ISSN 0037-7031, vol. 84, no. 2, pp. 122-156 (url, bibtex)
František Kmječ, Ondřej Bojar (2023): Team Iterate @ AutoMin 2023 - Experiments with Iterative Minuting. In: Proceedings of the 16th International Natural Language Generation Conference: Generation Challenges, pp. 114-120, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 979-8-89176-003-5 (url, bibtex)
Tom Kocmi, Eleftherios Avramidis, Rachel Bawden, Ondřej Bojar, Anton Dvorkovich, Christian Federmann, Mark Fishel, Markus Freitag, Thamme Gowda, Roman Grundkiewicz, Barry Haddow, Philipp Koehn, Benjamin Marie, Christof Monz, Makoto Morishita, Kenton Murray, Makoto Nagata, Toshiaki Nakazawa, Martin Popel, Maja Popović, Mariya Shmatova (2023): Findings of the 2023 Conference on Machine Translation (WMT23): LLMs Are Here but Not Quite There Yet. In: Proceedings of the Eighth Conference on Machine Translation, pp. 1-42, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 979-8-89176-041-7 (url, bibtex)
Veronika Kolářová, Václava Kettnerová, Jiří Mírovský (2023): Through Derivational Relations to Valency of Non-verbal Predicates in the NomVallex Lexicon. In: Jazykovedný časopis / Journal of Linguistics, ISSN 0021-5597, vol. 74, no. 1, pp. 182-192 (local PDF, bibtex)
David Košťák, Rudolf Rosa (2023): AI: When a Robot Writes a Play at Švanda Theatre. In: The Days After: Ethical Dilemmas of Industrial and Post-Industrial Society in 20th and 21st Century Theatre in the Light of Karel Čapek’s Plays R.U.R. and The White Plague, pp. 162-163, Institut umění – Divadelní ústav, Praha, Czechia, ISBN 978-80-7008-470-0 (bibtex)
Mateusz Krubiński (2023): Basic Arithmetic Properties in the Space of Language Model Prompts (Electronic). (pdf)
Mateusz Krubiński, Pavel Pecina (2023): MLASK: Multimodal Summarization of Video-based News Articles. In: Findings of the Association for Computational Linguistics: EACL 2023, pp. 910-924, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-959429-47-0 (pdf, bibtex)
Mateusz Krubiński, Hashem Sellat, Shadi Saleh, Adam Pospíšil, Petr Zemánek, Pavel Pecina (2023): Multi-Parallel Corpus of North Levantine Arabic. In: Proceedings of ArabicNLP 2023, pp. 411-417, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-959429-27-2 (pdf, bibtex)
Anna Kryvenko, Matyáš Kopp (2023): Workflow and Metadata Challenges in the ParlaMint Project: Insights from Building the ParlaMint-UA Corpus. In: CLARIN Annual Conference Proceedings 2023, pp. 67-70, CLARIN ERIC, Leuven, Belgium (url, bibtex)
Nalin Kumar, Saad Obaid ul Islam, Ondřej Dušek (2023): Better Translation + Split and Generate for Multilingual RDF-to-Text (WebNLG 2023). In: Proceedings of the Workshop on Multimodal, Multilingual Natural Language Generation and Multilingual WebNLG Challenge (MM-NLG 2023), pp. 73-79, Association for Computational Linguistics, Stroudsburg, PA, USA (url, bibtex)
Ivana Kvapilíková, Ondřej Bojar (2023): Low-Resource Machine Translation Systems for Indic Languages. In: Proceedings of the Eighth Conference on Machine Translation, pp. 954-958, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 979-8-89176-041-7 (bibtex)
Ivana Kvapilíková, Ondřej Bojar (2023): Boosting Unsupervised Machine Translation with Pseudo-Parallel Data. In: Proceedings of Machine Translation Summit XIX vol. 1: Research Track, pp. 135-147, Asia-Pacific Association for Machine Translation (AAMT), Kyoto, Japan, ISBN 978-4-9913461-0-1 (bibtex)
Hynek Kydlíček, Jindřich Libovický (2023): A Dataset and Strong Baselines for Classification of Czech News Texts. In: 26th International Conference, TSD 2023, pp. 33-44, Springer, Cham, Switzerland, ISBN 978-3-031-40497-9 (url, bibtex)
Penny Labropoulou, Stelios Piperidis, Miltos Deligiannis, Leon Voukoutis, Maria Giagkou, Ondřej Košarko, Jan Hajič, Georg Rehm (2023): Interoperable Metadata Bridges to the wider Language Technology Ecosystem. In: European Language Grid - A Language Technology Platform for Multilingual Europe, pp. 107-130, Springer Nature Switzerland AG, Cham, Switzerland, ISBN 978-3-031-17258-8 (url, local PDF, bibtex)
Mateusz Lango, Ondřej Dušek (2023): Critic-Driven Decoding for Mitigating Hallucinations in Data-to-text Generation. In: Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing (EMNLP), pp. 2853-2862, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 979-8-89176-060-8 (url, bibtex)
Vojtěch Lanz (2023): Unsupervised segmentation of Gregorian chant melodies for exploring chant modality (masters thesis). In: (url, bibtex)
Vojtěch Lanz, Jan Hajič, jr. (2023): Text boundaries do not provide a better segmentation of Gregorian antiphons. In: Proceedings of the 10th International Conference on Digital Libraries for Musicology, pp. 72-76, Association for Computing Machinery, New York, United States, ISBN 979-8-4007-0833-6 (url, bibtex)
Jindřich Libovický (2023): Is a Prestigious Job the same as a Prestigious Country? A Case Study on Multilingual Sentence Embeddings and European Countries. In: Findings of the Association for Computational Linguistics: EMNLP 2023, pp. 1000-1010, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-955917-71-1 (pdf, local PDF, bibtex)
Tomasz Limisiewicz (2023): ÚFAL Submission for SIGTYP Supervised Cognate Detection Task. In: Proceedings of SIGTYP, pp. 132-136, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-959429-56-2 (url, bibtex)
Tomasz Limisiewicz, Jiří Balhar, David Mareček (2023): Tokenization Impacts Multilingual Language Modeling: Assessing Vocabulary Allocation and Overlap Across Languages. In: Findings of the Association for Computational Linguistics: ACL 2023, pp. 5661-5681, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-959429-62-3 (url, bibtex)
Tomasz Limisiewicz, Dan Malkin, Gabriel Stanovsky (2023): You Can Have Your Data and Balance It Too: Towards Balanced and Efficient Multilingual Models. In: Proceedings of SIGTYP, pp. 1-11, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-959429-56-2 (url, bibtex)
Markéta Lopatková, Jaroslava Hlaváčová, Jiří Mírovský (2023): Linking Two Lexical Resources: VALLEX and MorfFlex Lexicons. In: Proceedings of the 23rd Conference Information Technologies – Applications and Theory (ITAT 2023), pp. 131-138, 23rd Conference on Information Technologies – Applications and Theory, Košice, Slovakia (url, bibtex)
Markéta Lopatková, Václava Kettnerová (2023): Proč má ježek bodliny přilepené k tělu, ale nemá tělo přilepené k bodlinám? K charakteristice inherentně recipročních predikátů. In: Vzťahy v jazyku – jazyk vo vzťahoch , pp. 35-45, Vydavateľstvo Prešovskej univerzity v Prešove, Prešov, Slovakia, ISBN 978-80-555-3107-6 (bibtex)
Markéta Lopatková, Václava Kettnerová (2023): Ještě k modelování reciprocity v teoretickém popisu češtiny. In: Naše řeč, ISSN 0027-8203, vol. 106, no. 3, pp. 165-176 (url, bibtex)
Markéta Lopatková, Václava Kettnerová (2023): Inherently Reciprocal Predicates - Do They Exist At All?. In: Proceedings of the 23rd Conference Information Technologies – Applications and Theory (ITAT 2023), pp. 102-109, 23rd Conference on Information Technologies – Applications and Theory, Košice, Slovakia (url, bibtex)
Dominik Macháček, Ondřej Bojar, Raj Dabre (2023): MT Metrics Correlate with Human Ratings of Simultaneous Speech Translation. In: Proceedings of the 20th International Conference on Spoken Language Translation, pp. 169-179, Association for Computational Linguistics, Stroudsburg, USA, ISBN 978-1-959429-84-5 (pdf, local PDF, bibtex)
Dominik Macháček, Raj Dabre, Ondřej Bojar (2023): Turning Whisper into Real-Time Transcription System. In: Proceedings of the 3rd Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics and the 13th International Joint Conference on Natural Language Processing: System Demonstrations, pp. 17-24, Asian Federation of Natural Language Processing, Bali, Indonesia (pdf, bibtex)
Dominik Macháček, Peter Polák, Ondřej Bojar, Raj Dabre (2023): Robustness of Multi-Source MT to Transcription Errors. In: Findings of the Association for Computational Linguistics: ACL 2023, pp. 3707-3723, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-959429-62-3 (pdf, bibtex)
Marie Mikulová (2023): Expressing Measure in Czech (Corpus-based Study). In: Jazykovedný časopis / Journal of Linguistics, ISSN 0021-5597, vol. 74, no. 1, pp. 108-118 (pdf, bibtex)
Victor Mireles, Stephanie Billib, Artem Revenko, Stephan Jänicke, Frank Uiterwaal, Pavel Pecina (2023): Exploratory Analysis of the Applicability of Formalised Knowledge to Personal Experience Narration. In: Data Science—Analytics and Applications. iDSC 2023, pp. 75-80, Springer, Cham, ISBN 978-3-031-42171-6 (pdf, bibtex)
Jiří Mírovský, Magdaléna Rysová, Pavlína Synková, Lucie Poláková (2023): Prague to Penn Discourse Transformation. In: The Prague Bulletin of Mathematical Linguistics, ISSN 0032-6585, 120, pp. 5-30 (pdf, local PDF, bibtex)
Sourabrata Mukherjee (2023): Sourabrata Mukherjee: Position Paper on Stylized Dialog Response Generation. In: 19th Workshop on Spoken Dialogue Systems for PhDs, PostDocs & New Researchers, pp. 44-46, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-952148-25-5 (bibtex)
Sourabrata Mukherjee, Akansha Bansal, Pritha Majumdar, Atul Kr. Ojha, Ondřej Dušek (2023): Low-Resource Text Style Transfer for Bangla: Data & Models. In: Proceedings of the First Workshop on Bangla Language Processing, pp. 34-47, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 979-8-89176-058-5 (url, bibtex)
Sourabrata Mukherjee, Akansha Bansal, Atul Kr. Ojha, Ondřej Dušek (2023): Text Detoxification as Style Transfer in English and Hindi. In: Proceedings of the 20th International Conference on Natural Language Processing (ICON), pp. 133-144, NLP Association of India (NLPA, Goa, India (url, bibtex)
Sourabrata Mukherjee, Ondřej Dušek (2023): Leveraging Low-resource Parallel Data for Text Style Transfer. In: Proceedings of the 16th International Natural Language Generation Conference, pp. 388-395, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 979-8-89176-001-1 (url, bibtex)
Sourabrata Mukherjee, Vojtěch Hudeček, Ondřej Dušek (2023): Polite Chatbot: A Text Style Transfer Application. In: Proceedings of the 17th Conference of the European Chapter of the Association for Computational Linguistics: Student Research Workshop, pp. 87-93, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-959429-48-7 (url, bibtex)
Sourabrata Mukherjee, Atul Kr. Ojha, Ondřej Dušek (2023): UFAL-ULD at BLP-2023 Task 1: Violence Detection in Bangla Text. In: Proceedings of the First Workshop on Bangla Language Processing, pp. 220-224, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 979-8-89176-058-5 (url, bibtex)
Sourabrata Mukherjee, Atul Kr. Ojha, Ondřej Dušek (2023): UFAL-ULD at BLP-2023 Task 2 Sentiment Classification in Bangla Text. In: Proceedings of the First Workshop on Bangla Language Processing, pp. 336-339, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 979-8-89176-058-5 (url, bibtex)
Tomáš Musil, Klára Vosecká, Rudolf Rosa (2023): Performing AI-Generated Theater Plays. In: Choreomata: Performance and Performativity after AI, pp. 390-400, CRC Press, Boca Raton, FL, USA, ISBN 9781032319919 (url, bibtex)
Toshiaki Nakazawa, Kazutaka Kinugawa, Hideya Mino, Isao Goto, Raj Dabre, Shohei Higashiyama, Shantipriya Parida, Makoto Morishita, Ondřej Bojar, Akiko Eriguchi, Yusuke Oda, Chenhui Chu, Sadao Kurohashi (2023): Overview of the 10th Workshop on Asian Translation. In: Proceedings of the 10th Workshop on Asian Translation, pp. 1-28, International Conference on Computational Linguistics, Macau, China (bibtex)
Kristýna Neumannová, Ondřej Bojar (2023): The Role of Compounds in Human vs. Machine Translation Quality. In: Proceedings of Machine Translation Summit XIX vol. 1: Research Track, pp. 248-260, Asia-Pacific Association for Machine Translation (AAMT), Kyoto, Japan, ISBN 978-4-9913461-0-1 (pdf, bibtex)
Saad Obaid ul Islam, Iza Škrjanec, Ondřej Dušek, Vera Demberg (2023): Tackling Hallucinations in Neural Chart Summarization. In: Proceedings of the 16th International Natural Language Generation Conference, pp. 414-423, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 979-8-89176-001-1 (url, bibtex)
Maciej Ogrodniczuk, Petya Osenova, Tomaž Erjavec, Darja Fišer, Nikola Ljubešić, Çağrı Çöltekin, Matyáš Kopp, Katja Meden, Taja Kuzman (2023): The ParlaMint Project: Ever-growing Family of Comparable and Interoperable Parliamentary Corpora. In: CLARIN Annual Conference Proceedings 2023, pp. 62-66, CLARIN ERIC, Leuven, Belgium (url, bibtex)
Kristýna Onderková, Matthias Nickles (2023): Exploring Abductive Reasoning in Language Models for Data-to-Text Generation. In: 31st Irish Conference on Artificial Intelligence and Cognitive Science (AICS), pp. 1-4, IEEE, New York City, U.S., ISBN 979-8-3503-6021-9 (url, bibtex)
Jarmila Panevová (2023): Rozpor mezi formou a funkcí: specifika infinitivu ve vybraných valenčních pozicích. In: Slovo a slovesnost, ISSN 0037-7031, vol. 84, no. 4, pp. 263-272 (url, bibtex)
Shantipriya Parida, Ondřej Bojar (2023): HaVQA: A Dataset for Visual Question Answering and Multimodal Research in Hausa Language. In: Findings of the Association for Computational Linguistics: ACL 2023, pp. 10162-10183, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-959429-62-3 (bibtex)
Andrej Perković, Jernej Vičič, Dávid Javorský, Ondřej Bojar (2023): Shortening of the results of machine translation using paraphrasing dataset. In: Proceedings of the 23rd Conference Information Technologies – Applications and Theory (ITAT 2023), pp. 121-130, 23rd Conference on Information Technologies – Applications and Theory, Košice, Slovakia (pdf, bibtex)
Ondřej Plátek, Ondřej Dušek (2023): MooseNet: A Trainable Metric for Synthesized Speech with a PLDA Module. In: 12th ISCA Speech Synthesis Workshop, pp. 48-54, International Speech Communication Association, Baixas, France (url, bibtex)
Ondřej Plátek, Vojtěch Hudeček, Patrícia Schmidtová, Mateusz Lango, Ondřej Dušek (2023): Three Ways of Using Large Language Models to Evaluate Chat. In: Proceedings of The Eleventh Dialog System Technology Challenge, pp. 113-122, Association for Computational Linguistics, Stroudsburg, PA, USA (url, bibtex)
Ondřej Plátek, Mateusz Lango, Ondřej Dušek (2023): With a Little Help from the Authors: Reproducing Human Evaluation of an MT Error Detector. In: The 3rd Workshop on Human Evaluation of NLP Systems (HumEval’23) , pp. 145-152, Association for Computational Linguistics, Varna, Bulgaria, ISBN 978-954-452-088-5 (url, bibtex)
Lucie Poláková, Jiří Mírovský (2023): Connectives with both Arguments External: A Survey on Czech. In: 20th International Conference on Intelligent Text Processing and Computational Linguistics, Lecture Notes in Computer Science, ISSN 0302-9743, 13451, pp. 61-72, Springer Verlag, Berlin / Heidelberg, ISBN 978-3-031-24336-3 (url, bibtex)
Peter Polák (2023): Long-form Simultaneous Speech Translation: Thesis Proposal. In: Proceedings of the 3rd Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics and the 13th International Joint Conference on Natural Language Processing: Student Research Workshop, pp. 64-74, Association for Computational Linguistics, Stroudsburg, PA, USA (url, local PDF, bibtex)
Peter Polák, Danni Liu, Ngoc-Quan Ngoc, Jan Niehues, Alex Waibel, Ondřej Bojar (2023): Towards Efficient Simultaneous Speech Translation: CUNI-KIT System for Simultaneous Track at IWSLT 2023. In: Proceedings of the 20th International Conference on Spoken Language Translation, pp. 389-396, Association for Computational Linguistics, Stroudsburg, USA, ISBN 978-1-959429-84-5 (url, bibtex)
Peter Polák, Brian Yan, Shinji Watanabe, Alex Waibel, Ondřej Bojar (2023): Incremental Blockwise Beam Search for Simultaneous Speech Translation with Controllable Quality-Latency Tradeoff. In: Proceedings of the 24st Annual Conference of the International Speech Communication Association, pp. 3979-3983, International Speech Communication Association, Baixas, France (url, bibtex)
Jakub Raczyński, Mateusz Lango, Jerzy Stefanowski (2023): The Problem of Coherence in Natural Language Explanations of Recommendations. In: 26th European Conference on Artificial Intelligence ECAI 2023, pp. 1922-1929, IOS Press BV, Amsterdam, Netherlands, ISBN 978-1-64368-436-9 (url, bibtex)
Georg Rehm, Katrin Marheinecke, Stefanie Hegele, Stelios Piperidis, Kalina Bontcheva, Jan Hajič, Khalid Choukri, Andrejs Vasiljevs, Gerhard Backfried, Katja Prinz, José Manuel Gómez-Pérez, Ulrich Germann (2023): Sustaining the European Language Grid: Towards the ELG Legal Entity. In: European Language Grid - A Language Technology Platform for Multilingual Europe, pp. 233-256, Springer Nature Switzerland AG, Cham, Switzerland, ISBN 978-3-031-17258-8 (url, bibtex)
Ian Roberts, Andres Garcia-Silva, Cristian Berrìo Aroca, José Manuel Gómez-Pérez, Miroslav Jánoší, Dimitris Galanis, Rémi Callizano, Andis Lagzdiņš, Milan Straka, Ulrich Germann (2023): Language Technology Tools and Services. In: European Language Grid: A Language Technology Platform for Multilingual Europe, pp. 131-150, Springer Nature Switzerland AG, Cham, Switzerland, ISBN 978-3-031-17257-1 (url, bibtex)
Rudolf Rosa, Daniel Hrbek (2023): AI: WHEN A ROBOT WRITES A PLAY. In: Theatre About Science. Theory and Practice, pp. 197-205, Imprensa da Universidade de Coimbra, Coimbra, Portugal, ISBN 978-989-26-2506-5 (url, bibtex)
Victor Schetinger, Dafne Reis Pedroso da Silva, Sara Di Bartolomeo, Edirlei Soares de Lima, Christofer Meinecke, Rudolf Rosa (2023): Macunaíma, papagaio IA, resolve crimes em Praga: Rumo à visualização de padrões em narrativas de modelos de IA generativos. In: Revista GEMInIS, ISSN 2179-1465, vol. 14, no. 3, pp. 21-37 (url, local PDF, bibtex)
Victor Schetinger, Sara Di Bartolomeo, Edirlei Soares de Lima, Christofer Meinecke, Rudolf Rosa (2023): n Walks in the Fictional Woods. In: Proceedings of alt.VIS 2023, pp. 1-8, University of Chicago, Chicago, IL, USA (url, bibtex)
Patrícia Schmidtová (2023): Semantic Accuracy in Natural Language Generation: A Thesis Proposal. In: Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 4: Student Research Workshop), pp. 352-361, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-959429-69-2 (url, bibtex)
Inguna Skadina, Andrejs Vasiljevs, Marcis Pinnis, Aivars Bērziņš, Nora Aranberri, Joachim van den Bogoaert, Sally O’Connor, Mercedes García-Martínez, Iakes Goenaga, Jan Hajič, Manuel Herranz, Christian Lieske, Martin Popel, Maja Popović, Sheila Castilho, Federico Gaspari, Rudolf Rosa, Riccardo Superbo, Andy Way (2023): Deep Dive Machine Translation. In: European Language Equality - A Strategic Agenda for Digital Language Equality, pp. 263-288, Springer Nature Switzerland AG, Cham, Switzerland, ISBN 978-3-031-28819-7 (url, bibtex)
Marcin Skowron, Gerhard Backfried, Eva Navas, Aivars Bērziņš, Joachim van den Bogoaert, Franciska de Jong, Andrea DeMarco, Inma Hernáez, Marek Kováč, Peter Polák, Johan Rohdin, Michael Rosner, Jon Sanchez, Ibon Saratxaga, Petr Schwarz (2023): Deep Dive Speech Technology. In: European Language Equality - A Strategic Agenda for Digital Language Equality, pp. 289-312, Springer Nature Switzerland AG, Cham, Switzerland, ISBN 978-3-031-28819-7 (url, bibtex)
Jakub Sláma, Barbora Štěpánková (2023): Postavení ambipozic v češtině. In: Slovo a slovesnost, ISSN 0037-7031, vol. 84, no. 2, pp. 91-121 (url, bibtex)
Abishek Stephen, Daniel Zeman (2023): Universal Dependencies for Malayalam. In: The Prague Bulletin of Mathematical Linguistics, ISSN 0032-6585, 120, pp. 31-46 (pdf, local PDF, bibtex)
Abishek Stephen, Zdeněk Žabokrtský (2023): Understanding Borrowing through Derivational Morphology: A Case Study of Czech Verbs. In: The Fourth International Workshop on Resources and Tools for Derivational Morphology , pp. 49-59, Croatian Language Technology Society, Zagreb, Croatia, ISBN 978-953-55375-5-7 (pdf, bibtex)
Milan Straka (2023): ÚFAL CorPipe at CRAC 2023: Larger Context Improves Multilingual Coreference Resolution. In: Proceedings of the CRAC 2023 Shared Task on Multilingual Coreference Resolution, pp. 41-51, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-955917-02-5 (url, local PDF, bibtex)
Jana Straková, Eva Fučíková, Jan Hajič, Zdeňka Urešová (2023): Extending an Event-type Ontology: Adding Verbs and Classes using Fine-tuned LLMs Suggestions. In: Proceedings of the 17th Linguistic Annotation Workshop, pp. 85-95, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-959429-83-8 (url, bibtex)
Jana Šamánková, Silvie Cinková, Adam Herma (2023): Co je srozumitelný úřední text a proč má smysl psát srozumitelně. Jak psát srozumitelné úřední texty. Příručka srozumitelného psaní pro úředníky. In: Jak psát srozumitelné úřední texty. Příručka srozumitelného psaní pro úředníky, pp. 7-15, Veřejný ochránce práv, Brno, Czech Republic, ISBN 978-80-7631-088-9 (pdf, bibtex)
Magda Ševčíková, Hana Hledíková, Lukáš Kyjánek, Anna Staňková (2023): Semantics of noun/verb conversion in Czech: lessons learned from corpus data annotation. In: SKASE Journal of Theoretical Linguistics, ISSN 1336-782X, vol. 20, no. 4, pp. 74-92 (pdf, bibtex)
Jana Šindlerová, Barbora Štěpánková, Ingrid Lindgren Andrén (2023): Epistemická částice zřejmě pohledem paralelního korpusu. In: Korpus – gramatika – axiologie, ISSN 1804-137X, 27, pp. 37-52 (url, bibtex)
Barbora Štěpánková, Jana Šindlerová, Lucie Poláková (2023): The Epistemic Marker určitě in the Light of Corpus Data. In: Jazykovedný časopis / Journal of Linguistics, ISSN 0021-5597, vol. 74, no. 1, pp. 130-139 (pdf, bibtex)
Simone Tedeschi, Johan Bos, Thierry Declerck, Jan Hajič, Daniel Hershcovich, Eduard Hovy, Alexander Koller, Simon Krek, Steven Schockaert, Rico Sennrich, Ekaterina Shutova, Roberto Navigli (2023): What’s the Meaning of Superhuman Performance in Today’s NLU?. In: Proceedings of 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pp. 12471-12491, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-959429-72-2 (url, local PDF, bibtex)
František Trebuňa, Ondřej Dušek (2023): VisuaLLM: Easy Web-based Visualization for Neural Language Generation. In: Proceedings of the 16th International Natural Language Generation Conference: System Demonstrations, pp. 6-8, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 979-8-89176-002-8 (url, bibtex)
František Trebuňa, Kristína Szabová, Ondřej Bojar (2023): Searching for Reasons of Transformers’ Success: Memorization vs Generalization. In: 26th International Conference, TSD 2023, pp. 25-32, Springer, Cham, Switzerland, ISBN 978-3-031-40497-9 (url, bibtex)
Iryna Tryhubyshyn, Aleš Tamchyna, Ondřej Bojar (2023): Bad MT Systems are Good for Quality Estimation. In: Proceedings of Machine Translation Summit XIX vol. 1: Research Track, pp. 200-208, Asia-Pacific Association for Machine Translation (AAMT), Kyoto, Japan, ISBN 978-4-9913461-0-1 (url, bibtex)
Zdeňka Urešová, Cristina Fernández-Alcaina, Eva Fučíková, Jan Hajič (2023): SynSemClass Czech and English Annotation Guidelines (technical report). In: (pdf, local PDF, bibtex)
Emiel van Miltenburg, Miruna Clinciu, Ondřej Dušek, Dimitra Gkatzia, Stephanie Inglis, Leo Leppänen, Saad Mahamood, Stephanie Schoch, Craig Thomson, Luou Wen (2023): Barriers and enabling factors for error analysis in NLG research. In: Northern European Journal of Language Technology, ISSN 2000-1533, vol. 9, no. 1, pp. 1-22 (url, bibtex)
Dušan Variš (2023): Learning capabilities in Transformer Neural Networks (PhD thesis). In: (url, bibtex)
Jonáš Vidra, Zdeněk Žabokrtský (2023): Transferring Word-Formation Networks Between Languages. In: The Prague Bulletin of Mathematical Linguistics, ISSN 0032-6585, 120, pp. 47-71 (url, bibtex)
Mateusz Woźny, Mateusz Lango (2023): Generating clickbait spoilers with an ensemble of large language models. In: Proceedings of the 16th International Natural Language Generation Conference, pp. 431-436, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 979-8-89176-001-1 (url, bibtex)
Zixiu Wu, Simone Balloccu, Ehud Reiter, Rim Helaoui, Diego Angelo Gaetano Reforgiato Recupero, Daniele Riboni (2023): Are Experts Needed? On Human Evaluation of Counselling Reflection Generation. In: Proceedings of 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pp. 6906-6930, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-959429-72-2 (url, bibtex)
Nianwen Xue, Julia Bonn, Jan Hajič (2023): Meaning Representations for Natural Languages: Design, Models and Applications (LectureNotes). (url)
Brian Yan, Jiatong Shi, Yun Tang, Hirofumi Inaguma, Yifan Peng, Siddharth Dalmia, Peter Polák, Patrick Fernandes, Dan Berrebbi, Tomoki Hayashi, Xiaohui Zhang, Zhaoheng Ni, Moto Hira, Soumi Maiti, Juan Pino, Shinji Watanabe (2023): ESPnet-ST-v2: Multipurpose Spoken Language Translation Toolkit. In: Proceedings of 61st Annual Meeting of the Association for Computational Linguistics (Volume 3: System Demonstrations), pp. 400-411, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-941643-00-6 (url, bibtex)
Juntao Yu, Michal Novák, Abdulrahman Aloraini, Nafise Sadat Moosavi, Silviu Paun, Sameer Pradhan, Massimo Poesio (2023): The Universal Anaphora Scorer 2.0. In: Proceedings of the 15th International Conference on Computational Semantics, pp. 183-194, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-959429-74-6 (url, bibtex)
Daniel Zeman (2023): Cross-Language Harmonization of Linguistic Resources (habilitation). In: (pdf, local PDF, bibtex)
Daniel Zeman, Pavel Kosek, Martin Březina, Jiří Pergler (2023): Morphosyntactic Annotation in Universal Dependencies for Old Czech. In: Jazykovedný časopis / Journal of Linguistics, ISSN 0021-5597, vol. 74, no. 1, pp. 214-222 (pdf, local PDF, local PDF, bibtex)
Zdeněk Žabokrtský, Miloslav Konopík, Anna Nedoluzhko, Michal Novák, Maciej Ogrodniczuk, Martin Popel, Ondřej Pražák, Jakub Sido, Daniel Zeman (2023): Findings of the Second Shared Task on Multilingual Coreference Resolution. In: Proceedings of the CRAC 2023 Shared Task on Multilingual Coreference Resolution, pp. 1-18, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-955917-02-5 (pdf, local PDF, bibtex)
Idris Abdulmumin, Satya Ranjan Dash, Musa Abdullahi Dawud, Shantipriya Parida, Shamsuddeen Hassan Muhammad, Ibrahim Sa'id Ahmad, Subhadarshi Panda, Ondřej Bojar, Bashir Shehu Galadanci, Bello Shehu Bello (2022): Hausa Visual Genome: A Dataset for Multi-Modal English to Hausa Machine Translation. In: Proceedings of the 13th Conference on Language Resources and Evaluation (LREC 2022), pp. 6471-6479, European Language Resources Association, Marseille, France, ISBN 979-10-95546-72-6 (url, local PDF, bibtex)
Itziar Aldabe, Jane Dunne, Aritz Farwell, Owen Gallagher, Federico Gaspari, Maria Giagkou, Jan Hajič, Jens Peter Kückens, Teresa Lynn, Georg Rehm, German Rigau, Katrin Marheinecke, Stelios Piperidis, Natalia Resende, Tereza Vojtěchová, Andy Way (2022): Overview of the ELE Project. In: Proceedings of the 23rd Annual Conference of the European Association for Machine Translation , pp. 353-354, European Association for Machine Translation, Ghent, Belgium, ISBN 9789464597622 (url, bibtex)
Antonios Anastasopoulos, Loïc Barrault, Luisa Bentivogli, Marcely Zanon Boito, Ondřej Bojar, Roldano Cattoni, Anna Currey, Georgiana Dinu, Kevin Duh, Maha Elbayad, Clara Emmanuel, Yannick Estève, Marcello Federico, Christian Federmann, Souhir Gahbiche, Hongyu Gong, Roman Grundkiewicz, Barry Haddow, Benjamin Hsu, Dávid Javorský, Věra Kloudová, Surafel Melaku Lakew, Xutai Ma, Prashant Mathur, Paul McNamee, Kenton Murray, Maria Nadejde, Satoshi Nakamura, Matteo Negri, Jan Niehues, Xing Niu, John Ortega, Juan Pino, Elizabeth Salesky, Yun Tang, Matthias Sperber, Sebastian Stuker, Katsuhito Sudoh, Marco Turchi, Yogesh Virkar, Alex Waibel, Changhan Wang, Shinji Watanabe (2022): FINDINGS OF THE IWSLT 2022 EVALUATION CAMPAIGN. In: Proceedings of the 19th International Conference on Spoken Language Translation, pp. 98-157, Association for Computational Linguistics, Stroudsburg, USA, ISBN 978-1-955917-41-4 (url, local PDF, bibtex)
Mariia Anisimova, Šárka Zikánová (2022): Attitude in diplomatic speeches: a pilot study. In: Proceedings of the 22nd Conference Information Technologies – Applications and Theory (ITAT 2022), pp. 151-158, 22nd Conference on Information Technologies – Applications and Theory, Košice, Slovakia (url, local PDF, bibtex)
Michal Auersperger, Pavel Pecina (2022): Defending Compositionality in Emergent Languages. In: Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies: Student Research Workshop, pp. 285-291, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-955917-73-5 (pdf, local PDF, bibtex)
Niyati Bafna, Josef Genabith, Cristina España-Bonet, Zdeněk Žabokrtský (2022): Combining Noisy Semantic Signals with Orthographic Cues: Cognate Induction for the Indic Dialect Continuum. In: Proceedings of the 26th Conference on Computational Natural Language Learning, pp. 110-131, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-959429-07-4 (bibtex)
Niyati Bafna, Martin Vastl, Ondřej Bojar (2022): Constrained Decoding for Technical Term Retention in English-Hindi MT. In: Proceedings of ICON 2021: 18th International Conference on Natural Language Processing, pp. 1-6, NLP Association India, Centre for Natural Language Processing, Department of Computer Science and Engineering, Silchar, India (local PDF, bibtex)
Niyati Bafna, Zdeněk Žabokrtský (2022): Subword-based Cross-lingual Transfer of Embeddings from Hindi to Marathi and Nepali. In: 19th SIGMORPHON Workshop on Computational Research in Phonetics, Phonology, and Morphology, pp. 61-71, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-955917-82-7 (pdf, bibtex)
Gábor Baranyi, Bruno Carlos Dos Santos Melício, Zsófia Gaál, Levente Hajder, András Simonyi, Dániel Sindely, Joul Skaf, Ondřej Dušek, Tomáš Nekvinda, András Lőrincz (2022): AI Technologies for Machine Supervision and Help in a Rehabilitation Scenario. In: Multimodal Technologies and Interaction, ISSN 2414-4088, vol. 6, no. 7, pp. 48-73 (url, bibtex)
Khuyagbaatar Batsuren, Gábor Bella, Aryaman Arora, Viktor Martinovic, Kyle Gorman, Zdeněk Žabokrtský, Amarsanaa Ganbold, Šárka Dohnalová, Magda Ševčíková, Kateřina Pelegrinová, Fausto Giunchiglia, Ryan Cotterell, Ekaterina Vylomova (2022): The SIGMORPHON 2022 Shared Task on Morpheme Segmentation. In: 19th SIGMORPHON Workshop on Computational Research in Phonetics, Phonology, and Morphology, pp. 103-116, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-955917-82-7 (pdf, bibtex)
Rachel Bawden, Ondřej Bojar, Rajen Chatterjee, Anton Dvorkovich, Christian Federmann, Mark Fishel, Markus Freitag, Thamme Gowda, Yvette Graham, Roman Grundkiewicz, Barry Haddow, Matthias Huck, Rebecca Knowles, Tom Kocmi, Philipp Koehn, Christof Monz, Makoto Morishita, Masaaki Nagata, Toshiaki Nakazawa, Matteo Negri, Michal Novák, Martin Popel, Maja Popović, Mariya Shmatova, Marco Turchi (2022): Findings of the 2022 Conference on Machine Translation (WMT22). In: Proceedings of the Seventh Conference on Machine Translation, pp. 1-34, Association for Computational Linguistics, Stroudsburg, PA, USA (pdf, local PDF, bibtex)
Sunit Bhattacharya, Rishu Kumar, Ondřej Bojar (2022): Team ÚFAL at CMCL 2022 Shared Task: Figuring out the correct recipe for predicting Eye-Tracking features using Pretrained Language Model. In: Proceedings of the Workshop on Cognitive Modeling and Computational Linguistics, pp. 130-135, Association for Computational Linguistics, Stroudsburg, PA, USA (local PDF, bibtex)
Sunit Bhattacharya, Vilém Zouhar, Ondřej Bojar (2022): Sentence Ambiguity, Grammaticality and Complexity Probes. In: Proceedings of the 5th Workshop on Analyzing and Interpreting Neural Networks for NLP, pp. 1-11, Association for Computational Linguistics, Stroudsburg, PA, USA (pdf, local PDF, bibtex)
Lukáš Burget, Ondřej Bojar (2022): Průběžná zpráva NEUREM3 (technical report). In: (pdf, bibtex)
Silvie Cinková (2022): Jak se na srozumitelnost dívá věda? Srozumitelnost v zrcadle psychologie, lingvistiky a matematiky (LectureNotes). (url)
Silvie Cinková, Jan Škvrňák, Michael Škvrňák (2022): Výuka digitálních humanitních věd na českých veřejných vysokých školách podle latentní sémantické analýzy. In: Digitální obrat v českých humanitních a sociálních vědách, pp. 367-407, Karolinum, Prague, Czech Republic, ISBN 978-80-246-5193-4 (bibtex)
Satya Ranjan Dash, Shantipriya Parida, Esau Villatoro Tello, Biswaranjan Acharya, Ondřej Bojar (2022): Natural Language Processing In Healthcare, A Special Focus on Low Resource Languages. In: , ISBN 9780367685393 (bibtex)
Ondřej Dušek (2022): Problémy dnešních generátorů jazyka. In: Vesmír, ISSN 0042-4544, 101, pp. 554-555 (url, bibtex)
Tomaž Erjavec, Matyáš Kopp (2022): TEI and Git in ParlaMint: Collaborative Development of Language Resources. In: CLARIN Annual Conference Proceedings 2022, pp. 57-60, CLARIN ERIC, Praha, Czechia (url, bibtex)
Cristina Fernández Alcaina, Eva Fučíková, Zdeňka Urešová (2022): Annotation guidelines for Spanish verbal synonyms in the SynSemClass lexicon (technical report). In: (url, local PDF, bibtex)
Federica Gamba, Francesca Frontini, Daan Broeder, Monica Monachini (2022): Language Technologies for the Creation of Multilingual Terminologies. Lessons Learned from the SSHOC Project. In: Proceedings of the 13th Conference on Language Resources and Evaluation (LREC 2022), pp. 154-163, European Language Resources Association, Marseille, France, ISBN 979-10-95546-72-6 (pdf, bibtex)
Muskan Garg, Seema Wazarkar, Muskaan Singh, Ondřej Bojar (2022): Multimodality for NLP-Centered Applications: Resources, Advances and Frontiers. In: Proceedings of the 13th Conference on Language Resources and Evaluation (LREC 2022), pp. 6837-6847, European Language Resources Association, Marseille, France, ISBN 979-10-95546-72-6 (url, local PDF, bibtex)
Sebastian Gehrmann, Abhik Bhattacharjee, Abinaya Mahendiran, Alex Wang, Alexandros Papangelis, Aman Madaan, Angelina McMillan-Major, Anna Shvets, Ashish Upadhay, Bingsheng Yao, Bryan Wilie, Chandra Bhagavatula, Chaobin You, Craig Thomson, Cristina Garbacea, Dakuo Wang, Daniel Deutsch, Deyi Xiong, Di Jin, Dimitra Gkatzia, Dragomir R. Radev, Elizabeth Clark, Esin Durmus, Faisal Ladhak, Filip Ginter, Genta Indra Winata, Hendrik Strobelt, Hiroaki Hayashi, Jekaterina Novikova, Jenna Kanerva, Jenny Chim, Jiawei Zhou, Jordan Clive, Joshua Maynez, Joao Sedoc, Juraj Juraska, Kaustubh D. Dhole, Khyati Rangavi Chandu, Laura Perez-Beltrachini, Leonardo F. R. Ribeiro, Lewis Tunstall, Li Zhang, Mahima Pushkarna, Mathias Creutz, Michael White, Mihir Kale, Moussa Kamal Eddine, Nico Daheim, Nishant Subramani, Ondřej Dušek, Paul Pu Liang, Pawan Sasanka Ammanamanchi, Qi Zhu, Ratish Puduppully, Reno Kriz, Rifat Shahriyar, Ronald Cardenas, Saad Mahamood, Salomey Osei, Samuel Cahyawijaya, Sanja Štajner, Sebastien Montella, Shailza Jolly, Simon Mille, Tahmid Hasan, Tianhao Shen, Tosin Adewumi, Vikas Raunak, Vipul Raheja, Vitaly Nikolaev, Vivian Tsai, Yacine Jernite, Ying Xu, Yisi Sang, Yixin Liu, Yufang Hou (2022): GEMv2: Multilingual NLG Benchmarking in a Single Line of Code (Electronic). In: ArXiv.org Computing Research Repository, ISSN 2331-8422, 2206.11249, pp. 1-16 (url)
Tirthankar Ghosal, Tanik Saikh, Tameesh Biswas, Asif Ekbal, Pushpak Bhattacharyya (2022): Novelty Detection: A Perspective from Natural Language Processing. In: Computational Linguistics, ISSN 1530-9312, vol. 48, no. 1, pp. 77-117 (url, bibtex)
Barry Haddow, Rachel Bawden, Antonio Valerio Miceli Barone, Jindřich Helcl, Alexandra Birch (2022): Survey of Low-Resource Machine Translation. In: Computational Linguistics, ISSN 1530-9312, vol. 48, no. 3, pp. 673-732 (pdf, bibtex)
Jan Hajič, Eva Hajičová, Barbora Hladká, Ondřej Košarko, Jozef Mišutka, Pavel Straňák (2022): LINDAT/CLARIAH-CZ: Where We Are and Where We Go. In: CLARIN: The Infrastructure for Language Resources, pp. 61-82, Berlin, Boston: De Gruyter, Berlin, Boston: De Gruyter, ISBN 978-3-11-076734-6 (bibtex)
Eva Hajičová (2022): Patrice Pognan (ne)vypočítatelný. In: Des langues calculables à l'homme incalculable. Hommage à Patrice Pognan, pp. 7-8, PLIDAM, Paris, France, ISBN 9782813004260 (bibtex)
Eva Hajičová (2022): Cesta od lingvistické teorie k anotovanému korpusu a zpátky. In: Človek a jeho jazyk 5. Povaha jazyka a jej poznávanie, sborník věnovaný 100. výročí narození prof. PhDr. Jána Horeckého, DrSc., pp. 35-47, Slovenská akadémia vied, Bratislava, Slovakia, ISBN 978-80-224-1977-2 (bibtex)
Eva Hajičová, Jan Hajič, Barbora Hladká, Jiří Mírovský, Lucie Poláková, Kateřina Rysová, Magdaléna Rysová, Pavel Straňák, Barbora Štěpánková, Šárka Zikánová (2022): Corpus Annotation as a Feasible and Scientifically Beneficial Task. In: CLARIN: The Infrastructure for Language Resources, pp. 613-646, Walter de Gruyter GmbH, Berlin/Boston, Mannheim, Germany, ISBN 978-3-11-076734-6 (url, bibtex)
Eva Hajičová, Marie Mikulová (2022): Information structure in a formal description of language as reflected in an annotated corpus of Czech. In: Lifetime Linguistic Inspirations. To Igor Mel’čuk from Colleagues and Friends for his 90th Birthday, pp. 187-200, Peter Lang, Berlin, ISBN 978-3-631-89042-4 (bibtex)
Eva Hajičová, Marie Mikulová, Barbora Štěpánková, Jiří Mírovský (2022): Advantages of a complex multilayer annotation scheme: The case of the Prague Dependency Treebank. In: Proceedings of The 16th Lingusitic Annotation Workshop (LAW-XVI) within LREC2022, pp. 70-78, European Language Resources Association, Marseille, France, ISBN 978-2-493814-08-1 (url, bibtex)
Katharina Hämmerl, Jindřich Libovický, Alexander Fraser (2022): Combining Static and Contextualised Multilingual Embeddings. In: Findings of the Association for Computational Linguistics: ACL 2022, pp. 2316-2329, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-955917-25-4 (url, local PDF, local PDF, bibtex)
Jindřich Helcl (2022): CUNI Non-Autoregressive System for the WMT 22 Efficient Translation Shared Task. In: Proceedings of the Seventh Conference on Machine Translation, pp. 668-670, Association for Computational Linguistics, Stroudsburg, PA, USA (pdf, bibtex)
Jindřich Helcl, Barry Haddow, Alexandra Birch (2022): Non-Autoregressive Machine Translation: It's Not as Fast as it Seems. In: Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, pp. 1780-1790, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-955917-71-1 (local PDF, bibtex)
David Herel, Dominika Zogatová, Matěj Kripner, Tomáš Mikolov (2022): Emergence of Novelty in Evolutionary Algorithms. In: Proceedings of the ALIFE 2022: The 2022 Conference on Artificial Life, pp. 146-154, MIT Press, Cambridge, MA, USA (pdf, bibtex)
Barbora Hladká, Jiří Mírovský, Matyáš Kopp, Václav Moravec (2022): Annotating Attribution in Czech News Server Articles. In: Proceedings of the 13th Conference on Language Resources and Evaluation (LREC 2022), pp. 1817-1823, European Language Resources Association, Marseille, France, ISBN 979-10-95546-72-6 (pdf, local PDF, bibtex)
Jaroslava Hlaváčová, Lukáš Kyjánek, Magda Ševčíková (2022): Global Variants in the Czech Language. In: Proceedings of the 22nd Conference Information Technologies – Applications and Theory (ITAT 2022), pp. 122-129, 22nd Conference on Information Technologies – Applications and Theory, Košice, Slovakia (pdf, bibtex)
Hana Hledíková (2022): Conversion in English and Czech: a corpus study of semantic relations between nouns and verbs (masters thesis). In: (url, bibtex)
Christian Huber, Rishu Kumar, Ondřej Bojar, Alex Waibel (2022): Short-Term Word-Learning in a Dynamically Changing Environment (Electronic). In: ArXiv.org Computing Research Repository, ISSN 2331-8422, pp. 1-4 (url)
Vojtěch Hudeček, Ondřej Dušek (2022): Learning Interpretable Latent Dialogue Actions With Less Supervision. In: Proccedings of The 2nd Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics and the 12th International Joint Conference on Natural Language Processing , pp. 1-13, Association for Computational Linguistics, Stroudsburg, PA, USA (url, bibtex)
Vojtěch Hudeček, Léon-Paul Schaub, Daniel Štancl, Patrick Paroubek, Ondřej Dušek (2022): DIASER: A Unifying View On Task-oriented Dialogue Annotation. In: Proceedings of the 13th Conference on Language Resources and Evaluation (LREC 2022), pp. 1286-1296, European Language Resources Association, Marseille, France, ISBN 979-10-95546-72-6 (url, bibtex)
Rudali Huidrom, Ondřej Dušek, Zdeněk Kasner, Thiago Castro Ferreira, Anya Belz (2022): Two Reproductions of a Human-Assessed Comparative Evaluation of a Semantic Error Detection System. In: Proceedings of the 15th International Conference on Natural Language Generation: Generation Challenges, pp. 52-61, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-955917-60-5 (url, local PDF, bibtex)
Dávid Javorský, Dominik Macháček, Ondřej Bojar (2022): Continuous Rating as Reliable Human Evaluation of Simultaneous Speech Translation. In: Proceedings of the Seventh Conference on Machine Translation, pp. 154-164, Association for Computational Linguistics, Stroudsburg, PA, USA (pdf, local PDF, bibtex)
Josef Jon, Martin Popel, Ondřej Bojar (2022): CUNI-Bergamot Submission at WMT22 General Task. In: Proceedings of the Seventh Conference on Machine Translation, pp. 280-289, Association for Computational Linguistics, Stroudsburg, PA, USA (pdf, local PDF, bibtex)
Pavel Kasík, Jindřich Libovický, Jindřich Helcl, Michal Novák (2022): Český překladač se naučil ukrajinsky rychle. Jen někdy plete jména měst. In: Seznam Zprávy, pp. 1-2 (url, bibtex)
Zdeněk Kasner, Ondřej Dušek (2022): Neural Pipeline for Zero-Shot Data-to-Text Generation. In: Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics: ACL 2022, pp. 3914-3932, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-955917-21-6 (url, bibtex)
Aleksei Kelli, Krister Lindén, Paweł Kamocki, Kadri Vider, Penny Labropoulou, Ramūnas Birštonas, Vadim Mantrov, Vanessa Hannesschläger, Ricardo Del Grata, Age Värv, Gaabriel Tavits, Andres Vutt, Ester Hoorn, Jan Hajič, Arvi Tavast (2022): The Interaction of Personal Data, Intellectual Property and Freedom of Expression in the Context of Language Research. In: Selected papers from the CLARIN AC 2021, pp. 76-87, Linköping Electronic Conference Proceedings, Linköping, Sweden, ISBN 978-91-7929-444-1 (bibtex)
Václava Kettnerová, Markéta Lopatková, Anna Vernerová (2022): Reflexives as Part of Verb Lexemes in the VALLEX Lexicon. In: The Prague Bulletin of Mathematical Linguistics, ISSN 0032-6585, 119, pp. 37-66 (pdf, local PDF, bibtex)
Veronika Kolářová, Anna Vernerová (2022): NomVallex: A Valency Lexicon of Czech Nouns and Adjectives. In: Proceedings of the 13th Conference on Language Resources and Evaluation (LREC 2022), pp. 1344-1352, European Language Resources Association, Marseille, France, ISBN 979-10-95546-72-6 (url, local PDF, bibtex)
Mateusz Krubiński, Pavel Pecina (2022): From COMET to COMES – Can Summary Evaluation Benefit from Translation Evaluation?. In: Proceedings of the 3rd Workshop on Evaluation and Comparison of NLP Systems, pp. 21-31, Association for Computational Linguistics, Stroudsburg, PA, USA (pdf, local PDF, bibtex)
Nalin Kumar, Ondřej Bojar (2022): Genre Transfer in NMT: Creating Synthetic Spoken Parallel Sentences using Written Parallel Data. In: 19th International Conference on Natural Language Processing, pp. 224-233, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-959429-38-8 (url, local PDF, bibtex)
Rishu Kumar, Rudolf Rosa (2022): TEAM UFAL @ CreativeSumm 2022: BART and SamSum based few-shot approach for creative Summarization. In: Proceedings of The Workshop on Automatic Summarization for Creative Writing, pp. 24-28, Association for Computational Linguistics, Stroudsburg, PA, USA (url, local PDF, bibtex)
Ivana Kvapilíková, Ondřej Bojar (2022): CUNI Submission to MT4All Shared Task. In: Proceedings of the LREC 2022 Workshop of the 1st Annual Meeting of the ELRA/ISCA Special Interest Group on Under-Resourced Languages (SIGUL 2022), pp. 78-82, European Language Resources Association (ELRA), Paris, France, ISBN 979-10-95546-91-7 (bibtex)
Lukáš Kyjánek (2022): Web-based Annotation Interface for Derivational Morphology. In: Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies: System Demonstrations, pp. 10-16, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-955917-74-2 (pdf, bibtex)
Lukáš Kyjánek, Olga Lyashevskaya, Anna Nedoluzhko, Daniil Vodolazsky, Zdeněk Žabokrtský (2022): Constructing a Lexical Resource of Russian Derivational Morphology. In: Proceedings of the 13th Conference on Language Resources and Evaluation (LREC 2022), pp. 2788-2797, European Language Resources Association, Marseille, France, ISBN 979-10-95546-72-6 (pdf, bibtex)
Wen Lai, Jindřich Libovický, Alexander Fraser (2022): Improving Both Domain Robustness and Domain Adaptability in Machine Translation. In: The 29th International Conference on Computational Linguistics, Proceedings of the Main Conference, pp. 5191-5204, ICCL, Sheffield, UK (url, local PDF, local PDF, bibtex)
Oliver Lemon, Dilek Hakkani-Tur, Junyi Jessy Li, Arash Ashrafzadeh, Daniel Hernández Garcia, Malihe Alikhani, David Vandyke, Ondřej Dušek (2022): Proceedings of the 23rd Annual Meeting of the Special Interest Group on Discourse and Dialogue (). In: Proceedings of the 23rd Annual Meeting of the Special Interest Group on Discourse and Dialogue, Association for Computational Linguistics, Stroudsburgh, PA, USA, ISBN 978-1-955917-66-7 (url)
Jindřich Libovický, Alexander Fraser (2022): Neural String Edit Distance. In: Proceedings of the Sixth Workshop on Structured Prediction for NLP, pp. 52-66, Association for Computational Linguistics, Stroudsburg, USA, ISBN 978-1-955917-51-3 (url, local PDF, local PDF, bibtex)
Jindřich Libovický, Helmut Schmid, Alexander Fraser (2022): Why don’t people use character-level machine translation?. In: Findings of the Association for Computational Linguistics: ACL 2022, pp. 2470-2485, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-955917-25-4 (url, local PDF, local PDF, local PDF, bibtex)
Tomasz Limisiewicz, Dan Malkin, Gabriel Stanovsky (2022): You Can Have Your Data and Balance It Too: Towards Balanced and Efficient Multilingual Models (Electronic). In: ArXiv.org Computing Research Repository, ISSN 2331-8422 (pdf)
Tomasz Limisiewicz, David Mareček (2022): Don’t Forget About Pronouns: Removing Gender Bias in Language Models Without Losing Factual Gender Information. In: Proceedings of the 4th Workshop on Gender Bias in Natural Language Processing (GeBNLP), pp. 17-29, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-955917-68-1 (pdf, bibtex)
Dan Malkin, Tomasz Limisiewicz, Gabriel Stanovsky (2022): A Balanced Data Approach for Evaluating Cross-Lingual Transfer: Mapping the Linguistic Blood Bank. In: Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, pp. 4903-4915, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-955917-71-1 (pdf, bibtex)
Andrei-Alexandru Manea (2022): Identification of plausible and incoherent instructions (masters thesis). In: (url, bibtex)
Jiří Mayer (2022): Semi-supervised learning in Optical Music Recognition (masters thesis). In: (url, bibtex)
Jiří Mayer, Pavel Pecina (2022): Obstacles with Synthesizing Training Data for OMR. In: Proceedings of the 4th International Workshop on Reading Music Systems, pp. 15-19, University of Alicante, Alicante, Spain (url, local PDF, bibtex)
Marie Mikulová, Milan Straka, Jan Štěpánek, Barbora Štěpánková, Jan Hajič (2022): Quality and Efficiency of Manual Annotation: Pre-annotation Bias. In: Proceedings of the 13th Conference on Language Resources and Evaluation (LREC 2022), pp. 2909-2918, European Language Resources Association, Marseille, France, ISBN 979-10-95546-72-6 (url, local PDF, bibtex)
Jakub Mlynář, Jiří Kocián, Karin Hofmeisterová (2022): How “Tools” Produce “Data”: Searching in a Large Digital Corpus of Audiovisual Holocaust Testimonies. In: Jewish Studies in the Digital Age, pp. 65-88, De Gruyter Oldenbourg, Berlin, Germany, ISBN 9783110744828 (url, bibtex)
Sourabrata Mukherjee, Zdeněk Kasner, Ondřej Dušek (2022): Balancing the Style-Content Trade-Off in Sentiment Transfer Using Polarity-Aware Denoising. In: 25th International Conference on Text, Speech and Dialogue, pp. 172-186, Springer, Cham, Switzerland, ISBN 978-3-031-16269-5 (url, bibtex)
Toshiaki Nakazawa, Hideya Mino, Isao Goto, Raj Dabre, Shohei Higashiyama, Shantipriya Parida, Anoop Kunchukuttan, Makoto Morishita, Ondřej Bojar, Chenhui Chu, Kaori Abe, Yusuke Oda, Sadao Kurohashi (2022): Overview of the 9th Workshop on Asian Translation. In: Proceedings of the 9th Workshop on Asian Translation, pp. 1-36, International Conference on Computational Linguistics, Gyeongju, Korea (url, bibtex)
Jakub Náplava, Milan Straka, Jana Straková, Alexandr Rosen (2022): Czech Grammar Error Correction with a Large and Diverse Corpus. In: Transactions of the Association for Computational Linguistics, ISSN 2307-387X, 10, pp. 452-467 (url, local PDF, bibtex)
Anna Nedoluzhko, Michal Novák, Martin Popel, Zdeněk Žabokrtský, Amir Zeldes, Daniel Zeman (2022): CorefUD 1.0: Coreference Meets Universal Dependencies. In: Proceedings of the 13th Conference on Language Resources and Evaluation (LREC 2022), pp. 4859-4872, European Language Resources Association, Marseille, France, ISBN 979-10-95546-72-6 (pdf, bibtex)
Anna Nedoluzhko, Muskaan Singh, Marie Hledíková, Tirthankar Ghosal, Ondřej Bojar (2022): ELITR Minuting Corpus: A Novel Dataset for Automatic Minuting from Multi-Party Meetings in English and Czech. In: Proceedings of the 13th Conference on Language Resources and Evaluation (LREC 2022), pp. 3174-3182, European Language Resources Association, Marseille, France, ISBN 979-10-95546-72-6 (pdf, local PDF, bibtex)
Tomáš Nekvinda, Ondřej Dušek (2022): AARGH! End-to-end Retrieval-Generation for Task-Oriented Dialog. In: Proceedings of the 23rd Annual Meeting of the Special Interest Group on Discourse and Dialogue, pp. 283-297, Association for Computational Linguistics, Stroudsburgh, PA, USA, ISBN 978-1-955917-66-7 (url, bibtex)
Maciej Ogrodniczuk, Petya Osenova, Tomaž Erjavec, Darja Fišer, Nikola Ljubešić, Çağrı Çöltekin, Matyáš Kopp, Katja Meden (2022): ParlaMint II: The Show Must Go On. In: Proceedings of the LREC 2022 ParlaCLARIN III Workshop on Creating, Enriching and Using Parliamentary Corpora, pp. 1-6, European Language Resources Association (ELRA), Paris, France, ISBN 979-10-95546-85-6 (pdf, local PDF, local PDF, bibtex)
Jarmila Panevová, Marie Mikulová (2022): Synonymie a homonymie v gramatice. In: Človek a jeho jazyk 5. Povaha jazyka a jej poznávanie, pp. 59-67, Veda, vydavateľstvo SAV, Bratislava, Slovakia, ISBN 978-80-224-1977-2 (bibtex)
Gustavo Penha, Svitlana Vakulenko, Ondřej Dušek, Leigh Clark, Vaishali Pal, Vaibhav Adlakha (2022): The Seventh Workshop on Search-Oriented Conversational Artificial Intelligence (SCAI'22). In: Proceedings of the 45th International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 3466-3469, Association for Computing Machinery, New York, NY, USA, ISBN 978-1-4503-8732-3 (url, bibtex)
Petr Pohůdka, Rudolf Rosa (2022): Rudolf Rosa: Dokázali jsme, že doba, kdy AI zvládne napsat divadelní hru, je velmi blízko (Electronic). (url)
Petr Pohůdka, Rudolf Rosa (2022): Rudolf Rosa: We have proved that AI is almost advanced enough to write a theater play (Electronic). (url)
Lucie Poláková (2022): Globální koherence českých textů a možnosti jejího korpusového zpracování. Zpráva o aktuálním projektu Ústavu formální a aplikované lingvistiky MFF UK. In: Jazykovědné aktuality , ISSN 1212-5326, vol. LIX, no. 1-2, pp. 45-50 (url, bibtex)
Peter Polák, Ngoc-Quan Ngoc, Tuan-Nam Nguyen, Danni Liu, Carlos Mullov, Jan Niehues, Ondřej Bojar, Alex Waibel (2022): CUNI-KIT System for Simultaneous Speech Translation Task at IWSLT 2022. In: Proceedings of the 19th International Conference on Spoken Language Translation, pp. 277-285, Association for Computational Linguistics, Stroudsburg, USA, ISBN 978-1-955917-41-4 (url, local PDF, bibtex)
Peter Polák, Muskaan Singh, Anna Nedoluzhko, Ondřej Bojar (2022): ALIGNMEET: A Comprehensive Tool for Meeting Annotation, Alignment, and Evaluation. In: Proceedings of the 13th Conference on Language Resources and Evaluation (LREC 2022), pp. 1771-1779, European Language Resources Association, Marseille, France, ISBN 979-10-95546-72-6 (pdf, local PDF, bibtex)
Martin Popel, Jindřich Libovický, Jindřich Helcl (2022): CUNI Systems for the WMT 22 Czech-Ukrainian Translation Task. In: Proceedings of the Seventh Conference on Machine Translation, pp. 352-357, Association for Computational Linguistics, Stroudsburg, PA, USA (pdf, local PDF, bibtex)
Borek Požár, Klára Tauchmanová, Kristýna Neumannová, Ivana Kvapilíková, Ondřej Bojar (2022): CUNI Submission to the BUCC 2022 Shared Task on Bilingual Term Alignment. In: Proceedings of the LREC 2022 15th Workshop on Building and Using Comparable Corpora, pp. 43-49, European Language Resources Association, Paris, France, ISBN 979-10-95546-94-8 (local PDF, bibtex)
Rudolf Rosa (2022): 70/70: Rudolf Rosa (Electronic). In: Matfyz.cz (url)
Rudolf Rosa, Patrícia Schmidtová, Ondřej Dušek, Tomáš Musil, David Mareček, Saad Obaid ul Islam, Marie Nováková, Klára Vosecká, Josef Doležal (2022): GPT-2-based Human-in-the-loop Theatre Play Script Generation. In: Proceedings of the 4th Workshop of Narrative Understanding, pp. 29-37, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-955917-85-8 (url, local PDF, bibtex)
Rudolf Rosa, Patrícia Schmidtová, Alisa Zakhtarenko, Ondřej Dušek, Tomáš Musil, David Mareček, Saad Obaid ul Islam, Marie Nováková, Klára Vosecká, Daniel Hrbek, David Košťák (2022): THEaiTRobot: An Interactive Tool for Generating Theatre Play Scripts. In: Proceedings of the 15th International Conference on Natural Language Generation: System Demonstrations, pp. 10-13, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-955917-60-5 (url, local PDF, bibtex)
Philipp Rösch, Jindřich Libovický (2022): Probing the Role of Positional Information in Vision-Language Models. In: Findings of the Association for Computational Linguistics: NAACL 2022, pp. 1031-1041, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-955917-76-6 (url, local PDF, local PDF, bibtex)
Kateřina Rysová, Magdaléna Rysová, Eva Hajičová (2022): L’utilisation des conjonctions comme outil de cohésion textuelle dans le tchèque de locuteurs non-natifs. In: Écho des études romanes, ISSN 1801-0865, vol. 18, no. 1, pp. 67-80 (url, local PDF, bibtex)
Kirill Semenov, Ondřej Bojar (2022): Automated Evaluation Metric for Terminology Consistency in MT. In: Proceedings of the Seventh Conference on Machine Translation, pp. 1-6, Association for Computational Linguistics, Stroudsburg, PA, USA (bibtex)
Sukanta Sen, Ondřej Bojar, Barry Haddow (2022): Simultaneous Translation for Unsegmented Input: A Sliding Window Approach (Electronic). In: ArXiv.org Computing Research Repository, ISSN 2331-8422, pp. 1-8 (url)
Kartik Shinde, Tirthankar Ghosal, Ondřej Bojar (2022): Automatic minuting: A pipeline method for generating minutes from multi-party meeting proceedings. In: Proceedings of the 35th Pacific Asia Conference on Language, Information and Computation, pp. 1-12, ACL, Stroudsburg PA 18360, USA (url, local PDF, bibtex)
Patrícia Schmidtová, Dávid Javorský, Christián Mikláš, Tomáš Musil, Rudolf Rosa, Ondřej Dušek (2022): DialogueScript: Using Dialogue Agents to Produce a Script (Electronic). In: ArXiv.org Computing Research Repository, ISSN 2331-8422, 2206.08425, pp. 1-5 (url)
Patrícia Schmidtová, Rudolf Rosa, David Košťák, Tomáš Studeník, Daniel Hrbek, Tomáš Musil, Josef Doležal, Ondřej Dušek, David Mareček, Klára Vosecká, Marie Nováková, Petr Žabka, Alisa Zakhtarenko, Dominik Jurko, Martina Kinská, Tom Kocmi, Ondřej Bojar (2022): THEaiTRE: Generating Theatre Play Scripts using Artificial Intelligence. In: , ISBN 978-80-88132-14-1 (url, bibtex)
Radek Skarnitzl, Hana Hledíková (2022): Prosodic Phrasing of Good Speakers in English and Czech. In: Frontiers in Psychology, ISSN 1664-1078, 13, pp. 857647-857647 (url, bibtex)
Milan Straka, Jana Straková (2022): ÚFAL CorPipe at CRAC 2022: Effectivity of Multilingual Models for Coreference Resolution. In: Proceedings of the CRAC 2022 Shared Task on Multilingual Coreference Resolution, pp. 28-37, Association for Computational Linguistics, Gyeongju, Korea (url, local PDF, bibtex)
Emil Svoboda, Tomáš Bořil, Jan Rusz, Tereza Tykalová, Dana Horáková, Charles R. G. Guttmann, Krastan B. Blagoev, Hiroto Hatabu, Vladimir I. Valtchinov (2022): Assessing clinical utility of Machine Learning and Artificial Intelligence approaches to analyze speech recordings in Multiple Sclerosis: A Pilot Study. In: Computers in Biology and Medicine, ISSN 0010-4825, 148, pp. 1-10 (bibtex)
Emil Svoboda, Magda Ševčíková (2022): Word Formation Analyzer for Czech: Automatic Parent Retrieval and Classification of Word Formation Processes. In: The Prague Bulletin of Mathematical Linguistics, ISSN 0032-6585, 118, pp. 55-73 (pdf, bibtex)
Magda Ševčíková (2022): Action meanings in noun/verb conversion: Native and foreign word-formation in Czech. In: Linguistica Pragensia, ISSN 0862-8432, vol. 32, no. 2, pp. 173-197 (pdf, bibtex)
Magda Ševčíková, Hana Hledíková (2022): Paradigms in English and Czech noun/verb conversion: A contrastive study of lexemes with borrowed roots. In: Paradigms in Word Formation: Theory and Applications, pp. 181-214, John Benjamins Publishing Company, Amsterdam, The Netherlands, ISBN 9789027211583 (bibtex)
Michal Škrabal, Zuzana Laubeová, Barbora Štěpánková (2022): Kontrastivní analýza frekvenčních špiček psaného a mluveného lexika. In: Korpusové přístupy k české diglosii, pp. 32-92, Nakladatelství Lidové noviny, Praha Czech republic, ISBN 978-80-7422-944-2 (bibtex)
Adam Šmelko, Martin Kruliš, Miroslav Kratochvíl, Jiří Klepl, Jiří Mayer, Petr Šimůnek (2022): Astute Approach to Handling Memory Layouts of Regular Data Structures. In: 22nd International Conference on Algorithms and Architectures for Parallel Processing, ICA3PP 2022, Copenhagen, Denmark (Online), 10 - 12 October, 2022, Lecture Notes in Computer Science, ISSN 0302-9743, 13777, pp. 507-528, Springer International Publishing, Cham, Switzerland, ISBN 978-3-031-22676-2 (url, bibtex)
Barbora Štěpánková (2022): K diglosii v českých výkladových slovnících. In: Korpusové přístupy k české diglosii, pp. 93-116, Nakladatelství Lidové noviny, Praha Czech republic, ISBN 978-80-7422-944-2 (bibtex)
1.0 THEaiTRobot, David Košťák, Daniel Hrbek, Rudolf Rosa, Ondřej Dušek (2022): Úryvek z divadelní hry AI: Když robot píše hru. In: Academix revue, ISSN 2788-094X, 4, pp. 46-49 (bibtex)
2.0 THEaiTRobot, Josef Doležal, Klára Vosecká, Tomáš Musil, David Mareček, Rudolf Rosa (2022): Permeation (technical report). In: (pdf, bibtex)
Zdeňka Urešová, Karolina Zaczynska, Peter Bourgonje, Eva Fučíková, Georg Rehm, Jan Hajič (2022): Making a Semantic Event-type Ontology Multilingual. In: Proceedings of the 13th Conference on Language Resources and Evaluation (LREC 2022), pp. 1332-1334, European Language Resources Association, Marseille, France, ISBN 979-10-95546-72-6 (pdf, local PDF, bibtex)
Šárka Zikánová, Jiří Mírovský, Lucie Poláková (2022): Structuration globale du texte: une étude de corpus. In: Écho des études romanes, ISSN 1801-0865, 18, pp. 99-115 (local PDF, bibtex)
Zdeněk Žabokrtský, Niyati Bafna, Jan Bodnár, Lukáš Kyjánek, Emil Svoboda, Magda Ševčíková, Jonáš Vidra (2022): Towards Universal Segmentations: UniSegments 1.0. In: Proceedings of the 13th Conference on Language Resources and Evaluation (LREC 2022), pp. 1137-1149, European Language Resources Association, Marseille, France, ISBN 979-10-95546-72-6 (pdf, bibtex)
Zdeněk Žabokrtský, Miloslav Konopík, Anna Nedoluzhko, Michal Novák, Maciej Ogrodniczuk, Martin Popel, Ondřej Pražák, Jakub Sido, Daniel Zeman, Yilun Zhu (2022): Findings of the Shared Task on Multilingual Coreference Resolution. In: Proceedings of the CRAC 2022 Shared Task on Multilingual Coreference Resolution, pp. 1-17, Association for Computational Linguistics, Gyeongju, Korea (url, local PDF, local PDF, bibtex)
Farhad Akhbardeh, Arkady Arkhangorodsky, Magdalena Biesialska, Ondřej Bojar, Rajen Chatterjee, Vishrav Chaudhary, Marta R. Costa-Jussà, Cristina España-Bonet, Angela Fan, Christian Federmann, Markus Freitag, Yvette Graham, Roman Grundkiewicz, Barry Haddow, Leonie Harter, Kenneth Heafield, Christopher M. Homan, Matthias Huck, Kwabena Amponsah-Kaakyire, Jungo Kasai, Daniel Khashabi, Kevin Knight, Tom Kocmi, Philipp Koehn, Nicholas Lourie, Christof Monz, Makoto Morishita, Masaaki Nagata, Ajay Nagesh, Toshiaki Nakazawa, Matteo Negri, Santanu Pal, Allahsera Tapo, Marco Turchi, Valentin Vydrin, Marcos Zampieri (2021): Findings of the 2021 Conference on Machine Translation (WMT21). In: Proceedings of the Sixth Conference on Machine Translation, pp. 1-88, Association for Computational Linguistics, Online, ISBN 978-1-954085-94-7 (pdf, local PDF, bibtex)
Khalid Al-Khatib, Tirthankar Ghosal, Yufang Hou, Anita De-Waard, Dayne Freitag (2021): Argument Mining for Scholarly Document Processing: Taking Stock and Looking Ahead. In: Proceedings of the Second Workshop on Scholarly Document Processing, pp. 56-65, ACL, 209 N. Eighth Street, Stroudsburg PA 18360, USA (local PDF, bibtex)
Antonios Anastasopoulos, Ondřej Bojar, Jacob Bremerman, Roldano Cattoni, Maha Elbayad, Marcello Federico, Xutai Ma, Satoshi Nakamura, Matteo Negri, Jan Niehues, Juan Pino, Elizabeth Salesky, Sebastian Stüker, Katsuhito Sudoh, Marco Turchi, Alex Waibel, Changhan Wang, Matthew Wiesner (2021): FINDINGS OF THE IWSLT 2021 EVALUATION CAMPAIGN. In: Proceedings of the 18th International Conference on Spoken Language Translation, pp. 1-29, Association for Computational Linguistics, Stroudsburg, USA, ISBN 978-1-954085-74-9 (url, local PDF, bibtex)
Mariia Anisimova (2021): An Introductory Overview of Evaluating Facts and Attitudes in Diplomatic Discourse. In: ITAT 2021 2nd Workshop on Automata, Formal and Natural Languages – WAFNL 2021, pp. 1-4, Faculty of Mathematics and Physics, Praha, Czechia (pdf, bibtex)
Ebrahim Ansari, Ondřej Bojar, Barry Haddow, Mohammad Mahmoudi (2021): SLTev: Comprehensive Evaluation of Spoken Language Translation. In: Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics: System Demonstrations, pp. 71-79, Association for Computational Linguistics (ACL), Stroudsburg, PA, USA, ISBN 978-1-954085-05-3 (url, local PDF, local PDF, bibtex)
Hardik Arora, Tirthankar Ghosal, Sandeep Kumar, Suraj Singh Patwal, Phil Gooch (2021): INNOVATORS at SemEval-2021 Task-11: A Dependency Parsing and BERT-based model for Extracting Contribution Knowledge from Scientific Papers. In: Proceedings of the 15th International Workshop on Semantic Evaluation (SemEval-2021), pp. 502-510, ACL, 209 N. Eighth Street, Stroudsburg PA 18360, USA (local PDF, bibtex)
Michal Auersperger, Pavel Pecina (2021): Solving SCAN Tasks with Data Augmentation and Input Embeddings. In: Proceedings of the Recent Advances in Natural Language Processing, pp. 86-91, INCOMA Ltd., Shoumen, Bulgaria, ISBN 978-954-452-072-4 (pdf, local PDF, bibtex)
Iz Beltagy, Arman Cohan, Guy Feigenblat, Dayne Freitag, Tirthankar Ghosal, Keith Hall, Drahomira Hermannova, Petr Knoth, Kyle Lo, Philipp Mayr, Robert Patton, Michal Shmueli-Scheuer, Anita De-Waard, Kuansan Wang, Lucy Lu Wang (2021): Overview of the Second Workshop on Scholarly Document Processing. In: Proceedings of the Second Workshop on Scholarly Document Processing, pp. 159-165, ACL, 209 N. Eighth Street, Stroudsburg PA 18360, USA (local PDF, bibtex)
Klára Bendová (2021): Using a Parallel Corpus to Adapt the Flesch Reading Ease Formula to Czech. In: Jazykovedný časopis / Journal of Linguistics, ISSN 0021-5597, 2, pp. 477-487 (pdf, bibtex)
Klára Bendová, Silvie Cinková (2021): Adaptation of Classic Readability Metrics to Czech. In: 24th International Conference on Text, Speech and Dialogue, pp. 159-171, Springer, Cham, Switzerland, ISBN 978-3-030-83526-2 (local PDF, bibtex)
Aakash Bhatnagar, Nidhir Bhavsar, Muskaan Singh, Tirthankar Ghosal (2021): CUNI-NU System at Biocreative VIII Track 5: Multi-label Topic Classification of COVID-19 Articles using Dual Attention with SPECTRE. In: Proceedings of the BioCreative VII Challenge Evaluation Workshop, pp. 283-288, University of Delaware, Delaware, US, ISBN 978-0-578-32368-8 (local PDF, bibtex)
Alexandra Birch, Barry Haddow, Antonio Valerio Miceli Barone, Jindřich Helcl, Jonas Waldendorf, Felipe Sánchez-Martínez, Mikel L. Forcada, Víctor M. Sánchez-Cartagena, Juan Antonio Pérez-Ortiz, Miquel Esplà-Gomis, Wilker Aziz, Lina Murady, Sevi Sariisik, Peggy van der Kreeft, Kay MacQuarrie (2021): Surprise Language Challenge: Developing a Neural Machine Translation System between Pashto and English in Two Months. In: Proceedings of Machine Translation Summit XVIII: Research Track, pp. 92-102, Association for Machine Translation in the Americas, Stroudsburg, PA, USA (bibtex)
Ondřej Bojar, Dominik Macháček, Sangeet Sagar, Otakar Smrž, Jonáš Kratochvíl, Peter Polák, Ebrahim Ansari, Mohammad Mahmoudi, Rishu Kumar, Dario Franceschini, Chiara Canton, Ivan Simonini, Thai-Son Nguyen, Felix Schneider, Sebastian Stüker, Alex Waibel, Barry Haddow, Rico Sennrich, Philip Williams (2021): ELITR Multilingual Live Subtitling: Demo and Strategy. In: Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics: System Demonstrations, pp. 271-277, Association for Computational Linguistics (ACL), Stroudsburg, PA, USA, ISBN 978-1-954085-05-3 (bibtex)
Ondřej Bojar, Vojtěch Srdečný, Rishu Kumar, Otakar Smrž, Felix Schneider, Barry Haddow, Phil Williams, Chiara Canton (2021): Operating a Complex SLT System with Speakers and Human Interpreters. In: Proceedings of Machine Translation Summit XVIII 1st Workshop on Automatic Spoken Language Translation in Real-World Settings, pp. 23-34, Association for Machine Translation in the Americas, Stroudsburg, PA, USA (pdf, bibtex)
Gosse Bouma, Djamé Seddah, Daniel Zeman (2021): From Raw Text to Enhanced Universal Dependencies: the Parsing Shared Task at IWPT 2021. In: Proceedings of the 17th International Conference on Parsing Technologies and the IWPT 2021 Shared Task on Parsing into Enhanced Universal Dependencies, pp. 146-157, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-954085-80-0 (url, local PDF, bibtex)
Peter Bourgonje, Karolina Zaczynska, Julián Moreno-Schneider, Georg Rehm, Zdeňka Urešová, Jan Hajič (2021): SynSemClass for German: Extending a Multilingual Verb Lexicon. In: 2nd International Conference on Digital Curation Technologies, pp. 1-11, CEUR-WS, Aachen, Germany (pdf, local PDF, local PDF, bibtex)
Silvie Cinková, Camille Latimier (2021): Easy Language in Czechia. In: Handbook of Easy Languages in Europe, pp. 119-148, Frank&Timme GmbH Verlag für wissenschaftliche Literatur, Berlin, Germany, ISBN 978-37329-0771-7 (url, bibtex)
Marie-Catherine de Marneffe, Christopher Manning, Joakim Nivre, Daniel Zeman (2021): Universal Dependencies. In: Computational Linguistics, ISSN 1530-9312, vol. 47, no. 2, pp. 255-308 (url, local PDF, bibtex)
Tomaž Erjavec, Maciej Ogrodniczuk, Petya Osenova, Andrej Pančur, Nikola Ljubešić, Tommaso Agnoloni, Starkaður Barkarson, María Calzada Pérez, Çağrı Çöltekin, Matthew Coole, Roberts Darģis, Luciana de Macedo, Jesse de Does, Katrien Depuydt, Sascha Diwersy, Dorte Haltrup Hansen, Matyáš Kopp, Tomas Krilavičius, Giancarlo Luxardo, Maarten Marx, Vaidas Morkevičius, Costanza Navarretta, Paul Rayson, Orsolya Ring, Michał Rudolf, Kiril Simov, Steinþór Steingrímsson, István Üveges, Ruben van Heusden, Giulia Venturi (2021): ParlaMint: Comparable Corpora of European Parliamentary Data. In: CLARIN Annual Conference Proceedings 2021, pp. 20-25, CLARIN ERIC, Utrecht, The Netherlands (url, bibtex)
Markus Freitag, Ricardo Rei, Nitika Mathur, Chi-kiu Lo, Craig Stewart, George Foster, Alon Lavie, Ondřej Bojar (2021): Results of the WMT21 Metrics Shared Task: Evaluating Metrics with Expert-based Human Evaluations on TED and News Domain. In: Proceedings of the Sixth Conference on Machine Translation, pp. 733-774, Association for Computational Linguistics, Online, ISBN 978-1-954085-94-7 (url, local PDF, bibtex)
Francesca Frontini, Federica Gamba, Monica Monachini, Daan Broeder, Kea Tijdens, Irena Vipavc Brvar (2021): D3.9 Report on Ontology and Vocabulary Collection and Publication (technical report). In: (url, bibtex)
Federica Gamba, Marco Passarotti, Paolo Ruffolo (2021): More Data and New Tools. Advances in Parsing the Index Thomisticus Treebank. In: Proceedings of the Conference on Computational Humanities Research 2021, pp. 108-122, CEUR Workshop Proceedings (CEUR-WS.org) (pdf, bibtex)
Petr Gebauer, Ondřej Bojar, Vojtěch Švandelík, Martin Popel (2021): CUNI Systems in WMT21: Revisiting Backtranslation Techniques for English-Czech NMT. In: Proceedings of the Sixth Conference on Machine Translation, pp. 123-129, Association for Computational Linguistics, Online, ISBN 978-1-954085-94-7 (url, local PDF, bibtex)
Sebastian Gehrmann, Tosin Adewumi, Karmanya Aggarwal, Pawan Sasanka Ammanamanchi, Aremu Anuoluwapo, Antoine Bosselut, Kyathi Raghavi Chandu, Miruna Clinciu, Dipanjan Das, Kaustubh D. Dhole, Wanyu Du, Esin Durmus, Ondřej Dušek, Chris Emezue, Varun Gangal, Cristina Garbacea, Tatsunori Hashimoto, Yufang Hou, Yacine Yernite, Harsh Jhamtani, Yangfeng Ji, Shailza Jolly, Mihir Kale, Dhruv Kumar, Faisal Ladhak, Aman Madaan, Mounica Maddela, Khyati Mahajan, Saad Mahamood, Bodhisattwa Prasad Majumder, Pedro Henrique Martins, Angelina McMillan-Major, Simon Mille, Emiel van Miltenburg, Moin Nadeem, Shashi Narayan, Vitaly Nikolaev, Rubungo Andre Niyongabo, Salomey Osei, Ankur Parikh, Laura Perez-Beltrachini, Niranjan Ramesh Rao, Vikas Raunak, Juan Diego Rodriguez, Sashank Santhanam, Joao Sedoc, Thibault Sellam, Samira Shaikh, Anastasia Shimorina, Marco Antonio Sobrevilla Cabezudo, Hendrik Strobelt, Nishant Subramani, Wei Xu, Diyi Yang, Akhila Yerukola, Jiawei Zhou (2021): The GEM Benchmark: Natural Language Generation, its Evaluation and Metrics. In: Proceedings of the 1st Workshop on Natural Language Generation, Evaluation, and Metrics (GEM 2021), pp. 96-120, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-954085-67-1 (url, bibtex)
Tirthankar Ghosal, Muskaan Singh (2021): Towards Finding a Research Lineage Leveraging on Identification of Significant Citations. In: Proceedings of the 84th Annual Meeting of the Association for Information Science and Technology, pp. 1-5, Wiley, New Jersey, United States (bibtex)
Tirthankar Ghosal, Muskaan Singh, Anna Nedoluzhko, Ondřej Bojar (2021): Report on the SIGDial 2021 Special Session on Summarization of Dialogues and Multi-Party Meetings (SummDial). In: ACM SIGIR Forum, ISSN 0163-5840, December 2021, pp. 1-17 (pdf, bibtex)
Tirthankar Ghosal, Piyush Tiwari, Robert Patton, Christopher Stahl (2021): Towards Establishing a Research Lineage via Identification of Significant Citations. In: Quantitative Science Studies, ISSN 2641-3337, vol. 2, no. 4, pp. 1-19 (bibtex)
Eva Hajičová, Jiří Mírovský, Barbora Štěpánková (2021): Několik poznámek ke slovosledu a aktuálnímu členění ve světle anglicko-českého paralelního korpusu. In: Lingvistika - korpus - empirie, pp. 51-62, Ústav pro jazyk český, Prague, Czech Republic, ISBN 978-80-88211-13-6 (local PDF, bibtex)
Michael Hanna, Ondřej Bojar (2021): A Fine-Grained Analysis of BERTScore. In: Proceedings of the Sixth Conference on Machine Translation, pp. 507-517, Association for Computational Linguistics, Online, ISBN 978-1-954085-94-7 (url, local PDF, bibtex)
Michael Hanna, David Mareček (2021): Analyzing BERT’s Knowledge of Hypernymy via Prompting. In: Proceedings of the 4th Workshop on Analyzing and Interpreting Neural Networks for NLP, pp. 275-282, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-955917-06-3 (pdf, bibtex)
Jaroslava Hlaváčová (2021): Artificial homonymy. In: Jazykovedný časopis / Journal of Linguistics, ISSN 0021-5597, 2/72, pp. 330-341 (bibtex)
Jaroslava Hlaváčová, Marie Mikulová, Barbora Štěpánková (2021): Konzistence morfologického slovníku MorfFlex. In: Jazykovedný časopis / Journal of Linguistics, ISSN 0021-5597, vol. 72, no. 4, pp. 855-861 (pdf, bibtex)
Petra Hoffmannová, Jiří Kocián (2021): HISTORICAL SOURCES OF GIANT MOUNTAINS – SOLUTION FOR DIGITAL ARCHIVES. In: Journal of Diplomatic and Social Studies, ISSN 2570-9844, vol. Vol. 4, no. 1, pp. 33-50 (url, bibtex)
Daniel Hrbek, 1.0 THEaiTRobot, Tomáš Studeník, David Košťák, Martina Kinská, Rudolf Rosa, Ondřej Dušek, Tom Kocmi, David Mareček, Tomáš Musil, Patrícia Schmidtová, Dominik Jurko, Ondřej Bojar, Klára Vosecká, Josef Doležal, Marie Nováková, Petr Žabka (2021): AI: Když robot píše hru (online premiéra divadelní hry) (Electronic). (url)
David Hrbek, Tomáš Studeník, Rudolf Rosa, Ondřej Dušek, Daniel Hrbek, David Košťák, Jan Romportl (2021): Ta otázka, ta zvědavost, ta provokace. In: Taneční zóna, ISSN 1213-3450, vol. 25, no. 1/2021, pp. 12-25 (bibtex)
Vojtěch Hudeček, Ondřej Dušek, Zhou Yu (2021): Discovering Dialogue Slots with Weak Supervision. In: Proceedings of the Joint Conference of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, pp. 2430-2442, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-954085-52-7 (url, bibtex)
Pinzhen Chen, Jindřich Helcl, Ulrich Germann, Laurie Burchell, Nikolay Bogoychev, Antonio Valerio Miceli Barone, Jonas Waldendorf, Alexandra Birch, Kenneth Heafield (2021): The University of Edinburgh’s English-German and English-Hausa Submissions to the WMT21 News Translation Task. In: Proceedings of the Sixth Conference on Machine Translation, pp. 104-109, Association for Computational Linguistics, Online, ISBN 978-1-954085-94-7 (bibtex)
Jan Chromý, Silvie Cinková, Jana Šamánková (2021): Srozumitelnost českého odborného a úředního textu - proč se jí zabývat a jak ji měřit. In: Studie z aplikované lingvistiky / Studies in Applied Linguistics (SALi), ISSN 1804-3240, 1, pp. 35-52 (pdf, bibtex)
Md Mahfuz ibn Alam, Ivana Kvapilíková, Antonios Anastasopoulos, Laurent Besacier, Georgiana Dinu, Marcello Federico, Matthias Gallé, Philipp Koehn, Vassilina Nikoulina, Kweon Woo Jung (2021): Findings of the WMT Shared Task on Machine Translation Using Terminologies.. In: Proceedings of the Sixth Conference on Machine Translation, pp. 652-663, Association for Computational Linguistics, Online, ISBN 978-1-954085-94-7 (bibtex)
Ladislav Janovec, Kateřina Rysová (2021): 47. ročník Olympiády v českém jazyce aneb poprvé on-line. In: Český jazyk a literatura, ISSN 0009-0786, vol. 72, no. 1, pp. 1-6 (local PDF, bibtex)
Maarten Janssen (2021): A Corpus with Wavesurfer and TEI: Speech and Video in TEITOK. In: 24th International Conference on Text, Speech and Dialogue, pp. 261-268, Springer, Cham, Switzerland, ISBN 978-3-030-83526-2 (url, bibtex)
Maarten Janssen (2021): Integrating TEITOK and KonText/PMLTQ at LINDAT. In: Selected Papers from the CLARIN Annual Conference 2020, pp. 104-110, Linköping University Electronic Press, Linköpings universitet, Linköping, Sweden, ISBN 978-91-7929-609-4 (url, bibtex)
Maarten Janssen (2021): UDWiki: guided creation and exploitation of UD treebanks. In: Proceedings of the Fifth Workshop on Universal Dependencies (UDW, SyntaxFest 2021), pp. 84-95, Association for Computational Linguistics, Sofia, Bulgaria, ISBN 978-1-955917-17-9 (url, bibtex)
Josef Jon, João Paulo de Souza Aires, Dušan Variš, Ondřej Bojar (2021): End-to-End Lexically Constrained Machine Translation for Morphologically Rich Languages. In: Proceedings of the Joint Conference of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, pp. 4019-4033, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-954085-52-7 (url, local PDF, bibtex)
Josef Jon, Michal Novák, João Paulo de Souza Aires, Dušan Variš, Ondřej Bojar (2021): CUNI systems for WMT21: Terminology translation Shared Task. In: Proceedings of the Sixth Conference on Machine Translation, pp. 828-834, Association for Computational Linguistics, Online, ISBN 978-1-954085-94-7 (url, local PDF, bibtex)
Josef Jon, Michal Novák, João Paulo de Souza Aires, Dušan Variš, Ondřej Bojar (2021): CUNI systems for WMT21: Multilingual Low-Resource Translation for Indo-European Languages Shared Task. In: Proceedings of the Sixth Conference on Machine Translation, pp. 354-361, Association for Computational Linguistics, Online, ISBN 978-1-954085-94-7 (url, local PDF, bibtex)
Zdeněk Kasner, Simon Mille, Ondřej Dušek (2021): Text-in-Context: Token-Level Error Detection for Table-to-Text Generation. In: Proceedings of the 14th International Conference on Natural Language Generation (INLG 2021), pp. 259-265, Association for Computational Linguistics, Stroudsburgh, PA, USA, ISBN 978-1-954085-51-0 (pdf, bibtex)
Václava Kettnerová (2021): Optional valency complementations in Czech light verb constructions. In: Linguistica Pragensia, ISSN 0862-8432, vol. 31, no. 1, pp. 7-27 (url, local PDF, bibtex)
Václava Kettnerová, Markéta Lopatková, Anna Vernerová (2021): Reflexives in the VALLEX Lexicon: Syntactic Reflexivity and Reciprocity. In: The Prague Bulletin of Mathematical Linguistics, ISSN 0032-6585, 117, pp. 27-60 (pdf, bibtex)
Martina Kinská, David Košťák, Ondřej Dušek, Rudolf Rosa (2021): AI: Když robot píše hru (divadelní program). In: , ISBN 00-0000-000-0 (bibtex)
Věra Kloudová, Ondřej Bojar, Martin Popel (2021): Detecting Post-edited References and Their Effect on Human Evaluation. In: Proceedings of the Workshop on Human Evaluation of NLP Systems (HumEval), pp. 114-119, Association for Computational Linguistics, Stroudsburg, USA, ISBN 978-1-954085-10-7 (pdf, local PDF, bibtex)
Tom Kocmi, Dominik Macháček, Ondřej Bojar (2021): The Reality of Multi-Lingual Machine Translation. In: , ISBN 978-80-88132-11-0 (pdf, local PDF, bibtex)
Guneet Singh Kohli, Prabsimran Kaur, Muskaan Singh, Tirthankar Ghosal, Prashant Singh Rana (2021): ARGUABLY @ AI Debater-NLPCC 2021 Task 3: Argument Pair Extraction from Peer Review and Rebuttals. In: Proceedings of the 10th CCF Conference on Natural Language Processing and Chinese Computing, NLPCC 2021, pp. 1-13, Springer, Cham, Springer Nature Switzerland, ISBN 978-3-030-88483-3 (bibtex)
Veronika Kolářová (2021): Genitiv adnominální v češtině: vývoj a současný stav (review). In: Jazykovědné aktuality , ISSN 1212-5326, vol. 58, no. 3-4, pp. 89-93 (local PDF, bibtex)
Veronika Kolářová, Anna Vernerová, Jana Klímová (2021): Systemic and Non-systemic Valency Behavior of Czech Deverbal Adjectives. In: Jazykovedný časopis / Journal of Linguistics, ISSN 0021-5597, vol. 72, no. 2, pp. 371-382 (local PDF, bibtex)
Matyáš Kopp, Vladislav Stankov, Jan Oldřich Krůza, Pavel Straňák, Ondřej Bojar (2021): ParCzech 3.0: A Large Czech Speech Corpus with Rich Metadata. In: 24th International Conference on Text, Speech and Dialogue, pp. 293-304, Springer, Cham, Switzerland, ISBN 978-3-030-83526-2 (pdf, local PDF, bibtex)
Mateusz Krubiński, Erfan Ghadery, Pavel Pecina, Marie-Francine Moens (2021): Just Ask! Evaluating Machine Translation by Asking and Answering Questions. In: Proceedings of the Sixth Conference on Machine Translation, pp. 495-506, Association for Computational Linguistics, Online, ISBN 978-1-954085-94-7 (pdf, bibtex)
Mateusz Krubiński, Erfan Ghadery, Pavel Pecina, Marie-Francine Moens (2021): MTEQA at WMT21 Metrics Shared Task. In: Proceedings of the Sixth Conference on Machine Translation, pp. 1024-1029, Association for Computational Linguistics, Online, ISBN 978-1-954085-94-7 (pdf, bibtex)
Jonáš Kulhánek, Vojtěch Hudeček, Tomáš Nekvinda, Ondřej Dušek (2021): AuGPT: Auxiliary Tasks and Data Augmentation for End-To-End Dialogue with Pre-Trained Language Models. In: 3rd Worskhop on NLP for Conversational AI, pp. 198-210, Association for Computational Linguistics, Stroudsburgh, PA, USA, ISBN 978-1-954085-86-2 (url, bibtex)
Jonáš Kulhánek, Vojtěch Hudeček, Tomáš Nekvinda, Ondřej Dušek (2021): AuGPT: Dialogue with Pre-trained Language Models and Data Augmentation. In: DSTC9 Workshop @ AAAI-21, pp. 1-9, DSTC9 Organizing Committee, Online (url, bibtex)
Rina Kumari, Nischal Ashok, Tirthankar Ghosal, Asif Ekbal (2021): What the fake? Probing misinformation detection standing on the shoulder of novelty and emotion. In: Information Processing and Management, ISSN 0306-4573, vol. 59, no. 1, pp. 1-18 (bibtex)
Rina Kumari, Nischal Ashok, Tirthankar Ghosal, Asif Ekbal (2021): Misinformation detection using multitask learning with mutual learning for novelty detection and emotion recognition. In: Information Processing and Management, ISSN 0306-4573, vol. 58, no. 5, pp. 1-15 (bibtex)
Rina Kumari, Nischal Ashok, Tirthankar Ghosal, Asif Ekbal (2021): A Multitask Learning Approach for Fake News Detection: Novelty, Emotion, and Sentiment Lend a Helping Hand. In: 2021 International Joint Conference on Neural Networks (IJCNN) Proceedings, pp. 1-8, IEEE, New York City, US (bibtex)
Ivana Kvapilíková, Ondřej Bojar (2021): Machine Translation of Covid-19 Information Resources via Multilingual Transfer. In: ITAT 2021 2nd Workshop on Automata, Formal and Natural Languages – WAFNL 2021, pp. 176-181, Faculty of Mathematics and Physics, Praha, Czechia (pdf, local PDF, bibtex)
Wen Lai, Jindřich Libovický, Alexander Fraser (2021): The LMU Munich System for the WMT 2021 Large-Scale Multilingual Machine Translation Shared Task. In: Proceedings of the Sixth Conference on Machine Translation, pp. 417-422, Association for Computational Linguistics, Online, ISBN 978-1-954085-94-7 (pdf, local PDF, local PDF, bibtex)
Mateusz Lango, Zdeněk Žabokrtský, Magda Ševčíková (2021): Semi-Automatic Construction of Word-Formation Networks. In: Language Resources and Evaluation, ISSN 1574-020X, vol. 55, no. 1, pp. 3-32 (url, bibtex)
Jindřich Libovický, Alexander Fraser (2021): The LMU Munich Systems for the WMT21 Unsupervised and Very Low-Resource Translation Task. In: Proceedings of the Sixth Conference on Machine Translation, pp. 994-999, Association for Computational Linguistics, Online, ISBN 978-1-954085-94-7 (pdf, local PDF, local PDF, bibtex)
Jindřich Libovický, Alexander Fraser (2021): Findings of the WMT 2021 Shared Tasks in Unsupervised MT and Very Low Resource Supervised MT. In: Proceedings of the Sixth Conference on Machine Translation, pp. 731-737, Association for Computational Linguistics, Online, ISBN 978-1-954085-94-7 (pdf, local PDF, local PDF, bibtex)
Tomasz Limisiewicz, David Mareček (2021): Examining Cross-lingual Contextual Embeddings with Orthogonal Structural Probes. In: Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing (EMNLP), pp. 4589-4598, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-955917-09-4 (pdf, bibtex)
Tomasz Limisiewicz, David Mareček (2021): Introducing Orthogonal Constraint in Structural Probes. In: Proceedings of the Joint Conference of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, pp. 428-442, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-954085-52-7 (pdf, bibtex)
Mike Lloyd, Jakub Mlynář (2021): Hand-ling 'road rage': Embodiment in conflict on the move. In: Social Interaction. Video-Based Studies of Human Sociality, ISSN 2446-3620, vol. 4, no. 4, pp. 1-35 (url, bibtex)
Markéta Lopatková, Václava Kettnerová, Anna Vernerová, Eduard Bejček, Zdeněk Žabokrtský (2021): Valenční slovník českých sloves VALLEX (technical report). In: (pdf, bibtex)
Markéta Lopatková, Jarmila Panevová (2021): Valenční slovník VALLEX a jeho praktické využití. In: Des langues calculables à l’homme incalculable, pp. 117-126, Editions des archives contemporaines, Paris, France, ISBN 9782813004260 (url, bibtex)
Dominik Macháček, Matúš Žilinec, Ondřej Bojar (2021): Lost in Interpreting: Speech Translation from Source or Interpreter?. In: Proceedings of INTERSPEECH 2021, pp. 2376-2380, ISCA, Baxas, France (pdf, local PDF, bibtex)
Jiří Mayer, Pavel Pecina (2021): Synthesizing Training Data for Handwritten Music Recognition. In: Document Analysis and Recognition -- ICDAR 2021, Lecture Notes in Computer Science, ISSN 0302-9743, 12823, pp. 626-641, Springer International Publishing, Cham, Switzerland, ISBN 978-3-030-86333-3 (pdf, bibtex)
Marie Mikulová, Jarmila Panevová (2021): Formy a funkce okolnostních určení v češtině. Určení prostorová a časová. In: , ISBN 978-80-88132-13-4 (url, bibtex)
Jiří Mírovský, Lucie Poláková (2021): Sense Prediction for Explicit Discourse Relations with BERT. In: Proceedings of Sixth International Congress on Information and Communication Technology (ICICT), pp. 835-842, Springer, Singapore, ISBN 978-981-16-1781-2 (url, bibtex)
Jiří Mírovský, Pavlína Synková, Lucie Poláková (2021): Extending Coverage of a Lexicon of Discourse Connectives Using Annotation Projection. In: The Prague Bulletin of Mathematical Linguistics, ISSN 0032-6585, 117, pp. 5-26 (pdf, local PDF, bibtex)
Jakub Mlynář (2021): "Getting on the page": The practical accord of material resources in educational interaction. In: Ethnographic Studies, ISSN 1366-4964, 18, pp. 145-172 (url, bibtex)
Jakub Mlynář (2021): Rewatching a video clip in classroom work with digital oral history. In: Bulletin Suisse de linguistique appliquée, ISSN 1023-2044, vol. 2021, no. Special issue, pp. 57-76 (pdf, bibtex)
Tomáš Musil (2021): Representations of Meaning in Neural Networks for NLP: a Thesis Proposal. In: Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Student Research Workshop, pp. 24-31, Association for Computational Linguistics, Stroudsburg, USA, ISBN 978-1-954085-50-3 (pdf, bibtex)
Toshiaki Nakazawa, Hideki Nakayma, Chenchen Ding, Raj Dabre, Shohei Higashiyama, Hideya Mino, Isao Goto, Win Pa Pa, Anoop Kunchukuttan, Shantipriya Parida, Ondřej Bojar, Chenhui Chu, Akiko Eriguchi, Kaori Abe, Yusuke Oda, Sadao Kurohashi (2021): Overview of the 8th Workshop on Asian Translation. In: Proceedings of the 8th Workshop on Asian Translation, pp. 1-45, Association for Computational Linguistics, Stroudsburg, USA (url, local PDF, bibtex)
Jakub Náplava, Martin Popel, Milan Straka, Jana Straková (2021): Understanding Model Robustness to User-generated Noisy Texts. In: Proceedings of the 7th Workshop on Noisy User-generated Text (W-NUT 2021), pp. 340-350, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-954085-90-9 (url, local PDF, bibtex)
Jakub Náplava, Milan Straka, Jana Straková (2021): Diacritics Restoration using BERT with Analysis on Czech language. In: The Prague Bulletin of Mathematical Linguistics, ISSN 0032-6585, 116, pp. 27-42 (pdf, local PDF, bibtex)
Anna Nedoluzhko, Michal Novák, Martin Popel, Zdeněk Žabokrtský, Daniel Zeman (2021): Is one head enough? Mention heads in coreference annotations compared with UD-style heads. In: Proceedings of the Sixth International Conference on Dependency Linguistics (Depling, SyntaxFest 2021), pp. 101-114, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-955917-14-8 (pdf, local PDF, bibtex)
Anna Nedoluzhko, Michal Novák, Martin Popel, Zdeněk Žabokrtský, Daniel Zeman (2021): Coreference meets Universal Dependencies – a pilot experiment on harmonizing coreference datasets for 11 languages (technical report). In: (pdf, local PDF, bibtex)
Tomáš Nekvinda, Ondřej Dušek (2021): Shades of BLEU, Flavours of Success: The Case of MultiWOZ. In: Proceedings of the 1st Workshop on Natural Language Generation, Evaluation, and Metrics (GEM 2021), pp. 34-46, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-954085-67-1 (url, bibtex)
Jarmila Panevová (2021): K vybraným typům deverbativních adjektiv v češtině (ve srovnání s ruštinou). In: Зборник Матице српске за славистику / Zbornik Matice srpske za slavistiku, ISSN 0352-5007, 100, pp. 205-218 (pdf, bibtex)
Jarmila Panevová, Magda Ševčíková (2021): Sonda do české slovotvorby: Substantiva utvořená od sloves nebo slovesa utvořená od substantiv?. In: Des langues calculables à l’homme incalculable, pp. 127-135, Editions des archives contemporaines, Paris, France, ISBN 9782813004260 (url, bibtex)
Shantipriya Parida, Subhadarshi Panda, Ketan Kotwal, Amulya Ratna Dash, Satya Ranjan Dash, Yashvardhan Sharma, Petr Motlíček, Ondřej Bojar (2021): NLPHut’s Participation at WAT2021. In: Proceedings of the 8th Workshop on Asian Translation, pp. 146-154, Association for Computational Linguistics, Stroudsburg, USA (pdf, bibtex)
Lucie Poláková, Jiří Mírovský, Šárka Zikánová, Eva Hajičová (2021): Discourse Relations and Connectives in Higher Text Structure. In: Dialogue and Discourse, ISSN 2152-9620, vol. 12, no. 2, pp. 1-37 (url, bibtex)
Lucie Poláková, Pavlína Synková (2021): Pragmatické aspekty v popisu textové koherence. In: Naše řeč, ISSN 0027-8203, vol. 104, no. 4, pp. 225-242 (url, bibtex)
Peter Polák, Ondřej Bojar (2021): Coarse-To-Fine And Cross-Lingual ASR Transfer. In: ITAT 2021 2nd Workshop on Automata, Formal and Natural Languages – WAFNL 2021, pp. 154-160, Faculty of Mathematics and Physics, Praha, Czechia (pdf, local PDF, bibtex)
Peter Polák, Muskaan Singh, Ondřej Bojar (2021): Explainable Quality Estimation: CUNI Eval4NLP Submission. In: Proceedings of the 2nd Workshop on Evaluation and Comparison of NLP Systems, pp. 250-255, Association for Computational Linguistics, Stroudsburg, PA, USA (pdf, local PDF, bibtex)
Martin Popel, Zdeněk Žabokrtský, Anna Nedoluzhko, Michal Novák, Daniel Zeman (2021): Do UD Trees Match Mention Spans in Coreference Annotations?. In: Findings of the Association for Computational Linguistics: EMNLP 2021, pp. 3570-3576, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-955917-10-0 (url, local PDF, bibtex)
Georg Rehm, Stelios Piperidis, Kalina Bontcheva, Jan Hajič, Victoria Arranz, Andrejs Vasiljevs, Gerhard Backfried, José Manuel Gómez-Pérez, Ulrich Germann, Chris Callison-Burch, Ronald Feldstein, Stefanie Hegele, Florian Kintzel, Katrin Marheinecke, Julian Moreno-Schneider, Dimitris Galanis, Penny Labropoulou, Miltos Deligiannis, Katerina Gkirtzou, Athanasia Kolovou, Dimitris Gkoumas, Leon Voukoutis, Ian Roberts, Jana Hamrlová, Dušan Variš, Lukáš Kačena, Khalid Choukri, Valérie Mapelli, Mickaël Rigault, Julija Melnika, Miro Jánošík, Katja Prinz, Andres Garcia-Silva, Cristian Berrio, Ondřej Klejch, Steve Renals (2021): European Language Grid: A Joint Platform for the European Language Technology Community. In: Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics: System Demonstrations, pp. 221-230, Association for Computational Linguistics (ACL), Stroudsburg, PA, USA, ISBN 978-1-954085-05-3 (pdf, bibtex)
Rudolf Rosa (2021): Technická zpráva o vývoji projektu THEaiTRE v roce 2020 (technical report). In: (pdf, local PDF, bibtex)
Rudolf Rosa, Tomáš Musil, Ondřej Dušek, Dominik Jurko, Patrícia Schmidtová, David Mareček, Ondřej Bojar, Tom Kocmi, Daniel Hrbek, David Košťák, Martina Kinská, Marie Nováková, Josef Doležal, Klára Vosecká, Tomáš Studeník, Petr Žabka (2021): When a Robot Writes a Play: Automatically Generating a Theatre Play Script. In: Proceedings of the ALIFE 2021: The 2021 Conference on Artificial Life, pp. 565-567, MIT Press, Cambridge, MA, USA (url, local PDF, local PDF, local ZIP, bibtex)
Rudolf Rosa, Tomáš Musil, Ondřej Dušek, Dominik Jurko, Patrícia Schmidtová, David Mareček, Ondřej Bojar, Tom Kocmi, Daniel Hrbek, David Košťák, Martina Kinská, Marie Nováková, Josef Doležal, Klára Vosecká, Tomáš Studeník, Petr Žabka (2021): THEaiTRE 1.0: Interactive Generation of Theatre Play Scripts. In: Proceedings of the Text2Story’21 Workshop, pp. 71-76, RWTH Aachen University, Aachen, Germany (pdf, local PDF, local ZIP, local PDF, bibtex)
Magdaléna Rysová, Kateřina Rysová (2021): Primární a sekundární diskurzní konektory. In: Slovo a slovesnost, ISSN 0037-7031, vol. 82, no. 3, pp. 179-208 (bibtex)
Shadi Saleh, Hadi Abdi Khojasteh, Hashem Sellat, Pavel Pecina (2021): CUNI-MTIR at COVID-19 MLIA @ Eval Task 2 (Electronic). (pdf)
Shadi Saleh, Hashem Sellat, Hadi Abdi Khojasteh, Pavel Pecina (2021): CUNI-MTIR at COVID-19 MLIA @ Eval Task 3 (Electronic). (pdf)
David Samuel, Milan Straka (2021): ÚFAL at MultiLexNorm 2021: Improving Multilingual Lexical Normalization by Fine-tuning ByT5. In: Proceedings of the 7th Workshop on Noisy User-generated Text (W-NUT 2021), pp. 483-492, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-954085-90-9 (url, local PDF, bibtex)
Arghyadeep Sen, Shantipriya Parida, Ketan Kotwal, Subhadarshi Panda, Ondřej Bojar, Satya Ranjan Dash (2021): Bengali Visual Genome: A Multimodal Dataset for Machine Translation and Image Captioning. In: 9th International Conference on Frontiers of Intelligent Computing: Theory and Applications (FICTA 2021), pp. 63-70, Springer Nature Singapore, Singapore, ISBN 978-981-16-6624-7 (local PDF, bibtex)
Léon-Paul Schaub, Vojtěch Hudeček, Daniel Štancl, Ondřej Dušek, Patrick Paroubek (2021): Defining And Detecting Inconsistent System Behavior in Task-oriented Dialogues. In: Proceedings of the 28th Conference on Natural Language Processing and the 23rd Meeting of Computer Science Student Researchers for NLP, pp. 142-152, Association pour le Traitement Automatique des Langues, Paris, France (url, bibtex)
Muskaan Singh, Tirthankar Ghosal, Ondřej Bojar (2021): An Empirical Performance Analysis of State-of-the-Art Summarization Models for Automatic Minuting. In: Proceedings of the 35th Pacific Asia Conference on Language, Information and Computation, pp. 50-60, ACL, 209 N. Eighth Street, Stroudsburg PA 18360, USA (url, bibtex)
Milan Straka, Jakub Náplava, Jana Straková (2021): Character Transformations for Non-Autoregressive GEC Tagging. In: Proceedings of the 7th Workshop on Noisy User-generated Text (W-NUT 2021), pp. 417-422, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-954085-90-9 (url, local PDF, bibtex)
Milan Straka, Jakub Náplava, Jana Straková, David Samuel (2021): RobeCzech: Czech RoBERTa, a Monolingual Contextualized Language Representation Model. In: 24th International Conference on Text, Speech and Dialogue, pp. 197-209, Springer, Cham, Switzerland, ISBN 978-3-030-83526-2 (url, local PDF, bibtex)
Emil Svoboda, Magda Ševčíková (2021): Splitting and Identifying Czech Compounds: A Pilot Study. In: Proceedings of the Third International Workshop on Resources and Tools for Derivational Morphology (DeriMo 2021), pp. 129-138, ATILF, Nancy, France, ISBN 978-2-9580006-0-8 (pdf, bibtex)
Magda Ševčíková (2021): Action nouns vs. nouns as bases for denominal verbs in Czech: A case study on directionality in derivation. In: Word Structure, ISSN 1750-1245, vol. 14, no. 1, pp. 97-128 (url, bibtex)
Magda Ševčíková (2021): Bezpříponová substantiva a vyjadřování vidového protikladu u příbuzných sloves. In: Slovo a slovesnost, ISSN 0037-7031, vol. 82, no. 4, pp. 263-288 (bibtex)
Jana Šindlerová, Barbora Štěpánková (2021): Between Adverbs and Particles: A Corpus Study of Selected Intensifiers. In: Jazykovedný časopis / Journal of Linguistics, ISSN 0021-5597, vol. 72, no. 2, pp. 444-453 (url, bibtex)
Barbora Štěpánková, Marie Mikulová (2021): Capturing Numerals and Pronouns at the Morphological Layer in the Prague Dependency Treebanks of Czech. In: Jazykovedný časopis / Journal of Linguistics, ISSN 0021-5597, vol. 72, no. 2, pp. 454-464 (url, bibtex)
1.0 THEaiTRobot, David Košťák, Daniel Hrbek, Rudolf Rosa, Ondřej Dušek (2021): AI: When a Robot Writes a Play (technical report). In: (pdf, local ZIP, local PDF, bibtex)
Francis Tyers, Ekaterina Vylomova, Daniel Zeman, Tim Zingler (2021): Working Group 1 (What counts as a word?) in Timothy Baldwin, William Croft, Joakim Nivre, and Agata Savary (eds.): Universals of Linguistic Idiosyncrasy in Multilingual Computational Linguistics (Report from Dagstuhl Seminar 21351) (technical report). In: , pp. 102-106 (url, bibtex)
Zdeňka Urešová, Eva Fučíková, Jan Hajič, Karolina Zaczynska (2021): Annotation guidelines for German verbal synonyms included in SynSemClass Lexicon (technical report). In: (pdf, bibtex)
Svitlana Vakulenko, Ondřej Dušek, Leigh Clark (2021): Report on the 6th Workshop on Search-Oriented Conversational AI (SCAI 2021). In: ACM SIGIR Forum, ISSN 0163-5840, vol. 55, no. 2, pp. 1-14 (url, bibtex)
Jens E. L. van Gysel, Meagan Vigus, Jayeol Chun, Kenneth Lai, Sarah Moeller, Jiarui Yao, Tim O'Gorman, James Cowell, William Croft, Chu-Ren Huang, Jan Hajič, James Martin, Stephan Oepen, Martha Palmer, James Pustejovsky, Rosa Vallejos (2021): Designing a Uniform Meaning Representation for Natural Language Processing. In: KI - Künstliche Intelligenz, ISSN 1610-1987, vol. 35, no. 2, pp. 1-18 (url, bibtex)
Emiel van Miltenburg, Miruna Clinciu, Ondřej Dušek, Dimitra Gkatzia, Stephanie Inglis, Leo Leppänen, Saad Mahamood, Emma Manning, Stephanie Schoch, Craig Thomson, Luou Wen (2021): Underreporting of errors in NLG output, and what to do about it. In: Proceedings of the 14th International Conference on Natural Language Generation (INLG 2021), pp. 140-153, Association for Computational Linguistics, Stroudsburgh, PA, USA, ISBN 978-1-954085-51-0 (url, bibtex)
Kamal Kaushik Varanasi, Tirthankar Ghosal, Valia Kordoni (2021): Additional Context Helps! Leveraging Cited Paper Information To Improve Citation Classification. In: Proceedings of the 18th International Conference on Scientometrics and Informetrics (ISSI 2021), pp. 1187-1192, International Society for Scientometrics and Informetrics (I.S.S.I.), Leuven, Belgium, ISBN 9789080328228 (local PDF, bibtex)
Kamal Kaushik Varanasi, Tirthankar Ghosal, Piyush Tiwari, Muskaan Singh (2021): IITP-CUNI@3C: Supervised Approaches for Citation Classification (Task A) and Citation Significance Detection (Task B). In: Proceedings of the Second Workshop on Scholarly Document Processing, pp. 140-145, ACL, 209 N. Eighth Street, Stroudsburg PA 18360, USA (local PDF, bibtex)
Dušan Variš, Ondřej Bojar (2021): Sequence Length is a Domain: Length-based Overfitting in Transformer Models. In: Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing (EMNLP), pp. 8246-8257, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-955917-09-4 (pdf, local PDF, local PDF, local PDF, bibtex)
Jonáš Vidra, Zdeněk Žabokrtský (2021): Transferring Word-Formation Networks Between Languages. In: Proceedings of the Third International Workshop on Resources and Tools for Derivational Morphology (DeriMo 2021), pp. 135-144, ATILF, Nancy, France, ISBN 978-2-9580006-0-8 (pdf, bibtex)
Leo Wanner, Matthias Klusch, Athanasios Mavropoulos, Emmanuel Jamin, Victor Martin Puchades, Gerard Casamayor, Jan Černocký, Steffi Davey, Mónica Domínguez, Ekaterina Egorova, Jens Grivolla, Gloria Elena Jaramillo Rojas, Anastasios Karakostas, Dimos Ntioudis, Pavel Pecina, Oleksander Sobko, Stefanos Vrochidis, Lena Wertmann (2021): Towards a Versatile Intelligent Conversational Agent as Personal Assistant for Migrants. In: Advances in Practical Applications of Agents, Multi-Agent Systems, and Social Good. The PAAMS Collection, Lecture Notes in Computer Science, ISSN 0302-9743, 12946, pp. 316-327, Springer, Cham, Switzerland, ISBN 978-3-030-85739-4 (pdf, bibtex)
Xinnuo Xu, Ondřej Dušek, Shashi Narayan, Verena Rieser, Ioannis Konstas (2021): MiRANews: Dataset and Benchmarks for Multi-Resource-Assisted News Summarization. In: Findings of the Association for Computational Linguistics: EMNLP 2021, pp. 1541-1552, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-955917-10-0 (url, bibtex)
Xinnuo Xu, Ondřej Dušek, Verena Rieser, Ioannis Konstas (2021): AggGen: Ordering and Aggregating while Generating. In: Proceedings of the Joint Conference of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, pp. 1419-1434, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-954085-52-7 (url, bibtex)
Daniel Zeman (2021): Enhanced Universal Dependencies: The Current State and Outlook. In: Proceedings of the International Conference «Corpus Linguistics-2021» / Труды международной конференции «Корпусная лингвистика-2021», pp. 9-17, Скифия-принт, Sankt-Peterburg, Russia, ISBN 978-5-98620-557-1 (url, local PDF, local PDF, bibtex)
Daniel Zeman (2021): Date and Time in Universal Dependencies. In: Proceedings of the Fifth Workshop on Universal Dependencies (UDW, SyntaxFest 2021), pp. 173-193, Association for Computational Linguistics, Sofia, Bulgaria, ISBN 978-1-955917-17-9 (pdf, local PDF, bibtex)
Šárka Zikánová (2021): Implicitní diskurzní vztahy v češtině. In: , ISBN 9788088132127 (bibtex)
Vilém Zouhar (2021): Sampling and Filtering of Neural Machine Translation Distillation Data. In: Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Student Research Workshop, pp. 1-8, Association for Computational Linguistics, Stroudsburg, USA, ISBN 978-1-954085-50-3 (pdf, bibtex)
Vilém Zouhar, Michal Novák, Matúš Žilinec, Ondřej Bojar, Mateo Obregón, Robin L. Hill, Frédéric Blain, Marina Fomicheva, Lucia Specia, Lisa Yankovskaya (2021): Backtranslation Feedback Improves User Confidence in MT, Not Quality. In: Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, pp. 151-161, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-954085-46-6 (url, local PDF, bibtex)
Vilém Zouhar, Aleš Tamchyna, Martin Popel, Ondřej Bojar (2021): Neural Machine Translation Quality and Post-Editing Performance. In: Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing (EMNLP), pp. 10204-10214, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-955917-09-4 (pdf, local PDF, bibtex)
Alireza Abbas Alipour, Ebrahim Ansari (2020): An Advanced Profile Hidden Markov Model for Malware Detection. In: Intelligent Data Analysis, ISSN 1088-467X, vol. 24, no. 4, pp. 759-778 (url, bibtex)
Hadi Abdi Khojasteh, Ebrahim Ansari, Mahdi Bohlouli (2020): LSCP: Enhanced Large Scale Colloquial Persian Language Understanding. In: Proceedings of the 12th International Conference on Language Resources and Evaluation (LREC 2020), pp. 6323-6327, European Language Resources Association, Marseille, France, ISBN 979-10-95546-34-4 (url, local PDF, bibtex)
Akshay Aggarwal, Daniel Zeman (2020): Estimating POS Annotation Consistency of Different Treebanks in a Language. In: Proceedings of the 19th International Workshop on Treebanks and Linguistic Theories, pp. 93-110, Heinrich-Heine-Universität Düsseldorf, Düsseldorf, Germany, ISBN 978-1-952148-01-9 (pdf, local PDF, bibtex)
Abhishek Agrawal, Rudolf Rosa (2020): Eyes on the Parse: Using Gaze Features in Syntactic Parsing. In: Proceedings of the Second Workshop on Beyond Vision and LANguage: inTEgrating Real-world kNowledge (LANTERN), pp. 1-16, Association for Computational Linguistics, Barcelona, Spain, ISBN 978-1-952148-51-4 (url, local PDF, bibtex)
Ika Alfina, Daniel Zeman, Arawinda Dinakaramani, Indra Budi, Heru Suhartanto (2020): Selecting the Universal Dependencies Morphological Features for Indonesian Dependency Treebank. In: Proceedings of the International Conference on Asian Language Processing (IALP 2020), pp. 104-109, Chinese and Oriental Languages Information Processing Society, Kuala Lumpur, Malaysia, ISBN 978-1-7281-7689-5 (pdf, local PDF, local PDF, bibtex)
Ebrahim Ansari, Amittai Axelrod, Nguyen Bach, Ondřej Bojar, Roldano Cattoni, Fahim Dalvi, Nadir Durrani, Marcello Federico, Christian Federmann, Jiatao Gu, Fei Huang, Kevin Knight, Xutai Ma, Ajay Nagesh, Matteo Negri, Jan Niehues, Juan Pino, Elizabeth Salesky, Xing Shi, Sebastian Stüker, Marco Turchi, Alex Waibel, Changhan Wang (2020): FINDINGS OF THE IWSLT 2020 EVALUATION CAMPAIGN. In: Proceedings of the 17th International Conference on Spoken Language Translation, pp. 1-34, Association for Computational Linguistics, Online, ISBN 978-1-952148-07-1 (pdf, local PDF, bibtex)
Petra Barančíková, Ondřej Bojar (2020): Costra 1.1: An Inquiry into Geometric Properties of Sentence Spaces. In: 23rd International Conference on Text, Speech and Dialogue, pp. 135-143, Springer, Cham, Switzerland, ISBN 978-3-030-58322-4 (local PDF, bibtex)
Petra Barančíková, Ondřej Bojar (2020): COSTRA 1.0: A Dataset of Complex Sentence Transformations. In: Proceedings of the 12th International Conference on Language Resources and Evaluation (LREC 2020), pp. 3535-3541, European Language Resources Association, Marseille, France, ISBN 979-10-95546-34-4 (url, local PDF, bibtex)
Loïc Barrault, Magdalena Biesialska, Ondřej Bojar, Marta R. Costa-Jussà, Christian Federmann, Yvette Graham, Roman Grundkiewicz, Barry Haddow, Matthias Huck, Eric Joanis, Tom Kocmi, Philipp Koehn, Chi-kiu Lo, Nikola Ljubešić, Christof Monz, Makoto Morishita, Masaaki Nagata, Toshiaki Nakazawa, Santanu Pal, Matt Post, Marcos Zampieri (2020): Findings of the 2020 Conference on Machine Translation (WMT20). In: Fifth Conference on Machine Translation - Proceedings of the Conference, pp. 1-55, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-948087-81-0 (pdf, local PDF, bibtex)
Jan Bodnár, Zdeněk Žabokrtský, Magda Ševčíková (2020): Semi-supervised Induction of Morpheme Boundaries in Czech Using a Word-Formation Network. In: 23rd International Conference on Text, Speech and Dialogue, pp. 189-196, Springer, Cham, Switzerland, ISBN 978-3-030-58322-4 (url, bibtex)
Ondřej Bojar, Dominik Macháček, Sangeet Sagar, Otakar Smrž, Jonáš Kratochvíl, Ebrahim Ansari, Dario Franceschini, Chiara Canton, Ivan Simonini, Thai-Son Nguyen, Felix Schneider, Sebastian Stüker, Alex Waibel, Barry Haddow, Rico Sennrich, Philip Williams (2020): ELITR: European Live Translator. In: Proceedings of the 22st Annual Conference of the European Association for Machine Translation (2020), pp. 463-464, European Association for Machine Translation, Lisboa, Portugal, ISBN 978-989-33-0589-8 (url, bibtex)
Gosse Bouma, Djamé Seddah, Daniel Zeman (2020): Overview of the IWPT 2020 Shared Task on Parsing into Enhanced Universal Dependencies. In: Proceedings of the 16th International Conference on Parsing Technologies and the IWPT 2020 Shared Task on Parsing into Enhanced Universal Dependencies, pp. 151-161, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-952148-11-8 (url, local PDF, bibtex)
Jorge Calvo-Zaragoza, Jan Hajič, jr., Alexander Pacha (2020): Understanding Optical Music Recognition. In: ACM Computing Surveys, ISSN 0360-0300, vol. 53, no. 4, pp. 1-35 (url, bibtex)
Silvie Cinková, Jan Rybicki (2020): Stylometry in a Bilingual Setup. In: Proceedings of the 12th International Conference on Language Resources and Evaluation (LREC 2020), pp. 977-984, European Language Resources Association, Marseille, France, ISBN 979-10-95546-34-4 (pdf, bibtex)
Erion Çano, Ondřej Bojar (2020): How Many Pages? Paper Length Prediction from the Metadata. In: 4th International Conference on Natural Language Processing and Information Retrieval, pp. 91-95, ACM, New York, USA, ISBN 978-1-4503-7760-7 (url, local PDF, bibtex)
Erion Çano, Ondřej Bojar (2020): Automating Text Naturalness Evaluation of NLG Systems (Electronic). In: ArXiv.org Computing Research Repository, ISSN 2331-8422 (url)
Erion Çano, Ondřej Bojar (2020): Human or Machine: Automating Human Likeliness Evaluation of NLG Texts (Electronic). In: ArXiv.org Computing Research Repository, ISSN 2331-8422 (url)
Erion Çano, Ondřej Bojar (2020): Two Huge Title and Keyword Generation Corpora of Research Articles. In: Proceedings of the 12th International Conference on Language Resources and Evaluation (LREC 2020), pp. 6663-6671, European Language Resources Association, Marseille, France, ISBN 979-10-95546-34-4 (url, local PDF, bibtex)
Micha Theo Neri de Rijk, David Mareček (2020): Using Word Embeddings and Collocations for Modelling Word Associations. In: The Prague Bulletin of Mathematical Linguistics, ISSN 0032-6585, 114, pp. 35-57 (pdf, bibtex)
Veronika Dostálová, Rudolf Rosa, Daniel Hrbek, Tomáš Studeník (2020): Dočkáme se digitálního Shakespeara? AI jako autor divadelní hry. In: TA.DI, 11/2020, pp. 28-31 (url, local PDF, bibtex)
Ondřej Dušek, Zdeněk Kasner (2020): Evaluating Semantic Accuracy of Data-to-Text Generation with Natural Language Inference. In: Proceedings of the 13th International Conference on Natural Language Generation (INLG 2020), pp. 131-137, Association for Computational Linguistics, Stroudsburgh, PA, USA, ISBN 978-1-952148-54-5 (url, bibtex)
Ondřej Dušek, Jekaterina Novikova, Verena Rieser (2020): Evaluating the state-of-the-art of End-to-End Natural Language Generation: The E2E NLG challenge. In: Computer Speech and Language, ISSN 0885-2308, 59, pp. 123-156 (url, bibtex)
Maria Eskevich, Franciska de Jong, Alexander König, Darja Fišer, Dieter Van Uytvanck, Tero Aalto, Lars Borin, Olga Gerassimenko, Jan Hajič, Henk van den Heuvel, Neeme Kahusk, Krista Liin, Martin Matthiesen, Stelios Piperidis, Kadri Vider (2020): CLARIN: Distributed Language Resources and Technology in a European Infrastructure. In: Proceedings of LT4HALA 2020 - 1st Workshop on Language Technologies for Historical and Ancient Languages, pp. 28-34, European Language Resources Association (ELRA), Marseille, France, ISBN 979-10-95546-53-5 (url, bibtex)
Ingrid Fadelli, Rudolf Rosa (2020): THEaiTRE: A theatre play written entirely by machines (Electronic). (url)
Dario Franceschini, Chiara Canton, Ivan Simonini, Armin Schweinfurth, Adelheid Glott, Sebastian Stüker, Thai-Son Nguyen, Felix Schneider, Thanh-Le Ha, Alex Waibel, Barry Haddow, Phil Williams, Rico Sennrich, Ondřej Bojar, Sangeet Sagar, Dominik Macháček, Otakar Smrž (2020): Removing European Language Barriers with Innovative Machine Translation Technology. In: Proceedings of the 1st International Workshop on Language Technology Platforms, pp. 44-49, ELRA, Paris, France, ISBN 979-10-95546-64-1 (url, local PDF, bibtex)
Ulrich Germann, Roman Grundkiewicz, Martin Popel, Radina Dobreva, Nikolay Bogoychev, Kenneth Heafield (2020): Speed-optimized, Compact Student Models that Distill Knowledge from a Larger Teacher Model: the UEDIN-CUNI Submission to the WMT 2020 News Translation Task. In: Fifth Conference on Machine Translation - Proceedings of the Conference, pp. 191-196, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-948087-81-0 (pdf, bibtex)
Hamid Haghdoost, Ebrahim Ansari, Zdeněk Žabokrtský, Mahshid Nikravesh, Mohammad Mahmoudi (2020): Morphological Networks for Persian and Turkish: What Can Be Induced from Morpheme Segmentation?. In: The Prague Bulletin of Mathematical Linguistics, ISSN 0032-6585, 115, pp. 105-127 (pdf, bibtex)
Jan Hajič, Eduard Bejček, Jaroslava Hlaváčová, Marie Mikulová, Milan Straka, Jan Štěpánek, Barbora Štěpánková (2020): Prague Dependency Treebank - Consolidated 1.0. In: Proceedings of the 12th International Conference on Language Resources and Evaluation (LREC 2020), pp. 5208-5218, European Language Resources Association, Marseille, France, ISBN 979-10-95546-34-4 (url, local PDF, bibtex)
Eva Hajičová (2020): K otázce tzv. kulisy ve světle paralelního korpusu. In: Jak je důležité míti styl, pp. 155-164, NLN - Nakladatelství Lidové noviny, Praha, Czechia, ISBN 978-80-7422-767-7 (bibtex)
Eva Hajičová (2020): Read, if you want to be read by others. In: Linguistica Pragensia, ISSN 0862-8432, vol. 30 , no. 1, pp. 103-105 (url, bibtex)
Eva Hajičová, Jiří Mírovský, Barbora Štěpánková (2020): Focalizers and Discourse Relations. In: The Prague Bulletin of Mathematical Linguistics, ISSN 0032-6585, 115, pp. 187-197 (url, local PDF, bibtex)
Barbora Hladká, Matyáš Kopp, Pavel Straňák (2020): Compiling Czech Parliamentary Stenographic Protocols into a Corpus. In: Proceedings of the LREC 2020 Workshop on Creating, Using and Linking of Parliamentary Corpora with Other Types of Political Discourse (ParlaCLARIN II), pp. 18-22, European Language Resources Association (ELRA), Paris, France, ISBN 979-10-95546-47-4 (url, local PDF, bibtex)
Karolína Houžvičková Šolcová, Rudolf Rosa, Daniel Hrbek (2020): R.U.R. v dobách umělé inteligence: Divadelní hru k 100 letům Čapkova díla píše robot z Matfyzu (Electronic). (url)
Karolína Houžvičková Šolcová, Rudolf Rosa, Daniel Hrbek (2020): Umělá inteligence píše divadelní hru (Electronic). (url)
Ọlájídé Ishola, Daniel Zeman (2020): Yorùbá Dependency Treebank (YTB). In: Proceedings of the 12th International Conference on Language Resources and Evaluation (LREC 2020), pp. 5180-5188, European Language Resources Association, Marseille, France, ISBN 979-10-95546-34-4 (url, local PDF, bibtex)
Zdeněk Kasner, Ondřej Dušek (2020): Data-to-Text Generation with Iterative Text Editing. In: Proceedings of the 13th International Conference on Natural Language Generation (INLG 2020), pp. 60-67, Association for Computational Linguistics, Stroudsburgh, PA, USA, ISBN 978-1-952148-54-5 (url, bibtex)
Zdeněk Kasner, Ondřej Dušek (2020): Train Hard, Finetune Easy: Multilingual Denoising for RDF-to-Text Generation. In: Proceedings of the 3rd International Workshop on Natural Language Generation from the Semantic Web (WebNLG+), pp. 171-176, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-952148-59-0 (url, bibtex)
Aleksei Kelli, Arvi Tavast, Krister Lindén, Kadri Vider, Ramūnas Birštonas, Penny Labropoulou, Irene Kull, Gaabriel Tavits, Age Värv, Pavel Straňák, Jan Hajič (2020): The Impact of Copyright and Personal Data Laws on the Creation and Use of Language Models. In: Linköping Electronic Conference Proceedings, ISSN 1650-3740, vol. 172, no. 8, pp. 53-65 (url, local PDF, bibtex)
Václava Kettnerová (2020): Derived Lexical Reciprocal Verbs in Czech. In: Prace Filologiczne, ISSN 0138-0567, vol. 75, no. 1, pp. 215-240 (local PDF, bibtex)
Václava Kettnerová, Veronika Kolářová (2020): Valence českých verbálních jmen v nominálních konstrukcích a ve verbonominálních predikátech s kategoriálním slovesem. In: Prace Filologiczne, ISSN 0138-0567, vol. 75, no. 1, pp. 241-262 (bibtex)
Václava Kettnerová, Markéta Lopatková (2020): Ke způsobům vyjádření vzájemnosti v češtině. In: Slovo a slovesnost, ISSN 0037-7031, vol. 81, no. 4, pp. 243-268 (local PDF, bibtex)
Václava Kettnerová, Markéta Lopatková (2020): Reciprocity in Czech Light Verb Constructions: The Dependency Perspective. In: Jazykovedný časopis / Journal of Linguistics, ISSN 0021-5597, vol. 71, no. 1, pp. 41-68 (pdf, local PDF, bibtex)
Václava Kettnerová, Markéta Lopatková, Anna Vernerová, Petra Barančíková (2020): Towards a Semi-Automatic Detection of Reflexive and Reciprocal Constructions and Their Representation in a Valency Lexicon. In: Proceedings of the 12th International Conference on Language Resources and Evaluation (LREC 2020), pp. 3136-3144, European Language Resources Association, Marseille, France, ISBN 979-10-95546-34-4 (url, local PDF, bibtex)
Tom Kocmi, Ondřej Bojar (2020): Efficiently Reusing Old Models Across Languages via Transfer Learning. In: Proceedings of the 22st Annual Conference of the European Association for Machine Translation (2020), pp. 1-10, European Association for Machine Translation, Lisboa, Portugal, ISBN 978-989-33-0589-8 (bibtex)
Tom Kocmi, Tomasz Limisiewicz, Gabriel Stanovsky (2020): Gender Coreference and Bias Evaluation at WMT 2020. In: Fifth Conference on Machine Translation - Proceedings of the Conference, pp. 357-364, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-948087-81-0 (pdf, bibtex)
Tom Kocmi, Martin Popel, Ondřej Bojar (2020): Announcing CzEng 2.0 Parallel Corpus with over 2 Gigawords (technical report). In: , pp. 1-6 (pdf, bibtex)
Veronika Kolářová (2020): Vztah afirmativní a negované formy adjektiv a substantiv z hlediska jejich valence. In: Prace Filologiczne, ISSN 0138-0567, vol. 75, no. 1, pp. 293-312 (bibtex)
Veronika Kolářová, Magda Ševčíková (2020): Substantiva tvořená od sloves se supletivními kořeny: Korpusová studie deverbativ s kořeny klád a lož. In: Lingvistika - korpus - empirie, pp. 165-177, Ústav pro jazyk český AV ČR, Praha, Czechia, ISBN 978-80-88211-13-6 (bibtex)
Veronika Kolářová, Anna Vernerová, Jana Klímová (2020): NomVallex I. Valenční slovník substantiv. In: , ISBN 978-80-88132-07-3 (bibtex)
Jonáš Kratochvíl, Peter Polák, Ondřej Bojar (2020): Large Corpus of Czech Parliament Plenary Hearings. In: Proceedings of the 12th International Conference on Language Resources and Evaluation (LREC 2020), pp. 6363-6367, European Language Resources Association, Marseille, France, ISBN 979-10-95546-34-4 (url, local PDF, bibtex)
Jan Oldřich Krůza (2020): Czech parliament meeting recordings as ASR training data. In: Proceedings of the 2020 Federated Conference on Computer Science and Information Systems, pp. 185-188, IEEE, Piscataway, New Jersey, United States, ISBN 978-83-955416-7-4 (url, bibtex)
Ivana Kvapilíková, Mikel Artetxe, Gorka Labaka, Eneko Agirre, Ondřej Bojar (2020): Unsupervised Multilingual Sentence Embeddings for Parallel Corpus Mining. In: Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics: Student Research Workshop, pp. 255-262, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-952148-03-3 (url, local PDF, bibtex)
Ivana Kvapilíková, Tom Kocmi, Ondřej Bojar (2020): CUNI Systems for the Unsupervised and Very Low Resource Translation Task in WMT20. In: Fifth Conference on Machine Translation - Proceedings of the Conference, pp. 1123-1128, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-948087-81-0 (pdf, local PDF, bibtex)
Lukáš Kyjánek, Zdeněk Žabokrtský, Magda Ševčíková, Jonáš Vidra (2020): Universal Derivations 1.0, A Growing Collection of Harmonised Word-Formation Resources. In: The Prague Bulletin of Mathematical Linguistics, ISSN 0032-6585, vol. 115, no. 2, pp. 5-30 (pdf, bibtex)
Jindřich Libovický, Alexander Fraser (2020): Towards Reasonably-Sized Character-Level Transformer NMT by Finetuning Subword Systems. In: Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP), pp. 2572-2579, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-952148-60-6 (bibtex)
Jindřich Libovický, Viktor Hangya, Helmut Schmid, Alexander Fraser (2020): The LMU Munich System for the WMT20 Very Low Resource Supervised MT Task. In: Fifth Conference on Machine Translation - Proceedings of the Conference, pp. 1102-1109, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-948087-81-0 (pdf, bibtex)
Jindřich Libovický, Zdeněk Kasner, Jindřich Helcl, Ondřej Dušek (2020): Expand and Filter: CUNI and LMU Systems for the WNGT 2020 Duolingo Shared Task. In: Proceedings of the Fourth Workshop on Neural Generation and Translation, pp. 153-160, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-952148-17-0 (url, local PDF, bibtex)
Jindřich Libovický, Rudolf Rosa, Alexander Fraser (2020): On the Language Neutrality of Pre-trained Multilingual Representations. In: Findings of the Association for Computational Linguistics: EMNLP 2020, pp. 1663-1674, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-952148-90-3 (url, local PDF, bibtex)
Tomasz Limisiewicz, David Mareček (2020): Syntax Representation in Word Embeddings and Neural Networks – A Survey. In: Proceedings of the 20th Conference Information Technologies - Applications and Theory (ITAT 2020), pp. 38-48, Tomáš Horváth, Košice, Slovakia (pdf, bibtex)
Tomasz Limisiewicz, Rudolf Rosa, David Mareček (2020): Universal Dependencies according to BERT: both more specific and more general. In: Findings of the Association for Computational Linguistics: EMNLP 2020, pp. 2710-2722, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-952148-90-3 (url, bibtex)
David Lukeš, Rudolf Rosa (2020): V4PY: An Introduction to Python for Linguists (Electronic). (url)
Kateřina Macková, Milan Straka (2020): Reading Comprehension in Czech via Machine Translation and Cross-lingual Transfer. In: 23rd International Conference on Text, Speech and Dialogue, pp. 171-179, Springer, Cham, Switzerland, ISBN 978-3-030-58322-4 (url, local PDF, bibtex)
Dominik Macháček, Ondřej Bojar (2020): Presenting Simultaneous Translation in Limited Space. In: Proceedings of the 20th Conference Information Technologies - Applications and Theory (ITAT 2020), pp. 32-37, Tomáš Horváth, Košice, Slovakia (pdf, bibtex)
Dominik Macháček, Jonáš Kratochvíl, Sangeet Sagar, Matúš Žilinec, Ondřej Bojar, Thai-Son Nguyen, Felix Schneider, Philip Williams, Yuekun Yao (2020): ELITR Non-Native Speech Translation at IWSLT 2020. In: Proceedings of the 17th International Conference on Spoken Language Translation, pp. 200-208, Association for Computational Linguistics, Online, ISBN 978-1-952148-07-1 (pdf, local PDF, bibtex)
David Mareček, Hande Celikkanat, Miikka Silfverberg, Vinit Ravishankar, Jörg Tiedemann (2020): Are Multilingual Neural Machine Translation Models Better at Capturing Linguistic Features?. In: The Prague Bulletin of Mathematical Linguistics, ISSN 0032-6585, 115, pp. 143-162 (pdf, bibtex)
David Mareček, Jindřich Libovický, Tomáš Musil, Rudolf Rosa, Tomasz Limisiewicz (2020): Hidden in the Layers: Interpretation of Neural Networks for Natural Language Processing. In: , ISBN 978-80-88132-10-3 (url, bibtex)
Nitika Mathur, Johnny Tian-Zheng Wei, Markus Freitag, Qingsong Ma, Ondřej Bojar (2020): Results of the WMT20 Metrics Shared Task. In: Fifth Conference on Machine Translation - Proceedings of the Conference, pp. 688-725, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-948087-81-0 (pdf, local PDF, bibtex)
Marie Mikulová, Jan Hajič, Jiří Hana, Hana Hanová, Jaroslava Hlaváčová, Emil Jeřábek, Barbora Štěpánková, Barbora Vidová Hladká, Daniel Zeman (2020): Manual for Morphological Annotation, Revision for the Prague Dependency Treebank - Consolidated 2020 release (technical report). In: (pdf, bibtex)
Marie Mikulová, Jarmila Panevová (2020): Vyjadřování prostorových určení v textu psaném a mluveném (Případová studie). In: Jak je důležité míti styl. Pocta Janě Hoffmannové, pp. 193-206, Lidové noviny, Praha, Czechia, ISBN 978-80-7422-767-7 (bibtex)
Jiří Mírovský, Lucie Poláková, Pavlína Synková (2020): CzeDLex 0.6 and its Representation in the PML-TQ. In: Proceedings of the 12th International Conference on Language Resources and Evaluation (LREC 2020), pp. 1128-1134, European Language Resources Association, Marseille, France, ISBN 979-10-95546-34-4 (url, local PDF, bibtex)
Toshiaki Nakazawa, Hideki Nakayma, Chenchen Ding, Raj Dabre, Shohei Higashiyama, Hideya Mino, Isao Goto, Win Pa Pa, Anoop Kunchukuttan, Shantipriya Parida, Ondřej Bojar, Sadao Kurohashi (2020): Overview of the 7th Workshop on Asian Translation. In: Proceedings of the 7th Workshop on Asian Translation (WAT2020), pp. 1-44, Association for Computational Linguistics, Stroudsburg, USA (url, local PDF, bibtex)
Minoo Nassajian, Ehsan Doostmohammadi, Adel Rahimi (2020): Joint Persian Word Segmentation Correction and Zero-Width Non-Joiner Recognition Using BERT. In: Proceedings of the Second Workshop on Beyond Vision and LANguage: inTEgrating Real-world kNowledge (LANTERN), pp. 4612-4618, Association for Computational Linguistics, Barcelona, Spain, ISBN 978-1-952148-51-4 (url, bibtex)
Minoo Nassajian, Ehsan Doostmohammadi, Adel Rahimi (2020): Persian Ezafe Recognition Using Transformers and Its Role in Part-Of-Speech Tagging. In: Findings of the Association for Computational Linguistics: EMNLP 2020, pp. 961-971, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-952148-90-3 (pdf, bibtex)
Tomáš Nekvinda, Ondřej Dušek (2020): One Model, Many Languages: Meta-learning for Multilingual Text-to-Speech. In: Proceedings of the 21st Annual Conference of the International Speech Communication Association, pp. 2972-2976, International Speech Communication Association, Baixas, France (url, bibtex)
Joakim Nivre, Marie-Catherine de Marneffe, Filip Ginter, Jan Hajič, Christopher Manning, Sampo Pyysalo, Sebastian Schuster, Francis Tyers, Daniel Zeman (2020): Universal Dependencies v2: An Evergrowing Multilingual Treebank Collection. In: Proceedings of the 12th International Conference on Language Resources and Evaluation (LREC 2020), pp. 4034-4043, European Language Resources Association, Marseille, France, ISBN 979-10-95546-34-4 (url, local PDF, bibtex)
Ondřej Novotný, Rudolf Rosa, Tomáš Studeník (2020): Scénář: Robot. Ve Švandově divadle píše hru k výročí R.U.R. umělá inteligence. In: Hospodářské noviny IHNED, ISSN 1213-7693, pp. 1-3 (url, local PDF, bibtex)
Stephan Oepen, Omri Abend, Lasha Abzianidze, Johan Bos, Jan Hajič, Daniel Hershcovich, Bin Li, Tim O'Gorman, Nianwen Xue, Daniel Zeman (2020): MRP 2020: The Second Shared Task on Cross-Framework and Cross-Lingual Meaning Representation Parsing. In: Proceedings of the CoNLL 2020 Shared Task: Cross-Framework Meaning Representation Parsing, pp. 1-22, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-952148-64-4 (url, local PDF, local PDF, bibtex)
Atul Kumar Ojha, Daniel Zeman (2020): Universal Dependency Treebanks for Low-Resource Indian Languages: The Case of Bhojpuri. In: Proceedings of the LREC 2020 WILDRE5 – 5th Workshop on Indian Language Data: Resources and Evaluation, pp. 33-38, European Language Resources Association, Paris, France, ISBN 979-10-95546-67-2 (url, bibtex)
Jarmila Panevová (2020): Světla Čmejrková – František Daneš: Jejich setkávání na poli gramatiky. Jazykovědné aktuality 57, č. 1 a 2, 2020, 35 – 38.. In: Jazykovědné aktuality , ISSN 1212-5326, vol. 57, no. 1 - 2, pp. 35-38 (url, bibtex)
Jarmila Panevová, Eva Hajičová, Václava Kettnerová, Veronika Kolářová, Markéta Lopatková, Marie Mikulová, Magda Ševčíková (2020): Funkční generativní popis – rámec pro konzistentní popis gramatiky. In: Naše řeč, ISSN 0027-8203, vol. 103, no. 1-2, pp. 55-78 (bibtex)
Jarmila Panevová, Markéta Lopatková, Václava Kettnerová (2020): Reciproka ve slovníku a v syntaxi. In: Lingvistika -- korpus -- empirie, pp. 63-70, Ústav pro jazyk český AV ČR, Praha, Czechia, ISBN 978-80-88211-13-6 (bibtex)
Jarmila Panevová, Marie Mikulová (2020): Subcategorization of Adverbials (The Case of Temporal Meanings). In: Korpus – gramatika – axiologie, ISSN 1804-137X, 22, pp. 16-30 (bibtex)
Shantipriya Parida, Satya Ranjan Dash, Ondřej Bojar, Petr Motlíček, Priyanka Pattnaik, Debasish Kumar Mallick (2020): OdiEnCorp 2.0: Odia-English Parallel Corpus for Machine Translation. In: Proceedings of the LREC 2020 WILDRE5 – 5th Workshop on Indian Language Data: Resources and Evaluation, pp. 14-19, European Language Resources Association, Paris, France, ISBN 979-10-95546-67-2 (local PDF, bibtex)
Shantipriya Parida, Petr Motlíček, Amulya Ratna Dash, Satya Ranjan Dash, Debasish Kumar Mallick, Satya Prakash Biswal, Priyanka Pattnaik, Biranchi Narayan Nayak, Ondřej Bojar (2020): ODIANLP’s Participation in WAT2020. In: Proceedings of the 7th Workshop on Asian Translation (WAT2020), pp. 103-108, Association for Computational Linguistics, Stroudsburg, USA (url, local PDF, bibtex)
Lucie Poláková, Jiří Mírovský (2020): Mining Local Discourse Annotation for Features of Global Discourse Structure. In: 23rd International Conference on Text, Speech and Dialogue, pp. 50-60, Springer, Cham, Switzerland, ISBN 978-3-030-58322-4 (url, bibtex)
Lucie Poláková, Kateřina Rysová, Magdaléna Rysová, Jiří Mírovský (2020): GeCzLex: Lexicon of Czech and German Anaphoric Connectives. In: Proceedings of the 12th International Conference on Language Resources and Evaluation (LREC 2020), pp. 1082-1089, European Language Resources Association, Marseille, France, ISBN 979-10-95546-34-4 (url, local PDF, bibtex)
Peter Polák, Sangeet Sagar, Dominik Macháček, Ondřej Bojar (2020): CUNI Neural ASR with Phoneme-Level Intermediate Step for Non-Native SLT at IWSLT 2020. In: Proceedings of the 17th International Conference on Spoken Language Translation, pp. 191-199, Association for Computational Linguistics, Online, ISBN 978-1-952148-07-1 (url, local PDF, bibtex)
Martin Popel (2020): CUNI English-Czech and English-Polish Systems in WMT20: Robust Document-Level Training. In: Fifth Conference on Machine Translation - Proceedings of the Conference, pp. 269-273, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-948087-81-0 (pdf, bibtex)
Martin Popel, Marketa Tomkova, Jakub Tomek, Łukasz Kaiser, Jakob Uszkoreit, Ondřej Bojar, Zdeněk Žabokrtský (2020): Transforming machine translation: a deep learning system reaches news translation quality comparable to human professionals. In: Nature Communications, ISSN 2041-1723, vol. 11, no. 4381, pp. 1-15 (url, local PDF, bibtex)
Georg Rehm, Maria Berger, Jan Hajič, Stefanie Hegele, Florian Kintzel, Katrin Marheinecke, Stelios Piperidis, Miltos Deligiannis, Dimitris Galanis, Katerina Gkirtzou, Penny Labropoulou, Kalina Boncheva, Dominic Jones, Ela Elsholz, Ian Roberts, Jana Hamrlová, Lukáš Kačena, Khalid Choukri, Victoria Arranz, Andrejs Vasiljevs, Katya Aplonova, Julija Melnika, Gerhard Backfried, Miro Jánošík, Katja Prinz, Christoph Prinz, Andres Garcia-Silva, Cristian Berrio, Ulrich Germann, Steve Renals, Ondřej Klejch, José Manuel Gómez-Pérez (2020): European Language Grid: An Overview. In: Proceedings of the 12th International Conference on Language Resources and Evaluation (LREC 2020), pp. 3366-3380, European Language Resources Association, Marseille, France, ISBN 979-10-95546-34-4 (url, bibtex)
Georg Rehm, Kalina Bontcheva, Khalid Choukri, Jan Hajič, Stelios Piperidis, Andrejs Vasiljevs (2020): Proceedings of the 1st International Workshop on Language Technology Platforms (IWLTP 2020, co-located with LREC 2020) (). In: Proceedings of the 1st International Workshop on Language Technology Platforms, ELRA, Paris, France, ISBN 979-10-95546-64-1 (pdf)
Georg Rehm, Katrin Marheinecke, Stefanie Hegele, Stelios Piperidis, Kalina Bontcheva, Jan Hajič, Khalid Choukri, Andrejs Vasiljevs, Gerhard Backfried, Christoph Prinz, José Manuel Gómez Pérez, Luc Meertens, Paul Lukowicz, Josef Genabith, Andrea Lösch, Philipp Slusallek, Morten Irgens, Patrick Gatellier, Joachim Köhler, Laure Le Bars, Dimitra Anastasiou, Albina Auksoriute, Núria Bel, António Branco, Gerhard Budin, Walter Daelemans, Koenraad De Smedt, Radovan Garabík, Maria Gavrilidou, Dagmar Gromann, Svetla Koeva, Simon Krek, Cvetana Krstev, Krister Lindén, Bernardo Magnini, Jan Odijk, Maciej Ogrodniczuk, Eiríkur Rögnvaldsson, Michael Rosner, Bolette Sandford Pedersen, Inguna Skadiņa, Marko Tadić, Dan Tufiş, Tamás Váradi, Kadri Vider, Andy Way, François Yvon (2020): The European Language Technology Landscape in 2020: Language-Centric and Human-Centric AI for Cross-Cultural Communication in Multilingual Europe. In: Proceedings of the 12th International Conference on Language Resources and Evaluation (LREC 2020), pp. 3322-3332, European Language Resources Association, Marseille, France, ISBN 979-10-95546-34-4 (url, bibtex)
Rudolf Rosa (2020): THEaiTRE: Umělá inteligence píše divadelní hru. In: Rozhledy matematicko-fyzikální, ISSN 0035-9343, vol. 95, no. 4, pp. 42-50 (pdf, bibtex)
Rudolf Rosa (2020): Deliverable D7.2 Report on NLP Technologies Workshop at EUROSAI Congress (technical report). In: (bibtex)
Rudolf Rosa, Ondřej Dušek, Tom Kocmi, David Mareček, Tomáš Musil, Patrícia Schmidtová, Dominik Jurko, Ondřej Bojar, Daniel Hrbek, David Košťák, Martina Kinská, Josef Doležal, Klára Vosecká (2020): THEaiTRE: Artificial Intelligence to Write a Theatre Play. In: Proceedings of AI4Narratives — Workshop on Artificial Intelligence for Narratives, pp. 9-13, RWTH Aachen University, Aachen, Germany (pdf, local ZIP, local PDF, local PDF, bibtex)
Rudolf Rosa, Tomáš Musil, David Mareček (2020): Measuring Memorization Effect in Word-Level Neural Networks Probing. In: 23rd International Conference on Text, Speech and Dialogue, pp. 180-188, Springer, Cham, Switzerland, ISBN 978-3-030-58322-4 (url, local PDF, bibtex)
Alexandr Rosen, Jirka Hana, Barbora Hladká, Tomáš Jelínek, Svatava Škodová, Barbora Štindlová (2020): Compiling and annotating a learner corpus for a morphologically rich language – CzeSL, a corpus of non-native Czech. In: , ISBN 978-80-246-4759-3 (bibtex)
Kateřina Rysová (2020): Odborný seminář Jazyk, text, dialog – vzpomínka na Světlu Čmejrkovou a Františka Daneše. In: Český jazyk a literatura, ISSN 0009-0786, vol. 70, no. 4, pp. 203-203 (local PDF, bibtex)
Shadi Saleh, Pavel Pecina (2020): Document Translation vs. Query Translation for Cross-Lingual Information Retrieval in the Medical Domain. In: Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, pp. 6849-6860, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-952148-25-5 (pdf, local PDF, bibtex)
David Samuel, Milan Straka (2020): ÚFAL at MRP 2020: Permutation-invariant Semantic Parsing in PERIN. In: Proceedings of the CoNLL 2020 Shared Task: Cross-Framework Meaning Representation Parsing, pp. 53-64, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-952148-64-4 (url, local PDF, bibtex)
Lucia Specia, Loïc Barrault, Ozan Caglayan, Amanda Duarte, Desmond Elliott, Spandana Gella, Nils Holzenberger, Chiraag Lala, Sun Jae Lee, Jindřich Libovický, Pranava Madhyastha, Florian Metze, Karl Mulligan, Alissa Ostapenko, Shruti Palaskar, Ramon Sanabria, Josiah Wang, Raman Arora (2020): Grounded Sequence to Sequence Transduction. In: IEEE Journal on Selected Topics in Signal Processing, ISSN 1932-4553, vol. 14, no. 3, pp. 577-591 (url, local PDF, bibtex)
Milan Straka, Jana Straková (2020): UDPipe at EvaLatin 2020: Contextualized Embeddings and Treebank Embeddings. In: Proceedings of LT4HALA 2020 - 1st Workshop on Language Technologies for Historical and Ancient Languages, pp. 124-129, European Language Resources Association (ELRA), Marseille, France, ISBN 979-10-95546-53-5 (url, local PDF, bibtex)
Magda Ševčíková (2020): The suffixes -ismus and -ita in nouns in Czech. In: The Interaction of Borrowing and Word Formation, pp. 162-195, Edinburgh University Press, Edinburgh, Great Britain, ISBN 9781474448208 (bibtex)
Barbora Štěpánková (2020): K možnostem zachycení pragmatické složky významu v jednojazyčném výkladovém slovníku (na příkladu hesel z oblasti etnografie a antropologie). In: Naše řeč, ISSN 0027-8203, vol. 103, no. 5, pp. 430-446 (bibtex)
Barbora Štěpánková, Marie Mikulová, Jan Hajič (2020): The MorfFlex Dictionary of Czech as a Source of Linguistic Data. In: Proceedings of XIX EURALEX Congress: Lexicography for Inclusion, pp. 387-392, Democritus University of Thrace, Alexandroupolis, Greece, ISBN 978-618-85138-1-5 (url, bibtex)
Marsida Toska, Joakim Nivre, Daniel Zeman (2020): Universal Dependencies for Albanian. In: Proceedings of the Fourth Workshop on Universal Dependencies (UDW 2020), pp. 178-188, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-952148-48-4 (url, local PDF, bibtex)
Martin Uhlíř, Rudolf Rosa, Tomáš Musil, David Košťák (2020): Ze života robotů. In: Respekt, ISSN 1801-1446, 46/2020, pp. 52-55 (url, bibtex)
Zdeňka Urešová, Eva Fučíková, Eva Hajičová, Jan Hajič (2020): Syntactic-Semantic Classes of Context-Sensitive Synonyms Based on a Bilingual Corpus. In: Human Language Technology. Challenges for Computer Science and Linguistics. 8th Language and Technology Conference, LTC 2017, Revised Selected Papers, pp. 242-255, Springer International Publishing, Cham, Switzerland, ISBN 978-3-030-66527-2 (pdf, local PDF, bibtex)
Zdeňka Urešová, Eva Fučíková, Eva Hajičová, Jan Hajič (2020): SynSemClass Linked Lexicon: Mapping Synonymy between Languages. In: Proceedings of the 2020 Globalex Workshop on Linked Lexicography (LREC 2020), pp. 10-19, European Language Resources Association, Marseille, France, ISBN 979-10-95546-46-7 (url, local PDF, bibtex)
Jan Vainer, Ondřej Dušek (2020): SpeedySpeech: Efficient Neural Speech Synthesis. In: Proceedings of the 21st Annual Conference of the International Speech Communication Association, pp. 3575-3579, International Speech Communication Association, Baixas, France (url, bibtex)
Dušan Variš, Satoshi Nakamura, Katsuhito Sudoh (2020): Image Captioning with Visual Object Representations Groundedin the Textual Modality (Electronic). In: ArXiv.org Computing Research Repository, ISSN 2331-8422 (url, local PDF)
Martin Vastl, Daniel Zeman, Rudolf Rosa (2020): Predicting Typological Features in WALS using Language Embeddings and Conditional Probabilities: ÚFAL Submission to the SIGTYP 2020 Shared Task. In: Proceedings of the Second Workshop on Computational Research in Linguistic Typology, pp. 29-35, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-952148-73-6 (url, local PDF, bibtex)
Jonathan Verner, Anna Vernerová (2020): PyVallex: A Processing System for Valency Lexicon Data. In: Proceedings of the 12th International Conference on Language Resources and Evaluation (LREC 2020), pp. 7187-7193, European Language Resources Association, Marseille, France, ISBN 979-10-95546-34-4 (url, bibtex)
Jonáš Vidra, Zdeněk Žabokrtský (2020): Next Step in Online Querying and Visualization of Word-Formation Networks. In: 23rd International Conference on Text, Speech and Dialogue, pp. 144-152, Springer, Cham, Switzerland, ISBN 978-3-030-58322-4 (url, bibtex)
Xinnuo Xu, Ondřej Dušek, Ioannis Konstas, Verena Rieser (2020): Fact-based Content Weighting for Evaluating Abstractive Summarisation. In: Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, pp. 5071-5081, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-952148-25-5 (url, bibtex)
Daniel Zeman, Jan Hajič (2020): FGD at MRP 2020: Prague Tectogrammatical Graphs. In: Proceedings of the CoNLL 2020 Shared Task: Cross-Framework Meaning Representation Parsing, pp. 33-39, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-952148-64-4 (url, local PDF, local PDF, bibtex)
Vilém Zouhar, Ondřej Bojar (2020): Outbound Translation User Interface Ptakopet: A Pilot Study. In: Proceedings of the 12th International Conference on Language Resources and Evaluation (LREC 2020), pp. 6967-6975, European Language Resources Association, Marseille, France, ISBN 979-10-95546-34-4 (url, local PDF, bibtex)
Vilém Zouhar, Michal Novák (2020): Extending Ptakopět for Machine Translation User Interaction Experiments. In: The Prague Bulletin of Mathematical Linguistics, ISSN 0032-6585, 115, pp. 129-142 (pdf, local PDF, bibtex)
Vilém Zouhar, Tereza Vojtěchová, Ondřej Bojar (2020): WMT20 Document-Level Markable Error Exploration. In: Fifth Conference on Machine Translation - Proceedings of the Conference, pp. 371-380, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-948087-81-0 (url, local PDF, bibtex)
Zdeněk Žabokrtský, Daniel Zeman, Magda Ševčíková (2020): Sentence Meaning Representations across Languages: What Can We Learn from Existing Frameworks?. In: Computational Linguistics, ISSN 1530-9312, vol. 46, no. 3, pp. 605-665 (url, local PDF, bibtex)
Hamed Alavi, Himanshu Verma, Jakub Mlynář, Denis Lalanne (2019): On the Temporality of Adaptive Built Environments. In: People, Personal Data and the Built Environment, pp. 13-40, Springer Nature, Cham, Switzerland, ISBN 978-3-319-70874-4 (url, bibtex)
Ebrahim Ansari, Zdeněk Žabokrtský, Mohammad Mahmoudi, Hamid Haghdoost, Jonáš Vidra (2019): Supervised Morphological Segmentation Using Rich Annotated Lexicon. In: International Conference "Recent Advances in Natural Language Processing", pp. 52-61, INCOMA Ltd., Varna, Bulgaria, ISBN 978-954-452-055-7 (pdf, bibtex)
Petra Barančíková, Ondřej Bojar (2019): In Search for Linear Relations in Sentence Embedding Spaces. In: Proceedings of the 19th Conference ITAT 2019: Slovenskočeský NLP workshop (SloNLP 2019), pp. 125-132, CreateSpace Independent Publishing Platform, Košice, Slovakia (local PDF, bibtex)
Loïc Barrault, Ondřej Bojar, Marta R. Costa-Jussà, Christian Federmann, Mark Fishel, Yvette Graham, Barry Haddow, Matthias Huck, Philipp Koehn, Shervin Malmasi, Christof Monz, Mathias Müller, Santanu Pal, Matt Post, Marcos Zampieri (2019): Findings of the 2019 Conference on Machine Translation (WMT19). In: Fourth Conference on Machine Translation - Proceedings of the Conference, pp. 1-61, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-950737-27-7 (url, bibtex)
Ondřej Bojar, Raffaella Bernardi, Bonnie L. Webber (2019): Representation of sentence meaning (A JNLE Special Issue). In: Natural Language Engineering, ISSN 1351-3249, vol. 25, no. 4, pp. 427-432 (pdf, local PDF, bibtex)
Ronald Cardenas, Claudia Borg, Daniel Zeman (2019): CUNI–Malta system at CoNLL–SIGMORPHON 2019 Shared Task on Morphological Analysis and Lemmatization in context: Operation-based word formation. In: Proceedings of the 16th SIGMORPHON Workshop on Computational Research in Phonetics, Phonology, and Morphology, pp. 104-112, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-950737-36-9 (url, local PDF, bibtex)
Silvie Cinková, Anežka Náhlíková (2019): Žánrová preference v narativech dětí mladšího školního věku. In: Slovo a slovesnost, ISSN 0037-7031, vol. 80, no. 3, pp. 192-214 (local PDF, bibtex)
Ludivine Crible, Ágnes Abuczki, Nijolė Burkšaitienė, Péter Furkó, Anna Nedoluzhko, Giedre Valunaite Oleskeviciene, Sigita Rackevičienė, Šárka Zikánová (2019): Functions and translations of underspecified discourse markers in TED Talks: a parallel corpus study on five languages. In: Journal of Pragmatics, ISSN 0378-2166, 142, pp. 139-155 (local PDF, bibtex)
Erion Çano, Ondřej Bojar (2019): Keyphrase Generation: A Multi-Aspect Survey. In: Proceedings of the 25th Conference of Open Innovations Association FRUCT 2019, pp. 85-94, Finnish-Russian University Cooperation in Telecommunications, Helsinki, Finland, ISBN 978-952-69244-0-3 (pdf, bibtex)
Erion Çano, Ondřej Bojar (2019): Keyphrase Generation: A Text Summarization Struggle. In: The 17th Annual Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, pp. 666-672, NAACL-HLT 2019, Minneapolis, MN, USA, ISBN 978-1-950737-13-0 (url, bibtex)
Erion Çano, Ondřej Bojar (2019): Sentiment Analysis of Czech Texts: An Algorithmic Survey. In: Proceedings of the 11th International Conference on Agents and Artificial Intelligence, pp. 973-979, SCITEPRESS Digital Library, Setúbal, Portugal, ISBN 978-989-758-350-6 (url, bibtex)
Erion Çano, Ondřej Bojar (2019): Efficiency Metrics for Data-Driven Models: A Text Summarization Case Study. In: Proceedings of the 12th International Conference on Natural Language Generation (INLG 2019), pp. 229-239, Association for Computational Linguistics, Stroudsubrgh, PA, USA, ISBN 978-1-950737-94-9 (url, bibtex)
Kira Droganova, Andrey Kutuzov, Nikita Mediankin, Daniel Zeman (2019): ÚFAL–Oslo at MRP 2019: Garage Sale Semantic Parsing. In: Proceedings of the CoNLL 2019 Shared Task: Cross-Framework Meaning Representation Parsing, pp. 158-165, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-950737-60-4 (pdf, local PDF, bibtex)
Kira Droganova, Daniel Zeman (2019): Towards Deep Universal Dependencies. In: Proceedings of the Fifth International Conference on Dependency Linguistics (Depling, Syntaxfest 2019), pp. 144-152, Association for Computational Linguistics, Paris, France, ISBN 978-1-950737-63-5 (pdf, local PDF, local PDF, bibtex)
Ondřej Dušek, David M. Howcroft, Verena Rieser (2019): Semantic Noise Matters for Neural Natural Language Generation. In: Proceedings of the 12th International Conference on Natural Language Generation (INLG 2019), pp. 421-426, Association for Computational Linguistics, Stroudsubrgh, PA, USA, ISBN 978-1-950737-94-9 (url, local PDF, bibtex)
Ondřej Dušek, Filip Jurčíček (2019): Neural Generation for Czech: Data and Baselines. In: Proceedings of the 12th International Conference on Natural Language Generation (INLG 2019), pp. 563-574, Association for Computational Linguistics, Stroudsubrgh, PA, USA, ISBN 978-1-950737-94-9 (url, bibtex)
Ondřej Dušek, Karin Sevegnani, Ioannis Konstas, Verena Rieser (2019): Automatic Quality Estimation for Natural Language Generation: Ranting (Jointly Rating and Ranking). In: Proceedings of the 12th International Conference on Natural Language Generation (INLG 2019), pp. 369-376, Association for Computational Linguistics, Stroudsubrgh, PA, USA, ISBN 978-1-950737-94-9 (url, bibtex)
Meisyarah Dwiastuti (2019): English-Indonesian Neural Machine Translation for Spoken Language Domains. In: Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics: Student Research Workshop, pp. 309-314, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-950737-47-5 (pdf, bibtex)
Hamid Haghdoost, Ebrahim Ansari, Zdeněk Žabokrtský, Mahshid Nikravesh (2019): Building a Morphological Network for Persian on Top of a Morpheme-Segmented Lexicon. In: Proceedings of the Second International Workshop on Resources and Tools for Derivational Morphology (DeriMo 2019), pp. 91-100, ÚFAL MFF UK, Praha, Czechia, ISBN 978-80-88132-08-0 (pdf, bibtex)
Jan Hajič, Eva Hajičová, Zdeňka Urešová (2019): A Multilingual Crossover between Types of Resources. In: Slavonic Natural Language Processing in the 21st Century, pp. 64-78, Tribun EU, Brno, Czechia, ISBN 9788026315452 (local DOCX, bibtex)
Eva Hajičová (2019): A Plea for Information Structure as a Part of Meaning Representation. In: Proceedings of the Workshop on Designing Meaning Representations, pp. 66-72, The Association for Computational Linguistics, Stroudsburg, PA 18360, USA, ISBN 978-1-950737-45-1 (pdf, local PDF, local PDF, bibtex)
Eva Hajičová (2019): MICHAEL ALEXANDER KIRKWOOD HALLIDAY (1925–2018) (obituary). In: Linguistica Pragensia, ISSN 0862-8432, vol. 29, no. 2, pp. 243-245 (url, local DOCX, bibtex)
Eva Hajičová (2019): Již Čapkovi roboti byli obdařeni řečí. In: 70 let podivné vědy. Rozhovory s našimi kybernetiky, pp. 15-20, Česká technika - nakladatelství ČVUT, Praha, Czechia, ISBN 978-80-01-06667-6 (bibtex)
Eva Hajičová, Jiří Mírovský (2019): Různé pojetí dichotomie v informační struktuře věty ve světle anotovaného korpusu. In: Svět podle Grepla, pp. 23-34, Host - vydavatelství, s.r.o., Brno, Czech Republic, ISBN 978-80-7577-810-9 (local PDF, bibtex)
Eva Hajičová, Jiří Mírovský, Kateřina Rysová (2019): Ordering of Adverbials of Time and Place in Grammars and in an Annotated English–Czech Parallel Corpus. In: Proceedings of the 18th International Workshop on Treebanks and Linguistic Theories (TLT, SyntaxFest 2019), pp. 51-60, Association for Computational Linguistics, Paris, France, ISBN 978-1-950737-64-2 (pdf, local PDF, bibtex)
Eva Hajičová, Jarmila Panevová (2019): Odešel významný český lingvista Petr Sgall . In: Slovo a slovesnost, ISSN 0037-7031, 80, pp. 346-348 (url, bibtex)
Jindřich Helcl, Jindřich Libovický, Martin Popel (2019): CUNI System for the WMT19 Robustness Task. In: Fourth Conference on Machine Translation - Proceedings of the Conference, pp. 738-742, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-950737-27-7 (url, local PDF, local PDF, bibtex)
Jaroslava Hlaváčová (2019): Aggregates and Variants in two Czech morphological approaches. In: Proceedings of the 19th Conference ITAT 2019: Slovenskočeský NLP workshop (SloNLP 2019), pp. 120-124, CreateSpace Independent Publishing Platform, Košice, Slovakia (bibtex)
Jaroslava Hlaváčová, Marie Mikulová, Barbora Štěpánková, Jan Hajič (2019): Modifications of the Czech morphological dictionary for consistent corpus annotation. In: Jazykovedný časopis / Journal of Linguistics, ISSN 0021-5597, vol. 70, no. 2, pp. 380-389 (url, bibtex)
Maarten Janssen, Adriane Boyd, Alexandr Rosen, Elena Volodina, Egon Stemle, Nives Mikelić (2019): Working together towards an ideal infrastructure for language learner corpora. In: Widening the Scope of Learner Corpus Research : Selected Papers from the Fourth Learner Corpus Research Conference, pp. 427-468, Presses universitaires de Louvain, Louvain, Belgium, ISBN 9782875588685 (bibtex)
Simon Keizer, Ondřej Dušek, Xingkun Liu, Verena Rieser (2019): User Evaluation of a Multi-dimensional Statistical Dialogue System. In: Proceedings of the 20th Annual SIGdial Meeting on Discourse and Dialogue, pp. 392-398, Association for Computational Linguistics, Stroudsubrgh, PA, USA, ISBN 978-1-950737-61-1 (url, bibtex)
Aleksei Kelli, Krister Lindén, Kadri Vider, Paweł Kamocki, Ramūnas Birštonas, Silvia Calamai, Penny Labropoulou, Maria Gavrilidou, Pavel Straňák (2019): Processing personal data without the consent of the data subject for the development and use of language resources. In: Linköping Electronic Conference Proceedings, ISSN 1650-3740, vol. 159, no. 8, pp. 72-82 (url, bibtex)
Václava Kettnerová, Markéta Lopatková (2019): Towards Reciprocal Deverbal Nouns in Czech: From Reciprocal Verbs to Reciprocal Nouns. In: Jazykovedný časopis / Journal of Linguistics, ISSN 0021-5597, vol. 70, no. 2, pp. 434-443 (local PDF, bibtex)
Václava Kettnerová, Markéta Lopatková (2019): Reflexives in Czech from a Dependency Perspective. In: Proceedings of the Fifth International Conference on Dependency Linguistics (Depling, Syntaxfest 2019), pp. 14-25, Association for Computational Linguistics, Paris, France, ISBN 978-1-950737-63-5 (pdf, local PDF, bibtex)
Tom Kocmi (2019): Exploring Benefits of Transfer Learning in Neural Machine Translation (PhD thesis). In: (url, bibtex)
Tom Kocmi, Ondřej Bojar (2019): CUNI Submission for Low-Resource Languages in WMT News 2019. In: Fourth Conference on Machine Translation - Proceedings of the Conference, pp. 234-240, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-950737-27-7 (pdf, bibtex)
Veronika Kolářová, Anna Vernerová, Jonathan Verner (2019): Non-systemic valency behavior of Czech deverbal nouns based on the NomVallex lexicon. In: Jazykovedný časopis / Journal of Linguistics, ISSN 0021-5597, vol. 70, no. 2, pp. 424-433 (url, bibtex)
Daniel Kondratyuk, Ronald Cardenas, Ondřej Bojar (2019): Replacing Linguists with Dummies: A Serious Need for Trivial Baselinesin Multi-Task Neural Machine Translation. In: The Prague Bulletin of Mathematical Linguistics, ISSN 0032-6585, 113, pp. 31-40 (pdf, bibtex)
Daniel Kondratyuk, Milan Straka (2019): 75 Languages, 1 Model: Parsing Universal Dependencies Universally. In: Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP), pp. 2779-2795, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-950737-90-1 (url, local PDF, bibtex)
Ivana Kvapilíková, Dominik Macháček, Ondřej Bojar (2019): CUNI Systems for the Unsupervised News Translation Task in WMT 2019. In: Fourth Conference on Machine Translation - Proceedings of the Conference, pp. 241-248, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-950737-27-7 (pdf, bibtex)
Lukáš Kyjánek, Zdeněk Žabokrtský, Magda Ševčíková, Jonáš Vidra (2019): Universal Derivations Kickoff: A Collection of Harmonized Derivational Resources for Eleven Languages. In: Proceedings of the Second International Workshop on Resources and Tools for Derivational Morphology (DeriMo 2019), pp. 101-110, ÚFAL MFF UK, Praha, Czechia, ISBN 978-80-88132-08-0 (pdf, bibtex)
Jindřich Libovický (2019): Multimodality in Machine Translation (PhD thesis). In: (pdf, local PDF, local PDF, bibtex)
Jindřich Libovický (2019): Neuronové sítě a automatický překlad. In: Rozhledy matematicko-fyzikální, ISSN 0035-9343, vol. 94, no. 4, pp. 30-40 (url, bibtex)
Jindřich Libovický, Pranava Madhyastha (2019): Probing Representations Learned by Multimodal Recurrent and Transformer Models (Electronic). (url)
Jindřich Libovický, Rudolf Rosa, Alexander Fraser (2019): How Language-Neutral is Multilingual BERT? (Electronic). In: ArXiv.org Computing Research Repository, ISSN 2331-8422, vol. arXiv:1911.03310 [cs.CL], no. arXiv:1911.03310 [cs.CL], pp. 1-6 (url, local PDF)
Dominik Macháček, Jonáš Kratochvíl, Tereza Vojtěchová, Ondřej Bojar (2019): A Speech Test Set of Practice Business Presentations with Additional Relevant Texts. In: Statistical Language and Speech Processing, pp. 151-161, Springer Nature Switzerland AG, Cham, Switzerland, ISBN 978-3-030-31371-5 (url, bibtex)
Qingsong Ma, Johnny Tian-Zheng Wei, Ondřej Bojar, Yvette Graham (2019): Results of the WMT19 Metrics Shared Task: Segment-Level and Strong MT Systems Pose Big Challenges . In: Fourth Conference on Machine Translation - Proceedings of the Conference, pp. 62-90, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-950737-27-7 (url, bibtex)
David Mareček, Rudolf Rosa (2019): From Balustrades to Pierre Vinken: Looking for Syntax in Transformer Self-Attentions. In: The BlackboxNLP Workshop on Analyzing and Interpreting Neural Networks for NLP at ACL 2019, pp. 263-275, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-950737-30-7 (url, local PDF, local PDF, bibtex)
Marie Mikulová, Veronika Kolářová, Jarmila Panevová, Eva Hajičová (2019): Delimiting Adverbial Meanings. A corpus-based comparative study on Czech spatial prepositions and their English equivalents. In: Proceedings of the Fifth International Conference on Dependency Linguistics (Depling, Syntaxfest 2019), pp. 153-159, Association for Computational Linguistics, Paris, France, ISBN 978-1-950737-63-5 (pdf, local PDF, bibtex)
Marie Mikulová, Jarmila Panevová (2019): Subkategorizace adverbiálních významů (hranice mezi obsahem a významem). In: Korpus – gramatika – axiologie, ISSN 1804-137X, 20, pp. 33-46 (bibtex)
Behzad Moradi, Ebrahim Ansari, Zdeněk Žabokrtský (2019): Unsupervised Word Sense Disambiguation Using Word Embeddings. In: Proceedings of the 25th Conference of Open Innovations Association FRUCT 2019, pp. 228-233, Finnish-Russian University Cooperation in Telecommunications, Helsinki, Finland, ISBN 978-952-69244-0-3 (pdf, bibtex)
Tomáš Musil (2019): Examining Structure of Word Embeddings with PCA. In: Proceedings of the 22nd International Conference on Text, Speech and Dialogue - TSD 2019, Lecture Notes in Computer Science, ISSN 0302-9743, 11697, pp. 211-223, Springer International Publishing, Cham / Heidelberg / New York / Dordrecht / London, ISBN 978-3-030-27946-2 (bibtex)
Tomáš Musil, Jonáš Vidra, David Mareček (2019): Derivational Morphological Relations in Word Embeddings. In: The BlackboxNLP Workshop on Analyzing and Interpreting Neural Networks for NLP at ACL 2019, pp. 173-180, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-950737-30-7 (url, bibtex)
Toshiaki Nakazawa, Nobushige Doi, Shohei Higashiyama, Chenchen Ding, Raj Dabre, Hideya Mino, Isao Goto, Win Pa Pa, Anoop Kunchukuttan, Shantipriya Parida, Ondřej Bojar, Sadao Kurohashi (2019): Overview of the 6th Workshop on Asian Translation. In: Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP), pp. 1-35, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-950737-90-1 (pdf, bibtex)
Jakub Náplava, Milan Straka (2019): Grammatical Error Correction in Low-Resource Scenarios. In: Proceedings of the 5th Workshop on Noisy User-generated Text (W-NUT 2019), pp. 346-356, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-950737-84-0 (url, local PDF, bibtex)
Jakub Náplava, Milan Straka (2019): CUNI System for the Building Educational Applications 2019 Shared Task: Grammatical Error Correction. In: Proceedings of the Fourteenth Workshop on Innovative Use of NLP for Building Educational Applications, pp. 183-190, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-950737-34-5 (url, local PDF, bibtex)
Minoo Nassajian, Ehsan Doostmohammadi (2019): Investigating Machine Learning Methods for Language and Dialect Identification of Cuneiform Texts. In: Proceedings of the Sixth Workshop on NLP for Similar Languages, Varieties and Dialects, pp. 188-193, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-950737-11-6 (url, bibtex)
Anna Nedoluzhko, Ondřej Bojar (2019): Towards Automatic Minuting of Meetings. In: Proceedings of the 19th Conference ITAT 2019: Slovenskočeský NLP workshop (SloNLP 2019), pp. 112-119, CreateSpace Independent Publishing Platform, Košice, Slovakia (url, local PDF, bibtex)
Michal Novák, Jiří Mírovský, Kateřina Rysová, Magdaléna Rysová (2019): Exploiting Large Unlabeled Data in Automatic Evaluation of Coherence in Czech. In: Proceedings of the 22nd International Conference on Text, Speech and Dialogue - TSD 2019, Lecture Notes in Computer Science, ISSN 0302-9743, 11697, pp. 197-210, Springer International Publishing, Cham / Heidelberg / New York / Dordrecht / London, ISBN 978-3-030-27946-2 (url, bibtex)
Stephan Oepen, Omri Abend, Jan Hajič, Daniel Hershcovich, Marco Kuhlmann, Nianwen Xue, Jayeol Chun, Milan Straka, Zdeňka Urešová, Tim O'Gorman (2019): MRP 2019: Cross-Framework Meaning Representation Parsing. In: Proceedings of the CoNLL 2019 Shared Task: Cross-Framework Meaning Representation Parsing, pp. 1-27, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-950737-60-4 (url, local PDF, local PDF, bibtex)
Giedre Valunaite Oleskeviciene, Sigita Rackevičienė, Jolita Šliogerienė, Nijolė Burkšaitienė, Viktorija Mažeikienė, Liudmila Mockienė, Péter Furkó, Ágnes Abuczki, Šárka Zikánová (2019): Fuzzy Boundaries in the Different Functions and Translations of the Discourse Marker and. In: Fuzzy Boundaries in Discourse Studies, pp. 1-13, Palgrave Macmillan, Great Britain, ISBN 978-3-030-27572-3 (local PDF, bibtex)
Klára Osolsobě, Jaroslava Hlaváčová (2019): NovaMorf: konec dlouhého období konvergencí a divergencí ve zpracování české morfologie. In: Slavonic Natural Language Processing in the 21st Century, pp. 93-99, Tribun EU, Brno, Czech rep., ISBN 978-80-263-1545-2 (bibtex)
Shruti Palaskar, Jindřich Libovický, Spandana Gella, Florian Metze (2019): Multimodal Abstractive Summarization for How2 Videos. In: Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics, pp. 6587-6596, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-950737-48-2 (url, local PDF, local PDF, bibtex)
Jarmila Panevová (2019): Studie z české morfologie a syntaxe (Vybrané stati). In: , ISBN 978-80-246-4388-5 (bibtex)
Jarmila Panevová, Marie Mikulová (2019): Úvahy nad příslovečnými určeními a jejich klasifikací. In: Svět podle Grepla, pp. 139-149, Host, Brno, Czechia, ISBN 978-80-7577-810-9 (bibtex)
Shantipriya Parida, Ondřej Bojar, Satya Ranjan Dash (2019): OdiEnCorp: Odia-English and Odia-Only Corpus for Machine Translation. In: Proceedings of the Third International Conference on Smart Computing and Informatics, Volume 1, pp. 495-504, Springer, Singapore, ISBN 978-981-13-9282-5 (url, bibtex)
Shantipriya Parida, Ondřej Bojar, Satya Ranjan Dash (2019): Hindi Visual Genome: A Dataset for Multimodal English-to-Hindi Machine Translation. In: Computación y Sistemas, ISSN 1405-5546, vol. 23, no. 4, pp. 1499-1505 (url, bibtex)
Shantipriya Parida, Petr Motlíček, Ondřej Bojar (2019): Idiap NMT System for WAT 2019 Multi-Modal Translation Task. In: Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP), pp. 175-180, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-950737-90-1 (pdf, bibtex)
Vladimír Petkevič, Jaroslava Hlaváčová, Klára Osolsobě, Martin Svášek, Josef Šimandl (2019): Parts of Speech in NovaMorf, a New Morphological Annotation of Czech. In: Jazykovedný časopis / Journal of Linguistics, ISSN 0021-5597, vol. 70, no. 2, pp. 358-369 (bibtex)
Thuong-Hai Pham, Dominik Macháček, Ondřej Bojar (2019): Promoting the Knowledge of Source Syntax in Transformer NMT Is Not Needed. In: Computación y Sistemas, ISSN 1405-5546, vol. 23, no. 3, pp. 923-934 (url, bibtex)
Lucie Poláková, Jiří Mírovský (2019): Anaphoric Connectives and Long-Distance Discourse Relations in Czech. In: Computación y Sistemas, ISSN 1405-5546, vol. 23, no. 3, pp. 711-717 (url, local PDF, bibtex)
Maria Ponomareva, Kira Droganova, Ivan Smurov, Tatiana Shavrina (2019): AGRR 2019: Corpus for Gapping Resolution in Russian. In: Balto-Slavic Natural Language Processing, pp. 35-43, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-950737-41-3 (url, local PDF, bibtex)
Martin Popel, Christian Federmann (2019): Domain Adaptation of Document-Level NMT in IWSLT19. In: Proceedings of the 16th International Workshop on Spoken Language Translation, pp. 1-7, Karlsruhe Institute of Technology, Karlsruhe, Germany (url, bibtex)
Martin Popel, Dominik Macháček, Michal Auersperger, Ondřej Bojar, Pavel Pecina (2019): English-Czech Systems in WMT19: Document-Level Transformer. In: Fourth Conference on Machine Translation - Proceedings of the Conference, pp. 342-348, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-950737-27-7 (pdf, local PDF, bibtex)
Rudolf Rosa, Zdeněk Žabokrtský (2019): Attempting to separate inflection and derivation using vector space representations. In: Proceedings of the Second International Workshop on Resources and Tools for Derivational Morphology (DeriMo 2019), pp. 61-70, ÚFAL MFF UK, Praha, Czechia, ISBN 978-80-88132-08-0 (url, local PDF, local PDF, local PDF, bibtex)
Rudolf Rosa, Zdeněk Žabokrtský (2019): Unsupervised Lemmatization as Embeddings-Based Word Clustering (Electronic). In: ArXiv.org Computing Research Repository, ISSN 2331-8422, arXiv:1908.08528 [cs.CL], pp. 1-5 (url, local PDF)
Kateřina Rysová (2019): Jubilejní 45. ročník Olympiády v českém jazyce. In: Český jazyk a literatura, ISSN 0009-0786, vol. 70, no. 1, pp. 1-7 (local PDF, bibtex)
Kateřina Rysová, Magdaléna Rysová (2019): Anaphoric Connective Database. In: Proceedings of the 11th Annual International Conference on Education and New Learning Technologies (EDULEARN 2019), pp. 6692-6700, IATED Academy, Palma, Spain, ISBN 978-84-09-12031-4 (url, bibtex)
Kateřina Rysová, Magdaléna Rysová, Tomáš Musil, Lucie Poláková, Ondřej Bojar (2019): A Test Suite and Manual Evaluation of Document-Level NMT at WMT19. In: Fourth Conference on Machine Translation - Proceedings of the Conference, pp. 455-463, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-950737-27-7 (url, local PDF, bibtex)
Kateřina Rysová, Magdaléna Rysová, Michal Novák, Jiří Mírovský, Eva Hajičová (2019): EVALD – a Pioneer Application for Automated Essay Scoring in Czech. In: The Prague Bulletin of Mathematical Linguistics, ISSN 0032-6585, 113, pp. 9-30 (url, local PDF, bibtex)
Magdaléna Rysová (2019): Jana Hoffmannová, Jiří Homoláč, Kamila Mrázková (eds.): Syntax mluvené češtiny, Praha: Academia, 2019 (review). In: Korpus – gramatika – axiologie, ISSN 1804-137X, pp. 76-80 (pdf, local PDF, bibtex)
Magdaléna Rysová, Kateřina Rysová, Jiří Mírovský, Michal Novák (2019): Coherence Errors in Learners’ Essays and a Possibility of Their Improvement through EVALD (Automated Evaluator of Discourse). In: Proceedings of the 11th Annual International Conference on Education and New Learning Technologies (EDULEARN 2019), pp. 6761-6768, IATED Academy, Palma, Spain, ISBN 978-84-09-12031-4 (url, bibtex)
Shadi Saleh, Pavel Pecina (2019): Term Selection for Query Expansion in Medical Cross-Lingual Information Retrieval. In: Advances in Information Retrieval; 41st European Conference on IR Research, ECIR 2019 , Lecture Notes in Computer Science, ISSN 0302-9743, 1, pp. 507-522, Springer International Publishing, Berlin, Germany, ISBN 978-3-030-15719-7 (url, local PDF, bibtex)
Shadi Saleh, Pavel Pecina (2019): An Extended CLEF eHealth Test Collection for Cross-lingual Information Retrieval in the Medical Domain. In: Advances in Information Retrieval; 41st European Conference on IR Research, ECIR 2019 , Lecture Notes in Computer Science, ISSN 0302-9743, 1, pp. 188-195, Springer International Publishing, Berlin, Germany, ISBN 978-3-030-15719-7 (url, local PDF, bibtex)
Jakub Sláma, Barbora Štěpánková (2019): On the Valency of Various Types of Adverbs and Its Lexicographic Description. In: Jazykovedný časopis / Journal of Linguistics, ISSN 0021-5597, vol. 70, no. 2, pp. 158-169 (url, bibtex)
Ivan Smurov, Maria Ponomareva, Tatiana Shavrina, Kira Droganova (2019): AGRR-2019: Automatic Gapping Resolution for Russian. In: Computational Linguistics and Intellectual Technologie, pp. 600-614, nakl. RGGU, Moscow, Russia (pdf, local PDF, bibtex)
Milan Straka, Jana Straková (2019): ÚFAL MRPipe at MRP 2019: UDPipe Goes Semantic in the Meaning Representation Parsing Shared Task. In: Proceedings of the CoNLL 2019 Shared Task: Cross-Framework Meaning Representation Parsing, pp. 127-137, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-950737-60-4 (url, local PDF, bibtex)
Milan Straka, Jana Straková, Jan Hajič (2019): Czech Text Processing with Contextual Embeddings: POS Tagging, Lemmatization, Parsing and NER. In: Proceedings of the 22nd International Conference on Text, Speech and Dialogue - TSD 2019, Lecture Notes in Computer Science, ISSN 0302-9743, 11697, pp. 137-150, Springer International Publishing, Cham / Heidelberg / New York / Dordrecht / London, ISBN 978-3-030-27946-2 (url, local PDF, bibtex)
Milan Straka, Jana Straková, Jan Hajič (2019): Evaluating Contextualized Embeddings on 54 Languages in POS Tagging, Lemmatization and Dependency Parsing (Electronic). In: ArXiv.org Computing Research Repository, ISSN 2331-8422, 1904.02099 (url, local PDF)
Milan Straka, Jana Straková, Jan Hajič (2019): UDPipe at SIGMORPHON 2019: Contextualized Embeddings, Regularization with Morphological Categories, Corpora Merging. In: Proceedings of the 16th SIGMORPHON Workshop on Computational Research in Phonetics, Phonology, and Morphology, pp. 95-103, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-950737-36-9 (pdf, local PDF, bibtex)
Jana Straková, Milan Straka, Jan Hajič (2019): Neural Architectures for Nested NER through Linearization. In: Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics, pp. 5326-5331, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-950737-48-2 (pdf, local PDF, bibtex)
Jana Straková, Milan Straka, Jan Hajič, Martin Popel (2019): Hluboké učení v automatické analýze českého textu. In: Slovo a slovesnost, ISSN 0037-7031, vol. 80, no. 4, pp. 306-327 (bibtex)
Pavel Straňák, Ondřej Košarko, Jozef Mišutka (2019): CLARIN-DSpace repository at LINDAT/CLARIN : LINDAT/CLARIN FAIR repository for language data. In: the grey Journal – International Journal on Grey Literature, ISSN 1574-1796, 16, pp. 52-61 (url, bibtex)
Magda Ševčíková, Lukáš Kyjánek (2019): Introducing Semantic Labels into the DeriNet Network. In: Jazykovedný časopis / Journal of Linguistics, ISSN 0021-5597, vol. 70, no. 2, pp. 412-423 (bibtex)
Svatava Škodová, Kateřina Rysová, Magdaléna Rysová (2019): Comparison of Automatic and Human Evaluation of L2 Texts in Czech. In: Journal of Slavic Languages, ISSN 1226-2323, vol. 24, no. 1, pp. 93-101 (bibtex)
Francis M. Tyers, Jonathan North Washington, Darya Kavitskaya, Memduh Gökırmak, Nick Howell, Remziye Berberova (2019): A Biscriptual Morphological Transducer for Crimean Tatar. In: Proceedings of the 3rd Workshop on Computational Methods for Endangered Languages: Vol. 1 Papers, pp. 74-80, University of Hawai'i, Honolulu, Hawai'i, USA (url, bibtex)
Zdeňka Urešová, Eva Fučíková, Eva Hajičová (2019): CzEngClass: Contextually-based Synonymy and Valency of Verbs in a Bilingual Setting (technical report). In: (pdf, local PDF, bibtex)
Zdeňka Urešová, Eva Fučíková, Eva Hajičová, Jan Hajič (2019): Meaning and Semantic Roles in CzEngClass Lexicon. In: Jazykovedný časopis / Journal of Linguistics, ISSN 0021-5597, vol. 70, no. 2, pp. 403-411 (url, local PDF, bibtex)
Zdeňka Urešová, Eva Fučíková, Eva Hajičová, Jan Hajič (2019): Parallel Dependency Treebank Annotated with Interlinked Verbal Synonym Classes and Roles. In: Proceedings of the 18th International Workshop on Treebanks and Linguistic Theories (TLT, SyntaxFest 2019), pp. 38-50, Association for Computational Linguistics, Paris, France, ISBN 978-1-950737-64-2 (pdf, local PDF, bibtex)
Dušan Variš, Ondřej Bojar (2019): Unsupervised Pretraining for Neural Machine Translation Using Elastic Weight Consolidation. In: Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics: Student Research Workshop, pp. 130-135, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-950737-47-5 (pdf, local PDF, local PDF, bibtex)
Anna Vernerová (2019): Lexicographic treatment of the valency aspects of verbal diatheses (PhD thesis). In: (pdf, local PDF, bibtex)
Jonáš Vidra, Zdeněk Žabokrtský, Magda Ševčíková, Lukáš Kyjánek (2019): DeriNet 2.0: Towards an All-in-One Word-Formation Resource. In: Proceedings of the Second International Workshop on Resources and Tools for Derivational Morphology (DeriMo 2019), pp. 81-89, ÚFAL MFF UK, Praha, Czechia, ISBN 978-80-88132-08-0 (pdf, bibtex)
Tereza Vojtěchová, Michal Novák, Miloš Klouček, Ondřej Bojar (2019): SAO WMT19 Test Suite: Machine Translation of Audit Reports. In: Fourth Conference on Machine Translation - Proceedings of the Conference, pp. 680-692, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-950737-27-7 (url, bibtex)
Šárka Zikánová, Jiří Mírovský, Pavlína Synková (2019): Explicit and Implicit Discourse Relations in the Prague Discourse Treebank. In: Proceedings of the 22nd International Conference on Text, Speech and Dialogue - TSD 2019, Lecture Notes in Computer Science, ISSN 0302-9743, 11697, pp. 236-248, Springer International Publishing, Cham / Heidelberg / New York / Dordrecht / London, ISBN 978-3-030-27946-2 (url, bibtex)
Ágnes Abuczki, Nijolė Burkšaitienė, Ludivine Crible, Péter Furkó, Giedre Valunaite Oleskeviciene, Sigita Rackevičienė, Šárka Zikánová (2018): Translation of "and" in a parallel TED Talk corpus of English, Czech, Hungarian, Lithuanian and French: functions and omissions. In: TextLink – Structuring Discourse in Multilingual Europe – Final Action Conference, pp. 1-4, University of Toulouse, Toulouse, France (pdf, bibtex)
Ahmad Aghaebrahimian (2018): Deep Neural Networks at the Service of Multilingual Parallel Sentence Extraction. In: Proceedings of The 27th International Conference on Computational Linguistics , pp. 1372-1383, ICCL, Sheffield, GB, ISBN 978-4-87974-703-7 (url, bibtex)
Ahmad Aghaebrahimian (2018): Deep Multi-Lingual Cross Sentence Alignment. In: Proceedings of the 32st Pacific Asia Conference on Language, Information and Computation, pp. 1-4, The Hong Kong Polytechnic University, Hong Kong, China (bibtex)
Ahmad Aghaebrahimian (2018): Linguistically-based Deep Unstructured Question Answering. In: Proceedings of CoNLL 2018: The SIGNLL Conference on Computational Natural Language Learning, pp. 433-443, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-948087-72-8 (url, bibtex)
Petra Barančíková, Václava Kettnerová (2018): Paraphrases of verbal multiword expressions: the case of Czech light verbs and idioms. In: Multiword expressions at length and in depth: Extended papers from the MWE 2017 workshop, pp. 35-59, Language Science Press, Berlin, Germany, ISBN 978-3-96110-123-8 (url, bibtex)
Petr Bělohlávek, Ondřej Plátek, Zdeněk Žabokrtský, Milan Straka (2018): Using Adversarial Examples in Natural Language Processing. In: Proceedings of the 11th International Conference on Language Resources and Evaluation (LREC 2018), pp. 3693-3700, European Language Resources Association, Miyazaki, Japan, ISBN 979-10-95546-00-9 (url, local PDF, bibtex)
Ondřej Bojar, Christian Federmann, Mark Fishel, Yvette Graham, Barry Haddow, Matthias Huck, Philipp Koehn, Christof Monz (2018): Findings of the 2018 Conference on Machine Translation (WMT18). In: Proceedings of the Third Conference on Machine Translation, Volume 2: Shared Tasks, pp. 272-307, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-948087-81-0 (pdf, bibtex)
Ondřej Bojar, Jiří Mírovský, Kateřina Rysová, Magdaléna Rysová (2018): EvalD Reference-Less Discourse Evaluation for WMT18. In: Proceedings of the Third Conference on Machine Translation, Volume 2: Shared Tasks, pp. 545-549, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-948087-81-0 (url, local PDF, bibtex)
Gosse Bouma, Jan Hajič, Joakim Nivre, Per Erik Solberg, Lilja Øvrelid (2018): Expletives in Universal Dependency Treebanks. In: Proceedings of the Second Workshop on Universal Dependencies (UDW 2018), pp. 18-26, Association for Computational Linguistics, Bruxelles, Belgium, ISBN 978-1-948087-78-0 (url, bibtex)
Franck Burlot, Yves Scherrer, Vinit Ravishankar, Ondřej Bojar, Stig-Arne Grönroos, Maarit Koponen, Tommi Nieminem, François Yvon (2018): The WMT’18 Morpheval test suites for English-Czech, English-German, English-Finnish and Turkish-English. In: Proceedings of the Third Conference on Machine Translation, Volume 2: Shared Tasks, pp. 550-564, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-948087-81-0 (pdf, bibtex)
Jorge Calvo-Zaragoza, Jan Hajič, jr., Alexander Pacha (2018): Discussion Group Summary: Optical Music Recognition. In: Graphics Recognition. Current Trends and Evolutions. 12th IAPR International Workshop, GREC 2017, Kyoto, Japan, November 9-10, 2017, Revised Selected Papers, Lecture Notes in Computer Science, ISSN 0302-9743, vol. 11009, no. 1, pp. 152-157, Springer International Publishing, Basel, Switzerland, ISBN 978-3-030-02284-6 (url, bibtex)
Ronald Cardenas, Daniel Zeman (2018): A Morphological Analyzer for Shipibo-Konibo. In: Proceedings of the Fifteenth Workshop on Computational Research in Phonetics, Phonology, and Morphology, pp. 131-139, Association for Computational Linguistics, Bruxelles, Belgium, ISBN 978-1-948087-76-6 (url, local PDF, bibtex)
Flavio Massimiliano Cecchini, Marco Passarotti, Paola Marongiu, Daniel Zeman (2018): Challenges in Converting the Index Thomisticus Treebank into Universal Dependencies. In: Proceedings of the Second Workshop on Universal Dependencies (UDW 2018), pp. 27-36, Association for Computational Linguistics, Bruxelles, Belgium, ISBN 978-1-948087-78-0 (url, local PDF, bibtex)
Ondřej Cífka, Ondřej Bojar (2018): Are BLEU and Meaning Representation in Opposition?. In: Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pp. 1362-1371, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-948087-32-2 (url, bibtex)
Silvie Cinková, Ondřej Bojar (2018): Testsuite on Czech–English Grammatical Contrasts. In: Proceedings of the Third Conference on Machine Translation, Volume 2: Shared Tasks, pp. 565-575, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-948087-81-0 (pdf, bibtex)
Radek Čech, Jiří Milička, Ján Mačutek, Michaela Koščová, Markéta Lopatková (2018): Quantitative Analysis of Syntactic Dependency in Czech. In: Quantitative Analysis of Dependency Structures, pp. 53-70, De Gruyter, Berlin, Boston, ISBN 9783110573565 (bibtex)
Laurence Danlos, Kateřina Rysová, Magdaléna Rysová, Manfred Stede (2018): Primary and secondary discourse connectives: definitions and lexicons. In: Dialogue and Discourse, ISSN 2152-9620, vol. 9, no. 1, pp. 50-78 (url, local PDF, bibtex)
Matthias Dorfer, Jan Hajič, jr., Andreas Arzt, Harald Frostel, Gerhard Widmer (2018): Learning Audio–Sheet Music Correspondences for Cross-Modal Retrieval and Piece Identification. In: Transactions of the International Society for Music Information Retrieval, ISSN 2514-3298, vol. 1, no. 1, pp. 22-33 (url, bibtex)
Kira Droganova, Filip Ginter, Jenna Kanerva, Daniel Zeman (2018): Mind the Gap: Data Enrichment in Dependency Parsing of Elliptical Constructions. In: Proceedings of the Second Workshop on Universal Dependencies (UDW 2018), pp. 47-54, Association for Computational Linguistics, Bruxelles, Belgium, ISBN 978-1-948087-78-0 (url, local PDF, bibtex)
Kira Droganova, Olga Lyashevskaya (2018): Cross-Tagset Parsing Evaluation for Russian. In: Digital Transformation and Global Society, pp. 380-390, Springer International Publishing, Cham, ISBN 978-3-030-02842-8 (bibtex)
Kira Droganova, Olga Lyashevskaya, Daniel Zeman (2018): Data Conversion and Consistency of Monolingual Corpora: Russian UD Treebanks. In: Proceedings of the 17th International Workshop on Treebanks and Linguistic Theories (TLT 2018), pp. 53-66, Linköping University Electronic Press, Linköping, Sweden, ISBN 978-91-7685-137-1 (pdf, local PDF, bibtex)
Kira Droganova, Daniel Zeman, Jenna Kanerva, Filip Ginter (2018): Parse Me if You Can: Artificial Treebanks for Parsing Experiments on Elliptical Constructions. In: Proceedings of the 11th International Conference on Language Resources and Evaluation (LREC 2018), pp. 1845-1852, European Language Resources Association, Miyazaki, Japan, ISBN 979-10-95546-00-9 (url, local PDF, bibtex)
Puneet Dwivedi, Daniel Zeman (2018): The Forest Lion and the Bull: Morphosyntactic Annotation of the Panchatantra. In: Computación y Sistemas, ISSN 1405-5546, vol. 22, no. 4, pp. 1377-1384 (url, local PDF, bibtex)
Jan Hajič, jr., Matthias Dorfer, Gerhard Widmer, Pavel Pecina (2018): Towards Full-Pipeline Handwritten OMR with Musical Symbol Detection by U-Nets. In: Proceedings of the 19th Conference of the International Society for Music Information Retrieval, pp. 225-232, International Society for Music Information Retrieval, New York, NY, USA, ISBN 978-2-9540351-2-3 (pdf, bibtex)
Jan Hajič, jr., Marta Kolárová, Alexander Pacha, Jorge Calvo-Zaragoza (2018): How current optical music recognition systems are becoming useful for digital libraries. In: Proceedings of the 5th International Conference on Digital Libraries for Musicology, pp. 57-61, ACM, New York, NY, USA, ISBN 978-1-4503-6522-2 (url, bibtex)
Jan Hajič, jr., Alexander Pacha, Jorge Calvo-Zaragoza (2018): Optical Music Recognition for Dummies (Electronic). (url)
Eva Hajičová (2018): What We Can Learn From J.-M. Zemb About Negation and Information Structure: A View From Prague. In: Diskursgrammatik – Grammaire du discours Hommage à Jean-Marie Zemb, pp. 43-55, Peter Lang, Berlin, ISBN 978-3-631-77674-2 (url, bibtex)
Eva Hajičová, Jiří Mírovský (2018): Discourse Coherence Through the Lens of an Annotated Text Corpus: A Case Study. In: Proceedings of the 11th International Conference on Language Resources and Evaluation (LREC 2018), pp. 1637-1642, European Language Resources Association, Miyazaki, Japan, ISBN 979-10-95546-00-9 (url, local PDF, bibtex)
Eva Hajičová, Jiří Mírovský (2018): Identification of Thematic Discourse Relations on the Data from an Annotated Corpus of Czech. In: TextLink – Structuring Discourse in Multilingual Europe – Final Action Conference, pp. 56-63, University of Toulouse, Toulouse, France (pdf, local PDF, bibtex)
Jirka Hana, Barbora Hladká (2018): Universal Dependencies and Non-Native Czech. In: Proceedings of the 17th International Workshop on Treebanks and Linguistic Theories (TLT 2018), pp. 105-114, Linköping University Electronic Press, Linköping, Sweden, ISBN 978-91-7685-137-1 (pdf, bibtex)
Jirka Hana, Barbora Hladká (2018): Syntactic annotation of a second-language learner corpus (Electronic). (url)
Jindřich Helcl, Jindřich Libovický, Tom Kocmi, Tomáš Musil, Ondřej Cífka, Dušan Variš, Ondřej Bojar (2018): Neural Monkey: The Current State and Beyond. In: The 13th Conference of The Association for Machine Translation in the Americas, Vol. 1: MT Researchers’ Track, pp. 168-176, The Association for Machine Translation in the Americas, Stroudsburg, PA, USA (url, local PDF, local PDF, bibtex)
Jindřich Helcl, Jindřich Libovický, Dušan Variš (2018): CUNI System for the WMT18 Multimodal Translation Task. In: Proceedings of the Third Conference on Machine Translation, Volume 2: Shared Tasks, pp. 622-629, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-948087-81-0 (pdf, local PDF, local PDF, bibtex)
Erhard Hinrichs, Nancy Ide, James Pustejovsky, Jan Hajič, Marie Hinrichs, Mohammad Fazleh Elahi, Keith Suderman, Marc Verhagen, Kyeongmin Rim, Pavel Straňák, Jozef Mišutka (2018): Bridging the LAPPS Grid and CLARIN. In: Proceedings of the 11th International Conference on Language Resources and Evaluation (LREC 2018), pp. 1-10, European Language Resources Association, Miyazaki, Japan, ISBN 979-10-95546-00-9 (url, local PDF, bibtex)
Jaroslava Hlaváčová (2018): Prefixal Morphemes of Czech Verbs. In: Proceedings of the 21st International Conference on Text, Speech and Dialogue—TSD 2018, pp. 50-57, Springer-Verlag, Cham, Switzerland, ISBN 978-3-030-00794-2 (bibtex)
Jaroslava Hlaváčová (2018): Využití velkých korpusů pro morfematickou analýzu českých slovesných předpon. In: ARANEA 2018 Web Corpora as a Language Training Tool; Les corpus web comme instrument de formation linguistique, pp. 40-48, Comenius University in Bratislava, Faculty of Arts, Bratislava, Slovakia, ISBN 978-80-223-4597-2 (local Gzipped PDF, bibtex)
Maarten Janssen (2018): Adding Words to Manuscripts: From PagesXML to TEITOK. In: TPDL 2018: Digital Libraries for Open Knowledge , pp. 152-157, Springer International Publishing, Universidade do Porto, ISBN 978-3-030-00066-0 (url, bibtex)
Aleksei Kelli, Krister Lindén, Kadri Vider, Penny Labropoulou, Erik Ketzan, Paweł Kamocki, Pavel Straňák (2018): Implementation of an Open Science Policy in the context of management of CLARIN language resources: a need for changes?. In: Linköping Electronic Conference Proceedings, ISSN 1650-3740, vol. 9, no. 147, pp. 102-111 (url, local PDF, bibtex)
Václava Kettnerová, Markéta Lopatková (2018): Lexicographic Potential of the Syntactic Properties of Verbs: The Case of Reciprocity in Czech. In: XVIII EURALEX International Congress, Lexicography in Global Contexts, pp. 685-698, Ljubljana University Press, Faculty of Arts, Ljubljana, Slovenia, ISBN 978-961-06-0096-1 (url, bibtex)
Václava Kettnerová, Markéta Lopatková (2018): Mezi reflexivitou a reciprocitou: Poznámky k reflexivním a recipročním konstrukcím vybraných českých sloves. In: Prace Filologiczne, ISSN 0138-0567, LXXII, pp. 131-145 (bibtex)
Václava Kettnerová, Markéta Lopatková, Eduard Bejček, Petra Barančíková (2018): Enriching VALLEX with Light Verbs: From Theory to Data and Back Again. In: The Prague Bulletin of Mathematical Linguistics, ISSN 0032-6585, 111, pp. 29-56 (url, local PDF, bibtex)
Tom Kocmi, Ondřej Bojar (2018): Trivial Transfer Learning for Low-Resource Neural Machine Translation. In: Proceedings of the Third Conference on Machine Translation, Volume 1: Research Papers, pp. 244-252, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-948087-81-0 (url, local PDF, bibtex)
Tom Kocmi, Shantipriya Parida, Ondřej Bojar (2018): CUNI NMT System for WAT 2018 Translation Tasks. In: Proceedings of the 5th Workshop on Asian Translation (WAT2018), pp. 1-7, Asian Federation of Natural Language Processing, Hong Kong, China (url, bibtex)
Tom Kocmi, Roman Sudarikov, Ondřej Bojar (2018): CUNI Submissions in WMT18. In: Proceedings of the Third Conference on Machine Translation, Volume 2: Shared Tasks, pp. 435-441, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-948087-81-0 (pdf, bibtex)
Tom Kocmi, Dušan Variš, Ondřej Bojar (2018): CUNI Basque-to-English Submission in IWSLT18. In: Proceedings of the International Workshop on Spoken Language Translation, pp. 142-146, Karlsruhe Institute of Technology, Karlsruhe, Germany (pdf, bibtex)
Veronika Kolářová, Jana Klímová, Anna Vernerová (2018): Valency Lexicon of Czech Nouns NomVallex: Starting Point and Goals. In: Slovanská lexikografie počátkem 21. století. Sborník příspěvků z mezinárodní konference., pp. 219-226, Slovanský ústav AV ČR, v.v.i., Praha, Czechia, ISBN 978-80-86420-65-3 (bibtex)
Veronika Kolářová, Anna Vernerová, Jana Klímová (2018): Předložková vyjádření adnominálních valenčních doplnění. In: Prace Filologiczne, ISSN 0138-0567, 72, pp. 211-223 (bibtex)
Daniel Kondratyuk, Tomáš Gavenčiak, Milan Straka, Jan Hajič (2018): LemmaTag: Jointly Tagging and Lemmatizing for Morphologically Rich Languages with BRNNs. In: Proceedings of the Conference on Empirical Methods in Natural Language Processing EMNLP 2018, pp. 4921-4928, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-948087-84-1 (url, local PDF, bibtex)
Vincent Kríž, Barbora Hladká (2018): Czech Legal Text Treebank 2.0. In: Proceedings of the 11th International Conference on Language Resources and Evaluation (LREC 2018), pp. 4501-4505, European Language Resources Association, Miyazaki, Japan, ISBN 979-10-95546-00-9 (bibtex)
Oldřich Krůza (2018): Phonetic Transcription by Untrained Annotators. In: Proceedings of the 18th conference ITAT 2018: Slovenskočeský NLP workshop (SloNLP 2018), pp. 35-40, CreateSpace Independent Publishing Platform, Košice, Slovakia, ISBN 978-1727267198 (url, bibtex)
Oldřich Krůza, Vladislav Kuboň (2018): Second-Generation Web Interface to Correcting ASR Output. In: Proceedings of the Future Technologies Conference (FTC) 2018, pp. 749-762, Springer-Verlag, Cham, Switzerland, ISBN 978-3-030-02685-1 (bibtex)
David Kuboň, Eleni Metheniti, Barbora Hladká (2018): Politician -- An Imitation Game. In: Internet Science, pp. 201-212, Springer International Publishing, Cham, Switzerland, ISBN 978-3-319-77546-3 (bibtex)
Vladislav Kuboň, Markéta Lopatková, Jiří Mírovský (2018): Analysis of Word Order in Multiple Treebanks. In: 17th International Conference on Intelligent Text Processing and Computational Linguistics, pp. 345-355, Springer Verlag, Berlin / Heidelberg, ISBN 978-331975476-5 (bibtex)
Lukáš Kyjánek (2018): Morphological Resources of Derivational Word-Formation Relations (technical report). In: (pdf, bibtex)
Mateusz Lango, Magda Ševčíková, Zdeněk Žabokrtský (2018): Semi-Automatic Construction of Word-Formation Networks (for Polish and Spanish). In: Proceedings of the 11th International Conference on Language Resources and Evaluation (LREC 2018), pp. 1853-1860, European Language Resources Association, Miyazaki, Japan, ISBN 979-10-95546-00-9 (pdf, bibtex)
Jindřich Libovický, Thomas Brovelli (Meyer), Bruno Cartoni (2018): Machine Translation Evaluation beyond the Sentence Level. In: Proceedings of the 21st Annual Conference of the European Association for Machine Translation (2018), pp. 179-188, European Association for Machine Translation, Allschwil, Switzerland, ISBN 978-84-09-01901-4 (pdf, local PDF, local PNG, bibtex)
Jindřich Libovický, Jindřich Helcl (2018): End-to-End Non-Autoregressive Neural Machine Translation with Connectionist Temporal Classification. In: Proceedings of the Conference on Empirical Methods in Natural Language Processing EMNLP 2018, pp. 3016-3021, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-948087-84-1 (url, local PDF, local PDF, bibtex)
Jindřich Libovický, Jindřich Helcl, David Mareček (2018): Input Combination Strategies for Multi-Source Transformer Decoder. In: Proceedings of the Third Conference on Machine Translation, Volume 1: Research Papers, pp. 253-260, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-948087-81-0 (url, local PDF, local PDF, bibtex)
Jindřich Libovický, Shruti Palaskar, Spandana Gella, Florian Metze (2018): Multimodal Abstractive Summarization for Open-Domain Videos. In: Visually Grounded Interaction and Language (ViGIL), pp. 1-8, Neural Information Processing Systems (NIPS) Foundation, La Jolla, CA, USA (pdf, local PDF, local PDF, local PDF, bibtex)
Jindřich Libovický, Rudolf Rosa, Jindřich Helcl, Martin Popel (2018): Solving Three Czech NLP Tasks End-to-End with Neural Models. In: Proceedings of the 18th conference ITAT 2018: Slovenskočeský NLP workshop (SloNLP 2018), pp. 138-143, CreateSpace Independent Publishing Platform, Košice, Slovakia, ISBN 978-1727267198 (pdf, local PDF, local PDF, bibtex)
Dominik Macháček, Jonáš Vidra, Ondřej Bojar (2018): Morphological and Language-Agnostic Word Segmentation for NMT. In: Proceedings of the 21st International Conference on Text, Speech and Dialogue—TSD 2018, pp. 277-284, Springer-Verlag, Cham, Switzerland, ISBN 978-3-030-00794-2 (url, bibtex)
Qingsong Ma, Ondřej Bojar, Yvette Graham (2018): Results of the WMT18 Metrics Shared Task: Both characters and embeddings achieve good performance. In: Proceedings of the Third Conference on Machine Translation, Volume 2: Shared Tasks, pp. 682-701, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-948087-81-0 (pdf, bibtex)
David Mareček, Rudolf Rosa (2018): Extracting Syntactic Trees from Transformer Encoder Self-Attentions. In: Proceedings of the First Workshop on Analyzing and Interpreting Neural Networks for NLP, pp. 347-349, The Assotiation of Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-948087-71-1 (url, local PDF, local PDF, bibtex)
Sonja Marković, Daniel Zeman (2018): Reflexives in Universal Dependencies. In: Proceedings of the 17th International Workshop on Treebanks and Linguistic Theories (TLT 2018), pp. 131-146, Linköping University Electronic Press, Linköping, Sweden, ISBN 978-91-7685-137-1 (pdf, local PDF, local PDF, bibtex)
Marie Mikulová, Eduard Bejček (2018): ForFun 1.0: Prague Database of Forms and Functions -- An Invaluable Resource for Linguistic Research. In: Proceedings of the 11th International Conference on Language Resources and Evaluation (LREC 2018), pp. 1-8, European Language Resources Association, Miyazaki, Japan, ISBN 979-10-95546-00-9 (url, local PDF, bibtex)
Marie Mikulová, Eduard Bejček, Eva Hajičová, Jarmila Panevová (2018): Search for the Relation of Form and Function Using the ForFun Database. In: The Prague Bulletin of Mathematical Linguistics, ISSN 0032-6585, 110, pp. 71-84 (pdf, local PDF, bibtex)
Marie Mikulová, Eduard Bejček, Jarmila Panevová (2018): What Can We Find Out about Time and Space in the ForFun Database?. In: Proceedings of the Second Workshop on Corpus-Based Research in the Humanities CRH-2, pp. 133-142, Dept. of Geoinformation, TU Wien, Wien, Austria, ISBN 978-3-901716-43-0 (bibtex)
Jakub Mlynář (2018): „To jsme všechny měly ten pocit – musíme se toho zbavit, musíme na to zapomenout“: Reflexe připomínání holocaustu v rozhovorech s česko-slovenskými přeživšími. In: "Nechtění" spoluobčané: Skupiny obyvatel perzekvovaných či marginalizovaných z politických, národnostních, náboženských i jiných důvodů v letech 1945-1989, pp. 116-130, Ústav pro studium totalitních režimů / Technická univerzita v Liberci, Praha / Liberec, Czech Republic, ISBN 978-80-88292-06-7 (bibtex)
Jakub Mlynář, Hamed Alavi, Himanshu Verma, Lorenzo Cantoni (2018): Towards a Sociological Conception of Artificial Intelligence. In: Artificial General Intelligence, pp. 130-139, Springer Nature, Cham, Switzerland, ISBN 978-3-319-97675-4 (url, bibtex)
Jakub Náplava, Milan Straka, Pavel Straňák, Jan Hajič (2018): Diacritics Restoration Using Neural Networks. In: Proceedings of the 11th International Conference on Language Resources and Evaluation (LREC 2018), pp. 1-10, European Language Resources Association, Miyazaki, Japan, ISBN 979-10-95546-00-9 (url, local PDF, bibtex)
Anna Nedoluzhko, Ekaterina Lapshinova-Koltunski (2018): Pronominal Adverbs in German and their Equivalents in English, Czech and Russian: Evidence from the Parallel Corpus. In: Computational Linguistics and Intellectual Technologies, ISSN 2221-7932, vol. 2018, no. 17, pp. 512-521 (bibtex)
Anna Nedoluzhko, Ekaterina Lapshinova-Koltunski (2018): Correlating DRDs with other types of discourse phenomena: Cross-linguistic analysis of the interplay between DRDs, coreference and bridging. In: TextLink – Structuring Discourse in Multilingual Europe – Final Action Conference, pp. 83-89, University of Toulouse, Toulouse, France (pdf, bibtex)
Anna Nedoluzhko, Michal Novák, Maciej Ogrodniczuk (2018): Analysis of coreferential expressions in PAWS. In: Computational Linguistics and Intellectual Technologies, ISSN 2221-7932, vol. 2018, no. 17, 2018, pp. 512-521 (pdf, bibtex)
Anna Nedoluzhko, Michal Novák, Maciej Ogrodniczuk (2018): PAWS: A Multi-lingual Parallel Treebank with Anaphoric Relations. In: Proceedings of the First Workshop on Computational Models of Reference, Anaphora and Coreference, pp. 68-76, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-948087-13-1 (url, bibtex)
Michal Novák (2018): Coreference from the Cross-lingual Perspective. In: , ISBN 978-80-88132-06-6 (bibtex)
Michal Novák (2018): Coreference from the Cross-lingual Perspective (PhD thesis). In: (url, local PDF, bibtex)
Michal Novák (2018): A Study on Bilingually Informed Coreference Resolution. In: Proceedings of the 18th conference ITAT 2018: Slovenskočeský NLP workshop (SloNLP 2018), pp. 130-137, CreateSpace Independent Publishing Platform, Košice, Slovakia, ISBN 978-1727267198 (pdf, bibtex)
Michal Novák (2018): A Fine-grained Large-scale Analysis of Coreference Projection. In: Proceedings of the First Workshop on Computational Models of Reference, Anaphora and Coreference, pp. 77-86, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-948087-13-1 (url, bibtex)
Michal Novák, Jiří Mírovský, Kateřina Rysová, Magdaléna Rysová (2018): Topic–Focus Articulation: A Third Pillar of Automatic Evaluation of Text Coherence. In: Advances in Computational Intelligence (LNAI 11289): 17th Mexican International Conference on Artificial Intelligence, MICAI 2018, Proceedings, Part II, pp. 92-105, Springer, Switzerland, ISBN 978-3-030-04497-8 (url, bibtex)
Alexander Pacha, Jan Hajič, jr., Jorge Calvo-Zaragoza (2018): A Baseline for General Music Object Detection with Deep Learning. In: Applied Sciences, ISSN 2076-3417, vol. 8, no. 9, pp. 1488-1488 (url, bibtex)
Jarmila Panevová (2018): Konkurence předložkových pádů u jádra lokálních určení v češtině. In: Sens i konwencje w języku, pp. 253-264, Wydawnictwo Naukowe UMK, Toruń Poland, ISBN 978-83-231-4081-8 (local PDF, bibtex)
Jarmila Panevová (2018): Diateze a reciprocita (na materiálu češtiny). In: Славистика, ISSN 1450-5061, vol. 22, no. 1, pp. 124-129 (bibtex)
Jarmila Panevová, Veronika Kolářová (2018): Aktant, nebo volné doplnění? (K netypickým formám ve valenčním poli substantiv). In: Prace Filologiczne, ISSN 0138-0567, 72, pp. 275-284 (bibtex)
Shantipriya Parida, Ondřej Bojar (2018): Translating Short Segments with NMT: A Case Study in English-to-Hindi. In: Proceedings of the 21st Annual Conference of the European Association for Machine Translation (2018), pp. 1-392, European Association for Machine Translation, Allschwil, Switzerland, ISBN 978-84-09-01901-4 (url, local PDF, local PDF, bibtex)
Martin Popel (2018): CUNI Transformer Neural MT System for WMT18. In: Proceedings of the Third Conference on Machine Translation, Volume 2: Shared Tasks, pp. 486-491, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-948087-81-0 (pdf, bibtex)
Martin Popel (2018): Machine Translation Using Syntactic Analysis (PhD thesis). In: (pdf, bibtex)
Martin Popel, Ondřej Bojar (2018): Training Tips for the Transformer Model. In: The Prague Bulletin of Mathematical Linguistics, ISSN 0032-6585, 110, pp. 43-70 (pdf, bibtex)
Rudolf Rosa (2018): Discovering the structure of natural language sentences by semi-supervised methods (PhD thesis). In: (url, local PDF, local PDF, local PDF, local PDF, bibtex)
Rudolf Rosa, Petra Barančíková (2018): Slovakoczech NLP workshop (SloNLP 2018) (ProceedingsPart). In: Proceedings of the 18th conference ITAT 2018: Slovenskočeský NLP workshop (SloNLP 2018), pp. 125-143, CreateSpace Independent Publishing Platform, Košice, Slovakia, ISBN 978-1727267198 (url)
Rudolf Rosa, David Mareček (2018): CUNI x-ling: Parsing under-resourced languages in CoNLL 2018 UD Shared Task. In: Proceedings of the CoNLL 2018 Shared Task: Multilingual Parsing from Raw Text to Universal Dependencies, pp. 187-196, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-948087-82-7 (pdf, local PDF, local PDF, bibtex)
Kateřina Rysová (2018): Olympiáda v českém jazyce: 44. ročník. In: Český jazyk a literatura, ISSN 0009-0786, vol. 69, no. 2, pp. 53-59 (bibtex)
Kateřina Rysová, Magdaléna Rysová (2018): Discourse Connectives and Reference. In: TextLink – Structuring Discourse in Multilingual Europe – Final Action Conference, pp. 122-128, University of Toulouse, Toulouse, France (pdf, local PDF, bibtex)
Kateřina Rysová, Magdaléna Rysová (2018): The Correlation between Discourse-Anaphoric Devices and an Overall Communicative Competence in Learners‘ Essays. In: EDULEARN18 Proceedings, pp. 2144-2154, IATED Academy, Valencia, Spain, ISBN 978-84-09-02709-5 (url, bibtex)
Magdaléna Rysová (2018): Diskurzní konektory v češtině: Od centra k periferii. In: , ISBN 978-80-88132-05-9 (url, bibtex)
Magdaléna Rysová, Lucie Poláková, Jiří Mírovský, Pavlína Synková (2018): Describing CzeDLex – a Lexicon of Czech Discourse Connectives. In: TextLink – Structuring Discourse in Multilingual Europe – Final Action Conference, pp. 129-135, University of Toulouse, Toulouse, France (pdf, local PDF, bibtex)
Magdaléna Rysová, Kateřina Rysová (2018): Primary and secondary discourse connectives: Constraints and preferences. In: Journal of Pragmatics, ISSN 0378-2166, 130, pp. 16-32 (url, local PDF, bibtex)
Magdaléna Rysová, Kateřina Rysová, Jiří Mírovský, Michal Novák (2018): Practicing Students‘ Writing Skills through eLearning: Automated Evaluation of Text Coherence in Czech. In: EDULEARN18 Proceedings, pp. 1963-1970, IATED Academy, Valencia, Spain, ISBN 978-84-09-02709-5 (url, bibtex)
Shadi Saleh, Pavel Pecina (2018): CUNI team: CLEF eHealth Consumer Health Search Task 2018. In: Working Notes of CLEF 2018 - Conference and Labs of the Evaluation Forum, pp. 1-11, CEUR-WS, Aachen, Germany (bibtex)
Agata Savary, Marie Candito, Verginica Barbu Mititelu, Eduard Bejček, Fabienne Cap, Slavomír Čéplö, Silvio Ricardo Cordeiro, Gülşen Cebiroğlu Eryiğit, Voula Giouli, Maarten van Gompel, Yaakov Ha-Cohen Kerner, Jolanta Kovalevskaitė, Simon Krek, Chaya Liebeskind, Johanna Monti, Carla Parra Escartín, Lonneke van der Plas, Behrang QasemiZadeh, Carlos Ramisch, Federico Sangati, Ivelina Stoyanova, Veronika Vincze (2018): The PARSEME multilingual corpus of verbal multiword expressions. In: Phraseology and Multiword Expressions, ISSN 2625-3127, vol. 1, no. 3, pp. 87-147 (url, bibtex)
Milan Straka (2018): UDPipe 2.0 Prototype at CoNLL 2018 UD Shared Task. In: Proceedings of CoNLL 2018: The SIGNLL Conference on Computational Natural Language Learning, pp. 197-207, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-948087-72-8 (pdf, local PDF, bibtex)
Milan Straka, Nikita Mediankin, Tom Kocmi, Zdeněk Žabokrtský, Vojtěch Hudeček, Jan Hajič (2018): SumeCzech: Large Czech News-Based Summarization Dataset. In: Proceedings of the 11th International Conference on Language Resources and Evaluation (LREC 2018), pp. 3488-3495, European Language Resources Association, Miyazaki, Japan, ISBN 979-10-95546-00-9 (url, local PDF, bibtex)
Magda Ševčíková (2018): Modelling Morphographemic Alternations in Derivation of Czech. In: The Prague Bulletin of Mathematical Linguistics, ISSN 0032-6585, 110, pp. 7-42 (pdf, bibtex)
Magda Ševčíková, Jarmila Panevová (2018): Derivation of Czech verbs and the category of aspect. In: Linguistica Copernicana, ISSN 2080-1068, vol. 2018, no. 15, pp. 79-93 (bibtex)
Magda Ševčíková, Jarmila Panevová (2018): Hranice flektivní a derivační morfologie: Případ předpony po- u českých sloves. In: Slovo a slovesnost, ISSN 0037-7031, vol. 79, no. 3, pp. 171-198 (bibtex)
Jana Šindlerová, Vladislav Kuboň, Aleš Tamchyna, Kateřina Veselovská (2018): Alternace konstrukcí s aktorem a instrumentem v paralelním česko-anglickém závislostním korpusu. In: Slovo a slovesnost, ISSN 0037-7031, vol. 79 (2018), no. 1, pp. 27-46 (local DOC, bibtex)
Zdeňka Urešová, Eva Fučíková, Eva Hajičová, Jan Hajič (2018): Defining Verbal Synonyms: between Syntax and Semantics. In: Proceedings of the 17th International Workshop on Treebanks and Linguistic Theories (TLT 2018), pp. 75-90, Linköping University Electronic Press, Linköping, Sweden, ISBN 978-91-7685-137-1 (pdf, local PDF, bibtex)
Zdeňka Urešová, Eva Fučíková, Eva Hajičová, Jan Hajič (2018): Synonymy in Bilingual Context: The CzEngClass Lexicon. In: Proceedings of The 27th International Conference on Computational Linguistics , pp. 2456-2469, ICCL, Sheffield, GB, ISBN 978-4-87974-703-7 (url, local PDF, bibtex)
Zdeňka Urešová, Eva Fučíková, Eva Hajičová, Jan Hajič (2018): Creating a Verb Synonym Lexicon Based on a Parallel Corpus. In: Proceedings of the 11th International Conference on Language Resources and Evaluation (LREC 2018), pp. 1432-1437, European Language Resources Association, Miyazaki, Japan, ISBN 979-10-95546-00-9 (pdf, local PDF, bibtex)
Zdeňka Urešová, Eva Fučíková, Eva Hajičová, Jan Hajič (2018): Tools for Building an Interlinked Multilingual Synonym Lexicon Network. In: Proceedings of the 11th International Conference on Language Resources and Evaluation (LREC 2018), pp. 850-856, European Language Resources Association, Miyazaki, Japan, ISBN 979-10-95546-00-9 (url, local PDF, bibtex)
Zdeňka Urešová, Eva Fučíková, Eva Hajičová, Jan Hajič (2018): A CROSS-LINGUAL SYNONYM CLASSES LEXICON. In: Prace Filologiczne, ISSN 0138-0567, LXXII, pp. 405-418 (local PDF, bibtex)
Dušan Variš, Natalia Klyueva (2018): Improving a Neural-based Tagger for Multiword Expression Identification. In: Proceedings of the 11th International Conference on Language Resources and Evaluation (LREC 2018), pp. 2526-2532, European Language Resources Association, Miyazaki, Japan, ISBN 979-10-95546-00-9 (local PDF, local PDF, bibtex)
Jonáš Vidra (2018): Morphological segmentation of Czech Words (masters thesis). In: (url, local PDF, bibtex)
Daniel Zeman (2018): The World of Tokens, Tags and Trees. In: , ISBN 978-80-88132-09-7 (pdf, local PDF, bibtex)
Daniel Zeman, Jan Hajič, Martin Popel, Martin Potthast, Milan Straka, Filip Ginter, Joakim Nivre, Slav Petrov (2018): CoNLL 2018 Shared Task: Multilingual Parsing from Raw Text to Universal Dependencies. In: Proceedings of the CoNLL 2018 Shared Task: Multilingual Parsing from Raw Text to Universal Dependencies, pp. 1-21, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-948087-82-7 (pdf, local PDF, bibtex)
Mostafa Abdou, Vladan Glončák, Ondřej Bojar (2017): Variable Mini-Batch Sizing and Pre-Trained Embeddings. In: Proceedings of the Second Conference on Machine Translation, Volume 2: Shared Task Papers, pp. 680-686, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-945626-96-8 (pdf, bibtex)
Ahmad Aghaebrahimian (2017): Constrained Deep Answer Sentence Selection. In: 20th International Conference, TSD 2017 Prague, Czech Republic, August 27–31, 2017 Proceedings, Lecture Notes in Computer Science, ISSN 0302-9743, 10415, pp. 57-65, Springer International Publishing, Cham / Heidelberg / New York / Dordrecht / London, ISBN 978-3-319-64205-5 (bibtex)
Ahmad Aghaebrahimian (2017): Quora Question Answer Dataset. In: 20th International Conference, TSD 2017 Prague, Czech Republic, August 27–31, 2017 Proceedings, Lecture Notes in Computer Science, ISSN 0302-9743, 10415, pp. 66-73, Springer International Publishing, Cham / Heidelberg / New York / Dordrecht / London, ISBN 978-3-319-64205-5 (bibtex)
Ahmad Aghaebrahimian (2017): Hybrid Deep Open-Domain Question Answering. In: Proceedings of 8th Language and Technology Conference, pp. 163-167, Fundacja Uniwersytetu im. Adama Mickiewicza w Poznaniu, Poznań, Poland, ISBN 978-83-64864-94-0 (bibtex)
Petra Barančíková, Václava Kettnerová (2017): ParaDi: Dictionary of Paraphrases of Czech Complex Predicates with Light Verbs. In: Proceedings of the 13th Workshop on Multiword Expressions (MWE 2017), pp. 1-10, Association for Computational Linguistics (ACL), Stroudsburg, PA, USA, ISBN 978-1-945626-48-7 (url, local PDF, local PDF, bibtex)
Eduard Bejček, Jan Hajič, Pavel Straňák, Zdeňka Urešová (2017): Extracting Verbal Multiword Data from Rich Treebank Annotation. In: Proceedings of the 15th International Workshop on Treebanks and Linguistic Theories (TLT 15), pp. 13-24, Indiana University, Bloomington, Bloomington, IN, USA (pdf, local PDF, local PDF, bibtex)
Eduard Bejček, Eva Hajičová, Marie Mikulová, Jarmila Panevová (2017): The Relation of Form and Function in Linguistic Theory and in a Multi-layer Treebank. In: Proceedings of the 16th International Workshop on Treebanks and Linguistic Theories, pp. 56-63, Univerzita Karlova, Praha, Czechia, ISBN 978-80-88132-04-2 (pdf, local PDF, bibtex)
Ondřej Bojar, Yvette Graham, Amir Kamran (2017): Results of the WMT17 Metrics Shared Task. In: Proceedings of the Second Conference on Machine Translation, Volume 2: Shared Task Papers, pp. 489-513, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-945626-96-8 (pdf, bibtex)
Ondřej Bojar, Jindřich Helcl, Tom Kocmi, Jindřich Libovický, Tomáš Musil (2017): Results of the WMT17 Neural MT Training Task. In: Proceedings of the Second Conference on Machine Translation, Volume 2: Shared Task Papers, pp. 525-533, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-945626-96-8 (url, local PDF, bibtex)
Ondřej Bojar, Rajen Chatterjee, Christian Federmann, Yvette Graham, Barry Haddow, Shujian Huang, Matthias Huck, Philipp Koehn, Qun Liu, Varvara Logacheva, Christof Monz, Matteo Negri, Matt Post, Raphael Rubino, Lucia Specia, Marco Turchi (2017): Findings of the 2017 Conference on Machine Translation (WMT17). In: Proceedings of the Second Conference on Machine Translation, Volume 2: Shared Task Papers, pp. 169-214, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-945626-96-8 (pdf, bibtex)
Ondřej Bojar, Tom Kocmi, David Mareček, Roman Sudarikov, Dušan Variš (2017): CUNI Submission in WMT17: Chimera Goes Neural. In: Proceedings of the Second Conference on Machine Translation, Volume 2: Shared Task Papers, pp. 248-256, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-945626-96-8 (url, bibtex)
Silvie Cinková, Zdeněk Hlávka (2017): Modeling Semantic Distance in the Pattern Dictionary of English Verbs. In: Jazykovedný časopis / Journal of Linguistics, ISSN 0021-5597, vol. 68, no. 2, pp. 122-135 (url, local PDF, bibtex)
Silvie Cinková, Anna Vernerová (2017): Are Annotators' Word-Sense Disambiguation Decisions Affected by Textual Entailment between Lexicon Glosses?. In: Proceedings of the 17th Conference on Information Technologies - Applications and Theory (ITAT 2017), pp. 5-14, CreateSpace Independent Publishing Platform, Praha, Czechia, ISBN 978-1974274741 (pdf, local PDF, bibtex)
Matthias Dorfer, Jan Hajič, jr., Gerhard Widmer (2017): On the Potential of Fully Convolutional Neural Networks for Musical Symbol Detection. In: Proceedings of the 12th IAPR International Workshop on Graphics Recognition , pp. 53-54, IEEE Computer Society, New York, USA, ISBN 978-1-5386-3586-5 (url, bibtex)
Kira Droganova, Daniel Zeman (2017): Elliptic Constructions: Spotting Patterns in UD Treebanks. In: NoDaLiDa 2017 Workshop on Universal Dependencies, pp. 48-57, Göteborgs universitet, Göteborg, Sweden, ISBN 978-91-7685-501-0 (pdf, local PDF, bibtex)
Petra Galuščáková, Michal Batko, Jan Čech, Jiří Matas, David Novák, Pavel Pecina (2017): Visual Descriptors in Methods for Video Hyperlinking. In: Proceedings of the 2017 ACM on International Conference on Multimedia Retrieval, pp. 294-300, ACM, New York, NY, USA, ISBN 978-1-4503-4701-3 (url, bibtex)
Jan Hajič, Eva Hajičová, Marie Mikulová, Jiří Mírovský (2017): Prague Dependency Treebank. In: Handbook on Linguistic Annotation, pp. 555-594, Springer Verlag, Berlin, Germany, ISBN 978-94-024-0879-9 (url, bibtex)
Jan Hajič, jr., Pavel Pecina (2017): The MUSCIMA++ Dataset for Handwritten Optical Music Recognition. In: 14th International Conference on Document Analysis and Recognition, ICDAR 2017, Kyoto, Japan, November 13 - 15, 2017, pp. 39-46, IEEE Computer Society, New York, USA, ISBN 978-1-5386-3586-5 (url, bibtex)
Jan Hajič, jr., Pavel Pecina (2017): Groundtruthing (not only) Music Notation with MUSCIMarker: a Practical Overview. In: Proceedings of the 12th IAPR International Workshop on Graphics Recognition , pp. 47-48, IEEE Computer Society, New York, USA, ISBN 978-1-5386-3586-5 (url, bibtex)
Jan Hajič, jr., Pavel Pecina (2017): How to Exploit Music Notation Syntax for OMR?. In: Proceedings of the 12th IAPR International Workshop on Graphics Recognition , pp. 55-56, IEEE Computer Society, New York, USA, ISBN 978-1-5386-3586-5 (url, bibtex)
Eva Hajičová (2017): Syntax-Semantics Interface. In: , ISBN 978-80-246-3714-3 (url, bibtex)
Eva Hajičová (2017): A Glimpse Under the Surface: Language Understanding May Need Deep Syntactic Structure. In: 20th International Conference, TSD 2017 Prague, Czech Republic, August 27–31, 2017 Proceedings, Lecture Notes in Computer Science, ISSN 0302-9743, 10415, pp. 3-7, Springer International Publishing, Cham / Heidelberg / New York / Dordrecht / London, ISBN 978-3-319-64205-5 (url, bibtex)
Eva Hajičová (2017): Theme. In: Oxford Research Encyclopedias: Linguistics, pp. 1-10, Oxford University Press, Oxford, United Kingdom, ISBN 9780199384655 (url, bibtex)
Jindřich Helcl, Jindřich Libovický (2017): CUNI System for the WMT17 Multimodal Translation Task. In: Proceedings of the Second Conference on Machine Translation, Volume 2: Shared Task Papers, pp. 450-457, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-945626-96-8 (url, local PDF, bibtex)
Jindřich Helcl, Jindřich Libovický (2017): Neural Monkey: An Open-source Tool for Sequence Learning. In: The Prague Bulletin of Mathematical Linguistics, ISSN 0032-6585, 107, pp. 5-17 (pdf, local PDF, bibtex)
Barbora Hladká, Jiří Hana (2017): Parsing Writings of Non-Native Czech. In: Proceedings of the 4th Workshop on NLP Techniques for Educational Applications, pp. 12-16, Asian Federation of Natural Language Processing, Taipei, Taiwan, ISBN 978-1-948087-08-7 (bibtex)
Jaroslava Hlaváčová (2017): Golden Rule of Morphology and Variants of Wordforms. In: Jazykovedný časopis / Journal of Linguistics, ISSN 0021-5597, vol. 68, no. 2, pp. 136-144 (url, local PDF, bibtex)
Matthias Huck, Aleš Tamchyna, Ondřej Bojar, Alexander Fraser (2017): Producing Unseen Morphological Variants in Statistical Machine Translation. In: Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics, volume 2: Short Papers, pp. 369-375, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-945626-35-7 (url, bibtex)
Pavel Ircing, Jan Švec, Zbyněk Zajíc, Barbora Hladká, Martin Holub (2017): Combining Textual and Speech Features in the NLI Task Using State-of-the-Art Machine Learning Techniques. In: The 12th Workshop on Innovative Use of NLP for Building Educational Applications, pp. 198-209, The Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-945626-00-5 (pdf, bibtex)
Václava Kettnerová (2017): Syntaktická struktura komplexních predikátů v češtině. In: Slovo a slovesnost, ISSN 0037-7031, vol. 78, no. 1, pp. 3-24 (local PDF, bibtex)
Václava Kettnerová, Veronika Kolářová, Anna Vernerová (2017): Deverbal Nouns in Czech Light Verb Constructions. In: Computational and Corpus-Based Phraseology. Second International Conference, Europhras 2017. London, UK, November 13–14, 2017., Lecture Notes in Computer Science, ISSN 0302-9743, 10596, pp. 205-219, Springer, Cham, Switzerland, ISBN 978-3-319-69804-5 (local PDF, bibtex)
Václava Kettnerová, Markéta Lopatková (2017): Ke koreferenci u komplexních predikátů s kategoriálním slovesem. In: Korpus – gramatika – axiologie, ISSN 1804-137X, 16, pp. 3-26 (local PDF, local PDF, bibtex)
Václava Kettnerová, Markéta Lopatková (2017): Complex Predicates with Light Verbs in VALLEX: From Formal Model to Lexicographic Description. In: Proceedings of the 17th Conference on Information Technologies - Applications and Theory (ITAT 2017), pp. 15-22, CreateSpace Independent Publishing Platform, Praha, Czechia, ISBN 978-1974274741 (local PDF, bibtex)
Natalia Klyueva, Antoine Doucet, Milan Straka (2017): Neural Networks for Multi-Word Expression Detection. In: Proceedings of the 13th Workshop on Multiword Expressions (MWE 2017), pp. 60-65, Association for Computational Linguistics (ACL), Stroudsburg, PA, USA, ISBN 978-1-945626-48-7 (pdf, local PDF, bibtex)
Natalia Klyueva, Anna Vernerová, Behrang QasemiZadeh (2017): Querying Multiword Expressions Annotation with NoSke. In: Proceedings of the 16th International Workshop on Treebanks and Linguistic Theories, pp. 73-79, Univerzita Karlova, Praha, Czechia, ISBN 978-80-88132-04-2 (url, local PDF, local PDF, local PDF, bibtex)
Tom Kocmi, Ondřej Bojar (2017): An Exploration of Word Embedding Initialization in Deep-Learning Tasks. In: Proceedings of the 14th International Conference on Natural Language Processing, pp. 56-64, NLP Association of India, Kolkata, India (bibtex)
Tom Kocmi, Ondřej Bojar (2017): Curriculum Learning and Minibatch Bucketing in Neural Machine Translation. In: Proceedings of the International Conference Recent Advances in Natural Language Processing, pp. 379-386, INCOMA Ltd., Šumen, Bulgaria, ISBN 978-954-452-048-9 (url, bibtex)
Tom Kocmi, Ondřej Bojar (2017): LanideNN: Multilingual Language Identification on Character Window. In: Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics, volume 2: Short Papers, pp. 927-936, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-945626-35-7 (url, bibtex)
Tom Kocmi, Dušan Variš, Ondřej Bojar (2017): CUNI NMT System for WAT 2017 Translation Tasks. In: Proceedings of the 4th Workshop on Asian Translation (WAT2017), pp. 154-159, Asian Federation of Natural Language Processing, Taipei, Taiwan, ISBN 978-1-948087-06-3 (bibtex)
Veronika Kolářová (2017): Valence českých deverbativních substantiv reprezentujících vybrané sémantické třídy. In: Prace Filologiczne, ISSN 0138-0567, 70, pp. 287-303 (bibtex)
Veronika Kolářová, Jan Kolář, Marie Mikulová (2017): Difference between Written and Spoken Czech: The Case of Verbal Nouns Denoting an Action. In: The Prague Bulletin of Mathematical Linguistics, ISSN 0032-6585, 107, pp. 19-38 (pdf, local PDF, bibtex)
Veronika Kolářová, Anna Vernerová, Jana Klímová, Jan Kolář (2017): Possible but not probable: A quantitative analysis of valency behaviour of Czech nouns in the Prague Dependency Treebank. In: Jazykovedný časopis / Journal of Linguistics, ISSN 0021-5597, vol. 68, no. 2, pp. 208-218 (url, local PDF, bibtex)
Jindřich Libovický, Jindřich Helcl (2017): Attention Strategies for Multi-Source Sequence-to-Sequence Learning. In: Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers), pp. 196-202, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-945626-76-0 (url, local PDF, local PDF, bibtex)
David Mareček, Ondřej Bojar, Ondřej Hübsch, Rudolf Rosa, Dušan Variš (2017): CUNI Experiments for WMT17 Metrics Task. In: Proceedings of the Second Conference on Machine Translation, Volume 2: Shared Task Papers, pp. 604-611, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-945626-96-8 (url, bibtex)
Antonio Valerio Miceli Barone, Jindřich Helcl, Rico Sennrich, Barry Haddow, Alexandra Birch (2017): Deep Architectures for Neural Machine Translation. In: Proceedings of the Second Conference on Machine Translation, Volume 1: Research Papers, pp. 99-107, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-945626-96-8 (url, bibtex)
Marie Mikulová, Eduard Bejček, Veronika Kolářová, Jarmila Panevová (2017): Subcategorization of Adverbial Meanings Based On Corpus Data. In: Jazykovedný časopis / Journal of Linguistics, ISSN 0021-5597, vol. 68, no. 2, pp. 268-277 (pdf, local PDF, bibtex)
Marie Mikulová, Jiří Mírovský, Anna Nedoluzhko, Petr Pajas, Jan Štěpánek, Jan Hajič (2017): PDTSC 2.0 - Spoken Corpus with Rich Multi-layer Structural Annotation. In: 20th International Conference, TSD 2017 Prague, Czech Republic, August 27–31, 2017 Proceedings, Lecture Notes in Computer Science, ISSN 0302-9743, 10415, pp. 129-137, Springer International Publishing, Cham / Heidelberg / New York / Dordrecht / London, ISBN 978-3-319-64205-5 (bibtex)
Jiří Mírovský (2017): Petr Pořízka: Tvorba korpusů a vytěžování jazykových dat: metody, modely, nástroje. (review). In: Slovo a slovesnost, ISSN 0037-7031, vol. 78, no. 4, pp. 349-352 (url, bibtex)
Jiří Mírovský, Pavlína Synková, Magdaléna Rysová, Lucie Poláková (2017): CzeDLex – A Lexicon of Czech Discourse Connectives. In: The Prague Bulletin of Mathematical Linguistics, ISSN 0032-6585, 109, pp. 61-91 (url, local PDF, bibtex)
Jakub Mlynář (2017): Analysing Oral Histories: Social Roles and Narrative Self-Regulation in Holocaust Survivors’ Testimonies (Electronic). (url)
Jakub Mlynář, Jiří Kocián, Karolína Bukovská, Lenka Chudomelová (2017): Sociologie (a) orální historie: Archiv vizuální historie USC Shoah Foundation dostupný v Centru vizuální historie Malach při Univerzitě Karlově. In: Naše společnost, ISSN 1214-438X, vol. 15, no. 2, pp. 52-55 (bibtex)
Joakim Nivre, Daniel Zeman, Filip Ginter, Francis Tyers (2017): EACL tutorial on Universal Dependencies (LectureNotes). (url, local PDF, local PDF)
Michal Novák (2017): Coreference Resolution System Not Only for Czech. In: Proceedings of the 17th conference ITAT 2017: Slovenskočeský NLP workshop (SloNLP 2017), pp. 193-200, CreateSpace Independent Publishing Platform, Praha, Czechia, ISBN 978-1974274741 (pdf, bibtex)
Michal Novák, Anna Nedoluzhko, Zdeněk Žabokrtský (2017): Projection-based Coreference Resolution Using Deep Syntax. In: Proceedings of the 2nd Workshop on Coreference Resolution Beyond OntoNotes (CORBON 2017), pp. 56-64, Association for Computational Linguistics (ACL), Stroudsburg, PA, USA, ISBN 978-1-945626-46-3 (pdf, bibtex)
Michal Novák, Kateřina Rysová, Magdaléna Rysová, Jiří Mírovský (2017): Incorporating Coreference to Automatic Evaluation of Coherence in Essays. In: Statistical Language and Speech Processing, pp. 58-69, Springer International Publishing, Cham, Switzerland, ISBN 978-3-319-68455-0 (pdf, local PDF, bibtex)
Christophe Onambélé, Matyáš Kopp, Marco Passarotti, Jiří Mírovský (2017): Converting Latin Treebank Data into an SQL Database for Query Purposes. In: Proceedings of the 2Nd International Conference on Digital Access to Textual Cultural Heritage, pp. 117-122, ACM, New York, NY, USA, ISBN 978-1-4503-5265-9 (bibtex)
Klára Osolsobě, Jaroslava Hlaváčová, Vladimír Petkevič, Martin Svášek, Josef Šimandl (2017): Nová automatická morfologická analýza češtiny. In: Naše řeč, ISSN 0027-8203, vol. 100, no. 4/2017, pp. 225-234 (bibtex)
Jarmila Panevová (2017): Od valence slovesa k valenci substantiv a adjektiv. In: Prace Filologiczne, ISSN 0138-0567, 70, pp. 59-71 (local PDF, bibtex)
Jarmila Panevová (2017): Grammaticalization and Lexicalization in the Slavic Languages (review). In: Slavia, ISSN 0037-6736, vol. 86, no. 2-3, pp. 296-300 (bibtex)
Jarmila Panevová (2017): K marginálním infinitivním konstrukcím (zejména v češtině). In: Slavia, ISSN 0037-6736, vol. 86, no. sešit 2-3, pp. 219-229 (local PDF, bibtex)
Jan-Thorsten Peter, Hermann Ney, Ondřej Bojar, Ngoc-Quam Pham, Jan Niehues, Alex Waibel, Franck Burlot, François Yvon, Marcis Pinnis, Valters Sics, Joost Bastings, Miguel Rios, Wilker Aziz, Phil Williams, Frédéric Blain, Lucia Specia (2017): The QT21 Combined Machine Translation System for English to Latvian. In: Proceedings of the Second Conference on Machine Translation, Volume 2: Shared Task Papers, pp. 348-357, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-945626-96-8 (pdf, bibtex)
Bedřich Pišl, David Mareček (2017): Communication with Robots using Multilayer Recurrent Networks. In: Proceedings of the First Workshop on Language Grounding for Robotics, pp. 44-48, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-945626-64-7 (pdf, bibtex)
Petr Plecháč, Jaroslava Hlaváčová, Kristýna Merthová, Robert Kolár (2017): Distribuce předpon v českém sylabotónickém trocheji. In: Slovo a slovesnost, ISSN 0037-7031, 78, pp. 322-332 (local PDF, bibtex)
Patrice Pognan (2017): Système linguistique et calculabilité des langues slaves de l'Ouest (Nord et Sud): approches d'une complexité comparée. In: Études de linguistique appliquée. Revue de didactologie des langues-cultures et de lexiculturologie, ISSN 0071-190X, vol. 56, no. 185, pp. 35-50 (bibtex)
Lucie Poláková, Jiří Mírovský, Pavlína Synková (2017): Signalling Implicit Relations: A PDTB - RST Comparison. In: Dialogue and Discourse, ISSN 2152-9620, vol. 8, no. 2/2017, pp. 225-248 (url, bibtex)
Martin Popel, Zdeněk Žabokrtský, Martin Vojtek (2017): Udapi: Universal API for Universal Dependencies. In: NoDaLiDa 2017 Workshop on Universal Dependencies, pp. 96-101, Göteborgs universitet, Göteborg, Sweden, ISBN 978-91-7685-501-0 (pdf, bibtex)
Adam Przepiórkowski, Jan Hajič, Elżbieta Hajnicz, Zdeňka Urešová (2017): Phraseology in two Slavic valency dictionaries: limitations and perspectives. In: International Journal of Lexicography, ISSN 0950-3846, vol. 30, no. 1, pp. 1-38 (bibtex)
Vinit Ravishankar (2017): A Universal Dependencies Treebank for Marathi. In: Proceedings of the 16th International Workshop on Treebanks and Linguistic Theories, pp. 190-200, Univerzita Karlova, Praha, Czechia, ISBN 978-80-88132-04-2 (pdf, bibtex)
Matiss Rikters, Ondřej Bojar (2017): Paying Attention to Multi-Word Expressions in Neural Machine Translation. In: Proceedings of MT Summit XVI, vol. 1: Research Track, pp. 86-95, IAMT, Nagoya, Japan (url, bibtex)
Matiss Rikters, Mark Fishel, Ondřej Bojar (2017): Visualizing Neural Machine Translation Attention and Confidence. In: The Prague Bulletin of Mathematical Linguistics, ISSN 0032-6585, 109, pp. 39-50 (pdf, bibtex)
Rudolf Rosa (2017): MonoTrans: Statistical Machine Translation from Monolingual Data. In: Proceedings of the 17th conference ITAT 2017: Slovenskočeský NLP workshop (SloNLP 2017), pp. 201-208, CreateSpace Independent Publishing Platform, Praha, Czechia, ISBN 978-1974274741 (pdf, local PDF, local PDF, local PDF, bibtex)
Rudolf Rosa, Petra Barančíková (2017): Slovakoczech NLP workshop (SloNLP 2017) (ProceedingsPart). In: Proceedings of the 17th Conference on Information Technologies - Applications and Theory (ITAT 2017), pp. 175-208, CreateSpace Independent Publishing Platform, Praha, Czechia, ISBN 978-1974274741 (url)
Rudolf Rosa, Daniel Zeman, David Mareček, Zdeněk Žabokrtský (2017): Slavic Forest, Norwegian Wood. In: Proceedings of the Fourth Workshop on NLP for Similar Languages, Varieties and Dialects (VarDial4), pp. 210-219, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-945626-43-2 (pdf, local PDF, local PDF, bibtex)
Rudolf Rosa, Zdeněk Žabokrtský (2017): Error Analysis of Cross-lingual Tagging and Parsing. In: Proceedings of the 16th International Workshop on Treebanks and Linguistic Theories, pp. 106-118, Univerzita Karlova, Praha, Czechia, ISBN 978-80-88132-04-2 (pdf, local PDF, local PDF, bibtex)
Kateřina Rysová (2017): Possibilities of Text Coherence Analysis in the Prague Dependency Treebank . In: New perspectives on cohesion and coherence: Implications for translation , pp. 35-48, Language Science Press, Berlin, Germany, ISBN 978-3-946234-72-2 (url, bibtex)
Kateřina Rysová, Karel Oliva (2017): Proběhl 43. ročník Olympiády v českém jazyce. In: Český jazyk a literatura, ISSN 0009-0786, vol. 68, no. 2, pp. 53-57 (pdf, bibtex)
Kateřina Rysová, Magdaléna Rysová, Jiří Mírovský, Michal Novák (2017): Introducing EVALD – Software Applications for Automatic Evaluation of Discourse in Czech. In: Proceedings of the International Conference Recent Advances in Natural Language Processing, pp. 634-641, INCOMA Ltd., Šumen, Bulgaria, ISBN 978-954-452-048-9 (pdf, bibtex)
Magdaléna Rysová (2017): Discourse Connectives: From Historical Origin to Present-Day Development . In: New perspectives on cohesion and coherence: Implications for translation , pp. 11-34, Language Science Press, Berlin, Germany, ISBN 978-3-946234-72-2 (url, bibtex)
Shadi Saleh, Pavel Pecina (2017): Task3 Patient-Centred Information Retrieval: Team CUNI. In: CLEF 2017 - 8th Conference and Labs of the Evaluation Forum, Lecture Notes in Computer Science, Lecture Notes in Computer Science, ISSN 0302-9743, pp. 1-7, Springer, Berlin, Germany (bibtex)
Alexey Sorokin, Tatiana Shavrina, Olga Lyashevskaya, Victor Bocharov, Svetlana Alexeeva, Kira Droganova, Alena Fenogenova, Dmitry Granovsky (2017): MorphoRuEval-2017: an evaluation track for the automatic morphological analysis methods for Russian. In: Computational Linguistics and Intellectual Technologies, pp. 311-328, nakl. RGGU, Moscow, Russia (pdf, bibtex)
Milan Straka, Jana Straková (2017): Tokenizing, POS Tagging, Lemmatizing and Parsing UD 2.0 with UDPipe. In: Proceedings of the CoNLL 2017 Shared Task: Multilingual Parsing from Raw Text to Universal Dependencies, pp. 88-99, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-945626-70-8 (pdf, local PDF, bibtex)
Milan Straka, Jana Straková, Jan Hajič (2017): Prague at EPE 2017: The UDPipe System. In: Proceedings of the 2017 Shared Task on Extrinsic Parser Evaluation at the Fourth International Conference on Dependency Linguistics and the 15th International Conference on Parsing Technologies, pp. 65-74, Association for Computational Linguistics (ACL), Stroudsburg, PA, USA, ISBN 978-1-945626-74-6 (pdf, local PDF, bibtex)
Jana Straková (2017): Neural Network Based Named Entity Recognition (PhD thesis). In: (pdf, local PDF, local PDF, bibtex)
Jana Straková, Milan Straka, Magda Ševčíková, Zdeněk Žabokrtský (2017): Czech Named Entity Corpus. In: Handbook of Linguistic Annotation, pp. 855-873, Springer Netherlands, Netherlands, ISBN 978-94-024-0879-9 (bibtex)
Pavlína Synková, Magdaléna Rysová, Lucie Poláková, Jiří Mírovský (2017): Extracting a Lexicon of Discourse Connectives in Czech from an Annotated Corpus. In: Proceedings of the 31st Pacific Asia Conference on Language, Information and Computation, pp. 232-240, National University of Philippines, Cebu, Philippines, ISBN 0-0000-0000-0 (url, local PDF, bibtex)
Magda Ševčíková, Adéla Kalužová, Zdeněk Žabokrtský (2017): Identification of aspectual pairs of verbs derived by suffixation in the lexical database DeriNet. In: Proceedings of the Workshop on Resources and Tools for Derivational Morphology (DeriMo), pp. 105-116, EDUCatt, Milano, Italy, ISBN 978-88-9335-225-3 (pdf, bibtex)
Josef Šimandl, Jana Klímová, Klára Osolsobě, François Esvan, František Štícha, Renata Novotná (2017): Slovník afixů užívaných v češtině. In: , ISBN 978-80-246-3544-6 (url, bibtex)
Jana Šindlerová, Barbora Štěpánková (2017): Lingvistické opodstatnění konstruktivistické didaktiky při výuce slovních druhů . In: Didaktické studie, ISSN 1804-1221, vol. 9, no. 2, pp. 102-113 (bibtex)
Jana Šindlerová, Aleš Tamchyna (2017): Emotions Translated: Enhancing a Subjectivity Lexicon Using a Parallel Valency Lexicon. In: Language use and linguistic structure. Proceedings of the Olomouc Linguistics Colloquium 2016, pp. 317-330, Palacký University, Olomouc, Czechia, ISBN 978-80-244-5173-2 (pdf, local DOCX, bibtex)
Dima Taji, Nizar Habash, Daniel Zeman (2017): Universal Dependencies for Arabic. In: Proceedings of the Third Arabic Natural Language Processing Workshop (WANLP), pp. 166-176, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-945626-44-9 (pdf, bibtex)
Zdeňka Urešová, Eva Fučíková, Eva Hajičová (2017): CzEngClass – Towards a Lexicon of verb Synonyms with Valency linked to Semantic Roles. In: Jazykovedný časopis / Journal of Linguistics, ISSN 0021-5597, vol. 68, no. 2, pp. 364-371 (url, bibtex)
Zdeňka Urešová, Eva Fučíková, Eva Hajičová, Jan Hajič (2017): Syntactic-Semantic Classes of Context-Sensitive Synonyms Based on a Bilingual Corpus. In: Proceedings of 8th Language and Technology Conference, pp. 201-205, Fundacja Uniwersytetu im. Adama Mickiewicza w Poznaniu, Poznań, Poland, ISBN 978-83-64864-94-0 (pdf, local PDF, bibtex)
Dušan Variš, Ondřej Bojar (2017): CUNI System for WMT17 Automatic Post-Editing Task. In: Proceedings of the Second Conference on Machine Translation, Volume 2: Shared Task Papers, pp. 661-666, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-945626-96-8 (pdf, bibtex)
Kateřina Veselovská (2017): Sentiment Analysis in Czech. In: , ISBN 978-80-88132-03-5 (bibtex)
Jernej Vičič, Vladislav Kuboň, Petr Homola (2017): Česílko Goes Open-source. In: The Prague Bulletin of Mathematical Linguistics, ISSN 0032-6585, 107, pp. 57-66 (pdf, bibtex)
Jonáš Vidra, Zdeněk Žabokrtský (2017): Online Software Components for Accessing Derivational Networks. In: Proceedings of the Workshop on Resources and Tools for Derivational Morphology (DeriMo), pp. 129-139, EDUCatt, Milano, Italy, ISBN 978-88-9335-225-3 (pdf, bibtex)
Miroslav Vodolán, Filip Jurčíček (2017): Denotation Extraction for Interactive Learning in Dialogue Systems. In: IEEE ASRU '17: Proc. IEEE Automatic Speech Recognition and Understanding, pp. 490-496, IEEE, Phoenix, AZ, USA, ISBN 978-1-5090-4787-1 (bibtex)
Miroslav Vodolán, Rudolf Kadlec, Jan Kleindienst (2017): Hybrid Dialog State Tracker with ASR Features. In: Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics, volume 2: Short Papers, pp. 205-210, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-945626-35-7 (bibtex)
Antonio Jimeno Yepes, Aurelie Névéol, Mariana Neves, Karin Verspoor, Ondřej Bojar, Arthur Boyer, Cristian Grozea, Barry Haddow, Madeleine Kittner, Yvonne Lichtblau, Pavel Pecina, Roland Roller, Rudolf Rosa, Amy Siu, Philippe Thomas, Saskia Trescher (2017): Findings of the WMT 2017 Biomedical Translation Shared Task. In: Proceedings of the Second Conference on Machine Translation, Volume 2: Shared Task Papers, pp. 234-247, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-945626-96-8 (pdf, bibtex)
Daniel Zeman (2017): Slovak Dependency Treebank in Universal Dependencies. In: Jazykovedný časopis / Journal of Linguistics, ISSN 0021-5597, vol. 68, no. 2, pp. 385-395 (url, local ODT, local PDF, local PDF, bibtex)
Daniel Zeman (2017): Core Arguments in Universal Dependencies. In: Proceedings of the Fourth International Conference on Dependency Linguistics (Depling 2017), September 18-20, 2017, Università di Pisa, Italy, pp. 287-296, Linköping University Electronic Press, Linköping, Sweden, ISBN 978-91-7685-467-9 (url, local PDF, bibtex)
Daniel Zeman, Martin Popel, Milan Straka, Jan Hajič, Joakim Nivre, Filip Ginter, Juhani Luotolahti, Sampo Pyysalo, Slav Petrov, Martin Potthast, Francis Tyers, Elena Badmaeva, Memduh Gökırmak, Anna Nedoluzhko, Silvie Cinková, Jan Hajič, jr., Jaroslava Hlaváčová, Václava Kettnerová, Zdeňka Urešová, Jenna Kanerva, Stina Ojala, Anna Missilä, Christopher Manning, Sebastian Schuster, Siva Reddy, Dima Taji, Nizar Habash, Herman Leung, Marie-Catherine de Marneffe, Manuela Sanguinetti, Maria Simi, Hiroshi Kanayama, Valeria de Paiva, Kira Droganova, Héctor Martínez Alonso, Çağrı Çöltekin, Umut Sulubacak, Hans Uszkoreit, Vivien Macketanz, Aljoscha Burchardt, Kim Harris, Katrin Marheinecke, Georg Rehm, Tolga Kayadelen, Mohammed Attia, Ali Elkahky, Zhuoran Yu, Emily Pitler, Saran Lertpradit, Michael Mandl, Jesse Kirchner, Hector Fernandez Alcalde, Jana Strnadová, Esha Banerjee, Ruli Manurung, Antonio Stella, Atsuko Shimada, Sookyoung Kwak, Gustavo Mendonça, Tatiana Lando, Rattima Nitisaroj, Josie Li (2017): CoNLL 2017 Shared Task: Multilingual Parsing from Raw Text to Universal Dependencies. In: Proceedings of the CoNLL 2017 Shared Task: Multilingual Parsing from Raw Text to Universal Dependencies, pp. 1-19, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-945626-70-8 (pdf, local PDF, bibtex)
Šárka Zikánová, Eva Hajičová (2017): Pražský workshop Discourse Relations in Multilingual Context (Kontakt II). In: Naše řeč, ISSN 0027-8203, vol. 100, no. 2, pp. 102-106 (bibtex)
Amal Abdelsalam, Ondřej Bojar (2016): Bilingual Embeddings and Word Alignments for Translation Quality Estimation. In: Proceedings of the First Conference on Machine Translation (WMT). Volume 2: Shared Task Papers, pp. 764-771, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-945626-10-4 (pdf, bibtex)
Ahmad Aghaebrahimian, Filip Jurčíček (2016): Constraint-Based Open Question Answering via Knowledge Graph Search . In: Text, Speech, and Dialogue: 19th International Conference, TSD 2016, Lecture Notes in Computer Science, ISSN 0302-9743, 9924, pp. 28-36, Springer International Publishing, Cham / Heidelberg / New York / Dordrecht / London, ISBN 978-3-319-45509-9 (pdf, bibtex)
Ahmad Aghaebrahimian, Filip Jurčíček (2016): Open-domain Factoid Question Answering via Knowledge Graph Search. In: Proceedings of the 2016 Conference of the North American Chapter of the Association for Computational Linguistics: Human-Computer Question Answering Workshop, pp. 22-28, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-941643-96-9 (url, bibtex)
Gabriel Altman, Jan Andres, Johan van der Auwera, Jarmila Bachmannová, Jan Balhar, Aleš Bičan, Lenka Bičanová, Jana Bílková, Petr Biskup, Ondřej Bláha, Izabela Blaszczyk, Ondřej Bojar, Tomáš Bořil, Máša Bořkovcová, Ivana Bozděchová, Pavel Caha, Václav Cvrček, Radek Čech, Marie Čechová, František Čermák, David S. Danaher, František Daneš, Jaroslav David, Mojmír Dočekal, Jakub Dotlačil, Vít Dovalil, Věra Dvořák, Eva Eckertová, Viktor Elšík, Joseph Emonds, Adolf Erhart, François Esvan, Dan Faltýnek, Masako Fidler, Alena Andrlová Fidlerová, Zbyněk Fišer, Eva Flanderková, Mirjam Fried, Markus Giger, Miroslav Grepl, Jan Hajič, Eva Hajičová, Ernst Hansack, Björn Hansen, Radoslav Harman, Milan Harvalík, Martin Havlík, Eva Havlová, Elke Hentschel, Milada Hirschová, Zdeňka Hladká, Jana Hoffmannová, Jiří Homoláč, Milada Homolková, Tomáš Hoskovec, Jan Hric, Jaroslav Hubáček, Jan Chloupek, Leonid L. Iomdin, Pavel Ircing, Laura Janda, Ilona Janyšková, Milan Jelínek, Tomáš Jelínek, Lucie Jílková, Filip Jurčíček, Michal Jurka, Petr Karlík, Petr Karlík mladší, Helena Karlíková, Stanislava Kloferová, Martina Kloudová, Miroslava Knappová, Robert Kolár, Ivana Kolářová, Marie Kopřivová, Jan Kořenský, Pavel Kosek, Peter Kosta, Michaela Koščová, Jiří Koten, Ondřej Koupil, Michal Kovář, Michala Králíková, Marie Krappmann, Jiří Kraus, Marie Krčmová, Susan Kresin, Michal Křen, Michal Křístek, Pavel Kubaník, Miroslav Kubát, Tomáš Kubík, Vladislav Kuboň, Ivona Kučerová, Natalia Levshina, Alena Macurová, Ján Mačutek, Jarosław Malicki, Petr Mareš, Olga Martincová, Jiří Marvan, Jindřich Matoušek, Barbara Mertins, Roland Meyer, Krzysztof Migdalski, Eva Minářová, Kamila Mrázková, Iveta Mrázová, Richard Müller, Olga Müllerová, Mira Nábělková, Olga Navrátilová, Iva Nebeská, Anna Nedoluzhko, Marek Nekula, Zuzana Nevěřilová, Stefan Michael Newerkla, Mark Newson, Pavel Novák, Renata Novotná, Norbert Nübler, Radek Ocelák, Karel Oliva, Ivo Osolsobě, Klára Osolsobě, Ludmila Pacnerová, Karel Pala, Zdena Palková, Jarmila Panevová, Pavel Pecina, Jaroslav Peregrin, Anna Maria Perissutti, Ondřej Pešek, Vladimír Petkevič, Petr Plecháč, Jana Pleskalová, Jan Radimský, Paul Rastall, Alexandr Rosen, Zdenka Rusínová, Lucie Saicová Římalová, Tamah Sherman, Tobias Scheer, Boris Skalka, Radek Skarnitzl, Marián Sloboda, Olga Stehlíková, Hana Strachoňová, Jana Straková, Roman Sukač, Zbyněk Sviták, Aleš Svoboda, Josef Syka, Ondřej Šefčík, Radek Šimík, Hana Gruet Škrabalová, Dušan Šlosar, Rudolf Šrámek, Jan Štěpán, František Štícha, Michaela Tabakovičová, Knut Tarald Taraldsen, Lucie Taraldsen Medová, Jiří Trávníček, Vladimír Trpka, Jana Marie Tušková, Ludmila Uhlířová, Lenka Uličná, Oldřich Uličný, Jana Valdrová, Irena Vaňková, Ivo Vasiljev, Radoslav Večerka, Jarmil Vepřek, Ljuba Veselinova, Kateřina Veselovská, Ludmila Veselovská, Jan Volín, Taťána Vykypělová, Roland Wagner, James Wilson, Uliana Yazhinova, Daniel Zeman, Jiří Zeman, Šárka Zikánová, Markéta Ziková, Petr Zima, Ilse Zimmermann, Zdeněk Žabokrtský, Stanislav Žaža (2016): Nový encyklopedický slovník češtiny. In: , ISBN 978-80-7422-480-5 (url, bibtex)
Nora Aranberri, Eleftherios Avramidis, Aljoscha Burchardt, Ondřej Klejch, Martin Popel, Maja Popović (2016): Tools and Guidelines for Principled Machine Translation Development. In: Proceedings of the 10th International Conference on Language Resources and Evaluation (LREC 2016), pp. 1877-1882, European Language Resources Association, Paris, France, ISBN 978-2-9517408-9-1 (pdf, local PDF, bibtex)
Mohammed Attia, Pavel Pecina, Younes Samih, Khaled Shaalan, Josef Genabith (2016): Arabic Spelling Error Detection and Correction. In: Natural Language Engineering, ISSN 1351-3249, vol. 22, no. 5, pp. 751-773 (bibtex)
Eleftherios Avramidis, Vivien Macketanz, Aljoscha Burchardt, Jindřich Helcl, Hans Uszkoreit (2016): Deeper Machine Translation and Evaluation for German. In: Proceedings of the 2nd Deep Machine Translation Workshop, pp. 29-38, ÚFAL MFF UK, Praha, Czechia, ISBN 978-80-88132-02-8 (bibtex)
Vít Baisa, Silvie Cinková, Ema Krejčová, Anna Vernerová (2016): VPS-GradeUp: Graded Decisions on Usage Patterns. In: Proceedings of the 10th International Conference on Language Resources and Evaluation (LREC 2016), pp. 823-827, European Language Resources Association, Paris, France, ISBN 978-2-9517408-9-1 (url, local PDF, bibtex)
Roman Barták, Vladislav Kuboň (2016): Using a Grammar Checker to Validate Compliance of Processes with Workflow Models. In: Proceedings of the 15th Mexican International Conference on Artificial Intelligence, pp. 1-16, Springer, Heidelberg (bibtex)
Roman Barták, Vladislav Kuboň (2016): On Similarities Between Workflow Verification and Grammar Checking. In: Proceedings of the Twenty-Ninth International Florida Artificial Intelligence Research Society Conference, pp. 585-590, AAAI Press, Palo Alto, CA, USA, ISBN 978-1-57735-756-8 (bibtex)
Ondřej Bojar, Ondřej Cífka, Jindřich Helcl, Tom Kocmi, Roman Sudarikov (2016): UFAL Submissions to the IWSLT 2016 MT Track. In: Proceedings of the ninth International Workshop on Spoken Language Translation (IWSLT), pp. 1-8, Karlsruhe Institute of Technology (pdf, bibtex)
Ondřej Bojar, Filip Děchtěrenko, Maria Zelenina (2016): A Pilot Eye-Tracking Study of WMT-Style Ranking Evaluation. In: Translation Evaluation: From Fragmented Tools and Data Sets to an Integrated Ecosystem, pp. 20-26, LREC, Portorož, Slovenia (bibtex)
Ondřej Bojar, Ondřej Dušek, Tom Kocmi, Jindřich Libovický, Michal Novák, Martin Popel, Roman Sudarikov, Dušan Variš (2016): CzEng 1.6: Enlarged Czech-English Parallel Corpus with Processing Tools Dockered. In: Text, Speech, and Dialogue: 19th International Conference, TSD 2016, Lecture Notes in Computer Science, ISSN 0302-9743, 9924, pp. 231-238, Springer International Publishing, Cham / Heidelberg / New York / Dordrecht / London, ISBN 978-3-319-45509-9 (url, bibtex)
Ondřej Bojar, Christian Federmann, Barry Haddow, Philipp Koehn, Matt Post, Lucia Specia (2016): Ten Years of WMT Evaluation Campaigns: Lessons Learnt. In: Translation Evaluation: From Fragmented Tools and Data Sets to an Integrated Ecosystem, pp. 27-34, LREC, Portorož, Slovenia (bibtex)
Ondřej Bojar, Yvette Graham, Amir Kamran, Miloš Stanojević (2016): Results of the WMT16 Metrics Shared Task. In: Proceedings of the First Conference on Machine Translation (WMT). Volume 2: Shared Task Papers, pp. 199-231, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-945626-10-4 (pdf, bibtex)
Ondřej Bojar, Rajen Chatterjee, Christian Federmann, Yvette Graham, Barry Haddow, Matthias Huck, Antonio Jimeno Yepes, Philipp Koehn, Varvara Logacheva, Christof Monz, Matteo Negri, Aurelie Névéol, Mariana Neves, Martin Popel, Matt Post, Raphael Rubino, Carolina Scarton, Lucia Specia, Marco Turchi, Karin Verspoor, Marcos Zampieri (2016): Findings of the 2016 Conference on Machine Translation (WMT16). In: Proceedings of the First Conference on Machine Translation (WMT). Volume 2: Shared Task Papers, pp. 131-198, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-945626-10-4 (pdf, bibtex)
Fabienne Braune, Alexander Fraser, Hal Daumé III, Aleš Tamchyna (2016): A Framework for Discriminative Rule Selection in Hierarchical Moses. In: Proceedings of the First Conference on Machine Translation (WMT). Volume 2: Shared Task Papers, pp. 92-101, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-945626-10-4 (bibtex)
Silvie Cinková (2016): WordSim353 for Czech. In: Text, Speech, and Dialogue: 19th International Conference, TSD 2016, Lecture Notes in Computer Science, ISSN 0302-9743, 9924, pp. 190-197, Springer International Publishing, Cham / Heidelberg / New York / Dordrecht / London, ISBN 978-3-319-45509-9 (url, local PDF, local PDF, bibtex)
Silvie Cinková, Ema Krejčová, Anna Vernerová, Vít Baisa (2016): Graded and Word-Sense-Disambiguation Decisions in Corpus Pattern Analysis: a Pilot Study. In: Proceedings of the 10th International Conference on Language Resources and Evaluation (LREC 2016), pp. 848-854, European Language Resources Association, Paris, France, ISBN 978-2-9517408-9-1 (url, local PDF, bibtex)
Silvie Cinková, Ema Krejčová, Anna Vernerová, Vít Baisa (2016): What Do Graded Decisions Tell Us about Verb Uses. In: Proceedings of the XVII EURALEX International Congress: Lexicography and Linguistic Diversity, pp. 318-328, Tbilisi University Press, Tbilisi, Georgia, ISBN 978-9941-13-542-2 (pdf, local PDF, bibtex)
Kira Droganova, Nikita Mediankin (2016): NLP Pipeline for Russian: an Easy-to-Use Web Application for Morphological and Syntactic Annotation (Electronic). (pdf, local PDF, local PNG)
Kira Droganova, Daniel Zeman (2016): Conversion of SynTagRus (the Russian dependency treebank) to Universal Dependencies (technical report). In: (pdf, bibtex)
Ondřej Dušek, Filip Jurčíček (2016): A Context-aware Natural Language Generator for Dialogue Systems. In: Proceedings of the 17th Annual Meeting of the Special Interest Group on Discourse and Dialogue, pp. 185-190, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-945626-23-4 (pdf, bibtex)
Ondřej Dušek, Filip Jurčíček (2016): Sequence-to-Sequence Generation for Spoken Dialogue via Deep Syntax Trees and Strings. In: Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers), pp. 45-51, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-945626-01-2 (pdf, bibtex)
Ondřej Dušek, Filip Jurčíček (2016): A Context-aware Natural Language Generation Dataset for Dialogue Systems. In: Workshop on Collecting and Generating Resources for Chatbots and Conversational Agents - Development and Evaluation, pp. 6-9, European Language Resources Association, Paris, France (pdf, local PDF, bibtex)
Eva Fučíková, Jan Hajič, Zdeňka Urešová (2016): Enriching a Valency Lexicon by Deverbative Nouns. In: Proceedings of the Workshop on Grammar and Lexicon: Interactions and Interfaces (GramLex), pp. 71-80, The COLING 2016 Organizing Committee, Ōsaka, Japan, ISBN 978-4-87974-706-8 (bibtex)
Eva Fučíková, Jan Hajič, Zdeňka Urešová (2016): Joint search in a bilingual valency lexicon and an annotated corpus. In: Proceedings of Coling 2016 (Demo papers), pp. 40-44, ICCL, Sheffiled, GB, ISBN 978-4-87974-703-7 (bibtex)
Petra Galuščáková, Michal Batko, Martin Kruliš, Jakub Lokoč, David Novák, Pavel Pecina (2016): CUNI at TRECVID 2015 Video Hyperlinking Task. In: 2015 TREC Video Retrieval Evaluation Notebook Papers and Slides, pp. 1-7, NIST, Gaithersburg, MD, USA (url, bibtex)
Petra Galuščáková, Shadi Saleh, Pavel Pecina (2016): SHAMUS: UFAL Search and Hyperlinking Multimedia System. In: Advances in Information Retrieval, Lecture Notes in Computer Science, ISSN 0302-9743, vol. 9626, no. 1, pp. 853-856, Springer International Publishing, Cham, Switzerland, ISBN 978-3-319-30670-4 (url, bibtex)
Rosa Gaudio, Gorka Labaka, Eneko Agirre, Petya Osenova, Kiril Simov, Martin Popel, Dieke Oele, Gertjan van Noord, Luís Gomes, João António Rodrigues, Steven Neale, João Ricardo Silva, Andreia Querido, Nuno Rendeiro, António Branco (2016): SMT and Hybrid systems of the QTLeap project in the WMT16 IT-task. In: Proceedings of the First Conference on Machine Translation (WMT). Volume 2: Shared Task Papers, pp. 435-441, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-945626-10-4 (pdf, bibtex)
Nathan David Green, Zdeněk Žabokrtský (2016): Creating Hybrid Dependency Parsers for Syntax-Based MT. In: Hybrid Approaches to Machine Translation, pp. 161-190, Springer International Publishing, Switzerland, ISBN 978-3-319-21310-1 (url, bibtex)
Jan Hajič, Eva Fučíková, Jana Šindlerová, Zdeňka Urešová (2016): Verb Argument Pairing in Czech-English Parallel Treebank. In: GLOBALEX 2016: Lexicographic Resources for Human Language Technology, pp. 16-23, GLOBALEX workshop 2016 (url, bibtex)
Jan Hajič, Eva Hajičová, Jiří Mírovský, Jarmila Panevová (2016): Linguistically Annotated Corpus as an Invaluable Resource for Advancements in Linguistic Research: A Case Study. In: The Prague Bulletin of Mathematical Linguistics, ISSN 0032-6585, 106, pp. 69-124 (url, local PDF, bibtex)
Jan Hajič, jr., Jiří Novotný, Pavel Pecina, Jaroslav Pokorný (2016): Further Steps towards a Standard Testbed for Optical Music Recognition. In: Proceedings of the 17th International Society for Music Information Retrieval Conference, pp. 157-163, New York University, New York, NY, USA, ISBN 978-0-692-75506-8 (pdf, bibtex)
Jaroslava Hlaváčová (2016): Homonymy and Polysemy in the Czech Morphological Dictionary. In: Text, Speech, and Dialogue: 19th International Conference, TSD 2016, Lecture Notes in Computer Science, ISSN 0302-9743, 9924, pp. 109-116, Springer International Publishing, Cham / Heidelberg / New York / Dordrecht / London, ISBN 978-3-319-45509-9 (local PDF, bibtex)
Bushra Jawaid, Amir Kamran, Ondřej Bojar (2016): Enriching Source for English-to-Urdu Machine Translation. In: Proceedings of the the 6th Workshop on South and Southeast Asian NLP, pp. 54-63, International Committee for Computational Linguistics, Ōsaka, Japan (bibtex)
Bushra Jawaid, Amir Kamran, Miloš Stanojević, Ondřej Bojar (2016): Results of the WMT16 Tuning Shared Task. In: Proceedings of the First Conference on Machine Translation (WMT). Volume 2: Shared Task Papers, pp. 232-238, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-945626-10-4 (pdf, bibtex)
Paweł Kamocki, Pavel Straňák, Michal Sedlák (2016): The Public License Selector: Making Open Licensing Easier. In: Proceedings of the 10th International Conference on Language Resources and Evaluation (LREC 2016), pp. 1-10, European Language Resources Association, Paris, France, ISBN 978-2-9517408-9-1 (pdf, local PDF, bibtex)
Václava Kettnerová (2016): Syntaktická struktura komplexních predikátů a její popis ve valenčním slovníku. In: Výzkum slovesné valence ve slovanských zemích, pp. 181-204, Slovanský ústav AV Č R, v.v.i., Prague, Czech Republic, ISBN 978-80-86420-60-8 (local PDF, bibtex)
Václava Kettnerová, Petra Barančíková, Markéta Lopatková (2016): Lexicographic Description of Complex Predicates in Czech: Between Lexicon and Grammar. In: Proceedings of the XVII EURALEX International Congress: Lexicography and Linguistic Diversity, pp. 893-904, Tbilisi University Press, Tbilisi, Georgia, ISBN 978-9941-13-542-2 (local PDF, local PDF, bibtex)
Václava Kettnerová, Eduard Bejček (2016): Distribution of Valency Complements in Czech Complex Predicates: Between Verb and Noun. In: Proceedings of the 10th International Conference on Language Resources and Evaluation (LREC 2016), pp. 515-521, European Language Resources Association, Paris, France, ISBN 978-2-9517408-9-1 (local PDF, local PDF, bibtex)
Jana Klímová, Veronika Kolářová, Anna Vernerová (2016): Towards a Corpus-based Valency Lexicon of Czech Nouns . In: GLOBALEX 2016: Lexicographic Resources for Human Language Technology, pp. 1-7, GLOBALEX workshop 2016 (pdf, local PDF, bibtex)
Natalia Klyueva, Vladislav Kuboň (2016): Incorporation of a valency lexicon into a TectoMT pipeline. In: Proceedings of the 2nd Deep Machine Translation Workshop, pp. 47-53, ÚFAL MFF UK, Praha, Czechia, ISBN 978-80-88132-02-8 (pdf, local PDF, local PDF, bibtex)
Natalia Klyueva, Pavel Straňák (2016): Improving Corpus Search via Parsing. In: Proceedings of the 10th International Conference on Language Resources and Evaluation (LREC 2016), pp. 2862-2866, European Language Resources Association, Paris, France, ISBN 978-2-9517408-9-1 (pdf, local PDF, bibtex)
Tom Kocmi, Ondřej Bojar (2016): SubGram: Extending Skip-gram Word Representation with Substrings. In: Text, Speech, and Dialogue: 19th International Conference, TSD 2016, Lecture Notes in Computer Science, ISSN 0302-9743, 9924, pp. 182-189, Springer International Publishing, Cham / Heidelberg / New York / Dordrecht / London, ISBN 978-3-319-45509-9 (url, bibtex)
Viktor Kocúr, Ondřej Bojar (2016): Particle Swarm Optimization Submission for WMT16 Tuning Task. In: Proceedings of the First Conference on Machine Translation (WMT). Volume 2: Shared Task Papers, pp. 518-524, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-945626-10-4 (pdf, bibtex)
Martin Komenda, Matěj Karolyi, Andrea Pokorná, Martin Víta, Vincent Kríž (2016): Automatic Keyword Extraction from Medical and Healthcare Curriculum. In: Proceedings of the 2016 Federated Conference on Computer Science and Information Systems, pp. 287-290, Institute of Electrical and Electronics Engineers, New York City, NY, USA, ISBN 978-83-60810-90-3 (bibtex)
Vincent Kríž, Barbora Hladká (2016): Improving Dependency Parsing Using Sentence Clause Charts. In: Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics – Student Research Workshop, pp. 86-92, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-945626-02-9 (pdf, bibtex)
Vincent Kríž, Barbora Hladká, Zdeňka Urešová (2016): Czech Legal Text Treebank 1.0. In: Proceedings of the 10th International Conference on Language Resources and Evaluation (LREC 2016), pp. 2387-2392, European Language Resources Association, Paris, France, ISBN 978-2-9517408-9-1 (pdf, local PDF, bibtex)
David Kuboň, Barbora Hladká (2016): Politician. In: Workshop on Collecting and Generating Resources for Chatbots and Conversational Agents - Development and Evaluation, pp. 43-44, European Language Resources Association, Paris, France (pdf, bibtex)
Vladislav Kuboň, Markéta Lopatková, Tomáš Hercig (2016): Searching for a Measure of Word Order Freedom. In: Proceedings of the 16th ITAT Conference Information Technologies - Applications and Theory, pp. 11-17, CreateSpace Independent Publishing Platform, Bratislava, Slovakia, ISBN 978-1537016740 (pdf, local PDF, bibtex)
Ekaterina Lapshinova-Koltunski, Anna Nedoluzhko, Kerstin Anna Kunz (2016): An interoperable approach to the analysis of discourse. In: Proceedings of the 10th International Conference on Language Resources and Evaluation (LREC 2016), pp. 991-997, European Language Resources Association, Paris, France, ISBN 978-2-9517408-9-1 (url, local PDF, local PDF, bibtex)
Ekaterina Lapshinova-Koltunski, Anna Nedoluzhko, Kerstin Anna Kunz (2016): From monolingual annotations towards cross-lingual resources: An interoperable approach to the analysis of discourse. In: TextLink – Structuring Discourse in Multilingual Europe Second Action Conference, pp. 74-78, Károli Gáspár University of the Reformed Church in Hungary, Budapest, Hungary, ISBN 9789633185636 (pdf, local PDF, local PDF, bibtex)
Jindřich Libovický (2016): Neural Scoring Function for MST Parser. In: Proceedings of the 10th International Conference on Language Resources and Evaluation (LREC 2016), pp. 694-698, European Language Resources Association, Paris, France, ISBN 978-2-9517408-9-1 (bibtex)
Jindřich Libovický, Jindřich Helcl, Marek Tlustý, Pavel Pecina, Ondřej Bojar (2016): CUNI System for WMT16 Automatic Post-Editing and Multimodal Translation Tasks. In: Proceedings of the First Conference on Machine Translation (WMT). Volume 2: Shared Task Papers, pp. 646-654, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-945626-10-4 (url, bibtex)
Jindřich Libovický, Pavel Pecina (2016): A Dataset and Evaluation Metric for Coherent Text Recognition from Scene Images. In: Multimodal Corpora: Computer vision and language processing, pp. 33-36, European Language Resources Association, Paris, France (pdf, bibtex)
Markéta Lopatková, Václava Kettnerová (2016): Alternations: From Lexicon to Grammar And Back Again. In: Proceedings of the Workshop on Grammar and Lexicon: Interactions and Interfaces (GramLex), pp. 18-27, The COLING 2016 Organizing Committee, Ōsaka, Japan, ISBN 978-4-87974-706-8 (local PDF, bibtex)
Markéta Lopatková, Václava Kettnerová, Eduard Bejček, Anna Vernerová, Zdeněk Žabokrtský (2016): Valenční slovník českých sloves VALLEX. In: , ISBN 978-80-246-3542-2 (bibtex)
Markéta Lopatková, Anna Vernerová, Václava Kettnerová (2016): Diateze ve Valenčním slovníku českých sloves VALLEX. In: Výzkum slovesné valence ve slovanských zemích, pp. 149-168, Slovanský ústav AV ČR , Prague, Czech Republic, ISBN 978-80-86420-60-8 (local PDF, bibtex)
Olga Lyashevskaya, Kira Droganova, Daniel Zeman, Maria Alexeeva, Tatiana Gavrilova, Nina Mustafina, Elena Shakurova (2016): Universal Dependencies for Russian: A New Syntactic Dependencies Tagset (Electronic). (pdf)
David Mareček (2016): Delexicalized and Minimally Supervised Parsing on Universal Dependencies. In: Statistical Language and Speech Processing, pp. 30-42, Springer International Publishing, Cham, Switzerland, ISBN 978-3-319-45924-0 (local PDF, bibtex)
David Mareček (2016): Twelve Years of Unsupervised Dependency Parsing. In: Proceedings of the 16th ITAT: Slovenskočeský NLP workshop (SloNLP 2016), pp. 56-62, CreateSpace Independent Publishing Platform, Bratislava, Slovakia, ISBN 978-1537016740 (pdf, local PDF, bibtex)
David Mareček (2016): Merged bilingual trees based on Universal Dependencies in Machine Translation. In: Proceedings of the First Conference on Machine Translation (WMT). Volume 2: Shared Task Papers, pp. 333-338, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-945626-10-4 (pdf, local PDF, local PDF, bibtex)
David Mareček, Zdeněk Žabokrtský (2016): Gibbs Sampling Segmentation of Parallel Dependency Trees for Tree-Based Machine Translation. In: The Prague Bulletin of Mathematical Linguistics, ISSN 0032-6585, 105, pp. 101-110 (pdf, local PDF, bibtex)
Héctor Martínez Alonso, Daniel Zeman (2016): Universal Dependencies for the AnCora treebanks. In: Procesamiento del Lenguaje Natural, ISSN 1135-5948, 57, pp. 91-98 (url, local PDF, bibtex)
Nikita Mediankin (2016): ConFarm: Extracting Surface Representations of Verb and Noun Constructions from Dependency Annotated Corpora of Russian. In: Proceedings of Coling 2016 (Demo papers), pp. 238-242, ICCL, Sheffiled, GB, ISBN 978-4-87974-703-7 (bibtex)
Nikita Mediankin, Kira Droganova (2016): Building NLP Pipeline for Russian with a Handful of Linguistic Knowledge. In: Proceedings of the Workshop on Computational Linguistics and Language Science, pp. 48-56, CEUR-WS, Aachen, Germany (bibtex)
Jiří Mírovský, Lucie Poláková, Jan Štěpánek (2016): Searching in the Penn Discourse Treebank Using the PML-Tree Query. In: Proceedings of the 10th International Conference on Language Resources and Evaluation (LREC 2016), pp. 1762-1769, European Language Resources Association, Paris, France, ISBN 978-2-9517408-9-1 (url, local PDF, bibtex)
Jiří Mírovský, Pavlína Synková, Magdaléna Rysová, Lucie Poláková (2016): Designing CzeDLex – A Lexicon of Czech Discourse Connectives. In: Proceedings of the 30th Pacific Asia Conference on Language, Information and Computation, pp. 449-457, Kyung Hee University, Seoul, Korea, ISBN 978-89-6817-428-5 (pdf, local PDF, bibtex)
Jozef Mišutka, Ondřej Košarko, Amir Kamran (2016): SHORTREF.ORG Making URLs Easy-to-Cite. In: Workshop on Research Results Reproducibility and Resources Citation in Science and Technology of Language, pp. 21-23, European Language Resources Association, Paris, France (pdf, bibtex)
Jakub Mlynář (2016): Minulost jako vzpomínka a vyprávění: Archiv vizuální historie USC Shoah Foundation v kontextu orální historie. In: Návraty: Poválečná rekonstrukce židovských komunit v zemích středovýchodní, jihovýchodní a východní Evropy, pp. 11-23, Karolinum, Prague, Czech Republic, ISBN 9788024632711 (bibtex)
Anna Nedoluzhko (2016): A new look at possessive reflexivization: A comparative study between Czech and Russian. In: Proceedings of the Workshop on Grammar and Lexicon: Interactions and Interfaces (GramLex), pp. 110-119, The COLING 2016 Organizing Committee, Ōsaka, Japan, ISBN 978-4-87974-706-8 (bibtex)
Anna Nedoluzhko, Ekaterina Lapshinova-Koltunski (2016): Abstract Coreference in a Multilingual Perspective: a View on Czech and German. In: Coreference Resolution Beyond OntoNotes co-located with NAACL 2016, pp. 47-52, The Association for Computational Linguistics, San Diego, USA, ISBN 978-1-941643-90-7 (local PDF, local PDF, bibtex)
Anna Nedoluzhko, Ekaterina Lapshinova-Koltunski, Kerstin Anna Kunz (2016): Contrasting Coreference in Czech and German: from Different Frameworks to Joint Results. In: Computational Linguistics and Intellectual Technologies, pp. 1-14, nakl. RGGU, Moscow, Russia (local PDF, local PDF, bibtex)
Anna Nedoluzhko, Michal Novák, Silvie Cinková, Marie Mikulová, Jiří Mírovský (2016): Coreference in Prague Czech-English Dependency Treebank. In: Proceedings of the 10th International Conference on Language Resources and Evaluation (LREC 2016), pp. 169-176, European Language Resources Association, Paris, France, ISBN 978-2-9517408-9-1 (url, local PDF, bibtex)
Anna Nedoluzhko, Anna Schwarz (Khoroshkina), Michal Novák (2016): Possessives in Parallel English‑Czech-Russian Texts. In: Computational Linguistics and Intellectual Technologies, ISSN 2221-7932, 15, pp. 483-497 (pdf, local PDF, bibtex)
Joakim Nivre, Marie-Catherine de Marneffe, Filip Ginter, Yoav Goldberg, Jan Hajič, Christopher Manning, Ryan McDonald, Slav Petrov, Sampo Pyysalo, Natalia Silveira, Reut Tsarfaty, Daniel Zeman (2016): Universal Dependencies v1: A Multilingual Treebank Collection. In: Proceedings of the 10th International Conference on Language Resources and Evaluation (LREC 2016), pp. 1659-1666, European Language Resources Association, Paris, France, ISBN 978-2-9517408-9-1 (url, local PDF, bibtex)
Michal Novák (2016): Pronoun Prediction with Linguistic Features and Example Weighing. In: Proceedings of the First Conference on Machine Translation (WMT). Volume 2: Shared Task Papers, pp. 602-608, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-945626-10-4 (pdf, bibtex)
Stephan Oepen, Marco Kuhlmann, Yusuke Miyao, Daniel Zeman, Silvie Cinková, Dan Flickinger, Jan Hajič, Angelina Ivanova, Zdeňka Urešová (2016): Towards Comparability of Linguistic Graph Banks for Semantic Parsing. In: Proceedings of the 10th International Conference on Language Resources and Evaluation (LREC 2016), pp. 3991-3995, European Language Resources Association, Paris, France, ISBN 978-2-9517408-9-1 (url, local PDF, bibtex)
Michal Olbrich (2016): Interogační jednotka pro monitorování dřevostaveb (masters thesis). In: (bibtex)
Arantxa Otegi, Nora Aranberri, António Branco, Jan Hajič, Steven Neale, Petya Osenova, Rita Pereira, Martin Popel, João Ricardo Silva, Kiril Simov, Eneko Agirre (2016): QTLeap WSD/NED Corpora: Semantic Annotation of Parallel Corpora in Six Languages. In: Proceedings of the 10th International Conference on Language Resources and Evaluation (LREC 2016), pp. 3023-3030, European Language Resources Association, Paris, France, ISBN 978-2-9517408-9-1 (bibtex)
Jarmila Panevová (2016): In favour of the Argument-Adjunct Distinction (from the Perspective of FGD). In: The Prague Bulletin of Mathematical Linguistics, ISSN 0032-6585, 106, pp. 21-30 (url, bibtex)
Jarmila Panevová (2016): Valence v gramatice, valence ve slovníku. In: Výzkum slovesné valence ve slovanských zemích, pp. 13-25, Slovanský ústav AV ČR, v.v.i., Praha Czech Republic, ISBN 978-80-86420-60-8 (bibtex)
Jarmila Panevová (2016): Syntax Vladimíra Šmilauera včera a dnes. In: Jazykovědné aktuality , ISSN 1212-5326, vol. 53, no. 1 and 2, pp. 30-35 (bibtex)
Jan-Thorsten Peter, Tamer Alkhouli, Hermann Ney, Matthias Huck, Fabienne Braune, Alexander Fraser, Aleš Tamchyna, Ondřej Bojar, Barry Haddow, Rico Sennrich, Frédéric Blain, Lucia Specia, Jan Niehues, Alex Waibel, Alexandre Allauzen, Lauriane Aufrant, Franck Burlot, Elena Knyazeva, Thomas Lavergne, François Yvon, Stella Frank, Marcis Pinnis (2016): The QT21/HimL Combined Machine Translation System. In: Proceedings of the First Conference on Machine Translation (WMT). Volume 2: Shared Task Papers, pp. 344-355, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-945626-10-4 (pdf, bibtex)
Ondřej Plátek, Petr Bělohlávek, Vojtěch Hudeček, Filip Jurčíček (2016): Recurrent Neural Networks for Dialogue State Tracking. In: Proceedings of the 16th ITAT: Slovenskočeský NLP workshop (SloNLP 2016), pp. 63-67, CreateSpace Independent Publishing Platform, Bratislava, Slovakia, ISBN 978-1537016740 (bibtex)
Ondřej Plátek, Filip Jurčíček (2016): A Dataset of Operator-client Dialogues Aligned with Database Queries for End-to-end Training. In: Intelligent Virtual Agents, pp. 1-10, Springer, Los Angeles, CA, USA, ISBN 978-3-319-47664-3 (pdf, bibtex)
Martin Popel, Roman Sudarikov, Ondřej Bojar, Rudolf Rosa, Jan Hajič (2016): TectoMT – a deep-linguistic core of the combined Chimera MT system. In: Baltic Journal of Modern Computing, ISSN 2255-8942, vol. 4, no. 2, pp. 377-377 (pdf, local PDF, local PDF, local PDF, bibtex)
Adam Przepiórkowski, Jan Hajič, Elżbieta Hajnicz, Zdeňka Urešová (2016): Phraseology in two Slavic valency dictionaries: limitations and perspectives. In: International Journal of Lexicography, ISSN 0950-3846, vol. 30, no. 1, pp. 1-38 (url, bibtex)
Katrin Přikrylová, Vladislav Kuboň, Kateřina Veselovská (2016): Logical vs. Natural Language Conjunctions in Czech: A Comparative Study. In: Proceedings of the 16th ITAT: Slovenskočeský NLP workshop (SloNLP 2016), pp. 68-73, CreateSpace Independent Publishing Platform, Bratislava, Slovakia, ISBN 978-1537016740 (bibtex)
Katrin Přikrylová, Vladislav Kuboň, Kateřina Veselovská (2016): The Role of Conjunctions in Adjective Polarity Analysis in Czech. In: Computación y Sistemas, ISSN 1405-5546, vol. 20, no. 3, pp. 377-386 (bibtex)
Anna Roitberg, Anna Nedoluzhko (2016): Bridging Corpus for Russian in comparison with Czech. In: Coreference Resolution Beyond OntoNotes co-located with NAACL 2016, pp. 59-66, The Association for Computational Linguistics, San Diego, USA, ISBN 978-1-941643-90-7 (local PDF, bibtex)
Rudolf Rosa (2016): Czechizator. In: Proceedings of the 16th ITAT: Slovenskočeský NLP workshop (SloNLP 2016), pp. 74-79, CreateSpace Independent Publishing Platform, Bratislava, Slovakia, ISBN 978-1537016740 (pdf, local PDF, local PDF, bibtex)
Rudolf Rosa, Petra Barančíková (2016): Slovakoczech NLP workshop (SloNLP 2016) (ProceedingsPart). In: Proceedings of the 16th ITAT Conference Information Technologies - Applications and Theory, pp. 35-89, CreateSpace Independent Publishing Platform, Bratislava, Slovakia, ISBN 978-1537016740 (url)
Rudolf Rosa, Martin Popel, Ondřej Bojar, David Mareček, Ondřej Dušek (2016): Moses & Treex Hybrid MT Systems Bestiary. In: Proceedings of the 2nd Deep Machine Translation Workshop, pp. 1-10, ÚFAL MFF UK, Praha, Czechia, ISBN 978-80-88132-02-8 (url, local PDF, local PDF, bibtex)
Rudolf Rosa, Roman Sudarikov, Michal Novák, Martin Popel, Ondřej Bojar (2016): Dictionary-based Domain Adaptation of MT Systems without Retraining. In: Proceedings of the First Conference on Machine Translation (WMT). Volume 2: Shared Task Papers, pp. 449-455, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-945626-10-4 (pdf, bibtex)
Victoria Rosén, Koenraad De Smedt, Gyri Smørdal Losnegaard, Eduard Bejček, Agata Savary, Petya Osenova (2016): MWEs in Treebanks: From Survey to Guidelines. In: Proceedings of the 10th International Conference on Language Resources and Evaluation (LREC 2016), pp. 2323-2330, European Language Resources Association, Paris, France, ISBN 978-2-9517408-9-1 (local PDF, bibtex)
Kateřina Rysová, Karel Oliva (2016): 42. ročník Olympiády v českém jazyce. In: Český jazyk a literatura, ISSN 0009-0786, vol. 67, no. 2, pp. 53-58 (local PDF, bibtex)
Kateřina Rysová, Magdaléna Rysová (2016): Koreference a elipsa v Pražském závislostním korpusu. In: Korpus – gramatika – axiologie, ISSN 1804-137X, vol. 7, no. 13, pp. 35-47 (local PDF, bibtex)
Kateřina Rysová, Magdaléna Rysová, Jiří Mírovský (2016): Automatic evaluation of surface coherence in L2 texts in Czech. In: Proceedings of the 28th Conference on Computational Linguistics and Speech Processing ROCLING XXVIII (2016), pp. 214-228, The Association for Computational Linguistics and Chinese Language Processing (ACLCLP), Taipei, Taiwan, ISBN 978-957-30792-9-3 (pdf, local PDF, bibtex)
Shadi Saleh, Pavel Pecina (2016): Reranking Hypotheses of Machine-Translated Queries for Cross-Lingual Information Retrieval. In: Experimental IR Meets Multilinguality, Multimodality, and Interaction 7th International Conference of the CLEF Association, Lecture Notes in Computer Science, ISSN 0302-9743, vol. 9822, no. 9822, pp. 54-66, Springer, Berlin, Germany, ISBN 978-3-319-44563-2 (bibtex)
Shadi Saleh, Pavel Pecina (2016): Task3 Patient-Centred Information Retrieval: Team CUNI. In: CLEF 2016 Working Notes, pp. 123-129, CEUR-WS (bibtex)
Shadi Saleh, Pavel Pecina (2016): Adapting SMT Query Translation Reranker to New Languages in Cross-Lingual Information Retrieval. In: Medical Information Retrieval (MedIR) Workshop at the 39th International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 1-4, ACM, Pisa, Italy (pdf, bibtex)
Ineke Schuurman, Menzo Windhouwer, Oddrun Ohren, Daniel Zeman (2016): CLARIN Concept Registry: The New Semantic Registry. In: Selected Papers from the CLARIN Annual Conference 2015, October 14–16, 2015, Wrocław, Poland, pp. 62-70, Linköping University Electronic Press, Linköpings universitet, Linköping, Sweden, ISBN 978-91-7685-765-6 (url, local PDF, bibtex)
Milan Straka, Jan Hajič, Jana Straková (2016): UDPipe: Trainable Pipeline for Processing CoNLL-U Files Performing Tokenization, Morphological Analysis, POS Tagging and Parsing. In: Proceedings of the 10th International Conference on Language Resources and Evaluation (LREC 2016), pp. 4290-4297, European Language Resources Association, Paris, France, ISBN 978-2-9517408-9-1 (pdf, local PDF, bibtex)
Jana Straková, Milan Straka, Jan Hajič (2016): Neural Networks for Featureless Named Entity Recognition in Czech. In: Text, Speech, and Dialogue: 19th International Conference, TSD 2016, Lecture Notes in Computer Science, ISSN 0302-9743, 9924, pp. 173-181, Springer International Publishing, Cham / Heidelberg / New York / Dordrecht / London, ISBN 978-3-319-45509-9 (url, local PDF, bibtex)
Roman Sudarikov, Ondřej Dušek, Martin Holub, Ondřej Bojar, Vincent Kríž (2016): Verb Sense Disambiguation in Machine Translation. In: Sixth Workshop on Hybrid Approaches to Translation (HyTra-6), pp. 42-50, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-4-87974-713-6 (pdf, bibtex)
Roman Sudarikov, Martin Popel, Ondřej Bojar, Aljoscha Burchardt, Ondřej Klejch (2016): Using MT-ComparEval. In: Translation Evaluation: From Fragmented Tools and Data Sets to an Integrated Ecosystem, pp. 76-82, LREC, Portorož, Slovenia (pdf, bibtex)
Magda Ševčíková, Adéla Limburská (2016): Enrichment of a lexical network of Czech derived words by exploiting inflectional paradigms. In: Book of Abstracts. 49th Annual Meeting of the Societas Linguistica Europaea, pp. 714-715, University of Naples Federico II, Naples, Italy (pdf, bibtex)
Magda Ševčíková, Zdeněk Žabokrtský, Jonáš Vidra, Milan Straka (2016): Lexikální síť DeriNet: elektronický zdroj pro výzkum derivace v češtině. In: Časopis pro moderní filologii, ISSN 0008-7386, vol. 98, no. 1, pp. 62-76 (bibtex)
Aleš Tamchyna, Petra Barančíková (2016): Manual and Automatic Paraphrases for MT Evaluation. In: Proceedings of the 10th International Conference on Language Resources and Evaluation (LREC 2016), pp. 3543-3548, European Language Resources Association, Paris, France, ISBN 978-2-9517408-9-1 (bibtex)
Aleš Tamchyna, Alexander Fraser, Ondřej Bojar, Marcin Junczys-Dowmunt (2016): Target-Side Context for Discriminative Models in Statistical Machine Translation. In: Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pp. 1704-1714, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-945626-00-5 (pdf, bibtex)
Aleš Tamchyna, Roman Sudarikov, Ondřej Bojar, Alexander Fraser (2016): CUNI-LMU Submissions in WMT2016: Chimera Constrained and Beaten. In: Proceedings of the First Conference on Machine Translation (WMT). Volume 2: Shared Task Papers, pp. 385-390, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-945626-10-4 (url, bibtex)
Aleš Tamchyna, Kateřina Veselovská (2016): UFAL at SemEval-2016 Task 5: Recurrent Neural Networks for Sentence Classification. In: Proceedings of the 2016 Conference of the North American Chapter of the Association for Computational Linguistics: International Workshop on Semantic Evaluation (SemEval), pp. 367-371, Association for Computational Linguistics (ACL), Stroudsburg, PA, USA, ISBN 978-1-941643-95-2 (bibtex)
Le Thanh, Hoa Vu Throng, Jonathan Oberländer, Ondřej Bojar (2016): Using Term Position Similarity and Language Modeling for Bilingual Document Alignment. In: Proceedings of the First Conference on Machine Translation (WMT). Volume 2: Shared Task Papers, pp. 710-716, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-945626-10-4 (pdf, bibtex)
Zdeňka Urešová, Eduard Bejček, Jan Hajič (2016): Inherently Pronominal Verbs in Czech: Description and Conversion Based on Treebank Annotation. In: Proceedings of the 12th Workshop on Multiword Expressions (ACL 2016), pp. 78-83, Association for Computational Linguistics (ACL), Stroudsburg, PA, USA, ISBN 978-1-945626-06-7 (url, local PDF, bibtex)
Zdeňka Urešová, Eva Fučíková, Jan Hajič (2016): Non-projectivity and valency. In: Proceedings of the 2016 Conference of the North American Chapter of the Association for Computational Linguistics: Workshop on Discontinuous Structures in Natural Language Processing (DiscoNLP), pp. 12-21, Association for Computational Linguistics (ACL), Stroudsburg, PA, USA, ISBN 978-1-941643-85-3 (pdf, bibtex)
Zdeňka Urešová, Eva Fučíková, Jana Šindlerová (2016): CzEngVallex: a bilingual Czech-English valency lexicon. In: The Prague Bulletin of Mathematical Linguistics, ISSN 0032-6585, 105, pp. 17-50 (pdf, bibtex)
Dušan Variš (2016): Automatic Error Correction of Machine Translation Output (masters thesis). In: (url, bibtex)
Jernej Vičič, Petr Homola, Vladislav Kuboň (2016): Automated implementation process of machine translation system for related languages.. In: Computing and Informatics, ISSN 1335-9150, vol. 35, no. 2, pp. 441-469 (bibtex)
Martin Víta, Vincent Kríž (2016): Word2vec Based System for Recognizing Partial Textual Entailment. In: Proceedings of the 2016 Federated Conference on Computer Science and Information Systems, pp. 513-516, Institute of Electrical and Electronics Engineers, New York City, NY, USA, ISBN 978-83-60810-90-3 (bibtex)
Miroslav Vodolán, Filip Jurčíček (2016): Data Collection for Interactive Learning through the Dialog. In: Workshop on Collecting and Generating Resources for Chatbots and Conversational Agents - Development and Evaluation, pp. 1-5, European Language Resources Association, Paris, France (bibtex)
Phil Williams, Rico Sennrich, Maria Nadejde, Matthias Huck, Barry Haddow, Ondřej Bojar (2016): Edinburgh’s Statistical Machine Translation Systems for WMT16. In: Proceedings of the First Conference on Machine Translation (WMT). Volume 2: Shared Task Papers, pp. 399-410, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-945626-10-4 (pdf, bibtex)
Zhiwei Yu, David Mareček, Zdeněk Žabokrtský, Daniel Zeman (2016): If You Even Don't Have a Bit of Bible: Learning Delexicalized POS Taggers. In: Proceedings of the 10th International Conference on Language Resources and Evaluation (LREC 2016), pp. 96-103, European Language Resources Association, Paris, France, ISBN 978-2-9517408-9-1 (url, local PDF, bibtex)
Daniel Zeman (2016): Universal Annotation of Slavic Verb Forms. In: The Prague Bulletin of Mathematical Linguistics, ISSN 0032-6585, 105, pp. 143-193 (pdf, local PDF, bibtex)
Daniel Zeman, David Mareček, Zhiwei Yu, Zdeněk Žabokrtský (2016): Planting Trees in the Desert: Delexicalized Tagging and Parsing Combined. In: Proceedings of the 30th Pacific Asia Conference on Language, Information and Computation, pp. 199-207, Kyung Hee University, Seoul, Korea, ISBN 978-89-6817-428-5 (pdf, local PDF, local PDF, bibtex)
Guido Zuccon, João Palotti, Lorraine Goeuriot, Liadh Kelly, Mihai Lupu, Pavel Pecina, Henning Müller, Julie Budaher, Anthony Deacon (2016): The IR Task at the CLEF eHealth Evaluation Lab 2016: User-centred Health Information Retrieval. In: CLEF 2016 Working Notes, pp. 15-27, CEUR-WS (bibtex)
Zdeněk Žabokrtský, Magda Ševčíková, Milan Straka, Jonáš Vidra, Adéla Limburská (2016): Merging Data Resources for Inflectional and Derivational Morphology in Czech. In: Proceedings of the 10th International Conference on Language Resources and Evaluation (LREC 2016), pp. 1307-1314, European Language Resources Association, Paris, France, ISBN 978-2-9517408-9-1 (pdf, local PDF, bibtex)
Ahmad Aghaebrahimian (2015): Constraint-based semantic parsing. In: UFAL WDS 2015 (Conference of PhD Students in Mathematical Linguistics), pp. 57-64, Institute of Formal and Applied Linguistics, Charles University in Prague, Praha, Czechia (bibtex)
Ahmad Aghaebrahimian, Filip Jurčíček (2015): Machine Learning for Semantic Parsing in Review. In: Proceedings of 7rd Language and Technology Conference, pp. 535-539, Fundacja Uniwersytetu im. Adama Mickiewicza w Poznaniu, Poznań, Poland, ISBN 978-83-932640-8-7 (pdf, bibtex)
Sarah Berenji Ardestani, Carl Johan Håkansson, Erwin Laure, Ilja Livenson, Pavel Straňák, Emanuel Dima, Dennis Blommesteijn, Mark van de Sanden (2015): B2SHARE: An Open eScience Data Sharing Platform. In: 2015 IEEE 11th International Conference on e-Science (e-Science), pp. 448-453, IEEE computer society, Munich, Germany, ISBN 978-1-4673-9325-6 (url, local PDF, bibtex)
Vít Baisa, Jane Bradbury, Silvie Cinková, Ismail El Maarouf, Adam Kilgarriff, Octavian Popescu (2015): SemEval-2015 Task 15: A CPA dictionary-entry-building task. In: Proceedings of the 9th International Workshop on Semantic Evaluation (SemEval 2015), pp. 315-324, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-941643-40-2 (url, local PDF, bibtex)
Petra Barančíková, Rudolf Rosa (2015): Slovakoczech NLP workshop (SloNLP 2015) (ProceedingsPart). In: ITAT 2015: Information Technologies – Applications and Theory, Proceedings of the 15th conference ITAT 2015, pp. 65-105, CreateSpace Independent Publishing Platform, Praha, Czechia, ISBN 978-1515120650 (url)
Petra Barančíková, Rudolf Rosa (2015): Targeted Paraphrasing on Deep Syntactic Layer for MT Evaluation. In: Proceedings of the Third International Conference on Dependency Linguistics (Depling 2015), pp. 20-27, Uppsala University, Uppsala, Sweden, ISBN 978-91-637-8965-6 (url, local PDF, local PDF, bibtex)
Eduard Bejček (2015): Automatické propojování lexikografických zdrojů a korpusových dat (PhD thesis). In: (local PDF, bibtex)
Ondřej Bojar (2015): Machine translation. In: The Oxford Handbook of Inflection, pp. 323-347, Oxford University Press, Oxford, UK, ISBN 978-0-19-959142-8 (url, bibtex)
Ondřej Bojar, Rajen Chatterjee, Christian Federmann, Barry Haddow, Matthias Huck, Chris Hokamp, Philipp Koehn, Varvara Logacheva, Christof Monz, Matteo Negri, Matt Post, Carolina Scarton, Lucia Specia, Marco Turchi (2015): Findings of the 2015 Workshop on Statistical Machine Translation. In: Proceedings of the 10th Workshop on Machine Translation, pp. 1-46, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-941643-32-7 (pdf, bibtex)
Ondřej Bojar, Aleš Tamchyna (2015): CUNI in WMT15: Chimera Strikes Again. In: Proceedings of the 10th Workshop on Machine Translation, pp. 79-83, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-941643-32-7 (url, bibtex)
Radek Čech, Ján Mačutek, Zdeněk Žabokrtský, Aleš Horák (2015): Polysemy and Synonymy in Syntactic Dependency Networks. In: Digital Scholarship in the Humanities, ISSN 2055-7671, 32, pp. 1-14 (url, bibtex)
Kira Droganova (2015): Building a Dependency Parsing Model for Russian with MaltParser and MyStem Tagset. In: 14th International Workshop on Treebanks and Linguistic Theories (TLT 2015), pp. 268-272, IPIPAN, Warszawa, Poland, ISBN 978-83-63159-18-4 (local PDF, bibtex)
Ondřej Dušek, Eva Fučíková, Jan Hajič, Martin Popel, Jana Šindlerová, Zdeňka Urešová (2015): Using Parallel Texts and Lexicons for Verbal Word Sense Disambiguation. In: Proceedings of the Third International Conference on Dependency Linguistics (Depling 2015), pp. 82-90, Uppsala University, Uppsala, Sweden, ISBN 978-91-637-8965-6 (pdf, local PDF, bibtex)
Ondřej Dušek, Luís Gomes, Michal Novák, Martin Popel, Rudolf Rosa (2015): New Language Pairs in TectoMT. In: Proceedings of the 10th Workshop on Machine Translation, pp. 98-104, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-941643-32-7 (pdf, local PDF, bibtex)
Ondřej Dušek, Filip Jurčíček (2015): Training a Natural Language Generator From Unaligned Data. In: Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing (Volume 1: Long Papers), pp. 451-461, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-941643-72-3 (pdf, local PDF, bibtex)
Franky, Ondřej Bojar, Kateřina Veselovská (2015): Resources for Indonesian Sentiment Analysis. In: The Prague Bulletin of Mathematical Linguistics, ISSN 0032-6585, 103, pp. 21-41 (pdf, bibtex)
Eva Fučíková, Jan Hajič, Jana Šindlerová, Zdeňka Urešová (2015): Czech-English Bilingual Valency Lexicon Online. In: 14th International Workshop on Treebanks and Linguistic Theories (TLT 2015), pp. 61-71, IPIPAN, Warszawa, Poland, ISBN 978-83-63159-18-4 (pdf, local PDF, bibtex)
Petra Galuščáková, Pavel Pecina (2015): CUNI at MediaEval 2015 Search and Anchoring in Video Archives: Anchoring via Information Retrieval. In: Working Notes Proceedings of the MediaEval 2015 Workshop, CEUR-WS.org, Aachen, Germany (pdf, bibtex)
Petra Galuščáková, Pavel Pecina (2015): Audio Information for Hyperlinking of TV Content. In: Proceedings of the Third Edition Workshop on Speech, Language & Audio in Multimedia, pp. 27-30, ACM, New York, NY, USA, ISBN 978-1-4503-3749-6 (bibtex)
Barbara J. Grosz, Eva Hajičová, Aravind K. Joshi (2015): Obituary - Jane J. Robinson. In: Computational Linguistics, ISSN 1530-9312, vol. 41, no. 4, pp. 723-726 (url, bibtex)
Jan Hajič (2015): Treebanks and MWEs (LectureNotes). (url)
Jan Hajič (2015): LINDAT/CLARIN: data a technologie pro výzkum založený na analýze psaného a mluveného jazyka. In: Seminář o digitálních zdrojích a službách ve společenských a humanitních vědách, pp. 1-4, Charles University in Prague, Praha, Czechia, ISBN 978-80-904571-9-5 (url, bibtex)
Jan Hajič, Eva Hajičová, Marie Mikulová, Jiří Mírovský, Jarmila Panevová, Daniel Zeman (2015): Deletions and node reconstructions in a dependency-based mutlilevel annotation scheme. In: 16th International Conference on Computational Linguistics and Intelligent Text Processing, Lecture Notes in Computer Science, ISSN 0302-9743, 9041, pp. 17-31, Springer, Berlin / Heidelberg, ISBN 978-3-319-18111-0 (url, bibtex)
Jan Hajič, jr., Pavel Pecina (2015): Matching Illustrative Images to “Soft News” Articles. In: UFAL WDS 2015 (Conference of PhD Students in Mathematical Linguistics), pp. 49-56, Institute of Formal and Applied Linguistics, Charles University in Prague, Praha, Czechia (bibtex)
Eva Hajičová, Marie Mikulová, Jarmila Panevová (2015): Reconstruction of Deletions in a Dependency-based Description of Czech: Selected Issues. In: Proceedings of the Third International Conference on Dependency Linguistics (Depling 2015), pp. 131-140, Uppsala University, Uppsala, Sweden, ISBN 978-91-637-8965-6 (pdf, bibtex)
Barbora Hladká, Martin Holub (2015): A Gentle Introduction to Machine Learning for Natural Language Processing: How to start in 16 practical steps. In: Language and Linguistics Compass, ISSN 1749-818X, vol. 9, no. 2, pp. 55-76 (bibtex)
Tam Hoang, Ondřej Bojar (2015): TmTriangulate: A Tool for Phrase Table Triangulation. In: The Prague Bulletin of Mathematical Linguistics, ISSN 0032-6585, 104, pp. 75-86 (pdf, bibtex)
Timo Järvinen, Elisabeth Bertol, Septina Dian Larasati, Monica-Mihaela Rizea, Maria Ruiz Santabalbina, Milan Souček (2015): Towards Cross-language Application of Dependency Grammar. In: Proceedings of the Third International Conference on Dependency Linguistics (Depling 2015), pp. 171-180, Uppsala University, Uppsala, Sweden, ISBN 978-91-637-8965-6 (bibtex)
Václava Kettnerová, Markéta Lopatková (2015): At the Lexicon-Grammar Interface: The Case of Complex Predicates in the Functional Generative Description. In: Proceedings of the Third International Conference on Dependency Linguistics (Depling 2015), pp. 191-200, Uppsala University, Uppsala, Sweden, ISBN 978-91-637-8965-6 (url, local PDF, bibtex)
Václava Kettnerová, Markéta Lopatková, Jarmila Panevová (2015): Shoda doplňku v reflexivních konstrukcích v češtině. In: Slovo a slovesnost, ISSN 0037-7031, vol. 76, no. 3, pp. 198-214 (local PDF, bibtex)
Ondřej Klejch, Eleftherios Avramidis, Aljoscha Burchardt, Martin Popel (2015): MT-ComparEval: Graphical evaluation interface for Machine Translation development. In: The Prague Bulletin of Mathematical Linguistics, ISSN 0032-6585, 104, pp. 63-74 (pdf, bibtex)
Ondřej Klejch, Ondřej Plátek, Lukáš Žilka, Filip Jurčíček (2015): CloudASR: Platform & Service. In: Text, Speech, and Dialogue: 18th International Conference, TSD 2015, Lecture Notes in Computer Science, ISSN 0302-9743, 9302, pp. 376-383, Springer International Publishing, Cham / Heidelberg / New York / Dordrecht / London, ISBN 978-3-319-24032-9 (bibtex)
Radoslav Klíč, Jirka Hana (2015): Resource-Light Acquisition of Inflectional Paradigms. In: Proceedings of the 15th conference ITAT 2015: Slovenskočeský NLP workshop (SloNLP 2015), pp. 66-72, CreateSpace Independent Publishing Platform, Praha, Czechia, ISBN 978-1515120650 (pdf, bibtex)
Natalia Klyueva (2015): Linguistic Issues in Machine Translation between Czech and Russian (PhD thesis). In: (bibtex)
Natalia Klyueva, Jeevanthi Liyanapathirana (2015): Analysis of MultiWord Expression translation errors in Statistical Machine Translation. In: MUMTTT workshop (2nd Workshop on Multi-word Units in Machine Translation and Translation Technology), pp. 55-57, Gloria Corpas Pastor, Málaga, Spain (url, local PDF, local PDF, local PDF, bibtex)
Vincent Kríž, Barbora Hladká (2015): RExtractor: a Robust Information Extractor. In: Proceedings of the 2015 Conference of the North American Chapter of the Association for Computational Linguistics: Demonstrations, pp. 21-25, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-941643-49-5 (bibtex)
Vincent Kríž, Martin Holub, Pavel Pecina (2015): Feature Extraction for Native Language Identification Using Language Modeling. In: Proceedings of Recent Advances in Natural Language Processing, pp. 298-306, Research Group in Computational Linguistics, University of Wolverhampton, UK, Hisarja, Bulgaria (bibtex)
Vladislav Kuboň, Markéta Lopatková (2015): Word-Order Analysis Based Upon Treebank Data. In: MICAI 2015: Advances in Artificial Intelligence and Soft Computing, Part I, pp. 47-58, Springer, Berlin / Heidelberg, ISBN 978-3-319-27059-3 (url, bibtex)
Vladislav Kuboň, Markéta Lopatková (2015): Free or Fixed Word Order: What Can Treebanks Reveal?. In: ITAT 2015: Information Technologies – Applications and Theory, Proceedings of the 15th conference ITAT 2015, pp. 23-29, CreateSpace Independent Publishing Platform, Praha, Czechia, ISBN 978-1515120650 (url, bibtex)
Vladislav Kuboň, Markéta Lopatková, Jiří Mírovský (2015): Analysis of Coordinating Constructions in a Dependency Treebank. In: Proceedings of the Twenty-Eighth International Florida Artificial Intelligence Research Society Conference, FLAIRS 2015, pp. 546-551, AAAI Press, Palo Alto, CA, USA, ISBN 978-1-57735-730-8 (url, bibtex)
Jindřich Libovický, Lukáš Neumann, Pavel Pecina, Jiří Matas (2015): A Machine Learning Approach to Hypothesis Decoding in Scene Text Recognition. In: Computer Vision - ACCV 2014 Workshops, Lecture Notes in Computer Science, ISSN 0302-9743, 9009, pp. 169-180, Springer International Publishing, Switzerland, ISBN 978-3-319-16630-8 (bibtex)
Ivana Lukšová (2015): Delving deep into some UFAL data and tools. In: UFAL WDS 2015 (Conference of PhD Students in Mathematical Linguistics), pp. 42-48, Institute of Formal and Applied Linguistics, Charles University in Prague, Praha, Czechia (bibtex)
Matouš Macháček, Ondřej Bojar (2015): Evaluating Machine Translation Quality Using Short Segments Annotations. In: The Prague Bulletin of Mathematical Linguistics, ISSN 0032-6585, 103, pp. 85-110 (pdf, bibtex)
David Mareček (2015): Multilingual Unsupervised Dependency Parsing with Unsupervised POS tags. In: MICAI 2015: Advances in Artificial Intelligence and Soft Computing, Part I, pp. 72-82, Springer, Berlin / Heidelberg, ISBN 978-3-319-27059-3 (bibtex)
Jakub Mlynář (2015): Unique collection of interviews with Armenian genocide witnesses and survivors is available at the Charles University in Prague. In: Historická sociologie, ISSN 1804-0616, vol. 2015, no. 2, pp. 126-128 (url, bibtex)
Jakub Mlynář (2015): Malach Center for Visual History. In: Seminář o digitálních zdrojích a službách ve společenských a humanitních vědách, pp. 83-89, Charles University in Prague, Praha, Czechia, ISBN 978-80-904571-9-5 (bibtex)
Anna Nedoluzhko, Ekaterina Lapshinova-Koltunski, Kerstin Anna Kunz (2015): Across Languages and Genres: Creating a Universal Annotation Scheme for Textual Relations. In: Proceedings of the The 9th Linguistic Annotation Workshop (LAW IX 2015) , pp. 168-177, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-941643-47-1 (local PDF, bibtex)
Anna Nedoluzhko, Svetlana Toldova, Michal Novák (2015): Coreference chains in Czech, English and Russian: Preliminary findings. In: Computational Linguistics and Intellectual Technologies, ISSN 2221-7932, vol. 14, no. 21, pp. 474-486 (pdf, bibtex)
Michal Novák, Anna Nedoluzhko (2015): Correspondences between Czech and English Coreferential Expressions. In: Discours: Revue de linguistique, psycholinguistique et informatique., ISSN 1963-1723, 16, pp. 1-41 (url, bibtex)
Michal Novák, Dieke Oele, Gertjan van Noord (2015): Comparison of Coreference Resolvers for Deep Syntax Translation. In: Proceedings of the Second Workshop on Discourse in Machine Translation, pp. 17-23, Association for Computational Linguistics, Lisboa, Portugal, ISBN 978-1-941643-32-7 (url, bibtex)
Stephan Oepen, Marco Kuhlmann, Yusuke Miyao, Daniel Zeman, Silvie Cinková, Dan Flickinger, Jan Hajič, Zdeňka Urešová (2015): SemEval 2015 Task 18: Broad-Coverage Semantic Dependency Parsing. In: Proceedings of the 9th International Workshop on Semantic Evaluation (SemEval 2015), pp. 915-926, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-941643-40-2 (url, local PDF, bibtex)
João Palotti, Guido Zuccon, Lorraine Goeuriot, Liadh Kelly, Allan Hanbury, Gareth J.F. Jones, Mihai Lupu, Pavel Pecina (2015): Retrieving information about medical symptoms. In: Working Notes of CLEF 2015 - Conference and Labs of the Evaluation forum, CEUR, Aachen, Germany (bibtex)
Jarmila Panevová (2015): Odešel prof. dr. Fr. Daneš (23.7.1919-18.3.2015) (Survey). In: Korpus – gramatika – axiologie, ISSN 1804-137X, 11, pp. 3-5
Jarmila Panevová, Marie Mikulová (2015): Příslovečné určení srovnání v češtině. In: U prostoru lingvističke slavistike, pp. 597-608, Univerzitet u Beogradu, Beograd, Serbia, ISBN 978-86-6153-364-8 (local PDF, bibtex)
Pavel Pecina, Antonio Toral, Vassilis Papavassiliou, Prokopis Prokopidis, Aleš Tamchyna, Andy Way, Josef Genabith (2015): Domain adaptation of statistical machine translation with domain-focused web crawling. In: Language Resources and Evaluation, ISSN 1574-020X, vol. 49, no. 1, pp. 147-193 (bibtex)
Ondřej Plátek, Filip Jurčíček (2015): Self-awareness: Towards Extracting Knowledge from Dialogue. In: UFAL WDS 2015 (Conference of PhD Students in Mathematical Linguistics), pp. 22-29, Institute of Formal and Applied Linguistics, Charles University in Prague, Praha, Czechia (bibtex)
Ondřej Plátek, Filip Jurčíček (2015): Self awareness for better common ground . In: SemDial, pp. 207-208, Göteborgs universitet, Göteborg, Sweden (bibtex)
Lucie Poláková (2015): Discourse Relations in Czech (PhD thesis). In: (bibtex)
Lucie Poláková, Pavlína Jínová, Jiří Mírovský (2015): Signals of Attribution in the Prague Dependency Treebank. In: 14th International Workshop on Treebanks and Linguistic Theories (TLT 2015), pp. 292-299, IPIPAN, Warszawa, Poland, ISBN 978-83-63159-18-4 (url, local PDF, bibtex)
Loganathan Ramasamy, Alexandr Rosen, Pavel Straňák (2015): Improvements to Korektor: A case study with native and non-native Czech. In: Proceedings of the 15th conference ITAT 2015: Slovenskočeský NLP workshop (SloNLP 2015), pp. 73-80, CreateSpace Independent Publishing Platform, Praha, Czechia, ISBN 978-1515120650 (bibtex)
Rudolf Rosa (2015): Multi-source Cross-lingual Delexicalized Parser Transfer: Prague or Stanford?. In: Proceedings of the Third International Conference on Dependency Linguistics (Depling 2015), pp. 281-290, Uppsala University, Uppsala, Sweden, ISBN 978-91-637-8965-6 (url, local PDF, local PDF, bibtex)
Rudolf Rosa (2015): Parsing Natural Language Sentences by Semi-supervised Methods (Electronic). (pdf, local PDF, local PDF, local PDF)
Rudolf Rosa (2015): A new parsing algorithm. In: UFAL WDS 2015 (Conference of PhD Students in Mathematical Linguistics), pp. 8-13, Institute of Formal and Applied Linguistics, Charles University in Prague, Praha, Czechia (local PDF, bibtex)
Rudolf Rosa, Ondřej Dušek, Michal Novák, Martin Popel (2015): Translation Model Interpolation for Domain Adaptation in TectoMT. In: Proceedings of the 1st Deep Machine Translation Workshop, pp. 89-96, ÚFAL MFF UK, Praha, Czechia, ISBN 978-80-904571-7-1 (url, local PDF, local PDF, bibtex)
Rudolf Rosa, Zdeněk Žabokrtský (2015): MSTParser Model Interpolation for Multi-source Delexicalized Transfer. In: Proceedings of the 14th International Conference on Parsing Technologies, pp. 71-75, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-941643-98-3 (url, local PDF, local PDF, bibtex)
Rudolf Rosa, Zdeněk Žabokrtský (2015): KLcpos3 - a Language Similarity Measure for Delexicalized Parser Transfer. In: Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing (Volume 2: Short Papers), pp. 243-249, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-941643-73-0 (url, local PDF, local ZIP, local PDF, local PDF, bibtex)
Victoria Rosén, Gyri Smørdal Losnegaard, Koenraad De Smedt, Eduard Bejček, Agata Savary, Adam Przepiórkowski, Petya Osenova, Verginica Barbu Mititelu (2015): A survey of multiword expressions in treebanks. In: 14th International Workshop on Treebanks and Linguistic Theories (TLT 2015), pp. 179-193, IPIPAN, Warszawa, Poland, ISBN 978-83-63159-18-4 (local PDF, bibtex)
Kateřina Rysová, Jiří Mírovský, Eva Hajičová (2015): On an apparent freedom of Czech word order. A case study. In: 14th International Workshop on Treebanks and Linguistic Theories (TLT 2015), pp. 93-105, IPIPAN, Warszawa, Poland, ISBN 978-83-63159-18-4 (pdf, local PDF, bibtex)
Kateřina Rysová, Karel Oliva (2015): Jaký byl 41. ročník Olympiády v českém jazyce?. In: Český jazyk a literatura, ISSN 0009-0786, pp. 53-57 (bibtex)
Kateřina Rysová, Magdaléna Rysová (2015): Analyzing Text Coherence via Multiple Annotation in the Prague Dependency Treebank. In: Text, Speech, and Dialogue: 18th International Conference, TSD 2015, Lecture Notes in Computer Science, ISSN 0302-9743, 9302, pp. 71-79, Springer International Publishing, Cham / Heidelberg / New York / Dordrecht / London, ISBN 978-3-319-24032-9 (url, bibtex)
Kateřina Rysová, Magdaléna Rysová, Eva Hajičová (2015): Topic–Focus Articulation in English Texts on the Basis of Functional Generative Description (technical report). In: (pdf, local PDF, bibtex)
Magdaléna Rysová (2015): Diskurzní konektory v češtině (Od centra k periferii) (PhD thesis). In: (bibtex)
Magdaléna Rysová, Kateřina Rysová (2015): Secondary Connectives in the Prague Dependency Treebank . In: Proceedings of the Third International Conference on Dependency Linguistics (Depling 2015), pp. 291-299, Uppsala University, Uppsala, Sweden, ISBN 978-91-637-8965-6 (pdf, local PDF, bibtex)
Shadi Saleh (2015): Cross language information retrieval systems. In: UFAL WDS 2015 (Conference of PhD Students in Mathematical Linguistics), pp. 1-7, Institute of Formal and Applied Linguistics, Charles University in Prague, Praha, Czechia (bibtex)
Shadi Saleh, Feraena Bibyna, Pavel Pecina (2015): CUNI at the CLEF eHealth 2015 Task 2. In: Working Notes of CLEF 2015 - Conference and Labs of the Evaluation forum, CEUR, Aachen, Germany (url, bibtex)
Ineke Schuurman, Menzo Windhouwer, Oddrun Ohren, Daniel Zeman (2015): CLARIN Concept Registry: the new semantic registry replacing ISOcat. In: CLARIN Annual Conference 2015, pp. 80-83, CLARIN-PL, Wrocław, Poland (pdf, bibtex)
Miloš Stanojević, Amir Kamran, Ondřej Bojar (2015): Results of the WMT15 Tuning Shared Task. In: Proceedings of the 10th Workshop on Machine Translation, pp. 274-281, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-941643-32-7 (pdf, bibtex)
Miloš Stanojević, Amir Kamran, Philipp Koehn, Ondřej Bojar (2015): Results of the WMT15 Metrics Shared Task. In: Proceedings of the 10th Workshop on Machine Translation, pp. 256-273, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-941643-32-7 (pdf, bibtex)
Milan Straka, Jan Hajič, Jana Straková, Jan Hajič, jr. (2015): Parsing Universal Dependency Treebanks using Neural Networks and Search-Based Oracle. In: 14th International Workshop on Treebanks and Linguistic Theories (TLT 2015), pp. 208-220, IPIPAN, Warszawa, Poland, ISBN 978-83-63159-18-4 (pdf, local PDF, bibtex)
Roman Sudarikov, Ondřej Bojar (2015): Giving a Sense: A Pilot Study in Concept Annotation from Multiple Resources. In: Proceedings of the 15th conference ITAT 2015: Slovenskočeský NLP workshop (SloNLP 2015), pp. 88-94, CreateSpace Independent Publishing Platform, Praha, Czechia, ISBN 978-1515120650 (pdf, bibtex)
Roman Sudarikov, Ondřej Bojar (2015): Giving a Sense: A Pilot Study in Concept Annotation from Multiple Resources. In: UFAL WDS 2015 (Conference of PhD Students in Mathematical Linguistics), pp. 14-21, Institute of Formal and Applied Linguistics, Charles University in Prague, Praha, Czechia (bibtex)
Roman Sudarikov, Petr Fanta, Ondřej Bojar (2015): TeamUFAL: WSD+EL as Document Retrieval. In: Proceedings of the 9th International Workshop on Semantic Evaluation (SemEval 2015), pp. 350-354, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-941643-40-2 (url, bibtex)
Magda Ševčíková (2015): Morphology within the Multi-Layered Annotation Scenario of the Prague Dependency Treebank. In: Systems and Frameworks for Computational Morphology, Fourth International Workshop, SFCM 2015, Stuttgart, Germany, September 17-18, 2015. Proceedings, pp. 1-26, Springer, Berlin / Heidelberg, ISBN 978-3-319-23978-1 (url, bibtex)
Jana Šindlerová, Eva Fučíková, Zdeňka Urešová (2015): Zero Alignment of Verb Arguments in a Parallel Treebank. In: Proceedings of the Third International Conference on Dependency Linguistics (Depling 2015), pp. 330-339, Uppsala University, Uppsala, Sweden, ISBN 978-91-637-8965-6 (pdf, local PDF, bibtex)
Aleš Tamchyna, Ondřej Bojar (2015): What a Transfer-Based System Brings to the Combination with PBMT. In: Proceedings of the Fourth Workshop on Hybrid Approaches to Translation (HyTra), pp. 11-20, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-941643-67-9 (bibtex)
Aleš Tamchyna, Ondřej Fiala, Kateřina Veselovská (2015): Czech Aspect-Based Sentiment Analysis: A New Dataset and Preliminary Results. In: Proceedings of the 15th conference ITAT 2015: Slovenskočeský NLP workshop (SloNLP 2015), pp. 95-99, CreateSpace Independent Publishing Platform, Praha, Czechia, ISBN 978-1515120650 (bibtex)
Aleš Tamchyna, Chris Quirk, Michel Galley (2015): A Discriminative Model for Semantics-to-String Translation. In: Proceedings of the 1st Workshop on Semantics-Driven Statistical Machine Translation (S2MT 2015), pp. 30-36, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-941643-61-7 (bibtex)
Kristýna Tomšů (2015): Morphology as processed by the human brain. In: UFAL WDS 2015 (Conference of PhD Students in Mathematical Linguistics), pp. 36-41, Institute of Formal and Applied Linguistics, Charles University in Prague, Praha, Czechia (bibtex)
Antonio Toral, Pavel Pecina, Longyue Wang, Josef Genabith (2015): Linguistically-augmented Perplexity-based Data Selection for Language Models. In: Computer Speech and Language, ISSN 0885-2308, vol. 32, no. 1, pp. 11-26 (bibtex)
Zdeňka Urešová, Ondřej Dušek, Eva Fučíková, Jan Hajič, Jana Šindlerová (2015): Bilingual English-Czech Valency Lexicon Linked to a Parallel Corpus. In: Proceedings of the The 9th Linguistic Annotation Workshop (LAW IX 2015) , pp. 124-128, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-941643-47-1 (url, local PDF, bibtex)
Zdeňka Urešová, Eva Fučíková, Jana Šindlerová (2015): CzEngVallex: Mapping Valency between Languages (technical report). In: (local PDF, bibtex)
Kateřina Veselovská (2015): On the Linguistic Structure of Emotional Meaning in Czech (PhD thesis). In: (local PDF, bibtex)
Jernej Vičič, Vladislav Kuboň (2015): A Comparison of MT Methods for Closely Related Languages: A Case Study on Czech - Slovak and Croatian - Slovenian Language Pairs. In: Text, Speech, and Dialogue: 18th International Conference, TSD 2015, Lecture Notes in Computer Science, ISSN 0302-9743, 9302, pp. 216-224, Springer International Publishing, Cham / Heidelberg / New York / Dordrecht / London, ISBN 978-3-319-24032-9 (bibtex)
Jonáš Vidra (2015): Implementation of a Search Engine for DeriNet. In: Proceedings of the 15th conference ITAT 2015: Slovenskočeský NLP workshop (SloNLP 2015), pp. 100-106, CreateSpace Independent Publishing Platform, Praha, Czechia, ISBN 978-1515120650 (bibtex)
Miroslav Vodolán (2015): Knowledge-based Dialog. In: UFAL WDS 2015 (Conference of PhD Students in Mathematical Linguistics), pp. 30-35, Institute of Formal and Applied Linguistics, Charles University in Prague, Praha, Czechia (bibtex)
Miroslav Vodolán, Rudolf Kadlec, Jan Kleindienst (2015): Hybrid Dialog State Tracker. In: Proceedings of NIPS 2015 Workshop on Machine Learning for Spoken Language Understanding and Interaction, pp. 1-6, Neural Information Processing Systems Foundation, La Jolla, CA, USA (bibtex)
Daniel Zeman (2015): Slavic Languages in Universal Dependencies. In: Natural Language Processing, Corpus Linguistics, E-learning, pp. 151-163, RAM-Verlag, Lüdenscheid, Germany, ISBN 978-3-942303-32-3 (local PDF, local PDF, bibtex)
Šárka Zikánová, Eva Hajičová, Barbora Hladká, Pavlína Jínová, Jiří Mírovský, Anna Nedoluzhko, Lucie Poláková, Kateřina Rysová, Magdaléna Rysová, Jan Václ (2015): Discourse and Coherence. From the Sentence Structure to Relations in Text. In: , ISBN 978-80-904571-8-8 (bibtex)
Šárka Zikánová, Lucie Poláková, Pavlína Jínová, Anna Nedoluzhko, Magdaléna Rysová, Jiří Mírovský, Eva Hajičová (2015): Zachycení výstavby textu v Pražském závislostním korpusu . In: Slovo a slovesnost, ISSN 0037-7031, 76, pp. 163-197 (bibtex)
Lukáš Žilka, Filip Jurčíček (2015): Incremental LSTM-based Dialog State Tracker. In: IEEE ASRU '15: Proc. IEEE Automatic Speech Recognition and Understanding, pp. 757-762, IEEE, Phoenix, AZ, USA, ISBN 978-1-4799-7291-3 (bibtex)
Lukáš Žilka, Filip Jurčíček (2015): LecTrack: Incremental Dialog State Tracking with Long Short-Term Memory Networks. In: Text, Speech, and Dialogue: 18th International Conference, TSD 2015, Lecture Notes in Computer Science, ISSN 0302-9743, 9302, pp. 189-197, Springer International Publishing, Cham / Heidelberg / New York / Dordrecht / London, ISBN 978-3-319-24032-9 (url, bibtex)
Andrea Abel, Katrin Wisniewski, Lionel Nicolas, Adriane Boyd, Jirka Hana, Detmar Meurers (2014): A Trilingual Learner Corpus illustrating European Reference Levels. In: RiCOGNIZIONI. Rivista di lingue, letterature e culture moderne, ISSN 2384-8987, 2, pp. 111-126 (url, bibtex)
Mohammed Attia, Pavel Pecina, Antonio Toral, Josef Genabith (2014): A corpus-based finite-state morphological toolkit for contemporary Arabic. In: Journal of Logic and Computation, ISSN 0955-792X, vol. 42, no. 2, pp. 455-472 (url, bibtex)
Petra Barančíková (2014): Parmesan: Meteor without Paraphrases with Paraphrased References. In: Proceedings of the Ninth Workshop on Statistical Machine Translation, pp. 355-361, Association for Computational Linguistics, Baltimore, MD, USA, ISBN 978-1-941643-17-4 (bibtex)
Petra Barančíková, Rudolf Rosa, Aleš Tamchyna (2014): Improving Evaluation of English-Czech MT through Paraphrasing. In: Proceedings of the 9th International Conference on Language Resources and Evaluation (LREC 2014), pp. 596-601, European Language Resources Association, Reykjavík, Iceland, ISBN 978-2-9517408-8-4 (pdf, local PDF, bibtex)
Petra Barančíková, Aleš Tamchyna (2014): Machine Translation within One Language as a Paraphrasing Technique. In: Proceedings of the 14th conference ITAT 2014, pp. 1-6, Institute of Computer Science AS CR, Praha, Czechia, ISBN 978-80-87136-18-8 (bibtex)
Eduard Bejček, Václava Kettnerová, Markéta Lopatková (2014): Automatic Mapping Lexical Resources: A Lexical Unit as the Keystone. In: Proceedings of the 9th International Conference on Language Resources and Evaluation (LREC 2014), pp. 2826-2832, European Language Resources Association, Reykjavík, Iceland, ISBN 978-2-9517408-8-4 (url, local PDF, local PDF, bibtex)
Ondřej Bojar, Christian Buck, Christian Federmann, Barry Haddow, Philipp Koehn, Johannes Leveling, Christof Monz, Pavel Pecina, Matt Post, Hervé Saint-Amand, Radu Soricut, Lucia Specia, Aleš Tamchyna (2014): Findings of the 2014 Workshop on Statistical Machine Translation. In: Proceedings of the Ninth Workshop on Statistical Machine Translation, pp. 12-58, Association for Computational Linguistics, Baltimore, MD, USA, ISBN 978-1-941643-17-4 (bibtex)
Ondřej Bojar, Vojtěch Diatka, Pavel Rychlý, Pavel Straňák, Vít Suchomel, Aleš Tamchyna, Daniel Zeman (2014): HindEnCorp – Hindi-English and Hindi-only Corpus for Machine Translation. In: Proceedings of the 9th International Conference on Language Resources and Evaluation (LREC 2014), pp. 3550-3555, European Language Resources Association, Reykjavík, Iceland, ISBN 978-2-9517408-8-4 (pdf, local PDF, local PDF, bibtex)
Ondřej Bojar, Daniel Zeman (2014): Czech Machine Translation in the project CzechMATE. In: The Prague Bulletin of Mathematical Linguistics, ISSN 0032-6585, 101, pp. 71-96 (pdf, local PDF, local PDF, bibtex)
Adriane Boyd, Jirka Hana, Lionel Nicolas, Detmar Meurers, Katrin Wisniewski, Andrea Abel, Karin Schöne, Barbora Štindlová, Chiara Vettori (2014): The MERLIN corpus: Learner language and the CEFR. In: Proceedings of the 9th International Conference on Language Resources and Evaluation (LREC 2014), pp. 1281-1288, European Language Resources Association, Reykjavík, Iceland, ISBN 978-2-9517408-8-4 (bibtex)
Koenraad De Smedt, Erhard Hinrichs, Detmar Meurers, Inguna Skadiņa, Bolette Sandford Pedersen, Costanza Navarretta, Núria Bel, Krister Lindén, Markéta Lopatková, Jan Hajič, Gisle Andersen, Przemysław Lenkiewicz (2014): CLARA: A New Generation of Researchers in Common Language Resources and Their Applications. In: Proceedings of the 9th International Conference on Language Resources and Evaluation (LREC 2014), pp. 2166-2174, European Language Resources Association, Reykjavík, Iceland, ISBN 978-2-9517408-8-4 (local PDF, bibtex)
Ondřej Dušek, Jan Hajič, Jaroslava Hlaváčová, Michal Novák, Pavel Pecina, Rudolf Rosa, Aleš Tamchyna, Zdeňka Urešová, Daniel Zeman (2014): Machine Translation of Medical Texts in the Khresmoi Project. In: Proceedings of the Ninth Workshop on Statistical Machine Translation, pp. 221-228, Association for Computational Linguistics, Baltimore, MD, USA, ISBN 978-1-941643-17-4 (pdf, local PDF, local PDF, bibtex)
Ondřej Dušek, Jan Hajič, Zdeňka Urešová (2014): Verbal Valency Frame Detection and Selection in Czech and English. In: The 2nd Workshop on EVENTS: Definition, Detection, Coreference, and Representation, pp. 6-11, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-941643-14-3 (pdf, local PDF, bibtex)
Ondřej Dušek, Ondřej Plátek, Lukáš Žilka, Filip Jurčíček (2014): Alex: Bootstrapping a Spoken Dialogoue System for a New Domain by Real Users. In: Proceedings of the 15th Annual Meeting of the Special Interest Group on Discourse and Dialogue, pp. 79-83, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-941643-21-1 (pdf, bibtex)
Petra Galuščáková, Martin Kruliš, Jakub Lokoč, Pavel Pecina (2014): CUNI at MediaEval 2014 Search and Hyperlinking Task: Visual and Prosodic Features in Hyperlinking. In: Working Notes Proceedings of the MediaEval 2014 Workshop, CEUR-WS.org, Aachen, Germany (bibtex)
Petra Galuščáková, Pavel Pecina (2014): CUNI at MediaEval 2014 Search and Hyperlinking Task: Search Task Experiments. In: Working Notes Proceedings of the MediaEval 2014 Workshop, CEUR-WS.org, Aachen, Germany (bibtex)
Petra Galuščáková, Pavel Pecina (2014): Experiments with Segmentation Strategies for Passage Retrieval in Audio-Visual Documents. In: ICMR '14 Proceedings of International Conference on Multimedia Retrieval , pp. 217-225, ACM, New York, NY, USA, ISBN 978-1-4503-2782-4 (url, local PDF, bibtex)
Lorraine Goeuriot, Liadh Kelly, Wei Li, João Palotti, Pavel Pecina, Guido Zuccon, Allan Hanbury, Gareth J.F. Jones, Henning Müller (2014): ShARe/CLEF eHealth Evaluation Lab 2014, Task 3: User-centred health information retrieval. In: Working Notes for CLEF 2014 Conference , pp. 43-61, CEUR-WS.org, Aachen, Germany (bibtex)
Nathan David Green, Septina Dian Larasati (2014): Votter Corpus: A Corpus of Social Polling Language. In: Proceedings of the 9th International Conference on Language Resources and Evaluation (LREC 2014), pp. 3693-3697, European Language Resources Association, Reykjavík, Iceland, ISBN 978-2-9517408-8-4 (bibtex)
Eva Hajičová (2014): Three dimensions of the so-called "interoperability" of annotation schemes. In: Proceedings of the 9th International Conference on Language Resources and Evaluation (LREC 2014), pp. 4559-4564, European Language Resources Association, Reykjavík, Iceland, ISBN 978-2-9517408-8-4 (bibtex)
Eva Hajičová (2014): Charles Fillmore has passed away. In: Linguistica Pragensia, ISSN 0862-8432, vol. 24, no. 1, pp. 73-75 (bibtex)
Jirka Hana, Barbora Hladká, Ivana Lukšová (2014): Sentence diagrams: their evaluation and combination. In: Proceedings of The 8th Linguistic Annotation Workshop (LAW-VIII), pp. 38-47, Dublin City University (DCU), Dublin, Ireland, ISBN 978-1-941643-29-7 (url, bibtex)
Jirka Hana, Alexandr Rosen, Barbora Štindlová, Jan Štěpánek (2014): Building a learner corpus. In: Language Resources and Evaluation, ISSN 1574-020X, 47, pp. 741-752 (url, bibtex)
Jaroslava Hlaváčová (2014): Vyjádření intenzity slovesného děje pomocí předpon. In: Korpusová lingvistika - abstrakty, pp. 66-67, Ústav Českého národního korpusu, Praha, Czechia (local PDF, bibtex)
Jaroslava Hlaváčová, Anna Nedoluzhko (2014): Productive verb prefixation patterns. In: The Prague Bulletin of Mathematical Linguistics, ISSN 0032-6585, 101, pp. 111-122 (pdf, bibtex)
Irena Holubová, Tomáš Knap, Vincent Kríž, Martin Nečaský, Barbora Hladká (2014): INTLIB - an INTelligent LIBrary. In: Proceedings of the Dateso 2014 Annual International Workshop on DAtabases, TExts, Specifications and Objects, pp. 13-24, Czech Technical University in Prague, Faculty of Information Technology, Praha, Czechia, ISBN 978-80-01-05482-6 (url, bibtex)
Jinho Choi, Marie-Catherine de Marneffe, Timothy Dozat, Filip Ginter, Yoav Goldberg, Jan Hajič, Christopher Manning, Ryan McDonald, Joakim Nivre, Slav Petrov, Sampo Pyysalo, Natalia Silveira, Reut Tsarfaty, Daniel Zeman (2014): Universal Dependencies (Electronic). (url)
Bushra Jawaid, Ondřej Bojar (2014): Two-Step Machine Translation with Lattices. In: Proceedings of the 9th International Conference on Language Resources and Evaluation (LREC 2014), pp. 682-686, European Language Resources Association, Reykjavík, Iceland, ISBN 978-2-9517408-8-4 (local PDF, bibtex)
Bushra Jawaid, Amir Kamran, Ondřej Bojar (2014): English to Urdu Statistical Machine Translation: Establishing a Baseline. In: Proceedings of COLING 2014, the 25th International Conference on Computational Linguistics: Technical Papers, pp. 37-42, Dublin City University and Association for Computational Linguistics, Dublin, Ireland, ISBN 978-1-941643-26-6 (bibtex)
Bushra Jawaid, Amir Kamran, Ondřej Bojar (2014): A Tagged Corpus and a Tagger for Urdu. In: Proceedings of the 9th International Conference on Language Resources and Evaluation (LREC 2014), pp. 2938-2943, European Language Resources Association, Reykjavík, Iceland, ISBN 978-2-9517408-8-4 (local TXT, local PDF, bibtex)
Pavlína Jínová, Lucie Poláková, Jiří Mírovský (2014): Sentence Structure and Discourse Structure (Possible parallels). In: Dependency Linguistics. Recent advances in linguistic theory using dependency structures, pp. 53-74, John Benjamins Publishing Company, Amsterdam, The Netherlands, ISBN 978-9027255983 (url, bibtex)
Filip Jurčíček, Ondřej Dušek, Ondřej Plátek (2014): A Factored Discriminative Spoken Language Understanding for Spoken Dialogue Systems. In: Text, Speech and Dialogue: 17th International Conference, TSD , Lecture Notes in Computer Science, ISSN 0302-9743, 8655, pp. 579-586, Springer Verlag, Berlin / Heidelberg, ISBN 978-3-319-10815-5 (url, bibtex)
Filip Jurčíček, Ondřej Dušek, Ondřej Plátek, Lukáš Žilka (2014): Alex: A Statistical Dialogue Systems Framework. In: Text, Speech and Dialogue: 17th International Conference, TSD , Lecture Notes in Computer Science, ISSN 0302-9743, 8655, pp. 587-594, Springer Verlag, Berlin / Heidelberg, ISBN 978-3-319-10815-5 (pdf, bibtex)
Rudolf Kadlec, Jindřich Libovický, Jan Macek, Jan Kleindienst (2014): IBM’s Belief Tracker: Results On Dialog State Tracking Challenge Datasets. In: Proceedings of the Workshop on Dialogue in Motion, pp. 10-18, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-937284-81-7 (bibtex)
Rudolf Kadlec, Miroslav Vodolán, Jindřich Libovický, Jan Macek, Jan Kleindienst (2014): Knowledge-based Dialog State Tracking. In: Proceedings of the 2014 IEEE Spoken Language Technology Workshop (SLT), pp. 348-353, IEEE, Lake Tahoe, NV, USA, ISBN 9781479971305 (bibtex)
Václava Kettnerová (2014): Lexikálně-sémantické konverze ve valenčním slovníku. In: , ISBN 978-80-246-2623-9 (bibtex)
Václava Kettnerová, Markéta Lopatková (2014): Reflexive Verbs in a Valency Lexicon: The Case of Czech Reflexive Morphemes. In: Proceedings of the XVI EURALEX International Congress: The User in Focus, pp. 1007-1023, EURAC research, Bolzano/Bozen, Italy, ISBN 978-88-88906-97-3 (bibtex)
Václava Kettnerová, Markéta Lopatková, Jarmila Panevová (2014): An Interplay between Valency Information and Reflexivity. In: The Prague Bulletin of Mathematical Linguistics, ISSN 0032-6585, 102, pp. 105-126 (url, local PDF, bibtex)
Natalia Klyueva, Vladislav Kuboň (2014): Automatic Valency Derivation for Related Languages. In: Proceedings of the Twenty-Seventh International Florida Artificial Intelligence Research Society Conference, FLAIRS 2014, pp. 437-442, AAAI Press, Palo Alto, CA, USA, ISBN 978-1-57735-658-5 (local PDF, bibtex)
Veronika Kolářová (2014): Preference v souvýskytu aktantů u českých substantiv mluvení. In: Korpus – gramatika – axiologie, ISSN 1804-137X, vol. 5, no. 10, pp. 23-40 (local PDF, bibtex)
Veronika Kolářová (2014): Valence vybraných typů deverbativních substantiv ve valenčním slovníku PDT-Vallex (technical report). In: (url, local PDF, bibtex)
Veronika Kolářová (2014): Nominalizované struktury se dvěma aktanty ve formě bezpředložkového genitivu. In: Naše řeč, ISSN 0027-8203, vol. 97, no. 4-5, pp. 286-299 (local PDF, bibtex)
Veronika Kolářová (2014): Special valency behavior of Czech deverbal nouns. In: Noun Valency, pp. 19-60, John Benjamins Publishing Company, Amsterdam, The Netherlands, ISBN 9789027259233 (bibtex)
Matěj Korvas, Ondřej Plátek, Ondřej Dušek, Lukáš Žilka, Filip Jurčíček (2014): Free English and Czech telephone speech corpus shared under the CC-BY-SA 3.0 license. In: Proceedings of the 9th International Conference on Language Resources and Evaluation (LREC 2014), pp. 4423-4427, European Language Resources Association, Reykjavík, Iceland, ISBN 978-2-9517408-8-4 (url, bibtex)
Vincent Kríž, Barbora Hladká, Martin Nečaský, Jan Dědek (2014): Statistical Recognition of References in Czech Court Decisions. In: 13th Mexican International Conference on Artificial Intelligence, MICAI 2014, Tuxtla Gutiérrez, Mexico, November 16-22, 2014. Proceedings, Part I, pp. 51-61, Springer International Publishing, Switzerland, ISBN 978-3-319-13646-2 (bibtex)
Vincent Kríž, Barbora Hladká, Martin Nečaský, Tomáš Knap (2014): Data Extraction Using NLP Techniques and Its Transformation to Linked Data. In: 13th Mexican International Conference on Artificial Intelligence, MICAI 2014, Tuxtla Gutiérrez, Mexico, November 16-22, 2014. Proceedings, Part I, pp. 113-124, Springer International Publishing, Switzerland, ISBN 978-3-319-13646-2 (bibtex)
Oldřich Krůza, Vladislav Kuboň (2014): Automatic Recognition of Clauses. In: International Journal of Computational Linguistics and Applications, ISSN 0976-0962, vol. 5, no. 1, pp. 125-138 (url, bibtex)
Vladislav Kuboň, Jernej Vičič (2014): A Comparison of MT Methods for Closely Related Languages: a Case Study on Czech – Slovak Language Pair. In: LT4CloseLang, pp. 92-98, The Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-937284-96-1 (url, bibtex)
Jindřich Libovický, Pavel Pecina (2014): Tolerant BLEU: a Submission to the WMT14 Metrics Task. In: Proceedings of the Ninth Workshop on Statistical Machine Translation, pp. 409-413, Association for Computational Linguistics, Baltimore, MD, USA, ISBN 978-1-941643-17-4 (local PDF, bibtex)
Markéta Lopatková, Jiří Mírovský, Vladislav Kuboň (2014): Gramatické závislosti vs. koordinace z pohledu redukční analýzy. In: Proceedings of the 14th conference ITAT 2014, pp. 61-67, Institute of Computer Science AS CR, Praha, Czechia, ISBN 978-80-87136-18-8 (pdf, bibtex)
Matouš Macháček, Ondřej Bojar (2014): Results of the WMT14 Metrics Shared Task. In: Proceedings of the Ninth Workshop on Statistical Machine Translation, pp. 293-301, Association for Computational Linguistics, Baltimore, MD, USA, ISBN 978-1-941643-17-4 (pdf, bibtex)
Daniela Majchráková, Ondřej Dušek, Jan Hajič, Agáta Karčová, Radovan Garabík (2014): Semi-automatic Detection of Multiword Expressions in the Slovak Dependency Treebank. In: Proceedings of the First International Conference "Computational Linguistics in Bulgaria", pp. 32-39, Institute for Bulgarian Language Prof. Lyubomir Andreychin, Bulgarian Academy of Sciences, Sofija, Bulgaria (pdf, local PDF, bibtex)
David Mareček, Zdeněk Žabokrtský (2014): Dealing with Function Words in Unsupervised Dependency Parsing. In: 15th International Conference on Computational Linguistics and Intelligent Text Processing, pp. 250-261, Springer, Berlin / Heidelberg, ISBN 978-3-642-54905-2 (local PDF, bibtex)
Marie Mikulová (2014): Annotation on the tectogrammatical level. Additions to annotation manual (with respect to PDTSC and PCEDT) (technical report). In: (bibtex)
Marie Mikulová (2014): Semantic Representation of Ellipsis in the Prague Dependency Treebanks. In: Proceedings of the Twenty-Sixth Conference on Computational Linguistics and Speech Processing ROCLING XXVI (2014), pp. 125-138, Association for Computational Linguistics and Chinese Language Processing (ACLCLP) , Taipei, Taiwan, ISBN 978-957-30792-7-9 (local PDF, bibtex)
Jiří Mírovský, Eva Hajičová (2014): What can linguists learn from some simple statistics on annotated treebanks. In: Proceedings of 13th International Workshop on Treebanks and Linguistic Theories (TLT13), pp. 279-284, University of Tübingen, Tübingen, Germany, ISBN 978-3-9809183-9-8 (local PDF, local PDF, bibtex)
Jiří Mírovský, Pavlína Jínová, Lucie Poláková (2014): Discourse Relations in the Prague Dependency Treebank 3.0. In: The 25th International Conference on Computational Linguistics (Coling 2014), Proceedings of the Conference System Demonstrations, pp. 34-38, Dublin City University (DCU), Dublin, Ireland, ISBN 978-1-941643-27-3 (local PDF, local PDF, local PDF, bibtex)
Yusuke Miyao, Stephan Oepen, Daniel Zeman (2014): In-House: An Ensemble of Pre-Existing Off-the-Shelf Parsers. In: Proceedings of the Eighth International Workshop on Semantic Evaluation (SemEval 2014), pp. 335-340, Dublin City University, Dublin, Ireland, ISBN 978-1-937284-96-1 (url, local PDF, bibtex)
Jakub Mlynář (2014): Malach jako brána do minulosti skrze vyprávění přeživších. In: Židovské listy, vol. 2014, no. 10, pp. 14-15 (bibtex)
Anna Nedoluzhko, Jiří Mírovský, Eva Fučíková, Jiří Pergler (2014): Annotation of coreference in Prague Czech-English Dependency Treebank (technical report). In: (url, bibtex)
Anna Nedoluzhko, Anna Schwarz (Khoroshkina) (2014): "VCHERA NASOCHINYALSYA VOROH STROK": PRODUCTIVE CIRCUMFIXAL INTENSIFYING PATTERNS IN RUSSIAN. In: Computational Linguistics and Intellectual Technologies, ISSN 2221-7932, 13 (20), pp. 466-477 (pdf, bibtex)
Anna Nedoluzhko, Svetlana Toldova, Anna Roitberg, Alina Ladygina, Maria D. Vasilyeva, Ilja Azerkovich, Matvej Kurzukov, Anastasija Ivanova, Julia Grishina (2014): RU-EVAL-2014: EVALUATING ANAPHORA AND COREFERENCE RESOLUTION FOR RUSSIAN. In: Computational Linguistics and Intellectual Technologies, ISSN 2221-7932, 13 (20), pp. 681-694 (local PDF, bibtex)
Michal Novák, Zdeněk Žabokrtský (2014): Cross-lingual Coreference Resolution of Pronouns. In: Proceedings of COLING 2014, the 25th International Conference on Computational Linguistics: Technical Papers, pp. 14-24, Dublin City University and Association for Computational Linguistics, Dublin, Ireland, ISBN 978-1-941643-26-6 (pdf, bibtex)
Stephan Oepen, Marco Kuhlmann, Daniel Zeman, Yusuke Miyao, Dan Flickinger, Jan Hajič, Angelina Ivanova, Yi Zhang (2014): SemEval 2014 Task 8: Broad-Coverage Semantic Dependency Parsing. In: Proceedings of the Eighth International Workshop on Semantic Evaluation (SemEval 2014), pp. 63-72, Dublin City University, Dublin, Ireland, ISBN 978-1-937284-96-1 (local PDF, bibtex)
Jarmila Panevová (2014): Koordinace versus determinace (Forma nebo význam?). In: Korpus – gramatika – axiologie, ISSN 1804-137X, vol. 4, no. 10, pp. 47-56 (bibtex)
Jarmila Panevová (2014): Contribution of Valency to the Analysis of Language. In: Noun Valency, pp. 1-18, John Benjamins Publishing Company, Amsterdam, The Netherlands, ISBN 9789027259233 (bibtex)
Jarmila Panevová, Eva Hajičová, Václava Kettnerová, Markéta Lopatková, Marie Mikulová, Magda Ševčíková (2014): Mluvnice současné češtiny 2, Syntax na základě anotovaného korpusu. In: , ISBN 978-80-246-2497-6 (bibtex)
Jarmila Panevová, Magda Ševčíková (2014): Delimitation of information between grammatical rules and lexicon. In: Dependency Linguistics. Recent advances in linguistic theory using dependency structures., pp. 33-52, John Benjamins Publishing Company, Amsterdam, The Netherland, ISBN 978-90-272-5598-3 (bibtex)
Pavel Pecina, Ondřej Dušek, Lorraine Goeuriot, Jan Hajič, Jaroslava Hlaváčová, Gareth J.F. Jones, Liadh Kelly, Johannes Leveling, David Mareček, Michal Novák, Martin Popel, Rudolf Rosa, Aleš Tamchyna, Zdeňka Urešová (2014): Adaptation of machine translation for multilingual information retrieval in medical domain. In: Artificial Intelligence in Medicine, ISSN 0933-3657, vol. 61, no. 3, pp. 165-185 (url, bibtex)
Martin Plátek, Markéta Lopatková, Dana Pardubská (2014): On the Complexity of Reductions by Restarting Automata.. In: NCMA, pp. 207-222, Österreichische Computer Gesellschaft , Wien, Austria, ISBN 978-3-85403-304-2 (local PDF, bibtex)
Martin Plátek, Dana Pardubská, Markéta Lopatková (2014): On Minimalism of Analysis by Reduction by Restarting Automata. In: 19th International Conference on Formal Grammar 2014, Lecture Notes in Computer Science, ISSN 0302-9743, 8612, pp. 155-170, Springer, Heidelberg, Germany, ISBN 978-3-662-44121-3 (local PDF, bibtex)
Ondřej Plátek (2014): Speech recognition using Kaldi (masters thesis). In: (bibtex)
Ondřej Plátek, Filip Jurčíček (2014): Integration of an On-line Kaldi Speech Recogniser to the Alex Dialogue Systems Framework. In: Text, Speech and Dialogue: 17th International Conference, TSD , Lecture Notes in Computer Science, ISSN 0302-9743, 8655, pp. 603-610, Springer Verlag, Berlin / Heidelberg, ISBN 978-3-319-10815-5 (url, bibtex)
Ondřej Plátek, Filip Jurčíček (2014): Free on-line speech recogniser based on Kaldi ASR toolkit producing word posterior lattices. In: Proceedings of the 15th Annual Meeting of the Special Interest Group on Discourse and Dialogue, pp. 108-112, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-941643-21-1 (url, bibtex)
Natalia Pletneva, Zdeňka Urešová, Jean-Jacques Altman, Nicolas Postel Vinay, Patrice Degoulet, Jan Hajič, Célia Boyer (2014): Observations and Lessons Learnt from Non Health Professionals Evaluating a Health Search Engine . In: 25th Conference of the European Federation of Medical Informatics (MIE), pp. 940-944, IOS PRESS, Amsterdam, Netherlands, ISBN 978-1-61499-432-9 (local DOCX, bibtex)
Lucie Poláková (2014): K možnostem korpusového zpracování nadvětných jevů. In: Naše řeč, ISSN 0027-8203, 4-5/2014, pp. 241-258 (url, bibtex)
Lucie Poláková, Pavlína Jínová, Jiří Mírovský (2014): Genres in the Prague Discourse Treebank. In: Proceedings of the 9th International Conference on Language Resources and Evaluation (LREC 2014), pp. 1320-1326, European Language Resources Association, Reykjavík, Iceland, ISBN 978-2-9517408-8-4 (url, local PDF, bibtex)
Loganathan Ramasamy (2014): Parsing under-resourced languages: Cross-lingual transfer strategies for Indian languages (PhD thesis). In: (local PDF, bibtex)
Loganathan Ramasamy, David Mareček, Zdeněk Žabokrtský (2014): Multilingual Dependency Parsing: Using Machine Translated Texts instead of Parallel Corpora. In: The Prague Bulletin of Mathematical Linguistics, ISSN 0032-6585, 102, pp. 93-104 (pdf, bibtex)
Loganathan Ramasamy, Zdeněk Žabokrtský (2014): Cross-lingual dependency transfer with harmonized Indian language treebanks. In: Proceedings of 13th International Workshop on Treebanks and Linguistic Theories (TLT13), pp. 160-171, University of Tübingen, Tübingen, Germany, ISBN 978-3-9809183-9-8 (pdf, bibtex)
Rudolf Rosa (2014): Depfix, a Tool for Automatic Rule-based Post-editing of SMT. In: The Prague Bulletin of Mathematical Linguistics, ISSN 0032-6585, 102, pp. 47-56 (pdf, local PDF, local PDF, local PDF, bibtex)
Rudolf Rosa (2014): Fairytale Child Chatbot. In: Proceedings of the 14th conference ITAT 2014, pp. 79-84, Institute of Computer Science AS CR, Praha, Czechia, ISBN 978-80-87136-18-8 (local PDF, local PDF, bibtex)
Rudolf Rosa (2014): Depfix Manual (technical report). In: (local HTML, local PDF, bibtex)
Rudolf Rosa, Jan Mašek, David Mareček, Martin Popel, Daniel Zeman, Zdeněk Žabokrtský (2014): HamleDT 2.0: Thirty Dependency Treebanks Stanfordized. In: Proceedings of the 9th International Conference on Language Resources and Evaluation (LREC 2014), pp. 2334-2341, European Language Resources Association, Reykjavík, Iceland, ISBN 978-2-9517408-8-4 (pdf, local PDF, local PDF, bibtex)
Kateřina Rysová (2014): O slovosledu z komunikačního pohledu. In: , ISBN 978-80-904571-5-7 (url, local PDF, bibtex)
Kateřina Rysová (2014): On the word order of Actor and Patient in Czech. In: Dependency Linguistics. Recent advances in linguistic theory using dependency structures, pp. 253-271, John Benjamins Publishing Company, Amsterdam, The Netherlands, ISBN 978-9027255983 (url, bibtex)
Kateřina Rysová, Jiří Mírovský (2014): Valency and Word Order in Czech ― A Corpus Probe. In: Proceedings of the 9th International Conference on Language Resources and Evaluation (LREC 2014), pp. 975-980, European Language Resources Association, Reykjavík, Iceland, ISBN 978-2-9517408-8-4 (local PDF, bibtex)
Kateřina Rysová, Karel Oliva (2014): Ohlédnutí za jubilejním 40. ročníkem Olympiády v českém jazyce. In: Český jazyk a literatura, ISSN 0009-0786, vol. 65, no. 2, pp. 53-59 (bibtex)
Magdaléna Rysová, Jiří Mírovský (2014): Use of Coreference in Automatic Searching for Multiword Discourse Markers in the Prague Dependency Treebank. In: Proceedings of The 8th Linguistic Annotation Workshop (LAW-VIII), pp. 11-19, Dublin City University (DCU), Dublin, Ireland, ISBN 978-1-941643-29-7 (url, local PDF, local PDF, local PDF, bibtex)
Magdaléna Rysová, Kateřina Rysová (2014): The Centre and Periphery of Discourse Connectives. In: Proceedings of the 28th Pacific Asia Conference on Language, Information and Computing, pp. 452-459, Department of Linguistics, Faculty of Arts, Chulalongkorn University, Bangkok, Thailand, ISBN 978-616-551-887-1 (url, local PDF, bibtex)
Shadi Saleh, Pavel Pecina (2014): CUNI at the ShARe/CLEF eHealth Evaluation Lab 2014. In: Working Notes for CLEF 2014 Conference , pp. 226-235, CEUR-WS.org, Aachen, Germany (bibtex)
Petr Sgall, Jarmila Panevová (2014): Jak psát a jak nepsat česky. In: , ISBN 978-80-246-2505-8 (bibtex)
Jana Straková, Milan Straka, Jan Hajič (2014): Open-Source Tools for Morphology, Lemmatization, POS Tagging and Named Entity Recognition. In: Proceedings of 52nd Annual Meeting of the Association for Computational Linguistics: System Demonstrations, pp. 13-18, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-941643-00-6 (pdf, local PDF, bibtex)
Magda Ševčíková (2014): Building a database of Czech derived words. In: SLE - 47th Annual Meeting of the Societas Linguistica Europaea, pp. 132-133, Societas Linguistica Europaea, Poznań, Poland (pdf, bibtex)
Magda Ševčíková (2014): Kvalitativní a nekvalitativní význam substantiv s příponou -ost. In: Korpus – gramatika – axiologie, ISSN 1804-137X, 9, pp. 41-55 (bibtex)
Magda Ševčíková (2014): Zjišťování slovotvorné produktivity z korpusových dat: přípony odvozující názvy vlastností. In: Naše řeč, ISSN 0027-8203, vol. 97, no. 4-5, pp. 228-240 (bibtex)
Magda Ševčíková, Zdeněk Žabokrtský (2014): Word-Formation Network for Czech. In: Proceedings of the 9th International Conference on Language Resources and Evaluation (LREC 2014), pp. 1087-1093, European Language Resources Association, Reykjavík, Iceland, ISBN 978-2-9517408-8-4 (url, local PDF, bibtex)
Jana Šindlerová, Zdeňka Urešová, Eva Fučíková (2014): Resources in Conflict: A Bilingual Valency Lexicon vs. a Bilingual Treebank vs. a Linguistic Theory. In: Proceedings of the 9th International Conference on Language Resources and Evaluation (LREC 2014), pp. 2490-2494, European Language Resources Association, Reykjavík, Iceland, ISBN 978-2-9517408-8-4 (pdf, local PDF, bibtex)
Jana Šindlerová, Kateřina Veselovská, Jan Hajič, jr. (2014): Tracing Sentiments: Syntactic and Semantic Features in a Subjectivity Lexicon . In: Proceedings of the XVI EURALEX International Congress: The User in Focus, pp. 405-414, EURAC research, Bolzano/Bozen, Italy, ISBN 978-88-88906-97-3 (bibtex)
Eduard Šubert, Ondřej Bojar (2014): Twitter Crowd Translation -- Design and Objectives. In: Translating and the Computer 36, pp. 217-227, Editions Tradulex; AsLing, Geneva, Switzerland, ISBN 9782970073628 (url, bibtex)
Aleš Tamchyna, Fabienne Braune, Alexander Fraser, Marine Carpuat, Hal Daumé III, Chris Quirk (2014): Integrating a Discriminative Classifier into Phrase-based and Hierarchical Decoding. In: The Prague Bulletin of Mathematical Linguistics, ISSN 0032-6585, 101, pp. 29-41 (pdf, bibtex)
Aleš Tamchyna, Martin Popel, Rudolf Rosa, Ondřej Bojar (2014): CUNI in WMT14: Chimera Still Awaits Bellerophon. In: Proceedings of the Ninth Workshop on Statistical Machine Translation, pp. 195-200, Association for Computational Linguistics, Baltimore, MD, USA, ISBN 978-1-941643-17-4 (pdf, local PDF, local PDF, bibtex)
Zdeňka Urešová, Ondřej Dušek, Jan Hajič, Pavel Pecina (2014): Multilingual Test Sets for Machine Translation of Search Queries for Cross-Lingual Information Retrieval in the Medical Domain. In: Proceedings of the 9th International Conference on Language Resources and Evaluation (LREC 2014), pp. 3244-3247, European Language Resources Association, Reykjavík, Iceland, ISBN 978-2-9517408-8-4 (bibtex)
Zdeňka Urešová, Jan Hajič, Ondřej Bojar (2014): Comparing Czech and English AMRs. In: Proceedings of Workshop on Lexical and Grammatical Resources for Language Processing (LG-LP 2014, at Coling 2014), pp. 55-64, Association for Computational Linguistics and Dublin City University, Dublin, Ireland, ISBN 978-1-873769-44-7 (pdf, local PDF, bibtex)
Dušan Variš, Ondřej Bojar (2014): Japonsko-český strojový překlad. In: Proceedings of the 14th conference ITAT 2014, pp. 1-8, Institute of Computer Science AS CR, Praha, Czechia, ISBN 978-80-87136-18-8 (bibtex)
Anna Vernerová, Václava Kettnerová, Markéta Lopatková (2014): To pay or to get paid: Enriching a Valency Lexicon with Diatheses. In: Proceedings of the 9th International Conference on Language Resources and Evaluation (LREC 2014), pp. 2452-2459, European Language Resources Association, Reykjavík, Iceland, ISBN 978-2-9517408-8-4 (local PDF, bibtex)
Kateřina Veselovská (2014): On Linguistic Structure of Evaluative Meaning. In: Language Use and Linguistic Structure, pp. 269-277, Univerzita Palackého v Olomouci, Olomouc, Czechia, ISBN 978-80-244-4060-6 (bibtex)
Kateřina Veselovská (2014): Fear and Trembling: Annotating Emotions in Czech Holocaust Testimonies. In: Proceedings of the 5th International Workshop on Emotion, Social Signals, Sentiment & Linked Open Data, pp. 41-45, ELRA, Reykjavík, Iceland (bibtex)
Kateřina Veselovská (2014): 25. evropská letní škola jazyka, logiky a informace (ESSLLI 2013). In: Studie z aplikované lingvistiky / Studies in Applied Linguistics (SALi), ISSN 1804-3240 (bibtex)
Kateřina Veselovská, Jan Hajič, jr., Jana Šindlerová (2014): Subjectivity Lexicon for Czech: Implementation and Improvements. In: Journal for Language Technology and Computational Linguistics, ISSN 2190-6858, vol. 29, no. 1, pp. 47-61 (pdf, bibtex)
Kateřina Veselovská, Jan Mašek, Vladislav Kuboň (2014): Sentiment Detection and Annotation in a Treebank. In: Proceedings of the Twenty-Seventh International Florida Artificial Intelligence Research Society Conference, FLAIRS 2014, pp. 1-2, AAAI Press, Palo Alto, CA, USA, ISBN 978-1-57735-658-5 (bibtex)
Kateřina Veselovská, Aleš Tamchyna (2014): ÚFAL: Using Hand-crafted Rules in Aspect Based Sentiment Analysis on Parsed Data. In: Proceedings of the Eighth International Workshop on Semantic Evaluation (SemEval 2014), pp. 694-698, Dublin City University, Dublin, Ireland, ISBN 978-1-937284-96-1 (bibtex)
Nianwen Xue, Ondřej Bojar, Jan Hajič, Martha Palmer, Zdeňka Urešová, Xiuhong Zhang (2014): Not an Interlingua, But Close: Comparison of English AMRs to Chinese and Czech. In: Proceedings of the 9th International Conference on Language Resources and Evaluation (LREC 2014), pp. 1765-1772, European Language Resources Association, Reykjavík, Iceland, ISBN 978-2-9517408-8-4 (pdf, local PDF, local PDF, local PDF, bibtex)
Daniel Zeman, Ondřej Dušek, David Mareček, Martin Popel, Loganathan Ramasamy, Jan Štěpánek, Zdeněk Žabokrtský, Jan Hajič (2014): HamleDT: Harmonized Multi-Language Dependency Treebank. In: Language Resources and Evaluation, ISSN 1574-020X, vol. 48, no. 4, pp. 601-637 (url, local PDF, bibtex)
Katsiaryna Aharodnik, Marco Chang, Anna Feldman, Jirka Hana (2013): Automatic Identification of Learners’ Language Background based on their Writing in Czech. In: Proceedings of the 6th International Joint Conference on Natural Language Processing, pp. 1428-1436, Asian Federation of Natural Language Processing, Nagoya, Japan, ISBN 978-4-9907348-0-0 (bibtex)
Niraj Aswani, Thomas Beckers, Erich Birngruber, Célia Boyer, Andreas Burner, Jakub Bystroň, Khalid Choukri, Sarah Cruchet, Hamish Cunningham, Jan Dědek, Ljiljana Dolamic, René Donner, Sebastian Dungs, Ivan Eggel, Antonio Foncubierta, Norbert Fuhr, Adam Funk, Alba García Seco de Herrera, Arnaud Gaudinat, Georgi Georgiev, Julien Gobeill, Lorraine Goeuriot, Paz Gomez, Mark A. Greenwood, Manfred Gschwandtner, Allan Hanbury, Jan Hajič, Jaroslava Hlaváčová, Markus Holzer, Gareth J.F. Jones, Blanca Jordán, Matthias Jordan, Klemens Kaderk, Franz Kainberger, Liadh Kelly, Sascha Kriewel, Marlene Kritz, Georg Langs, Nolan Lawson, Dimitrios Markonis, Iván Martínez, Vassil Momtchev, Alexandre Masselot, Hélène Mazo, Henning Müller, Pavel Pecina, Konstantin Pentchev, Deyan Peychev, Natalia Pletneva, Diana Pottecher, Angus Roberts, Patrick Ruch, Matthias Samwald, Priscille Schneller, Veronika Stefanov, Miguel Angel Tinte, Zdeňka Urešová, Alejandro Vargas, Dina Vishnyakova (2013): Khresmoi - multilingual semantic search of medical text and images. In: MEDINFO 2013 - Proceedings of the 14th World Congress on Medical and Health Informatics, pp. 1266-1266, IOS Press, Amsterdam, Netherlands, ISBN 978-1-61499-288-2 (bibtex)
Niraj Aswani, Thomas Beckers, Erich Birngruber, Célia Boyer, Andreas Burner, Jakub Bystroň, Khalid Choukri, Sarah Cruchet, Hamish Cunningham, Jan Dědek, Ljiljana Dolamic, René Donner, Ondřej Dušek, Sebastian Dungs, Ivan Eggel, Antonio Foncubierta, Norbert Fuhr, Adam Funk, Alba García Seco de Herrera, Arnaud Gaudinat, Georgi Georgiev, Julien Gobeill, Lorraine Goeuriot, Paz Gomez, Mark A. Greenwood, Manfred Gschwandtner, Allan Hanbury, Jan Hajič, Jaroslava Hlaváčová, Markus Holzer, Gareth J.F. Jones, Blanca Jordán, Matthias Jordan, Klemens Kaderk, Franz Kainberger, Liadh Kelly, Sascha Kriewel, Marlene Kritz, Georg Langs, Nolan Lawson, Johannes Leveling, David Mareček, Dimitrios Markonis, Iván Martínez, Vassil Momtchev, Alexandre Masselot, Hélène Mazo, Henning Müller, Michal Novák, Johann Petrak, João Palotti, Pavel Pecina, Konstantin Pentchev, Deyan Peychev, Natalia Pletneva, Martin Popel, Diana Pottecher, Angus Roberts, Rudolf Rosa, Patrick Ruch, Alexander Sachs, Matthias Samwald, Priscille Schneller, Veronika Stefanov, Aleš Tamchyna, Miguel Angel Tinte, Zdeňka Urešová, Alejandro Vargas, Dina Vishnyakova (2013): Khresmoi Professional: Multilingual Semantic Search for Medical Professionals. In: Proceedings of the ACM SIGIR Workshop on Health Search and Discovery: Helping Users and Advancing Medicine, pp. 31-34, Microsoft Research, Cambridge, UK (url, local PDF, bibtex)
Petra Barančíková (2013): Lexical Paraphrasing for Improvement of MT Evaluation. In: WDS'13 Proceedings of Contributed Papers, pp. 7-11, Matfyzpress, Praha, Czechia, ISBN 978-80-7378-250-4 (bibtex)
Eduard Bejček, Pavel Straňák, Pavel Pecina (2013): Syntactic Identification of Occurrences of Multiword Expressions in Text using a Lexicon with Dependency Structures. In: The 9th Workshop on Multiword Expressions (MWE 2013), pp. 106-115, Association for Computational Linguistics, Atlanta, Georgia, USA, ISBN 978-1-937284-47-3 (pdf, local PDF, local ZIP, local PDF, bibtex)
Jan Berka, Ondřej Bojar, Mark Fishel, Maja Popović, Daniel Zeman (2013): Tools for Machine Translation Quality Inspection (technical report). In: (url, local PDF, local PDF, bibtex)
Karel Bílek, Natalia Klyueva, Vladislav Kuboň (2013): Exploiting Maching Learning for Automatic Semantic Feature Assignment. In: Proceedings of the Twenty-Sixth International Florida Artificial Intelligence Research Society Conference, FLAIRS 2013, pp. 297-302, AAAI Press, Palo Alto, California, ISBN 978-1-57735-605-9 (bibtex)
Karel Bílek, Daniel Zeman (2013): CUni Multilingual Matrix in the WMT 2013 Shared Task. In: Proceedings of the Eight Workshop on Statistical Machine Translation, pp. 85-91, Association for Computational Linguistics, Sofija, Bulgaria, ISBN 978-1-937284-57-2 (pdf, local PDF, local PDF, bibtex)
Ondřej Bojar, Christian Buck, Chris Callison-Burch, Christian Federmann, Barry Haddow, Philipp Koehn, Christof Monz, Matt Post, Radu Soricut, Lucia Specia (2013): Findings of the 2013 Workshop on Statistical Machine Translation. In: Proceedings of the Eight Workshop on Statistical Machine Translation, pp. 1-44, Association for Computational Linguistics, Sofija, Bulgaria, ISBN 978-1-937284-57-2 (url, bibtex)
Ondřej Bojar, Matouš Macháček, Aleš Tamchyna, Daniel Zeman (2013): Scratching the Surface of Possible Translations. In: Text, Speech and Dialogue: 16th International Conference, TSD 2013. Proceedings, Lecture Notes in Computer Science, ISSN 0302-9743, 8082, pp. 465-474, Springer Verlag, Berlin / Heidelberg, ISBN 978-3-642-40584-6 (local PDF, bibtex)
Ondřej Bojar, Rudolf Rosa, Aleš Tamchyna (2013): Chimera – Three Heads for English-to-Czech Translation. In: Proceedings of the Eight Workshop on Statistical Machine Translation, pp. 92-98, Association for Computational Linguistics, Sofija, Bulgaria, ISBN 978-1-937284-57-2 (url, local PDF, local PDF, bibtex)
Ondřej Bojar, Aleš Tamchyna (2013): The Design of Eman, an Experiment Manager. In: The Prague Bulletin of Mathematical Linguistics, ISSN 0032-6585, 99, pp. 39-58 (pdf, bibtex)
Silvie Cinková, Martin Holub, Ema Krejčová, Lenka Smejkalová (2013): Rule-Based Extraction of English Verb Collocates from a Dependency-Parsed Corpus. In: Proceedings of the Second International Conference on Dependency Linguistics, Depling 2013, pp. 60-67, Matfyzpress, Praha, Czechia, ISBN 978-80-7378-240-5 (bibtex)
Thành Long Dương, Steven Bird, Paul Cook, Pavel Pecina (2013): Increasing the quality and quantity of source language data for unsupervised cross-lingual POS tagging.. In: Proceedings of the 6th International Joint Conference on Natural Language Processing, pp. 1243-1249, Asian Federation of Natural Language Processing, Nagoya, Japan, ISBN 978-4-9907348-0-0 (pdf, bibtex)
Thành Long Dương, Steven Bird, Paul Cook, Pavel Pecina (2013): Simpler unsupervised POS tagging with bilingual projections. In: Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics, pp. 634-639, Association for Computational Linguistics, Sofija, Bulgaria, ISBN 978-1-937284-50-3 (pdf, bibtex)
Ondřej Dušek (2013): Towards a Truly Statistical Natural Language Generator for Spoken Dialogues. In: WDS'13 Proceedings of Contributed Papers, pp. 12-17, Matfyzpress, Praha, Czechia, ISBN 978-80-7378-250-4 (bibtex)
Ondřej Dušek, Filip Jurčíček (2013): Robust Multilingual Statistical Morphological Generation Models. In: 51st Annual Meeting of the Association for Computational Linguistics Proceedings of the Student Research Workshop, pp. 158-164, Association for Computational Linguistics, Sofija, Bulgaria, ISBN 978-1-937284-53-4 (pdf, bibtex)
Maria Eskevich, Gareth J.F. Jones, Robin Aly, Roeland Ordelman, Shu Chen, Danish Nadeem, Camille Guinaudeau, Guillaume Gravier, Pascale Sébillot, Tom De Nies, Pedro Debevere, Rik Van de Walle, Petra Galuščáková, Pavel Pecina, Martha A. Larson (2013): Multimedia Information Seeking through Search and Hyperlinking. In: Conference ICMR'13 International Conference on Multimedia Retrieval , pp. 287-294, ACM, New York, NY, USA, ISBN 978-1-4503-2033-7 (url, bibtex)
Franky (2013): A Rule-based Approach for Karmina Generation. In: 51st Annual Meeting of the Association for Computational Linguistics Proceedings of the Student Research Workshop, pp. 24-31, Association for Computational Linguistics, Sofija, Bulgaria, ISBN 978-1-937284-53-4 (bibtex)
Petra Galuščáková, Pavel Pecina (2013): CUNI at MediaEval 2013 Search and Hyperlinking Task. In: Proceedings of the MediaEval 2013 Multimedia Benchmark Workshop, CEUR-WS.org, Aachen, Germany (bibtex)
Petra Galuščáková, Pavel Pecina (2013): CUNI at MediaEval 2013 Similar Segments in Social Speech Task. In: Proceedings of the MediaEval 2013 Multimedia Benchmark Workshop, CEUR-WS.org, Aachen, Germany (bibtex)
Petra Galuščáková, Martin Popel, Ondřej Bojar (2013): PhraseFix: Statistical Post-Editing of TectoMT. In: Proceedings of the Eight Workshop on Statistical Machine Translation, pp. 141-147, Association for Computational Linguistics, Sofija, Bulgaria, ISBN 978-1-937284-57-2 (bibtex)
Nathan David Green, Zdeněk Žabokrtský (2013): Improvements to Syntax-based Machine Translation using Ensemble Dependency Parsers. In: 51st Annual Meeting of the Association for Computational Linguistics, Proceedings of the Second Workshop on Hybrid Approaches to Translation, pp. 19-24, Omnipress, Inc., Sofija, Bulgaria, ISBN 978-1-937284-63-3 (bibtex)
Jan Hajič, Bernd Bohnet, Joakim Nivre, Igor Boguslavsky, Richárd Farkas, Filip Ginter (2013): Joint Morphological and Syntactic Analysis for Richly Inflected Languages. In: Transactions of the Association for Computational Linguistics, ISSN 2307-387X, 1, pp. 415-428 (bibtex)
Jan Hajič, jr., Kateřina Veselovská (2013): Developing Sentiment Annotator in UIMA – the Unstructured Management Architecture for Data Mining Applications. In: ITAT 2013: Information Technologies - Applications and Theory (Workshops, Posters, and Tutorials) , pp. 5-10, CreateSpace Independent Publishing Platform , Donovaly, Slovakia, ISBN 978-1490952086 (local PDF, bibtex)
Eva Hajičová (2013): Vilém Mathesius and functional sentence perspective, and beyond. In: A Centenary of English Studies at Charles University : From Mathesius to Present-day Linguistics, pp. 49-60, Univerzita Karlova v Praze, Filozofická fakulta, Prague, ISBN 978-80-7308-449-3 (bibtex)
Eva Hajičová (2013): Ke stratifikačnímu modelování jazyka. In: Tygramatika: soubor studií věnovaných prof. Janu Kořenskému k 75. narozeninám, pp. 79-90, Dokořán, Praha, Czechia, ISBN 978-80-7363-544-2 (bibtex)
Eva Hajičová (2013): Professor Ladislav Matejka (1919-2012) passed away. In: Linguistica Pragensia, ISSN 0862-8432, vol. XXIII, no. 2, pp. 87-88 (bibtex)
Patrick Hanks (2013): Lexical analysis: Norms and Exploitations. In: , ISBN 978-0-262-01857-9 (url, bibtex)
Barbora Hladká, Martin Holub, Vincent Kríž (2013): Feature Engineering in the NLI Shared Task 2013: Charles University Submission Report. In: Proceedings of the Eighth Workshop on Innovative Use of NLP for Building Educational Applications, pp. 232-241, Association for Computational Linguistics, Atlanta, Georgia, USA (url, bibtex)
Jaroslava Hlaváčová (2013): Special domain data mining through DBpedia on the example of Biology. In: ITAT 2013: Information Technologies - Applications and Theory (Workshops, Posters, and Tutorials) , pp. 2-4, CreateSpace Independent Publishing Platform , Donovaly, Slovakia, ISBN 978-1490952086 (local PDF, bibtex)
Jaroslava Hlaváčová (2013): Review of the book "Morfologie českého slovesa a tvoření deverbativ jako problém strojové analýzy češtiny" (review). In: The Prague Bulletin of Mathematical Linguistics, ISSN 0032-6585, 99, pp. 99-103 (pdf, bibtex)
Jaroslava Hlaváčová, Anna Nedoluzhko (2013): Intensifying Verb Prefix Patterns in Czech and Russian. In: Text, Speech and Dialogue: 16th International Conference, TSD 2013. Proceedings, Lecture Notes in Computer Science, ISSN 0302-9743, 8082, pp. 303-310, Springer Verlag, Berlin / Heidelberg, ISBN 978-3-642-40584-6 (local PDF, bibtex)
Pavlína Jínová, Lucie Poláková, Jiří Mírovský (2013): Subordinators with Elaborative Meanings in Czech and English. In: Proceedings of the Second International Conference on Dependency Linguistics, Depling 2013, pp. 128-136, Matfyzpress, Praha, Czechia, ISBN 978-80-7378-240-5 (local PDF, bibtex)
Michal Kalina, Ondřej Bojar (2013): Jak překladač z Matfyzu porazil Google. In: Hospodářské noviny IHNED, ISSN 1213-7693 (url, bibtex)
Václava Kettnerová, Markéta Lopatková (2013): Lexikalizované alternace v češtině. In: Linguistica Copernicana, ISSN 2080-1068, vol. 9, no. 1, pp. 31-64 (bibtex)
Václava Kettnerová, Markéta Lopatková (2013): The Representation of Czech Light Verb Constructions in a Valency Lexicon. In: Proceedings of the Second International Conference on Dependency Linguistics, Depling 2013, pp. 147-156, Matfyzpress, Praha, Czechia, ISBN 978-80-7378-240-5 (local PDF, bibtex)
Václava Kettnerová, Markéta Lopatková, Eduard Bejček, Anna Vernerová, Marie Podobová (2013): Corpus Based Identification of Czech Light Verbs. In: Proceedings of the Seventh International Conference Slovko 2013; Natural Language Processing, Corpus Linguistics, E-learning, pp. 118-128, RAM-Verlag, Lüdenscheid, Germany, ISBN 978-3-942303-18-7 (local PDF, bibtex)
Natalia Klyueva (2013): Usage of some non-finite constructions in Czech and Russian. In: 6th Annual International Conference on Languages & Linguistics, pp. 5-12, Atiner, Athîna, Greece (local PDF, bibtex)
Veronika Kolářová (2013): Adverbální předmětový genitiv a jeho protějšky v nominálních konstrukcích: Případ posesiva. In: Zborník Filozofickej fakulty Univerzity Komenského, Philologica LXXII, Slovo a tvar v štruktúre a komunikácii, pp. 411-421, Univerzita Komenského, Bratislava, Bratislava, Slovakia, ISBN 978-80-223-3562-1 (local PDF, bibtex)
Veronika Kolářová (2013): Agents Expressed by Prepositionless Instrumental Modifying Czech Nouns Derived from Intransitive Verbs. In: Proceedings of the Seventh International Conference Slovko 2013; Natural Language Processing, Corpus Linguistics, E-learning, pp. 129-147, RAM-Verlag, Lüdenscheid, Germany, ISBN 978-3-942303-18-7 (local PDF, bibtex)
Matěj Korvas, Vojtěch Diatka (2013): Correspondence Seminar: Bringing Linguistics to High Schools. In: Proceedings of the Fourth Workshop on Teaching NLP and CL, pp. 46-50, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-937284-69-5 (pdf, bibtex)
Lubomír Krčmář, Karel Ježek, Pavel Pecina (2013): Determining Compositionality of Word Expressions Using Word Space Models. In: The 9th Workshop on Multiword Expressions (MWE 2013), pp. 42-52, Association for Computational Linguistics, Atlanta, Georgia, USA, ISBN 978-1-937284-47-3 (bibtex)
Lubomír Krčmář, Karel Ježek, Pavel Pecina (2013): Determining Compositionality of Word Expressions Using Various Word Space Models and Measures. In: Proceedings of the Workshop on Continuous Vector Space Models and their Compositionality, pp. 64-73, The Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-937284-67-1 (bibtex)
Vincent Kríž (2013): Detecting Semantic Relations in Texts and Their Integration with External Data Resources. In: WDS'13 Proceedings of Contributed Papers, pp. 18-23, Matfyzpress, Praha, Czechia, ISBN 978-80-7378-250-4 (bibtex)
Vladislav Kuboň, Markéta Lopatková, Jiří Mírovský (2013): Automatic Processing of Linguistic Data as a Feedback for Linguistic Theory. In: Proceedings of the 12th Mexican International Conference on Artificial Intelligence (MICAI 2013), Lecture Notes in Computer Science, ISSN 0302-9743, vol. 1, no. 8265, pp. 252-264, Springer, Heidelberg, Germany, ISBN 978-3-642-45113-3 (local PDF, bibtex)
Vladislav Kuboň, Markéta Lopatková, Jiří Mírovský (2013): A Case Study of a Free Word Order. In: Proceedings of the 27th Pacific Asia Conference on Language, Information and Computation, pp. 222-231, Department of English, National Chengchi University, Taipei, Taiwan, ISBN 978-986-03-8567-0 (local PDF, bibtex)
Matouš Macháček, Ondřej Bojar (2013): Results of the WMT13 Metrics Shared Task. In: Proceedings of the Eight Workshop on Statistical Machine Translation, pp. 45-51, Association for Computational Linguistics, Sofija, Bulgaria, ISBN 978-1-937284-57-2 (pdf, bibtex)
David Mareček, Martin Popel, Loganathan Ramasamy, Jan Štěpánek, Daniel Zeman, Zdeněk Žabokrtský, Jan Hajič (2013): Cross-language Study on Influence of Coordination Style on Dependency Parsing Performance (technical report). In: (pdf, local PDF, bibtex)
David Mareček, Milan Straka (2013): Stop-probability estimates computed on a large corpus improve Unsupervised Dependency Parsing. In: Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics, pp. 281-290, Association for Computational Linguistics, Sofija, Bulgaria, ISBN 978-1-937284-50-3 (pdf, local PDF, bibtex)
Thomas Meyer, Lucie Poláková (2013): Machine Translation with Many Manually Labeled Discourse Connectives. In: 51st Annual Meeting of the Association for Computational Linguistics Proceedings of the Workshop on Discourse in Machine Translation, pp. 43-50, Omnipress, Inc., Sofija, Bulgaria, ISBN 978-1-937284-68-8 (pdf, local PDF, bibtex)
Marie Mikulová (2013): Anotace na tektogramatické rovině. Dodatky k anotátorské příručce (s ohledem na anotování PDTSC a PCEDT) (technical report). In: (local PDF, bibtex)
Marie Mikulová, Eduard Bejček, Jiří Mírovský, Anna Nedoluzhko, Jarmila Panevová, Lucie Poláková, Pavel Straňák, Magda Ševčíková, Zdeněk Žabokrtský (2013): Úpravy a doplňky Pražského závislostního korpusu (Od PDT 2.0 k PDT 3.0) (technical report). In: (local PDF, bibtex)
Marie Mikulová, Eduard Bejček, Jiří Mírovský, Anna Nedoluzhko, Jarmila Panevová, Lucie Poláková, Pavel Straňák, Magda Ševčíková, Zdeněk Žabokrtský (2013): From PDT 2.0 to PDT 3.0 (Modifications and Complements) (technical report). In: (local PDF, bibtex)
Marie Mikulová, Jan Štěpánek, Zdeňka Urešová (2013): Liší se mluvené a psané texty ve valenci?. In: Korpus – gramatika – axiologie, ISSN 1804-137X, 8, pp. 36-46 (url, local PDF, bibtex)
Jiří Mírovský, Kateřina Rysová, Magdaléna Rysová, Eva Hajičová (2013): (Pre-)Annotation of Topic-Focus Articulation in Prague Czech-English Dependency Treebank. In: Proceedings of the 6th International Joint Conference on Natural Language Processing, pp. 55-63, Asian Federation of Natural Language Processing, Nagoya, Japan, ISBN 978-4-9907348-0-0 (local PDF, bibtex)
Jakub Mlynář (2013): Výroční setkání Centra vizuální historie Malach - 28. 1. 2013 . In: Historická sociologie, ISSN 1804-0616, vol. 2013, no. 1, pp. 119-120 (bibtex)
Jakub Mlynář (2013): Nový archiv nahrávek rozhovorů v Centru vizuální historie Malach . In: Maskil, ISSN 0000-0000, vol. N/A, no. 8, pp. 13-13 (url, bibtex)
Martin Nečaský, Tomáš Knap, Jakub Klímek, Irena Holubová, Barbora Hladká (2013): Linked Open Data for Legislative Domain - Ontology and Experimental Data. In: Business Information Systems Workshops, pp. 172-183, Springer, Berlin / Heidelberg, ISBN 978-3-642-41686-6 (url, bibtex)
Anna Nedoluzhko (2013): Generic noun phrases and annotation of coreference and bridging relations in the Prague Dependency Treebank. In: 51st Annual Meeting of the Association for Computational Linguistics Proceedings of the 7th Linguistic Annotation Workshop & Interoperability with Discourse, pp. 103-111, Omnipress, Inc, Sofija, Bulgaria, ISBN 978-1-937284-58-9 (local PDF, bibtex)
Anna Nedoluzhko, Jiří Mírovský (2013): Annotators’ Certainty and Disagreements in Coreference and Bridging Annotation in Prague Dependency Treebank. In: Proceedings of the Second International Conference on Dependency Linguistics, Depling 2013, pp. 236-243, Matfyzpress, Praha, Czechia, ISBN 978-80-7378-240-5 (url, local PDF, bibtex)
Anna Nedoluzhko, Jiří Mírovský (2013): How Dependency Trees and Tectogrammatics Help Annotating Coreference and Bridging Relations in Prague Dependency Treebank. In: Proceedings of the Second International Conference on Dependency Linguistics, Depling 2013, pp. 244-251, Matfyzpress, Praha, Czechia, ISBN 978-80-7378-240-5 (url, local PDF, bibtex)
Anna Nedoluzhko, Jiří Mírovský, Michal Novák (2013): A Coreferentially annotated Corpus and Anaphora Resolution for Czech. In: Computational Linguistics and Intellectual Technologies, pp. 467-475, ABBYY, Moskva, Russia, ISBN 978-1-937284-58-9 (local PDF, bibtex)
Michal Novák, Anna Nedoluzhko, Zdeněk Žabokrtský (2013): Translation of "It" in a Deep Syntax Framework. In: 51st Annual Meeting of the Association for Computational Linguistics Proceedings of the Workshop on Discourse in Machine Translation, pp. 51-59, Omnipress, Inc., Sofija, Bulgaria, ISBN 978-1-937284-68-8 (pdf, bibtex)
Michal Novák, Zdeněk Žabokrtský, Anna Nedoluzhko (2013): Two Case Studies on Translating Pronouns in a Deep Syntax Framework. In: Proceedings of the 6th International Joint Conference on Natural Language Processing, pp. 1037-1041, Asian Federation of Natural Language Processing, Nagoya, Japan, ISBN 978-4-9907348-0-0 (pdf, bibtex)
Jarmila Panevová (2013): Slovníková informace a její použití v gramatice (na příkladu českého slovesa). In: Južnoslovenski filolog, ISSN 0350-185X, 69, pp. 75-90 (local PDF, bibtex)
Jarmila Panevová (2013): A Message Sent to Heaven. In: Sprache, Sprachvergleich, Sprachträger (Rudolf Růžička zum 90. Geburtstag von Freunden, wissenschaftlichen Weggefährten und Schülern), pp. 140-144, S. Hirzel Verlag, Stuttgart, Germany, ISBN 978-3-7776-2358-0 (bibtex)
Jarmila Panevová (2013): Vzpomínka na Světlu Čmejrkovou. In: Jazykovědné aktuality , ISSN 1212-5326, vol. 50, no. 1-2, pp. 66-67 (bibtex)
Jarmila Panevová, Magda Ševčíková (2013): The Role of Grammatical Constraints in Lexical Component in Functional Generative Description. In: Proceedings of the 6th International Conference on Meaning-Text Theory, Prague, August 30–31, 2013, pp. 134-143, Univerzita Karlova v Praze, Praha, Czechia, ISBN 978-3-86688-405-2 (pdf, local PDF, bibtex)
Pavel Pecina (2013): Jörg Tiedemann: Bitext Alignment (review). In: Machine Translation, ISSN 0922-6567, vol. 27, no. 1, pp. 77-79 (bibtex)
Martin Plátek, Markéta Lopatková (2013): Formalization of Word-Order Shifts by Restarting Automata. In: ITAT 2013: Information Technologies - Applications and Theory (Proceedings), pp. 2-9, CreateSpace Independent Publishing Platform, Prešov, Slovakia, ISBN 978-1490952000 (bibtex)
Lucie Poláková, Jiří Mírovský, Anna Nedoluzhko, Pavlína Jínová, Šárka Zikánová, Eva Hajičová (2013): Introducing the Prague Discourse Treebank 1.0. In: Proceedings of the 6th International Joint Conference on Natural Language Processing, pp. 91-99, Asian Federation of Natural Language Processing, Nagoya, Japan, ISBN 978-4-9907348-0-0 (local PDF, bibtex)
Martin Popel, David Mareček, Jan Štěpánek, Daniel Zeman, Zdeněk Žabokrtský (2013): Coordination Structures in Dependency Treebanks. In: Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics, pp. 517-527, Association for Computational Linguistics, Sofija, Bulgaria, ISBN 978-1-937284-50-3 (pdf, local PDF, local PDF, local PDF, bibtex)
Rudolf Rosa (2013): Automatic post-editing of phrase-based machine translation outputs (masters thesis). In: (local PDF, local PDF, local PDF, local PDF, bibtex)
Rudolf Rosa, David Mareček, Aleš Tamchyna (2013): Deepfix: Statistical Post-editing of Statistical Machine Translation Using Deep Syntactic Analysis. In: 51st Annual Meeting of the Association for Computational Linguistics Proceedings of the Student Research Workshop, pp. 172-179, Association for Computational Linguistics, Sofija, Bulgaria, ISBN 978-1-937284-53-4 (url, local PDF, local PDF, local PDF, bibtex)
Alexandr Rosen, Jiří Hana, Barbora Štindlová, Anna Feldman (2013): Evaluating and automating the annotation of a learner corpus. In: Language Resources and Evaluation, ISSN 1574-020X, 47, pp. 1-2 (bibtex)
Kateřina Rysová (2013): K subjektivnímu slovosledu na základě korpusu. In: Didaktické studie, ISSN 1804-1221, vol. 5, no. 2, pp. 20-38 (local PDF, bibtex)
Kateřina Rysová, Karel Oliva (2013): Skončil 39. ročník Olympiády v českém jazyce. In: Český jazyk a literatura, ISSN 0009-0786, vol. 64, no. 2, pp. 53-59 (bibtex)
Magdaléna Rysová (2013): K explikativním vztahům v češtině . In: Grenzüberschreitungen - Polnische, tschechische und deutsche Sprache, Literatur und Kultur. Beiträge zur VIII. Internationalen Westslawistischen interFaces-Konferenz in Leipzig., pp. 331-342, Olms, Hildesheim, Germany, ISBN 978-3-487-15004-8 (local PDF, local DOC, bibtex)
Jana Straková, Milan Straka, Jan Hajič (2013): A New State-of-The-Art Czech Named Entity Recognizer. In: Text, Speech and Dialogue: 16th International Conference, TSD 2013. Proceedings, Lecture Notes in Computer Science, ISSN 0302-9743, 8082, pp. 68-75, Springer Verlag, Berlin / Heidelberg, ISBN 978-3-642-40584-6 (url, local PDF, bibtex)
Magda Ševčíková (2013): Česká adverbia s příponou -o v teoretickém popisu a v syntakticky anotovaném korpusu. In: Zborník Filozofickej fakulty Univerzity Komenského, Philologica LXXII, Slovo a tvar v štruktúre a komunikácii, pp. 365-374, Univerzita Komenského, Bratislava, Bratislava, Slovakia, ISBN 978-80-223-3562-1 (local PDF, bibtex)
Magda Ševčíková (2013): Deadjektivní deriváty v češtině jako deriváty syntaktické vs. lexikální. In: Gramatika a korpus / Grammar and Corpora 2012, pp. 1-8, Gaudeamus, Hradec Králové, Czechia, ISBN 978-80-7435-243-0 (bibtex)
Magda Ševčíková (2013): Productivity of selected deadjectival suffixes in Czech. In: SLE - 46th Annual Meeting of the Societas Linguistica Europaea, pp. 338-339, Societas Linguistica Europaea, Split, Croatia (pdf, local PDF, bibtex)
Jana Šindlerová, Zdeňka Urešová, Eva Fučíková (2013): Verb Valency and Argument Non-correspondence in a Bilingual Treebank. In: Proceedings of the Seventh International Conference Slovko 2013; Natural Language Processing, Corpus Linguistics, E-learning, pp. 100-108, RAM-Verlag, Lüdenscheid, Germany, ISBN 978-3-942303-18-7 (local PDF, bibtex)
Jana Šindlerová, Kateřina Veselovská (2013): Building a Corpus of Evaluative Sentences in Multiple Domains. In: Corpus Linguistics 2013 - Abstract Book, pp. 273-275, UCREL, Lancaster, UK (pdf, bibtex)
Barbora Štindlová, Svatava Škodová, Jirka Hana, Alexandr Rosen (2013): A learner corpus of Czech: current state and future directions. In: Twenty Years of Learner Corpus Research - Looking Back, Moving Ahead , Presses universitaires de Louvain, Louvain-la-Neuve, Belgium, ISBN 9782875581990 (url, bibtex)
Aleš Tamchyna (2013): Utilizing Source Context in Statistical Machine Translation. In: WDS'13 Proceedings of Contributed Papers, pp. 24-29, Matfyzpress, Praha, Czechia, ISBN 978-80-7378-250-4 (bibtex)
Aleš Tamchyna, Ondřej Bojar (2013): No Free Lunch in Factored Phrase-Based Machine Translation. In: Lecture Notes in Computer Science, ISSN 0302-9743, 7817, pp. 210-223 (url, bibtex)
Aleš Tamchyna, Ondřej Dušek, Rudolf Rosa, Pavel Pecina (2013): MTMonkey: A Scalable Infrastructure for a Machine Translation Web Service. In: The Prague Bulletin of Mathematical Linguistics, ISSN 0032-6585, 100, pp. 31-40 (pdf, local PDF, local PDF, bibtex)
Zdeňka Urešová, Eva Fučíková, Jan Hajič, Jana Šindlerová (2013): An Analysis of Annotation of Verb-Noun Idiomatic Combinations in a Parallel Dependency Corpus. In: The 9th Workshop on Multiword Expressions (MWE 2013), pp. 58-63, Association for Computational Linguistics, Atlanta, Georgia, USA, ISBN 978-1-937284-47-3 (pdf, local PDF, bibtex)
Anna Vernerová, Markéta Lopatková (2013): Towards Automatic Detection of Applicable Diatheses. In: ITAT 2013: Information Technologies - Applications and Theory (Proceedings), pp. 10-17, CreateSpace Independent Publishing Platform, Prešov, Slovakia, ISBN 978-1490952000 (url, local PDF, bibtex)
Kateřina Veselovská (2013): Czech Subjectivity Lexicon: A Lexical Resource for Czech Polarity Classification. In: Proceedings of the Seventh International Conference Slovko 2013; Natural Language Processing, Corpus Linguistics, E-learning, pp. 279-284, RAM-Verlag, Lüdenscheid, Germany, ISBN 978-3-942303-18-7 (local PDF, bibtex)
Kateřina Veselovská, Jan Hajič, jr. (2013): Why Words Alone Are Not Enough: Error Analysis of Lexicon-based Polarity Classifier for Czech. In: Proceedings of the 6th International Joint Conference on Natural Language Processing, pp. 1-5, Asian Federation of Natural Language Processing, Nagoya, Japan, ISBN 978-4-9907348-0-0 (local PDF, bibtex)
Katrin Wisniewski, Karin Schöne, Lionel Nicolas, Chiara Vettori, Adriane Boyd, Detmar Meurers, Andrea Abel, Jirka Hana (2013): MERLIN: An Online Trilingual Learner Corpus Empirically Grounding the European Reference Levels in Authentic Learner . In: Conference proceedings. ICT for language learning, pp. 1-5, libreriauniversitaria.it, Firenze, Italy, ISBN 978-88-6292-423-8 (pdf, bibtex)
Daniel Zeman (2013): Lingvistické nástroje a data na ÚFALu (LectureNotes). (url, local ODP, local PDF)
Šárka Zikánová (2013): Text annotations in the Prague Dependency Treebank. In: Linguistica Pragensia, ISSN 0862-8432, vol. 23, no. 1, pp. 31-40 (local PDF, local PDF, bibtex)
Lukáš Žilka, David Marek, Matěj Korvas, Filip Jurčíček (2013): Comparison of Bayesian Discriminative and Generative Models for Dialogue State Tracking. In: SIGDIAL '13: 14th Annual Meeting of the Special Interest Group on Discourse and Dialogue , pp. 452-457, The Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-937284-95-4 (url, bibtex)
Niraj Aswani, Thomas Beckers, Erich Birngruber, Célia Boyer, Andreas Burner, Jakub Bystroň, Khalid Choukri, Sarah Cruchet, Hamish Cunningham, Jan Dědek, Ljiljana Dolamic, René Donner, Sebastian Dungs, Ivan Eggel, Antonio Foncubierta, Norbert Fuhr, Adam Funk, Alba García Seco de Herrera, Arnaud Gaudinat, Georgi Georgiev, Julien Gobeill, Lorraine Goeuriot, Paz Gomez, Mark A. Greenwood, Manfred Gschwandtner, Allan Hanbury, Jan Hajič, Jaroslava Hlaváčová, Markus Holzer, Gareth J.F. Jones, Blanca Jordán, Matthias Jordan, Klemens Kaderk, Franz Kainberger, Liadh Kelly, Sascha Mriewel, Marlene Kritz, Georg Langs, Nolan Lawson, Dimitrios Markonis, Iván Martínez, Vassil Momtchev, Alexandre Masselot, Hélène Mazo, Henning Müller, Pavel Pecina, Konstantin Pentchev, Deyan Peychev, Natalia Pletneva, Diana Pottecher, Angus Roberts, Patrick Ruch, Matthias Samwald, Priscille Schneller, Veronika Stefanov, Miguel Angel Tinte, Zdeňka Urešová, Alejandro Vargas, Dina Vishnyakova (2012): Khresmoi: Multimodal Multilingual Medical Information Search. In: Proceedings of the 24th International Conference of the European Federation for Medical Informatics, Quality of Life through Quality of Information, Village of the future, IOS Press, Pisa, Italy, ISBN 978-1-61499-101-4 (bibtex)
Mohammed Attia, Pavel Pecina, Younes Samih, Khaled Shaalan, Josef van Genabith (2012): Improved Spelling Error Detection and Correction for Arabic. In: Proceedings of the 24th International Conference on Computational Linguistics (Coling 2012), pp. 103-112, Coling 2012 Organizing Committee, Mumbai, India (bibtex)
Eleftherios Avramidis, Marta R. Costa-Jussà, Christian Federmann, Pavel Pecina, Josef van Genabith (2012): A Richly Annotated, Multilingual Parallel Corpus for Hybrid Machine Translation. In: Proceedings of the 8th International Conference on Language Resources and Evaluation (LREC 2012), pp. 2189-2193, European Language Resources Association, İstanbul, Turkey, ISBN 978-2-9517408-7-7 (bibtex)
Eduard Bejček, Jarmila Panevová, Jan Popelka, Pavel Straňák, Magda Ševčíková, Jan Štěpánek, Zdeněk Žabokrtský (2012): Prague Dependency Treebank 2.5 -- a revisited version of PDT 2.0. In: Proceedings of the 24th International Conference on Computational Linguistics (Coling 2012), pp. 231-246, Coling 2012 Organizing Committee, Mumbai, India (local PDF, local PDF, bibtex)
Jan Berka, Ondřej Bojar, Mark Fishel, Maja Popović, Daniel Zeman (2012): Automatic MT Error Analysis: Hjerson Helping Addicter. In: Proceedings of the 8th International Conference on Language Resources and Evaluation (LREC 2012), pp. 2158-2163, European Language Resources Association, İstanbul, Turkey, ISBN 978-2-9517408-7-7 (url, local PDF, bibtex)
Ondřej Bojar (2012): Čeština a strojový překlad. In: , ISBN 978-80-904571-4-0 (bibtex)
Ondřej Bojar (2012): Strojový překlad. In: Vesmír, ISSN 0042-4544, 91, pp. 488-490 (bibtex)
Ondřej Bojar, Mauro Cettolo, Silvie Cinková, Philipp Koehn, Miroslav Týnovský, Zdeněk Žabokrtský (2012): Scientific Report on Rich Tree-Based SMT (technical report). In: (bibtex)
Ondřej Bojar, Silvie Cinková, Jan Hajič, Barbora Hladká, Vladislav Kuboň, Jiří Mírovský, Jarmila Panevová, Nino Peterek, Johanka Spoustová, Zdeněk Žabokrtský (2012): The Czech Language in the Digital Age. In: , ISBN 978-3-642-30705-8 (local PDF, bibtex)
Ondřej Bojar, Bushra Jawaid, Amir Kamran (2012): Probes in a Taxonomy of Factored Phrase-Based Models. In: Proceedings of the Seventh Workshop on Statistical Machine Translation, pp. 253-260, Association for Computational Linguistics, Montréal, Canada, ISBN 978-1-937284-20-6 (url, local PDF, bibtex)
Ondřej Bojar, Dekai Wu (2012): Towards a Predicate-Argument Evaluation for MT. In: Proceedings of Sixth Workshop on Syntax, Semantics and Structure in Statistical Translation (SSST-6), ACL, pp. 30-38, Association for Computational Linguistics, Jeju, Korea, ISBN 978-1-937284-38-1 (url, local PDF, bibtex)
Ondřej Bojar, Zdeněk Žabokrtský, Ondřej Dušek, Petra Galuščáková, Martin Majliš, David Mareček, Jiří Maršík, Michal Novák, Martin Popel, Aleš Tamchyna (2012): The Joy of Parallelism with CzEng 1.0. In: Proceedings of the 8th International Conference on Language Resources and Evaluation (LREC 2012), pp. 3921-3928, European Language Resources Association, İstanbul, Turkey, ISBN 978-2-9517408-7-7 (url, local PDF, bibtex)
Silvie Cinková, Martin Holub, Vincent Kríž (2012): Optimizing semantic granularity for NLP - report on a lexicographic experiment. In: Proceedings of the 15th EURALEX International Congress, pp. 523-531, Department of Linguistics and Scandinavian Studies, University of Oslo, Oslo, Norway, ISBN 978-82-303-2228-4 (local PDF, bibtex)
Silvie Cinková, Martin Holub, Vincent Kríž (2012): Managing Uncertainty in Semantic Tagging. In: Proceedings of 13th Conference of the European Chapter of the Association for Computational Linguistics, pp. 840-850, Association for Computational Linguistics, Avignon, France, ISBN 978-1-937284-19-0 (pdf, bibtex)
Silvie Cinková, Martin Holub, Adam Rambousek, Lenka Smejkalová (2012): A database of semantic clusters of verb usages. In: Proceedings of the 8th International Conference on Language Resources and Evaluation (LREC 2012), pp. 3176-3183, European Language Resources Association, İstanbul, Turkey, ISBN 978-2-9517408-7-7 (pdf, bibtex)
Silvie Cinková, Lenka Smejkalová, Anna Vernerová, Jonáš Thál, Martin Holub (2012): Maintaining consistency of monolingual verb entries with interannotator agreement. In: Nordiska studier i lexikografi - Rapport från Konferensen om lexikografi i Norden, pp. 169-180, Nordiska föreningen for lexikografi, Lund, Sweden, ISBN 978-91-85333-42-4 (local PDF, bibtex)
Ondřej Dušek, Zdeněk Žabokrtský, Martin Popel, Martin Majliš, Michal Novák, David Mareček (2012): Formemes in English-Czech Deep Syntactic MT. In: Proceedings of the Seventh Workshop on Statistical Machine Translation, pp. 267-274, Association for Computational Linguistics, Montréal, Canada, ISBN 978-1-937284-20-6 (pdf, local PDF, bibtex)
Christian Federmann, Eleftherios Avramidis, Marta R. Costa-Jussà, Maite Melero, Josef van Genabith, Pavel Pecina (2012): The ML4HMT Workshop on Optimising the Division of Labour in Hybrid Machine Translation. In: Proceedings of the 8th International Conference on Language Resources and Evaluation (LREC 2012), pp. 3430-3435, European Language Resources Association, İstanbul, Turkey, ISBN 978-2-9517408-7-7 (bibtex)
Christian Federmann, Maite Melero, Pavel Pecina, Josef van Genabith (2012): Towards Optimal Choice Selection for Improved Hybrid Machine Translation. In: The Prague Bulletin of Mathematical Linguistics, ISSN 0032-6585, 97, pp. 5-22 (pdf, bibtex)
Mark Fishel, Ondřej Bojar, Maja Popović (2012): Terra: a Collection of Translation Error-Annotated Corpora. In: Proceedings of the 8th International Conference on Language Resources and Evaluation (LREC 2012), pp. 7-14, European Language Resources Association, İstanbul, Turkey, ISBN 978-2-9517408-7-7 (local PDF, bibtex)
Mark Fishel, Rico Sennrich, Maja Popović, Ondřej Bojar (2012): TerrorCat: a Translation Error Categorization-based MT Quality Metric. In: Proceedings of the Seventh Workshop on Statistical Machine Translation, pp. 64-70, Association for Computational Linguistics, Montréal, Canada, ISBN 978-1-937284-20-6 (url, bibtex)
Petra Galuščáková, Ondřej Bojar (2012): Improving SMT by Using Parallel Data of a Closely Related Language. In: Human Language Technologies – The Baltic Perspective - Proceedings of the Fifth International Conference Baltic HLT 2012, pp. 58-65, IOS Press, Amsterdam, Netherlands, ISBN 978-1-61499-132-8 (url, bibtex)
Petra Galuščáková, Pavel Pecina (2012): CUNI at MediaEval 2012 Search and Hyperlinking Task. In: Working Notes Proceedings of the MediaEval 2012 Workshop, CEUR Workshop Proceedings, Aachen, Germany (pdf, bibtex)
Petra Galuščáková, Pavel Pecina, Jan Hajič (2012): Penalty Functions for Evaluation Measures of Unsegmented Speech Retrieval. In: Information Access Evaluation. Multilinguality, Multimodality, and Visual Analytics - Third International Conference of the CLEF Initiative, Lecture Notes in Computer Science, ISSN 0302-9743, 7488, pp. 100-111, Springer, Berlin / Heidelberg, ISBN 978-3-642-33246-3 (url, bibtex)
Milica Gašić, Filip Jurčíček, Blaise Thomson, Steve Young (2012): Optimisation for POMDP-Based Spoken Dialogue Systems. In: Data-driven methods for adaptive Spoken Dialogue Systems, pp. 75-101, Springer New York, New York, Springer, ISBN 978-1-4614-4802-0 (url, bibtex)
Nathan David Green (2012): Building parallel corpora through social network gaming. In: Workshop on Collaborative Resource Development and Delivery, pp. 22-25, European Language Resources Association, İstanbul, Turkey, ISBN 978-2-9517408-7-7 (bibtex)
Nathan David Green, Septina Dian Larasati, Zdeněk Žabokrtský (2012): Indonesian Dependency Treebank: Annotation and Parsing. In: Proceedings of the 26th Pacific Asia Conference on Language, Information and Computation, pp. 137-145, Faculty of Computer Science, Universitas Indonesia, Bali, Indonesia, ISBN 978-979-1421-17-1 (bibtex)
Nathan David Green, Loganathan Ramasamy, Zdeněk Žabokrtský (2012): Using an SVM Ensemble System for Improved Tamil Dependency Parsing. In: ACL 2012 Joint Workshop on Statistical Parsing and Semantic Processing of Morphologically Rich Languages, pp. 72-77, Association for Computational Linguistics, Jeju, Korea, ISBN 978-1-937284-30-5 (bibtex)
Nathan David Green, Zdeněk Žabokrtský (2012): Ensemble Parsing and its Effect on Machine Translation (technical report). In: (pdf, bibtex)
Nathan David Green, Zdeněk Žabokrtský (2012): Hybrid Combination of Constituency and Dependency Trees into an Ensemble Dependency Parser. In: Proceedings of the Workshop on Innovative Hybrid Approaches to the Processing of Textual Data, pp. 19-26, Association for Computational Linguistics, Avignon, France, ISBN 978-1-937284-19-0 (url, bibtex)
Jan Hajič, Eva Hajičová, Jarmila Panevová, Petr Sgall, Ondřej Bojar, Silvie Cinková, Eva Fučíková, Marie Mikulová, Petr Pajas, Jan Popelka, Jiří Semecký, Jana Šindlerová, Jan Štěpánek, Josef Toman, Zdeňka Urešová, Zdeněk Žabokrtský (2012): Announcing Prague Czech-English Dependency Treebank 2.0. In: Proceedings of the 8th International Conference on Language Resources and Evaluation (LREC 2012), pp. 3153-3160, European Language Resources Association, İstanbul, Turkey, ISBN 978-2-9517408-7-7 (url, local PDF, bibtex)
Jan Hajič, Barbora Vidová Hladká, Jarmila Panevová (2012): Lingvistika na Matematicko-fyzikální fakultě?. In: Vesmír, ISSN 0042-4544, 91, pp. 523-526 (bibtex)
Eva Hajičová (2012): Topic-Focus revisited (Through the eyes of the Prague Dependency Treebank). In: Смыслы, тексты и другие захватывающие сюжеты. Сборник статей в честь 80-летия Игоря Александровича Мельчука, pp. 218-232, Языкы славянской культуры, Москва, Russia, ISBN 978-5-9551-0593-2 (local DOC, bibtex)
Eva Hajičová (2012): The Functional Generative Description as a functionally motivated formal model of language. In: SLE - 45th Annual Meeting of the Societas Linguistica Europaea, pp. 130-131, Societas Linguistica Europaea, Stockholm, Sweden (bibtex)
Eva Hajičová (2012): Vilém Mathesius and Functional Sentence Perspective, and beyond. In: A Centenary of English Studies at Charles University: from Mathesius to Present-day Linguistics, pp. 49-60, Univerzita Karlova v Praze, Praha, Czechia, ISBN 978-80-7308-449-3 (local DOCX, bibtex)
Eva Hajičová (2012): What we have learned from complex annotation of topic-focus articulation in a large Czech corpus . In: Écho des études romanes, ISSN 1801-0865, II, pp. 51-64 (local DOC, bibtex)
Eva Hajičová (2012): On scalarity in information structure. In: Linguistica Pragensia, ISSN 0862-8432, vol. XXII, no. 2, pp. 60-78 (local DOCX, bibtex)
Eva Hajičová, Barbora Hladká (2012): Od kořenů ke stromům: příběh anotovaných korpusů češtiny. In: Čeština v pohledu synchronním a diachronním. Stoleté kořeny Ústavu pro jazyk český., pp. 61-66, Karolinum, Praha, Czechia, ISBN 978-80-246-2121-0 (bibtex)
Jirka Hana, Anna Feldman (2012): Resource-light Approaches to Computational Morphology Part 1: Monolingual Approaches. In: Language and Linguistics Compass, ISSN 1749-818X, vol. 6, no. 10, pp. 622-634 (url, bibtex)
Jirka Hana, Barbora Hladká (2012): Getting more data – Schoolkids as annotators. In: Proceedings of the 8th International Conference on Language Resources and Evaluation (LREC 2012), pp. 4049-4054, European Language Resources Association, İstanbul, Turkey, ISBN 978-2-9517408-7-7 (url, bibtex)
Jirka Hana, Boris Lehečka, Anna Feldman, Alena Černá, Karel Oliva (2012): Building a Corpus of Old Czech. In: Proceedings of the Adaptation of Language Resources and Tools for Processing Cultural Heritage Objects Workshop, pp. 9-15, European Language Resources Association, İstanbul, Turkey (bibtex)
Jirka Hana, Alexandr Rosen, Barbora Štindlová, Petr Jäger (2012): Building a learner corpus. In: Proceedings of the 8th International Conference on Language Resources and Evaluation (LREC 2012), pp. 3228-3232, European Language Resources Association, İstanbul, Turkey, ISBN 978-2-9517408-7-7 (url, bibtex)
Jirka Hana, Jan Štěpánek (2012): Prague Markup Language Framework. In: Proceedings of the Sixth Linguistic Annotation Workshop, pp. 12-21, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-937284-32-9 (local PDF, bibtex)
James Henderson, Filip Jurčíček (2012): Data-Driven Methods for Spoken Language Understanding. In: Data-driven methods for adaptive Spoken Dialogue Systems, pp. 19-38, Springer New York, New York, Springer, ISBN 978-1-4614-4802-0 (url, bibtex)
Jaroslava Hlaváčová, Anja Nedolužko (2012): Příklad pravidelných slovotvorných vzorců v automatickém zpracování češtiny a ruštiny. In: Zborník príspevkov prezentovaných na konferencii Informačné technológie – Aplikácie a Teória, ITAT 2012, Hotel Magura, 17-21. septembra 2012, pp. 53-56, Slovenská spoločnosť pre umelú inteligenciu, Košice, Slovakia, ISBN 978-80-971144-1-1 (local PostScript, bibtex)
Martin Holub, Vincent Kríž, Silvie Cinková, Eckhard Bick (2012): Tailored Feature Extraction for Lexical Disambiguation of English Verbs Based on Corpus Pattern Analysis. In: Proceedings of the 24th International Conference on Computational Linguistics (Coling 2012), pp. 1195-1209, Coling 2012 Organizing Committee, Mumbai, India (local PDF, bibtex)
Bushra Jawaid, Ondřej Bojar (2012): Tagger Voting for Urdu. In: Proceedings of the Workshop on South and Southeast Asian Natural Language Processing (WSSANLP) at Coling 2012, pp. 135-144, Coling 2012 Organizing Committee, Mumbai, India (pdf, bibtex)
Tomáš Jelínek, Barbora Štindlová, Alexandr Rosen, Jirka Hana (2012): Combining Manual and Automatic Annotation of a Learner Corpus. In: Text, Speech and Dialogue: 15th International Conference, TSD 2012. Proceedings, Lecture Notes in Computer Science, ISSN 0302-9743, 7499, pp. 127-134, Springer Verlag, Berlin / Heidelberg, ISBN 978-3-642-32789-6 (url, bibtex)
Pavlína Jínová (2012): Nejčastější konektivní prostředky kauzálního vztahu v Pražském závislostním korpusu. In: Studie z aplikované lingvistiky / Studies in Applied Linguistics (SALi), ISSN 1804-3240, vol. 2012, no. 1, pp. 35-52 (bibtex)
Pavlína Jínová, Jiří Mírovský, Lucie Poláková (2012): Analyzing the Most Common Errors in the Discourse Annotation of the Prague Dependency Treebank. In: Proceedings of the 11th International Workshop on Treebanks and Linguistic Theories, pp. 127-132, Edicoes Colibri, Lisboa, Lisboa, Portugal, ISBN 978-989-689-274-6 (local PDF, bibtex)
Pavlína Jínová, Jiří Mírovský, Lucie Poláková (2012): Semi-Automatic Annotation of Intra-Sentential Discourse Relations in PDT. In: Proceedings of the Workshop on Advances in Discourse Analysis and its Computational Aspects (ADACA) at Coling 2012, pp. 43-58, Coling 2012 Organizing Committee, Mumbai, India (local PDF, bibtex)
Filip Jurčíček (2012): Reinforcement learning for spoken dialogue systems using off-policy natural gradient method. In: IEEE SLT '12: Proc. IEEE Spoken Language Technology Workshop, pp. 7-12, IEEE, Miami, FL, USA, ISBN 978-1-4673-5126-3 (url, bibtex)
Petr Karlík, Jarmila Panevová (2012): České syntaktické bádání od Šmilauera po dnešek očima Prahy a Brna. In: Čeština v pohledu synchronním a diachronním. Stoleté kořeny Ústavu pro jazyk český., pp. 55-59, Karolinum, Praha, Czechia, ISBN 978-80-246-2121-0 (local PDF, bibtex)
Václava Kettnerová (2012): Lexikálně-sémantické konverze ve valenčním slovníku (PhD thesis). In: (bibtex)
Václava Kettnerová (2012): Syntaktické konstrukce typu Včely se hemží na zahradě – Zahrada se hemží včelami. In: Korpus – gramatika – axiologie, ISSN 1804-137X, 5, pp. 54-71 (bibtex)
Václava Kettnerová, Markéta Lopatková, Eduard Bejček (2012): The Syntax-Semantics Interface of Czech Verbs in the Valency Lexicon. In: Proceedings of the 15th EURALEX International Congress, pp. 434-443, Department of Linguistics and Scandinavian Studies, University of Oslo, Oslo, Norway, ISBN 978-82-303-2228-4 (local PDF, bibtex)
Václava Kettnerová, Markéta Lopatková, Eduard Bejček (2012): Mapping Semantic Information from FrameNet onto VALLEX. In: The Prague Bulletin of Mathematical Linguistics, ISSN 0032-6585, 97, pp. 23-41 (url, bibtex)
Václava Kettnerová, Markéta Lopatková, Zdeňka Urešová (2012): The Rule-Based Approach to Czech Grammaticalized Alternations. In: Text, Speech and Dialogue: 15th International Conference, TSD 2012. Proceedings, Lecture Notes in Computer Science, ISSN 0302-9743, 7499, pp. 158-165, Springer Verlag, Berlin / Heidelberg, ISBN 978-3-642-32789-6 (bibtex)
Albert Kim, Jana Straková (2012): Concurrent effects of lexical status and letter-rotation during early stages of visual word recognition: evidence from ERPs. In: Brain Research, ISSN 0006-8993, 1468, pp. 52-62 (bibtex)
Natalia Klyueva (2012): Comparing Czech and Russian Valency on the Material of Vallex. In: Empirical Methods in Natural Language Processing - Proceedings of the Conference on Natural Language Processing 2012, pp. 446-451, Eigenverlag ÖGAI, Wien, Austria, ISBN 3-85027-005-X (local PDF, bibtex)
Natalia Klyueva (2012): Some differences between Czech and Russian: a parallel corpus study. In: Computational Linguistics and Intellectual Technologies: Proceedings of the International Conference "Dialog 2012", Компьютерная лингвистика и интеллектуальные технологии, Issue 11 (18), pp. 268-276, Изд-во РГГУ, Москва, Russia (local PDF, local PDF, bibtex)
Veronika Kolářová (2012): Valence dějových substantiv odvozených od sloves s předmětovým genitivem. In: Čeština v pohledu synchronním a diachronním. Stoleté kořeny Ústavu pro jazyk český., pp. 609-614, Karolinum, Praha, Czechia, ISBN 978-80-246-2121-0 (local PDF, bibtex)
Oldřich Krůza, Nino Peterek (2012): Making Community and ASR Join Forces in Web. In: Text, Speech and Dialogue: 15th International Conference, TSD 2012. Proceedings, Lecture Notes in Computer Science, ISSN 0302-9743, 7499, pp. 415-421, Springer Verlag, Berlin / Heidelberg, ISBN 978-3-642-32789-6 (bibtex)
Vladislav Kuboň, Petr Homola (2012): Machine Translation Among Related Slavic Languages. In: Multilingual Processing in Eastern and Southern EU Languages: Low-Resourced Technologies and Translation, pp. 283-307, Cambridge Scholars Publishing, Cambridge, United Kingdom, ISBN 978-1-4438-3878-8 (bibtex)
Vladislav Kuboň, Markéta Lopatková, Martin Plátek (2012): How to Measure Word Order Freedom for Natural Languages?. In: Proceedings of 22nd Theorietag der Fachgruppe "Automaten und Formale Sprachen" der Gesellschaft für Informatik, pp. 71-76, Matfyzpress, Praha, Czechia, ISBN 978-80-7378-221-4 (pdf, local PDF, bibtex)
Vladislav Kuboň, Markéta Lopatková, Martin Plátek (2012): Studying Formal Properties of a Free Word Order Language. In: Proceedings of the Twenty-Fifth International Florida Artificial Intelligence Research Society Conference, pp. 300-305, AAAI Press, Palo Alto, CA, USA, ISBN 978-1-57735-558-8 (bibtex)
Vladislav Kuboň, Markéta Lopatková, Martin Plátek (2012): On Formalization of Word Order Properties. In: Computational Linguistics and Intelligent Text Processing, 13th International Conference, CICLing 2012, Lecture Notes in Computer Science, ISSN 0302-9743, 7181, pp. 130-141, Springer-Verlag, Berlin / Heidelberg, ISBN 978-3-642-28603-2 (bibtex)
Septina Dian Larasati (2012): A Dataset Comparison for an Indonesian-English Statistical Machine Translation System. In: Proceedings of the 26th Pacific Asia Conference on Language, Information and Computation, pp. 146-152, Faculty of Computer Science, Universitas Indonesia, Bali, Indonesia, ISBN 978-979-1421-17-1 (pdf, bibtex)
Septina Dian Larasati (2012): Improving Word Alignment by Exploiting Adapted Word Similarity. In: Proceedings of the Workshop on Monolingual Machine Translation (MONOMT) at AMTA 2012, pp. 41-45, AMTA 2012 Organizing Committee, San Diego, USA (bibtex)
Septina Dian Larasati (2012): Towards an Indonesian-English SMT System: A Case Study of an Under-Studied and Under-Resourced Language, Indonesian. In: WDS'12 Proceedings of Contributed Papers, pp. 123-129, Matfyzpress, Praha, Czechia, ISBN 978-80-7378-224-5 (pdf, bibtex)
Septina Dian Larasati (2012): IDENTIC Corpus: Morphologically Enriched Indonesian-English Parallel Corpus. In: Proceedings of the 8th International Conference on Language Resources and Evaluation (LREC 2012), pp. 902-906, European Language Resources Association, İstanbul, Turkey, ISBN 978-2-9517408-7-7 (bibtex)
Markéta Lopatková, Petr Homola, Natalia Klyueva (2012): Annotation of sentence structure: Capturing the relationship between clauses in Czech sentences. In: Language Resources and Evaluation, ISSN 1574-020X, vol. 46, no. 1, pp. 25-36 (url, bibtex)
Martin Majliš (2012): Yet Another Language Identifier. In: Proceedings of the Student Research Workshop at the 13th Conference of the European Chapter of the Association for Computational Linguistics, pp. 46-54, Association for Computational Linguistics, Avignon, France, ISBN 978-1-937284-19-0 (pdf, local PDF, bibtex)
Martin Majliš, Zdeněk Žabokrtský (2012): Language Richness of the Web. In: Proceedings of the 8th International Conference on Language Resources and Evaluation (LREC 2012), pp. 2927-2934, European Language Resources Association, İstanbul, Turkey, ISBN 978-2-9517408-7-7 (url, local PDF, bibtex)
David Mareček (2012): Unsupervised Dependency Parsing (PhD thesis). In: (local PDF, bibtex)
David Mareček, Zdeněk Žabokrtský (2012): Exploiting Reducibility in Unsupervised Dependency Parsing. In: Proceedings of the 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, pp. 297-307, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-937284-43-5 (bibtex)
David Mareček, Zdeněk Žabokrtský (2012): Unsupervised Dependency Parsing using Reducibility and Fertility features. In: The NAACL-HLT Workshop on the Induction of Linguistic Structure, pp. 84-89, The Association for Computational Linguistics, Montréal, Canada, ISBN 978-1-937284-20-6 (bibtex)
Jiří Maršík, Ondřej Bojar (2012): TrTok: A Fast and Trainable Tokenizer for Natural Languages. In: The Prague Bulletin of Mathematical Linguistics, ISSN 0032-6585, 98, pp. 75-85 (pdf, bibtex)
František Martínek, Kateřina Rysová (2012): On a Corpus of Older Czech Texts and Its Usage. In: Prace Filologiczne, ISSN 0138-0567, 63, pp. 219-230 (local PDF, bibtex)
Jiří Mírovský, Pavlína Jínová, Lucie Poláková (2012): Does Tectogrammatics Help the Annotation of Discourse?. In: Proceedings of the 24th International Conference on Computational Linguistics (Coling 2012), pp. 853-862, Coling 2012 Organizing Committee, Mumbai, India (local PDF, bibtex)
Jarmila Panevová (2012): Světová slavistika utrpěla citelné ztráty. In: Slovo a slovesnost, ISSN 0037-7031, vol. 73, no. 1, pp. 74-76 (bibtex)
Jarmila Panevová, Petr Karlík (2012): Dva pohledy na vývoj českého poválečného syntaktického myšlení. In: Korpus – gramatika – axiologie, ISSN 1804-137X, 5, pp. 34-53 (bibtex)
Jarmila Panevová, Marie Mikulová (2012): Ассиметрии между глубинным и поверхностным преставлением предложения (на примере двух типов обстоятельств в чешском языке). In: Смыслы, тексты и другие захватывающие сюжеты. Сборник статей в честь 80-летия Игоря Александровича Мельчука, pp. 486-497, Языкы славянской культуры, Москва, Russia, ISBN 978-5-9551-0593-2 (local PDF, bibtex)
Pavel Pecina, Antonio Toral, Vassilis Papavassiliou, Prokopis Prokopidis, Josef Genabith (2012): Domain Adaptation of Statistical Machine Translation using Web-Crawled Resources: a Case Study. In: EAMT 2012: Proceedings of the 16th Annual Conference of the European Association for Machine Translation, pp. 145-152, European Association for Machine Translation, Trento, Italy (bibtex)
Pavel Pecina, Antonio Toral, Josef van Genabith (2012): Simple and Effective Parameter Tuning for Domain Adaptation of Statistical Machine Translation. In: Proceedings of the 24th International Conference on Computational Linguistics (Coling 2012), pp. 2209-2224, Coling 2012 Organizing Committee, Mumbai, India (bibtex)
Lucie Poláková, Pavlína Jínová, Jiří Mírovský (2012): Interplay of Coreference and Discourse Relations: Discourse Connectives with a Referential Component. In: Proceedings of the 8th International Conference on Language Resources and Evaluation (LREC 2012), pp. 146-153, European Language Resources Association, İstanbul, Turkey, ISBN 978-2-9517408-7-7 (url, local PDF, bibtex)
Lucie Poláková, Pavlína Jínová, Šárka Zikánová, Zuzanna Bedřichová, Jiří Mírovský, Magdaléna Rysová, Jana Zdeňková, Veronika Pavlíková, Eva Hajičová (2012): Manual for Annotation of Discourse Relations in Prague Dependency Treebank (technical report). In: , pp. 1-83 (url, local PDF, bibtex)
Loganathan Ramasamy, Ondřej Bojar, Zdeněk Žabokrtský (2012): Morphological Processing for English-Tamil Statistical Machine Translation. In: Proceedings of the Workshop on Machine Translation and Parsing in Indian Languages (MTPIL-2012), pp. 113-122, The COLING 2012 Organizing Committee, Mumbai, India (bibtex)
Loganathan Ramasamy, Zdeněk Žabokrtský (2012): Prague Dependency Style Treebank for Tamil. In: Proceedings of the 8th International Conference on Language Resources and Evaluation (LREC 2012), pp. 23-25, European Language Resources Association, İstanbul, Turkey, ISBN 978-2-9517408-7-7 (bibtex)
Loganathan Ramasamy, Zdeněk Žabokrtský, Sowmya Vajjala (2012): The Study of Effect of Length in Morphological Segmentation of Agglutinative Languages. In: Proceedings of the First Workshop on Multilingual Modeling (MM-2012), pp. 18-24, Association for Computational Linguistics, Jeju, Korea, ISBN 978-1-937284-35-0 (bibtex)
Michal Richter, Pavel Straňák, Alexandr Rosen (2012): Korektor – A System for Contextual Spell-checking and Diacritics Completion. In: Proceedings of the 24th International Conference on Computational Linguistics (Coling 2012), pp. 1-12, Coling 2012 Organizing Committee, Mumbai, India (pdf, local PDF, bibtex)
Rudolf Rosa, Ondřej Dušek, David Mareček, Martin Popel (2012): Using Parallel Features in Parsing of Machine-Translated Sentences for Correction of Grammatical Errors. In: Proceedings of Sixth Workshop on Syntax, Semantics and Structure in Statistical Translation (SSST-6), ACL, pp. 39-48, Association for Computational Linguistics, Jeju, Korea, ISBN 978-1-937284-38-1 (pdf, local PDF, local PDF, bibtex)
Rudolf Rosa, David Mareček (2012): Dependency Relations Labeller for Czech. In: Text, Speech and Dialogue: 15th International Conference, TSD 2012. Proceedings, Lecture Notes in Computer Science, ISSN 0302-9743, 7499, pp. 256-263, Springer Verlag, Berlin / Heidelberg, ISBN 978-3-642-32789-6 (url, local PDF, local PDF, bibtex)
Rudolf Rosa, David Mareček, Ondřej Dušek (2012): DEPFIX: A System for Automatic Correction of Czech MT Outputs. In: Proceedings of the Seventh Workshop on Statistical Machine Translation, pp. 362-368, Association for Computational Linguistics, Montréal, Canada, ISBN 978-1-937284-20-6 (pdf, local PDF, local HTML, local PDF, bibtex)
Kateřina Rysová (2012): Možnosti jednotlivých volných slovesných doplnění být obligatorním členem věty . In: Čeština v pohledu synchronním a diachronním. Stoleté kořeny Ústavu pro jazyk český., pp. 615-620, Karolinum, Praha, Czechia, ISBN 978-80-246-2121-0 (local PDF, bibtex)
Magdaléna Rysová (2012): Alternativní vyjádření konektorů v češtině (masters thesis). In: (local PDF, bibtex)
Magdaléna Rysová (2012): Alternative Lexicalizations of Discourse Connectives in Czech. In: Proceedings of the 8th International Conference on Language Resources and Evaluation (LREC 2012), pp. 2800-2807, European Language Resources Association, İstanbul, Turkey, ISBN 978-2-9517408-7-7 (pdf, local PDF, bibtex)
Khaled Shaalan, Mohammed Attia, Pavel Pecina, Younes Samih, Josef van Genabith (2012): Arabic Word Generation and Modelling for Spell Checking. In: Proceedings of the 8th International Conference on Language Resources and Evaluation (LREC 2012), pp. 719-725, European Language Resources Association, İstanbul, Turkey, ISBN 978-2-9517408-7-7 (bibtex)
Johanka Spoustová, Miroslav Spousta (2012): A High-Quality Web Corpus of Czech. In: Proceedings of the 8th International Conference on Language Resources and Evaluation (LREC 2012), pp. 311-315, European Language Resources Association, İstanbul, Turkey, ISBN 978-2-9517408-7-7 (pdf, local PDF, bibtex)
Raymond Hendy Susanto, Septina Dian Larasati, Francis M. Tyers (2012): Rule-based Machine Translation between Indonesian and Malaysian. In: Proceedings of the Workshop on South and Southeast Asian Natural Language Processing (WSSANLP) at Coling 2012, pp. 9-10, Coling 2012 Organizing Committee, Mumbai, India (bibtex)
Magda Ševčíková (2012): Predikativum v gramatickém popisu češtiny. In: Čeština v pohledu synchronním a diachronním. Stoleté kořeny Ústavu pro jazyk český., pp. 597-602, Karolinum, Praha, Czechia, ISBN 978-80-246-2121-0 (local PDF, bibtex)
Magda Ševčíková, Jiří Mírovský (2012): Sentence Modality Assignment in the Prague Dependency Treebank. In: Text, Speech and Dialogue: 15th International Conference, TSD 2012. Proceedings, Lecture Notes in Computer Science, ISSN 0302-9743, 7499, pp. 56-63, Springer Verlag, Berlin / Heidelberg, ISBN 978-3-642-32789-6 (local PDF, bibtex)
Barbora Štindlová, Svatava Škodová, Jirka Hana, Alexandr Rosen (2012): CzeSL – an error tagged corpus of Czech as a second language. In: Practical Applications in Language and Computers – PALC 2011, pp. 21-32, Peter Lang, Pieterlen, Switzerland, ISBN 978-3-631-62547-7 (bibtex)
Barbora Štindlová, Svatava Škodová, Alexandr Rosen, Jirka Hana (2012): Annotating foreign learners’ Czech. In: Studies in Formal Slavic Linguistics. Contributions from Formal Description of Slavic Languages 8.5, pp. 205-219, Peter Lang GmbH, Frankfurt am Main, Germany, ISBN 978-3-631-63609-1 (bibtex)
Aleš Tamchyna, Petra Galuščáková, Amir Kamran, Miloš Stanojević, Ondřej Bojar (2012): Selecting Data for English-to-Czech Machine Translation. In: Proceedings of the Seventh Workshop on Statistical Machine Translation, pp. 374-381, Association for Computational Linguistics, Montréal, Canada, ISBN 978-1-937284-20-6 (url, local PDF, bibtex)
Antonio Toral, Leroy Finn, Dominic Jones, Pavel Pecina, David Lewis, Declan Groves (2012): Retraining Machine Translation with Post-edits to Increase Post-editing Productivity in Content Management Systems. In: International Workshop on Expertise in Translation and Post-editing Research and Application, pp. 39-40, Copenhagen Business School , København, Denmark (bibtex)
Antonio Toral, Marc Poch, Pavel Pecina, Gregor Thurmair (2012): Efficiency-based Evaluation of Aligners for Industrial Applications. In: EAMT 2012: Proceedings of the 16th Annual Conference of the European Association for Machine Translation, pp. 57-60, European Association for Machine Translation, Trento, Italy (bibtex)
Zdeňka Urešová (2012): Building the PDT-VALLEX valency lexicon. In: Proceedings of the fifth Corpus Linguistics Conference, pp. 1-18, University of Liverpool, Liverpool, UK (local DOC, bibtex)
Kateřina Veselovská (2012): Sentence-Level Sentiment Analysis in Czech. In: Proceedings of the 2nd International Conference on Web Intelligence, Mining and Semantics, pp. 1-4, ACM , New York, NY, USA, ISBN 978-1-4503-0915-8 (url, bibtex)
Kateřina Veselovská (2012): Mezinárodní konference závislostní lingvistiky Depling 2011 v Barceloně (review). In: Studie z aplikované lingvistiky / Studies in Applied Linguistics (SALi), ISSN 1804-3240, vol. 2012, no. 1-2, pp. 1-2 (bibtex)
Kateřina Veselovská, Jan Hajič, jr., Jana Šindlerová (2012): Creating annotated resources for polarity classification in Czech. In: Empirical Methods in Natural Language Processing - Proceedings of the Conference on Natural Language Processing 2012, pp. 296-304, Eigenverlag ÖGAI, Wien, Austria, ISBN 3-85027-005-X (pdf, bibtex)
Kateřina Veselovská, Giang Linh Nguy, Michal Novák (2012): Using Czech-English Parallel Corpora in Automatic Identification of It. In: The Fifth Workshop on Building and Using Comparable Corpora, pp. 112-120, European Language Resources Association, İstanbul, Turkey (local PDF, bibtex)
Barbora Vidová Hladká, Zdeňka Urešová (2012): Syntactic annotation of transcriptions in the Czech Academic Corpus: Then and now. In: Proceedings of the fifth Corpus Linguistics Conference, pp. 1-4, University of Liverpool, Liverpool, UK (local DOC, bibtex)
Daniel Zeman (2012): CUNI: Feature Selection and Error Analysis of a Transition-Based Parser. In: Proceedings of the Workshop on Machine Translation and Parsing in Indian Languages (MTPIL-2012), pp. 143-148, The COLING 2012 Organizing Committee, Mumbai, India (url, local PDF, bibtex)
Daniel Zeman (2012): Data Issues of the Multilingual Translation Matrix. In: Proceedings of the Seventh Workshop on Statistical Machine Translation, pp. 395-400, Association for Computational Linguistics, Montréal, Canada, ISBN 978-1-937284-20-6 (pdf, bibtex)
Daniel Zeman, David Mareček, Martin Popel, Loganathan Ramasamy, Jan Štěpánek, Zdeněk Žabokrtský, Jan Hajič (2012): HamleDT: To Parse or Not to Parse?. In: Proceedings of the 8th International Conference on Language Resources and Evaluation (LREC 2012), pp. 2735-2741, European Language Resources Association, İstanbul, Turkey, ISBN 978-2-9517408-7-7 (url, local PDF, local PDF, bibtex)
Zdeněk Žabokrtský (2012): Machine Translation using Dependency Trees. In: Proceedings of 22nd Theorietag der Fachgruppe "Automaten und Formale Sprachen" der Gesellschaft für Informatik, pp. 21-26, Matfyzpress, Praha, Czechia, ISBN 978-80-7378-221-4 (bibtex)
Eduard Bejček, Pavel Straňák, Daniel Zeman (2011): Influence of Treebank Design on Representation of Multiword Expressions. In: Lecture Notes in Computer Science, ISSN 0302-9743, 6608, pp. 1-14 (url, local PDF, bibtex)
Jan Berka, Martin Černý, Ondřej Bojar (2011): Quiz-Based Evaluation of Machine Translation. In: The Prague Bulletin of Mathematical Linguistics, ISSN 0032-6585, 95, pp. 77-86 (pdf, local PDF, bibtex)
Ondřej Bojar (2011): Analyzing Error Types in English-Czech Machine Translation. In: The Prague Bulletin of Mathematical Linguistics, ISSN 0032-6585, 95, pp. 63-76 (pdf, local PDF, bibtex)
Ondřej Bojar, Miloš Ercegovčević, Martin Popel, Omar F. Zaidan (2011): A Grain of Salt for the WMT Manual Evaluation. In: Proceedings of the Sixth Workshop on Statistical Machine Translation, pp. 1-11, Association for Computational Linguistics, Edinburgh, UK, ISBN 978-1-937284-12-1 (pdf, local PDF, local PDF, bibtex)
Ondřej Bojar, Petra Galuščáková, Miroslav Týnovský (2011): Evaluating Quality of Machine Translation from Czech to Slovak. In: Information Technologies – Applications and Theory, pp. 3-9, Univerzita Pavla Jozefa Šafárika v Košiciach, Košice, Slovakia, ISBN 978-80-89557-01-1 (local PDF, bibtex)
Ondřej Bojar, Aleš Tamchyna (2011): Improving Translation Model by Monolingual Data. In: Proceedings of the Sixth Workshop on Statistical Machine Translation, pp. 330-336, Association for Computational Linguistics, Edinburgh, UK, ISBN 978-1-937284-12-1 (url, local PDF, bibtex)
Radek Čech, Ján Mačutek, Zdeněk Žabokrtský (2011): The role of syntax in complex networks: Local and global importance of verbs in a syntactic dependency network. In: Physica A: Statistical Mechanics and its Applications, ISSN 0378-4371, 390, pp. 3614-3623 (bibtex)
Mark Fishel, Ondřej Bojar, Daniel Zeman, Jan Berka (2011): Automatic Translation Error Analysis. In: Lecture Notes in Computer Science, ISSN 0302-9743, 6836, pp. 72-79 (url, local PDF, local PDF, local ODP, bibtex)
Petra Galuščáková, Ondřej Bojar (2011): Czech-Slovak Parallel Corpora. In: Natural Language Processing, Multilinguality , pp. 65-71, Tribun EU, Bratislava, Slovakia, ISBN 978-80-263-0049-6 (local PDF, bibtex)
Nathan David Green (2011): Effects of Noun Phrase Bracketing in Dependency Parsing and Machine Translation. In: Proceedings of the ACL 2011 Student Session, pp. 69-74, Association for Computational Linguistics, Portland, OR, USA, ISBN 978-1-932432-89-3 (url, bibtex)
Nathan David Green (2011): Dependency Parsing. In: WDS'11 Proceedings of Contributed Papers, Part I, pp. 137-142, Matfyzpress, Praha, Czechia, ISBN 978-80-7378-184-2 (pdf, bibtex)
Jan Hajič, Jakub Mlynář (2011): Archiv vizuální historie přístupný v Centru Malach. In: Archivní časopis, ISSN 0004-0398, vol. 61, no. 4, pp. 428-439 (bibtex)
Eva Hajičová (2011): Computational Linguistics without Linguistics? View from Prague. In: Linguistic Issues in Language Technology, ISSN 1945-3604, vol. 6, no. 6, pp. 1-22 (url, local PDF, bibtex)
Ondřej Hálek, Rudolf Rosa, Aleš Tamchyna, Ondřej Bojar (2011): Named Entities from Wikipedia for Machine Translation. In: Information Technologies – Applications and Theory, pp. 23-30, Univerzita Pavla Jozefa Šafárika v Košiciach, Košice, Slovakia, ISBN 978-80-89557-02-8 (local PDF, local PDF, local PDF, bibtex)
Jirka Hana, Anna Feldman, Katsiaryna Aharodnik (2011): A low-budget tagger for Old Czech. In: Proceedings of the 5th ACL-HLT Workshop on Language Technology for Cultural Heritage, Social Sciences, and Humanities, pp. 10-18, Association for Computational Linguistics, Portland, OR, USA, ISBN 978-1-937284-04-6 (bibtex)
Barbora Hladká, Alevtina Bémová, Zdeňka Urešová (2011): Syntaktická proměna Českého akademického korpusu. In: Slovo a slovesnost, ISSN 0037-7031, 4, pp. 268-287 (local PDF, bibtex)
Barbora Hladká, Jiří Mírovský, Jan Kohout (2011): An attractive game with the document: (im)possible?. In: The Prague Bulletin of Mathematical Linguistics, ISSN 0032-6585, 96, pp. 5-26 (pdf, local PDF, bibtex)
Jaroslava Hlaváčová (2011): Problém variantních tvarů slov při automatickém zpracování jazyka. In: Information Technologies – Applications and Theory, pp. 75-78, Univerzita Pavla Jozefa Šafárika v Košiciach, Košice, Slovakia, ISBN 978-80-89557-01-1 (local PDF, bibtex)
Jaroslava Hlaváčová, Michal Hrušecký (2011): Prefix Recognition Experiments. In: Lecture Notes in Computer Science, ISSN 0302-9743, 6836, pp. 235-242 (url, local PDF, bibtex)
Bushra Jawaid (2011): Machine Translation with Significant Word Reordering and Rich Target-Side Morphology. In: WDS'11 Proceedings of Contributed Papers, Part I, pp. 161-166, Matfyzpress, Praha, Czechia, ISBN 978-80-7378-184-2 (pdf, local PDF, bibtex)
Bushra Jawaid, Daniel Zeman (2011): Word-Order Issues in English-to-Urdu Statistical Machine Translation. In: The Prague Bulletin of Mathematical Linguistics, ISSN 0032-6585, 95, pp. 87-106 (url, bibtex)
Pavlína Jínová (2011): Vybrané problematické aspekty konektivních prostředků v rámci anotace mezivýpovědních významových vztahů v PDT . In: Bohemica Olomucensia, ISSN 1803-876X, 2, pp. 138-147 (local PDF, bibtex)
Pavlína Jínová, Lucie Mladová, Jiří Mírovský (2011): Sentence Structure and Discourse Structure: Possible Parallels. In: Proceedings of the International Conference on Dependency Linguistics (Depling 2011), pp. 233-240, Universitat Pompeu Fabra, Barcelona, Spain, ISBN 978-84-615-1834-0 (local PDF, bibtex)
Václava Kettnerová (2011): Lokativní sémantická diateze v češtině. In: Slovo a slovesnost, ISSN 0037-7031, vol. 72, no. 2, pp. 83-101 (bibtex)
Václava Kettnerová, Markéta Lopatková (2011): The Lexicographic Representation of Czech Diatheses: Rule Based Approach. In: Natural Language Processing, Multilinguality , pp. 89-100, Tribun EU, Bratislava, Slovakia, ISBN 978-80-263-0049-6 (bibtex)
Natalia Klyueva, Naděžda Runštuková (2011): Translating prepositions from Czech into Russian: challenges for the Machine Translation . In: Natural Language Processing, Multilinguality , pp. 101-108, Tribun EU, Bratislava, Slovakia, ISBN 978-80-263-0049-6 (bibtex)
Vladislav Kuboň, Markéta Lopatková (2011): Studying Properties of Czech Complex Sentences from an Annotated Corpus. In: Proceedings of the 24th International Florida Artificial Intelligence Research Society Conference (FLAIRS 2011), pp. 180-185, The AAAI Press, Menlo Park, CA, USA, ISBN 978-1-57735-501-4 (bibtex)
Septina Dian Larasati, Vladislav Kuboň, Daniel Zeman (2011): Indonesian Morphology Tool (MorphInd): Towards an Indonesian Corpus. In: Communications in Computer and Information Science, ISSN 1865-0929, 100, pp. 119-129 (url, bibtex)
Matouš Macháček, Ondřej Bojar (2011): Approximating a Deep-Syntactic Metric for MT Evaluation and Tuning. In: Proceedings of the Sixth Workshop on Statistical Machine Translation, pp. 92-98, Association for Computational Linguistics, Edinburgh, UK, ISBN 978-1-937284-12-1 (url, local PDF, local DOC, local PDF, bibtex)
Martin Majliš, Zdeněk Žabokrtský (2011): W2C - Large Multilingual Corpus (technical report). In: (url, local PDF, bibtex)
David Mareček (2011): Combining Diverse Word-Alignment Symmetrizations Improves Dependency Tree Projection. In: Lecture Notes in Computer Science, ISSN 0302-9743, 6608, pp. 144-154 (url, bibtex)
David Mareček, Rudolf Rosa, Petra Galuščáková, Ondřej Bojar (2011): Two-step translation with grammatical post-processing. In: Proceedings of the Sixth Workshop on Statistical Machine Translation, pp. 426-432, Association for Computational Linguistics, Edinburgh, UK, ISBN 978-1-937284-12-1 (url, local PDF, local PDF, bibtex)
David Mareček, Zdeněk Žabokrtský (2011): Gibbs Sampling with Treeness constraint in Unsupervised Dependency Parsing. In: Robust Unsupervised and Semisupervised Methods in Natural Language Processing, pp. 1-8, Incoma, Šumen, Bulgaria, ISBN 978-954-452-017-5 (bibtex)
David Mareček, Zdeněk Žabokrtský (2011): Unsupervised Dependency Parsing (technical report). In: (pdf, bibtex)
Marie Mikulová (2011): Významová reprezentace elipsy. In: , ISBN 978-80-904175-9-5 (bibtex)
Marie Mikulová (2011): Významová reprezentace elipsy (PhD thesis). In: (bibtex)
Marie Mikulová, Jana Hoffmannová (2011): Korpusy mluvené češtiny a možnosti jejich využití pro poznání rozdílných "světů" mluvenosti a psanosti. In: Korpusová lingvistika Praha 2011. 2 Výzkum a výstavba korpusů, pp. 78-92, Lidové noviny, Praha, ISBN 978-80-7422-115-6 (bibtex)
Jakub Mlynář (2011): Centrum vizuální historie Malach při Matematicko-fyzikální fakultě Univerzity Karlovy. In: Studie z aplikované lingvistiky / Studies in Applied Linguistics (SALi), ISSN 1804-3240, vol. 2011, no. 2, pp. 127-129 (bibtex)
Anna Nedoluzhko (2011): Rozšířená textová koreference a asociační anafora (Koncepce anotace českých dat v Pražském závislostním korpusu). In: , ISBN 978-80-904571-2-6 (local PDF, local PDF, bibtex)
Anna Nedoluzhko, Jiří Mírovský (2011): Annotating Extended Textual Coreference and Bridging Relations in the Prague Dependency Treebank (technical report). In: , pp. 1-69 (url, bibtex)
Giang Linh Nguy, Michal Novák, Anna Nedoluzhko (2011): Coreference Resolution in the Prague Dependency Treebank (technical report). In: , pp. 1-66 (pdf, bibtex)
Giang Linh Nguy, Magda Ševčíková (2011): Unstated Subject Identification in Czech. In: WDS'11 Proceedings of Contributed Papers, Part I, pp. 149-154, Matfyzpress, Praha, Czechia, ISBN 978-80-7378-184-2 (pdf, bibtex)
Giang Linh Nguy, Zdeněk Žabokrtský (2011): Coreference of Deletions – The Case of Control. In: Proceedings of the 5th International Conference on Meaning-Text Theory, Barcelona, September 8 – 9, 2011, pp. 186-195, Universitat Pompeu Fabra, Barcelona, Spain, ISBN 978-84-615-1716-9 (pdf, bibtex)
Michal Novák (2011): Utilization of Anaphora in Machine Translation. In: WDS'11 Proceedings of Contributed Papers, Part I, pp. 155-160, Matfyzpress, Praha, Czechia, ISBN 978-80-7378-184-2 (pdf, bibtex)
Michal Novák, Zdeněk Žabokrtský (2011): Resolving Noun Phrase Coreference in Czech. In: Lecture Notes in Computer Science, ISSN 0302-9743, 7099, pp. 24-34 (url, bibtex)
Jarmila Panevová (2011): O rezultativnosti (především) v češtině. In: Граматика и лексика у словенским језицима, pp. 165-176, Матица српска, Нови Сад / Београд, Serbia, ISBN 978-86-82873-32-7 (bibtex)
Jarmila Panevová (2011): On Syntax and Semantics of Czech Infinitival Constructions: A Case Study. In: Slovo i jazyk. Sbornik statej k vosmidesjatiletiju akad. J. D. Apresjana, pp. 537-547, Jazyki slavjanskich kul'tur, Moskva Ruská federace, ISBN 978-5-9551-0478-2 (bibtex)
Jarmila Panevová (2011): Infinitiv ve funkci atributu. In: Kapitoly z české gramatiky, pp. 945-960, Academia, Praha, Prague, Czech republic, ISBN 978-80-200-1845-8 (bibtex)
Jarmila Panevová (2011): Absolutní a relativní čas v souvětí a vazbách přechodníkových. In: Kapitoly z české gramatiky, pp. 1121-1126, Academia, Praha, Prague, Czech republic, ISBN 978-80-200-1845-8 (bibtex)
Jarmila Panevová (2011): Nominalizace vyjádřené slovesnými adjektivy. In: Kapitoly z české gramatiky, pp. 961-973, Academia, Praha, Prague, Czech republic, ISBN 978-80-200-1845-8 (bibtex)
Jarmila Panevová (2011): Vybrané problémy ze slovesné valence. In: Kapitoly z české gramatiky, pp. 913-920, Academia, Praha, Prague, Czech republic, ISBN 978-80-200-1845-8 (bibtex)
Jarmila Panevová, Marie Mikulová (2011): Problém elipsy: Co s ním a kam s ním?. In: Prace Filologiczne, ISSN 0138-0567, 60, pp. 225-232 (bibtex)
Jarmila Panevová, Alexandr Rosen (2011): Zvláštní případy shody: doplněk u infinitivu. In: Kapitoly z české gramatiky, pp. 900-909, Academia, Praha, Prague, Czech republic, ISBN 978-80-200-1845-8 (bibtex)
Jarmila Panevová, Magda Ševčíková (2011): Jak se počítají substantiva v češtině: poznámky ke kategorii čísla. In: Slovo a slovesnost, ISSN 0037-7031, 72, pp. 163-176 (local PDF, bibtex)
Jarmila Panevová, Magda Ševčíková (2011): Delimitation of information between grammatical rules and lexicon. In: Proceedings of the International Conference on Dependency Linguistics (Depling 2011), pp. 173-182, Universitat Pompeu Fabra, Barcelona, Spain, ISBN 978-84-615-1834-0 (pdf, local PDF, bibtex)
Pavel Pecina (2011): Book Reviews: Syntax-Based Collocation Extraction by Violeta Seretan (review). In: Computational Linguistics, ISSN 1530-9312, vol. 37, no. 3, pp. 631-633 (bibtex)
Martin Popel, David Mareček, Nathan David Green, Zdeněk Žabokrtský (2011): Influence of Parser Choice on Dependency-Based MT. In: Proceedings of the Sixth Workshop on Statistical Machine Translation, pp. 433-439, Association for Computational Linguistics, Edinburgh, UK, ISBN 978-1-937284-12-1 (bibtex)
Česlav Przywara, Ondřej Bojar (2011): eppex: Epochal Phrase Table Extraction for Statistical Machine Translation. In: The Prague Bulletin of Mathematical Linguistics, ISSN 0032-6585, 96, pp. 89-98 (url, local PDF, bibtex)
Loganathan Ramasamy (2011): TamilTB: An Effort Towards Building a Dependency Treebank for Tamil. In: WDS'11 Proceedings of Contributed Papers, Part I, pp. 143-148, Matfyzpress, Praha, Czechia, ISBN 978-80-7378-184-2 (pdf, bibtex)
Loganathan Ramasamy, Zdeněk Žabokrtský (2011): Tamil dependency parsing: results using rule based and corpus based approaches. In: Lecture Notes in Computer Science, ISSN 0302-9743, 6608, pp. 82-95 (url, bibtex)
Loganathan Ramasamy, Zdeněk Žabokrtský (2011): Tamil Dependency Treebank (TamilTB) - 0.1 Annotation Manual (technical report). In: (bibtex)
Kateřina Rysová (2011): The Word Order of Inner Participants in Czech, Considering the Systemic Ordering of Actor and Patient. In: Proceedings of the International Conference on Dependency Linguistics (Depling 2011), pp. 183-192, Universitat Pompeu Fabra, Barcelona, Spain, ISBN 978-84-615-1834-0 (url, local PDF, bibtex)
Kateřina Rysová (2011): Ke slovosledu v konstrukcích s obligatorními participanty (se zaměřením na slovosled patientu a způsobového slovesného doplnění). In: Korpusová lingvistika Praha 2011 Gramatika a značkování korpusů, pp. 62-69, Ústav českého národního korpusu, Prague, Czech Republic, ISBN 978-80-7422-116-3 (local PDF, bibtex)
Petr Sgall (2011): Jazyk, mluvení, psaní. In: , ISBN 978-80-246-1903-3 (url, bibtex)
Petr Sgall (2011): Ke zkoumání českého jazyka. In: Vesmír, ISSN 0042-4544, 90, pp. 372-373 (bibtex)
Petr Sgall (2011): Příběh Oskara Sgalla. In: Apel, Zpravodaj Svazu osvobozených politických vězňů a pozůstalých, 10, pp. 3-4 (bibtex)
Petr Sgall (2011): Pamětnice Věra Holznerová Jílková. In: Apel, Zpravodaj Svazu osvobozených politických vězňů a pozůstalých, 10, pp. 6-7 (bibtex)
Johanka Spoustová, Miroslav Spousta (2011): Comparable Fora. In: Proceedings of the 4th Workshop on Building and Using Comparable Corpora, pp. 96-101, Association for Computational Linguistics, Portland, OR, USA, ISBN 978-1-937284-01-5 (url, local PDF, bibtex)
Magda Ševčíková, Jarmila Panevová, Lenka Smejkalová (2011): Specificity of the number of nouns in Czech and its annotation in Prague Dependency Treebank. In: The Prague Bulletin of Mathematical Linguistics, ISSN 0032-6585, 96, pp. 27-47 (pdf, local PDF, bibtex)
Svatava Škodová, Barbora Štindlová, Jirka Hana, Alexandr Rosen (2011): Víceúrovňová anotace českého žákovského korpusu. In: Korpusová lingvistika Praha 2011: 3 - Gramatika a značkování korpusů, pp. 208-225, Nakladatelství Lidové noviny, Praha, Czechia, ISBN 978-80-7422-116-3 (bibtex)
Zdeňka Urešová (2011): Valence sloves v Pražském závislostním korpusu. In: , ISBN 978-80-904571-0-2 (bibtex)
Zdeňka Urešová (2011): Valence sloves v Pražském závislostním korpusu (PhD thesis). In: (local PDF, local PDF, bibtex)
Zdeňka Urešová (2011): Valenční slovník Pražského závislostního korpusu (PDT-Vallex). In: , ISBN 978-80-904571-1-9 (bibtex)
Anna Vernerová (2011): Nominal Valency in Lexicons. In: WDS'11 Proceedings of Contributed Papers, Part I, pp. 171-176, Matfyzpress, Praha, Czechia, ISBN 978-80-7378-184-2 (local PDF, bibtex)
Kateřina Veselovská (2011): Sentence-Level Polarity Detection in a Computer Corpus. In: WDS'11 Proceedings of Contributed Papers, Part I, pp. 167-170, Matfyzpress, Praha, Czechia, ISBN 978-80-7378-184-2 (pdf, bibtex)
Kateřina Veselovská (2011): Kognice 2010: Reprezentace významu (review). In: Studie z aplikované lingvistiky / Studies in Applied Linguistics (SALi), ISSN 1804-3240, pp. 154-155 (url, bibtex)
Kateřina Veselovská (2011): CLARA Joint Training Programme: Course on Treebank Annotation (review). In: Studie z aplikované lingvistiky / Studies in Applied Linguistics (SALi), ISSN 1804-3240, pp. 163-164 (bibtex)
Kateřina Veselovská (2011): Členská negace a způsoby jejího vyjadřování v datech ČNK a PDT 2.0. In: Korpusová lingvistika Praha 2011: Gramatika a značkování korpusů, pp. 49-61, Lidové Noviny, Praha, Czech Republic, ISBN 978-80-7422-116-3 (bibtex)
Daniel Zeman (2011): Hierarchical Phrase-Based MT at the Charles University for the WMT 2011 Shared Task. In: Proceedings of the Sixth Workshop on Statistical Machine Translation, pp. 496-500, Association for Computational Linguistics, Edinburgh, UK, ISBN 978-1-937284-12-1 (url, local PDF, bibtex)
Daniel Zeman, Mark Fishel, Jan Berka, Ondřej Bojar (2011): Addicter: What Is Wrong with My Translations?. In: The Prague Bulletin of Mathematical Linguistics, ISSN 0032-6585, 96, pp. 79-88 (pdf, local PDF, local PDF, bibtex)
Zdeněk Žabokrtský (2011): Treex – an open-source framework for natural language processing. In: Information Technologies – Applications and Theory, pp. 7-14, Univerzita Pavla Jozefa Šafárika v Košiciach, Košice, Slovakia, ISBN 978-80-89557-02-8 (bibtex)
Eduard Bejček, Václava Kettnerová, Markéta Lopatková (2010): Advanced Searching in the Valency Lexicons Using PML-TQ Search Engine. In: Text, Speech and Dialogue. 13th International Conference, TSD 2010, Brno, Czech Republic, September 6-10, 2010. Proceedings, Lecture Notes in Computer Science, ISSN 0302-9743, 6231, pp. 51-58, Springer, Berlin / Heidelberg, ISBN 978-3-642-15759-2 (url, local PDF, local PDF, bibtex)
Eduard Bejček, Pavel Straňák (2010): Annotation of Multiword Expressions in the Prague Dependency Treebank. In: Language Resources and Evaluation, ISSN 1574-020X, vol. 44, no. 1-2, pp. 7-21 (url, local PDF, bibtex)
Ondřej Bojar (2010): Vládce jazyků. In: Maxim, ISSN 1214-1569, 2010/10, pp. 52-53 (bibtex)
Ondřej Bojar, Kamil Kos (2010): 2010 Failures in English-Czech Phrase-Based MT. In: Proceedings of the Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR, pp. 60-66, Association for Computational Linguistics, Uppsala, Sweden, ISBN 978-1-932432-71-8 (url, bibtex)
Ondřej Bojar, Kamil Kos, David Mareček (2010): Tackling Sparse Data Issue in Machine Translation Evaluation. In: Proceedings of the ACL 2010 Conference Short Papers, pp. 86-91, Association for Computational Linguistics, Uppsala, Sweden, ISBN 978-1-932432-69-5 (url, bibtex)
Ondřej Bojar, Adam Liška, Zdeněk Žabokrtský (2010): Evaluating Utility of Data Sources in a Large Parallel Czech-English Corpus CzEng 0.9. In: Proceedings of the 7th International Conference on Language Resources and Evaluation (LREC 2010), pp. 447-452, European Language Resources Association, Valletta, Malta, ISBN 2-9517408-6-7 (bibtex)
Ondřej Bojar, Pavel Straňák, Daniel Zeman (2010): Data Issues in English-to-Hindi Machine Translation. In: Proceedings of the 7th International Conference on Language Resources and Evaluation (LREC 2010), pp. 1771-1777, European Language Resources Association, Valletta, Malta, ISBN 2-9517408-6-7 (local PDF, local ODP, local PDF, bibtex)
Ondřej Bojar, Jana Šindlerová (2010): Building a Bilingual ValLex Using Treebank Token Alignment: First Observations. In: Proceedings of the 7th International Conference on Language Resources and Evaluation (LREC 2010), pp. 304-309, European Language Resources Association, Valletta, Malta, ISBN 2-9517408-6-7 (local PDF, bibtex)
Silvie Cinková (2010): Aim and result – A Swedish-Czech comparison of consecutive clauses. In: InterCorp: Exploring a Multilingual Corpus, pp. 70-82, Nakladatelství Lidové noviny, Praha, Czechia, ISBN 978-80-7422-042-5 (local DOC, bibtex)
Silvie Cinková, Martin Holub, Pavel Rychlý, Lenka Smejkalová, Jana Šindlerová (2010): Can Corpus Pattern Analysis Be Used in NLP?. In: Text, Speech and Dialogue. 13th International Conference, TSD 2010, Brno, Czech Republic, September 6-10, 2010. Proceedings, Lecture Notes in Computer Science, ISSN 0302-9743, 6231, pp. 67-74, Springer, Berlin / Heidelberg, ISBN 978-3-642-15759-2 (local PDF, bibtex)
Silvie Cinková, Martin Holub, Lenka Smejkalová (2010): The Lexical Population of Semantic Types in Hanks’s PDEV. In: A Way with Words: Recent Advances in Lexical Theory and Analysis. A Festschrift for Patrick Hanks, pp. 199-214, Menha, Kampala, Uganda, ISBN 978-9970-101-01-6 (local PDF, bibtex)
Radek Čech, Ján Mačutek, Petr Pajas (2010): Full Valency. Verb Valency without Distinguishing Complements and Adjuncts. In: Journal of Quantitative Linguistics, ISSN 0929-6174, vol. 17, no. 4, pp. 291-302 (bibtex)
Jiří Diviš, Ondřej Bojar (2010): Automatic Source Code Reduction. In: Information Technologies – Applications and Theory, pp. 9-16, PONT s. r. o., Seňa, Slovakia, ISBN 978-80-970179-4-1 (bibtex)
Jinhua Du, Pavel Pecina, Andy Way (2010): An Augmented Three-Pass System Combination Framework: DCU Combination System for WMT 2010.. In: Proceedings of the Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR, pp. 143-148, Association for Computational Linguistics, Uppsala, Sweden, ISBN 978-1-932432-71-8 (bibtex)
Anna Feldman, Jirka Hana (2010): A resource-light approach to morpho-syntactic tagging. In: , ISBN 978-90-420-2768-8 (url, bibtex)
Eva Hajičová (2010): Tři otázky pro Petra Karlíka (a tři oříšky pro Popelku). In: Karlík a továrna na lingvistiku, pp. 156-165, Host, Brno, Czechia, ISBN 978-80-7294-412-5 (bibtex)
Eva Hajičová (2010): Rhematizers Revisited. In: Linguistica Pragensia, ISSN 0862-8432, vol. XX, no. 2, pp. 57-70 (local PDF, bibtex)
Eva Hajičová, Anne Abeillé, Jan Hajič, Jiří Mírovský, Zdeňka Urešová (2010): Treebank Annotation. In: Handbook of Natural Language Processing, Second Edition, pp. 167-188, CRC Press, Taylor and Francis Group, Boca Raton, FL, USA , ISBN 978-1-4200-8592-1 (local PDF, bibtex)
Keith Brendan Hall, Václav Novák (2010): Corrective Dependency Parsing. In: Trends in Parsing Technology: Dependency Parsing, Domain Adaptation, and Deep Parsing, pp. 151-168, Springer Science+Business Media B.V., Dordrecht, Netherlands, ISBN 978-90-481-9351-6 (url, bibtex)
Jirka Hana, Anna Feldman (2010): Challenges of Cheap Resource Creation for Morphological Tagging. In: Proceedings of the Fourth Linguistic Annotation Workshop (LAW IV), pp. 197-201, Association for Computational Linguistics, Stroudsburg, USA, ISBN 978-1-932432-72-5 (bibtex)
Jirka Hana, Anna Feldman (2010): A Positional Tagset for Russian. In: Proceedings of the 7th International Conference on Language Resources and Evaluation (LREC 2010), pp. 1278-1284, European Language Resources Association, Valletta, Malta, ISBN 2-9517408-6-7 (local PDF, local PDF, bibtex)
Jirka Hana, Alexandr Rosen, Svatava Škodová, Barbora Štindlová (2010): Error-tagged Learner Corpus of Czech. In: Proceedings of the Fourth Linguistic Annotation Workshop (LAW IV), pp. 11-19, Association for Computational Linguistics, Stroudsburg, USA, ISBN 978-1-932432-72-5 (bibtex)
Patrick Hanks (2010): Elliptical Arguments: a Problem in relating Meaning to Use. In: eLexicography in the 21st century : New challenges, new applications. Proceedings of eLex 2009, Louvain-la-Neuve, 22-24 October 2009, pp. 109-124, Presses universitaires de Louvain, Louvain-la-Neuve, Belgium, ISBN 978-2-87463-211-2 (local PDF, bibtex)
Patrick Hanks (2010): Lexicography, Printing Technology, and the Spread of Renaissance Culture. In: Proceedings of the XIV Euralex International Congress, Leeuwarden, pp. 988-1006, Fryske Akademy, Leeuwarden / Ljouwert, Netherlands, ISBN 978-90-6273-840-3 (bibtex)
Patrick Hanks (2010): Terminology, Phraseology, and Lexicography. In: Proceedings of the XIV Euralex International Congress, Leeuwarden, pp. 1299-1308, Fryske Akademy, Leeuwarden / Ljouwert, Netherlands, ISBN 978-90-6273-840-3 (pdf, bibtex)
Patrick Hanks (2010): How People use Words to Make Meanings. In: Proceedings of 4th International Workshop on Natural Language Processing and Cognitive Science, SciTePress, Funchal, Madeira, Portugal, ISBN 978-989-8425-13-3 (pdf, bibtex)
Patrick Hanks (2010): Compiling a monolingual dictionary for native speakers. In: Lexikos, ISSN 1684-4904, 20, pp. 580-598 (pdf, bibtex)
Patrick Hanks (2010): Nine issues in metaphor theory and analysis (review). In: International Journal of Corpus Linguistics, ISSN 1384-6655, vol. 15, no. 1, pp. 133-150 (bibtex)
Petr Homola, Vladislav Kuboň (2010): Exploiting Charts in the MT Between Related Languages. In: International Journal of Computational Linguistics and Applications, ISSN 0976-0962, vol. 1, no. 1-2, pp. 185-199 (bibtex)
Petr Homola, Vladislav Kuboň (2010): A Method of Hybrid MT for Related Languages. In: Control and Cybernetics, ISSN 0324-8569, vol. 39, no. 2, pp. 421-438 (bibtex)
Petr Homola, Jernej Vičič (2010): Combining MT Systems Effectively. In: Proceedings of the 23th International Florida-Artificial-Intelligence-Research-Society Conference (FLAIRS 2010), pp. 198-203, Florida AI Research Society, Daytona Beach, Florida, USA, ISBN 978-1-57735-447-5 (bibtex)
Michal Hrušecký, Jaroslava Hlaváčová (2010): Automatické rozpoznávání předpon a přípon s pomocí nástroje Affisix. In: Informačné technológie – Aplikácie a Teória, Zborník príspevkov prezentovaných na konferencii ITAT, pp. 63-67, PONT s. r. o., Seňa, Slovakia, ISBN 978-80-970179-3-4 (local PDF, bibtex)
Max Jakob, Markéta Lopatková, Valia Kordoni (2010): Mapping between Dependency Structures and Compositional Semantic Representations.. In: Proceedings of the 7th International Conference on Language Resources and Evaluation (LREC 2010), pp. 2491-2497, European Language Resources Association, Valletta, Malta, ISBN 2-9517408-6-7 (local PDF, bibtex)
Bushra Jawaid (2010): Statistical Machine Translation between Languages with Significant Word Order Difference (masters thesis). In: (local PDF, bibtex)
Elisabetta Jezek, Patrick Hanks (2010): What lexical sets tell us about conceptual categories. In: Lexis, ISSN 1951-6215, 4, pp. 7-22 (pdf, local PDF, bibtex)
Václava Kettnerová, Markéta Lopatková (2010): Representation of Changes in Valency Structure of Verbs in the Valency Lexicon of Czech Verbs. In: Proceedings of Verb 2010, Interdisciplinary Workshop on Verbs, The Identification and Representation of Verb Features, pp. 154-159, Scuola Normale Superiore - Laboratore di Linguistica, Universita di Pisa - Dipartimento di Linguistica, Pisa, Italy (local PDF, bibtex)
Václava Kettnerová, Markéta Lopatková (2010): The Representation of Diatheses in the Valency Lexicon of Czech Verbs. In: Proceedings of the 7th International Conference on Advances in Natural Language Processing (IceTAL 2010), Lecture Notes in Computer Science, ISSN 0302-9743, 6233, pp. 185-196, Springer, Berlin / Heidelberg, ISBN 978-3-642-14769-2 (bibtex)
Natalia Klyueva, Vladislav Kuboň (2010): Verbal Valency in the MT Between Related Languages. In: Proceedings of Verb 2010, Interdisciplinary Workshop on Verbs, The Identification and Representation of Verb Features, pp. 160-164, Scuola Normale Superiore - Laboratore di Linguistica, Universita di Pisa - Dipartimento di Linguistica, Pisa, Italy (local PDF, local DOC, local PDF, bibtex)
Natalia Klyueva, David Mareček (2010): Towards Parallel Czech-Russian Dependency Treebank. In: Workshop on Annotation and Exploitation of Parallel Corpora, NEALT Proceedings Series, ISSN 1736-6305, 10, pp. 44-52, Northern European Association for Language Technology, Tartu, Estonia (local PDF, local PDF, bibtex)
Veronika Kolářová (2010): Valence deverbativních substantiv v češtině (na materiálu substantiv s dativní valencí). In: , ISBN 978-80-246-1828-9 (bibtex)
David Kolovratník (2010): Exodus – Exploring SMT for EU Institutions. In: Proceedings of the Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR, pp. 116-120, Association for Computational Linguistics, Uppsala, Sweden, ISBN 978-1-932432-71-8 (pdf, bibtex)
Vladislav Kuboň, Markéta Lopatková (2010): Od segmentů ke klauzím v češtině - analýza vybraných jevů. In: Informačné technológie – Aplikácie a Teória, Zborník príspevkov prezentovaných na konferencii ITAT, pp. 76-80, PONT s. r. o., Seňa, Slovakia, ISBN 978-80-970179-3-4 (local PDF, bibtex)
Septina Dian Larasati, Vladislav Kuboň (2010): A Study of Indonesian-to-Malaysian MT . In: The 4th International MALINDO Workshop 2010, pp. 16-22, Universitas Indonesia, Jakarta, Indonesia (bibtex)
Markéta Lopatková (2010): Valency Lexicon of Czech Verbs: Towards Formal Description of Valency and Its Modeling in an Electronic Language Resource (habilitation). In: (local PDF, bibtex)
Markéta Lopatková, František Mráz, Martin Plátek (2010): Towards a formal model of natural language description based on restarting automata with parallel DR-structures. In: Information Technologies – Applications and Theory, pp. 25-32, PONT s. r. o., Seňa, Slovakia, ISBN 978-80-970179-4-1 (pdf, local PDF, bibtex)
Markéta Lopatková, Martin Plátek (2010): O zlomovém bodu introspekce a souvisejících problémech. In: Padesát je málo. Komorně laděný sborník u příležitosti 50. narozenin profesora Jana Hajiče, pp. 34-37, Univerzita Karlova v Praze, Praha, Czechia (local PDF, bibtex)
David Mareček, Martin Popel, Zdeněk Žabokrtský (2010): Maximum Entropy Translation Model in Dependency-Based MT Framework. In: Proceedings of the Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR, pp. 201-201, Association for Computational Linguistics, Uppsala, Sweden, ISBN 978-1-932432-71-8 (pdf, bibtex)
Marie Mikulová, Jan Štěpánek (2010): Ways of Evaluation of the Annotators in Building the Prague Czech-English Dependency Treebank. In: Proceedings of the 7th International Conference on Language Resources and Evaluation (LREC 2010), pp. 1836-1839, European Language Resources Association, Valletta, Malta, ISBN 2-9517408-6-7 (local PDF, local PDF, bibtex)
Jiří Mírovský, Lucie Mladová, Šárka Zikánová (2010): Connective-Based Measuring of the Inter-Annotator Agreement in the Annotation of Discourse in PDT. In: Proceedings of the 23rd International Conference on Computational Linguistics (Coling 2010), pp. 775-781, Tsinghua University Press, Beijing, China, ISBN 978-7-302-23456-2 (local PDF, bibtex)
Jiří Mírovský, Lucie Mladová, Zdeněk Žabokrtský (2010): Annotation Tool for Discourse in PDT. In: Proceedings of the 23rd International Conference on Computational Linguistics (Coling 2010), pp. 9-12, Tsinghua University Press, Beijing, China, ISBN 978-7-302-23456-2 (local PDF, bibtex)
Jiří Mírovský, Petr Pajas, Anna Nedoluzhko (2010): Annotation Tool for Extended Textual Coreference and Bridging Anaphora. In: Proceedings of the 7th International Conference on Language Resources and Evaluation (LREC 2010), pp. 168-171, European Language Resources Association, Valletta, Malta, ISBN 2-9517408-6-7 (local PDF, bibtex)
Lucie Mladová (2010): Corpus Linguistics Conference 2009 (review). In: Studie z aplikované lingvistiky / Studies in Applied Linguistics (SALi), ISSN 1804-3240, pp. 178-179 (bibtex)
Lucie Mladová, Zuzanna Bedřichová (2010): Anotace mezivýpovědních textových vztahů na Ústavu formální a aplikované lingvistiky MFF UK. In: Studie z aplikované lingvistiky / Studies in Applied Linguistics (SALi), ISSN 1804-3240, vol. 1, no. 1, pp. 160-161 (url, local PDF, bibtex)
Michal Novák (2010): Machine Learning Approach to Anaphora Resolution (masters thesis). In: (pdf, bibtex)
Pavel Novák, Petr Sgall (2010): On the Prague functional approach. In: Travaux linguistiques de Prague /3, pp. 291-297, 1968, reprinted. In: Lingvistika a jazyková realita. Výbor z díla, pp. 90-95, Akropolis, Praha, Czechia, ISBN 978-80-87310-04-5 (bibtex)
Jarmila Panevová (2010): "Být posel dobrých zpráv je mi příjemné" (Několik poznámek k infinitivním konstrukcím). In: Karlík a továrna na lingvistiku, pp. 345-354, Host, Brno, Czechia, ISBN 978-80-7294-412-5 (bibtex)
Jarmila Panevová (2010): Kategorie pojmenovávací a usouvztažňovací (Jak František Daneš rozvíjí Viléma Mathesia). In: Užívání a prožívání jazyka. K 90. narozeninám Františka Daneše, pp. 21-26, Karolinum, Praha, Czechia, ISBN 978-80-246-1756-5 (bibtex)
Jarmila Panevová (2010): Ke vztahu kognitivního obsahu a jazykového významu. In: Korpus – gramatika – axiologie, ISSN 1804-137X, vol. 1, no. 1, pp. 30-40 (pdf, local PDF, bibtex)
Jarmila Panevová (2010): Významné životní jubileum Adely Rechziegelové. In: Slovo a slovesnost, ISSN 0037-7031, vol. 71, no. 2, pp. 153-154 (bibtex)
Jarmila Panevová, Marie Mikulová (2010): A Few Czech Additions to Nedjalkov's Typology of Reciprocals and Reflexives. In: Problemy grammatiki i tipologii, pp. 253-261, Znak, Moskva, ISBN 978-5-9551-0385-3 (bibtex)
Jarmila Panevová, Magda Ševčíková (2010): Annotation of Morphological Meanings of Verbs Revisited. In: Proceedings of the 7th International Conference on Language Resources and Evaluation (LREC 2010), pp. 1491-1498, European Language Resources Association, Valletta, Malta, ISBN 2-9517408-6-7 (url, bibtex)
Pavel Pecina (2010): Lexical Association Measures and Collocation Extraction. In: Language Resources and Evaluation, ISSN 1574-020X, vol. 44, no. 1-2, pp. 137-158 (bibtex)
Martin Plátek, Markéta Lopatková (2010): On the Prague Group of Mathematical and Algebraic Linguistics and Its Formal Tools. In: Proceedings of 20. Theorietag der GI-Fachgruppe AFS; Automaten und Formale Sprachen , pp. 27-32, Universität Kassel, Kassel, Germany (local PDF, bibtex)
Martin Plátek, František Mráz, Markéta Lopatková (2010): (In)Dependencies in Functional Generative Description by Restarting Automata. In: Proceedings of the Second Workshop on Non-Classical Models for Automata and Applications, NCMA 2010, pp. 155-170, Österreichische Computer Gesellschaft, Wien, Austria, ISBN 978-3-85403-263-2 (local PDF, bibtex)
Martin Plátek, František Mráz, Markéta Lopatková (2010): Restarting Automata with Structured Output and Functional Generative Description. In: Proceedings of the Fourth International Conference Language and Automata Theory and Applications, LATA 2010, Lecture Notes in Computer Science, ISSN 0302-9743, 6031, pp. 500-511, Springer Verlag, Berlin / Heidelberg, ISBN 978-3-642-13088-5 (url, bibtex)
Martin Popel (2010): English-Czech Machine Translation Using TectoMT. In: WDS 2010 Proceedings of Contributed Papers, pp. 88-93, Matfyzpress, Charles University, Praha, Czechia, ISBN 978-80-7378-139-2 (pdf, local PDF, bibtex)
Martin Popel, David Mareček (2010): Perplexity of n-gram and Dependency Language Models. In: Text, Speech and Dialogue. 13th International Conference, TSD 2010, Brno, Czech Republic, September 6-10, 2010. Proceedings, Lecture Notes in Computer Science, ISSN 0302-9743, 6231, pp. 173-180, Springer, Berlin / Heidelberg, ISBN 978-3-642-15759-2 (local PDF, local PDF, bibtex)
Martin Popel, Zdeněk Žabokrtský (2010): TectoMT: Modular NLP Framework. In: Proceedings of the 7th International Conference on Advances in Natural Language Processing (IceTAL 2010), Lecture Notes in Computer Science, ISSN 0302-9743, 6233, pp. 293-304, Springer, Berlin / Heidelberg, ISBN 978-3-642-14769-2 (local PDF, local PDF, bibtex)
Jan Ptáček, Pavel Ircing, Miroslav Spousta, Jan Romportl, Zdeněk Loose, Silvie Cinková, José Relaño Gil, Raúl Santos (2010): Integration of Speech and Text Processing Modules into a Real-Time Dialogue System. In: Text, Speech and Dialogue. 13th International Conference, TSD 2010, Brno, Czech Republic, September 6-10, 2010. Proceedings, Lecture Notes in Computer Science, ISSN 0302-9743, vol. 6231, no. 6231/2010, pp. 552-559, Springer, Berlin / Heidelberg, ISBN 978-3-642-15759-2 (url, local PDF, bibtex)
Kateřina Rysová (2010): Jak volný je "volný český slovosled"? (O slovosledu české a německé výpovědi). In: Kulturní translace / Kulturelle Translationen / Translacje kulturowe: příspěvky z mezinárodní studentské konference InterFaces VII v červnu 2009 v Praze, pp. 77-81, Akropolis Praha, Praha, Czechia, ISBN 978-80-87310-10-6 (bibtex)
Kateřina Rysová (2010): Valence jako slovosledný faktor - jak dalece ovlivňuje slovosled české a německé výpovědi. In: Mnohojazyčný korpus InterCorp: Možnosti studia, pp. 165-170, Lidové Noviny, Praha, Czechia, ISBN 978-80-7422-058-6 (bibtex)
Petr Sgall (2010): V postoloprtském lágru. In: Apel, Zpravodaj Svazu osvobozených politických vězňů a pozůstalých, vol. 9, no. 3, pp. 10-11 (bibtex)
Petr Sgall (2010): Uvedení do syntaxe (Větná stavba). In: Mluvnice současné češtiny 1: Jak se píše a jak se mluví, pp. 301-308, Karolinum, Praha, Czechia, ISBN 978-80-246-1743-5 (url, bibtex)
Petr Sgall (2010): Význam a obsah. In: Užívání a prožívání jazyka. K 90. narozeninám Františka Daneše, pp. 63-66, Karolinum, Praha, Czechia, ISBN 978-80-246-1756-5 (bibtex)
Petr Sgall (2010): Perspektivy standardní češtiny. In: Jazykovědné aktuality , ISSN 1212-5326, XLVII, pp. 73-94 (bibtex)
Petr Sgall (2010): Nadějný mladý lingvista. In: Lingvistika a jazyková realita. Výbor z díla, pp. 28-33, Akropolis, Praha, Czechia, ISBN 978-80-87310-04-5 (bibtex)
Petr Sgall (2010): Markéta Lopatková – Zdeněk Žabokrtský – Václava Kettnerová: Valenční slovník českých sloves. Praha: Karolinum, 2008, 381s. (review). In: Slovo a slovesnost, ISSN 0037-7031, vol. 71, no. 2, pp. 145-149 (bibtex)
Otakar Smrž, Jan Hajič (2010): The Other Arabic Treebank: Prague Dependencies and Functions. In: Arabic Computational Linguistics, pp. 1-33, CSLI Publications, Stanford, CA, USA, ISBN 978-1-57586-544-7 (pdf, bibtex)
Drahomíra Spoustová, Miroslav Spousta (2010): Dependency Parsing as a Sequence Labeling Task. In: The Prague Bulletin of Mathematical Linguistics, ISSN 0032-6585, 94, pp. 7-14 (pdf, local PDF, bibtex)
Drahomíra Spoustová, Miroslav Spousta, Pavel Pecina (2010): Building a Web Corpus of Czech. In: Proceedings of the 7th International Conference on Language Resources and Evaluation (LREC 2010), pp. 998-1001, European Language Resources Association, Valletta, Malta, ISBN 2-9517408-6-7 (local PDF, bibtex)
Jana Straková (2010): When Informatics Meets Neuroscience: Software and Statistics for Human Brain Imaging. In: WDS 2010 Proceedings of Contributed Papers, pp. 94-96, Matfyzpress, Charles University, Praha, Czechia, ISBN 978-80-7378-139-2 (local PDF, bibtex)
Jana Straková, Pavel Pecina (2010): Czech Information Retrieval with Syntax-based Language Models. In: Proceedings of the 7th International Conference on Language Resources and Evaluation (LREC 2010), pp. 1359-1362, European Language Resources Association, Valletta, Malta, ISBN 2-9517408-6-7 (pdf, local PDF, bibtex)
Pavel Straňák (2010): Annotation of Multiword Expressions in The Prague Dependency Treebank (PhD thesis). In: (local PDF, local PDF, bibtex)
Pavel Straňák, Jan Štěpánek (2010): Representing Layered and Structured Data in the CoNLL-ST Format. In: Proceedings of the Second International Conference on Global Interoperability for Language Resources, pp. 143-152, City University of Hong Kong, Hong Kong, China, ISBN 978-962-442-323-5 (local PDF, local PDF, bibtex)
Magda Ševčíková (2010): Kondicionál přítomný jako součást explicitních performativních formulí. In: Korpus – gramatika – axiologie, ISSN 1804-137X, vol. 1, no. 1, pp. 41-62 (pdf, local PDF, bibtex)
Magda Ševčíková, Jarmila Panevová, Zdeněk Žabokrtský (2010): Grammatical number of nouns in Czech: linguistic theory and treebank annotation. In: Proceedings of the Ninth International Workshop on Treebanks and Linguistic Theories, NEALT Proceedings Series, ISSN 1736-6305, 9, pp. 211-222, Northern European Association for Language Technology, Tartu, Estonia (pdf, local PDF, bibtex)
Jan Štěpánek, Petr Pajas (2010): Querying Diverse Treebanks in a Uniform Way. In: Proceedings of the 7th International Conference on Language Resources and Evaluation (LREC 2010), pp. 1828-1835, European Language Resources Association, Valletta, Malta, ISBN 2-9517408-6-7 (local PDF, local PDF, bibtex)
Aleš Tamchyna, Ondřej Bojar (2010): Bohatá anotace ve frázovém strojovém překladu. In: Informačné technológie – Aplikácie a Teória, Zborník príspevkov prezentovaných na konferencii ITAT, pp. 99-106, PONT s. r. o., Seňa, Slovakia, ISBN 978-80-970179-3-4 (bibtex)
Zdeňka Urešová (2010): PDT-Vallex - trochu jiný valenční slovník. In: Slovo – Tvorba – Dynamickosť. Na počesť Kláry Buzássyovej, pp. 278-286, Veda, Bratislava, Slovakia, ISBN 978-80-224-1107-3 (local PDF, bibtex)
Kateřina Veselovská (2010): Členská negace a způsob jejího vyjadřování v současné češtině (masters thesis). In: (local PDF, bibtex)
Jernej Vičič, Petr Homola (2010): Speeding up the Implementation Process of a Shallow Transfer Machine Translation System. In: Proceedings of the 14th EAMT Conference , pp. 261-268, European Association for Machine Translation, Saint Raphaël, France (bibtex)
Daniel Zeman (2010): Morphological Stickers for Annotation of Check. In: Padesát je málo. Komorně laděný sborník u příležitosti 50. narozenin profesora Jana Hajiče, pp. 65-70, Univerzita Karlova v Praze, Praha, Czechia (local PDF, bibtex)
Daniel Zeman (2010): Using TectoMT as a Preprocessing Tool for Phrase-Based Statistical Machine Translation. In: Text, Speech and Dialogue. 13th International Conference, TSD 2010, Brno, Czech Republic, September 6-10, 2010. Proceedings, Lecture Notes in Computer Science, ISSN 0302-9743, 6231, pp. 216-223, Springer, Berlin / Heidelberg, ISBN 978-3-642-15759-2 (local PDF, local DOC, bibtex)
Daniel Zeman (2010): Hierarchical Phrase-Based MT at the Charles University for the WMT 2010 Shared Task. In: Proceedings of the Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR, pp. 212-215, Association for Computational Linguistics, Uppsala, Sweden, ISBN 978-1-932432-71-8 (url, local PDF, local PDF, local ODP, bibtex)
Daniel Zeman (2010): Hard Problems of Tagset Conversion. In: Proceedings of the Second International Conference on Global Interoperability for Language Resources, pp. 181-185, City University of Hong Kong, Hong Kong, China, ISBN 978-962-442-323-5 (local PDF, bibtex)
Šárka Zikánová, Lucie Mladová, Jiří Mírovský, Pavlína Jínová (2010): Typical Cases of Annotators’ Disagreement in Discourse Annotations in Prague Dependency Treebank. In: Proceedings of the 7th International Conference on Language Resources and Evaluation (LREC 2010), pp. 2002-2006, European Language Resources Association, Valletta, Malta, ISBN 2-9517408-6-7 (local PDF, bibtex)
Zdeněk Žabokrtský (2010): From Treebanking to Machine Translation (habilitation). In: (bibtex)
Zdeněk Žabokrtský (2010): Výzkum strojového překladu v Centru komputační lingvistiky MFF UK. In: Forum, ISSN 1211-1724, pp. 42-43 (pdf, bibtex)
Анна Юрьевна Недолужко (2010): Кореферентные отношения в тексте – сравнительный анализ размеченных данных. In: Computational Linguistics and Intellectual Technologies Papers from the Annual International Conference "Dialogue" (2010), Компьютерная лингвистика и интеллектуальные технологии, Issue 9 (16), pp. 350-356, Изд-во РГГУ, Bekasovo, Russia, ISBN 978-5-7281-1148-1 (url, local PDF, bibtex)
Eneko Agirre, Enrique Alfonseca, Keith Brendan Hall, Jana Kravalová, Marius Pasca, Aitor Soroa (2009): A Study on Similarity and Relatedness Using Distributional and WordNet-based Approaches. In: Proceedings of NAACL-HLT 09, pp. 19-27, Association for Computational Linguistics, Boulder, CO, USA, ISBN 978-1-932432-41-1 (pdf, bibtex)
Zuzanna Bedřichová (2009): Problems and Possibilities of the Annotation of the Interpropositional Discourse Relations in PDT 2.0.. In: Czech in Formal Grammar, pp. 1-8, Lincom München, München, Germany, ISBN 978-3-89586-282-3 (bibtex)
Eduard Bejček (2009): Automatické přiřazování valenčních rámců a jejich slévání. In: Informačné Technológie – Aplikácie a Teória. Zborník príspevkov, ITAT 2009, pp. 9-14, PONT s.r.o., Seňa, Slovakia, ISBN 978-80-970179-1-0 (local PDF, bibtex)
Eduard Bejček, Pavel Straňák, Jan Hajič (2009): Finalising Multiword Annotations in PDT. In: Proceedings of 8th Treebanks and Linguistic Theories Workshop (TLT), pp. 17-25, Università Cattolica del Sacro Cuore, Milano, Italy, ISBN 978-88-8311-712-1 (local PDF, local PDF, local PDF, bibtex)
Ondřej Bojar (2009): Exploiting Linguistic Data in Machine Translation. In: , ISBN 978-80-904175-8-8 (local PDF, bibtex)
Ondřej Bojar, David Mareček, Václav Novák, Martin Popel, Jan Ptáček, Jan Rouš, Zdeněk Žabokrtský (2009): English-Czech MT in 2008. In: Proceedings of the Fourth Workshop on Statistical Machine Translation, pp. 125-129, Association for Computational Linguistics, Athina, Greece (pdf, local PDF, bibtex)
Ondřej Bojar, Pavel Straňák, Daniel Zeman, Gaurav Jain, Michal Hrušecký, Michal Richter, Jan Hajič (2009): English-Hindi Translation – Obtaining Mediocre Results with Bad Data and Fancy Models. In: Proceedings of ICON 2009: 7th International Conference on Natural Language Processing, pp. 316-321, Macmillan Publishers, India, Hyderabad, India, ISBN 978-023-032-845-7 (local PDF, local PDF, bibtex)
Ondřej Bojar, Miroslav Týnovský (2009): Evaluation of Tree Transfer System (technical report). In: (local PDF, bibtex)
Ondřej Bojar, Zdeněk Žabokrtský (2009): CzEng 0.9, Building a Large Czech-English Automatic Parallel Treebank. In: The Prague Bulletin of Mathematical Linguistics, ISSN 0032-6585, 92, pp. 63-83 (pdf, local PDF, bibtex)
Silvie Cinková (2009): Words that Matter: Towards a Swedish-Czech Colligational Dictionary of Basic Verbs. In: , ISBN 978-80-904175-3-3 (pdf, local PDF, local PDF, bibtex)
Silvie Cinková (2009): A Contrastive Lexical description of Basic Verbs. Examples from Swedish and Czech.. In: The Prague Bulletin of Mathematical Linguistics, ISSN 0032-6585, 92, pp. 21-62 (pdf, bibtex)
Silvie Cinková (2009): Semantic Representation of Non-Sentential Utterances in Dialog. In: Proceedings of SRSL 2009, the 2nd Workshop on Semantic Representation of Spoken Language, pp. 26-33, Association for Computational Linguistics, Athina, Greece (url, bibtex)
Silvie Cinková, Josef Toman, Jan Hajič, Kristýna Čermáková, Václav Klimeš, Lucie Mladová, Jana Šindlerová, Kristýna Tomšů, Zdeněk Žabokrtský (2009): Tectogrammatical Annotation of the Wall Street Journal. In: The Prague Bulletin of Mathematical Linguistics, ISSN 0032-6585, 92, pp. 85-104 (pdf, local PDF, bibtex)
Radek Čech, Petr Pajas (2009): Pitfalls of the Transitivity Hypothesis: Transitivity in Conversation and Written Language in Czech. In: Glottotheory – International Journal of Theoretical Linguistics, ISSN 1337-7892, vol. 2, no. 2, pp. 41-42 (pdf, bibtex)
Kristýna Čermáková, Lucie Mladová, Eva Fučíková, Kateřina Veselá (2009): Annotation of Selected Non-dependency Relations in a Dependency Treebank. In: Proceedings of 8th Treebanks and Linguistic Theories Workshop (TLT), pp. 51-57, Università Cattolica del Sacro Cuore, Milano, Italy, ISBN 978-88-8311-712-1 (local PDF, bibtex)
Pavel Češka (2009): Speech Reconstruction - Overview of State-of-the-art Systems. In: WDS'09 Proceedings of Contributed Papers, pp. 11-15, Matfyzpress, Charles University, Praha, Czechia, ISBN 978-80-7378-101-9 (bibtex)
Jan Hajič, Massimiliano Ciaramita, Richard Johansson, Daisuke Kawahara, Maria Antònia Martí, Lluís Màrquez, Adam Meyers, Joakim Nivre, Sebastian Padó, Jan Štěpánek, Pavel Straňák, Mihai Surdeanu, Nianwen Xue, Yi Zhang (2009): The CoNLL-2009 Shared Task: Syntactic and Semantic Dependencies in Multiple Languages. In: Proceedings of the Thirteenth Conference on Computational Natural Language Learning (CoNLL): Shared Task, pp. 1-18, Association for Computational Linguistics, Boulder, CO, USA, ISBN 978-1-932432-29-9 (url, local PDF, bibtex)
Eva Hajičová (2009): From Prague Structuralism to Treebank Annotation. In: Proceedings of 8th Treebanks and Linguistic Theories Workshop (TLT), pp. 3-5, Università Cattolica del Sacro Cuore, Milano, Italy, ISBN 978-88-8311-712-1 (bibtex)
Eva Hajičová (2009): Information structure from the point of view of the relation of function and form. In: The Prague School and Theories of Structure, pp. 107-127, Gottingen: V&R Unipress, Goettingen, Německo, ISBN 978-3-89971-704-4 (bibtex)
Eva Hajičová, Petr Sgall (2009): The fundamental significance of information structure. In: Language in life, and a life in language: Jacob Mey – A festschrift. Studies in Pragmatics 6, pp. 151-157, Emerald, Bingley, UK, ISBN 978-1-84855-316-3 (url, local PDF, bibtex)
Barbora Hladká, Jiří Mírovský, Pavel Schlesinger (2009): Designing a Language Game for Collecting Coreference Annotation. In: Proceedings of the Third Linguistic Annotation Workshop (LAW III) , pp. 52-55, Association for Computational Linguistics, Suntec, Singapore, ISBN 978-1-932432-52-7 (local PDF, bibtex)
Barbora Hladká, Jiří Mírovský, Pavel Schlesinger (2009): Play the Language: Play Coreference. In: Proceedings of the ACL-IJCNLP 2009 Conference Short Papers, pp. 209-212, Association for Computational Linguistics, Suntec, Singapore, ISBN 978-1-932432-61-9 (local PDF, bibtex)
Jaroslava Hlaváčová (2009): Formalizace systému české morfologie s ohledem na automatické zpracování českých textů (PhD thesis). In: (local PDF, bibtex)
Jaroslava Hlaváčová (2009): Stupňování sloves. In: After Half a Century of Slavonic Natural Language Processing, pp. 85-90, Masaryk University, Brno, Czech Republic, ISBN 978-80-7399-815-8 (bibtex)
Petr Homola (2009): Syntactic Analysis in Machine Translation. In: , ISBN 978-80-904175-7-1 (local PDF, bibtex)
Petr Homola (2009): Syntactic Analysis in Machine Translation (PhD thesis). In: (bibtex)
Petr Homola, Natalia Klyueva, Ondřej Bojar (2009): Towards a Rule-Based Machine Translation System Between Czech and Russian. In: Formal Description of Slavic Languages, pp. 37-38, Universität Potsdam, Potsdam, Germany (local PDF, bibtex)
Petr Homola, Vladislav Kuboň, Pavel Pecina (2009): A Simple Automatic MT Evaluation Metric. In: Proceedings of the Fourth Workshop on Statistical Machine Translation, pp. 33-36, Association for Computational Linguistics, Athina, Greece (url, local PDF, bibtex)
Petr Homola, Vladislav Kuboň, Jernej Vičič (2009): Shallow Transfer Between Slavic Languages. In: Proceedings of Balto-Slavonic Natural Language Processing, pp. 219-232, Polska Akademia Nauk, Kraków, Poland, ISBN 978-83-60434-59-8 (bibtex)
Václava Kettnerová (2009): Konstrukce s rozpadem tématu a dikta v češtině. In: Slovo a slovesnost, ISSN 0037-7031, vol. 70, no. 3, pp. 163-174 (bibtex)
Václava Kettnerová, Markéta Lopatková (2009): Changes in Valency Structure of Verbs: Grammar vs. Lexicon. In: Slovko 2009, NLP, Corpus Linguistics, Corpus Based Grammar Research, pp. 198-210, Slovenská akadémia vied, Bratislava, Slovakia, ISBN 978-80-7399-875-2 (local PDF, bibtex)
Hana Klempová, Michal Novák, Peter Fabian, Jan Ehrenberger, Ondřej Bojar (2009): Získávání paralelních textů z webu. In: Informačné Technológie – Aplikácie a Teória. Zborník príspevkov, ITAT 2009, pp. 47-54, PONT s.r.o., Seňa, Slovakia, ISBN 978-80-970179-1-0 (local PDF, bibtex)
Václav Klimeš (2009): Detecting and Correcting Errors in an English Tectogrammatical Annotation. In: Proceedings of the 12th International Conference, TSD 2009, pp. 32-39, Springer, Berlin / Heidelberg, ISBN 978-3-642-04207-2 (bibtex)
David Kolovratník, Natalia Klyueva, Ondřej Bojar (2009): Statistical Machine Translation between Related and Unrelated Languages. In: Information Technologies – Applications and Theory, pp. 31-36, PONT s.r.o., Seňa, Slovakia, ISBN 978-80-970179-2-7 (local PDF, local PDF, bibtex)
Kamil Kos, Ondřej Bojar (2009): Evaluation of Machine Translation Metrics for Czech as the Target Language. In: The Prague Bulletin of Mathematical Linguistics, ISSN 0032-6585, 92, pp. 135-148 (pdf, local PDF, bibtex)
Jana Kravalová (2009): Využití syntaxe v metodách pro vyhledávání informací (masters thesis). In: (local PDF, bibtex)
Jana Kravalová, Zdeněk Žabokrtský (2009): Czech Named Entity Corpus and SVM-based Recognizer. In: Proceedings of the 2009 Named Entities Workshop: Shared Task on Transliteration (NEWS 2009), pp. 194-201, Association for Computational Linguistics, Suntec, Singapore, ISBN 978-1-932432-57-2 (url, bibtex)
Oldřich Krůza, Vladislav Kuboň (2009): Obtaining Hidden Relations from a Syntactically Annotated Corpus - From Word Relationships to Clause Relationships. In: Proceedings of the 22nd International Florida-Artificial-Intelligence-Research-Society Conference (FLAIRS 2009), AAAI Press, Sanibel Island, FL, USA, ISBN 978-1-57735-419-2 (local PDF, bibtex)
Oldřich Krůza, Vladislav Kuboň (2009): Automatic Extraction of Clause Relationships from a Treebank. In: Computational Linguistics and Intelligent Text Processing. 10th International Conference, CICLing 2009, Mexico City, Mexico, March 1-7, 2009, Proceedings, Lecture Notes in Computer Science, ISSN 0302-9743, vol. 5449, no. 5449/2009, pp. 195-206, Springer, Berlin / Heidelberg, ISBN 978-3-642-00381-3 (url, local PDF, bibtex)
Vladislav Kuboň (2009): On the Role of Syntactic Analysis of Natural Languages. In: Proceedings of Malý informatický seminář (MIS 2009), pp. 30-43, matfyzpress, Praha, Czechia, ISBN 978-80-7378-095-1 (local PDF, bibtex)
Jimmy Lin, Craig G. Murray, Bonnie J. Dorr, Jan Hajič, Pavel Pecina (2009): A Cost-effective Lexical Acquisition Process for Large-scale Thesaurus Translation. In: Language Resources and Evaluation, ISSN 1574-020X, vol. 43, no. 1, pp. 27-40 (pdf, bibtex)
Markéta Lopatková, Tomáš Holan (2009): Segmentation Charts for Czech – Relations among Segments in Complex Sentences. In: Language and Automata Theory and Applications. Third International Conference, LATA 2009, Tarragona, Spain, April 2-8, 2009. Proceedings, Lecture Notes in Computer Science, ISSN 0302-9743, 5457, pp. 542-553, Springer, Berlin / Heidelberg, ISBN 978-3-642-00981-5 (bibtex)
Markéta Lopatková, Natalia Klyueva, Petr Homola (2009): Annotation of Sentence Structure; Capturing the Relationship among Clauses in Czech Sentences. In: Proceedings of the Third Linguistic Annotation Workshop (LAW III) , pp. 74-81, Association for Computational Linguistics, Suntec, Singapore, ISBN 978-1-932432-52-7 (bibtex)
David Mareček (2009): Improving Word Alignment Using Alignment of Deep Structures. In: Proceedings of the 12th International Conference, TSD 2009, pp. 56-63, Springer, Berlin / Heidelberg, ISBN 978-3-642-04207-2 (pdf, bibtex)
David Mareček (2009): Using Tectogrammatical Alignment in Phrase‐Based Machine Translation. In: WDS'09 Proceedings of Contributed Papers, pp. 22-27, Matfyzpress, Charles University, Praha, Czechia, ISBN 978-80-7378-101-9 (pdf, bibtex)
David Mareček, Natalia Klyueva (2009): Converting Russian Treebank SynTagRus into Praguian PDT Style. In: Multilingual resources, technologies and evaluation for Central and Eastern European languages, pp. 30-35, INCOMA Ltd., Shoumen, Bulgaria, ISBN 978-954-452-008-3 (pdf, bibtex)
Marie Mikulová (2009): Pokyny k překladu určené překladatelům, revizorům a korektorům textů z Wall Street Journal pro projekt PCEDT (technical report). In: (bibtex)
Marie Mikulová, Jan Štěpánek (2009): Annotation Quality Checking and Its Implications for Design of Treebank (in Building the Prague Czech-English Dependency Treebank). In: Proceedings of 8th Treebanks and Linguistic Theories Workshop (TLT), pp. 137-148, Università Cattolica del Sacro Cuore, Milano, Italy, ISBN 978-88-8311-712-1 (local PDF, bibtex)
Marie Mikulová, Jan Štěpánek (2009): Annotation Procedure in Building the Prague Czech-English Dependency Treebank. In: Slovko 2009, NLP, Corpus Linguistics, Corpus Based Grammar Research, pp. 241-248, Slovenská akadémia vied, Bratislava, Slovakia, ISBN 978-80-7399-875-2 (local ODT, bibtex)
Jiří Mírovský (2009): Searching in the Prague Dependency Treebank. In: , ISBN 978-80-904175-6-4 (local PDF, bibtex)
Lucie Mladová (2009): Annotation of Discourse Connectives for the PDT. In: WDS'09 Proceedings of Contributed Papers, pp. 16-21, Matfyzpress, Charles University, Praha, Czechia, ISBN 978-80-7378-101-9 (local PDF, bibtex)
Lucie Mladová, Šárka Zikánová, Zuzanna Bedřichová, Eva Hajičová (2009): Towards a Discourse Corpus of Czech. In: Proceedings of the fifth Corpus Linguistics Conference, pp. 1-8, University of Liverpool, Liverpool, UK (url, local PDF, bibtex)
Anna Nedoluzhko (2009): Razmetka koreferencii na sintaksičeski annotirovannom korpuse češskix tekstov. In: Papers from the Annual International Conference “Dialogue 2009” Issue 8 (15), pp. 332-339, М.: РГГУ, Bekasovo/Moscow, ISBN 978-5-7281-1102-3 (url, bibtex)
Anna Nedoluzhko, Jiří Mírovský, Radek Ocelák, Jiří Pergler (2009): Extended Coreferential Relations and Bridging Anaphora in the Prague Dependency Treebank. In: Proceedings of the 7th Discourse Anaphora and Anaphor Resolution Colloquium (DAARC 2009), pp. 1-16, AU-KBC Research Centre, Anna University, Chennai , Goa, India, ISBN 978-3-642-04974-3 (local PDF, bibtex)
Anna Nedoluzhko, Jiří Mírovský, Petr Pajas (2009): The Coding Scheme for Annotating Extended Nominal Coreference and Bridging Anaphora in the Prague Dependency Treebank. In: Proceedings of the Third Linguistic Annotation Workshop (LAW III) , pp. 108-111, Association for Computational Linguistics, Suntec, Singapore, ISBN 978-1-932432-52-7 (local PDF, bibtex)
Giang Linh Nguy, Václav Novák, Zdeněk Žabokrtský (2009): Comparison of Classification and Ranking Approaches to Pronominal Anaphora Resolution in Czech. In: Proceedings of the SIGDIAL 2009 Conference, pp. 276-285, The Association for Computational Linguistics, London, UK, ISBN 978-1-932432-64-0 (pdf, local PDF, bibtex)
Václav Novák, Sven Hartrumpf, Keith Brendan Hall (2009): Large-scale Semantic Networks: Annotation and Evaluation. In: Proceedings of the NAACL HLT Workshop on Semantic Evaluations: Recent Achievements and Future Directions, pp. 37-45, Association for Computational Linguistics , Boulder, CO, USA, ISBN 978-1-932432-31-2 (local PDF, bibtex)
Václav Novák, Magda Ševčíková (2009): Unsupervised Detection of Annotation Inconsistencies Using Apriori Algorithm. In: Proceedings of the Third Linguistic Annotation Workshop (LAW III) , pp. 138-141, Association for Computational Linguistics, Suntec, Singapore, ISBN 978-1-932432-52-7 (local PDF, bibtex)
Ondřej Odcházel, Ondřej Bojar (2009): Computer Aided Translation Backed by Machine Translation. In: Translating and the Computer 31, pp. 1-8, ASLIB, London, UK (bibtex)
Petr Pajas, Jan Štěpánek (2009): System for Querying Syntactically Annotated Corpora. In: Proceedings of the ACL-IJCNLP 2009 Software Demonstrations, pp. 33-36, Association for Computational Linguistics, Suntec, Singapore, ISBN 1-932432-61-2 (pdf, bibtex)
Jarmila Panevová (2009): Honorifika v češtině (České vykání - teorie a korpusová data). In: Južnoslovenski filolog, ISSN 0350-185X, 65, pp. 101-108 (bibtex)
Pavel Pecina (2009): Lexical Association Measures: Collocation Extraction. In: , ISBN 978-80-904175-5-7 (local PDF, bibtex)
Martin Plátek, Markéta Lopatková (2009): Restartovací automaty se strukturovaným výstupem a Funkční generativní popis. In: Informačné Technológie – Aplikácie a Teória. Zborník príspevkov, ITAT 2009, pp. 65-72, PONT s.r.o., Seňa, Slovakia, ISBN 978-80-970179-1-0 (bibtex)
Martin Plátek, Markéta Lopatková (2009): Restartovací automaty, DR-stromy a Funkční generativní popis češtiny. In: Proceedings of Malý informatický seminář (MIS 2009), pp. 66-85, matfyzpress, Praha, Czechia, ISBN 978-80-7378-095-1 (bibtex)
Martin Popel (2009): Ways to Improve the Quality of English-Czech Machine Translation (masters thesis). In: (pdf, local PDF, bibtex)
Martin Popel, Zdeněk Žabokrtský (2009): Improving English-Czech Tectogrammatical MT. In: The Prague Bulletin of Mathematical Linguistics, ISSN 0032-6585, 92, pp. 1-20 (pdf, bibtex)
Petr Sgall (2009): Zdeněk Kirschner died. In: The Prague Bulletin of Mathematical Linguistics, ISSN 0032-6585, 92, pp. 153-155 (pdf, bibtex)
Petr Sgall (2009): Prague school. In: Grammar, meaning and pragmatics, pp. 230-238, John Benjamins Publishing Company, Amsterdam/Philadelphia, ISBN 978-90-272-0782-1 (bibtex)
Petr Sgall (2009): Rozhovor. In: Rozhovory s českými lingvisty II., pp. 245-271, Akropolis, Praha, Czechia, ISBN 978-80-86903-95-8 (bibtex)
Petr Sgall (2009): Where to Look for the Fundamentals of Language. In: Linguistica Pragensia, ISSN 0862-8432, vol. 19, no. 1, pp. 1-35 (bibtex)
Petr Sgall, Jan Hajič, Eva Hajičová (2009): Jak dál v anotacích textových korpusů?. In: After half a century of Slavonic Natural Language Processing, pp. 57-61, Masaryk University, Brno, ISBN 978-80-7399-815-8 (bibtex)
Drahomíra Spoustová, Jan Hajič, Jan Raab, Miroslav Spousta (2009): Semi-Supervised Training for the Averaged Perceptron POS Tagger. In: Proceedings of the 12th Conference of the European Chapter of the Association for Computational Linguistics, pp. 763-771, Association for Computational Linguistics, Athina, Greece, ISBN 978-1-932432-16-9 (url, local PDF, bibtex)
Magda Ševčíková (2009): Funkce kondicionálu z hlediska významové roviny. In: , ISBN 978-80-904175-2-6 (local PDF, bibtex)
Magda Ševčíková (2009): The meaning of the conditional mood within the tectogrammatical annotation of Prague Dependency Treebank 2.0. In: Slovko 2009, NLP, Corpus Linguistics, Corpus Based Grammar Research, pp. 321-330, Slovenská akadémia vied, Bratislava, Slovakia, ISBN 978-80-7399-875-2 (local PDF, local PDF, bibtex)
Magda Ševčíková (2009): Funkce kondicionálu z hlediska významové roviny (PhD thesis). In: (local PDF, bibtex)
Jana Šindlerová, Ondřej Bojar (2009): Towards English-Czech Parallel Valency Lexicon via Treebank Examples. In: Proceedings of 8th Treebanks and Linguistic Theories Workshop (TLT), pp. 185-195, Università Cattolica del Sacro Cuore, Milano, Italy, ISBN 978-88-8311-712-1 (local PDF, bibtex)
Zdeňka Urešová, Petr Pajas (2009): Diatheses in the Czech Valency Lexicon PDT-Vallex. In: Slovko 2009, NLP, Corpus Linguistics, Corpus Based Grammar Research, pp. 358-376, Slovenská akadémia vied, Bratislava, Slovakia, ISBN 978-80-7399-875-2 (local PDF, bibtex)
Kateřina Veselovská (2009): A Corpus-based Study of the Constituent Negation in Czech. In: Proceedings of the fifth Corpus Linguistics Conference, pp. 1-20, University of Liverpool, Liverpool, UK (bibtex)
Jernej Vičič, Petr Homola, Vladislav Kuboň (2009): A method to restrict the blow-up of hypotheses of a non-disambiguated shallow machine translation system. In: RANLP, pp. 1-8, Bulgarian Academy of Sciences, Borovec, Bulgaria, ISBN 978-954-452-012-0 (bibtex)
Barbora Vidová Hladká, Zdeňka Urešová (2009): Syntactic annotation of spoken utterances: A case study on the Czech Academic Corpus. In: Proceedings of the Third Linguistic Annotation Workshop (LAW III) , pp. 90-98, Association for Computational Linguistics, Suntec, Singapore, ISBN 978-1-932432-52-7 (local PDF, bibtex)
Daniel Zeman (2009): Maximum Spanning Malt: Hiring World's Leading Dependency Parsers to Plant Indian Trees. In: Proceedings of ICON09 NLP Tools Contest: Indian Language Dependency Parsing, pp. 18-23, International Institute of Information Technologies, Hyderabad, Hyderabad, India (local PDF, local PDF, local ODP, local PDF, bibtex)
Daniel Zeman (2009): Using Unsupervised Paradigm Acquisition for Prefixes. In: Evaluating Systems for Multilingual and Multimodal Information Access – 9th Workshop of the Cross-Language Evaluation Forum, Lecture Notes in Computer Science, ISSN 0302-9743, 5706, pp. 983-990, Springer, Berlin / Heidelberg, ISBN 978-3-642-04446-5 (url, local PDF, bibtex)
Daniel Zeman (2009): A Simple Generative Pipeline Approach to Dependency Parsing and Semantic Role Labeling. In: Proceedings of the Thirteenth Conference on Computational Natural Language Learning (CoNLL): Shared Task, pp. 120-125, Association for Computational Linguistics, Boulder, CO, USA, ISBN 978-1-932432-29-9 (pdf, local PDF, bibtex)
Šárka Zikánová (2009): Postavení slovesného přísudku ve starší češtině (1500–1620). In: , ISBN 978-80-246-1381-9 (url, bibtex)
Šárka Zikánová, Miroslav Týnovský (2009): Identification of Topic and Focus in Czech: Comparative Evaluation on Prague Dependency Treebank. In: Studies in Formal Slavic Phonology, Morphology, Syntax, Semantics and Information Structure. Formal Description of Slavic Languages 7, pp. 343-353, Peter Lang, Frankfurt am Main, Germany, ISBN 978-3-631-57788-2 (bibtex)
Zdeněk Žabokrtský, Martin Popel (2009): Hidden Markov Tree Model in Dependency-based Machine Translation. In: Proceedings of the ACL-IJCNLP 2009 Conference Short Papers, pp. 145-148, Association for Computational Linguistics, Suntec, Singapore, ISBN 978-1-932432-61-9 (pdf, local PDF, bibtex)
Анна Юрьевна Недолужко (2009): Разметка кореференции на синтаксически аннотированном корпусе чешских текстов. In: Papers from the Annual International Conference “Dialogue 2009” Issue 8 (15), pp. 350-355, М.: РГГУ, Bekasovo/Moscow, ISBN 978-5-7281-1102-3 (url, bibtex)
Zuzanna Bedřichová (2008): Částice implikující presupozici jako podstatná složka větného významu. In: Čeština doma a ve světě, ISSN 1210-9339, vol. 16, no. 3-4, pp. 119-126 (bibtex)
Eduard Bejček, Pavel Straňák (2008): Anotace víceslovných výrazů v Pražském závislostním korpusu. In: Grammar & Corpora / Gramatika a korpus 2007, pp. 143-149, Academia, Praha, ISBN 978-80-200-1634-8 (local PDF, bibtex)
Eduard Bejček, Pavel Straňák, Pavel Schlesinger (2008): Annotation of Multiword Expressions in the Prague Dependency Treebank. In: IJCNLP 2008 Proceedings of the Third International Joint Conference on Natural Language Processing, pp. 793-798, International Institute of Information Technology, Hyderabad, India (local PDF, local PDF, bibtex)
Václava Benešová (2008): Modality in dependent content clauses with Czech verbs of communication with imperative features. In: Grammar & Corpora / Gramatika a korpus 2007, pp. 199-206, Academia, Praha, ISBN 978-80-200-1634-8 (bibtex)
Václava Benešová, Markéta Lopatková, Klára Hrstková (2008): Enhancing Czech Valency Lexicon with Semantic Information from FrameNet: The Case of Communication Verbs. In: ICGL 2008 Proceedings of the First International Conference on Global Interoperability for Language Resources, pp. 18-25, City University of Hong Kong, Hong Kong, China (local PDF, bibtex)
Viktor Bielický, Otakar Smrž (2008): Building the Valency Lexicon of Arabic Verbs. In: Proceedings of the 6th International Conference on Language Resources and Evaluation (LREC 2008), pp. 2300-2307, European Language Resources Association, Marrakech, Morocco, ISBN 2-9517408-4-0 (pdf, local PDF, bibtex)
Ondřej Bojar (2008): Exploiting Linguistic Data in Machine Translation (PhD thesis). In: (local PDF, local PDF, local PDF, bibtex)
Ondřej Bojar, Silvie Cinková, Jan Ptáček (2008): Towards English-to-Czech MT via Tectogrammatical Layer. In: The Prague Bulletin of Mathematical Linguistics, ISSN 0032-6585, 90, pp. 57-68 (pdf, local PDF, bibtex)
Ondřej Bojar, Jan Hajič (2008): Phrase-Based and Deep Syntactic English-to-Czech Statistical Machine Translation. In: ACL 2008 WMT: Proceedings of the Third Workshop on Statistical Machine Translation, pp. 143-146, Association for Computational Linguistics, Columbus, OH, USA, ISBN 978-1-932432-09-1 (url, local PDF, bibtex)
Ondřej Bojar, Miroslav Janíček, Miroslav Týnovský (2008): Implementation of Tree Transfer System (technical report). In: (local PDF, bibtex)
Ondřej Bojar, Miroslav Janíček, Zdeněk Žabokrtský, Pavel Češka, Peter Beňa (2008): CzEng 0.7: Parallel Corpus with Community-Supplied Translations. In: Proceedings of the 6th International Conference on Language Resources and Evaluation (LREC 2008), pp. 1203-1208, European Language Resources Association, Marrakech, Morocco, ISBN 2-9517408-4-0 (local PDF, bibtex)
Ondřej Bojar, Adam Lopez (2008): Tree-based Translation. In: Proceedings of MT Marathon 2008, University of Edinburgh, Edinburgh, Scotland (url, local PDF, bibtex)
Ondřej Bojar, Pavel Straňák, Daniel Zeman (2008): English-Hindi Translation in 21 Days. In: Proceedings of the 6th International Conference On Natural Language Processing (ICON-2008) NLP Tools Contest, International Institute of Information Technologies, Hyderabad, Pune, India (url, local PDF, local DOC, bibtex)
Silvie Cinková (2008): Lemmatisierung der verbalen Reflexivität im entstehenden Großen Deutsch-Tschechischen akademischen Wörterbuch. In: Beiträge zur bilingualen lexikographie, pp. 141-152, Univerzita Karlova v Praze, Praha, Czechia, ISBN 978-80-7308-217-8 (bibtex)
Silvie Cinková, Jan Hajič, Jan Ptáček (2008): An Annotation Scheme for Speech Reconstruction on a Dialog Corpus. In: Fourth International Workshop on Human-Computer Conversation, The Companions consortium, Bellagio, Italy (pdf, local PDF, bibtex)
Silvie Cinková, Eva Hajičová, Jarmila Panevová, Petr Sgall (2008): The Tectogrammatics of English: on Some Problematic Issues from the Viewpoint of the Prague Dependency Treebank. In: Resourceful Language Technology: Festschrift in Honor of Anna Sågvall Hein, pp. 33-48, Uppsala University, Humanistisk-samhällsvetenskapliga vetenskapsområdet, Faculty of Languages, Department of Linguistics and Philology, Uppsala, Sweden, ISBN 978-91-554-7226-9 (url, local PDF, local PDF, bibtex)
Silvie Cinková, Eva Hajičová, Jarmila Panevová, Petr Sgall (2008): Two Languages - One Annotation Scenario? Experience from the Prague Dependency Treebank. In: The Prague Bulletin of Mathematical Linguistics, ISSN 0032-6585, 89, pp. 5-22 (pdf, local PDF, bibtex)
Silvie Cinková, Marie Mikulová (2008): Speech reconstruction for the syntactic and semantic analysis of the NAP/AAA corpus (technical report). In: (local PDF, local PDF, bibtex)
Pavel Češka, Pavel Pecina (2008): Charles University at CLEF 2007 Ad-Hoc Track. In: Advances in Multilingual and Multimodal Information Retrieval, 8th Workshop of the Cross-Language Evaluation Forum, CLEF 2007, Budapest, Hungary, September 19-21, 2007, Revised Selected Papers, Lecture Notes in Computer Science, ISSN 0302-9743, vol. 5152/2008, no. 5152, pp. 33-36, Springer, Berlin / Heidelberg, ISBN 978-3-540-85759-4 (bibtex)
Tomáš Duběda, Jan Raab (2008): Pitch Accents, Boundary Tones and Contours: Automatic Learning of Czech Intonation. In: Proceedings of the 11th International Conference, TSD 2008, Lecture Notes in Computer Science, ISSN 0302-9743, 5246, pp. 293-301, Springer, Berlin / Heidelberg, ISBN 978-3-540-87390-7 (local PDF, bibtex)
Jan Hajič, Silvie Cinková, Marie Mikulová, Petr Pajas, Jan Ptáček, Josef Toman, Zdeňka Urešová (2008): PDTSL: An Annotated Resource For Speech Reconstruction. In: Proceedings of the 2008 IEEE Workshop on Spoken Language Technology, pp. 93-96, IEEE, Goa, India, ISBN 978-1-4244-3472-5 (bibtex)
Eva Hajičová (2008): What we are talking about and what we are saying about it. In: Computational Linguistics and Intelligent Text Processing, pp. 241-262, Springer Berlin /Heidelberg, Berlín, Heidelberg, Německo, ISBN 978-3-540-78134-9 (local PDF, bibtex)
Eva Hajičová (2008): Úloha Pražského lingvistického kroužku při vývoji i ve vyhlídkách české jazykovědy. In: Slovo a slovesnost, ISSN 0037-7031, vol. 69, no. 1-2, pp. 5-8 (url, bibtex)
Eva Hajičová (2008): Ověřování lingvistické teorie nad počítačovým korpusem. In: Slovo a slovesnost, ISSN 0037-7031, vol. 69, no. 1-2, pp. 131-142 (url, bibtex)
Eva Hajičová, Lucie Kučová (2008): Coreferential relations in the Prague Dependency Treebank . In: Formal Description of Slavic Languages: The Fifth Conference, Leipzig 2003, pp. 18-28, Peter Lang, Frankfurt am Main, Germany, ISBN 978-3-631-55160-8 (local PDF, bibtex)
Barbora Hladká, Ondřej Kučera (2008): An Annotated Corpus Outside Its Original Context: A Corpus-Based Exercise Book. In: ACL 2008: Proceedings of the Third Workshop on Innovative Use of NLP for Building Educational Applications, pp. 36-43, Association for Computational Linguistics (ACL), Columbus, OH, USA, ISBN 978-1-932432-08-4 (local PDF, bibtex)
Jaroslava Hlaváčová (2008): Pravopisné varianty a morfologická anotace korpusů. In: Grammar & Corpora / Gramatika a korpus 2007, pp. 161-168, Academia, Praha, ISBN 978-80-200-1634-8 (bibtex)
Jaroslava Hlaváčová, Michal Hrušecký (2008): “Affisix” Tool for Prefix Recognition. In: Proceedings of the 11th International Conference, TSD 2008, Lecture Notes in Computer Science, ISSN 0302-9743, 5246, pp. 85-92, Springer, Berlin / Heidelberg, ISBN 978-3-540-87390-7 (bibtex)
Jaroslava Hlaváčová, David Kolovratník (2008): Morfologie češtiny znovu a lépe. In: Informačné Technológie – Aplikácie a Teória.Zborník príspevkov, ITAT 2008, pp. 43-47, PONT s.r.o., Seňa, Slovakia, ISBN 978-80-969184-8-5 (local PDF, bibtex)
Jaroslava Hlaváčová, Markéta Lopatková (2008): Variants and Homographs: Eternal Problem of Dictionary Makers. In: Proceedings of the 11th International Conference, TSD 2008, Lecture Notes in Computer Science, ISSN 0302-9743, 5246, pp. 93-100, Springer, Berlin / Heidelberg, ISBN 978-3-540-87390-7 (bibtex)
Petr Homola (2008): A Distributed Database for Mobile NLP Applications. In: Proceedings of the Mobile Language Processing Workshop (ACL), pp. 27-28, ACL, Columbus, OH, USA, ISBN 978-1-932432-13-8 (bibtex)
Petr Homola (2008): Słowjeńska mjeńšyna w Awstriskej. In: Serbska pratyja 2009, pp. 1-1, Ludowe nakładnistwo Domowina / Domowina-Verlag, Budyšin / Bautzen, Germany, Budyšin / Bautzen, Germany, ISBN 978-3-7420-2095-6 (bibtex)
Petr Homola (2008): Pólske Morawarje we pšuskej Šlazyńskej. In: Serbska pratyja 2009, pp. 1-1, Ludowe nakładnistwo Domowina / Domowina-Verlag, Budyšin / Bautzen, Germany, Budyšin / Bautzen, Germany, ISBN 978-3-7420-2095-6 (bibtex)
Petr Homola, Vladislav Kuboň (2008): Implementace unifikační gramatiky pro strojový překlad. In: Proceedings of Malý informatický seminář (MIS 2008), pp. 19-26, Matfyzpress, Praha, Czechia, ISBN 978-80-7378-076-0 (bibtex)
Petr Homola, Vladislav Kuboň (2008): Improving Machine Translation Between Closely Related Romance Languages. In: Proceedings of the Twelfth EAMT Conference, pp. 72-77, HITEC e.V., Hamburg, Germany, ISBN 978-3-00-025770-4 (bibtex)
Petr Homola, Vladislav Kuboň (2008): Partial Parsing in a Simple MT System. In: Proceedings of the Partial Parsing: Between Chunking and Deep Parsing Workshop '08 (LREC), pp. 7-13, European Language Resources Association, Paris, France, ISBN 2-9517408-4-0 (bibtex)
Petr Homola, Vladislav Kuboň (2008): A Method of Hybrid MT for Related Languages. In: Proceedings of the International Intelligent Information Systems '08 Conference, pp. 269-278, Academic Publishing House EXIT, Warszawa, Poland, ISBN 978-83-60434-44-4 (bibtex)
Petr Homola, Vladislav Kuboň (2008): A Hybrid Machine Translation System for Typologically Related Languages. In: Proceedings of the 21st International Florida-Artificial-Intelligence-Research-Society Conference (FLAIRS 2008), pp. 227-228, AAAI Press, Coconut Grove, FL, USA, ISBN 978-1-57735-365-2 (bibtex)
Petr Kaderka, Martin Havlík, Zdeňka Svobodová, Nino Peterek, Eva Havlová, Jana Klímová, Patricie Kubáčková (2008): Minulost, současnost a budoucnost korpusu DIALOG. In: Grammar & Corpora / Gramatika a korpus 2007, pp. 181-189, Academia, Praha, ISBN 978-80-200-1634-8 (bibtex)
Václava Kettnerová (2008): Czech Verbs of Communication with respect to Types of Dependent Content Clauses. In: The Prague Bulletin of Mathematical Linguistics, ISSN 0032-6585, 90, pp. 83-108 (url, bibtex)
Václava Kettnerová, Markéta Lopatková, Klára Hrstková (2008): Semantic Roles in Valency Lexicon of Czech Verbs: Verbs of Communication and Exchange. In: Advances in Natural Language Processing (6th International Conference on NLP, GoTAL 2008), Lecture Notes in Computer Science, ISSN 0302-9743, 5221, pp. 217-221, Springer, Berlin / Heidelberg, ISBN 978-3-540-85286-5 (bibtex)
Václava Kettnerová, Markéta Lopatková, Klára Hrstková (2008): Semantic Classes in Czech Valency Lexicon: Verbs of Communication and Verbs of Exchange. In: Proceedings of the 11th International Conference, TSD 2008, Lecture Notes in Computer Science, ISSN 0302-9743, 5246, pp. 109-116, Springer, Berlin / Heidelberg, ISBN 978-3-540-87390-7 (bibtex)
Natalia Klyueva, Ondřej Bojar (2008): UMC 0.1: Czech-Russian-English Multilingual Corpus. In: Proceedings of the Conference "Korpusnaja lingvistika - 2008", pp. 188-195, St.Petersburg State University, Sankt-Peterburg, Russia, ISBN 978-5-288-04769-5 (pdf, local PDF, bibtex)
Michal Křen, Jaroslava Hlaváčová (2008): Corpus as a Means for Study of Lexical Usage Changes. In: Proceedings of the 13th EURALEX International Congress, pp. 437-447, Universitat Pompeu Fabra, Barcelona, Spain, ISBN 978-84-96742-67-3 (bibtex)
Vladislav Kuboň, Miroslav Spousta (2008): Využití jazykových technologií v oblasti eLearningu. In: Proceedings of Malý informatický seminář (MIS 2008), pp. 71-78, Matfyzpress, Praha, Czechia, ISBN 978-80-7378-076-0 (local PDF, bibtex)
Vladislav Kuboň, Miroslav Spousta (2008): Multilingual Approach to e-Learning from a Monolingual Perspective. In: Proceedings of the 21st International Florida-Artificial-Intelligence-Research-Society Conference (FLAIRS 2008), pp. 229-230, AAAI Press, Coconut Grove, FL, USA, ISBN 978-1-57735-365-2 (bibtex)
Ondřej Kučera, Barbora Hladká, Klára Hrstková (2008): Automaticky vytvořená cvičebnice češtiny. In: Pedagogický software 2008, pp. 55-57, Scientific Pedagogical Publishing, České Budějovice, České Budějovice, Czechia, ISBN 80-85645-59-9 (local PDF, bibtex)
Markéta Lopatková (2008): Valence a její formální popis. Vybrané aspekty budování slovníku VALLEX. In: Proceedings of Malý informatický seminář (MIS 2008), pp. 58-88, Matfyzpress, Praha, Czechia, ISBN 978-80-7378-076-0 (local PDF, bibtex)
Markéta Lopatková (2008): K významnému životnímu jubileu Jarmily Panevové. In: Slovo a slovesnost, ISSN 0037-7031, vol. 69, no. 3, pp. 245-247 (local PDF, bibtex)
Markéta Lopatková, Tomáš Holan (2008): Vztahy mezi segmenty – segmentační schémata českých vět. In: Informačné Technológie – Aplikácie a Teória.Zborník príspevkov, ITAT 2008, pp. 15-22, PONT s.r.o., Seňa, Slovakia, ISBN 978-80-969184-8-5 (local PDF, bibtex)
Markéta Lopatková, Martin Plátek, Petr Sgall (2008): Functional Generative Description, Restarting Automata and Analysis by Reduction. In: Studies in Formal Slavic Linguistics. Contributions from Formal Description of Slavic Languages 6.5, pp. 173-190, Peter Lang GmbH, Frankfurt am Main, Germany, ISBN 978-3-631-57009-8 (local PDF, bibtex)
Markéta Lopatková, Zdeněk Žabokrtský, Václava Kettnerová (2008): Valenční slovník českých sloves. In: , ISBN 978-80-246-1467-0 (bibtex)
David Mareček (2008): Automatic Alignment of Tectogrammatical Trees from Czech-English Parallel Corpus (masters thesis). In: (local PDF, bibtex)
David Mareček, Zdeněk Žabokrtský, Václav Novák (2008): Automatic Alignment of Czech and English Deep Syntactic Dependency Trees. In: Proceedings of the Twelfth EAMT Conference, pp. 102-111, HITEC e.V., Hamburg, Germany, ISBN 978-3-00-025770-4 (pdf, local PDF, bibtex)
Marie Mikulová (2008): Pražský závislostní korpus: Specifikace významů prostorových určení. In: Grammar & Corpora / Gramatika a korpus 2007, pp. 391-399, Academia, Praha, ISBN 978-80-200-1634-8 (bibtex)
Marie Mikulová (2008): Rekonstrukce standardizovaného textu z mluvené řeči v Pražském závislostním korpusu mluvené češtiny. Manuál pro anotátory (technical report). In: (local PDF, local PDF, bibtex)
Marie Mikulová, Zdeňka Urešová (2008): Rekonstrukce standardizovaného textu z mluvené řeči. In: Čeština v mluveném korpusu, pp. 167-176, Lidové noviny, Praha, Česká republika, ISBN 978-80-7106-982-9 (url, local PDF, bibtex)
Jiří Mírovský (2008): Netgraph Query Language for the Prague Dependency Treebank 2.0 . In: The Prague Bulletin of Mathematical Linguistics, ISSN 0032-6585, 90, pp. 5-31 (pdf, bibtex)
Jiří Mírovský (2008): PDT 2.0 Requirements on a Query Language. In: Proceedings of the 46th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies, pp. 37-45, Association for Computational Linguistics, Columbus, OH, USA, ISBN 978-1-932432-04-6 (pdf, bibtex)
Jiří Mírovský (2008): Does Netgraph Fit Prague Dependency Treebank?. In: Proceedings of the 6th International Conference on Language Resources and Evaluation (LREC 2008), pp. 436-441, European Language Resources Association, Marrakech, Morocco, ISBN 2-9517408-4-0 (pdf, local PDF, bibtex)
Jiří Mírovský (2008): Towards a Simple and Full-Featured Treebank Query Language. In: ICGL 2008 Proceedings of the First International Conference on Global Interoperability for Language Resources, pp. 171-178, City University of Hong Kong, Hong Kong, China (pdf, local PDF, bibtex)
Jiří Mírovský (2008): Netgraph - Making Searching in Treebanks Easy. In: IJCNLP 2008 Proceedings of the Third International Joint Conference on Natural Language Processing, pp. 945-950, International Institute of Information Technology, Hyderabad, India (pdf, local PDF, bibtex)
Jiří Mírovský, Jarmila Panevová (2008): Learning to Search in the Prague Dependency Treebank. In: Grammar & Corpora / Gramatika a korpus 2007, pp. 105-111, Academia, Praha, ISBN 978-80-200-1634-8 (local PDF, bibtex)
Lucie Mladová (2008): Diskurzní vztahy v češtině a jejich zachycení v anotovaném korpusu (technical report). In: (local PDF, bibtex)
Lucie Mladová (2008): Od hloubkové struktury věty k diskurzním vztahům (Diskurzní vztahy v češtině a jejich zachycení v anotovaném korpusu) (masters thesis). In: (local PDF, bibtex)
Lucie Mladová (2008): K problematice vztahu rematizátorů a textových konektorů. In: Čeština doma a ve světě, ISSN 1210-9339, vol. 16, no. 3-4, pp. 126-133 (bibtex)
Lucie Mladová, Šárka Zikánová, Eva Hajičová (2008): From Sentence to Discourse: Building an Annotation Scheme for Discourse Based on Prague Dependency Treebank. In: Proceedings of the 6th International Conference on Language Resources and Evaluation (LREC 2008), pp. 1-7, European Language Resources Association, Marrakech, Morocco, ISBN 2-9517408-4-0 (local PDF, bibtex)
Anja Nedolužko, Jan Hajič (2008): Cинтаксически аннотированный корпус чешского языка. In: Компьютерная лингвистика и интеллектуальные технологии: По материалам ежегодной международной конференции «Диалог», Компьютерная лингвистика и интеллектуальные технологии, 7 (14), pp. 400-406, Moskva, RGGU, Бекасово, Russia, ISBN 978-5-7281-1022-4 (url, local DOC, local PDF, bibtex)
Giang Linh Nguy (2008): Machine Learning Approaches to Coreference Resolution. In: WDS'08 Proceedings of Contributed Papers, pp. 139-143, Matfyzpress, Charles University, Praha, Czechia, ISBN 978-80-7378-065-4 (local PDF, bibtex)
Václav Novák (2008): Semantic Network Manual Annotation and its Evaluation: Extract of Ph.D. Thesis. In: The Prague Bulletin of Mathematical Linguistics, ISSN 0032-6585, 90, pp. 69-82 (url, bibtex)
Václav Novák (2008): Semantic Network Manual Annotation and its Evaluation (PhD thesis). In: (local PDF, bibtex)
Václav Novák, Keith Brendan Hall (2008): Inter-sentential Coreferences in Semantic Networks: An Evaluation of Manual Annotation. In: Proceedings of the 6th International Conference on Language Resources and Evaluation (LREC 2008), pp. 2746-2751, European Language Resources Association, Marrakech, Morocco, ISBN 2-9517408-4-0 (pdf, local PDF, bibtex)
Petr Pajas, Jan Štěpánek (2008): Recent Advances in a Feature-Rich Framework for Treebank Annotation. In: The 22nd International Conference on Computational Linguistics - Proceedings of the Conference, pp. 673-680, The Coling 2008 Organizing Committee, Manchester, UK, ISBN 978-1-905593-45-3 (bibtex)
Jarmila Panevová (2008): Povaha stupňování adjektiv (K "nesrovnávacímu" užití stupňovaných forem). In: Iugi observatione ... Jubilejný zborník na počesť Ľubomíra Ďuroviča, pp. 149-156, Veda, Bratislava, Slovakia, ISBN 978-80-224-1043-4 (bibtex)
Jarmila Panevová (2008): České konstrukce tzv. slovanského akuzativu s infinitivem. In: Slovo a slovesnost, ISSN 0037-7031, vol. 69, no. 3, pp. 163-175 (local PDF, bibtex)
Jarmila Panevová (2008): Problémy se slovanským reflexivem. In: Slavia, ISSN 0037-6736, vol. 77, no. 1-3, pp. 153-163 (local PDF, bibtex)
Pavel Pecina (2008): Lexical Association Measures: Collocation Extraction (PhD thesis). In: (bibtex)
Pavel Pecina (2008): A Machine Learning Approach to Multiword Expression Extraction. In: Proceedings of the LREC 2008 Workshop Towards a Shared Task for Multiword Expressions, pp. 54-57, ELRA, Marrakech, Morocco, ISBN 2-9517408-4-0 (local PDF, bibtex)
Pavel Pecina (2008): Reference Data for Czech Collocation Extraction. In: Proceedings of the LREC 2008 Workshop Towards a Shared Task for Multiword Expressions, pp. 11-14, ELRA, Marrakech, Morocco, ISBN 2-9517408-4-0 (local PDF, bibtex)
Pavel Pecina, Petra Hoffmannová, Gareth J.F. Jones, Jianqiang Wang, Douglas W. Oard (2008): Overview of the CLEF-2007 Cross-Language Speech Retrieval Track. In: Advances in Multilingual and Multimodal Information Retrieval, 8th Workshop of the Cross-Language Evaluation Forum, CLEF 2007, Budapest, Hungary, September 19-21, 2007, Revised Selected Papers, Lecture Notes in Computer Science, ISSN 0302-9743, vol. 5152/2008, no. 5152, pp. 674-686, Springer, Berlin / Heidelberg, ISBN 978-3-540-85759-4 (bibtex)
Jan Ptáček (2008): Two Tectogrammatical Realizers Side by Side: Case of English and Czech. In: Fourth International Workshop on Human-Computer Conversation, The Companions consortium, Bellagio, Italy (local PDF, bibtex)
Magdaléna Rysová (2008): Nejednoznačnost aktuálního členění výpovědi v básnické tvorbě Otokara Březiny. In: Čeština doma a ve světě, ISSN 1210-9339, vol. 16, no. 3-4, pp. 147-156 (bibtex)
Petr Sgall (2008): Jakou češtinu máme, a jak se k ní chováme? Potřebujeme češtinu správnou, spisovnou, nebo standardní? (Electronic). In: i-Forum, ISSN 1214-5726 (url)
Petr Sgall (2008): Velikonoce a jejich lexikální příslušenství (Electronic). In: i-Forum, ISSN 1214-5726 (url)
Petr Sgall (2008): Ideje Pražského lingvistického kroužku jsou i dnes aktuální. In: Slovo a slovesnost, ISSN 0037-7031, vol. 69, no. 1-2, pp. 34-43 (bibtex)
Otakar Smrž, Viktor Bielický, Iveta Kouřilová, Jakub Kráčmar, Jan Hajič, Petr Zemánek (2008): Prague Arabic Dependency Treebank: A Word on the Million Words. In: Proceedings of the Workshop on Arabic and Local Languages (LREC 2008), pp. 16-23, European Language Resources Association, Marrakech, Morocco, ISBN 2-9517408-4-0 (pdf, local PDF, bibtex)
Miroslav Spousta, Michal Marek, Pavel Pecina (2008): Victor: the Web-Page Cleaning Tool. In: Proceedings of the 4th Web as Corpus Workshop, pp. 12-17, ACL SIGWAC, Marrakech, Morocco, ISBN 2-9517408-4-0 (local PDF, bibtex)
Drahomíra Spoustová (2008): Combining Statistical and Rule-Based Approaches to Morphological Tagging of Czech Texts. In: The Prague Bulletin of Mathematical Linguistics, ISSN 0032-6585, 89, pp. 23-40 (pdf, local PDF, bibtex)
Drahomíra Spoustová, Pavel Pecina, Jan Hajič, Miroslav Spousta (2008): Validating the Quality of Full Morphological Annotation. In: Proceedings of the 6th International Conference on Language Resources and Evaluation (LREC 2008), pp. 1-4, European Language Resources Association, Marrakech, Morocco, ISBN 2-9517408-4-0 (local PDF, bibtex)
Magda Ševčíková (2008): Proper Nouns in Czech Corpora. In: Proceedings of the Corpus Linguistics Conference Series, pp. 1-10, University of Birmingham, Birmingham, UK (pdf, local PDF, bibtex)
Magda Ševčíková (2008): Pronouns Introducing Content Clauses. In: Grammar & Corpora / Gramatika a korpus 2007, pp. 277-284, Academia, Praha, ISBN 978-80-200-1634-8 (bibtex)
Magda Ševčíková, Zdeněk Žabokrtský (2008): Petr Sgall: Language in its multifarious aspects. Ed. Eva Hajičová – Jarmila Panevová. Karolinum, Charles University Press, Praha 2006. 556 s. (review). In: Slovo a slovesnost, ISSN 0037-7031, vol. 69, no. 3, pp. 221-227 (bibtex)
Jan Štěpánek (2008): Pražský závislostní korpus. In: Varia XV, pp. 581-587, Slovenská jazykovedná spoločnosť pri SAV v Bratislave, Katedra slovenského jazyka a literatúry FHV UMB v Banskej Bystrici, Banská Bystrica, Slovakia, ISBN 80-89037-04-6 (bibtex)
Miroslav Týnovský (2008): Hybrid Approaches in Machine Translation. In: WDS'08 Proceedings of Contributed Papers, pp. 124-128, Matfyzpress, Charles University, Praha, Czechia, ISBN 978-80-7378-065-4 (pdf, bibtex)
Barbora Vidová Hladká, Jan Hajič, Jiří Hana, Jaroslava Hlaváčová, Jiří Mírovský, Jan Raab (2008): The Czech Academic Corpus 2.0 Guide. In: The Prague Bulletin of Mathematical Linguistics, ISSN 0032-6585, 89, pp. 41-96 (url, bibtex)
Daniel Zeman (2008): Unsupervised Acquiring of Morphological Paradigms from Tokenized Text. In: Advances in Multilingual and Multimodal Information Retrieval, 8th Workshop of the Cross-Language Evaluation Forum, CLEF 2007, Budapest, Hungary, September 19-21, 2007, Revised Selected Papers, Lecture Notes in Computer Science, ISSN 0302-9743, vol. 5152/2008, no. 5152, pp. 892-899, Springer, Berlin / Heidelberg, ISBN 978-3-540-85759-4 (pdf, local PDF, bibtex)
Daniel Zeman (2008): Using Unsupervised Paradigm Acquisition for Prefixes. In: Working Notes for the Cross Language Evaluation Forum (CLEF) 2008 Workshop, pp. 1-7, Århus Universitet, Århus, Denmark (pdf, local PDF, local DOC, local PDF, bibtex)
Daniel Zeman (2008): Reusable Tagset Conversion Using Tagset Drivers. In: Proceedings of the 6th International Conference on Language Resources and Evaluation (LREC 2008), pp. 213-218, European Language Resources Association, Marrakech, Morocco, ISBN 2-9517408-4-0 (url, local PDF, local PDF, bibtex)
Daniel Zeman, Philip Resnik (2008): Cross-Language Parser Adaptation between Related Languages. In: IJCNLP 2008 Workshop on NLP for Less Privileged Languages, pp. 35-42, International Institute of Information Technology, Hyderabad, India (url, local PDF, local PDF, local DOC, bibtex)
Šárka Zikánová (2008): Problematické syntaktické struktury: k rozborům aktuálního členění v Pražském závislostním korpusu. In: Svět za slovy a jejich tvary, svět za spojením slov, pp. 233-240, Vydavatelství Univerzity Palackého v Olomouci, Olomouc, Czechia, ISBN 978-80-244-1984-8 (bibtex)
Šárka Zikánová (2008): Několik vět a mezivětných vztahů úvodem. In: Čeština doma a ve světě, ISSN 1210-9339, vol. 16, no. 3-4, pp. 118-119 (bibtex)
Šárka Zikánová, Marko Malink (2008): Clitic Climbing and Theta-Roles in Upper Sorbian and Czech. In: Formal Description of Slavic Languages: The Fifth Conference, Leipzig 2003, pp. 396-407, Peter Lang, Frankfurt am Main, Germany, ISBN 978-3-631-55160-8 (bibtex)
Zdeněk Žabokrtský, Ondřej Bojar (2008): TectoMT, Developer's Guide (technical report). In: (bibtex)
Zdeněk Žabokrtský, Jan Ptáček, Petr Pajas (2008): TectoMT: Highly Modular MT System with Tectogrammatics Used as Transfer Layer. In: ACL 2008 WMT: Proceedings of the Third Workshop on Statistical Machine Translation, pp. 167-170, Association for Computational Linguistics, Columbus, OH, USA, ISBN 978-1-932432-09-1 (pdf, local PDF, bibtex)
Eduard Bejček (2007): Selected Sense Enumerated Lexical Resources for Czech. In: WDS'07 Proceedings of Contributed Papers, pp. 125-130, Matfyzpress, Charles University, Praha, Czechia, ISBN 978-80-7378-023-4 (local PDF, bibtex)
Ondřej Bojar (2007): English-to-Czech Factored Machine Translation. In: ACL 2007 WMT: Proceedings of the Second Workshop on Statistical Machine Translation, pp. 232-239, Association for Computational Linguistics, Praha, Czechia, ISBN 978-1-932432-86-2 (url, bibtex)
Ondřej Bojar, Silvie Cinková, Jan Ptáček (2007): Towards English-to-Czech MT via Tectogrammatical Layer. In: Proceedings of the 6th International Workshop on Treebanks and Linguistic Theories (TLT 2007), NEALT Proceedings Series, ISSN 1736-6305, 1, pp. 7-18, North European Association for Language Technology, Bergen, Norway (url, local PDF, bibtex)
Ondřej Bojar, Martin Čmejrek (2007): Mathematical Model of Tree Transformations (technical report). In: (local PDF, bibtex)
Ondřej Bojar, Magdalena Prokopová (2007): Czech-English Machine Translation Dictionary (technical report). In: (local PDF, bibtex)
Silvie Cinková (2007): “Movement towards Structure”: Foreign Learners, Language Patterns and Learners' Lexicons. In: Rapport fra konference om leksikografi i Norden, LexicoNordica, ISSN 0805-2735, 9, Nordisk Forening for Leksikografi, Akureyri, Iceland (local PDF, bibtex)
Pavel Češka, Pavel Pecina (2007): Charles University at CLEF 2007 Ad-Hoc Track. In: Working Notes for the Cross Language Evaluation Forum (CLEF) 2007 Workshop, Magyar Tudományos Akadémia, Budapest, Hungary, ISBN 2-912335-31-0 (pdf, bibtex)
Pavel Češka, Pavel Pecina (2007): Charles University at CLEF 2007 CL-SR Track. In: Working Notes for the Cross Language Evaluation Forum (CLEF) 2007 Workshop, Magyar Tudományos Akadémia, Budapest, Hungary, ISBN 2-912335-31-0 (pdf, bibtex)
Johanka Doležalová, Vladimír Petkevič (2007): Shallow Parsing of Czech Sentence Based on Correct Morphological Disambiguation. In: Linguistic Investigations into Formal Description of Slavic Languages, pp. 53-63, Peter Lang, Frankfurt am Main, Germany, ISBN 978-3-631-55376-3 (local PDF, bibtex)
Jurjen Duintjer Tebbens, Pavel Schlesinger (2007): Improving implementation of linear discriminant analysis for the high dimension/small sample size problem. In: Computational Statistics and Data Analysis, ISSN 0167-9473, vol. 52, no. 1, pp. 423-437 (url, local PDF, bibtex)
Jan Hajič, Eva Hajičová (2007): Some of Our Best Friends are Statistician. In: Proceedings of the 10th International Conference on Text, Speech and Dialogue, Lecture Notes in Computer Science, ISSN 0302-9743, vol. 4629, no. XVII, pp. 2-11, Springer, Berlin / Heidelberg, ISBN 978-3-540-74627-0 (bibtex)
Eva Hajičová (2007): The Need of Deep Annotation of Corpus: A Case Study. In: Gramatika a korpus / Grammar and Corpora 2005, pp. 18-19, ÚJČ AV ČR Praha, Praha, Czechia, ISBN 978-80-86496-32-0 (bibtex)
Eva Hajičová (2007): On some aspects of Praguian functionalism. In: Abstracts of the 40th Annual Meeting of the Societas Linguistica Europaea, pp. 17-17, Joensuun Yliopisto, Joensuu, Finnland (bibtex)
Eva Hajičová (2007): The Position of TFA (Information Structure) in a Dependency Based Description of Language. In: Proceedings of the 3rd International Conference on Meaning-Text Theory (MTT 2007), pp. 159-178, Verlag Otto Sagner, c/o Kubon & Sagner, München / Wien, ISBN 978-3-86688-017-7 (url, local PDF, bibtex)
Eva Hajičová (2007): Information Structure from the Point of View of the Relation of Function and Form. In: The Prague Bulletin of Mathematical Linguistics, ISSN 0032-6585, 88, pp. 53-72 (pdf, bibtex)
Eva Hajičová (2007): Corpus annotation as a test of a linguistic theory: The case of Prague Dependency Treebank. In: Language Resources and Linguistic Theory (Materiali linguistici 59), pp. 15-24, Franco Angeli, Milano, ISBN 978-88-464-8944-9 (bibtex)
Eva Hajičová (2007): Medailon. In: Rozhovory s českými lingvisty I., pp. 53-83, Dauphin, Praha, Česká republika, ISBN 978-80-7272-107-8 (bibtex)
Eva Hajičová, Jan Cuřín, Jan Hajič, Ondřej Kučera, Barbora Vidová Hladká (2007): Jazyk a umělá inteligence: kudy a kam?. In: Uměla inteligence 5, pp. 272-283, Academia, Praha, Česká republika, ISBN 978-80-200-1407-2 (bibtex)
Eva Hajičová, Petr Sgall, Kateřina Veselá (2007): Contextual Boundness and Contrast in the Prague Dependency Treebank. In: Interfaces and interface conditions, pp. 231-243, De Gruyter, Berlín, Německo, ISBN 978-3-11-019547-7 (bibtex)
Keith Brendan Hall, Jiří Havelka, David A. Smith (2007): Log-linear Models of Non-projective Trees, k-best MST Parsing and Tree-ranking. In: Proceedings of the CoNLL Shared Task Session of EMNLP-CoNLL 2007, pp. 962-966, Association for Computational Linguistics, Praha, Czechia, ISBN 978-1-932432-89-3 (url, bibtex)
Jiří Havelka (2007): Mathematical Properties of Dependency Trees and their Application to Natural Language Syntax (PhD thesis). In: (bibtex)
Jiří Havelka (2007): Beyond Projectivity: Multilingual Evaluation of Constraints and Measures on Non-Projective Structures. In: Proceedings of the 45th Annual Meeting of the Association for Computational Linguistics, pp. 608-615, Association for Computational Linguistics, Praha, Czechia, ISBN 978-1-932432-86-2 (url, bibtex)
Jiří Havelka (2007): Relationship between Non-Projective Edges, Their Level Types, and Well-Nestedness. In: NAACL HLT 2007 Human Language Technologies 2007: The Conference of the North American Chapter of the Association for Computational Linguistics; Companion Volume, Short Papers, pp. 61-64, Association for Computational Linguistics, Rochester, NY, USA, ISBN 1-932432-94-9 (url, bibtex)
Barbora Hladká (2007): Our lucky moments with Frederick Jelinek. In: The Prague Bulletin of Mathematical Linguistics, ISSN 0032-6585, 88, pp. 111-112 (pdf, bibtex)
Jaroslava Hlaváčová (2007): Korpusové chyby. In: Gramatika a korpus / Grammar and Corpora 2005, pp. 77-86, ÚJČ AV ČR Praha, Praha, Czechia, ISBN 978-80-86496-32-0 (bibtex)
Petr Homola (2007): Current development tendencies in the dialect of Jablunkov. In: Proceedings of the Slavic Linguistics Society Conference, pp. 1-8, Zentrum für allgemeine Sprachwissenschaft und Universalienforschung, Berlin, Germany (bibtex)
Petr Homola (2007): Morphosyntaktische Unterbestimmtheit und der Schwund von Klitika. In: Proceedings of the 3rd person workshop, Zentrum für allgemeine Sprachwissenschaft und Universalienforschung, Berlin, Germany (bibtex)
Petr Homola, Natalia Klyueva (2007): K některým aspektům strojového překladu mezi baltoslovanskými jazyky. In: MIS 2007, 13.–20. ledna 2007, Josefův Důl, Sborník semináře, pp. 36-43, Matfyzpress, Praha, Czechia, ISBN 978-80-7378-033-3 (bibtex)
Klára Hrstková (2007): Czech Prefixed Verbs in a Valency Lexicon. Preliminary Study. In: WDS'07 Proceedings of Contributed Papers, pp. 131-136, Matfyzpress, Charles University, Praha, Czechia, ISBN 978-80-7378-023-4 (local PDF, bibtex)
Pavel Ircing, Pavel Pecina, Douglas W. Oard, Jianqiang Wang, Ryen W. White, Jan Hoidekr (2007): Information Retrieval Test Collection for Searching Spontaneous Czech Speech. In: Proceedings of the 10th International Conference on Text, Speech and Dialogue, Lecture Notes in Computer Science, ISSN 0302-9743, vol. 4629, no. XVII, pp. 439-446, Springer, Berlin / Heidelberg, ISBN 978-3-540-74627-0 (bibtex)
Václav Klimeš (2007): Transformation-Based Tectogrammatical Dependency Analysis of English. In: Proceedings of the 10th International Conference on Text, Speech and Dialogue, Lecture Notes in Computer Science, ISSN 0302-9743, vol. 4629, no. XVII, pp. 15-22, Springer, Berlin / Heidelberg, ISBN 978-3-540-74627-0 (bibtex)
Natalia Klyueva (2007): Semantics in Machine Translation. In: WDS'07 Proceedings of Contributed Papers, pp. 141-144, Matfyzpress, Charles University, Praha, Czechia, ISBN 978-80-7378-023-4 (bibtex)
Philipp Koehn, Hieu Hoang, Alexandra Birch, Chris Callison-Burch, Marcello Federico, Nicola Bertoldi, Brooke Cowan, Wade Shen, Christine Corbett Moran, Richard Zens, Chris Dyer, Ondřej Bojar, Alexandra Constantin, Evan Herbst (2007): Moses: Open Source Toolkit for Statistical Machine Translation. In: Proceedings of the 45th Annual Meeting of the Association for Computational Linguistics Companion Volume, Proceedings of the Student Research Workshop, Proceedings of Demo and Poster Sessions, Tutorial Abstracts, pp. 177-180, Association for Computational Linguistics, Praha, Czechia, ISBN 978-1-932432-87-9 (url, local PDF, bibtex)
Vladislav Kuboň, Markéta Lopatková, Martin Plátek, Patrice Pognan (2007): A Linguistically-Based Segmentation of Complex Sentences. In: Proceedings of FLAIRS 2007 (20th International Florida Artificial Intelligence Research Society Conference), pp. 368-373, AAAI Press, Key West, FL, USA, ISBN 978-1-57735-319-5 (bibtex)
Markéta Lopatková, Jarmila Panevová (2007): Valence vybraných sloves pohybu v češtině. In: Зборник Матице српске за славистику / Zbornik Matice srpske za slavistiku, ISSN 0352-5007, 71-72, pp. 101-115 (pdf, bibtex)
Markéta Lopatková, Martin Plátek, Petr Sgall (2007): Towards a Formal Model for Functional Generative Description: Analysis by Reduction and Restarting Automata. In: The Prague Bulletin of Mathematical Linguistics, ISSN 0032-6585, 87, pp. 7-26 (url, bibtex)
Michal Marek, Pavel Pecina, Miroslav Spousta (2007): Web Page Cleaning with Conditional Random Fields. In: Proceedings of the 3rd Web As a Corpus Workshop, Incorporating CLEANEVAL, pp. 155-162, UCL Pressess Universitaires de Louvain, Louvain-la-Neuve, Belgium, ISBN 978-2-8746-3082-8 (bibtex)
Domen Marinčič, Matjaž Gams, Zdeněk Žabokrtský (2007): Parsing Aided by Intra-Clausal Coordination Detection. In: Proceedings of the 6th International Workshop on Treebanks and Linguistic Theories (TLT 2007), NEALT Proceedings Series, ISSN 1736-6305, 1, pp. 79-84, North European Association for Language Technology, Bergen, Norway (url, bibtex)
Marie Mikulová, Alevtina Bémová, Jan Hajič, Eva Hajičová, Jiří Havelka, Veronika Kolářová, Lucie Kučová, Markéta Lopatková, Petr Pajas, Jarmila Panevová, Magda Ševčíková, Petr Sgall, Jan Štěpánek, Zdeňka Urešová, Kateřina Veselá, Zdeněk Žabokrtský (2007): Annotation on the tectogrammatical level in the Prague Dependency Treebank (technical report). In: (local PDF, bibtex)
Jiří Navrátil, David Klusáček (2007): On Linear DETs. In: Acoustics, Speech and Signal Processing, 2007. ICASSP 2007. IEEE International Conference, pp. 229-232, Institute of Electrical and Electronics Engineers (IEEE), Honolulu, HI, USA, ISBN 1-4244-0728-1 (bibtex)
Petr Němec (2007): Capturing the Meaning of Time Expressions: A Functional Approach. In: Proceedings of 3rd Language and Technology Conference, pp. 320-324, Wydawnictwo Poznańskie Sp. z o. o., Poznań, Poland, ISBN 978-83-7177-407-2 (local PDF, bibtex)
Petr Němec (2007): Automatic Analysis of Temporal Relations within a Discourse. In: Proceedings of the 14th International Symposium on Temporal Representation and Reasoning, pp. 117-128, IEEE Computer Society, Washington, DC, USA, ISBN 0-7695-2836-8 (bibtex)
Giang Linh Nguy, Zdeněk Žabokrtský (2007): Rule-based Approach to Pronominal Anaphora Resolution Applied on the Prague Dependency Treebank 2.0 Data. In: Proceedings of the 6th Discourse Anaphora and Anaphor Resolution Colloquium (DAARC 2007), pp. 77-81, CLUP-Center for Linguistics of the University of Oporto, Lagos (Algarve), Portugal, ISBN 978-989-95343-0-8 (bibtex)
Václav Novák (2007): Cedit - Semantic Networks Manual Annotation Tool. In: Proceedings of Human Language Technologies: The Annual Conference of the North American Chapter of the Association for Computational Linguistics (NAACL-HLT), pp. 11-12, Association for Computational Linguistics, Rochester, NY, USA, ISBN 1-932432-94-9 (url, bibtex)
Václav Novák (2007): Large Semantic Network Manual Annotation. In: Proceedings of the Seventh International Workshop on Computational Semantics IWCS-7, pp. 355-358, Universiteit van Tilburg, Tilburg, The Netherlands, ISBN 90-74029-31-0 (bibtex)
Václav Novák, Zdeněk Žabokrtský (2007): Feature Engineering in Maximum Spanning Tree Dependency Parser. In: Proceedings of the 10th International Conference on Text, Speech and Dialogue, Lecture Notes in Computer Science, ISSN 0302-9743, vol. 4629, no. XVII, pp. 92-98, Springer, Berlin / Heidelberg, ISBN 978-3-540-74627-0 (url, local PDF, bibtex)
Douglas W. Oard, Jianqiang Wang, Gareth J.F. Jones, Ryen W. White, Pavel Pecina, Dagobert Soergel, Xiaoli Huang, Izhak Shafran (2007): Overview of the CLEF-2006 Cross-Language Speech Retrieval Track. In: Evaluation of Multilingual and Multi-modal Information retrieval. 7th Workshop of the Cross-language Evaluation Forum, CLEF 2006, Revised Selected Papers, Lecture Notes in Computer Science, ISSN 0302-9743, 4730, pp. 744-758, Springer, Berlin / Heidelberg, ISBN 978-3-540-74998-1 (bibtex)
Karel Oliva, Šárka Zikánová (2007): Ohlédnutí za 32. ročníkem Olympiády v českém jazyce. In: Český jazyk a literatura, ISSN 0009-0786, vol. 57, no. 3, pp. 109-116 (bibtex)
Petr Pajas (2007): Structure of Submodels - Diagonal indiscernibility in Models of Arithmetic (PhD thesis). In: (bibtex)
Jarmila Panevová (2007): Gradation of adjectives and valency. In: Gramatika a korpus / Grammar and Corpora 2005, pp. 197-204, ÚJČ AV ČR Praha, Praha, Czechia, ISBN 978-80-86496-32-0 (bibtex)
Jarmila Panevová (2007): Znovu o reciprocitě. In: Slovo a slovesnost, ISSN 0037-7031, vol. 68, no. 2, pp. 91-100 (bibtex)
Jarmila Panevová, Marie Mikulová (2007): On Reciprocity. In: The Prague Bulletin of Mathematical Linguistics, ISSN 0032-6585, 87, pp. 27-40 (pdf, bibtex)
Pavel Pecina, Petra Hoffmannová, Gareth J.F. Jones, Ying Zhang, Douglas W. Oard (2007): Overview of the CLEF-2007 Cross-Language Speech Retrieval Track. In: Working Notes for the Cross Language Evaluation Forum (CLEF) 2007 Workshop, Magyar Tudományos Akadémia, Budapest, Hungary, ISBN 2-912335-31-0 (pdf, bibtex)
Nino Peterek, Petr Kaderka, Zdeňka Svobodová, Eva Havlová, Martin Havlík, Jana Klímová, Patricie Kubáčková (2007): Digitisation and Automatic Alignment of the DIALOG Corpus: Prosodically Annotated Corpus of Czech Television Debates. In: Proceedings of the 10th International Conference on Text, Speech and Dialogue, Lecture Notes in Computer Science, ISSN 0302-9743, vol. 4629, no. XVII, pp. 607-612, Springer, Berlin / Heidelberg, ISBN 978-3-540-74627-0 (bibtex)
Martin Plátek, Markéta Lopatková (2007): Funkční generativní popis a formální teorie překladů. In: Proceedings of ITAT 2007 (Information Technologies - Application and Theory), pp. 3-14, Univerzita Pavla Jozefa Šafárika, Košice, Slovakia, ISBN 978-80-969184-7-8 (bibtex)
Adam Przepiórkowski, Łukasz Degórski, Miroslav Spousta, Kiril Simov, Petya Nacheva Osenova, Lothar Lemnitzer, Vladislav Kuboň, Beata Wójtowitcz (2007): Towards the Automatic Extraction of Definitions in Slavic. In: Proceedings of the Workshop on Balto-Slavonic Natural Language Processing 2007, pp. 43-50, Association for Computational Linguistics, Praha, Czechia, ISBN 978-1-932432-88-6 (bibtex)
Jan Ptáček (2007): Sentence Synthesis in Machine Translation. In: WDS'07 Proceedings of Contributed Papers, pp. 151-156, Matfyzpress, Charles University, Praha, Czechia, ISBN 978-80-7378-023-4 (bibtex)
Jan Ptáček, Zdeněk Žabokrtský (2007): Dependency-based Sentence Synthesis Component for Czech. In: Proceedings of the 3rd International Conference on Meaning-Text Theory (MTT 2007), pp. 407-415, Verlag Otto Sagner, c/o Kubon & Sagner, München / Wien, ISBN 978-3-86688-017-7 (bibtex)
Jan Raab (2007): Comparing Prosody Formalisms for Machine Learning. In: WDS'07 Proceedings of Contributed Papers, pp. 137-140, Matfyzpress, Charles University, Praha, Czechia, ISBN 978-80-7378-023-4 (bibtex)
Jiří Semecký (2007): Verb Valency Frames Disambiguation (PhD thesis). In: (bibtex)
Petr Sgall (2007): Issues of verb valency in syntactic annotation of a large corpus. In: Language Resources and Linguistic Theory (Materiali linguistici 59), pp. 25-37, Franco Angeli, Milano, ISBN 978-88-464-8944-9 (bibtex)
Petr Sgall, Václav Cvrček (2007): O názorové pluralitě a hledání konsenzu v lingvistice. In: Naše řeč, ISSN 0027-8203, vol. 90, no. 3, pp. 132-135 (bibtex)
Otakar Smrž (2007): Functional Arabic Morphology: Dissertation Summary. In: The Prague Bulletin of Mathematical Linguistics, ISSN 0032-6585, 88, pp. 5-30 (pdf, local PDF, bibtex)
Otakar Smrž (2007): Demo Proposal: Extensible Integrated Treebank Annotation Environment. In: Proceedings of the 2nd Workshop on Computational Approaches to Arabic Script-based Languages, pp. 152-155, Linguistic Institute, Stanford, CA, USA (pdf, bibtex)
Otakar Smrž (2007): Functional Arabic Morphology. Formal System and Implementation (PhD thesis). In: (pdf, bibtex)
Otakar Smrž (2007): ElixirFM -- Implementation of Functional Arabic Morphology. In: ACL 2007 Proceedings of the Workshop on Computational Approaches to Semitic Languages: Common Issues and Resources, pp. 1-8, Association for Computational Linguistics, Praha, Czechia, ISBN 978-1-932432-86-2 (url, bibtex)
Otakar Smrž, Petr Pajas, Zdeněk Žabokrtský, Jan Hajič, Jiří Mírovský, Petr Němec (2007): Learning to Use the Prague Arabic Dependency Treebank. In: Perspectives on Arabic Linguistics: Papers from the annual symposium on Arabic Linguistics, pp. 77-90, John Benjamins, Amsterdam, The Netherlands, ISBN 978-90-272-4804-6 (pdf, bibtex)
Drahomíra Spoustová (2007): Kombinované statisticko-pravidlové metody značkování češtiny (PhD thesis). In: (local PDF, bibtex)
Drahomíra Spoustová, Jan Hajič, Jan Votrubec, Pavel Krbec, Pavel Květoň (2007): The Best of Two Worlds: Cooperation of Statistical and Rule-Based Taggers for Czech. In: Proceedings of the Workshop on Balto-Slavonic Natural Language Processing 2007, pp. 67-74, Association for Computational Linguistics, Praha, Czechia, ISBN 978-1-932432-88-6 (url, local PDF, local PDF, bibtex)
Drahomíra Spoustová, Tomáš Jelínek (2007): Pravidlová disambiguace a získávání informací o povrchové valenci sloves a adjektiv z ČNK. In: Gramatika a korpus / Grammar and Corpora 2005, pp. 42-48, ÚJČ AV ČR Praha, Praha, Czechia, ISBN 978-80-86496-32-0 (bibtex)
Magda Ševčíková, Zdeněk Žabokrtský, Oldřich Krůza (2007): Named Entities in Czech: Annotating Data and Developing NE Tagger. In: Proceedings of the 10th International Conference on Text, Speech and Dialogue, Lecture Notes in Computer Science, ISSN 0302-9743, vol. 4629, no. XVII, pp. 188-195, Springer, Berlin / Heidelberg, ISBN 978-3-540-74627-0 (bibtex)
Magda Ševčíková, Zdeněk Žabokrtský, Oldřich Krůza (2007): Zpracování pojmenovaných entit v českých textech (technical report). In: (pdf, bibtex)
Jana Šindlerová, Lucie Mladová, Josef Toman, Silvie Cinková (2007): An Application of the PDT-scheme to a Parallel Treebank. In: Proceedings of the 6th International Workshop on Treebanks and Linguistic Theories (TLT 2007), NEALT Proceedings Series, ISSN 1736-6305, 1, pp. 163-174, North European Association for Language Technology, Bergen, Norway (bibtex)
Barbora Vidová Hladká, Jan Hajič, Jiří Hana, Jaroslava Hlaváčová, Jiří Mírovský, Jan Votrubec (2007): Czech Academic Corpus 1.0 Guide. In: , ISBN 978-80-246-1315-4 (url, bibtex)
Daniel Zeman (2007): Unsupervised Acquiring of Morphological Paradigms from Tokenized Text. In: Working Notes for the Cross Language Evaluation Forum (CLEF) 2007 Workshop, Magyar Tudományos Akadémia, Budapest, Hungary, ISBN 2-912335-31-0 (pdf, bibtex)
Šárka Zikánová (2007): Connectives as Discourse Landmarks. Agnès Celle, Ruth Huart (eds.) Amsterdam, Philadelphia: John Benjamins Publishing Company, 2007, 212 pp. ISBN 978-90-272-5404-7 (review). In: The Prague Bulletin of Mathematical Linguistics, ISSN 0032-6585, 88, pp. 95-98 (pdf, bibtex)
Šárka Zikánová, Miroslav Týnovský, Jiří Havelka (2007): Identification of Topic and Focus in Czech: Evaluation of Manual Parallel Annotations. In: The Prague Bulletin of Mathematical Linguistics, ISSN 0032-6585, 87, pp. 61-70 (pdf, bibtex)
Zdeněk Žabokrtský, Markéta Lopatková (2007): Valency Information in VALLEX 2.0: Logical Structure of the Lexicon. In: The Prague Bulletin of Mathematical Linguistics, ISSN 0032-6585, 87, pp. 41-60 (url, bibtex)
Eduard Bejček (2006): Automatické přiřazování významu - "Sense-tagging" (masters thesis). In: (bibtex)
Eduard Bejček, Petra Möllerová, Pavel Straňák (2006): The lexico-semantic annotation of PDT: Some results, problems and solutions. In: Lecture Notes in Computer Science, ISSN 0302-9743, 4188, pp. 21-28 (url, local PDF, local PDF, bibtex)
Václava Benešová, Ondřej Bojar (2006): Czech Verbs of Communication and the Extraction of their Frames. In: Text, Speech and Dialogue. 9th International Conference, TSD 2006, Brno, Czech Republic, September 11–15, 2006, Proceedings, Lecture Notes in Computer Science, ISSN 0302-9743, 4188, pp. 29-36, Springer, Berlin / Heidelberg, ISBN 978-3-540-39090-9 (url, bibtex)
Ondřej Bojar (2006): Strojový překlad: zamyšlení nad účelností hloubkových jazykových analýz. In: Proceedings of Malý informatický seminář (MIS), pp. 3-13, Matfyzpress, Praha, Czechia, ISBN 80-7378-000-3 (bibtex)
Ondřej Bojar, Evgeny Matusov, Hermann Ney (2006): Czech-English Phrase-Based Machine Translation. In: Advances in Natural Language Processing (5th International Conference on NLP, FinTAL 2006), pp. 214-224, Springer, Berlin / Heidelberg, ISBN 978-3-540-37334-6 (url, bibtex)
Ondřej Bojar, Magdalena Prokopová (2006): Czech-English Word Alignment. In: Proceedings of the 5th International Conference on Language Resources and Evaluation (LREC 2006) , pp. 1236-1239, ELRA, Genova, Italy, ISBN 2-9517408-2-4 (url, local PDF, bibtex)
Ondřej Bojar, Zdeněk Žabokrtský (2006): CzEng: Czech-English Parallel Corpus, Release version 0.5. In: The Prague Bulletin of Mathematical Linguistics, ISSN 0032-6585, 86, pp. 59-62 (bibtex)
Silvie Cinková (2006): From PropBank to EngValLex: Adapting the PropBank-Lexicon to the Valency Theory of the Functional Generative Description. In: Proceedings of the 5th International Conference on Language Resources and Evaluation (LREC 2006) , pp. 2170-2175, ELRA, Genova, Italy, ISBN 2-9517408-2-4 (local PDF, bibtex)
Silvie Cinková, Jan Hajič, Marie Mikulová, Lucie Mladová, Anja Nedolužko, Petr Pajas, Jarmila Panevová, Jiří Semecký, Jana Šindlerová, Josef Toman, Zdeňka Urešová, Zdeněk Žabokrtský (2006): Annotation of English on the tectogrammatical level (technical report). In: (local PDF, bibtex)
Silvie Cinková, Veronika Kolářová (2006): Nouns as Components of Support Verb Constructions in the Prague Dependency Treebank. In: Korpusy a korpusová lingvistika v zahraničí a na Slovensku (in press), pp. 113-139, Veda, Bratislava, Slovakia, ISBN 80-224-0880-8 (bibtex)
Silvie Cinková, Petr Podveský, Pavel Pecina, Pavel Schlesinger (2006): Semi-automatic Building of Swedish Collocation Lexicon. In: Proceedings of the 5th International Conference on Language Resources and Evaluation (LREC 2006) , pp. 1890-1893, ELRA, Genova, Italy, ISBN 2-9517408-2-4 (bibtex)
Silvie Cinková, Jan Pomikálek (2006): LEMPAS: A Make-Do Lemmatizer for the Swedish PAROLE-Corpus . In: The Prague Bulletin of Mathematical Linguistics, ISSN 0032-6585, 86, pp. 47-54 (local PDF, bibtex)
František Čermák, Petr Sgall, Petr Vybíral (2006): K diskusi o standardní a "spisovné" češtině. In: Slovo a slovesnost, ISSN 0037-7031, vol. 67, no. 4, pp. 267-282 (bibtex)
Jurjen Duintjer Tebbens, Pavel Schlesinger (2006): Efficient Implementation of Optimal Linear Discriminant Analysis. In: Proceedings of the Seminar on Numerical Analysis (SNA'06), Modelling and Simulation of Challenging Engineering Problems, pp. 29-32, Computer Science of the Academy of Sciences of the Czech Republic, Sedlec-Prčice, Czechia (bibtex)
Boštjan Dvořák, Petr Homola, Vladislav Kuboň (2006): Exploiting Similarity in the MT into a Minority Language. In: Proceedings of the 5th SALTMIL Workshop on Minority Languages, pp. 59-64, European Language Resources Association, Paris, France, ISBN 2-9517408-2-4 (bibtex)
Sašo Džeroski, Tomaž Erjavec, Nina Ledinek, Petr Pajas, Zdeněk Žabokrtský, Andreja Žele (2006): Towards a Slovene Dependency Treebank. In: Proceedings of the 5th International Conference on Language Resources and Evaluation (LREC 2006) , pp. 1388-1391, ELRA, Genova, Italy, ISBN 2-9517408-2-4 (pdf, bibtex)
Jan Hajič (2006): Treebanks and Tagsets. In: ENCYCLOPEDIA OF LANGUAGE AND LINGUISTICS, pp. 109-114, Elsevier, Netherlands, Amsterdam, The Netherlands, ISBN 0-08-044299-4 (bibtex)
Jan Hajič (2006): Complex Corpus Annotation: The Prague Dependency Treebank. In: Insight into the Slovak and Czech Corpus Linguistics, pp. 54-73, Veda, Bratislava, Slovakia, Bratislava, Slovakia, ISBN 80-224-0880-8 (pdf, bibtex)
Jan Hajič (2006): Complex Corpus Annotation: The Prague Dependancy Treebank. In: Insight into Slovak and Czech Corpus Linguistic, pp. 54-73, Jazykovedný ústav Ľ. Štúra, SAV, Bratislava, Slovakia, ISBN 8022408808 (bibtex)
Jan Hajič, Marie Mikulová, Martina Otradovcová, Petr Pajas, Petr Podveský, Zdeňka Urešová (2006): Pražský závislostní korpus mluvené češtiny. Rekonstrukce standardizovaného textu z mluvené češtiny (technical report). In: (local PDF, local PDF, bibtex)
Eva Hajičová (2006): On translating and Understanding, Plurality of Languages and Cultures. In: Tradurre e comprendere. Pluralità dei linguaggi e delle culture. Atti del XII congresso nazionale della Società Italiana di Filosofia del Linguaggio, pp. 253-269, Aracne Editrice, Roma, Italy, ISBN 88-548-0733-8 (bibtex)
Eva Hajičová (2006): Old linguists never die, they only get obligatorily deleted. In: Computational Linguistics, ISSN 1530-9312, vol. 32, no. 4, pp. 457-469 (bibtex)
Eva Hajičová (2006): Natural Language Comprehension and Translation. In: Tradurre e comprendere: Pluralità dei linguaggi e delle culture, pp. 253-268, ARACNE editrice, Roma, ISBN 88-548-0733-8 (bibtex)
Eva Hajičová (2006): K některým otázkám závislostní gramatiky. In: Slovo a slovesnost, ISSN 0037-7031, vol. 67, no. 1, pp. 3-26 (bibtex)
Eva Hajičová (2006): Využití korpusu pro ověřování lingvistických hypotéz. In: Korpusová lingvistika: Stav a modelové přístupy, pp. 118-130, Nakladatelství Lidové noviny, Praha, ISBN 80-7106-865-9 (bibtex)
Eva Hajičová (2006): K tzv. vzdáleným závislostem očima Pražského závislostního korpusu. In: Gramatika a korpus, pp. 181-194, ÚJČ AV ČR, Praha, Česká republika, ISBN 80-200-1463-2 (bibtex)
Eva Hajičová (2006): Towards the Underlying Structure Annotation of a Large Corpus of Texts. In: Insight into Slovak and Czech Corpus Linguistics, pp. 74-82, Veda, Bratislava, Bratislava, Slovenská republika, ISBN 80-224-0880-8 (bibtex)
Eva Hajičová (2006): O jazyce, jeho exaktním popisu (Pražská škola v lingvistice kdysi a dnes). In: Vesmír, ISSN 0042-4544, 85, pp. 512-513 (bibtex)
Eva Hajičová (2006): 80th Birthday of Petr Sgall. In: Linguistica Pragensia, ISSN 0862-8432, vol. 16, no. 1, pp. 40-43 (bibtex)
Eva Hajičová, Barbora Hladká, Lucie Kučová (2006): An Annotated Corpus as a Test Bed for Discourse Structure Analysis. In: Proceedings of the Workshop on Constraints in Discourse, pp. 82-89, National University of Ireland, Maynooth, Ireland (bibtex)
Eva Hajičová, Jarmila Panevová (2006): Petr Sgall Octogenerian. In: The Prague Bulletin of Mathematical Linguistics, ISSN 0032-6585, 85, pp. 73-74 (bibtex)
Eva Hajičová, Jarmila Panevová (2006): K osmdesátinám Petra Sgalla. In: Slovo a slovesnost, ISSN 0037-7031, vol. 67, no. 3, pp. 234-237 (bibtex)
Eva Hajičová, Jarmila Panevová (2006): Introduction. In: Language in its multifarious aspects, pp. 7-16, Karolinum Press, Praha, Česká republika, ISBN 80-246-1158-9 (bibtex)
Eva Hajičová, Petr Sgall (2006): Corpus annotation as a test of a linguistic theory. In: Proceedings of the 5th International Conference on Language Resources and Evaluation (LREC 2006) , pp. 879-884, ELRA, Genova, Italy, ISBN 2-9517408-2-4 (bibtex)
Eva Hajičová, Petr Sgall (2006): Eighty years of the Prague Linguistic Circle. In: Linguistica Pragensia, ISSN 0862-8432, vol. 16, no. 2, pp. 57-76 (bibtex)
Eva Hajičová, Věra Schmiedtová (2006): Standardní čeština a korpus. In: Slovo a slovesnost, ISSN 0037-7031, vol. 67, no. 4, pp. 262-266 (bibtex)
Jaroslava Hlaváčová (2006): New Approach to Frequency Dictionaries - Czech Example. In: Proceedings of the 5th International Conference on Language Resources and Evaluation (LREC 2006) , pp. 373-378, ELRA, Genova, Italy, ISBN 2-9517408-2-4 (bibtex)
Tomáš Holan, Zdeněk Žabokrtský (2006): Combining Czech Dependency Parsers. In: Text, Speech and Dialogue. 9th International Conference, TSD 2006, Brno, Czech Republic, September 11–15, 2006, Proceedings, Lecture Notes in Computer Science, ISSN 0302-9743, 4188, pp. 95-102, Springer, Berlin / Heidelberg, ISBN 978-3-540-39090-9 (bibtex)
Petr Homola (2006): Die Sprachsituation im Teschener Schlesien aus historischer Perspektive (ArticleTranslation). In: Bildung eines Netzwerkes zur Förderung der Minderheitensprachen Polnisch und Tschechisch in der Grenzregion Těšín/Cieszyn, pp. 1-10
Petr Homola, Vladislav Kuboň (2006): A Structural Similarity Measure. In: Proceedings of the Workshop Linguistic Distances Coling/ACL 2006, pp. 91-99, Association for Computational Linguistics, Sydney, Australia, ISBN 1-932432-83-3 (bibtex)
Věra Jílková, Petr Sgall (2006): Osudy litomyšlské rodiny Sgallových. In: Pomezí Čech, Moravy a Slezska 7, pp. 182-195, Regionální muzeum Litomyšl, Litomyšl, ISBN 80-239-8105-6 (bibtex)
Václav Klimeš (2006): Transformation-Based Tectogrammatical Analysis of Czech. In: Text, Speech and Dialogue. 9th International Conference, TSD 2006, Brno, Czech Republic, September 11–15, 2006, Proceedings, Lecture Notes in Computer Science, ISSN 0302-9743, 4188, pp. 135-142, Springer, Berlin / Heidelberg, ISBN 978-3-540-39090-9 (bibtex)
Václav Klimeš (2006): Analytical and Tectogrammatical Analysis of a Natural Language (PhD thesis). In: (url, local PDF, bibtex)
Václav Klimeš (2006): Rule-Based Analytical Parsing of Czech. In: The Prague Bulletin of Mathematical Linguistics, ISSN 0032-6585, 85, pp. 5-21 (local PDF, bibtex)
David Klusáček (2006): Maximum Mutual Information and Word Classes. In: WDS'06 Proceedings of Contributed Papers, pp. 185-190, Matfyzpress, Charles University, Praha, Czechia, ISBN 80-86732-84-3 (bibtex)
Philipp Koehn, Marcello Federico, Wade Shen, Nicola Bertoldi, Ondřej Bojar, Chris Callison-Burch, Brooke Cowan, Chris Dyer, Hieu Hoang, Richard Zens, Alexandra Constantin, Christine Corbett Moran, Evan Herbst (2006): Open Source Toolkit for Statistical Machine Translation: Factored Translation Models and Confusion Network Decoding (technical report). In: (local PDF, bibtex)
Veronika Kolářová (2006): Valency of Deverbal Nouns in Czech. In: The Prague Bulletin of Mathematical Linguistics, ISSN 0032-6585, 86, pp. 5-20 (bibtex)
Ivana Kruijffová, Klára Chvátalová, Oana Postolache (2006): Annotation Guidelines for Czech-English Word Alignment. In: Proceedings of the 5th International Conference on Language Resources and Evaluation (LREC 2006) , pp. 1256-1261, ELRA, Genova, Italy, ISBN 2-9517408-2-4 (bibtex)
Vladislav Kuboň, Markéta Lopatková, Martin Plátek, Patrice Pognan (2006): Segmentation of Complex Sentences. In: Text, Speech and Dialogue. 9th International Conference, TSD 2006, Brno, Czech Republic, September 11–15, 2006, Proceedings, Lecture Notes in Computer Science, ISSN 0302-9743, 4188, pp. 151-158, Springer, Berlin / Heidelberg, ISBN 978-3-540-39090-9 (local PostScript, bibtex)
Vladislav Kuboň, Markéta Lopatková, Martin Plátek, Patrice Pognan (2006): O segmentaci českých vět. In: Proceedings of ITAT 2006 (Information Technologies - Application and Theory), pp. 81-86, Univerzita Pavla Jozefa Šafárika, Košice, Slovakia, Košice, Slovakia, ISBN 80-969184-4-3 (local PDF, bibtex)
Ondřej Kučera (2006): Pražský závislostní korpus jako elektronická cvičebnice jazyka českého. In: Proceedings of the 4th Student Research Competition in Informatics and Information Technologies (finalists papers), pp. 41-47, Association for Computing Machinery, Praha, Czechia (bibtex)
Ondřej Kučera (2006): Pražský závislostní korpus jako cvičebnice jazyka českého (masters thesis). In: (bibtex)
Ondřej Kučera (2006): A corpus-based exercise book of Czech language. In: The Prague Bulletin of Mathematical Linguistics, ISSN 0032-6585, 85, pp. 35-56 (bibtex)
Markéta Lopatková, Jarmila Panevová (2006): Recent developments of the theory of valency in the light of the Prague Dependency Treebank. In: Insight into Slovak and Czech Corpus Linguistics, pp. 83-92, Veda Bratislava, Slovakia, Bratislava, Slovensko, ISBN 80-224-0880-8 (bibtex)
Markéta Lopatková, Zdeněk Žabokrtský, Václava Benešová (2006): Valency Lexicon of Czech Verbs VALLEX 2.0 (technical report). In: (bibtex)
Markéta Lopatková, Zdeněk Žabokrtský, Karolína Skwarska (2006): Valency Lexicon of Czech Verbs: Alternation-Based Model. In: Proceedings of the 5th International Conference on Language Resources and Evaluation (LREC 2006) , pp. 1728-1733, ELRA, Genova, Italy, ISBN 2-9517408-2-4 (local PDF, bibtex)
Marie Mikulová, Alevtina Bémová, Jan Hajič, Eva Hajičová, Jiří Havelka, Veronika Kolářová, Lucie Kučová, Markéta Lopatková, Petr Pajas, Jarmila Panevová, Magda Razímová, Petr Sgall, Jan Štěpánek, Zdeňka Urešová, Kateřina Veselá, Zdeněk Žabokrtský (2006): Annotation on the tectogrammatical level in the Prague Dependency Treebank. Annotation manual (technical report). In: (bibtex)
Marie Mikulová, Alevtina Bémová, Jan Hajič, Eva Hajičová, Jiří Havelka, Veronika Kolářová, Lucie Kučová, Markéta Lopatková, Petr Pajas, Jarmila Panevová, Magda Ševčíková, Petr Sgall, Jan Štěpánek, Zdeňka Urešová, Kateřina Veselá, Zdeněk Žabokrtský (2006): Annotation on the tectogrammatical level in the Prague Dependency Treebank. Reference book (technical report). In: (bibtex)
Marie Mikulová, Alevtina Bémová, Jan Hajič, Eva Hajičová, Jiří Havelka, Veronika Kolářová, Lucie Kučová, Markéta Lopatková, Petr Pajas, Jarmila Panevová, Magda Ševčíková, Petr Sgall, Jan Štěpánek, Zdeňka Urešová, Kateřina Veselá, Zdeněk Žabokrtský (2006): Anotace na tektogramatické rovině Pražského závislostního korpusu. Referenční příručka (technical report). In: (bibtex)
Jiří Mírovský (2006): Netgraph: A Tool for Searching in Prague Dependency Treebank 2.0. In: Proceedings of the Fifth Workshop on Treebanks and Linguistic Theories (TLT), pp. 211-222, ÚFAL MFF UK, Praha, Czechia, ISBN 80-239-8009-2 (pdf, bibtex)
Craig G. Murray, Bonnie J. Dorr, Jimmy Lin, Pavel Pecina, Jan Hajič (2006): Leveraging Reusability: Cost-Effective Lexical Acquisition for Large-scale Ontology Translation. In: Proceedings of the 21th International Conference on Computational Linguistics and 44th Annual Meeting of the Association for Computational Linguistics, pp. 945-952, Association for Computational Linguistics, Sydney, Australia, ISBN 1-932432-65-5 (bibtex)
Craig G. Murray, Bonnie J. Dorr, Jimmy Lin, Pavel Pecina, Jan Hajič (2006): Leveraging Recurrent Phrase Structure in Large-scale Ontology Translation. In: Proceedings of the 11th Annual conference of the European Association for Machine Translation (EAMT), pp. 1-10, European Association for Machine Translation, Oslo, Norway, ISBN 82-7368-294-3 (bibtex)
Petr Němec (2006): Annotation of Temporal Relations within a Discourse. In: Text, Speech and Dialogue. 9th International Conference, TSD 2006, Brno, Czech Republic, September 11–15, 2006, Proceedings, Lecture Notes in Computer Science, ISSN 0302-9743, vol. 4188, no. 9, pp. 181-188, Springer, Berlin / Heidelberg, ISBN 978-3-540-39090-9 (bibtex)
Petr Němec (2006): Tree Searching/Rewriting Formalism. In: Proceedings of the 5th International Conference on Language Resources and Evaluation (LREC 2006) , pp. 2194-2199, ELRA, Genova, Italy, ISBN 2-9517408-2-4 (bibtex)
Václav Novák (2006): On Distance between Deep Syntax and Semantic Representation. In: Proceedings of Frontiers in Linguistically Annotated Corpora, pp. 78-85, The Association for Computational Linguistics, Sydney, Australia, ISBN 1-932432-78-7 (bibtex)
Václav Novák, Jan Hajič (2006): Perspectives of Turning Prague Dependency Treebank into a Knowledge Base. In: Proceedings of the 5th International Conference on Language Resources and Evaluation (LREC 2006) , pp. 439-442, ELRA, Genova, Italy, ISBN 2-9517408-2-4 (local PDF, bibtex)
Douglas W. Oard, Jianqiang Wang, Gareth J.F. Jones, Ryen W. White, Pavel Pecina, Dagobert Soergel, Xiaoli Huang, Izhak Shafran (2006): Overview of the CLEF-2006 Cross-Language Speech Retrieval Track. In: Working Notes for the Cross Language Evaluation Forum (CLEF) 2006 Workshop, Delos Network of Excellence, Alicante, Spain, ISBN 2-912335-23-X (bibtex)
Petr Pajas, Jan Štěpánek (2006): XML-Based Representation of Multi-Layered Annotation in the PDT 2.0. In: Proceedings of the LREC Workshop on Merging and Layering Linguistic Information (LREC 2006), pp. 40-47, ELRA, Genova, Italy, ISBN 2-9517408-2-4 (url, bibtex)
Jarmila Panevová (2006): Na co jsme pyšní a čemu zůstaneme věrní (Poznámky k valenci několika českých adjektiv). In: Možnosti a meze české gramatiky, pp. 153-166, Academia, Praha, Česká republika, ISBN 80-200-1463-2 (bibtex)
Jarmila Panevová (2006): Dvě poznámky k tzv. vágnosti. In: Od fonemu do tekstu. Prace dedykowane Profesorowi Romanowi Laskowskiemu, pp. 301-304, Wydawnictwo LEXIS , Kraków , ISBN 83-89425-24-6 (bibtex)
Pavel Pecina, Pavel Schlesinger (2006): Combining Association Measures for Collocation Extraction. In: Proceedings of the COLING/ACL 2006 Main Conference Poster Sessions, pp. 651-658, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 1-932432-65-5 (bibtex)
Nino Peterek (2006): Tools and Data for Analysis of Spoken Czech and its Prosody (PhD thesis). In: (bibtex)
Jan Ptáček, Zdeněk Žabokrtský (2006): Synthesis of Czech Sentences from Tectogrammatical Trees. In: Text, Speech and Dialogue. 9th International Conference, TSD 2006, Brno, Czech Republic, September 11–15, 2006, Proceedings, Lecture Notes in Computer Science, ISSN 0302-9743, 4188, pp. 221-228, Springer, Berlin / Heidelberg, ISBN 978-3-540-39090-9 (pdf, bibtex)
Magda Razímová, Zdeněk Žabokrtský (2006): Annotation of Grammatemes in the Prague Dependency Treebank 2.0. In: Proceedings of the LREC Workshop on Annotation Science, pp. 12-19, ELRA, Genova, Italy, ISBN 2-9517408-2-4 (bibtex)
Kiril Ribarov, Alevtina Bémová, Barbora Hladká (2006): When a statistically oriented parser was more efficient than a linguist: A case of treebank conversion. In: The Prague Bulletin of Mathematical Linguistics, ISSN 0032-6585, 86, pp. 21-38 (bibtex)
Jiří Semecký (2006): Automatic Verb Valency Frames Disambiguation from Czech. In: Proceedings of the 11th ESSLLI Student Session, pp. 250-274, Universidad de Málaga, Málaga, Spain (bibtex)
Jiří Semecký (2006): On Automatic Assignment of Verb Valency Frames in Czech. In: Proceedings of the 5th International Conference on Language Resources and Evaluation (LREC 2006) , pp. 1941-1944, ELRA, Genova, Italy, ISBN 2-9517408-2-4 (pdf, local PDF, bibtex)
Jiří Semecký, Silvie Cinková (2006): Constructing an English Valency Lexicon. In: Proceedings of Frontiers in Linguistically Annotated Corpora, pp. 94-97, The Association for Computational Linguistics, Sydney, Australia, ISBN 1-932432-78-7 (pdf, bibtex)
Jiří Semecký, Petr Podveský (2006): Extensive Study on Automatic Verb Sense Disambiguation in Czech. In: Text, Speech and Dialogue. 9th International Conference, TSD 2006, Brno, Czech Republic, September 11–15, 2006, Proceedings, Lecture Notes in Computer Science, ISSN 0302-9743, 4188, pp. 237-244, Springer, Berlin / Heidelberg, ISBN 978-3-540-39090-9 (url, bibtex)
Květa Sgallová, Petr Sgall (2006): Zemřel Miroslav Červenka. In: Slovo a slovesnost, ISSN 0037-7031, vol. 67, no. 2, pp. 159-160 (bibtex)
Petr Sgall (2006): Review of Prager Strukturalismus. Methodologische Grundlagen. - Prague Structuralism. Methodological Fundamentals. (review). In: Slovo a slovesnost, ISSN 0037-7031, vol. 67, no. 2, pp. 146-150 (bibtex)
Petr Sgall (2006): Poznámka k pojmu hyperkorektnost. In: Naše řeč, ISSN 0027-8203, vol. 89, no. 1, pp. 21-25 (bibtex)
Petr Sgall (2006): Proti černobílé spisovnosti. Rozhovor s Bohumilem Vykypělem. In: A2 - Kulturní týdeník, ISSN 1801-4542, 43, pp. 24-25 (bibtex)
Petr Sgall (2006): Běžná mluva a lingvisté v Čechách a na Moravě. In: Teorie a empirie. Bichla pro Krčmovó, pp. 27-38, Masarykova univerzita, Brno, ISBN 80-210-3955-8 (bibtex)
Petr Sgall (2006): Valence jako jádro jazykového systému. In: Slovo a slovesnost, ISSN 0037-7031, vol. 67, no. 3, pp. 163-178 (bibtex)
Petr Sgall (2006): Kořeny mé životní dráhy – Můj otec, má matka a má mateřština. In: Jazykovědné aktuality , ISSN 1212-5326, vol. XLIII, no. 3-4, pp. 4-17 (bibtex)
Petr Sgall (2006): Language in its multifarious aspects. In: , ISBN 80-246-1158-9 (bibtex)
Petr Sgall, Concetta Maglione (2006): Čeština standardní a běžně mluvená. In: Český jazyk a literatura, ISSN 0009-0786, vol. 56, no. 2, pp. 80-87 (bibtex)
Otakar Smrž (2006): Tips and Tricks of the Prague Arabic Dependency Treebank. In: Proceedings of The Challenge of Arabic for NLP/MT Conference, pp. 25-34, The British Computer Society, London, UK (pdf, bibtex)
Miroslav Spousta (2006): Web as a Corpus. In: WDS'06 Proceedings of Contributed Papers, pp. 179-184, Matfyzpress, Charles University, Praha, Czechia, ISBN 80-86732-84-3 (bibtex)
Magda Ševčíková-Razímová, Zdeněk Žabokrtský (2006): Systematic Parameterized Description of Pro-forms in the Prague Dependency Treebank 2.0. In: Proceedings of the Fifth Workshop on Treebanks and Linguistic Theories (TLT), pp. 175-186, ÚFAL MFF UK, Praha, Czechia, ISBN 80-239-8009-2 (pdf, bibtex)
Jan Štěpánek (2006): Post-annotation Checking of Prague Dependency Treebank 2.0 Data. In: Text, Speech and Dialogue. 9th International Conference, TSD 2006, Brno, Czech Republic, September 11–15, 2006, Proceedings, Lecture Notes in Computer Science, ISSN 0302-9743, 4188, pp. 277-284, Springer, Berlin / Heidelberg, ISBN 978-3-540-39090-9 (bibtex)
Jan Štěpánek (2006): Závislostní zachycení větné struktury v anotovaném syntaktickém korpusu (nástroje pro zajištění konzistence dat) (PhD thesis). In: (pdf, local PDF, bibtex)
Jan Štěpánek (2006): Post-annotation checking of Prague Dependency Treebank 2.0 data. In: The Prague Bulletin of Mathematical Linguistics, ISSN 0032-6585, 85, pp. 23-33 (local PDF, bibtex)
Zdeňka Urešová (2006): Verbal Valency in the Prague Dependency Treebank from the Annotator's Viewpoint. In: Insight into Slovak and Czech Corpus Linguistics, pp. 93-112, Veda, Bratislava, Bratislava, Slovensko, ISBN 80-224-0880-8 (local PDF, bibtex)
Barbora Vidová Hladká (2006): One day with Eva Hajičová. In: Linguistica Pragensia, ISSN 0862-8432, vol. 16, no. 1, pp. 36-38 (bibtex)
Barbora Vidová Hladká, Jan Králík (2006): Proměny Českého akademického korpusu. In: Slovo a slovesnost, ISSN 0037-7031, vol. 67, no. 4, pp. 179-194 (bibtex)
Jan Votrubec (2006): Morphological Tagging Based on Averaged Perceptron. In: WDS'06 Proceedings of Contributed Papers, pp. 191-195, Matfyzpress, Charles University, Praha, Czechia, ISBN 80-86732-84-3 (bibtex)
Šárka Zikánová (2006): What do the data in Prague Dependency Treebank say about systemic ordering in Czech?. In: The Prague Bulletin of Mathematical Linguistics, ISSN 0032-6585, 86, pp. 39-46 (bibtex)
Šárka Zikánová (2006): Slovosled ve starší češtině (1500-1620): Postavení syntetického přísudku ve větě hlavní (PhD thesis). In: (url, bibtex)
Václava Benešová (2005): Valency and Semantic Features of Verbs. In: WDS'05 Proceedings of Contributed Papers, pp. 66-71, Matfyzpress, Charles University, Praha, Czechia, ISBN 80-86732-59-2 (local PDF, bibtex)
Ondřej Bojar (2005): Budování česko-anglického slovníku pro strojový překlad. In: ITAT 2005 Information Technologies - Applications and Theory, pp. 201-211, Univerzita Pavla Jozefa Šafárika, Račkova dolina, Slovakia, ISBN 80-7097-609-8 (bibtex)
Ondřej Bojar, Cyril Brom, Milan Hladík, Vojtěch Toman (2005): The Project ENTs: Towards Modelling Human-like Artificial Agents. In: SOFSEM 2005: Communications, pp. 111-122, Society for Computer Science, Liptovský Ján, Slovakia, ISBN 80-969255-4-7 (bibtex)
Ondřej Bojar, Jan Hajič (2005): Extracting Translations Verb Frames. In: Proceedings of Modern Approaches in Translation Technologies, pp. 2-6, Bulgarian Academy of Sciencies, Borovec, Bulgaria, ISBN 954-90906-9-8 (bibtex)
Ondřej Bojar, Petr Homola, Vladislav Kuboň (2005): Problems of Reusing an existing MT System. In: Second International Joint Conference on Natural Language Processing: Companion Volume including Posters/Demos and tutorial abstracts, pp. 179-184, Asian Federation of Natural Language Processing, Jeju Island, Korea, ISBN 978-3-540-29172-5 (bibtex)
Ondřej Bojar, Petr Homola, Vladislav Kuboň (2005): An MT System Recycled. In: Proceedings of the 10th Machine Translation Summit, pp. 380-387, Phuket, Thailand, ISBN 974-7431-26-2 (bibtex)
Ondřej Bojar, Petr Homola, Vladislav Kuboň (2005): Problémy recyklování systému automatického překladu. In: ITAT 2005 Information Technologies - Applications and Theory, pp. 335-344, Univerzita Pavla Jozefa Šafárika, Račkova dolina, Slovakia, ISBN 80-7097-609-8 (bibtex)
Ondřej Bojar, Jiří Semecký, Václava Benešová (2005): VALEVAL: Testing VALLEX Consistency and Experimenting with Word-Frame Disambiguation. In: The Prague Bulletin of Mathematical Linguistics, ISSN 0032-6585, 83, pp. 5-17 (bibtex)
Silvie Cinková, Zdeněk Žabokrtský (2005): Swedish-Czech Combinatorial Valency Lexicon of Predicate Nouns: Describing Event Structure in Support Verb Constructions. In: Proceedings of the 8th International Conference on Computational Lexicography COMPLEX, pp. 50-59, Nyelvtudományi Intézet, Magyar Tudományos Akadémia, Budapest, Hungary, ISBN 963-9074-35-7 (local PDF, bibtex)
Silvie Cinková, Zdeněk Žabokrtský (2005): Treating support verb constructions in a lexicon: Swedish-Czech combinatorial valency lexicon of predicate nouns. In: Proceedings of Interdisciplinary Workshop on the Identification and Representation of Verb Features and Verb Classes, pp. 22-27, Universität des Saarlandes, Germany, Saarbrücken, Germany (bibtex)
Jan Cuřín, Martin Čmejrek, Jiří Havelka, Vladislav Kuboň (2005): Building a parallel bilingual syntactically annontated corpus. In: Natural Language Processing – IJCNLP 2004 First International Joint Conference, Hainan Island, China, March 22-24, 2004, Revised Selected Papers, Lecture Notes in Computer Science, ISSN 0302-9743, 3248, pp. 168-176, Springer, Berlin / Heidelberg, ISBN 978-3-540-24475-2 (bibtex)
František Čermák, Petr Sgall, Petr Vybíral (2005): Od školské spisovnosti ke standardní češtině. In: Slovo a slovesnost, ISSN 0037-7031, vol. 66, no. 2, pp. 103-115 (bibtex)
Martin Čmejrek, Jan Cuřín, Jan Hajič, Jiří Havelka (2005): Prague Czech-English Dependency Treebank: Resource for Structure-based MT. In: Proceedings of the 10th EAMT Conference, pp. 73-78, European Association for Machine Translation, Budapest, Hungary, ISBN 963-9206-04-0 (bibtex)
Martin Čmejrek, Jarmila Panevová (2005): Strojový překlad (50 let od zrodu). In: Proceedings of EUROLINGUA ’04, pp. 124-133, Technická univerzita, Pedagogická fakulta, Liberec, Liberec, Czechia, ISBN 80-7083-958-9 (bibtex)
Drahomíra Doležalová (2005): Automatic Construction of a Valency Lexicon of Czech Adjectives. In: Proceedings of the 8th International Conference, TSD 2005, Lecture Notes in Computer Science, ISSN 0302-9743, 3658, pp. 56-60, Springer, Berlin / Heidelberg, ISBN 3-540-28789-2 (bibtex)
Jan Hajič, Otakar Smrž, Tim Buckwalter, Hubert Jin (2005): Feature-based Tagger of Approximations of Functional Arabic Morphology. In: Proceedings of the Fourth Workshop on Treebanks and Linguistic Theories (TLT 2005), pp. 53-64, Universitat de Barcelona, Catalunya, Spain, Barcelona, Spain, ISBN 978-84-475-2992-6 (pdf, local PDF, bibtex)
Eva Hajičová (2005): Written and spoken language: The case of ambiguities. In: Proceedings of SPECOM 2005 (10th International Conference Speech and Computer), pp. 33-38, International Speech and Communication Organization (ISCA), Patras, Greece, ISBN 5-7452-0110-X (bibtex)
Eva Hajičová (2005): Harvesting the fruit from Treebanks: Information structure and the stock of shared knowledge. In: Proceedings of the Annual Conference of Societas Linguistica Europaea: Formal, Functional and Typological Perspectives on Discourse and Grammar, pp. 1-12, Valencia, Spain (bibtex)
Eva Hajičová (2005): What Golem was not yet able to do (and what e-Golem should learn). In: Interdisciplinary Aspects of Human-Machine Co-existence and Co-operation, Czech-Argentine Biennale Workshop “e-Golems”, pp. 207-213, České vysoké učení technické, Praha, Czechia, ISBN 80-01-03275-2 (bibtex)
Eva Hajičová (2005): On some aspects of translation. In: A Festschrift for Libuše Dušková, pp. 47-57, Univerzita Karlova, Praha, Praha, Czechia, ISBN 80-7308-108-3 (bibtex)
Eva Hajičová (2005): Edward Göbbel: Syntactic and focus-structural aspects of triadic constructions (review). In: The Prague Bulletin of Mathematical Linguistics, ISSN 0032-6585, 83, pp. 77-81 (bibtex)
Eva Hajičová, Jiří Havelka, Kateřina Veselá (2005): Corpus Evidence of Contextual Boundness and Focus. In: Proceedings of the Corpus Linguistics Conference Series, pp. 1-9, University of Birmingham, Birmingham, UK (url, local DOC, local HTML, bibtex)
Eva Hajičová, Petr Sgall (2005): The position of information structure in the core of language. In: The Partee Effect, pp. 289-302, CSLI, Palo Alto, CA, USA, ISBN 1-57586-504-1 (bibtex)
Keith Brendan Hall, Václav Novák (2005): Corrective Modeling for Non-Projective Dependency Parsing. In: Proceedings of the Ninth International Workshop on Parsing Technologies (IWPT), pp. 42-52, Association for Computational Linguistics, Vancouver, BC, Canada, ISBN 1-932432-58-2 (local PostScript, bibtex)
Jiří Hana, Daniel Zeman, Jan Hajič, Hana Hanová, Barbora Hladká, Emil Jeřábek (2005): Manual for Morphological Annotation, Revision for the Prague Dependency Treebank 2.0 (technical report). In: (pdf, local PDF, bibtex)
Jiří Havelka (2005): Projektivita v úplně uspořádaných kořenových stromech: alternativní definice projektivity a optimální algoritmy pro zprojektivnění a nalezení neprojektivních hran. In: Proceedings of Malý informatický seminář (MIS), pp. 11-28, Matfyzpress, Charles University, Josefův Důl, Czechia, ISBN 80-86732-70-3 (bibtex)
Jiří Havelka (2005): Projectivity in Totally Ordered Rooted Trees: An Alternative Definition of Projectivity and Optimal Algorithms for Detecting Non-Projective Edges and Projectivizing Totally Ordered Rooted Trees. In: The Prague Bulletin of Mathematical Linguistics, ISSN 0032-6585, 84, pp. 13-30 (pdf, local PDF, bibtex)
Jaroslava Hlaváčová (2005): Orwell's 1984 - playing with Czech and Slovak versions. In: Computer Treatment of Slavic and East European Languages (Proceedings of Slovko 2005), pp. 116-123, Veda, Bratislava, Slovakia, ISBN 80-224-0895-6 (bibtex)
Martin Holub (2005): Models, Similarity, and Topics of Texts (PhD thesis) (PhD thesis). In: (bibtex)
Martin Holub (2005): Review of Peter Jackson and Isabelle Moulinier: Natural Language Processing for Online Applications: Text Retrieval, Extraction and Categorization (review). In: The Prague Bulletin of Mathematical Linguistics, ISSN 0032-6585, 83, pp. 83-84 (bibtex)
Petr Homola, Vladislav Kuboň (2005): A Machine Translation System into a Minority Language. In: Proceedings of Modern Approaches in Translation Technologies, pp. 31-35, Bulgarian Academy of Sciencies, Borovec, Bulgaria, ISBN 954-90906-9-8 (bibtex)
Veronika Kolářová (2005): Valence vybraných skupin verbálních a dějových substantiv z pohledu čísel. In: Conference Grammar & Corpora Abstracts / Konference Gramatika & korpus Anotace příspěvků, pp. 39-41, Ústav pro jazyk český, Akademie věd České republiky, Praha, Czechia (bibtex)
Veronika Kolářová (2005): Valence deverbativních substantiv v češtině (PhD thesis) (PhD thesis). In: (bibtex)
Iveta Kouřilová, Otakar Smrž (2005): Review of A Student Grammar of Modern Standard Arabic by Eckehard Schultz, Cambridge University Press, 264p. (review). In: The Linguist List, ISSN 1068-4875, 16.2221, pp. 1-2 (url, bibtex)
Pavel Krbec (2005): Language Modeling for Speech Recognition of Czech (PhD thesis) (PhD thesis). In: (bibtex)
Lucie Kučová, Kateřina Veselá, Eva Hajičová, Jiří Havelka (2005): Topic-focus articulation and anaphoric relations: A corpus based probe. In: Proceedings of Discourse Domains and Information Structure workshop, pp. 37-46, Edinburgh, Scotland, UK (pdf, local PDF, bibtex)
Lucie Kučová, Zdeněk Žabokrtský (2005): Anaphora in Czech: Large Data and Experiments with Automatic Anaphora. In: Proceedings of the 8th International Conference, TSD 2005, Lecture Notes in Computer Science, ISSN 0302-9743, 3658, pp. 93-98, Springer, Berlin / Heidelberg, ISBN 3-540-28789-2 (pdf, local PDF, bibtex)
Markéta Lopatková (2005): Formální specifikace podkladové struktury pro popis přirozeného jazyka. In: Proceedings of Malý informatický seminář (MIS), pp. 49-60, Matfyzpress, Charles University, Josefův Důl, Czechia, ISBN 80-86732-70-3 (bibtex)
Markéta Lopatková, Ondřej Bojar, Jiří Semecký, Václava Benešová, Zdeněk Žabokrtský (2005): Valency Lexicon of Czech Verbs VALLEX: Recent Experiments with Frame Disambiguation. In: Proceedings of the 8th International Conference, TSD 2005, Lecture Notes in Computer Science, ISSN 0302-9743, 3658, pp. 99-106, Springer, Berlin / Heidelberg, ISBN 3-540-28789-2 (bibtex)
Markéta Lopatková, Martin Plátek, Vladislav Kuboň (2005): Modeling syntax of Free Word-Order Languages: Dependency Analysis By Reduction. In: Proceedings of the 8th International Conference, TSD 2005, Lecture Notes in Computer Science, ISSN 0302-9743, 3658, pp. 140-147, Springer, Berlin / Heidelberg, ISBN 3-540-28789-2 (bibtex)
Ryan McDonald, Fernando Pereira, Kiril Ribarov, Jan Hajič (2005): Non-Projective Dependency Parsing using Spanning Tree Algorithms. In: Proceedings of Human Langauge Technology Conference and Conference on Empirical Methods in Natural Language Processing, pp. 523-530, Association for Computational Linguistics, Vancouver, BC, Canada, ISBN 1-932432-55-8 (pdf, local PDF, bibtex)
Marie Mikulová, Alevtina Bémová, Jan Hajič, Eva Hajičová, Jiří Havelka, Veronika Kolářová, Markéta Lopatková, Petr Pajas, Jarmila Panevová, Magda Razímová, Petr Sgall, Jan Štěpánek, Zdeňka Urešová, Kateřina Veselá, Zdeněk Žabokrtský, Lucie Kučová (2005): Anotace na tektogramatické rovině Pražského závislostního korpusu. Anotátorská příručka (technical report). In: (bibtex)
Jiří Mírovský (2005): Pražský závislostní korpus a jeho dostupnost pomocí vyhledávacího programu Netgraph. In: Jazyky v kontaktu / jazyky v konfliktu a evropský jazykový prostor, pp. 258-258, Univerzita Palackého, Olomouc, Olomouc, Czechia, ISBN 80-244-1027-3 (bibtex)
Petr Němec (2005): Application of Backpropagation in Morphological Tagging for Czech. In: Proceedings of Microsoft Association for Computing Machinery Regional Competition, pp. 49-56, Praha, Czechia (bibtex)
Petr Němec, Kiril Ribarov (2005): Making the Good Taggers Even Better: Application of Artificial Neural Networks in Morphological Tagging of Czech. In: Proceedings of the 2nd Language and Technology Conference, pp. 85-89, Wydawnictvo Poznańskie Sp. z o.o., Poznań, Poland, ISBN 83-7177-341-2 (bibtex)
Petr Pajas, Jan Štěpánek (2005): A Generic XML-Based Format for Structured Linguistic Annotation and Its Application to Prague Dependency Treebank 2.0 (technical report). In: (pdf, local PDF, bibtex)
Jarmila Panevová (2005): Sloveso: centrum věty, valence: centrální pojem syntaxe. In: Aktuálne otázky súčasnej syntaxe, pp. 73-77, Veda Bratislava, Slovakia, Bratislava, Slovakia, ISBN 80-224-0879-4 (local PDF, bibtex)
Pavel Pecina (2005): An Extensive Empirical Study of Collocation Extraction Methods. In: Proceedings of the ACL Student Research Workshop, pp. 13-18, Association for Computational Linguistics, Ann Arbor, MI, USA, ISBN 1-932432-51-5 (local PDF, bibtex)
Martin Plátek, František Mráz, Friedrich Otto, Markéta Lopatková (2005): O roztržitosti a volnosti slovosledu pomocí restartovacích automatů. In: ITAT 2005 Information Technologies - Applications and Theory, pp. 145-156, Univerzita Pavla Jozefa Šafárika, Račkova dolina, Slovakia, ISBN 80-7097-609-8 (local PostScript, bibtex)
Petr Podveský, Pavel Machek (2005): Speech Recognition of Czech - Inclusion of Rare Words Helps. In: Proceedings of the ACL Student Research Workshop, pp. 121-126, Association for Computational Linguistics, Ann Arbor, MI, USA, ISBN 1-932432-51-5 (url, local BIB, local PDF, bibtex)
Josef Psutka, Pavel Ircing, Josef V. Psutka, Jan Hajič, William Byrne, Jiří Mírovský (2005): Automatic transcription of Czech, Russian, and Slovak Spontaneous Speech in the MALACH Project. In: Proceedings of Eurospeech 2005, pp. 1349-1352, ISCA, Lisboa, Portugal (pdf, local PDF, bibtex)
Magda Razímová (2005): Meanings of Morphological Categories on the Tectogrammatical Level. In: WDS'05 Proceedings of Contributed Papers, pp. 72-77, Matfyzpress, Charles University, Praha, Czechia, ISBN 80-86732-59-2 (local PDF, bibtex)
Magda Razímová, Zdeněk Žabokrtský (2005): Morphological Meanings in the Prague Dependency Treebank 2.0. In: Proceedings of the 8th International Conference, TSD 2005, Lecture Notes in Computer Science, ISSN 0302-9743, 3658, pp. 148-155, Springer, Berlin / Heidelberg, ISBN 3-540-28789-2 (local PDF, bibtex)
Kiril Ribarov (2005): Kořenové stromy a závislostní parsing. In: Proceedings of Malý informatický seminář (MIS), pp. 61-81, Matfyzpress, Charles University, Josefův Důl, Czechia, ISBN 80-86732-70-3 (bibtex)
Erika Rimkutė, Giedrė Jarašiūnaitė, Petr Homola (2005): Morfologinių samplaikų atpažinimas ir klasifikavimas. In: Lituanistica, ISSN 0235-716X, vol. 62, no. 2, pp. 58-75 (bibtex)
Veronika Řezníčková, Zdeňka Urešová (2005): K syntaktické anotaci textu Českého národniho korpusu: od analytické k tektogramatické rovině. In: Aktuálne otázky súčasnej syntaxe, pp. 57-72, Veda Bratislava, Slovakia, Bratislava, Slovakia, ISBN 80-224-0879-4 (bibtex)
Jiří Semecký (2005): Automatic assignment of Frame Semantics using Syntax-Semantics Interface in LFG. In: The Prague Bulletin of Mathematical Linguistics, ISSN 0032-6585, 83, pp. 19-46 (bibtex)
Petr Sgall (2005): Recenze: R. Nicolaï and P. Zima (eds.): Lexical and structural diffusion: Interplay of internal and external factors of language development in the west African Sahel. Corpus. Les cahiers 1. Nice: Faculté des lettres, arts et sciences humaines de Nice (review). In: Slovo a slovesnost, ISSN 0037-7031, vol. 66, no. 1, pp. 52-54 (bibtex)
Petr Sgall (2005): Eva Hajičova's birthday. In: The Prague Bulletin of Mathematical Linguistics, ISSN 0032-6585, 84, pp. 59-60 (bibtex)
Petr Sgall (2005): A note on the destruction and the revival of the Prague Linguistic Circle. In: Linguistica Pragensia, ISSN 0862-8432, vol. 15, no. 1, pp. 43-45 (bibtex)
Miroslav Spousta (2005): Automatické přiřazování tvaroslovných vzorů v češtině (master's thesis) (masters thesis). In: (local PDF, bibtex)
Pavel Straňák (2005): Review of Leonard Talmy: Toward a Cognitive Semantics, Volume I, Concept Structuring Systems (review). In: The Prague Bulletin of Mathematical Linguistics, ISSN 0032-6585, 83, pp. 85-86 (local PDF, bibtex)
Alfonso Medina Urrea, Jaroslava Hlaváčová (2005): Automatic Recognition of Czech Derivational Prefixes. In: Computational Linguistics and Intelligent Text Processing. 6th International Conference, CICLing 2005, Mexico City, Mexico, February 13-19, 2005. Proceedings, Lecture Notes in Computer Science, ISSN 0302-9743, 3406, pp. 189-197, Springer, Berlin / Heidelberg, ISBN 978-3-540-24523-0 (bibtex)
Hans Uszkoreit, Valia Kordoni, Vladislav Kuboň, Michael Rosner, Sabine Kirschmeyer Andersen (2005): Language Technology from a European Perspective. In: Proceedings of the Second ACL Workshop on Effective Tools and Methodologies for Teaching NLP and CL, pp. 43-48, Association for Computational Linguistics, Ann Arbor, MI, USA, ISBN 1-932432-51-5 (bibtex)
Barbora Vidová Hladká, Ondřej Kučera (2005): Prague Dependency Treebank as an Exercise Book of Czech. In: Proceedings of Human Langauge Technology Conference and Conference on Empirical Methods in Natural Language Processing, pp. 14-15, Association for Computational Linguistics, Vancouver, BC, Canada, ISBN 1-932432-55-8 (url, local PDF, bibtex)
Jan Votrubec (2005): Volba vhodné sady rysů pro morfologické značkování češtiny (masters thesis). In: (bibtex)
Daniel Zeman, Zdeněk Žabokrtský (2005): Improving Parsing Accuracy by Combining Diverse Dependency Parsers. In: Proceedings of the Ninth International Workshop on Parsing Technologies (IWPT), pp. 171-178, Association for Computational Linguistics, Vancouver, BC, Canada, ISBN 1-932432-58-2 (pdf, local PDF, bibtex)
Šárka Zikánová (2005): Postavení syntetického predikátu v humanistické češtině. In: Aktuálne otázky súčasnej syntaxe, pp. 93-102, Veda Bratislava, Slovakia, Bratislava, Slovakia, ISBN 80-224-0879-4 (bibtex)
Šárka Zikánová (2005): Latinské vlivy v humanistické češtině. In: Jazyky v kontaktu / jazyky v konfliktu a evropský jazykový prostor, pp. 179-191, Univerzita Palackého, Olomouc, Olomouc, Czechia, ISBN 80-244-1027-3 (url, local PDF, bibtex)
Šárka Zikánová (2005): Příspěvek k aktuálnímu členění věty v humanistické češtině. In: Slova a dějiny: Igoru Němcovi k 80. narozeninám, pp. 391-399, Albis international, Prague, Czech Republic, ISBN 80-86496-20-1 (bibtex)
Zdeněk Žabokrtský (2005): Resemblances between Meaning-Text Theory and Functional Generative Description. In: Proceedings of the 2nd International Conference of Meaning-Text Theory, pp. 549-557, Slavic Culture Languages Publishers House, Moskva, Russia, ISBN 5-9551-0094-6 (local PDF, bibtex)
Zdeněk Žabokrtský (2005): Valency Lexicon of Czech Verbs (PhD thesis) (PhD thesis). In: (pdf, local PDF, bibtex)
Václava Benešová (2004): Delimitace lexií českých sloves z hlediska jejich syntaktických vlastností (diplomová práce). In: , Linguistic Data Consortium, University of Pennsylvania, ISBN 1-58563-280-5 (bibtex)
Renata Blatná, František Čermák, Jaroslava Hlaváčová, Milena Hnátková, Jan Kocek, Marie Kopřivová, Michal Křen, Vladimír Petkevič, Věra Schmiedtová, Martin Stluka, Michal Šulc (2004): Frekvenční slovník češtiny. In: , pp. 595 s., Linguistic Data Consortium, University of Pennsylvania, ISBN 80-7106-676-1 (bibtex)
Ondřej Bojar (2004): Problems of Inducing Large Coverage Constraint-Based Dependency Grammar for Czech. In: Proceedings of International Workshop on Constraint Solving and Language Processing, CSLP 2004, pp. 29-42, Roskilde University, Roskilde (bibtex)
Ondřej Bojar (2004): Automated Extraction of Lexico-Syntactic Information. In: WDS, pp. 211--217, Charles University, Matfyzpress, Prague (bibtex)
Ondřej Bojar (2004): Czech Syntactic Analysis Constraint-Based, XDG: One Possible Start. In: , pp. 43--54 (bibtex)
Ondřej Bojar, Jiří Semecký, Shravan Vasishth, Ivana Kruijff-Korbayová (2004): Processing noncanonical word order in Czech. In: Proceedings of Architectures and Mechanisms for Language Processing, AMLaP 2004, pp. 91--91, Université de Provence, Aix en Provence (bibtex)
William J. Byrne, David Doermann, Martin Franz, Samuel Gustman, Jan Hajič, Douglas W. Oard, Michael Picheny, Josef V. Psutka, Bhuvana Ramabhadran, Dagobert Soergel, Todd Ward, Wang Zhu (2004): Automatic Recognition of Spontaneous Speech for Access to Multilingual Oral History Archives. In: IEEE Transactions on Speech and Audio Processing, pp. 420-435, ISBN ISSN 1063-6676 (bibtex)
Silvie Cinková (2004): Mats Wahlberg (ed.): Svenskt ortnamnslexikon (Švédský slovník místních jmen) (review). In: Acta Onomastica, ISSN 1211-4413, XLV, pp. 105-106 (bibtex)
Silvie Cinková (2004): Extraction of Swedish Verb-Noun Collocations from a Large Msd-Annotated Corpus. In: , pp. 99--102 (bibtex)
Silvie Cinková (2004): Recenze - Ruslan Mitkov (ed.) The Oxford Handbook of Computational Linguistics. In: , pp. 87--94 (bibtex)
Silvie Cinková (2004): Manuál pro tektogramatickou anotaci angličtiny (technical report). In: , pp. 2-172 (bibtex)
Jan Cuřín, Martin Čmejrek, Jiří Havelka, Jan Hajič, Vladislav Kuboň, Zdeněk Žabokrtský (2004): Prague Czech-English Dependency Treebank Version 1.0. In: Linguistic Data Consortium (LDC), Linguistic Data Consortium (LDC), University of Pennsylvania, ISBN 1-58563-321-6 (bibtex)
Martin Čmejrek, Jan Cuřín, Jiří Havelka (2004): Prague Czech-English Dependency Treebank: Any Hopes for a Common Annotation Scheme?. In: HLT-NAACL 2004 Workshop: Frontiers in Corpus Annotation, pp. 47--54, Association for Computational Linguistics, Boston (pdf, local PDF, bibtex)
Martin Čmejrek, Jan Cuřín, Jiří Havelka, Jan Hajič, Vladislav Kuboň (2004): Prague Czech-English Dependency Treebank. Syntactically Annotated Resources for Machine Translation. In: Proceedings of the 4th International Conference on Language Resources and Evaluation, pp. 1597-1600, European Language Resources Association, Lisboa, ISBN ISBN 2-9517408-1-6 (bibtex)
Anett Frank, Jiří Semecký (2004): Corpus-based Induction of an LFG Syntax-Semantics Interface for Frame Semantic Processing. In: Proceedings of the 5th International Conference on Linguistically Interpreted Corpora, LINC 2004, Proceedings of the 5th International Conference on Linguistically Interpreted Corpora, LINC 2004 (bibtex)
Louise Guthrie, Roberto Basili, Fabio Zanzotto, Kalina Boncheva, Hamish Cunningham, David Guthrie, Jia Cui, Marco Cammisa, Jerry Cheng-Chieh Liu, Cassia Farria Martin, Kristiyan Haralambiev, Martin Holub, Klaus Machery, Frederick Jelinek (2004): Large Scale Experiments for Semantic Labeling of Noun Phrases in Raw Text. In: Proceedings of LREC 2004, pp. - - (bibtex)
Jan Hajič (2004): History of Computational Linguistics. In: A Companion to Digital Humanities, pp. 79-87, Blackwell Publishing, ISBN 978-1-4051-0321-3 (url, bibtex)
Jan Hajič (2004): Disambiguation of Rich Inflection (Computational Morphology of Czech). In: , Linguistic Data Consortium, University of Pennsylvania, ISBN 80-246-0282-2 (bibtex)
Jan Hajič, Martin Holub, Marie Hučínová, Martin Pavlík, Pavel Pecina, Pavel Straňák, Pavel Šidák (2004): Validating and Improving the Czech WordNet via Lexico-Semantic Annotation of the Prague Dependency Treebank. In: Proceedings of LREC 2004, pp. - - (bibtex)
Jan Hajič, Jarmila Panevová, Eva Buráňová, Zdeňka Urešová, Alevtina Bémová, Jan Štěpánek, Petr Pajas, Jiří Kárník (2004): Anotace na analytické rovině. Návod pro anotátory (technical report). In: (bibtex)
Jan Hajič, Otakar Smrž, Petr Zemánek, Petr Pajas, Jan Šnaidauf, Emanuel Beška, Jakub Kráčmar, Kamila Hassanová (2004): Prague Arabic Dependency Treebank 1.0. In: , Linguistic Data Consortium, University of Pennsylvania, ISBN 1-58563-319-4 (bibtex)
Jan Hajič, Otakar Smrž, Petr Zemánek, Jan Šnaidauf, Emanuel Beška (2004): Prague Arabic Dependency Treebank: Development in Data and Tools. In: Proceedings of the NEMLAR International Conference on Arabic Language Resources and Tools, pp. 110--117, ELDA, Cairo (pdf, local PDF, bibtex)
Jan Hajič, Zdeňka Urešová, Alevtina Bémová, Marie Kaplanová (2004): Anotace na tektogramatické rovině (úroveň 3) (technical report). In: (bibtex)
Jan Hajič, Zdeňka Urešová, Alevtina Bémová, Marie Kaplanová (2004): The Prague Dependency Treebank. Annotation on tectogrammatical level (technical report). In: (bibtex)
Eva Hajičová (2004): The Prague Dependency Treebank: How much of the underlying syntactic structure can be tagged automatically?. In: Corpus Linguistics: Readings in a Widening Discipline, pp. 427-433, Continuum, London, UK, ISBN ISBN HB 0-8264-6013-5, ISBN PB 0-8264-8803-X (url, bibtex)
Eva Hajičová (2004): Kontrast v základu výpovědi ve světle Pražského závislostního korpusu. In: Korpus jako zdroj dat o češtině, pp. 103--112, Masarykova univerzita, Brno, ISBN 80-210-3595-1 (bibtex)
Eva Hajičová, Jiří Havelka, Petr Sgall (2004): Topic and focus, anaphoric relations and degrees of salience. In: Prague Linguistic Circle Papers / Travaux du cercle linguistique de Prague N.S. (in press), John Benjamins, Amsterdam (bibtex)
Eva Hajičová, Jiří Havelka, Petr Sgall, Kateřina Veselá, Daniel Zeman (2004): Issues of Projectivity in the Prague Dependency Treebank. In: The Prague Bulletin of Mathematical Linguistics, ISSN 0032-6585, 81, pp. 5-22 (pdf, local PDF, bibtex)
Eva Hajičová, Petr Sgall (2004): Translation and Information Structure. In: Neue Perspektiven in der Übersetzung- und Dolmetscherwissenschaft, pp. 235-247, AKS-Verlag, Bochum (bibtex)
Eva Hajičová, Petr Sgall (2004): Degrees of Contrast and the Topic-Focus Articulation. In: , Language, Context and Cognition, ISSN 1866-8313, 1, pp. 1--13, Linguistic Data Consortium, University of Pennsylvania, ISBN 1-58563-280-5 (bibtex)
Jaroslava Hlaváčová (2004): Automatické rozpoznávání českých derivačních předpon. In: accepted for publication in proceedings CICLING 2005 (bibtex)
Jaroslava Hlaváčová, Jana Klímová (2004): Derivational Relations in Flectional Languages - Czech Case. In: Proceeding LREC 2004, pp. 1239-1242, Lisbon (bibtex)
Martin Holub, Jiří Semecký, Jiří Diviš (2004): Searching for Topics in a Large Collection of Texts. In: Proceedings of ACL 2004, pp. - - (bibtex)
Petr Homola (2004): On some aspects on machine translation among related languages. In: Proceedings of the Ninth ESSLLI Student Session (bibtex)
Petr Homola, Vladislav Kuboň (2004): A translation model for languages of acceding countries. In: Proceedings of the EAMT Workshop (bibtex)
Petr Homola, Jakub Piskorski (2004): How can shallow NLP help a machine translation system. In: Proceedings of the Conference Human Language Technologies - The Baltic Perspective (bibtex)
Petr Homola, Erika Rimkutė (2004): Mašininis vertimas tarp artimų kalbų. In: in press, Kaunas Technology University, Kaunas (bibtex)
Petr Homola, Béla Tolvaj (2004): Distributed translation memories and shallow MT. In: MIS 2004, MATFYZPRESS, Praha (bibtex)
David Klusáček (2004): Optimal Detection in Case of the Sparse Training Data. In: Proceedings of ODYSSEY04, pp. 97--104 (bibtex)
Veronika Kolářová (2004): Valence deverbálních substantiv: Některé specifické posuny v povrchových realizacích participantů. In: Korpus jako zdroj dat o češtině (in press) (bibtex)
Vladislav Kuboň (2004): Využití existujících zdrojů pro systém automatického překladu. In: Sborník ze semináře MIS 04, pp. 7, MFF UK, Praha (bibtex)
Lucie Kučová, Eva Hajičová (2004): Prague Dependency Treebank: Enrichment of the Underlying Syntactic Annotation by Coreferential Mark-Up. In: , 81, pp. 23-34 (bibtex)
Lucie Kučová, Eva Hajičová (2004): Coreferential Relations in the Prague Dependency Treebank. In: Proceedings of DAARC2004, pp. 97-102, Azores (bibtex)
Markéta Lopatková, Jarmila Panevová (2004): Valence vybraných skupin sloves (k některým slovesům dandi a recipiendi). In: , 5, pp. 348--356 (bibtex)
Markéta Lopatková, Martin Plátek, Vladislav Kuboň (2004): Závislostní redukční analýza přirozených jazyků. In: Proceedings of Informačné (inteligentné) technológie - aplikácie a teória, pp. 165-176, Univerzita Pavla Jozefa Šafárika, Košice, Slovakia, ISBN 80-7097-589-X (bibtex)
Karel Oliva, Drahomíra Doležalová (2004): O korpusu jako o zdroji jazykových dat. In: Korpus jako zdroj dat o češtině, pp. 7-10, Ústav českého jazyka, Filozofická fakulta, Masarykova univerzita, Brno, Brno, Czechia, ISBN 80-210-3595-1 (bibtex)
Jarmila Panevová (2004): Všeobecné aktanty očima Pražského závislostního korpusu (PZK). In: Korpus jako zdroj dat o češtině. Sborník konference ve Šlapanicích (in press) (bibtex)
Jakub Piskorski, Petr Homola, Małgorzata Marciniak, Agnieszka Mykowiecka, Adam Przepiórkowski, Marcin Woliński (2004): Information extraction for Polish using the SProUT platform. In: Intelligent Information Processing and Web Mining, pp. 227-236, Springer Verlag, Berlin / Heidelberg, ISBN 3-540-21331-7 (bibtex)
Markéta Pravdová (2004): Reklama jako zvláštní typ sdělování. In: Vztah langue a parole v perspektivě "interaktivního obratu" v lingvistickém zkoumání, pp. 248-253, UP Olomouc, Olomouc (bibtex)
Markéta Pravdová (2004): K způsobům persvaze v reklamních projevech. In: Beiträge der Europäischen Slavistischen Linguistik (POLYSLAV), pp. 131-136, Otto Sagner, Múnchen (bibtex)
Josef V. Psutka, Jan Hajič, William J. Byrne (2004): The Development of ASR for Slavic Languages in the MALACH Project. In: Proceedings of IEEE International Conference on Acoustics, Speech and Signal Processing ICASSP 2004, pp. 749-752, Montreal, ISBN ISBN 0-7803-8484-9) (ISSN 1520-6149 (bibtex)
Josef V. Psutka, Pavel Ircing, Jan Hajič, Vlasta Radová, Josef Psutka, William J. Byrne (2004): Issues in annotation of the Czech spontaneous speech corpus in the MALACH project. In: Proceedings of the 4th International Conference on Language Resources and Evaluation LREC , pp. 607-610, Lisbon, ISBN 2-9517408-1-6 (bibtex)
Magda Razímová (2004): Funkce adverbálního dativu v hloubkové a povrchové stavbě české věty (diplomová práce FF UK Praha). In: , Linguistic Data Consortium, University of Pennsylvania, ISBN 1-58563-280-5 (bibtex)
Kiril Ribarov (2004): Automatic Building of a Dependency Tree - The Rule-Based Approach and Beyond (PhD thesis). In: , Linguistic Data Consortium, University of Pennsylvania, ISBN 1-58563-280-5 (bibtex)
Kiril Ribarov (2004): Towards Intelligent Written Cultural Heritage Processing - Lexical Processing. In: Proceeedings of LREC 2004, Lisabon, Portugalsko (bibtex)
Kiril Ribarov, Jiří Bubník, Jiří Čelák, Vojtěch Janota, Alexandr Kara, Václav Novák, Tomáš Vondra (2004): ACT - Computer Processing of Written Cultural Heritage Sources. In: Proceedings of INFORUM 2004 Conference, Praha (bibtex)
Kiril Ribarov, Jiří Bubník, Jiří Čelák, Vojtěch Janota, Alexandr Kara, Václav Novák, Tomáš Vondra (2004): The Annotation Corpora of Text (ACT) Tool. In: Scripta & e-Scripta, ISSN 1312-238X, 2, pp. 49-78 (bibtex)
Petr Sgall (2004): Types of Languages and the Simple Pattern of the Core of Language. In: Linguistics Today - Facing a Greater Challenge (Plenary lectures from CIL 17), pp. 243--265, Benjamins, Amsterdam/Philadelphia (url, local ZIP, bibtex)
Petr Sgall (2004): K obohacování spisovné češtiny. In: , 5, pp. 77--85 (bibtex)
Petr Sgall (2004): Co pomůže češtině. O potřebě přejít od školské spisovnosti ke standardnímu vyjadřování. In: , Přítomnost, ISSN 1211-3883, léto 2004, pp. 52--53, Linguistic Data Consortium, University of Pennsylvania, ISBN 1-58563-280-5 (bibtex)
Petr Sgall, Jarmila Panevová (2004): Jak psát a nepsat česky. In: (bibtex)
Petr Sgall, Jarmila Panevová, Eva Hajičová (2004): Deep Syntactic Annotation: Tectogrammatical Representation and Beyond. In: HLT-NAACL 2004 Workshop: Frontiers in Corpus Annotation, pp. 32--38, Association for Computational Linguistics, Boston (pdf, local PDF, bibtex)
Otakar Smrž (2004): Finite State Morphology (review). In: The Prague Bulletin of Mathematical Linguistics, ISSN 0032-6585, 81, pp. 73-76 (pdf, local PDF, bibtex)
Otakar Smrž, Petr Pajas (2004): MorphoTrees of Arabic and Their Annotation in the TrEd Environment. In: Proceedings of the NEMLAR International Conference on Arabic Language Resources and Tools, pp. 38--41, ELDA, Cairo (pdf, local PDF, bibtex)
Zdeňka Urešová (2004): The verbal valency in the Prague Dependency Treebank from the annotator's point of view. In: sborník přednášek JÚLŠ SAV (in press), Bratislava (bibtex)
Kateřina Veselá, Jiří Havelka, Eva Hajičová (2004): Condition of Projectivity in the Underlying Dependency Structures. In: Proceedings of Coling 2004, pp. 289--295, COLING, Geneva (bibtex)
Kateřina Veselá, Jiří Havelka, Eva Hajičová (2004): Annotators' Agreement: The Case of Topic-Focus Articulation. In: Proceedings of the 4th International Conference on Language Resources and Evaluation, pp. 2191-2194, European Language Resources Association, Lisboa, ISBN ISBN 2-9517408-1-6 (bibtex)
Kateřina Veselá, Nino Peterek, Eva Hajičová (2004): Prosodic Characteristics of Czech Contrastive Topic. In: Proceedings of 8th International Conference on Spoken Language Processing, Interspeech 2004, pp. 4, Sunjin Printing Co., Korea (bibtex)
Daniel Zeman (2004): Parsing with a Statistical Dependency Model (PhD thesis). In: (url, local PDF, bibtex)
Daniel Zeman (2004): Data-Oriented Parsing by Rens Bod, Remko Scha, and Khalil Sima'an (review). In: The Prague Bulletin of Mathematical Linguistics, ISSN 0032-6585, 81, pp. 69-72 (bibtex)
Daniel Zeman (2004): Neprojektivity v Pražském závislostním korpusu (PDT) (technical report). In: (pdf, local PDF, bibtex)
Šárka Zikánová (2004): Slovosled a porozumění textu: predikát v humanistické češtině. In: Vztah langue a parole v perspektivě "interaktivního obratu" v lingvistickém zkoumání, pp. 106-111, UP Olomouc, Olomouc (pdf, local PDF, bibtex)
Zdeněk Žabokrtský, Markéta Lopatková (2004): Valency Frames of Czech Verbs in VALLEX 1.0. In: HLT-NAACL 2004 Workshop: Frontiers in Corpus Annotation, pp. 70--77, Association for Computational Linguistics, Boston (pdf, local PDF, bibtex)
Christian Bering, Witold Drożdżyński, Gregor Erbach, Clara Guasch, Petr Homola, Sabine Lehmann, Hong Li, Hans-Ulrich Krieger, Jakub Piskorski, Ulrich Schäfer, Atsuko Shimada, Melanie Siegel, Feiyu Xu, Dorothee Ziegler-Eisele (2003): Corpora and evaluation tools for multilingual names entity grammar development. In: Proceedings of Multilingual Corpora Workshop at Corpus Linguistics, pp. 42-52, Universität des Saarlandes, Saarbrücken, Germany (pdf, local PDF, bibtex)
Alena Böhmová, Jan Hajič, Eva Hajičová, Barbora Hladká (2003): The Prague Dependency Treebank: A Three-Level Annotation Scenario. In: Treebanks: Building and Using Syntactically Annotated Corpora, pp. 103-128, Kluwer Academic Publishers, Dordrecht, The Netherlands, ISBN 1-4020-1334-5 (url, bibtex)
Alena Böhmová, Eva Hajičová (2003): Large Language Data and the Degrees of Automation. In: Proceedings of XVII International Congress of Linguists, CD-ROM, pp. x1-x6, Matfyzpress, MFF UK, Prague, ISBN 80-86732-21-5 (bibtex)
Ondřej Bojar (2003): Building Subcorpora Suitable for Extraction of Lexico-Syntactic Information. In: Proceedings of the Student Session, ESSLLI, pp. 25--34 (pdf, local PDF, bibtex)
Ondřej Bojar (2003): AX - Systém pro automatizovanou extrakci lexikálně-syntaktických údajů. In: MIS 2003, pp. 15--24, MATFYZPRESS, Praha, ISBN 80-86732-22-3 (url, local PostScript, bibtex)
Ondřej Bojar (2003): Towards Automatic Extraction of Verb Frames. In: The Prague Bulletin of Mathematical Linguistics, ISSN 0032-6585, 79--80, pp. 101--120 (bibtex)
Ondřej Bojar, Cyril Brom, Milan Hladík, Mikuláš Vejlupek, Vojtěch Toman, David Voňka (2003): ENTI -- Simulátor přirozeného prostředí lidského světa. In: MIS 2003, pp. 3--14, MATFYZPRESS, Praha, ISBN 80-86732-22-3 (url, local PostScript, bibtex)
Giuseppe Camuglia, Monia Camuglia Ribarov, Kiril Ribarov (2003): Computer Processing of a Clopen Language System: Old-Church Slavonic. In: Linguistica Computazionale, ISSN 1824-1573, XVI-XVII, pp. 133--149 (bibtex)
Monia Camuglia, Kiril Ribarov (2003): Old-Church Slavonic in Codes. In: Computational Approaches to the study of Early and Modern Slavic Languages and Texts -- Proceeedings of the Electronic Description and Edition of Slavic Sources, pp. 201--204, Sofia (bibtex)
Silvie Cinková (2003): Belegsuche bei der lexikographischen Bearbeitung von selten gebrauchtem Wortschatz. In: Das Wort. Germanistisches Jahrbuch 2003, pp. 353--365, Deutscher akademischer Austauschdienst, Moskva (pdf, local PDF, bibtex)
Martin Čmejrek, Jan Cuřín, Jiří Havelka (2003): Treebanks in Machine Translation. In: Proceedings of The Second Workshop on Treebanks and Linguistic Theories, pp. 209--212, Vaxjo University Press, Vaxjo, Sweden, ISBN 91-7636-394-5 (bibtex)
Martin Čmejrek, Jan Cuřín, Jiří Havelka (2003): Czech-English Dependency-based Machine Translation. In: EACL 2003 Proceedings of the Conference, pp. 83--90, Association for Computational Linguistics, Budapest, Hungary, ISBN 1-932432-00-0 (bibtex)
Witold Drożdżyński, Petr Homola, Jakub Piskorski, Vytautas Zinkevičius (2003): Adapting SProUT to processing Baltic and Slavonic languages. In: Proceedings of Information Extraction for Slavonic and other Central and Eastern European Languages, pp. !!!, Borovets (pdf, local PDF, bibtex)
Radu Gramatovici (2003): On the Recognition Power of Non-Expansive Go-Through Automata. In: Analele universitaţii Bucareşti Mathematica / Annals of Bucharest University, ISSN 1010-5433, vol. LII, no. 1, pp. 45--54 (bibtex)
Jan Hajič, Petr Homola, Vladislav Kuboň (2003): A Simple Multilingual Machine Translation System. In: Proceedings of Machine Translation Summit IX, pp. 157--164, New Orleans, USA (bibtex)
Jan Hajič, Václav Honetschläger (2003): Annotation Lexicons: Using the Valency Lexicon for Tectogrammatical Annotation. In: The Prague Bulletin of Mathematical Linguistics, ISSN 0032-6585, 79--80, pp. 61--86 (bibtex)
Jan Hajič, Vladislav Kuboň (2003): Tagging as a Key to Successful MT. In: MIS 2003, pp. 56--65, MATFYZPRESS, Praha, ISBN 80-86732-22-3 (bibtex)
Jan Hajič, Jarmila Panevová, Zdeňka Urešová, Alevtina Bémová, Veronika Kolářová, Petr Pajas (2003): PDT-VALLEX: Creating a Large-coverage Valency Lexicon for Treebank Annotation. In: Proceedings of The Second Workshop on Treebanks and Linguistic Theories, pp. 57-68, Vaxjo University Press, Vaxjo, Sweden, ISBN 91-7636-394-5 (pdf, local PDF, bibtex)
Jan Hajič, Josef Psutka, Pavel Ircing, William Byrne, Jiří Mírovský, Bhuvana Ramabhadran, Samuel Gustman, Josef V. Psutka, Vlasta Radová (2003): Language Model Data Selection for Czech ASR in the MALACH Project. In: ICASSP 2003, pp. !!!, (submitted), Hong Kong (bibtex)
Jan Hajič, Zdeňka Urešová (2003): Linguistic Annotation: from Links to Cross-Layer Lexicons. In: Proceedings of The Second Workshop on Treebanks and Linguistic Theories, pp. 69--80, Vaxjo University Press, Vaxjo, Sweden, ISBN 91-7636-394-5 (local PDF, bibtex)
Eva Hajičová (2003): Contextual boundness and discourse patterns. In: Proceedings of XVII International Congress of Linguists, CD-ROM, pp. x1-x7, Matfyzpress, MFF UK, Prague, ISBN 80-86732-21-5 (bibtex)
Eva Hajičová (2003): Topic-focus articulation in the Czech National Corpus. In: Language and function. To the memory of Jan Firbas, pp. 185--194, John Benjamins, Amsterdam/Philadelphia (bibtex)
Eva Hajičová (2003): Syntactic theory and corpus annotation need each other. In: Zbornik povzetkov, 13. mednarodni slavistični kongres, 2. del, pp. 289, Medninarodni slavistični komite, Ljubljana (bibtex)
Eva Hajičová (2003): Aspects of discourse structure. In: Natural language processing between linguistic inquiry and system engineering, pp. 47--54, Editura Universitatii Alexandru Ioan Cuza, Iasi (bibtex)
Eva Hajičová (2003): Information structure and syntactic complexity. In: Investigations into formal Slavic linguistics, pp. 169--180, Peter Lang, Frankfurt/M. (bibtex)
Eva Hajičová, Jiří Havelka, Petr Sgall (2003): Discourse Semantics and the Salience of Referents. In: , pp. 127-140 (local DOC, bibtex)
Eva Hajičová, Petr Sgall (2003): Dependency syntax in Functional Generative Description. In: Dependenz und Valenz -- Dependency and Valency, pp. 570--592, Walter de Gruyter, Berlin, New York (bibtex)
Eva Hajičová, Petr Sgall (2003): Information Structure, Translation and Discourse. In: Textologie und Translation, pp. 107--123, Gunter Narr, Tuebingen (bibtex)
Eva Hajičová, Petr Sgall, Eva Buráňová (2003): Topic-Focus Articulation and degrees of salience in the Prague Dependency Treebank. In: Formal Approaches to Function in Grammar. In honor of Eloise Jelinek, Arizona, pp. 165--177, John Benjamins, Amsterdam/Philadelphia (bibtex)
Eva Hajičová, Petr Sgall, Kateřina Veselá (2003): Information structure and contrastive topic. In: Formal approaches to Slavic linguistics. The Amherst Meeting 2002, pp. 219--234, Michigan Slavic Publications, Ann Arbor (bibtex)
Václav Hlaváč, Jaroslava Hlaváčová (2003): Rozpoznávání jako jeden z přístupů porozumění složitým jevům. In: Softwarové noviny, ISSN 1801-2345, vol. 72, no. XIV(6), pp. 70--72 (bibtex)
Tomáš Holan, Vladislav Kuboň, Martin Plátek, Karel Oliva (2003): A Theoretical Basis of an Architecture of a Shell of a Reasonably Robust Syntactic Analyser. In: Proceedings of Text, Speech and Dialogue 2003, pp. 58--65, Springer, Berlin/Heidelberg (bibtex)
Martin Holub (2003): A New Approach to Conceptual Document Indexing: Building a Hierarchical System of Concepts Based on Document Clusters. In: ISICT 2003 Proceedings of the International Symposium on Information and Communication Technologies, pp. 311--316, Trinity College Dublin, Dublin, Ireland, ISBN 0-9544145-2-7 (bibtex)
Martin Holub, Pavel Straňák (2003): Approaches to Building Semantic Lexicons. In: WDS'03 Proceedings of Contributed Papers, Part I, pp. 173--178, MATFYZPRESS, Prague, ISBN 80-86732-18-5 (bibtex)
Petr Homola, Erika Rimkutė (2003): Shallow machine translation - in between of two extremes. In: Proceedings of The Fifth International Tbilisi Symposium on Language, Logic and Computation, pp. !!!, (in press), Tbilisi (bibtex)
Václav Honetschläger (2003): Using a Czech Valency Lexicon for Annotation Support. In: Proceedings of Text, Speech and Dialogue 2003, pp. 120--126, Springer, Berlin/Heidelberg (url, local PostScript, bibtex)
Jana Klímová, Veronika Kolářová-Řezníčková (2003): Využití ČNK a PZK pro ověřování valenčních vlastností deverbativních substantiv se zabudovanou rolí. In: Slovanské jazyky v počítačovom spracovaní, pp. !!!, (in press), Bratislava (bibtex)
Jiří Kocanda (2003): Statistical Parsing. In: WDS'03 Proceedings of Contributed Papers, Part I, pp. 161--166, MATFYZPRESS, Prague, ISBN 80-86732-18-5 (bibtex)
Pavel Krbec, Petr Podveský, Jan Hajič (2003): Combination of a Hidden Tag Model and a Traditional N-gram Model: A Case Study in Czech Speech Recognition. In: EUROSPEECH 2003 Proceedings (8th European Conference on Speech Communication and Technology), pp. 2289--2291, ISCA, Geneva (bibtex)
Vladislav Kuboň (2003): Multilingual Aspects of Monolingual Corpora. In: In the proceedings of Sprachtechnlogie fuer die Multilinguale Kommunikation, GLDV-Fruejahrstagung 2003, pp. 283--298, Gardez-Verlag, Sankt Augustin (bibtex)
Ivona Kučerová, Veronika Řezníčková (2003): Korpus jako výzva k syntaktické analýze. Poznámky k syntaktické derivaci deverbativních substantiv v češtině. In: Slavia, ISSN 0037-6736, 72, pp. 267-274 (bibtex)
Lucie Kučová, Veronika Kolářová, Zdeněk Žabokrtský, Petr Pajas, Oliver Čulo (2003): Annotation of Coreference in the Prague Dependency Treebank (technical report). In: (url, local PostScript, bibtex)
Břetislav Kupera (2003): Genetic Algorithms and Artificial Neural Network in Natural Language Processing. In: WDS'03 Proceedings of Contributed Papers, Part I, pp. 156--160, MATFYZPRESS, Prague, ISBN 80-86732-18-5 (bibtex)
Pavel Květoň (2003): Language for Grammatical Rules (technical report). In: (bibtex)
Markéta Lopatková (2003): Valency in the Prague Dependency Treebank: Building the Valency Lexicon. In: The Prague Bulletin of Mathematical Linguistics, ISSN 0032-6585, 79--80, pp. 37--60 (pdf, local PDF, bibtex)
Markéta Lopatková (2003): Issue of Valency in Prague Dependency Treebank: Creating valency lexicon of Verbs. (Abstract) . In: XVII International Congress of Linguists Abstracts , pp. 153-153, MFF UK, Praha (bibtex)
Markéta Lopatková (2003): O homonymii předložkových skupin v češtine (Co umí počítač?). In: (bibtex)
Markéta Lopatková, Zdeněk Žabokrtský (2003): Testování konzistence a úplnosti valenčního slovníku českých sloves. In: Proceedings of ITAT 2003, pp. 73-82, University of P. J. Šafařík, Košice (url, local PostScript, bibtex)
Markéta Lopatková, Zdeněk Žabokrtský, Karolína Skwarska, Václava Benešová (2003): VALLEX 1.0 Valency Lexicon of Czech Verbs (technical report). In: (url, bibtex)
Karel Oliva, Pavel Květoň, Roman Ondruška (2003): The Computational Complexity of Rule-Based Part-of-Speech Tagging. In: Proceedings of Text, Speech and Dialogue 2003, pp. 82--89, Springer, Berlin/Heidelberg (bibtex)
Roman Ondruška, Jarmila Panevová, Jan Štěpánek (2003): An Exploitation of the Prague Dependency Treebank: A Valency Case. In: Proceedings of the Workshop on Shallow Processing of Large Corpora (SproLaC 2003), pp. 69--77, UCREL, Lancaster University, Lancaster, ISBN 1-86220-134-X (url, local HTML, bibtex)
Jarmila Panevová (2003): Some Issues of Syntax and Semantics of Verbal Modifications. In: Proceedings MTT 2003, First International Conference on Meaning-Text Theory, pp. 139--146, Ecole Normale Supérieure, Paris (bibtex)
Jarmila Panevová (2003): O jednom typu kauzativní konstrukce v češtině. In: Etudes linguistiques Romano-Slaves offertes a Slanislaw Karolak, pp. 379--385, Oficyna Wydavnicza Edukacja, Cracovie, ISBN 83-917539-0-5 (pdf, local PDF, bibtex)
Jarmila Panevová (2003): Existuje chyba v syntaxi?. In: Sborník prací Filozoficko-přírodovědecké fakulty Slezské univerzity v Opavě, pp. 145--153, Slezská univerzita v Opavě, Opava (bibtex)
Martin Plátek, Markéta Lopatková, Karel Oliva (2003): Restarting Automata: Motivations and Applications. In: Proceedings of the workshop Petrinetze, pp. 90--96, Technische Universitaet Muenchen, Munchen (url, local PostScript, bibtex)
Josef Psutka, Ilja Iljuchin, Pavel Ircing, Josef V. Psutka, Václav Trejbal, William J. Byrne, Jan Hajič, Samuel Gustman (2003): Building LVCSR System for Transcription of Spontaneously Pronounced Russian Testimonies in the MALACH Project: Initial Steps and First Results. In: Proceedings of Text, Speech and Dialogue 2003, pp. 327--332, Springer, Berlin/Heidelberg (bibtex)
Josef Psutka, Pavel Ircing, Josef V. Psutka, Vlasta Radová, William Byrne, Jan Hajič, Jiří Mírovský, Samuel Gustman (2003): Large Vocabulary ASR for Spontaneous Czech in the MALACH Project. In: EUROSPEECH 2003 Proceedings (8th European Conference on Speech Communication and Technology), pp. 1821--1824, ISCA, Geneva (bibtex)
Josef Psutka, Pavel Ircing, Josef V. Psutka, Vlasta Radová, William J. Byrne, Veera Venkataramani, Jan Hajič, Samuel Gustman (2003): Towards Automatic Transcription of Spontaneous Czech Speech in the MALACH Project. In: Proceedings of Text, Speech and Dialogue 2003, pp. 214--219, Springer, Berlin/Heidelberg (bibtex)
Owen Rambow, Bonnie J. Dorr, Karin Kipper, Ivona Kučerová, Martha Palmer (2003): Automatically Deriving Tectogrammatical Labels from Other Resources: A Comparison of Semantic Labels Across Frameworks. In: The Prague Bulletin of Mathematical Linguistics, ISSN 0032-6585, 79--80, pp. 23--35 (bibtex)
Kiril Ribarov, Monia Camuglia (2003): Incorporation of Old-Church Slavonic Card-Files into a Corpus. In: Scripta & e-Scripta, ISSN 1312-238X, 1, pp. 65--74 (bibtex)
Veronika Řezníčková (2003): Czech Deverbal Nouns: Issues of Their Valency in Linear and Dependency Corpora. In: Proceedings of the Workshop on Shallow Processing of Large Corpora (SProLaC 2003), pp. 88--97, UCREL, Lancaster University, Lancaster, ISBN 1-86220-134-X (url, local HTML, bibtex)
Jiří Semecký (2003): Semantic Word Classes Extracted from Text Clusters. In: WDS'03 Proceedings of Contributed Papers, Part I, pp. 167--172, MATFYZPRESS, Prague, ISBN 80-86732-18-5 (bibtex)
Petr Sgall (2003): Introductory remarks (to the Workshop on Discourse Patterns). In: Proceedings of XVII International Congress of Linguists, CD-ROM, pp. x1-x5, Matfyzpress, MFF UK, Prague, ISBN 80-86732-21-5 (bibtex)
Petr Sgall (2003): Slavistics and the history of topic-focus studies. In: Investigations into formal Slavic linguistics, pp. 201--212, Peter Lang, Frankfurt/M. (bibtex)
Petr Sgall (2003): From Data to Speech. Language Generation in Context. In: Journal of Pragmatics, ISSN 0378-2166, vol. 2, no. 35, pp. 315--319 (pdf, local PDF, bibtex)
Petr Sgall (2003): From functional sentence perspective to topic-focus articulation. In: Language and function. To the memory of Jan Firbas, pp. 279--287, John Benjamins, Amsterdam/Philadelphia (bibtex)
Petr Sgall (2003): Lingvistické ohlédnutí za dvacátým stoletím. In: Český jazyk a literatura, ISSN 0009-0786, vol. 53, no. 4, pp. 157-164 (bibtex)
Petr Sgall (2003): Topic-Focus Articulation in Corpus Annotation. In: Natural language processing between linguistic inquiry and system engineering, pp. 95--101, Editura Universitatii Alexandru Ioan Cuza, Iasi (bibtex)
Petr Sgall (2003): Dynamics in the meaning of the sentence and of discourse. In: Meaning: The Dynamic Turn, pp. 169--184, Elsevier Science Ltd., Oxford (pdf, local PDF, bibtex)
Kateřina Veselá, Jiří Havelka (2003): Anotování aktuálního členění věty v Pražském závislostním korpusu (technical report). In: (bibtex)
Kateřina Veselá, Nino Peterek, Eva Hajičová (2003): Some observations on contrastive topic in Czech spontaneous speech. In: Proceedings of XVII International Congress of Linguists, CD-ROM, pp. !!!, Matfyzpress, MFF UK, Prague (bibtex)
Kateřina Veselá, Nino Peterek, Eva Hajičová (2003): Topic-Focus Articulation in PDT: Prosodic Characteristics of Contrastive Topic. In: The Prague Bulletin of Mathematical Linguistics, ISSN 0032-6585, 79--80, pp. 5--22 (bibtex)
Zdeněk Žabokrtský (2003): Word Sense Disambiguation. The Case for Combinations of Knowledge Sources. In: The Prague Bulletin of Mathematical Linguistics, ISSN 0032-6585, 79--80, pp. 151--153 (bibtex)
Zdeněk Žabokrtský, Otakar Smrž (2003): Arabic Syntactic Trees: from Constituency to Dependency. In: EACL 2003 Conference Companion, pp. 183--186, Association for Computational Linguistics, Budapest, Hungary, ISBN 1-932432-01-9 (pdf, local PDF, bibtex)
Martin Čmejrek, Jan Cuřín, Jiří Havelka (2002): Czech-English Dependency-based Machine Translation: Data Preparation for the Starting up Experiments. In: The Prague Bulletin of Mathematical Linguistics, ISSN 0032-6585, 78, pp. 103--118 (bibtex)
Łukasz Dębowski, Jan Hajič, Vladislav Kuboň (2002): Testing the Limits -- Adding a New Language to an MT System. In: The Prague Bulletin of Mathematical Linguistics, ISSN 0032-6585, 78, pp. 95--101 (bibtex)
Jan Hajič (2002): Tectogrammatical Representation: Towards a Minimal Transfer in Machine Translation. In: Proceedings of the 6th International Workshop on Tree Adjoining Grammars and Related Frameworks (TAG+6), pp. 216--226, Universita di Venezia, Venezia (pdf, local PDF, bibtex)
Jan Hajič, Martin Čmejrek, Jason Eisner, Gerald Penn, Owen Rambow, Drago Radev, Yuan Ding, Terry Koo, Kristen Parton (2002): Natural Language Generation in the Context of Machine Translation (technical report). In: (bibtex)
Jan Hajič, Douglas W. Oard, Dina Demner-Fushman, Bhuvana Ramabhadran, Samuel Gustman, William J. Byrne, Dagobert Soergel, Bonnie J. Dorr, Philip Resnik, Michael Picheny (2002): Cross-Language Access to Recorded Speech in the MALACH Project. In: Text, Speech and Dialogue. 5th International Conference, TSD 2002, pp. 57--64, Springer, Berlin/Heidelberg, ISBN 3-540-44129-8 (pdf, local PDF, bibtex)
Jan Hajič, Josef Psutka, Pavel Ircing, Bhuvana Ramabhadran, Samuel Gustman, William J. Byrne, Josef V. Psutka, Vlasta Radová (2002): Automatic Transcription of Czech Language Oral History in the MALACH Project: Resources and Initial Experiments. In: Text, Speech and Dialogue. 5th International Conference, TSD 2002, pp. 253--260, Springer, Berlin/Heidelberg, ISBN 3-540-44129-8 (url, local PostScript, bibtex)
Eva Hajičová (2002): Recenze knihy: Studie z korpusové lingvistiky (review). In: Slovo a slovesnost, ISSN 0037-7031, 63, pp. 65-68 (bibtex)
Eva Hajičová (2002): Theoretical description of language as a basis of corpus annotation: The case of Prague Dependency Treebank. In: Prague Linguistic Circle Papers, pp. 111--127, John Benjamins, Amsterdam/Philadelphia, ISBN 902725444 (bibtex)
Eva Hajičová, Ivona Kučerová (2002): Argument/Valency Structure in PropBank, LCS Database and Prague Dependency Treebank: A Comparative Pilot Study. In: Proceedings of the Third International Conference on Language Resources and Evaluation (LREC 2002), pp. 846--851, ELRA (url, local TXT, bibtex)
Eva Hajičová, Petr Pajas, Kateřina Veselá (2002): Corpus Annotation on the Tectogrammatical Layer: Summarizing the First Stages of Evaluations. In: The Prague Bulletin of Mathematical Linguistics, ISSN 0032-6585, 77, pp. 5--18 (bibtex)
Eva Hajičová, Petr Sgall (2002): Are Linguistic Frameworks Comparable?. In: Computational Linguistics for the New Millennium: Divergence or Synergy?, pp. 113--122, Peter Lang, Frankfurt-M. (bibtex)
Eva Hajičová, Petr Sgall (2002): Dependency syntax in Functional Generative Desription. In: Festschrift for P. Hellwig, pp. xx--yy, Heidelberg (bibtex)
Eva Hajičová, Petr Sgall, J. Hronek, František Čermák, Karel Kučera, Věra Schmiedtová, Neil Bermel, Henry Kucera, Jaroslav Suk, Laura Janda, Charles E. Townsend (2002): Umějí děti česky?. In: Český jazyk a literatura, ISSN 0009-0786, vol. 52, no. 9-10, pp. 237-243 (bibtex)
Jiří Hana, Hana Hanová, Jan Hajič, Barbora Vidová Hladká, Emil Jeřábek (2002): Manual for Morphological Annotation (technical report). In: (url, bibtex)
Martin Holub (2002): Word Frequency Distributions. In: The Prague Bulletin of Mathematical Linguistics, ISSN 0032-6585, 77, pp. 113--116 (bibtex)
Petr Homola (2002): Machine translation among Slavic languages. In: WDS 2002, pp. 39--43, MATFYZPRESS, Prague (bibtex)
Václav Honetschläger (2002): Analytical and Tectogrammatical Syntactic Parsing. In: WDS 2002, pp. 33--38, MATFYZPRESS, Prague (bibtex)
Ivona Kučerová (2002): Subjekt-predikátová shoda v češtine: univerzální, nebo specifická jazyková forma?. In: Čeština -- univerzália a specifika, pp. x01--x10, Lidové noviny, Prague (pdf, local PDF, bibtex)
Ivona Kučerová, Zdeněk Žabokrtský (2002): Transforming Penn Treebank Phrase Trees into (Praguian) Tectogrammatical Dependency Trees. In: The Prague Bulletin of Mathematical Linguistics, ISSN 0032-6585, 78, pp. 77--94 (bibtex)
Pavel Květoň, Karel Oliva (2002): (Semi-)Automatic Detection of Errors in PoS-Tagged Corpora. In: Proceedings of the 19th International Conference on Computational Linguistics (COLING 2002), pp. 509--515, Morgan Kaufmann Publishers, San Francisco (bibtex)
Markéta Lopatková, Veronika Řezníčková, Zdeněk Žabokrtský (2002): Valency Lexicon for Czech: from Verbs to Nouns. In: Text, Speech and Dialogue. 5th International Conference, TSD 2002, pp. 147--150, Springer, Berlin/Heidelberg, ISBN 3-540-44129-8 (url, local PostScript, bibtex)
Markéta Lopatková, Zdeněk Žabokrtský, Karolína Skwarska, Václava Benešová (2002): Tektogramaticky anotovaný valenční slovník českých sloves (technical report). In: (url, bibtex)
Jiří Mírovský, Roman Ondruška (2002): NetGraph System: Searching through the Prague Dependency Treebank. In: The Prague Bulletin of Mathematical Linguistics, ISSN 0032-6585, 77, pp. 101--104 (bibtex)
Karel Mokrý, Otakar Smrž (2002): External Tools Not Only for ArabTeX Documents. In: Proceedings of the International Symposium on the Processing of Arabic, pp. 161--165, Department of Arabic, Faculty of Arts, University of Manouba, Tunisia (url, local ZIP, bibtex)
Karel Oliva, Pavel Květoň (2002): Achieving an Almost Correct PoS-Tagged Corpus. In: Text, Speech and Dialogue. 5th International Conference, TSD 2002, pp. 19--26, Springer, Berlin/Heidelberg, ISBN 3-540-44129-8 (bibtex)
Karel Oliva, Pavel Květoň (2002): (German) Corpus representativity, bigrams, and PoS-tagging quality. In: KONVENS 2002, pp. x1--x8, DFKI, Saarbruecken (pdf, local PDF, bibtex)
Karel Oliva, Pavel Květoň (2002): Linguistically Motivated Bigrams in Part-of-Speech Tagging of Language Corpora. In: The Prague Bulletin of Mathematical Linguistics, ISSN 0032-6585, 78, pp. 23--36 (bibtex)
Roman Ondruška, Jiří Mírovský, Daniel Průša (2002): Searching through Prague Dependency Treebank-Conception and Architecture. In: Proceedings of The First Workshop on Treebanks and Linguistic Theories, pp. 114--122, LML, Bulgarian Academy of Sciences and SfS, Tuebingen University, Sofia, Bulgaria and Tuebingen, Germany (pdf, bibtex)
Jarmila Panevová (2002): Několik poznámek k potřebě terminologických slovníků jazykovědných pojmů pro školy (na okraj slovníku P. Hausera a kol.). In: Školská jazykovědná terminologie, pp. 75--78, PeF UK (pdf, local PDF, bibtex)
Jarmila Panevová (2002): Corpus-based Grammar or Corpus Grammar-based?. In: Referát přednesený na zasedání Komise pro gramatickou stavbu slovanských jazyků, pp. xx--yy (bibtex)
Jarmila Panevová (2002): řada hesel publikace. In: Encyklopedický slovník češtiny, pp. xx--yy, Lidové noviny (bibtex)
Jarmila Panevová (2002): K valenci substantiv (s ohledem na jejich derivaci). In: Zbornik matice srpske za slavistiku, pp. 29--36, Novi Sad (pdf, local PDF, bibtex)
Jarmila Panevová (2002): Čím může bohemistice přispět současná počítačová lingvistika?. In: Sborník Slezské univerzity v Opavě, pp. x1--x7 (pdf, local PDF, bibtex)
Jarmila Panevová (2002): Towards a Relational - Perspective Approach to Syntactic Semantics. In: The Prague Bulletin of Mathematical Linguistics, ISSN 0032-6585, 78, pp. 133--134 (bibtex)
Jarmila Panevová, Eva Hajičová, Petr Sgall (2002): K nové úrovni bohemistické práce: Využití anotovaného korpusu. Část 1. In: Slovo a slovesnost, ISSN 0037-7031, 63, pp. 161-177 (bibtex)
Jarmila Panevová, Eva Hajičová, Petr Sgall (2002): K nové úrovni bohemistické práce: Využití anotovaného korpusu. Část 2. In: Slovo a slovesnost, ISSN 0037-7031, 63, pp. 241-262 (bibtex)
Jarmila Panevová, Eva Hajičová, Petr Sgall (2002): Úvod do teoretické a počítačové lingvistiky I. -- Teoretická lingvistika. In: (bibtex)
Jarmila Panevová, Kateřina Marková (2002): Ješčo raz po povodu nulevyx elementov v strukture predloženija. In: Festschrift for Chrakovsky, pp. x01--x12 (pdf, local PDF, bibtex)
Jarmila Panevová, Kiril Ribarov (2002): Za poleznosta na elektronskite jazični korpusi (vrz primerot na eden tip na imenskata fraza vo češkiot jazik). In: Slavistički studii, pp. 307--316, Univerzitet Sv. Kiril i Metodij, Skopje, Macedonia (pdf, local PDF, bibtex)
Jarmila Panevová, Veronika Řezníčková, Zdeňka Urešová (2002): The Theory of Control Applied to the Prague Dependency Treebank (PDT). In: Proceedings of the 6th International Workshop on Tree Adjoining Grammars and Related Frameworks (TAG+6), pp. 175--180, Universita di Venezia, Venezia (pdf, local PDF, bibtex)
Pavel Pecina, Martin Holub (2002): Sémanticky signifikantní kolokace (technical report). In: (url, bibtex)
Martin Plátek, Radu Gramatovici (2002): D-trivial Dependency Grammars with Global Word-Order Restrictions (technical report). In: (url, bibtex)
Petr Podveský (2002): Finite-state machines in speech recognition. In: WDS 2002, pp. 27--32, MATFYZPRESS, Prague (bibtex)
Markéta Pravdová (2002): McSvět a místo člověka v něm. In: Studentská vědecká konference v Praze, pp. 418--431, Matfyzpress, UK Praha (bibtex)
Markéta Pravdová (2002): K povaze reklamního diskurzu. In: Naše řeč, ISSN 0027-8203, vol. 85, no. 3, pp. 177-189 (bibtex)
Kiril Ribarov (2002): Old Sources and Modern Procedures: Computer Processing of Old-Church Slavonic. In: Proceedings of the Third International Conference on Language Resources and Evaluation (LREC 2002), pp. 1622--1626, ELRA (bibtex)
Kiril Ribarov (2002): On the Rule-Based Parsing of Czech. In: The Prague Bulletin of Mathematical Linguistics, ISSN 0032-6585, 77, pp. 77--99 (bibtex)
Kiril Ribarov, Otakar Smrž (2002): Searching for non-linearities in natural language. In: 7th Experimental Chaos Conference Abstract Booklet, pp. 63--63, UCSD, San Diego (pdf, local PDF, bibtex)
Veronika Řezníčková (2002): PDT: Two Steps in Tectogrammatical Annotation with respect to some Issues of Deletion. In: The Prague Bulletin of Mathematical Linguistics, ISSN 0032-6585, 78, pp. 37--52 (bibtex)
Veronika Řezníčková, David Blažek (2002): Recenze: Beiträge der Europäischen Slavistischen Linguistik (review). In: Slovo a slovesnost, ISSN 0037-7031, 63, pp. 227-232 (bibtex)
Petr Savický, Jaroslava Hlaváčová (2002): Measures of Word Commonness. In: Journal of Quantitative Linguistics, ISSN 0929-6174, vol. 9, no. 3, pp. 215--231 (bibtex)
Petr Sgall (2002): Formalizing a Functional Description. In: Current Approaches to Formal Slavic Linguistics. Contributions of the Second European Conference on Formal Description of Slavic Languages FDSL II, pp. 299--306, Peter Lang, Frankfurt/M. (bibtex)
Petr Sgall (2002): Underlying Structures in Annotating Czech National Corpus. In: Current issues in formal Slavic linguistics, pp. 499--505, Peter Lang (2001), Frankfurt/M. (bibtex)
Petr Sgall (2002): Recenze: Czech through Russian (review). In: Slovo a slovesnost, ISSN 0037-7031, 63, pp. 135-137 (bibtex)
Petr Sgall (2002): The freedom of language. In: Prague Linguistic Circle Papers, pp. 309--329, John Benjamins, Amsterdam/Philadelphia, ISBN 902725444 (bibtex)
Petr Sgall (2002): Spoken Czech revisited. In: Where One's Tongue Rules Well. A Festschrift for Charles E. Townsend, pp. 299--309, Slavica Publishers, Columbus (bibtex)
Petr Sgall (2002): Moravská a pražská (malostranská) koncepce aktuálního členění. In: Čeština -- univerzália a specifika, pp. 51--58, Lidové noviny, Prague (bibtex)
Petr Sgall (2002): Češskij jazyk v povsednevnom razgovore. In: Vstreči étničeskix kul'tur v zerkale jazyka v sopostavitel'nom lingvokul'turnom aspekte, pp. 311--329, Nauka, Moskva (pdf, local PDF, bibtex)
Petr Sgall, Alena Böhmová (2002): The Simple Core and the Complex Periphery of Natural Language -- a Formal and a Computational View. In: Proceedings of the 19th International Conference on Computational Linguistics (COLING 2002), pp. 925--931, Morgan Kaufmann Publishers, San Francisco (pdf, local PDF, bibtex)
Petr Sgall, Zdeněk Žabokrtský, Sašo Džeroski (2002): A Machine Learning Approach to Automatic Functor Assignment in the Prague Dependency Treebank. In: Proceedings of the Third International Conference on Language Resources and Evaluation (LREC 2002), pp. 1513--1520, ELRA (bibtex)
Otakar Smrž, Jan Šnaidauf, Petr Zemánek (2002): Prague Dependency Treebank for Arabic: Multi-Level Annotation of Arabic Corpus. In: Proceedings of the International Symposium on the Processing of Arabic, pp. 147--155, Department of Arabic, Faculty of Arts, University of Manouba, Tunisia (pdf, local PDF, bibtex)
Otakar Smrž, Petr Zemánek (2002): Sherds from an Arabic Treebanking Mosaic. In: The Prague Bulletin of Mathematical Linguistics, ISSN 0032-6585, 78, pp. 63--76 (pdf, local PDF, bibtex)
Markéta Straňáková-Lopatková, Zdeněk Žabokrtský (2002): Valency Dictionary of Czech Verbs: Complex Tectogrammatical Annotation. In: Proceedings of the Third International Conference on Language Resources and Evaluation (LREC 2002), pp. 949--956, ELRA (url, local PostScript, bibtex)
Markéta Straňáková-Lopatková, Zdeněk Žabokrtský (2002): Valenční slovník stokrát jinak: co je pod povrchem?. In: Čeština -- univerzália a specifika, pp. 361--363, Lidové noviny, Prague (pdf, local PDF, bibtex)
Jan Štěpánek (2002): Building on Frege. In: The Prague Bulletin of Mathematical Linguistics, ISSN 0032-6585, 78, pp. 139--142 (bibtex)
Barbora Vidová Hladká (2002): Pražský závislostní korpus aneb Co tady před padesáti lety nebylo. In: Pokroky matematiky, fyziky a astronomie, ISSN 0032-2423, vol. 4, no. 47, pp. 298--306 (pdf, local PDF, bibtex)
Barbora Vidová Hladká, Kiril Ribarov (2002): Exploring Textual Data. In: The Prague Bulletin of Mathematical Linguistics, ISSN 0032-6585, 78, pp. 135--137 (bibtex)
Daniel Zeman (2002): How to Decrease the Performance of a Statistical Parser. In: The Prague Bulletin of Mathematical Linguistics, ISSN 0032-6585, 78, pp. 53-62 (local PDF, bibtex)
Daniel Zeman (2002): Can Subcategorization Help a Statistical Dependency Parser?. In: Proceedings of the 19th International Conference on Computational Linguistics (COLING 2002), pp. 1156-1162, Morgan Kaufmann Publishers, San Francisco (url, local ZIP, bibtex)
Alena Böhmová (2001): Automatic Procedures in Tectogrammatical Tagging. In: PBML 76, MFF UK, Praha (bibtex)
Silvie Cinková (2001): /Sýnihefti sagnorðabókar/: andere Betrachtungsweise der lexikographischen Bearbeitung der Verben. In: , Germanistica Pragensia, ISSN 0567-8269, XVII, pp. 133--139, (v tisku) Lancaster, pp.37-48 (bibtex)
Jan Cuřín (2001): Jak počítače umějí český jazyk. In: Softwarové noviny, září 2001, č.6, r. XII, pp. 106-107 (bibtex)
Martin Čmejrek, Jan Cuřín (2001): Automatic Extraction of Terminological Lexicon from Czech-English Parallel Texts. In: International Journal of Corpus Linguistics Special Issue 2001, John Benjamins Publishing Co. (bibtex)
Jan Hajič (2001): Statistické modelování a automatická analýza přirozeného jazyka (morfologie, syntax, překlad). In: Slovenčina a čeština v počítačovom spracovaní (zborník referátov zo seminára Bratislava 26.-27.10.2001 (ed.A. Jarošová)), VEDA, vydavateľstvo SAV, Bratislava, ISBN 80-224-0692-9 (url, local DOC, bibtex)
Jan Hajič, Pavel Krbec, Karel Oliva, Pavel Květoň, Vladimír Petkevič (2001): Serial Combination of Rules and Statistics: A Case Study in Czech Tagging. In: Proceedings of ACL 2001, Association for Computational Linguistics, Toulouse, France (bibtex)
Jan Hajič, Barbora Vidová Hladká, Petr Pajas (2001): The Prague Dependency Treebank: Annotation Structure and Support. In: Proceedings of the IRCS Workshop on Linguistic Databases, pp. 105--114, University of Pennsylvania, Philadelphia, USA (url, local PostScript, bibtex)
Jan Hajič, Barbora Vidová Hladká, Jarmila Panevová, Eva Hajičová, Petr Sgall, Petr Pajas (2001): Prague Dependency Treebank 1.0 (Final Production Label). In: CDROM, CAT: LDC2001T10., ISBN 1-58563-212-0 (bibtex)
Eva Hajičová (2001): Čeština a počítače (Abstrakt). In: sborník ke konferenci ZNALOSTI 2001, 19-21.6.2001, VŠE, Praha, pp. 307 (bibtex)
Eva Hajičová (2001): Syntaktický výzkum nad Českým národním korpusem. In: Čeština - univerzália a specifika 3 (eds. Z. Hladká, P. Karlík), MU Brno, ISBN 80-210-2532-8, pp. 173-181 (bibtex)
Eva Hajičová (2001): Possibilities and Limits of Optimality in Topic-Focus Articulation. In: Current issues in formal Slavic linguistics, pp. 385--394, Peter Lang, Frankfurt/M. (bibtex)
Eva Hajičová (2001): Information Structure and Syntactic Complexity. In: Proceedings of FDSL 4, Potsdam (in press) (bibtex)
Eva Hajičová, Jan Hajič, Barbora Vidová Hladká, Martin Holub, Petr Pajas, Veronika Řezníčková, Petr Sgall (2001): The Current Status of the Prague Dependency Treebank. In: TSD2001 Proceedings (eds. V. Matoušek, P. Mautner, R. Mouček, K. Taušer), LNAI 2166, Springer-Verlag Berlin Heidelberg New York, ISBN 3-540-42557-8, pp. 92-69 (url, local PostScript, bibtex)
Eva Hajičová, Petr Sgall (2001): A reusable corpus needs syntactic annotations: Prague Dependency Treebank. In: , (v tisku) Lancaster, pp.37-48 (bibtex)
Eva Hajičová, Petr Sgall (2001): Topic-focus and salience. In: Proceedings of 39th Annual Meeting of the Association for Computational linguistics, 10 thconference of the European Chapter. Proceedings, pp. 268--273, Toulouse: CNRS (bibtex)
Jiří Hana (2001): The AGILE System. In: PBML 75, UK Praha, pp. 5-28 (url, local HTML, bibtex)
Jiří Havelka (2001): Reference and Anaphoric Relations. Studies in Linguistics and Philosophy 72, Kluwer Academic Publishers: Dordrecht, The Netherlands. ISBN 0-7923-6070-2. In: PBML 75, UK Praha, pp. 5-28 (bibtex)
Martin Holub, Pavel Míka (2001): MATES -- An Experimental Linguistic Database System. In: Proceedings of the IRCS Workshop on Linguistic Databases, pp. 134--140, University of Pennsylvania, Philadelphia, USA (url, local PostScript, bibtex)
Vladislav Kuboň (2001): Problems of Robust Parsing of Czech. In: disertační práce MFF UK (bibtex)
Vladislav Kuboň (2001): A Method for Analyzing Clause Complexity. In: PBML 75, UK Praha, pp. 5-28 (bibtex)
Vladislav Kuboň, Tomáš Holan, Karel Oliva, Martin Plátek (2001): Word-Order Relaxations & Restrictions within a Dependency Grammar (technical report). In: Proceedings of International Workshop on Parsing Technologies, Tsinghua University Press, Beijing, China, ISBN 7-302-04925-4 (bibtex)
Vladislav Kuboň, Martin Plátek (2001): A Method of Accurate Robust Parsing for Czech. In: TSD2001 Proceedings (eds. V. Matoušek, P. Mautner, R. Mouček, K. Taušer), LNAI 2166, Springer-Verlag Berlin Heidelberg New York, ISBN 3-540-42557-8, pp. 92-69 (bibtex)
Ivona Kučerová (2001): Teoretická lingvistika a statistické zpracování přirozeného jazyka. In: sborník řady Linguae bohemicae studentinum IV., (v tisku) (bibtex)
Jarmila Panevová (2001): Building an Electronic Language Database Nowadays. In: Festschrift for Ferenc Papp, Debrecen (v tisku) (bibtex)
Jarmila Panevová (2001): Problémy reflexivního zájmena v češtině. In: Přednášky z XLIV. běhu Letní školy slovanských studií (ed. J. Nehasil), UK v Praze, FF, Praha, ISBN 80-7308-004-4, pp.81-88 (bibtex)
Jarmila Panevová (2001): Zpráva o spolupráci Jazykovědného sdružení ČR s MŠMT ČR.. In: Jazykovědné aktuality 37, č. 3-4, ISSN 1212-5326, pp. 28-34 (bibtex)
Jarmila Panevová (2001): Některé typy chyb ve stylu odborném a žurnalistickém a možnost jejich automatického odstranění. In: TERMINA 2000, Sborník příspěvků z II. konference 1996 a III. konference 2000, pp. 40--47, Galén, Praha, ISBN 80-7202-105-X (bibtex)
Jarmila Panevová (2001): Upotreblenie glagoµnych vremen v nekotorych tipach složnopodčinennych predloženij (na materiale češskogo jazyka). In: Kategorii glagola i strukturi predloženij, Sankt-Peterburg (v tisku) (pdf, local PDF, bibtex)
Jarmila Panevová (2001): Valency Frames: Extension and Reexamination.. In: Festschrift fuer Andrzej Boguslawski (eds. V.S. Chrakovskij, M.Grochowski, G.Hentschel), Studia Slavica Oldenburgensia 9, Bibliotheks- und Informationssystem, Oldenburg, ISBN 3-8142-0796-3, pp.357-368 (bibtex)
Jarmila Panevová, Eva Hajičová, Petr Sgall (2001): Manuál pro tektogramatické značkování (III. verze, prosinec 2001) (technical report). In: , (v tisku) Lancaster, pp.37-48 (url, local DOC, bibtex)
Jarmila Panevová, Eva Hajičová, Petr Sgall (2001): Tectogrammatics in corpus tagging. In: Perspectivs on Semantics, Pragmatics, and Discourse; A Festschrift for Ferenc Kiefer (eds (I. Kenesei, R. M. Harnish); Pragmatics and Beyond new Series, Vol.90, John Benjamins Publishing Company Amsterdam/Philadelphia, ISBN 90 272 5109 6, pp. 294-299 (bibtex)
Jarmila Panevová, Veronika Řezníčková (2001): K možnému pojetí všeobecnosti aktantu. In: Čeština - univerzália a specifika 3. Sborník konference ve Šlapanicích u Brna, 22.-24.11.2000 (eds. Zdeňka Hladká, Petr Karlík), MU Brno, ISBN 80-210-2532-8, pp. 139-146 (bibtex)
Nino Peterek (2001): Recent Methods of Prosody Analysis. In: PBML 76, MFF UK, Praha (bibtex)
Petr Sgall (2001): Aspect, Eventuality Types and Nominal Reference. Garland Publishing, New York - London 1999. In: , Slovo a slovesnost, ISSN 0037-7031, vol. 62, no. 2, pp. 126--130, (v tisku) Lancaster, pp.37-48 (bibtex)
Petr Sgall (2001): Structural and Formal Linguistics in Prague (Preface). In: Towards a Relational - Perspective Approach to Syntactic Semantics, ISBN 7-107-14429-4, pp. xxiii-xxxxviii (bibtex)
Petr Sgall (2001): A remark on Semantics and Pragmatics in Natural Language. In: PBML 76, pp. 13--22, MFF UK, Praha (bibtex)
Petr Sgall (2001): Linguistics: Information Structure. In: International Encyclopaedia of the Social and Behavioral Sciences (eds. H. J. Smelser and P. B. Boltes), pp. 8939--8942, Pergamon Oxford (bibtex)
Petr Sgall (2001): On the interface between langue and parole (abstract). In: Naturally! Linguistic studies in honour of W. U. Dressler presented on the occasion of his 60th birthday (eds Ch. Schaner-Woles, J. Rennison, F. Neubarth), Rosenberg & Sallier, Turin, pp. 469-474 (bibtex)
Petr Sgall (2001): Ohlédnutí pražského lingvisty za dvacátým stoletím. In: Slovo a slovesnost 62, pp. 241--257 (url, local DOC, bibtex)
Petr Sgall (2001): Pravdivost jako východisko sémantiky (Truth as a starting point of semantics). In: Úvahy o pravdivosti (ed. Jiří Nosek), pp. 123--134, Prague:Filosofia (bibtex)
Petr Sgall (2001): Volnost jako univerzální vlastnost jazyka. In: Čeština -- univerzália a specifika 3, pp. 49--57, Masarykova univerzita, Brno, ISBN 80-210-2532-8 (bibtex)
Petr Sgall (2001): Bedeutung, Sinn und Bezeichnung. In: Festschrift fuer Andrzej Boguslawski (eds. V.S. Chrakovskij, M.Grochowski, G.Hentschel), Studia Slavica Oldenburgensia 9, Bibliotheks- und Informationssystem, Oldenburg, ISBN 3-8142-0796-3, pp.357-368 (bibtex)
Petr Sgall (2001): Etničeskij jazyk. Opyt funkcional'noj differenciacii. Specimina philologiae Slavicae, vol.121, 1999.. In: , Slovo a slovesnost, ISSN 0037-7031, vol. 62, no. 1, pp. 71--74, (v tisku) Lancaster, pp.37-48 (bibtex)
Petr Sgall (2001): Functional Generative Description, Word Order and Focus. In: Theoretical Linguistics 27, pp.3-19 (bibtex)
Markéta Straňáková-Lopatková (2001): Ambiguity of Prepositional Groups: Classification, Criteria and Method for Automatic Processing.. In: On Prepositions (eds. L. Šaric, D. F. Reindl), Studia Slavica Oldenburgensia 8, Bibliotheks- und Informationssystem, Oldenburg, pp.263-282 (url, local DOC, bibtex)
Markéta Straňáková-Lopatková (2001): Homonymie předložkových skupin v češtině a možnost jejího automatického zpracování (technical report). In: , (v tisku) Lancaster, pp.37-48 (url, bibtex)
Markéta Straňáková-Lopatková (2001): Ambiguity of Prepositional Groups and the Possibility of Its Automatic Processing.. In: Summary of Doctoral Thesis, pp. 1-21, MFF UK, Praha, 2001 (bibtex)
Markéta Straňáková-Lopatková (2001): Některé typy syntaktické homonymie (z hlediska možnosti automatického zpracování). In: Čeština - univerzália a specifika 3, Sborník konference ve Šlapanicích u Brna, 22.-24.11.2000 (eds. Z. Hladká, P. Karlík), MU Brno, ISBN 80-210-2532-8, pp. 183-195 (bibtex)
Markéta Straňáková-Lopatková (2001): Homonymie předložkových skupin a možnost jejího automatického zpracování. In: disertační práce MFF UK (url, local ZIP, bibtex)
Markéta Straňáková-Lopatková, Ivan Kopeček, Karel Pala (2001): Ambiguity Problems in Human-Computer Interaction. In: Proceedings of the conference UAHCI, vol.3 (ed. C. Stephanidis), LEAmahwah, New Jersey, ISBN 0-8058-3609-8, pp.486-490 (bibtex)
Markéta Straňáková-Lopatková, Hana Skoumalová, Zdeněk Žabokrtský (2001): Enhancing the Valency Dictionary of Czech Verbs: Tectogrammatical Annotation. In: TSD2001 Proceedings (eds. V. Matoušek, P. Mautner, R. Mouček, K. Taušer), LNAI 2166, Springer-Verlag Berlin Heidelberg New York, ISBN 3-540-42557-8, pp. 92-69 (url, local PostScript, bibtex)
Jan Štěpánek (2001): CD-ROM Prague Dependency Treebank 1.0. Institute of Formal and Applied Linguistics & Linguistic Data Lab. Published by Linguistic Data Consortium, University of Pennsylvania.. In: PBML 76, MFF UK, Praha (bibtex)
Barbora Vidová Hladká, Alena Böhmová, Kiril Ribarov (2001): Corpus Linguistics. Investigating Language Structure and Use. Cambridge Approaches to Linguistics. Cambridge University Press: Cambridge 1998. In: PBML 76, MFF UK, Praha (bibtex)
Daniel Zeman (2001): How Much Will a RE-based Preprocessor Help a Statistical Parser?. In: Proceedings of International Workshop on Parsing Technologies, pp. 253-256, Tsinghua University Press, Beijing, China, ISBN 7-302-04925-4 (url, local RTF, bibtex)
Daniel Zeman (2001): Parsing with Regular Expressions: A Minute to Learn, a Lifetime to Master. In: The Prague Bulletin of Mathematical Linguistics, ISSN 0032-6585, 75, pp. 29-37 (url, bibtex)
Zdeněk Žabokrtský (2001): Automatic Functor Assignment in the Prague Dependency Treebank (technical report). In: , (v tisku) Lancaster, pp.37-48 (url, bibtex)
Zdeněk Žabokrtský, Martine De Cock, Etienne E. Kerre (2001): Representing Linguistic Hedges by L-Fuzzy Modifiers. In: Proceedings of CIMCA’01 (International Conference on Computational Intelligence for Modelling Control and Automation), pp. 64-72 (bibtex)
Zdeněk Žabokrtský, Mirko Navara (2001): How to Make Constrained Fuzzy Arithmetic Efficient. In: Soft Computing, (v tisku) (bibtex)
Jan Cuřín (2000): The experimental results from NSF Workshop'99 CLSP Johns Hopkins University. Part I.: Czech/English Statistical Machine Translation. In: PBML 74, Charles University, Prague (bibtex)
Martin Čmejrek (2000): Statistical Modelling in Machine Translation - Segmentation of Parallel Texts. In: WDS'00, Proceedings, part IV. (ed. J. Šafránková), pp. 479--482, MatfyzPress, vydavatelství MFF UK, ISBN 80-85863-59-6 (bibtex)
Jan Hajič (2000): Morphological Tagging: Data vs. Dictionaries. In: 6th ANLP Conference / 1st NAACL Meeting. Proceedings, pp. 94--101, Seattle, Washington, ISBN 1-55860-704-8 (bibtex)
Jan Hajič, Pavel Krbec, Josef Psutka, Pavel Ircing, William Byrne (2000): Morpheme Based Language Models for Speech Recognition of Czech. In: TSD2000, Proceedings (eds. P. Sojka, I. Kopeček, K. Pala), pp. 211--216, Lecture Notes in Artificial Intelligence vol.1902, Springer, ISBN 3-540-41042-2 (bibtex)
Jan Hajič, Vladislav Kuboň, Jan Hric (2000): Machine Translation of Very Close Languages. In: 6th ANLP Conference / 1st NAACL Meeting. Proceedings, pp. 7--12, Seattle, Washington, ISBN 1-55860-704-8 (bibtex)
Jan Hajič, Vladislav Kuboň, Jan Hric (2000): Česílko - an MT system for closely related languages. In: ACL2000, Tutorial Abstracts and Demonstration Notes, pp. 7--8, ACL, ISBN 1-55860-730-7 (bibtex)
Jan Hajič, Barbora Vidová Hladká, Alena Böhmová, Eva Hajičová (2000): The Prague Dependency Treebank: A Three-Level Annotation Scenario. In: Treebanks: Building and Using Syntactically Annotated Corpora (ed. Anne Abeille), Kluver Academic Publishers, v tisku (bibtex)
Eva Hajičová (2000): Dependency as a universal ingredient in syntactic theories. In: Ve sb. vydávaném společně pařížskou univerzitou INALCO a ÚFALem MFF UK, v tisku (bibtex)
Eva Hajičová (2000): A TFA-Based Glance at Optimality Theory. In: (url, local DOC, bibtex)
Eva Hajičová (2000): Mysteries of Order. In: Člověk a jeho jazyk; 1. Jazyk jako fenomén kultury (na počesť profesora Jána Horeckého) (ed. Kl. Buzássyová), pp. 260--268, Veda, vydavateľstvo Slovenskej akadémi vied, Bratislava, ISBN 80-224-0641-4 (url, local DOC, bibtex)
Eva Hajičová (2000): Dependency-Based Underlying-Structure Tagging of a Very Large Czech Corpus. In: Special issue of TAL journal, Grammaires de Dépendence / Dependency Grammars (ed. Sylvian Kahane), pp. 57--78, Hermes (url, local DOC, bibtex)
Eva Hajičová (2000): Item Ordering in the Sentence. In: Proceedings of LP'98 (eds. O.Fujimura, B.D.Joseph, B. Palek), pp. 361--371, The Karolinum Press, Praha, ISBN 80-246-0016-1 (bibtex)
Eva Hajičová (2000): The International Scene. In: Rudiments of English Linguistics (ed. Pavol Štekauer), 1 kapitolka, pp. 196--225, Slovacontact, Prešov, ISBN 80-88876-04-4 (bibtex)
Eva Hajičová (2000): Výzkum syntaxe nad ČNK. In: Čeština - univerzália a specifika 3. Sborník konference ve Šlapanicích u Brna, 22.-24.11.2000 (ed. Zdeňka Hladká, Petr Karlík) (bibtex)
Eva Hajičová (2000): Language Resources and Evaluation Conference in Granada: a View from Prague. In: ELRA Newsletter (bibtex)
Eva Hajičová (2000): Focalizers and Their Status in the Topic/Focus Articulation of the Sentence. In: (bibtex)
Eva Hajičová (2000): A Praguian view on Optimality Theory. In: Linguistica Pragensia, Vol.X/2, pp. 59--72, ÚJČ AV ČR, ISSN 0862-8432 (bibtex)
Eva Hajičová (2000): How Many Topics/Foci. In: In: Linguistics and Language Studies; Exploring language from different perspectives (eds. I. Kovačič, M. Milojevic-Sheppard, S. Orel-Kos, J. Orešnik), pp. 9--20, Filozovska fakulteta Univerze v Ljubljani, Ljubljana, ISBN 961-227-071-6 (bibtex)
Eva Hajičová (2000): Teorie optimality a aktuální členění věty. In: Slovo a slovesnost 61, pp. 161--169 (bibtex)
Eva Hajičová, Markéta Ceplová (2000): Deletions and their reconstruction in tectogrammatical syntactic tagging of very large corpora. In: Proceedings of the 18th International Conference on Computational Linguistics (COLING), pp. 278--284, Universität des Saarlandes, Saarbrücken, Germany, ISBN 1-55860-717-X (bibtex)
Eva Hajičová, Petr Pajas (2000): Evaluation of Tectogrammatical Annotation of PDT. In: TSD2000, Proceedings (eds. P. Sojka, I. Kopeček, K. Pala), pp. 75--80, Lecture Notes in Artificial Intelligence vol.1902, Springer, ISBN 3-540-41042-2 (bibtex)
Eva Hajičová, Petr Sgall (2000): Semantico-Syntactic Tagging of Very Large Corpora: the Case of Restoration of Nodes on the Underlying Level. In: LREC (2nd Intern. Conference), vol.I (eds. M. Gavrilidou, G. Carayannis, S. Markantonatou, S. Piperidis, G. Stainhaouer), pp. 95/98, Athens, Greece (bibtex)
Eva Hajičová, Petr Sgall (2000): Dependency, Coordination, and Projectivity. In: Slovo v tekste i v slovare (sbornik statej k semidesjatiletiju akademika Ju. D. Apresjana), pp. 456--466, Jazyki russkoj kul'tury, Moskva, ISBN 5-7859-0199-4 (bibtex)
Eva Hajičová, Petr Sgall, Eva Buráňová (2000): Tagging of a very large corpora: Topic-Focus Articulation. In: Proceedings of the 18th International Conference on Computational Linguistics (COLING), pp. 139--144, Universität des Saarlandes, Saarbrücken, Germany, ISBN 1-55860-717-X (bibtex)
Jiří Havelka (2000): Lexical Semantics and Knowledge Representation in Multilingual Text Generation. In: PBML 73-74 (bibtex)
Jiří Havelka (2000): Topic-Focus Articulation and Discourse Theories. In: WDS'00, Proceedings, part IV. (ed. J. Šafránková), pp. 472--475, MatfyzPress, vydavatelství MFF UK, ISBN 80-85863-59-6 (bibtex)
Martin Holub (2000): Use of Dependency Microcontexts in IR. In: SOFSEM, pp. !!!, !!! (bibtex)
Martin Holub (2000): Context Information Mining I., Motivation and Syntactic Issues. In: PBML (bibtex)
Martin Holub, Alena Böhmová (2000): Use of Dependency Tree Structures for the Microcontext Extraction. In: ACL2000, Workshop on Recent Advances in Natural Language Processing and Information Retrieval (eds. Judith Klavans and Julio Gonzalo), pp. 23--33 (local PDF, bibtex)
Geert-Jan M. Kruijff (2000): Categories, Constructions, and Dependency Relations. In: TSD2000, Proceedings (eds. P. Sojka, I. Kopeček, K. Pala), pp. 51--59, Lecture Notes in Artificial Intelligence vol.1902, Springer, ISBN 3-540-41042-2 (bibtex)
Geert-Jan M. Kruijff, Ivana Kruijffová (2000): Aggregation and Contextual Reference in Automatically Generated Instructions. In: TSD2000, Proceedings (eds. P. Sojka, I. Kopeček, K. Pala), pp. 87--92, Lecture Notes in Artificial Intelligence vol.1902, Springer, ISBN 3-540-41042-2 (url, local HTML, bibtex)
Geert-Jan M. Kruijff, Ivana Kruijffová, John Bateman (2000): Contextually Appropriate Ordering of Nominal Expressions. In: Information sharing (eds. Kees van Deemter, Rodger Kibble) (local HTML, bibtex)
Geert-Jan M. Kruijff, Ivana Kruijffová, John Bateman (2000): Generation of Contextually Appropriate Word Order. In: contributed to: Information Sharing (eds. Kees van Deemer, Rodger Kibble, 20.10. (url, local HTML, bibtex)
Geert-Jan M. Kruijff, Ivana Kruijffová, Danail Dochev, Ivan Hadjiiliev, Lena Sokolova, Tony Hartley (2000): Text Structuring Specification for the Final Prototype. AGILE project deliverable Work Package 5: TEXS3-Cu, TEXS3-Ru, TEXS3-Bg. In: Technical Report, p. 59, January (bibtex)
Geert-Jan M. Kruijff, Ivana Kruijffová, Jiří Hana, Martin Čmejrek, Serge Sharoff, Danail Dochev (2000): Evaluation of the intermediate prototype. AGILE project deliverable Work Package 9:EVALI-Bu, EVALI-Cz, EVALI-Ru. In: Technical Report, p. 37, March (bibtex)
Geert-Jan M. Kruijff, Ivana Kruijffová, Jiří Hana, Serge Sharoff, Danail Dochev, Elke Teich, John Bateman, Lena Sokolova, Kamenka Staykova (2000): Formal specification of full grammar models and Implementation of tactical generation resources for all three languages in a Finale Prototype. AGILE project deliverable Work Package 6: SPEC3-Bg, SPEC3. In: Technical report, p. 71, July (bibtex)
Geert-Jan M. Kruijff, Ivana Kruijffová, Jiří Hana, Hana Skoumalová, Serge Sharoff, Elke Teich, John Bateman, Lena Sokolova, Tony Hartley, Kamenka Staykova (2000): A Multilingual System for Text Generation in Three Slavic Languages. In: Proceedings of the 18th International Conference on Computational Linguistics (COLING), pp. 474--480, Universität des Saarlandes, Saarbrücken, Germany, ISBN 1-55860-717-X (url, local HTML, bibtex)
Geert-Jan M. Kruijff, Ivana Kruijffová, Serge Sharoff, Ivan Hadjiiliev, Lena Sokolova, Michael Boldasov (2000): Flexible text structuring for the final prototype (documentation to the software). AGILE project deliverable Work Package 5: TEXM3-Cu, TEXM3-Ru, TEXM3-Bg. In: Technical Report, p. 92, June (bibtex)
Geert-Jan M. Kruijff, Ivana Kruijffová, Hana Skoumalová, Serge Sharoff, Elke Teich, John Bateman (2000): Resources for multilingual text generation in three Slavic languages. In: LREC (2nd Intern. Conference), vol.III (eds. M. Gavrilidou, G. Carayannis, S. Markantonatou, S. Piperidis, G. Stainhaouer), pp. 1763--1768, Athens, Greece (url, local HTML, bibtex)
Geert-Jan M. Kruijff, Shravan Vasishth (2000): Processing as abduction+deduction: A sentence processing model. In: Proceedings of the ESSLLI 2000 Wsh. on Linguistic Theory and Grammar Implementations, Birmingham UK (bibtex)
Geert-Jan M. Kruijff, Shravan Vasishth (2000): Processing as abduction: A sentence processing model. In: Proceedings of the Japanese Cognitive Society Meeting (bibtex)
Ivana Kruijffová (2000): Intonational Phonology. Cambridge University Press, 1996. In: PBML (bibtex)
Ivana Kruijffová (2000): Papers in Honour of Eva Hajičová (eds. B. H. Partee , P. Sgall; John Benjamins Publishing Company; Amsterdam/Philadelphia; 1995). In: Časopis pro moderní filologii 82(1), pp. 51--54 (bibtex)
Ivana Kruijffová (2000): Linguistic, Cognitive and Computational Perspectives. In: PBML 73-74 (bibtex)
Ivana Kruijffová (2000): recenze na:James D. McCawley: The Syntactic Phenomena of English (Second Edition). Chicago University Press. In: JoLLi (bibtex)
Ivana Kruijffová (2000): Language at work. Analyzing communication breakdown in the workplace to inform systems design. CSLI Lecture Notes Number 66. In: (bibtex)
Ivana Kruijffová, Jiří Hana, Martin Čmejrek, Serge Sharoff, Danail Dochev, Elke Teich, Lena Sokolova, Tony Hartley, Kamenka Staykova, Donia Scott (2000): Evaluation of the Final Prototype. AGILE project deliverable Work Package 9:EVALI-Bu, EVALI-Cz, EVALI-Ru. In: Technical Report, p. 102, October (bibtex)
Ivana Kruijffová, Bonnie L. Webber (2000): Discourse Connectives, Inference and Information Structure. In: Proceedings of the International Workshop on Inference in Computational Semantics, ICoS-2 (eds. Johan Bos, Michael Kohlhase), pp. 105--120, Schloss Dagstuhl, July 29-30 (bibtex)
Ivana Kruijffová, Bonnie L. Webber (2000): Presuppositional Interpretation of Causal and Additive "although". Abstract. In: "Making Sense", Groningen (bibtex)
Ivana Kruijffová, Bonnie L. Webber (2000): Information Structure and the Interpretation of Discourse Connectives in English and Czech (Abstrac). In: SIC-CSP2000, proceedings, Cambridge (bibtex)
Karel Oliva, Pavel Květoň, Vladimír Petkevič, Milena Hnátková (2000): The Linguistics Basis of a Rule-Based Tagger of Czech. In: TSD, in Lecture Notes in Artificial Intelligence, vol. 1902, pp. 3--8, Springer-Verlag (bibtex)
Roman Ondruška (2000): Syntactic Frames Extraction. In: WDS'00, Proceedings, part IV. (ed. J. Šafránková), pp. 483--486, MatfyzPress, vydavatelství MFF UK, ISBN 80-85863-59-6 (bibtex)
Jarmila Panevová (2000): Poznámky k valenci podstatných jmen. In: Čeština - univerzália a specifika 2. Sborník konference ve Šlapanicích u Brna, 17.-19.11.1999 (ed. Zdeňka Hladká, Petr Karlík), pp. 173--180, Masarykova Univerzita v Brně, ISBN 80-210-2262-0 (bibtex)
Jarmila Panevová (2000): Building an electronic language database nowadays: The Prague Dependency Treebank. In: (bibtex)
Jarmila Panevová (2000): Funkoní styly a automatické zpracování jazyka. In: Česká slavistika. České přednášky pro XII. mezinárodní sjezd slavistů, Krakov 1998 (ed. H. Bláhová, S. Wollmann a kol.), Slavia, Slovanský ústav AV ~LR, ISBN 80-85494-41-8 (url, local DOC, bibtex)
Jarmila Panevová (2000): More Remarks on Control. In: (url, local DOC, bibtex)
Jarmila Panevová (2000): Actants and circonstants: Criteria for their determination. In: (v tisku ve sb. I.N.A.L.C.O Paris ? ÚFAL MFF UK Praha) (bibtex)
Jarmila Panevová (2000): Větná skladba (z hlediska pojetí v osnovách různých vzdělávacích systémů). In: (bibtex)
Jarmila Panevová, Alena Böhmová, Eva Hajičová, Petr Sgall, Markéta Ceplová, Veronika Řezníčková (2000): A Manual for Tectogrammatical Tagging of the Prague Dependency Treebank (technical report). In: (bibtex)
Jarmila Panevová, Eva Hajičová, Petr Sgall (2000): Coreference in Annotating a Large Corpus. In: LREC (2nd Intern. Conference), vol.I (eds. M. Gavrilidou, G. Carayannis, S. Markantonatou, S. Piperidis, G. Stainhaouer), pp. 497--500, Athens, Greece (bibtex)
Jarmila Panevová, Eva Hajičová, Petr Sgall (2000): A Semi-Automatic Resolution of Anaphora and Ellipsis in a Large Corpus of Czech. In: (url, local DOC, bibtex)
Jarmila Panevová, Jana Klímová (2000): Deminutivní substantiva v Českém národním korpusu a možnosti jejich automatického zpracování. In: Člověk a jeho jazyk; 1. Jazyk jako fenomén kultury (na počesť profesora Jána Horeckého) (ed. Kl. Buzássyová), pp. 296--306, Veda, vydavateľstvo Slovenskej akadémi vied, Bratislava, ISBN 80-224-0641-4 (bibtex)
Jarmila Panevová, Kateřina Marková (2000): Opisanije dvux tipov konversivnyx par. In: Slovo v tekste i v slovare (sbornik statej k semidesjatiletiju akademika Ju. D. Apresjana), pp. 202--211, Jazyki russkoj kul'tury, Moskva, ISBN 5-7859-0199-4 (bibtex)
Jarmila Panevová, Markéta Straňáková-Lopatková (2000): Selected Types of Ambiguity of Prepositional Groups; Classification, Criteria and Method for Automatic Processing (Abstract). In: sborník ke konferenci SLE, pp. 61--63, Poznaň, 31.8.-2.9. (bibtex)
Jarmila Panevová, Barbora Vidová Hladká, Kiril Ribarov, Vladislav Kuboň, Daniel Zeman, Martin Čmejrek, Jan Cuřín, Nino Peterek (2000): Počítačová lingvistika ve vztahu k informatice. In: Pokroky matematiky, fyziky a astronomie, ISSN 0032-2423, vol. 45, no. 3, pp. 207-218 (url, bibtex)
Nino Peterek (2000): Computer Analysis of Czech Speech and Prosody. In: (bibtex)
Nino Peterek (2000): Automatic speech recognition and prosody. In: WDS'00, Proceedings, part IV. (ed. J. Šafránková), pp. 476--478, MatfyzPress, vydavatelství MFF UK, ISBN 80-85863-59-6 (bibtex)
Nino Peterek, William Byrne, Sanjeev Khudanpur, Peter Beyerlein, Juan Manuel Huerta, Bhaskara Marthi, John Picone, Dimitra Vergyri, Wei Wang (2000): Towards Language Independent Acoustic Modeling. In: ICASSP 2000, pp. 1029-1032, Istanbul, Turkey (pdf, bibtex)
Nino Peterek, William Byrne, Sanjeev Khudanpur, Peter Beyerlein, Juan Manuel Huerta, Bhaskara Marthi, John Picone, Wei Wang, J. Morgan (2000): The experimental results from NSF Workshop'99 CLSP Johns Hopkins University. Part II.: Automatic Speech Recognition. In: PBML (bibtex)
Kiril Ribarov (2000): Rule-Based Tagging: Morphological Tagsets versus Tagset of Analytical Functions. In: LREC (2nd Intern. Conference), vol.II (eds. M. Gavrilidou, G. Carayannis, S. Markantonatou, S. Piperidis, G. Stainhaouer), pp. 1123--1125, Athens, Greece (bibtex)
Kiril Ribarov (2000): The (Un)Deterministic Nature of Morphological Context. In: LREC (2nd Intern. Conference), vol.III (eds. M. Gavrilidou, G. Carayannis, S. Markantonatou, S. Piperidis, G. Stainhaouer), pp. 1743--1747, Athens, Greece (bibtex)
Kiril Ribarov, Daniel Zeman (2000): Stochastically-Based Semantic Analysis, Kluwer, 1999 (review). In: The Prague Bulletin of Mathematical Linguistics, ISSN 0032-6585, 73-74, pp. 105-108 (bibtex)
Veronika Řezníčková (2000): Valency frames of nouns and adjectives from PDT point of view. In: WDS'00, Proceedings, part IV. (ed. J. Šafránková), pp. 466--471, MatfyzPress, vydavatelství MFF UK, ISBN 80-85863-59-6 (bibtex)
Anoop Sarkar, Daniel Zeman (2000): Automatic Extraction of Subcategorization Frames for Czech. In: Proceedings of the 18th International Conference on Computational Linguistics (COLING), pp. 691-697, Universität des Saarlandes, Saarbrücken, Germany, ISBN 1-55860-717-X (pdf, bibtex)
Petr Sgall (2000): Aktuální členění věty - jeho postavení v jazykovém systému a jeho význam pro překlad. In: vyjde ve sborníku pro prof. M. Hralu, Karolinum, Praha (bibtex)
Petr Sgall (2000): Od významové stavby k formální sémantice. In: Člověk a jeho jazyk; 1. Jazyk jako fenomén kultury (na počesť profesora Jána Horeckého) (ed. Kl. Buzássyová), pp. 244--250, Veda, vydavateľstvo Slovenskej akadémi vied, Bratislava, ISBN 80-224-0641-4 (bibtex)
Petr Sgall (2000): The Freedom of Language. In: PBML 73-74 (bibtex)
Petr Sgall (2000): Konvence v jazyku. In: In: Konvence ve vědě a filosofii (připravil J. Nosek), pp. 11--21, Filosofia, nakladatelství Filosofického ústavu AV ČR, Praha, ISBN 80-7007-138-9 (bibtex)
Petr Sgall (2000): English Syntax in Functional Generative Description, Topic-focus articulation (information structure) of the sentence, Syntax and semantics. In: Rudiments of English Linguistics (ed. Pavol Štekauer), 3 kapitolky, pp. 225--265, Slovacontact, Prešov, ISBN 80-88876-04-4 (url, local DOC, bibtex)
Petr Sgall (2000): Remarks on the Semantics of the Focus. In: Recent Topics in Mathematical and Computational Linguistics (papers in honour of S. Marcus) (Eds. C. Martín-Vide, G. Paun), pp. 271--278, Editura Acadeniei Romane, Bucuresti, ISBN 973-27-0770-4 (bibtex)
Petr Sgall (2000): Sémantika a pragmatika v jazycích různých typů. In: Čeština - univerzália a specifika 2. Sborník konference ve Šlapanicích u Brna, 17.-19.11.1999 (ed. Zdeňka Hladká, Petr Karlík), pp. 107--113, Masarykova Univerzita v Brně, ISBN 80-210-2262-0 (bibtex)
Petr Sgall (2000): Jakobson entre l'Est et l'Ouest, 1915-1939. In: , Slovo a slovesnost, ISSN 0037-7031, vol. 61, no. 4, pp. 307--309 (bibtex)
Petr Sgall (2000): On Comparison of Approaches. In: Linguistica Pragensia, Vol.X/2, pp. 73--84, ÚJČ AV ČR, ISSN 0862-8432 (url, local DOC, bibtex)
Petr Sgall (2000): Sketches of Slavic Scholars. In: SaS 61, pp. 149--151 (bibtex)
Petr Sgall (2000): Problémy mluvené češtiny v Praze. In: Město a jeho jazyk (ed. Slavomír Ondrejovič), pp. 75--83, Veda, vydavateľstvo Slovenskej akadémi vied, v edici Sociolinguistica Slovaca, Bratislava, ISBN 80-224-0605-8 (bibtex)
Petr Sgall (2000): Foundations of Computational Linguistics. Man-Machine Communication in Natural Language. In: PBML 73-74 (bibtex)
Petr Sgall (2000): Jan Firbas, Functional sentence perspective in written and spoken communication. In: , Journal of Pragmatics, ISSN 0378-2166, 32, pp. 639--644 (bibtex)
Markéta Straňáková-Lopatková (2000): Selected Types of Pg-ambiguity; Processing Based on Analysis by Reduction. In: TSD2000, Proceedings (eds. P. Sojka, I. Kopeček, K. Pala), pp. 139--144, Lecture Notes in Artificial Intelligence vol.1902, Springer, ISBN 3-540-41042-2 (bibtex)
Jan Štěpánek (2000): Knowledge, Language and Logic:Questions for Quine, Kluwer Academic Publishers Dordrecht/Boston/London. In: PBML (bibtex)
Jan Štěpánek (2000): Formalism for a Description of Relations between Tectogrammatics and Morphemics. In: WDS'00, Proceedings, part IV. (ed. J. Šafránková), pp. 462--465, MatfyzPress, vydavatelství MFF UK, ISBN 80-85863-59-6 (bibtex)
Barbora Vidová Hladká (2000): Czech Language Tagging. PhD Thesis, UFAL, MFF UK. In: (bibtex)
Barbora Vidová Hladká (2000): The Contex (not only) for Human. In: LREC (2nd Intern. Conference), vol.II (eds. M. Gavrilidou, G. Carayannis, S. Markantonatou, S. Piperidis, G. Stainhaouer), pp. 1113--1116, Athens, Greece (bibtex)
Barbora Vidová Hladká, Alena Böhmová, Kiril Ribarov (2000): Corpus Linguistics - Investigating Language Structure and Use, Cambridge, 1998. In: (bibtex)
Daniel Zeman, Anoop Sarkar (2000): Learning Verb Subcategorization from Corpora: Counting Frame Subsets. In: Proceedings of the Second International Conference on Language Resources and Evaluation, pp. 227-233, European Language Resources Association, Athîna, Greece (pdf, bibtex)
Alevtina Bémová, Jan Hajič, Barbora Vidová Hladká, Jarmila Panevová (1999): Morphological and Syntactic Tagging of the Prague Dependency Treebank. In: Journées ATALA - Corpus annotés pour la syntaxe; ATALA Workshop - Treebanks, Paris (bibtex)
Alena Böhmová, Eva Hajičová (1999): The Prague Dependency Treebank I; How Much of the Underlying Syntactic Structure Can Be Tagged Automatically?. In: PBML 71, Universita Karlova, Praha (bibtex)
Alena Böhmová, Eva Hajičová (1999): How Much of the Underlying Syntactic Structure Can Be Tagged Automatically?. In: Journées ATALA - Corpus annotés pour la syntaxe; ATALA Workshop - Treebanks, Paris (bibtex)
Martin Čmejrek, Jan Cuřín (1999): Automatic Translation Lexicon Extraction from Czech-English Parallel Texts. In: PBML 71, Universita Karlova, Praha (bibtex)
Jan Hajič, Ondřej Cikhart (1999): Word Sense Disambiguation of Czech Texts. In: TSD'99, Proceedings (eds. V. Matoušek, P. Mautner, J. Ocelíková, P. Sojka), Lecture Notes in Artificial Intelligence vol.1692, Springer (bibtex)
Jan Hajič, Michael Collins, Lance Ramshaw, Christoph Tillmann (1999): A Statistical Parser for Czech. In: ACL'99, Maryland, USA (bibtex)
Jan Hajič, Jarmila Panevová (1999): The syntactic Tagging of Corpora: New Issues for Explicit Syntactic Description of Czech. In: (bibtex)
Jan Hajič, Nino Peterek, Josef Psutka, Pavel Ircing, William Byrne, Frederick Jelinek, Sanjeev Khudanpur, John McDonough (1999): Large Vocabulary Speech Recognition for Read and Broadcast Czech. In: TSD'99, Proceedings (eds. V. Matoušek, P. Mautner, J. Ocelíková, P. Sojka), Lecture Notes in Artificial Intelligence vol.1692, Springer (bibtex)
Eva Hajičová (1999): Silver Medal of Charles University to John Benjamins. In: PBML 71, Universita Karlova, Praha (bibtex)
Eva Hajičová (1999): Aktuální členění věty a výstavba promluvy. In: Čeština - univerzália a specifika. Sborník konference ve Šlapanicích u Brna 17.-18. 11. 1998 (ed. Zdeňka Hladká, Petr Karlík) (bibtex)
Eva Hajičová (1999): The Prague Dependency Treebank: Crossing the Sentence Boundary. In: TSD'99, Proceedings (eds. V. Matoušek, P. Mautner, J. Ocelíková, P. Sojka), Lecture Notes in Artificial Intelligence vol.1692, Springer (bibtex)
Eva Hajičová, Petr Sgall, Ivana Kruijffová (1999): Prague Dependency Treebank: Restoration of Deletions. In: TSD'99, Proceedings (eds. V. Matoušek, P. Mautner, J. Ocelíková, P. Sojka), Lecture Notes in Artificial Intelligence vol.1692, Springer (bibtex)
Geert-Jan M. Kruijff (1999): Robustness in Tabular Deduction for Multimodal Logical Grammar - Part 1. In: TSD'99, Proceedings (eds. V. Matoušek, P. Mautner, J. Ocelíková, P. Sojka), Lecture Notes in Artificial Intelligence vol.1692, Springer (bibtex)
Geert-Jan M. Kruijff (1999): Dordrecht, Netherlands, 1997, ISBN 0-7923-4446-4, 466p. In: PBML 71, Universita Karlova, Praha (bibtex)
Geert-Jan M. Kruijff (1999): Implementation Tabular Deduction for Multimodal Logical Grammar. In: PBML 71, Universita Karlova, Praha (bibtex)
Geert-Jan M. Kruijff, Ivana Kruijffová (1999): Text Structuring in a Multilingual System for Generation of Instructions. In: TSD'99, Proceedings (eds. V. Matoušek, P. Mautner, J. Ocelíková, P. Sojka), Lecture Notes in Artificial Intelligence vol.1692, Springer (url, local HTML, bibtex)
Geert-Jan M. Kruijff, Ivana Kruijffová (1999): Handling Word Order in a Multilingual System for Generation of Instructions. In: TSD'99, Proceedings (eds. V. Matoušek, P. Mautner, J. Ocelíková, P. Sojka), Lecture Notes in Artificial Intelligence vol.1692, Springer (url, local HTML, bibtex)
Ivana Kruijffová (1999): Issues of Valency and Meaning. Studies in Honour of Jarmila Panevová (ed. Eva Hajičová), 1998, 307 s. In: Sas 60 (bibtex)
Pavel Květoň (1999): A maximum entropy approach to the natural language modeling. In: WDS 1999, Charles University, Prague (bibtex)
Jarmila Panevová (1999): Valence a její univerzální a specifické projevy. In: Čeština - univerzália a specifika. Sborník konference ve Šlapanicích u Brna 17.-18. 11. 1998 (ed. Zdeňka Hladká, Petr Karlík) (bibtex)
Jarmila Panevová (1999): Úvahy nad novou Skladbou češtiny M. Grepla a P. Karlíka. II. In: SaS 60 (bibtex)
Jarmila Panevová, Alena Böhmová, Petr Sgall (1999): Syntactic Tagging: Procedure for the Transition from the Analytic to the Tectogrammatical Tree Structure. In: TSD'99, Proceedings (eds. V. Matoušek, P. Mautner, J. Ocelíková, P. Sojka), Lecture Notes in Artificial Intelligence vol.1692, Springer (bibtex)
Jarmila Panevová, Markéta Straňáková-Lopatková (1999): Some Types of Syntactic Ambiguity; How to Treat Them in an Automatic Procedure. In: TSD'99, Proceedings (eds. V. Matoušek, P. Mautner, J. Ocelíková, P. Sojka), Lecture Notes in Artificial Intelligence vol.1692, Springer (bibtex)
Petr Sgall (1999): Na gymnáziu za války (a potom v Praze). In: Almanach 1909-1999, Gymnázium v České Třebové (bibtex)
Petr Sgall (1999): Čekající možnosti a číhající propasti. In: Slovo a slovesnost, 60 (bibtex)
Petr Sgall (1999): Sbližování spisovné a obecné češtiny. In: Naše řeč 82, č.4 (bibtex)
Petr Sgall (1999): D. G. Hays o předstupních civilizace. In: Slovo a slovesnost (bibtex)
Petr Sgall (1999): Prague School Typology. In: Approaches to language typology (ed. Masayoshi Shibatani, Theodora Bynon), Oxford: Oxford University Press (1. Vydání 1995) (bibtex)
Petr Sgall (1999): Remarks on Sentence Prosody and Topic-Focus Articulation. In: TSD'99, Proceedings (eds. V. Matoušek, P. Mautner, J. Ocelíková, P. Sojka), Lecture Notes in Artificial Intelligence vol.1692, Springer (bibtex)
Petr Sgall (1999): Proměny hovorové češtiny a škola. In: Proceedings of international conference Teachers and their University Education at the Turn of Millenium (Jana Kohnová) (url, bibtex)
Petr Sgall (1999): Lingvistika a zákon schválnosti. In: Jazykovědné aktuality, roč. XXXVI - 1999, č. 1 a 2 (bibtex)
Petr Sgall (1999): Závislostní gramatika a slovosled v češtině a v analytických jazycích. In: Čeština - univerzália a specifika. Sborník konference ve Šlapanicích u Brna 17.-18. 11. 1998 (ed. Zdeňka Hladká, Petr Karlík) (bibtex)
Petr Sgall (1999): The Archimedes Problem. In: PBML 71, Universita Karlova, Praha (bibtex)
Petr Sgall (1999): Issues of Colloquial and Standard Czech. In: Přednášky z XLII. Běhu Letní školy slovanských studií, Lšss FF UK (bibtex)
Petr Sgall (1999): Nestát vývoji v cestě. Rozhovor připravil L. Kasal. In: Tvar 15 (bibtex)
Petr Sgall (1999): Types of languages and probabilistic implication laws. In: Prague Linguistic Circle Papers 3, pp. 25--34, Amsterdam/Philadelphia: Benjamins (in press) (bibtex)
Markéta Straňáková-Lopatková (1999): Selected Types of Pg-Ambiguity.. In: PBML 72 (bibtex)
Alevtina Bémová, Geert-Jan M. Kruijff, Ivana Kruijffová, Jiří Trojánek (1998): Modelling Lexical Resources in KPML for Generating Instructions in Slavic languages. In: (bibtex)
Alevtina Bémová, Jiří Trojánek (1998): Tagging and analysis of instructional texts in the software domain. In: (bibtex)
Jan Hajič (1998): Building a Syntactically Annotated Corpus: The Prague Dependency Treebank. In: Issues of Valency and Meaning. Studies in Honour of Jarmila Panevová (ed. Eva Hajičová), Karolinum, Charles University Press, Prague, ISBN 80-7184-601-5 (bibtex)
Jan Hajič, Eric Brill, Michael Collins, Barbora Hladká, Douglas Jones, Cynthia Kuo, Lance Ramshaw, Oren Schwartz, Christoph Tillmann, Daniel Zeman (1998): Core Natural Language Processing Technology Applicable to Multiple Languages. In: An NSF Workshop: Language Engineering for Students and Professionals Integrating Research and Education, Johns Hopkins University, Baltimore, MD, USA (url, bibtex)
Jan Hajič, Eric Brill, Michael Collins, Barbora Hladká, Douglas Jones, Cynthia Kuo, Lance Ramshaw, Oren Schwartz, Christoph Tillmann, Daniel Zeman (1998): Core Natural Language Technology Applicable to Multiple Languages, Workshop ´98 Final Report. In: The Prague Bulletin of Mathematical Linguistics, ISSN 0032-6585, 70, pp. 73-82 (local HTML, bibtex)
Jan Hajič, Jarmila Panevová, Eva Hajičová, Petr Sgall (1998): Syntax v českém národním korpusu. In: Slovo a slovesnost 59, č. 3 (bibtex)
Jan Hajič, Barbora Vidová Hladká (1998): Tagging Inflective Languages: Prediction of Morphological Categories for a Rich, Structured Tagset. In: Proceedings of the Conference COLING - ACL `98, Montreal, Canada (bibtex)
Jan Hajič, Barbora Vidová Hladká (1998): Czech Language Processing – PoS Tagging. In: Proceedings of the First International Conference on Language Resources & Evaluation, Granada. Spain (bibtex)
Eva Hajičová (1998): The ordering of valency slots from a communicative point of view. In: Productivity and Creativity. Studies in General and Descriptive Lingvistics in Honour of E. M. Uhlenbeck (ed. Mark Janse), Mouton de Gruyter, Berlin, New York (bibtex)
Eva Hajičová (1998): Prague Dependency Treebank: From analytic to tectogrammatical annotations. In: Text, Speech, Dialogue. Proceedings of the First Workshop on Text, Speech, Dialogue-TSD`98 (eds. P. Sojka, V. Matoušek, K. Pala, I. Kopeček), Brno: Masaryk University, Czech Republic, ISBN 80-210-1900-X (bibtex)
Eva Hajičová (1998): Oldřich Leška`s seventieth birthday. In: Linguistica Pragensia, Vol. VIII/2 (bibtex)
Eva Hajičová (1998): Movement rules revisited. In: Processing of Dependency-Based Grammars, Proceedings from the Workshop, COLING/ACL (ed. A. Polguere, S. Kahane), Montreal (bibtex)
Eva Hajičová (1998): Jarmila Panevová's anniversary. In: Linguistica Pragensia, Vol. VIII/1 (bibtex)
Eva Hajičová, Geert-Jan M. Kruijff, Ivana Kruijffová (1998): Salience in dialogues. In: Dialoganalyse VI (ed. by S.Čmejrková, J. Hoffmannová, O. Mullerová, J. Světlá), Max Niemeyer Verlag, Teubingen, ISBN 3-484-75016-2 (bibtex)
Eva Hajičová, Petr Sgall, Renata Blatná, František Čermák, Karel Kučera, Věra Schmiedtová (1998): Šestnáctý kongres lingvistů. In: Slovo a slovesnost 59, č. 3 (bibtex)
Eva Hajičová, Petr Sgall, Barbara Partee (1998): Focus, Topic, and Semantics. In: Proceedings of Workshop on Focus. Univ. Of Massachusetts Occasional Papers in Linguistics, Vol. 21 (ed. E. Benedicto, M. Romero, S. Tomioka), GLSA, Univ. Of Massachusetts, Amherst (bibtex)
Eva Hajičová, Petr Sgall, Barbara Partee (1998): Topic-focus articulation, tripartite structures, and semantic content. In: (bibtex)
Jiří Hana (1998): Lexical-morphological specifications and resources grant Evropske unie v ramci programu INCO-Copernicus a jeho cislo je PL961104. In: www page (url, bibtex)
Romana Králíková (1998): Tagging of Parallel Corpora. In: Telri. Proceedings of the Third European Seminar "Translation Equivalence" (eds. W. Teubert, E. Tognini Bonelli, N. Volz), Montecatini Terme, Italy, ISBN 0 9528026 1 X (bibtex)
Geert-Jan M. Kruijff (1998): Clarendon Press/Oxford Science Publications, 1996. ISBN 0-19-853833-2. In: Journal of Logic, Language & Information, vol. 7, No. 4 (bibtex)
Geert-Jan M. Kruijff (1998): Basic dependency-based grammar (technical report). In: (bibtex)
Geert-Jan M. Kruijff (1998): Kluwer Academic Press, Dordrecht, the Netherlands, 1997. ISBN 079234345x. In: PBML 69, r. 35 (bibtex)
Geert-Jan M. Kruijff, Ivana Kruijffová (1998): Proceedings of the ESSLLI Third Student Session (editors). In: (bibtex)
Geert-Jan M. Kruijff, Richard T. Oehrle, Gosse Bouma (1998): Proceedings of the Joint Conference on Formal Grammar, Head-driven Phrase Structure Grammar, and Categorial Grammar (editors). In: (bibtex)
Geert-Jan M. Kruijff, J. Schaake (1998): Discerning Relevance in Discourse Using TFA. In: kapitola ve sb. red. R. Mitkovem a N. Nikolovem: Recent Advances in Natural Language Processing, John Benjamins Publishing, Amsterdam (bibtex)
Ivana Kruijffová (1998): Topic-Focus Articulation and the Dynamics of Discourse Representation. In: příspěvek na kolokvium "The Dynamic Turn" (bibtex)
Ivana Kruijffová (1998): Automatic Generation of Instructions in a Multilingual Environment. In: Text, Speech, Dialogue. Proceedings of the First Workshop on Text, Speech, Dialogue-TSD`98 (eds. P. Sojka, V. Matoušek, K. Pala, I. Kopeček), Brno: Masaryk University, Czech Republic, ISBN 80-210-1900-X (bibtex)
Ivana Kruijffová (1998): The Dynamic Potential of Topic and Focus: A Praguian Approach to Discourse Represemtation Theory. In: PhD. Thesis, MFF UK, Prague (url, bibtex)
Ivana Kruijffová, Jiří Hana (1998): Specification of grammatical resources for the Initial Demonstrator. In: (bibtex)
Ivana Kruijffová, Jiří Hana (1998): Grammatical Resource Implementation for for Bulgarian, Czech and Russian. In: (bibtex)
Ivana Kruijffová, Jiří Hana (1998): Generation of simple text structures in Bulgarian, Czech and Russian. In: (bibtex)
Vladislav Kuboň, Tomáš Holan, Karel Oliva, Martin Plátek (1998): Two Useful Measures of Word Order Complexity. In: (bibtex)
Jarmila Panevová (1998): Ellipsis and zero elements in the structure of the sentence. In: Tipologija, grammatika, semantika. K 65-letiju Viktora Samuiloviča Chrakovskogo. (ed. N. A. Kozinceva, A.K. Ogloblin), Nauka, Sankt-Peterburg, ISBN 5-02-028355-X (bibtex)
Jarmila Panevová (1998): New Prague School Publications. In: Theoretical Linguistics, Vol. 24, No. 1 (bibtex)
Jarmila Panevová (1998): Úvahy nad novou Skladbou češtiny M. Grepla a P. Karlíka. I. In: SaS 59 (bibtex)
Jarmila Panevová (1998): Ještě k teorii valence. In: Slovo a slovesnost 59, č.1 (bibtex)
Jarmila Panevová (1998): Ještě k teorii valence - tentokrát na materiálu českých adjektiv primárních. In: Jazykovědné aktuality, roč. XXXV - 1998, zvl. číslo (bibtex)
Jarmila Panevová (1998): Koreference v gramatice a v textu (nutnost strukturního popisu a jeho hranice). In: Nowe czasy, nowe jezyki, nowe (i stare) problemy (ed. E. Jedrzejko), Katowice: Wydawnictwo Uniwersitetu Slaskiego, ISBN 83-226-0768-7 (bibtex)
Jarmila Panevová, Eva Hajičová, Petr Sgall (1998): Language Resources Need Annotations To Make Them Really Reusable: The Prague Dependency Treebank. In: Proceedings of the First International Conference on Language Resources & Evaluation, Granada. Spain (bibtex)
Jarmila Panevová, Petr Sgall (1998): Verbal Categories, Meaning and Typology. In: Typology of Verbal Categories (eds. L. Kulikov, H. Vater), Tuebingen: Max Niemeyer Verlag, ISBN 3-484-30782-4 (bibtex)
Jarmila Panevová, Petr Sgall (1998): K asymetrii mezi rovinami jazykového systému. In: Časopis pro moderní filologii, 79, č. 2, 1997 (bibtex)
Kiril Ribarov, Zdeňka Ribarova (1998): Living Conservation of the Lexis of Old Church Slavonic. In: Paleobulgarica XXI, 2, Bulgaria (bibtex)
Petr Sgall (1998): Valency and underlying structure: An alternative view on dependency. In: Recent trends in Meaning-Text Theory. Ed. by Leo Wanner, Amsterdam/Philadelphia: Benjamins, ISBN 90-272-3042-0 (url, local DOC, bibtex)
Petr Sgall (1998): Teorie valence a její formální zpracování. In: Slovo a slovesnost 59, č. 1 (bibtex)
Petr Sgall (1998): K poznámce O. Uličného v ČJL 1998, č. 1-2. In: Český jazyk a literatura č. 3-4, 49/98-99, SPN + FORTUNA (bibtex)
Petr Sgall (1998): Problems of Dialogue Research in Spoken Czech. In: Dialoganalyse VI (ed. by S.Čmejrková, J. Hoffmannová, O. Mullerová, J. Světlá), Max Niemeyer Verlag, Teubingen, ISBN 3-484-75016-2 (bibtex)
Petr Sgall (1998): Revisiting the Classification of the Dependents. Interesting Results and Tempting Topics for Further Research. In: Issues of Valency and Meaning. Studies in Honour of Jarmila Panevová (ed. Eva Hajičová), Karolinum, Charles University Press, Prague, ISBN 80-7184-601-5 (url, local DOC, bibtex)
Petr Sgall (1998): Structure, meaning and use. In: Reconnecting language: Morphology and syntax in functional perspectives. Red. Anne-Marie Simon-Vandenbergen, Amsterdam/Philadelphia: Benjamins, ISBN 90-272-3659-3 (bibtex)
Petr Sgall (1998): Věta, kontext a slovosled. In: Časopis pro moderní filologii 80, č. 1 (bibtex)
Petr Sgall (1998): Functionalism in Czech Linguistics and in the World. In: Linguistica Pragensia, 1997 (bibtex)
Petr Sgall (1998): Word, sentence, and discourse. In: Productivity and Creativity. Studies in General and Descriptive Linguistics in Honour of E. M. Uhlenbeck (ed. Mark Janse), Mouton de Gruyter, Berlin, New York (bibtex)
Petr Sgall (1998): Český pohled na dějiny lingvistiky. In: Slovo a slovesnost 59, č. 4 (bibtex)
Petr Sgall (1998): Remarks on Parsing Written and Spoken Discourse. In: Text, Speech, Dialogue. Proceedings of the First Workshop on Text, Speech, Dialogue-TSD`98. (eds. P. Sojka, V. Matoušek, K. Pala, I. Kopeček), Brno: Masaryk University, Czech Republic, ISBN 80-210-1900-X (bibtex)
Petr Sgall (1998): Neochuzujme spisovnou češtinu. In: Český jazyk a literatura č. 1-2, 49/98-99, SPN + FORTUNA (bibtex)
Petr Sgall (1998): Introduction. In: Structural and Functional Linguistics: The Prague School, by Jun Qian, Jilin Education Press, Changchun, Jilin (bibtex)
Petr Sgall, Jaroslav Peregrin (1998): Meaning and "propositional attitudes". In: In the World of Signs. Essays in honour of Professor Jerzy Pelc. Poznan Studies in the Philosophy of the Science and the Humanities, Vol.62 (eds. J. Jadacki, W. Strawinski), Adam Mickiewicz University. 1998 ISBN 90-420-0399-5 (bibtex)
Petr Sgall, Jaroslav Peregrin (1998): Věta, promluva a slovosled. In: Jazykovědné aktuality, roč. XXXV - 1998, zvl. číslo (bibtex)
Petr Sgall, Kiril Ribarov (1998): The Micro and the Macro of Linguistic Description. In: Elsnet in Wonderland, Proceedings, Utrecht University (bibtex)
Markéta Straňáková-Lopatková (1998): Ambiguity in Czech Sentences, Its Classification and Searching for It. In: Proceedings of WDS'98 (ed. J. Šafránková), ISBN 80-85863-29-4 (bibtex)
Barbora Vidová Hladká, Kiril Ribarov (1998): PoS tags for automatic tagging and syntactic structures. In: Issues of Valency and Meaning. Studies in Honour of Jarmila Panevová (ed. Eva Hajičová), Karolinum, Charles University Press, Prague, ISBN 80-7184-601-5 (bibtex)
Daniel Zeman (1998): A Statistical Approach to Parsing of Czech. In: The Prague Bulletin of Mathematical Linguistics, ISSN 0032-6585, 69, pp. 29-37 (url, local PDF, bibtex)
Daniel Zeman (1997): Pravděpodobnostní model významových zápisů vět (masters thesis). In: (url, bibtex)