My publications and talks
Of course, you can also find them on Google Scholar.
Video recordings of my talks
Sometimes someone records me on video while I am giving a talk. I try to collect such videos.
- From Balustrades to Pierre Vinken: Looking for Syntax in Transformer Self-Attentions, BlackBoxNLP workshop, Florence, Italy (video with slides)
- Cross-lingual Transfer of Dependency Parsers, ÚFAL Monday Seminar, 2017, Prague, Czechia (video and slides)
- KLcpos3 - a Language Similarity Measure for Delexicalized Parser Transfer, ACL 2015, Beijing, China (video with slides)
- Using a Collection of Many Treebanks for Exploring the Structure of Natural Language Sentences, ÚFAL Doctoral Students Workshop 2014, Prague, Czechia (video and slides)
- DEPFIX: Automatic Post-editing of Phrase-based Machine Translation Outputs, ÚFAL Monday Seminar, 2013, Prague, Czechia (video and slides)
- Error Correction of PB SMT Outputs with automatic post-editing shown on English to Czech translation, MTM 2013, Prague, Czechia (video)
- Deepfix: Statistical Post-editing of Statistical Machine Translation Using Deep Syntactic Analysis, ACL SRW 2013, Sofia, Bulgaria (video with slides)
An automatic listing of my publications
For each publication, there is also a link to the paper in PDF, and also to presentation(s) and/or poster(s).
However, the names of the files are always something like batt1.pdf and I cannot change that as it gets generated automatically, so you have to try out the files to see which is which...
Or, you can follow the links named "biblio", which lead to a page of the publication with detailed information about it and a more user-friendly list of files for download.
- Eyes on the Parse: Using Gaze Features in Syntactic Parsing. In: Proceedings of the Second Workshop on Beyond Vision and LANguage: inTEgrating Real-world kNowledge (LANTERN), pp. 1-16, Association for Computational Linguistics, Barcelona, Spain, ISBN 978-1-952148-51-4 (url, local PDF, bibtex)
- Dočkáme se digitálního Shakespeara? AI jako autor divadelní hry. In: TA.DI, 11/2020, pp. 28-31 (url, local PDF, bibtex)
- THEaiTRE: A theatre play written entirely by machines (Electronic). (url)
- R.U.R. v dobách umělé inteligence: Divadelní hru k 100 letům Čapkova díla píše robot z Matfyzu (Electronic). (url)
- Umělá inteligence píše divadelní hru (Electronic). (url)
- On the Language Neutrality of Pre-trained Multilingual Representations. In: Findings of the Association for Computational Linguistics: EMNLP 2020, pp. 1663-1674, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-952148-90-3 (url, local PDF, bibtex)
- Universal Dependencies according to BERT: both more specific and more general. In: Findings of the Association for Computational Linguistics: EMNLP 2020, pp. 2710-2722, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-952148-90-3 (url, bibtex)
- Hidden in the Layers: Interpretation of Neural Networks for Natural Language Processing. In: , ISBN 978-80-88132-10-3 (bibtex)
- Scénář: Robot. Ve Švandově divadle píše hru k výročí R.U.R. umělá inteligence. In: Hospodářské noviny IHNED, ISSN 1213-7693, pp. 1-3 (url, local PDF, bibtex)
- Deliverable D7.2 Report on NLP Technologies Workshop at EUROSAI Congress (technical report). In: (bibtex)
- THEaiTRE: Artificial Intelligence to Write a Theatre Play. In: Proceedings of AI4Narratives — Workshop on Artificial Intelligence for Narratives, pp. 9-13, RWTH Aachen University, Aachen, Germany (pdf, bibtex)
- Measuring Memorization Effect in Word-Level Neural Networks Probing. In: 23rd International Conference on Text, Speech and Dialogue, pp. 180-188, Springer, Cham, Switzerland, ISBN 978-3-030-58322-4 (url, local PDF, bibtex)
- Ze života robotů. In: Respekt, ISSN 1801-1446, 46/2020, pp. 52-55 (url, bibtex)
- Predicting Typological Features in WALS using Language Embeddings and Conditional Probabilities: ÚFAL Submission to the SIGTYP 2020 Shared Task. In: Proceedings of the Second Workshop on Computational Research in Linguistic Typology, pp. 29-35, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-952148-73-6 (url, local PDF, bibtex)
- How Language-Neutral is Multilingual BERT? (Electronic). In: ArXiv.org Computing Research Repository, ISSN 2331-8422, vol. arXiv:1911.03310 [cs.CL], no. arXiv:1911.03310 [cs.CL], pp. 1-6 (url, local PDF)
- From Balustrades to Pierre Vinken: Looking for Syntax in Transformer Self-Attentions. In: The BlackboxNLP Workshop on Analyzing and Interpreting Neural Networks for NLP at ACL 2019, pp. 263-275, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-950737-30-7 (url, local PDF, local PDF, obd, bibtex)
- Attempting to separate inflection and derivation using vector space representations. In: Proceedings of the Second International Workshop on Resources and Tools for Derivational Morphology (DeriMo 2019), pp. 61-70, ÚFAL MFF UK, Praha, Czechia, ISBN 978-80-88132-08-0 (url, local PDF, local PDF, local PDF, obd, bibtex)
- Unsupervised Lemmatization as Embeddings-Based Word Clustering (Electronic). In: ArXiv.org Computing Research Repository, ISSN 2331-8422, arXiv:1908.08528 [cs.CL], pp. 1-5 (url, local PDF)
- Solving Three Czech NLP Tasks End-to-End with Neural Models. In: Proceedings of the 18th conference ITAT 2018: Slovenskočeský NLP workshop (SloNLP 2018), pp. 138-143, CreateSpace Independent Publishing Platform, Košice, Slovakia, ISBN 978-1727267198 (pdf, local PDF, local PDF, obd, bibtex)
- Extracting Syntactic Trees from Transformer Encoder Self-Attentions. In: Proceedings of the First Workshop on Analyzing and Interpreting Neural Networks for NLP, pp. 347-349, The Assotiation of Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-948087-71-1 (url, local PDF, local PDF, obd, bibtex)
- Discovering the structure of natural language sentences by semi-supervised methods (PhD thesis). In: (local PDF, local PDF, local PDF, local PDF, bibtex)
- CUNI x-ling: Parsing under-resourced languages in CoNLL 2018 UD Shared Task. In: Proceedings of the CoNLL 2018 Shared Task: Multilingual Parsing from Raw Text to Universal Dependencies, pp. 187-196, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-948087-82-7 (pdf, local PDF, local PDF, obd, bibtex)
- CUNI Experiments for WMT17 Metrics Task. In: Proceedings of the Second Conference on Machine Translation, Volume 2: Shared Task Papers, pp. 604-611, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-945626-96-8 (obd, bibtex)
- MonoTrans: Statistical Machine Translation from Monolingual Data. In: Proceedings of the 17th conference ITAT 2017: Slovenskočeský NLP workshop (SloNLP 2017), pp. 201-208, CreateSpace Independent Publishing Platform, Praha, Czechia, ISBN 978-1974274741 (pdf, local PDF, local PDF, local PDF, obd, bibtex)
- Slavic Forest, Norwegian Wood. In: Proceedings of the Fourth Workshop on NLP for Similar Languages, Varieties and Dialects (VarDial4), pp. 210-219, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-945626-43-2 (pdf, local PDF, local PDF, obd, bibtex)
- Error Analysis of Cross-lingual Tagging and Parsing. In: Proceedings of the 16th International Workshop on Treebanks and Linguistic Theories, pp. 106-118, Univerzita Karlova, Praha, Czechia, ISBN 978-80-88132-04-2 (pdf, local PDF, local PDF, obd, bibtex)
- Findings of the WMT 2017 Biomedical Translation Shared Task. In: Proceedings of the Second Conference on Machine Translation, Volume 2: Shared Task Papers, pp. 234-247, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-945626-96-8 (pdf, obd, bibtex)
- TectoMT – a deep-linguistic core of the combined Chimera MT system. In: Baltic Journal of Modern Computing, ISSN 2255-8942, vol. 4, no. 2, pp. 377-377 (pdf, local PDF, local PDF, local PDF, obd, bibtex)
- Czechizator. In: Proceedings of the 16th ITAT: Slovenskočeský NLP workshop (SloNLP 2016), pp. 74-79, CreateSpace Independent Publishing Platform, Bratislava, Slovakia, ISBN 978-1537016740 (pdf, local PDF, local PDF, obd, bibtex)
- Moses & Treex Hybrid MT Systems Bestiary. In: Proceedings of the 2nd Deep Machine Translation Workshop, pp. 1-10, ÚFAL MFF UK, Praha, Czechia, ISBN 978-80-88132-02-8 (url, local PDF, local PDF, obd, bibtex)
- Dictionary-based Domain Adaptation of MT Systems without Retraining. In: Proceedings of the First Conference on Machine Translation (WMT). Volume 2: Shared Task Papers, pp. 449-455, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-945626-10-4 (pdf, obd, bibtex)
- Targeted Paraphrasing on Deep Syntactic Layer for MT Evaluation. In: Proceedings of the Third International Conference on Dependency Linguistics (Depling 2015), pp. 20-27, Uppsala University, Uppsala, Sweden, ISBN 978-91-637-8965-6 (url, local PDF, local PDF, obd, bibtex)
- New Language Pairs in TectoMT. In: Proceedings of the 10th Workshop on Machine Translation, pp. 98-104, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-941643-32-7 (pdf, local PDF, obd, bibtex)
- Multi-source Cross-lingual Delexicalized Parser Transfer: Prague or Stanford?. In: Proceedings of the Third International Conference on Dependency Linguistics (Depling 2015), pp. 281-290, Uppsala University, Uppsala, Sweden, ISBN 978-91-637-8965-6 (url, local PDF, local PDF, obd, bibtex)
- Parsing Natural Language Sentences by Semi-supervised Methods (Electronic). (pdf, local PDF, local PDF, local PDF)
- A new parsing algorithm. In: UFAL WDS 2015 (Conference of PhD Students in Mathematical Linguistics), pp. 8-13, Institute of Formal and Applied Linguistics, Charles University in Prague, Praha, Czechia (local PDF, obd, bibtex)
- Translation Model Interpolation for Domain Adaptation in TectoMT. In: Proceedings of the 1st Deep Machine Translation Workshop, pp. 89-96, ÚFAL MFF UK, Praha, Czechia, ISBN 978-80-904571-7-1 (url, local PDF, local PDF, obd, bibtex)
- KLcpos3 - a Language Similarity Measure for Delexicalized Parser Transfer. In: Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing (Volume 2: Short Papers), pp. 243-249, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-941643-73-0 (url, local PDF, local PDF, local ZIP, local PDF, obd, bibtex)
- MSTParser Model Interpolation for Multi-source Delexicalized Transfer. In: Proceedings of the 14th International Conference on Parsing Technologies, pp. 71-75, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-941643-98-3 (url, local PDF, local PDF, obd, bibtex)
- Improving Evaluation of English-Czech MT through Paraphrasing. In: Proceedings of the 9th International Conference on Language Resources and Evaluation (LREC 2014), pp. 596-601, European Language Resources Association, Reykjavík, Iceland, ISBN 978-2-9517408-8-4 (pdf, local PDF, obd, bibtex)
- Machine Translation of Medical Texts in the Khresmoi Project. In: Proceedings of the Ninth Workshop on Statistical Machine Translation, pp. 221-228, Association for Computational Linguistics, Baltimore, MD, USA, ISBN 978-1-941643-17-4 (pdf, local PDF, local PDF, obd, bibtex)
- Adaptation of machine translation for multilingual information retrieval in medical domain. In: Artificial Intelligence in Medicine, ISSN 0933-3657, vol. 61, no. 3, pp. 165-185 (url, obd, bibtex)
- Depfix, a Tool for Automatic Rule-based Post-editing of SMT. In: The Prague Bulletin of Mathematical Linguistics, ISSN 0032-6585, 102, pp. 47-56 (local PDF, local PDF, local PDF, obd, bibtex)
- Fairytale Child Chatbot. In: Proceedings of the 14th conference ITAT 2014, pp. 79-84, Institute of Computer Science AS CR, Praha, Czechia, ISBN 978-80-87136-18-8 (local PDF, local PDF, obd, bibtex)
- Depfix Manual (technical report). In: (local HTML, local PDF, bibtex)
- HamleDT 2.0: Thirty Dependency Treebanks Stanfordized. In: Proceedings of the 9th International Conference on Language Resources and Evaluation (LREC 2014), pp. 2334-2341, European Language Resources Association, Reykjavík, Iceland, ISBN 978-2-9517408-8-4 (pdf, local PDF, local PDF, obd, bibtex)
- CUNI in WMT14: Chimera Still Awaits Bellerophon. In: Proceedings of the Ninth Workshop on Statistical Machine Translation, pp. 195-200, Association for Computational Linguistics, Baltimore, MD, USA, ISBN 978-1-941643-17-4 (pdf, local PDF, local PDF, obd, bibtex)
- Khresmoi Professional: Multilingual Semantic Search for Medical Professionals. In: Proceedings of the ACM SIGIR Workshop on Health Search and Discovery: Helping Users and Advancing Medicine, pp. 31-34, Microsoft Research, Cambridge, UK (url, local PDF, obd, bibtex)
- Chimera – Three Heads for English-to-Czech Translation. In: Proceedings of the Eight Workshop on Statistical Machine Translation, pp. 92-98, Association for Computational Linguistics, Sofija, Bulgaria, ISBN 978-1-937284-57-2 (url, local PDF, local PDF, obd, bibtex)
- Automatic post-editing of phrase-based machine translation outputs (masters thesis). In: (local PDF, local PDF, local PDF, local PDF, bibtex)
- Deepfix: Statistical Post-editing of Statistical Machine Translation Using Deep Syntactic Analysis. In: 51st Annual Meeting of the Association for Computational Linguistics Proceedings of the Student Research Workshop, pp. 172-179, Association for Computational Linguistics, Sofija, Bulgaria, ISBN 978-1-937284-53-4 (url, local PDF, local PDF, local PDF, obd, bibtex)
- MTMonkey: A Scalable Infrastructure for a Machine Translation Web Service. In: The Prague Bulletin of Mathematical Linguistics, ISSN 0032-6585, 100, pp. 31-40 (pdf, local PDF, local PDF, obd, bibtex)
- Using Parallel Features in Parsing of Machine-Translated Sentences for Correction of Grammatical Errors. In: Proceedings of Sixth Workshop on Syntax, Semantics and Structure in Statistical Translation (SSST-6), ACL, pp. 39-48, Association for Computational Linguistics, Jeju, Korea, ISBN 978-1-937284-38-1 (pdf, local PDF, local PDF, obd, bibtex)
- Dependency Relations Labeller for Czech. In: Text, Speech and Dialogue: 15th International Conference, TSD 2012. Proceedings, Lecture Notes in Computer Science, ISSN 0302-9743, 7499, pp. 256-263, Springer Verlag, Berlin / Heidelberg, ISBN 978-3-642-32789-6 (url, local PDF, local PDF, obd, bibtex)
- DEPFIX: A System for Automatic Correction of Czech MT Outputs. In: Proceedings of the Seventh Workshop on Statistical Machine Translation, pp. 362-368, Association for Computational Linguistics, Montréal, Canada, ISBN 978-1-937284-20-6 (pdf, local HTML, local PDF, local PDF, obd, bibtex)
- Named Entities from Wikipedia for Machine Translation. In: Information Technologies – Applications and Theory, pp. 23-30, Univerzita Pavla Jozefa Šafárika v Košiciach, Košice, Slovakia, ISBN 978-80-89557-02-8 (local PDF, local PDF, local PDF, obd, bibtex)
- Two-step translation with grammatical post-processing. In: Proceedings of the Sixth Workshop on Statistical Machine Translation, pp. 426-432, Association for Computational Linguistics, Edinburgh, UK, ISBN 978-1-937284-12-1 (url, local PDF, local PDF, obd, bibtex)