My publications and talks

Of course, you can also find them on Google Scholar.

Video recordings of my talks

Sometimes someone records me on video while I am giving a talk. I try to collect such videos.

  • KLcpos3 - a Language Similarity Measure for Delexicalized Parser Transfer, ACL 2015, Beijing, China (video with slides)
  • Using a Collection of Many Treebanks for Exploring the Structure of Natural Language Sentences, ÚFAL Doctoral Students Workshop 2014, Prague, Czechia (video and slides)
  • DEPFIX: Automatic Post-editing of Phrase-based Machine Translation Outputs, ÚFAL Monday Seminar, 2013, Prague, Czechia (video and slides)
  • Error Correction of PB SMT Outputs with automatic post-editing shown on English to Czech translation, MTM 2013, Prague, Czechia (video)
  • Deepfix: Statistical Post-editing of Statistical Machine Translation Using Deep Syntactic Analysis, ACL SRW 2013, Sofia, Bulgaria (video with slides)

An automatic listing of my publications

For each publication, there is also a link to the paper in PDF, and also to presentation(s) and/or poster(s).

However, the names of the files are always something like batt1.pdf and I cannot change that as it gets generated automatically, so you have to try out the files to see which is which...

Or, you can follow the links named "biblio", which lead to a page of the publication with detailed information about it and a more user-friendly list of files for download.

  1. David Mareček, Ondřej Bojar, Ondřej Hübsch, Rudolf Rosa, Dušan Variš (2017): CUNI Experiments for WMT17 Metrics Task. In: Proceedings of the Second Conference on Machine Translation, Volume 2: Shared Task Papers, pp. 604-611, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-945626-96-8 (biblio, bibtex)
  2. Rudolf Rosa (2017): MonoTrans: Statistical Machine Translation from Monolingual Data. In: Proceedings of the 17th conference ITAT 2017: Slovenskočeský NLP workshop (SloNLP 2017), pp. 201-208, CreateSpace Independent Publishing Platform, Praha, Czechia, ISBN 978-1974274741 (pdf, biblio, batt1.pdf, batt2.pdf, batt3.pdf, bibtex)
  3. Rudolf Rosa, Daniel Zeman, David Mareček, Zdeněk Žabokrtský (2017): Slavic Forest, Norwegian Wood. In: Proceedings of the Fourth Workshop on NLP for Similar Languages, Varieties and Dialects (VarDial4), pp. 210-219, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-945626-43-2 (pdf, biblio, batt1.pdf, batt2.pdf, bibtex)
  4. Martin Popel, Roman Sudarikov, Ondřej Bojar, Rudolf Rosa, Jan Hajič (2016): TectoMT – a deep-­linguistic core of the combined Chimera MT system. In: Baltic Journal of Modern Computing, ISSN 2255-8942, vol. 4, no. 2, pp. 377-377 (pdf, biblio, batt1.pdf, batt2.pdf, batt3.pdf, obd, bibtex)
  5. Rudolf Rosa (2016): Czechizator. In: Proceedings of the 16th ITAT: Slovenskočeský NLP workshop (SloNLP 2016), pp. 74-79, CreateSpace Independent Publishing Platform, Bratislava, Slovakia, ISBN 978-1537016740 (pdf, biblio, batt1.pdf, batt2.pdf, obd, bibtex)
  6. Rudolf Rosa, Martin Popel, Ondřej Bojar, David Mareček, Ondřej Dušek (2016): Moses & Treex Hybrid MT Systems Bestiary. In: Proceedings of the 2nd Deep Machine Translation Workshop, pp. 1-10, ÚFAL MFF UK, Praha, Czechia, ISBN 978-80-88132-02-8 (url, biblio, batt1.pdf, batt2.pdf, obd, bibtex)
  7. Rudolf Rosa, Roman Sudarikov, Michal Novák, Martin Popel, Ondřej Bojar (2016): Dictionary-based Domain Adaptation of MT Systems without Retraining. In: Proceedings of the First Conference on Machine Translation (WMT). Volume 2: Shared Task Papers, pp. 449-455, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-945626-10-4 (pdf, biblio, obd, bibtex)
  8. Petra Barančíková, Rudolf Rosa (2015): Targeted Paraphrasing on Deep Syntactic Layer for MT Evaluation. In: Proceedings of the Third International Conference on Dependency Linguistics (Depling 2015), pp. 20-27, Uppsala University, Uppsala, Sweden, ISBN 978-91-637-8965-6 (url, biblio, batt1.pdf, batt2.pdf, obd, bibtex)
  9. Ondřej Dušek, Luís Gomes, Michal Novák, Martin Popel, Rudolf Rosa (2015): New Language Pairs in TectoMT. In: Proceedings of the 10th Workshop on Machine Translation, pp. 98-104, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-941643-32-7 (pdf, biblio, batt1.pdf, obd, bibtex)
  10. Rudolf Rosa (2015): Multi-source Cross-lingual Delexicalized Parser Transfer: Prague or Stanford? In: Proceedings of the Third International Conference on Dependency Linguistics (Depling 2015), pp. 281-290, Uppsala University, Uppsala, Sweden, ISBN 978-91-637-8965-6 (url, biblio, batt1.pdf, batt2.pdf, obd, bibtex)
  11. Rudolf Rosa (2015): Parsing Natural Language Sentences by Semi-supervised Methods (Electronic). (pdf, biblio, batt1.pdf, batt2.pdf, batt3.pdf)
  12. Rudolf Rosa (2015): A new parsing algorithm. In: UFAL WDS 2015 (Conference of PhD Students in Mathematical Linguistics), pp. 8-13, Institute of Formal and Applied Linguistics, Charles University in Prague, Praha, Czechia (biblio, batt1.pdf, obd, bibtex)
  13. Rudolf Rosa, Ondřej Dušek, Michal Novák, Martin Popel (2015): Translation Model Interpolation for Domain Adaptation in TectoMT. In: Proceedings of the 1st Deep Machine Translation Workshop, pp. 89-96, ÚFAL MFF UK, Praha, Czechia, ISBN 978-80-904571-7-1 (url, biblio, batt1.pdf, batt2.pdf, obd, bibtex)
  14. Rudolf Rosa, Zdeněk Žabokrtský (2015): KLcpos3 - a Language Similarity Measure for Delexicalized Parser Transfer. In: Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing (Volume 2: Short Papers), pp. 243-249, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-941643-73-0 (url, biblio, batt1.pdf, batt2.zip, batt3.pdf, batt4.pdf, obd, bibtex)
  15. Rudolf Rosa, Zdeněk Žabokrtský (2015): MSTParser Model Interpolation for Multi-source Delexicalized Transfer. In: Proceedings of the 14th International Conference on Parsing Technologies, pp. 71-75, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-941643-98-3 (url, biblio, batt1.pdf, batt2.pdf, obd, bibtex)
  16. Petra Barančíková, Rudolf Rosa, Aleš Tamchyna (2014): Improving Evaluation of English-Czech MT through Paraphrasing. In: Proceedings of the 9th International Conference on Language Resources and Evaluation (LREC 2014), pp. 596-601, European Language Resources Association, Reykjavík, Iceland, ISBN 978-2-9517408-8-4 (pdf, biblio, batt1.pdf, obd, bibtex)
  17. Ondřej Dušek, Jan Hajič, Jaroslava Hlaváčová, Michal Novák, Pavel Pecina, Rudolf Rosa, Aleš Tamchyna, Zdeňka Urešová, Daniel Zeman (2014): Machine Translation of Medical Texts in the Khresmoi Project. In: Proceedings of the Ninth Workshop on Statistical Machine Translation, pp. 221-228, Association for Computational Linguistics, Baltimore, MD, USA, ISBN 978-1-941643-17-4 (pdf, biblio, batt1.pdf, batt2.pdf, obd, bibtex)
  18. Pavel Pecina, Ondřej Dušek, Lorraine Goeuriot, Jan Hajič, Jaroslava Hlaváčová, Gareth J.F. Jones, Liadh Kelly, Johannes Leveling, David Mareček, Michal Novák, Martin Popel, Rudolf Rosa, Aleš Tamchyna, Zdeňka Urešová (2014): Adaptation of machine translation for multilingual information retrieval in medical domain. In: Artificial Intelligence in Medicine, ISSN 0933-3657, vol. 61, no. 3, pp. 165-185 (url, biblio, obd, bibtex)
  19. Rudolf Rosa (2014): Depfix, a Tool for Automatic Rule-based Post-editing of SMT. In: The Prague Bulletin of Mathematical Linguistics, ISSN 0032-6585, 102, pp. 47-56 (biblio, batt1.pdf, batt2.pdf, batt3.pdf, obd, bibtex)
  20. Rudolf Rosa (2014): Fairytale Child Chatbot. In: Proceedings of the 14th conference ITAT 2014, pp. 79-84, Institute of Computer Science AS CR, Praha, Czechia, ISBN 978-80-87136-18-8 (biblio, batt1.pdf, batt2.pdf, obd, bibtex)
  21. Rudolf Rosa (2014): Depfix Manual (technical report). ÚFAL MFF UK (biblio, batt1.pdf, batt2.html, bibtex)
  22. Rudolf Rosa, Jan Mašek, David Mareček, Martin Popel, Daniel Zeman, Zdeněk Žabokrtský (2014): HamleDT 2.0: Thirty Dependency Treebanks Stanfordized. In: Proceedings of the 9th International Conference on Language Resources and Evaluation (LREC 2014), pp. 2334-2341, European Language Resources Association, Reykjavík, Iceland, ISBN 978-2-9517408-8-4 (pdf, biblio, batt1.pdf, batt2.pdf, obd, bibtex)
  23. Aleš Tamchyna, Martin Popel, Rudolf Rosa, Ondřej Bojar (2014): CUNI in WMT14: Chimera Still Awaits Bellerophon. In: Proceedings of the Ninth Workshop on Statistical Machine Translation, pp. 195-200, Association for Computational Linguistics, Baltimore, MD, USA, ISBN 978-1-941643-17-4 (pdf, biblio, batt1.pdf, batt2.pdf, obd, bibtex)
  24. Niraj Aswani, Thomas Beckers, Erich Birngruber, Célia Boyer, Andreas Burner, Jakub Bystroň, Khalid Choukri, Sarah Cruchet, Hamish Cunningham, Jan Dědek, Ljiljana Dolamic, René Donner, Ondřej Dušek, Sebastian Dungs, Ivan Eggel, Antonio Foncubierta, Norbert Fuhr, Adam Funk, Alba García Seco de Herrera, Arnaud Gaudinat, Georgi Georgiev, Julien Gobeill, Lorraine Goeuriot, Paz Gomez, Mark A. Greenwood, Manfred Gschwandtner, Allan Hanbury, Jan Hajič, Jaroslava Hlaváčová, Markus Holzer, Gareth J.F. Jones, Blanca Jordán, Matthias Jordan, Klemens Kaderk, Franz Kainberger, Liadh Kelly, Sascha Kriewel, Marlene Kritz, Georg Langs, Nolan Lawson, Johannes Leveling, David Mareček, Dimitrios Markonis, Iván Martínez, Vassil Momtchev, Alexandre Masselot, Hélène Mazo, Henning Müller, Michal Novák, Johann Petrak, João Palotti, Pavel Pecina, Konstantin Pentchev, Deyan Peychev, Natalia Pletneva, Martin Popel, Diana Pottecher, Angus Roberts, Rudolf Rosa, Patrick Ruch, Alexander Sachs, Matthias Samwald, Priscille Schneller, Veronika Stefanov, Aleš Tamchyna, Miguel Angel Tinte, Zdeňka Urešová, Alejandro Vargas, Dina Vishnyakova (2013): Khresmoi Professional: Multilingual Semantic Search for Medical Professionals. In: Proceedings of the ACM SIGIR Workshop on Health Search and Discovery: Helping Users and Advancing Medicine, pp. 31-34, Microsoft Research, Cambridge, UK (url, biblio, batt1.pdf, obd, bibtex)
  25. Ondřej Bojar, Rudolf Rosa, Aleš Tamchyna (2013): Chimera – Three Heads for English-to-Czech Translation. In: Proceedings of the Eight Workshop on Statistical Machine Translation, pp. 92-98, Association for Computational Linguistics, Sofija, Bulgaria, ISBN 978-1-937284-57-2 (url, biblio, batt1.pdf, batt2.pdf, obd, bibtex)
  26. Rudolf Rosa (2013): Automatic post-editing of phrase-based machine translation outputs (masters thesis). Charles University in Prague, Faculty of Mathematics and Physics, Praha, Czechia (biblio, batt1.pdf, batt2.pdf, batt3.pdf, batt4.pdf, bibtex)
  27. Rudolf Rosa, David Mareček, Aleš Tamchyna (2013): Deepfix: Statistical Post-editing of Statistical Machine Translation Using Deep Syntactic Analysis. In: 51st Annual Meeting of the Association for Computational Linguistics Proceedings of the Student Research Workshop, pp. 172-179, Association for Computational Linguistics, Sofija, Bulgaria, ISBN 978-1-937284-53-4 (url, biblio, batt1.pdf, batt2.pdf, batt3.pdf, obd, bibtex)
  28. Aleš Tamchyna, Ondřej Dušek, Rudolf Rosa, Pavel Pecina (2013): MTMonkey: A Scalable Infrastructure for a Machine Translation Web Service. In: The Prague Bulletin of Mathematical Linguistics, ISSN 0032-6585, 100, pp. 31-40 (pdf, biblio, batt1.pdf, batt2.pdf, obd, bibtex)
  29. Rudolf Rosa, Ondřej Dušek, David Mareček, Martin Popel (2012): Using Parallel Features in Parsing of Machine-Translated Sentences for Correction of Grammatical Errors. In: Proceedings of Sixth Workshop on Syntax, Semantics and Structure in Statistical Translation (SSST-6), ACL, pp. 39-48, Association for Computational Linguistics, Jeju, Korea, ISBN 978-1-937284-38-1 (pdf, biblio, batt1.pdf, batt2.pdf, obd, bibtex)
  30. Rudolf Rosa, David Mareček (2012): Dependency Relations Labeller for Czech. In: Text, Speech and Dialogue: 15th International Conference, TSD 2012. Proceedings, Lecture Notes in Computer Science, ISSN 0302-9743, 7499, pp. 256-263, Springer Verlag, Berlin / Heidelberg, ISBN 978-3-642-32789-6 (url, biblio, batt1.pdf, batt2.pdf, obd, bibtex)
  31. Rudolf Rosa, David Mareček, Ondřej Dušek (2012): DEPFIX: A System for Automatic Correction of Czech MT Outputs. In: Proceedings of the Seventh Workshop on Statistical Machine Translation, pp. 362-368, Association for Computational Linguistics, Montréal, Canada, ISBN 978-1-937284-20-6 (pdf, biblio, batt1.html, batt2.pdf, batt3.pdf, obd, bibtex)
  32. Ondřej Hálek, Rudolf Rosa, Aleš Tamchyna, Ondřej Bojar (2011): Named Entities from Wikipedia for Machine Translation. In: Information Technologies – Applications and Theory, pp. 23-30, Univerzita Pavla Jozefa Šafárika v Košiciach, Košice, Slovakia, ISBN 978-80-89557-02-8 (biblio, batt1.pdf, batt2.pdf, batt3.pdf, obd, bibtex)
  33. David Mareček, Rudolf Rosa, Petra Galuščáková, Ondřej Bojar (2011): Two-step translation with grammatical post-processing. In: Proceedings of the Sixth Workshop on Statistical Machine Translation, pp. 426-432, Association for Computational Linguistics, Edinburgh, UK, ISBN 978-1-937284-12-1 (url, biblio, batt1.pdf, batt2.pdf, obd, bibtex)