Michal Novák

office
425
email
mnovak@ufal.mff.cuni.cz
phone
+420 951 554 366
fax
+420 257 223 293
address
Malostranské náměstí 25
118 00 Praha 1
Czech Republic

Main Research Interests

  • coreference / anaphora resolution
  • machine translation
  • machine learning

Projects

Current

  • GAUK 3389/2015 - Cross-lingual approaches to coreference resolution
  • GAČR 16-05394S - Structure of coreferential chains in parallel language data
  • NAKI II DG16P02B016 - Automatic Evaluation of Text Coherence in Czech

Former

  • EuroMatrix+
  • GAUK 4226/2011 – Utilization of coreference in machine translation
  • Khresmoi – Medical information retrieval (working on Machine Translation)
  • QTLeap - Quality Translation by Deep Language Engineering Approaches

Curriculum Vitae

  • 2010 Mgr. (Master's degree) in Computational Linguistics, Faculty of Mathematics and Physics, Charles University in Prague.
  • 2008 Bc. (Bachelor's degree) in Computer Science, Faculty of Mathematics and Physics, Charles University in Prague.

Selected Bibliography

  1. Michal Novák, Anna Nedoluzhko, Zdeněk Žabokrtský (2017): Projection-based Coreference Resolution Using Deep Syntax. In: Proceedings of the 2nd Workshop on Coreference Resolution Beyond OntoNotes (CORBON 2017), pp. 56-64, Association for Computational Linguistics (ACL), Stroudsburg, PA, USA, ISBN 978-1-945626-46-3 (pdf, biblio, bibtex)
  2. Ondřej Bojar, Ondřej Dušek, Tom Kocmi, Jindřich Libovický, Michal Novák, Martin Popel, Roman Sudarikov, Dušan Variš (2016): CzEng 1.6: Enlarged Czech-English Parallel Corpus with Processing Tools Dockered. In: Text, Speech, and Dialogue: 19th International Conference, TSD 2016, Lecture Notes in Computer Science, ISSN 0302-9743, 9924, pp. 231-238, Springer International Publishing, Cham / Heidelberg / New York / Dordrecht / London, ISBN 978-3-319-45509-9 (url, biblio, obd, bibtex)
  3. Anna Nedoluzhko, Michal Novák, Silvie Cinková, Marie Mikulová, Jiří Mírovský (2016): Coreference in Prague Czech-English Dependency Treebank. In: Proceedings of the 10th International Conference on Language Resources and Evaluation (LREC 2016), pp. 169-176, European Language Resources Association, Paris, France, ISBN 978-2-9517408-9-1 (url, biblio, batt1.pdf, obd, bibtex)
  4. Anna Nedoluzhko, Anna Schwarz (Khoroshkina), Michal Novák (2016): Possessives in Parallel English‑Czech-Russian Texts. In: Computational Linguistics and Intellectual Technologies, ISSN 2221-7932, 15, pp. 483-497 (pdf, biblio, batt1.pdf, obd, bibtex)
  5. Michal Novák (2016): Pronoun Prediction with Linguistic Features and Example Weighing. In: Proceedings of the First Conference on Machine Translation (WMT). Volume 2: Shared Task Papers, pp. 602-608, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-945626-10-4 (pdf, biblio, obd, bibtex)
  6. Rudolf Rosa, Roman Sudarikov, Michal Novák, Martin Popel, Ondřej Bojar (2016): Dictionary-based Domain Adaptation of MT Systems without Retraining. In: Proceedings of the First Conference on Machine Translation (WMT). Volume 2: Shared Task Papers, pp. 449-455, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-945626-10-4 (pdf, biblio, obd, bibtex)
  7. Ondřej Dušek, Luís Gomes, Michal Novák, Martin Popel, Rudolf Rosa (2015): New Language Pairs in TectoMT. In: Proceedings of the 10th Workshop on Machine Translation, pp. 98-104, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-941643-32-7 (pdf, biblio, batt1.pdf, obd, bibtex)
  8. Anna Nedoluzhko, Svetlana Toldova, Michal Novák (2015): Coreference chains in Czech, English and Russian: Preliminary findings. In: Computational Linguistics and Intellectual Technologies, ISSN 2221-7932, vol. 14, no. 21, pp. 474-486 (pdf, biblio, obd, bibtex)
  9. Michal Novák, Anna Nedoluzhko (2015): Correspondences between Czech and English Coreferential Expressions. In: Discours: Revue de linguistique, psycholinguistique et informatique., ISSN 1963-1723, 16, pp. 1-41 (url, biblio, obd, bibtex)
  10. Michal Novák, Dieke Oele, Gertjan van Noord (2015): Comparison of Coreference Resolvers for Deep Syntax Translation. In: Proceedings of the Second Workshop on Discourse in Machine Translation, pp. 17-23, Association for Computational Linguistics, Lisboa, Portugal, ISBN 978-1-941643-32-7 (url, biblio, obd, bibtex)
  11. Rudolf Rosa, Ondřej Dušek, Michal Novák, Martin Popel (2015): Translation Model Interpolation for Domain Adaptation in TectoMT. In: Proceedings of the 1st Deep Machine Translation Workshop, pp. 89-96, ÚFAL MFF UK, Praha, Czechia, ISBN 978-80-904571-7-1 (url, biblio, batt1.pdf, batt2.pdf, obd, bibtex)
  12. Ondřej Dušek, Jan Hajič, Jaroslava Hlaváčová, Michal Novák, Pavel Pecina, Rudolf Rosa, Aleš Tamchyna, Zdeňka Urešová, Daniel Zeman (2014): Machine Translation of Medical Texts in the Khresmoi Project. In: Proceedings of the Ninth Workshop on Statistical Machine Translation, pp. 221-228, Association for Computational Linguistics, Baltimore, MD, USA, ISBN 978-1-941643-17-4 (pdf, biblio, batt1.pdf, batt2.pdf, obd, bibtex)
  13. Michal Novák, Zdeněk Žabokrtský (2014): Cross-lingual Coreference Resolution of Pronouns. In: Proceedings of COLING 2014, the 25th International Conference on Computational Linguistics: Technical Papers, pp. 14-24, Dublin City University and Association for Computational Linguistics, Dublin, Ireland, ISBN 978-1-941643-26-6 (pdf, biblio, obd, bibtex)
  14. Pavel Pecina, Ondřej Dušek, Lorraine Goeuriot, Jan Hajič, Jaroslava Hlaváčová, Gareth J.F. Jones, Liadh Kelly, Johannes Leveling, David Mareček, Michal Novák, Martin Popel, Rudolf Rosa, Aleš Tamchyna, Zdeňka Urešová (2014): Adaptation of machine translation for multilingual information retrieval in medical domain. In: Artificial Intelligence in Medicine, ISSN 0933-3657, vol. 61, no. 3, pp. 165-185 (url, biblio, obd, bibtex)
  15. Niraj Aswani, Thomas Beckers, Erich Birngruber, Célia Boyer, Andreas Burner, Jakub Bystroň, Khalid Choukri, Sarah Cruchet, Hamish Cunningham, Jan Dědek, Ljiljana Dolamic, René Donner, Ondřej Dušek, Sebastian Dungs, Ivan Eggel, Antonio Foncubierta, Norbert Fuhr, Adam Funk, Alba García Seco de Herrera, Arnaud Gaudinat, Georgi Georgiev, Julien Gobeill, Lorraine Goeuriot, Paz Gomez, Mark A. Greenwood, Manfred Gschwandtner, Allan Hanbury, Jan Hajič, Jaroslava Hlaváčová, Markus Holzer, Gareth J.F. Jones, Blanca Jordán, Matthias Jordan, Klemens Kaderk, Franz Kainberger, Liadh Kelly, Sascha Kriewel, Marlene Kritz, Georg Langs, Nolan Lawson, Johannes Leveling, David Mareček, Dimitrios Markonis, Iván Martínez, Vassil Momtchev, Alexandre Masselot, Hélène Mazo, Henning Müller, Michal Novák, Johann Petrak, João Palotti, Pavel Pecina, Konstantin Pentchev, Deyan Peychev, Natalia Pletneva, Martin Popel, Diana Pottecher, Angus Roberts, Rudolf Rosa, Patrick Ruch, Alexander Sachs, Matthias Samwald, Priscille Schneller, Veronika Stefanov, Aleš Tamchyna, Miguel Angel Tinte, Zdeňka Urešová, Alejandro Vargas, Dina Vishnyakova (2013): Khresmoi Professional: Multilingual Semantic Search for Medical Professionals. In: Proceedings of the ACM SIGIR Workshop on Health Search and Discovery: Helping Users and Advancing Medicine, pp. 31-34, Microsoft Research, Cambridge, UK (url, biblio, batt1.pdf, obd, bibtex)
  16. Anna Nedoluzhko, Jiří Mírovský, Michal Novák (2013): A Coreferentially annotated Corpus and Anaphora Resolution for Czech. In: Computational Linguistics and Intellectual Technologies, pp. 467-475, ABBYY, Moskva, Russia, ISBN 978-1-937284-58-9 (biblio, batt1.pdf, obd, bibtex)
  17. Michal Novák, Anna Nedoluzhko, Zdeněk Žabokrtský (2013): Translation of "It" in a Deep Syntax Framework. In: 51st Annual Meeting of the Association for Computational Linguistics Proceedings of the Workshop on Discourse in Machine Translation, pp. 51-59, Omnipress, Inc., Sofija, Bulgaria, ISBN 978-1-937284-68-8 (pdf, biblio, obd, bibtex)
  18. Michal Novák, Zdeněk Žabokrtský, Anna Nedoluzhko (2013): Two Case Studies on Translating Pronouns in a Deep Syntax Framework. In: Proceedings of the 6th International Joint Conference on Natural Language Processing, pp. 1037-1041, Asian Federation of Natural Language Processing, Nagoya, Japan, ISBN 978-4-9907348-0-0 (pdf, biblio, obd, bibtex)
  19. Ondřej Bojar, Zdeněk Žabokrtský, Ondřej Dušek, Petra Galuščáková, Martin Majliš, David Mareček, Jiří Maršík, Michal Novák, Martin Popel, Aleš Tamchyna (2012): The Joy of Parallelism with CzEng 1.0. In: Proceedings of the 8th International Conference on Language Resources and Evaluation (LREC 2012), pp. 3921-3928, European Language Resources Association, İstanbul, Turkey, ISBN 978-2-9517408-7-7 (url, biblio, batt1.pdf, obd, bibtex)
  20. Ondřej Dušek, Zdeněk Žabokrtský, Martin Popel, Martin Majliš, Michal Novák, David Mareček (2012): Formemes in English-Czech Deep Syntactic MT. In: Proceedings of the Seventh Workshop on Statistical Machine Translation, pp. 267-274, Association for Computational Linguistics, Montréal, Canada, ISBN 978-1-937284-20-6 (pdf, biblio, batt1.pdf, obd, bibtex)
  21. Kateřina Veselovská, Giang Linh Nguy, Michal Novák (2012): Using Czech-English Parallel Corpora in Automatic Identification of It. In: The Fifth Workshop on Building and Using Comparable Corpora, pp. 112-120, European Language Resources Association, İstanbul, Turkey (biblio, batt1.pdf, obd, bibtex)
  22. Giang Linh Nguy, Michal Novák, Anna Nedoluzhko (2011): Coreference Resolution in the Prague Dependency Treebank (technical report). In: , pp. 1-66 (pdf, biblio, bibtex)
  23. Michal Novák (2011): Utilization of Anaphora in Machine Translation. In: WDS'11 Proceedings of Contributed Papers, Part I, pp. 155-160, Matfyzpress, Praha, Czechia, ISBN 978-80-7378-184-2 (pdf, biblio, obd, bibtex)
  24. Michal Novák, Zdeněk Žabokrtský (2011): Resolving Noun Phrase Coreference in Czech. In: Lecture Notes in Computer Science, ISSN 0302-9743, 7099, pp. 24-34 (url, biblio, obd, bibtex)
  25. Michal Novák (2010): Machine Learning Approach to Anaphora Resolution (masters thesis). MFF UK, Prague, Czech Republic (pdf, biblio, bibtex)
  26. Hana Klempová, Michal Novák, Peter Fabian, Jan Ehrenberger, Ondřej Bojar (2009): Získávání paralelních textů z webu. In: Informačné Technológie – Aplikácie a Teória. Zborník príspevkov, ITAT 2009, pp. 47-54, PONT s.r.o., Seňa, Slovakia, ISBN 978-80-970179-1-0 (biblio, batt1.pdf, bibtex)