Martin Popel

office hours
+420 221 914 289
+420 257 223 293
Malostranské náměstí 25
118 00 Praha 1
Czech Republic

Main Research Interests

dependency-based MT, machine learning, parsing, language modeling, MT evaluation, treebanking


Grants: QTLeap, Manyla, Khresmoi, EuroMatrixPlus

Technical editor: The Prague Bulletin of Mathematical Linguistics and ÚFAL technical reports

Projects / Software / Data

Curriculum Vitae


List of classesNPFL110 Modern Methods in Computational Linguistics II


Selected Bibliography


  • Rudolf Rosa, Jan Mašek, David Mareček, Martin Popel, Daniel Zeman, Zdeněk Žabokrtský: HamleDT 2.0: Thirty Dependency Treebanks Stanfordized In Proceedings of LREC 2014, Reykjavík, Iceland, May 2014, pp. 2334–2341. [pdf]
  • Aleš Tamchyna, Martin Popel, Rudolf Rosa, Ondřej Bojar: CUNI in WMT14: Chimera Still Awaits Bellerophon In Proceedings of WMT 2014, Baltimore, MD, USA, June 2014, pp. 195–200. [pdf]
  • Daniel Zeman, Ondřej Dušek, David Mareček, Martin Popel, Loganathan Ramasamy, Jan Štěpánek, Zdeněk Žabokrtský, Jan Hajič: HamleDT: Harmonized Multi-Language Dependency Treebank In: Language Resources and Evaluation, Vol. 2014, Copyright © Springer Netherlands, ISSN 1574-020X, pp. 1–40. [official pdf] [self-archived pdf]
  • Pavel Pecina et al.: Adaptation of machine translation for multilingual information retrieval in medical domain In: Artificial Intelligence in Medicine, Vol. 61, No. 3, Copyright © Elsevier, ISSN 0933-3657, Jul 2014, pp. 165–185. [pdf]


  • Martin Popel, David Mareček, Jan Štěpánek, Daniel Zeman, Zdeněk Žabokrtský: Coordination Structures in Dependency Treebanks In Proceedings of ACL 2013, Sofia, Bulgaria, August 5–7, 2013, pp. 517–527. [pdf] [poster]
  • Petra Galuščáková, Martin Popel, Ondřej Bojar: PhraseFix: Statistical Post-Editing of TectoMT In Proceedings of WMT 2013, Sofia, Bulgaria, August 8–9, 2013, pp. 141–147. [pdf]
  • David Mareček, Martin Popel, Loganathan Ramasamy, Jan Štěpánek, Daniel Zeman, Zdeněk Žabokrtský, Jan Hajič: Cross-language Study on Influence of Coordination Style on Dependency Parsing Performance ÚFAL Technical report, [pdf]
  • Niraj Aswani et al.: Khresmoi Professional: Multilingual Semantic Search for Medical Professionals In Proceedings of the ACM SIGIR Workshop on Health Search and Discovery: Helping Users and Advancing Medicine, Microsoft Research, Cambridge, UK, 2013, pp. 31–34. [pdf]


  • Rudolf Rosa, Ondřej Dušek, David Mareček, Martin Popel: Using Parallel Features in Parsing of Machine-Translated Sentences for Correction of Grammatical Errors In Proceedings of ACL SSST-6, Jeju, Korea, July 12, 2012, pp. 39–48. [pdf]
  • Ondřej Dušek, Zdeněk Žabokrtský, Martin Popel, Martin Majliš, Michal Novák, and David Mareček: Formemes in English-Czech Deep Syntactic MT. In Proceedings of WMT 2012, Montréal, Canada, June 7–8, 2012, pp. 267–274. [pdf]
  • Ondřej Bojar, Zdeněk Žabokrtský, Ondřej Dušek, Petra Galuščáková, Martin Majliš, David Mareček, Jiří Maršík, Michal Novák, Martin Popel, and Aleš Tamchyna: The Joy of Parallelism with CzEng 1.0. In Proceedings of LREC 2012, Istanbul, Turkey, May 21–27, 2012, pp. 3921–3928. [pdf]
  • Daniel Zeman, David Mareček, Martin Popel, Loganathan Ramasamy, Jan Štěpánek, Zdeněk Žabokrtský, Jan Hajič: HamleDT: To Parse or Not to Parse?. In Proceedings of LREC 2012, Istanbul, Turkey, May 21–27, 2012, pp. 2735–2741. [pdf]


  • Ondřej Bojar, Miloš Ercegovčević, Martin Popel and Omar Zaidan: A Grain of Salt for the WMT Manual Evaluation. In Proceedings of WMT 2011, EMNLP 6th Workshop on Statistical Machine Translation, Edinburgh, UK, July 30, 2011, pp. 1–11. [pdf] presentation [pdf]
  • Martin Popel, David Mareček, Nathan Green and Zdeněk Žabokrtský: Influence of Parser Choice on Dependency-Based MT. In Proceedings of WMT 2011, EMNLP 6th Workshop on Statistical Machine Translation, Edinburgh, UK, July 31, 2011, pp. 433–439. [pdf] poster [jpg]


  • Martin Popel, David Mareček: Perplexity of n-gram and Dependency Language Models. In Proceedings of TSD 2010, 13th International Conference on Text, Speech and Dialog, Brno, Czechia, September 8, 2010, pp. 173–180. [pdf] presentation [pdf]
  • Martin Popel, Zdeněk Žabokrtský: TectoMT: Modular NLP Framework. In Proceedings of IceTAL, 7th International Conference on Natural Language Processing, Reykjavík, Iceland, August 17, 2010, pp. 293–304. [pdf] presentation [pdf]
  • David Mareček, Martin Popel, Zdeněk Žabokrtský: Maximum Entropy Translation Model in Dependency-Based MT Framework. In Proceedings of the Joint 5th Workshop on Statistical Machine Translation and MetricsMATR, Uppsala, Sweden, 15–16 July 2010, pp. 201–206. [pdf] poster [pdf]
  • Martin Popel: English-Czech Machine Translation Using TectoMT. In Proceedings of the 19th Annual Conference of Doctoral Students WDS 2010, Prague, Czech Republic, 1–4 June 2010, pp. 88–93. [pdf] presentation [pdf]


  • Martin Popel, Zdeněk Žabokrtský: Improving English-Czech Tectogrammatical MT. In The Prague Bulletin of Mathematical Linguistics No. 92, 2009, pp. 115–134. [pdf]
  • Zdeněk Žabokrtský, Martin Popel: Hidden Markov Tree Model in Dependency-based Machine Translation. In Proceedings of the 47th Annual Meeting of the Association for Computational Linguistics, Singapore, 2009, pp. 145–148 [pdf] poster [ppt]
  • Ondřej Bojar, David Mareček, Václav Novák, Martin Popel, Jan Ptáček, Jan Rouš, Zdeněk Žabokrtský: English-Czech MT in 2008. In Proceedings of the Fourth Workshop on Statistical Machine Translation, Athens, Greece. Association for Computational Linguistics, 2009 [pdf]

For citations see Google Scholar, for bibtex entries see Biblio.


  • 2010–2011 Master student Amir Kamran (Hybrid MT Approaches for Low-Resource Languages)
  • 2011–2012 Bachelor student Michal Koutný (Word prediction using language models)
  • 2011–2013 Bachelor student Ondřej Klejch (Tool for comparison and evaluation of machine translation)
  • 2012–2013 Bachelor student Michal Sedlák (Web Interface for the Treex Framework), see Treex::Web

Other talks