David Mareček - Publications

  1. Michael Hanna, Roberto Zamparelli, David Mareček (2023): The Functional Relevance of Probed Information: A Case Study. In: Proceedings of the 17th Conference of the European Chapter of the Association for Computational Linguistics, pp. 835-848, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-959429-44-9 (url, bibtex)
  2. Bar Iluz, Tomasz Limisiewicz, Gabriel Stanovsky, David Mareček (2023): Exploring the Impact of Training Data Distribution and Subword Tokenization on Gender Bias in Machine Translation. In: Proceedings of the 13th International Joint Conference on Natural Language Processing and the 3rd Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics (Volume 1: Long Papers), pp. 885-896, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 979-8-89176-014-1 (pdf, local PDF, bibtex)
  3. Tomasz Limisiewicz, Jiří Balhar, David Mareček (2023): Tokenization Impacts Multilingual Language Modeling: Assessing Vocabulary Allocation and Overlap Across Languages. In: Findings of the Association for Computational Linguistics: ACL 2023, pp. 5661-5681, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-959429-62-3 (url, bibtex)
  4. Tomasz Limisiewicz, David Mareček (2022): Don’t Forget About Pronouns: Removing Gender Bias in Language Models Without Losing Factual Gender Information. In: Proceedings of the 4th Workshop on Gender Bias in Natural Language Processing (GeBNLP), pp. 17-29, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-955917-68-1 (pdf, bibtex)
  5. Rudolf Rosa, Patrícia Schmidtová, Ondřej Dušek, Tomáš Musil, David Mareček, Saad Obaid ul Islam, Marie Nováková, Klára Vosecká, Josef Doležal (2022): GPT-2-based Human-in-the-loop Theatre Play Script Generation. In: Proceedings of the 4th Workshop of Narrative Understanding, pp. 29-37, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-955917-85-8 (url, local PDF, bibtex)
  6. Rudolf Rosa, Patrícia Schmidtová, Alisa Zakhtarenko, Ondřej Dušek, Tomáš Musil, David Mareček, Saad Obaid ul Islam, Marie Nováková, Klára Vosecká, Daniel Hrbek, David Košťák (2022): THEaiTRobot: An Interactive Tool for Generating Theatre Play Scripts. In: Proceedings of the 15th International Conference on Natural Language Generation: System Demonstrations, pp. 10-13, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-955917-60-5 (url, local PDF, bibtex)
  7. Patrícia Schmidtová, Rudolf Rosa, David Košťák, Tomáš Studeník, Daniel Hrbek, Tomáš Musil, Josef Doležal, Ondřej Dušek, David Mareček, Klára Vosecká, Marie Nováková, Petr Žabka, Alisa Zakhtarenko, Dominik Jurko, Martina Kinská, Tom Kocmi, Ondřej Bojar (2022): THEaiTRE: Generating Theatre Play Scripts using Artificial Intelligence. In: , ISBN 978-80-88132-14-1 (url, bibtex)
  8. 2.0 THEaiTRobot, Josef Doležal, Klára Vosecká, Tomáš Musil, David Mareček, Rudolf Rosa (2022): Permeation (technical report). In: (pdf, bibtex)
  9. Michael Hanna, David Mareček (2021): Analyzing BERT’s Knowledge of Hypernymy via Prompting. In: Proceedings of the 4th Workshop on Analyzing and Interpreting Neural Networks for NLP, pp. 275-282, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-955917-06-3 (pdf, bibtex)
  10. Daniel Hrbek, 1.0 THEaiTRobot, Tomáš Studeník, David Košťák, Martina Kinská, Rudolf Rosa, Ondřej Dušek, Tom Kocmi, David Mareček, Tomáš Musil, Patrícia Schmidtová, Dominik Jurko, Ondřej Bojar, Klára Vosecká, Josef Doležal, Marie Nováková, Petr Žabka (2021): AI: Když robot píše hru (online premiéra divadelní hry) (Electronic). (url)
  11. Tomasz Limisiewicz, David Mareček (2021): Examining Cross-lingual Contextual Embeddings with Orthogonal Structural Probes. In: Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing (EMNLP), pp. 4589-4598, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-955917-09-4 (pdf, bibtex)
  12. Tomasz Limisiewicz, David Mareček (2021): Introducing Orthogonal Constraint in Structural Probes. In: Proceedings of the Joint Conference of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, pp. 428-442, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-954085-52-7 (pdf, bibtex)
  13. Rudolf Rosa, Tomáš Musil, Ondřej Dušek, Dominik Jurko, Patrícia Schmidtová, David Mareček, Ondřej Bojar, Tom Kocmi, Daniel Hrbek, David Košťák, Martina Kinská, Marie Nováková, Josef Doležal, Klára Vosecká, Tomáš Studeník, Petr Žabka (2021): When a Robot Writes a Play: Automatically Generating a Theatre Play Script. In: Proceedings of the ALIFE 2021: The 2021 Conference on Artificial Life, pp. 565-567, MIT Press, Cambridge, MA, USA (url, local PDF, local PDF, local ZIP, bibtex)
  14. Rudolf Rosa, Tomáš Musil, Ondřej Dušek, Dominik Jurko, Patrícia Schmidtová, David Mareček, Ondřej Bojar, Tom Kocmi, Daniel Hrbek, David Košťák, Martina Kinská, Marie Nováková, Josef Doležal, Klára Vosecká, Tomáš Studeník, Petr Žabka (2021): THEaiTRE 1.0: Interactive Generation of Theatre Play Scripts. In: Proceedings of the Text2Story’21 Workshop, pp. 71-76, RWTH Aachen University, Aachen, Germany (pdf, local PDF, local ZIP, local PDF, bibtex)
  15. Micha Theo Neri de Rijk, David Mareček (2020): Using Word Embeddings and Collocations for Modelling Word Associations. In: The Prague Bulletin of Mathematical Linguistics, ISSN 0032-6585, 114, pp. 35-57 (pdf, bibtex)
  16. Tomasz Limisiewicz, David Mareček (2020): Syntax Representation in Word Embeddings and Neural Networks – A Survey. In: Proceedings of the 20th Conference Information Technologies - Applications and Theory (ITAT 2020), pp. 38-48, Tomáš Horváth, Košice, Slovakia (pdf, bibtex)
  17. Tomasz Limisiewicz, Rudolf Rosa, David Mareček (2020): Universal Dependencies according to BERT: both more specific and more general. In: Findings of the Association for Computational Linguistics: EMNLP 2020, pp. 2710-2722, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-952148-90-3 (url, bibtex)
  18. David Mareček, Hande Celikkanat, Miikka Silfverberg, Vinit Ravishankar, Jörg Tiedemann (2020): Are Multilingual Neural Machine Translation Models Better at Capturing Linguistic Features?. In: The Prague Bulletin of Mathematical Linguistics, ISSN 0032-6585, 115, pp. 143-162 (pdf, bibtex)
  19. David Mareček, Jindřich Libovický, Tomáš Musil, Rudolf Rosa, Tomasz Limisiewicz (2020): Hidden in the Layers: Interpretation of Neural Networks for Natural Language Processing. In: , ISBN 978-80-88132-10-3 (url, bibtex)
  20. Rudolf Rosa, Ondřej Dušek, Tom Kocmi, David Mareček, Tomáš Musil, Patrícia Schmidtová, Dominik Jurko, Ondřej Bojar, Daniel Hrbek, David Košťák, Martina Kinská, Josef Doležal, Klára Vosecká (2020): THEaiTRE: Artificial Intelligence to Write a Theatre Play. In: Proceedings of AI4Narratives — Workshop on Artificial Intelligence for Narratives, pp. 9-13, RWTH Aachen University, Aachen, Germany (pdf, local PDF, local ZIP, local PDF, bibtex)
  21. Rudolf Rosa, Tomáš Musil, David Mareček (2020): Measuring Memorization Effect in Word-Level Neural Networks Probing. In: 23rd International Conference on Text, Speech and Dialogue, pp. 180-188, Springer, Cham, Switzerland, ISBN 978-3-030-58322-4 (url, local PDF, bibtex)
  22. David Mareček, Rudolf Rosa (2019): From Balustrades to Pierre Vinken: Looking for Syntax in Transformer Self-Attentions. In: The BlackboxNLP Workshop on Analyzing and Interpreting Neural Networks for NLP at ACL 2019, pp. 263-275, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-950737-30-7 (url, local PDF, local PDF, bibtex)
  23. Tomáš Musil, Jonáš Vidra, David Mareček (2019): Derivational Morphological Relations in Word Embeddings. In: The BlackboxNLP Workshop on Analyzing and Interpreting Neural Networks for NLP at ACL 2019, pp. 173-180, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-950737-30-7 (url, bibtex)
  24. Jindřich Libovický, Jindřich Helcl, David Mareček (2018): Input Combination Strategies for Multi-Source Transformer Decoder. In: Proceedings of the Third Conference on Machine Translation, Volume 1: Research Papers, pp. 253-260, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-948087-81-0 (url, local PDF, local PDF, bibtex)
  25. David Mareček, Rudolf Rosa (2018): Extracting Syntactic Trees from Transformer Encoder Self-Attentions. In: Proceedings of the First Workshop on Analyzing and Interpreting Neural Networks for NLP, pp. 347-349, The Assotiation of Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-948087-71-1 (url, local PDF, local PDF, bibtex)
  26. Rudolf Rosa, David Mareček (2018): CUNI x-ling: Parsing under-resourced languages in CoNLL 2018 UD Shared Task. In: Proceedings of the CoNLL 2018 Shared Task: Multilingual Parsing from Raw Text to Universal Dependencies, pp. 187-196, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-948087-82-7 (pdf, local PDF, local PDF, bibtex)
  27. Ondřej Bojar, Tom Kocmi, David Mareček, Roman Sudarikov, Dušan Variš (2017): CUNI Submission in WMT17: Chimera Goes Neural. In: Proceedings of the Second Conference on Machine Translation, Volume 2: Shared Task Papers, pp. 248-256, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-945626-96-8 (url, bibtex)
  28. David Mareček, Ondřej Bojar, Ondřej Hübsch, Rudolf Rosa, Dušan Variš (2017): CUNI Experiments for WMT17 Metrics Task. In: Proceedings of the Second Conference on Machine Translation, Volume 2: Shared Task Papers, pp. 604-611, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-945626-96-8 (url, bibtex)
  29. Bedřich Pišl, David Mareček (2017): Communication with Robots using Multilayer Recurrent Networks. In: Proceedings of the First Workshop on Language Grounding for Robotics, pp. 44-48, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-945626-64-7 (pdf, bibtex)
  30. Rudolf Rosa, Daniel Zeman, David Mareček, Zdeněk Žabokrtský (2017): Slavic Forest, Norwegian Wood. In: Proceedings of the Fourth Workshop on NLP for Similar Languages, Varieties and Dialects (VarDial4), pp. 210-219, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-945626-43-2 (pdf, local PDF, local PDF, bibtex)
  31. David Mareček (2016): Delexicalized and Minimally Supervised Parsing on Universal Dependencies. In: Statistical Language and Speech Processing, pp. 30-42, Springer International Publishing, Cham, Switzerland, ISBN 978-3-319-45924-0 (local PDF, bibtex)
  32. David Mareček (2016): Twelve Years of Unsupervised Dependency Parsing. In: Proceedings of the 16th ITAT: Slovenskočeský NLP workshop (SloNLP 2016), pp. 56-62, CreateSpace Independent Publishing Platform, Bratislava, Slovakia, ISBN 978-1537016740 (pdf, local PDF, bibtex)
  33. David Mareček (2016): Merged bilingual trees based on Universal Dependencies in Machine Translation. In: Proceedings of the First Conference on Machine Translation (WMT). Volume 2: Shared Task Papers, pp. 333-338, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-945626-10-4 (pdf, local PDF, local PDF, bibtex)
  34. David Mareček, Zdeněk Žabokrtský (2016): Gibbs Sampling Segmentation of Parallel Dependency Trees for Tree-Based Machine Translation. In: The Prague Bulletin of Mathematical Linguistics, ISSN 0032-6585, 105, pp. 101-110 (pdf, local PDF, bibtex)
  35. Rudolf Rosa, Martin Popel, Ondřej Bojar, David Mareček, Ondřej Dušek (2016): Moses & Treex Hybrid MT Systems Bestiary. In: Proceedings of the 2nd Deep Machine Translation Workshop, pp. 1-10, ÚFAL MFF UK, Praha, Czechia, ISBN 978-80-88132-02-8 (url, local PDF, local PDF, bibtex)
  36. Zhiwei Yu, David Mareček, Zdeněk Žabokrtský, Daniel Zeman (2016): If You Even Don't Have a Bit of Bible: Learning Delexicalized POS Taggers. In: Proceedings of the 10th International Conference on Language Resources and Evaluation (LREC 2016), pp. 96-103, European Language Resources Association, Paris, France, ISBN 978-2-9517408-9-1 (url, local PDF, bibtex)
  37. Daniel Zeman, David Mareček, Zhiwei Yu, Zdeněk Žabokrtský (2016): Planting Trees in the Desert: Delexicalized Tagging and Parsing Combined. In: Proceedings of the 30th Pacific Asia Conference on Language, Information and Computation, pp. 199-207, Kyung Hee University, Seoul, Korea, ISBN 978-89-6817-428-5 (pdf, local PDF, local PDF, bibtex)
  38. David Mareček (2015): Multilingual Unsupervised Dependency Parsing with Unsupervised POS tags. In: MICAI 2015: Advances in Artificial Intelligence and Soft Computing, Part I, pp. 72-82, Springer, Berlin / Heidelberg, ISBN 978-3-319-27059-3 (bibtex)
  39. David Mareček, Zdeněk Žabokrtský (2014): Dealing with Function Words in Unsupervised Dependency Parsing. In: 15th International Conference on Computational Linguistics and Intelligent Text Processing, pp. 250-261, Springer, Berlin / Heidelberg, ISBN 978-3-642-54905-2 (local PDF, bibtex)
  40. Pavel Pecina, Ondřej Dušek, Lorraine Goeuriot, Jan Hajič, Jaroslava Hlaváčová, Gareth J.F. Jones, Liadh Kelly, Johannes Leveling, David Mareček, Michal Novák, Martin Popel, Rudolf Rosa, Aleš Tamchyna, Zdeňka Urešová (2014): Adaptation of machine translation for multilingual information retrieval in medical domain. In: Artificial Intelligence in Medicine, ISSN 0933-3657, vol. 61, no. 3, pp. 165-185 (url, bibtex)
  41. Loganathan Ramasamy, David Mareček, Zdeněk Žabokrtský (2014): Multilingual Dependency Parsing: Using Machine Translated Texts instead of Parallel Corpora. In: The Prague Bulletin of Mathematical Linguistics, ISSN 0032-6585, 102, pp. 93-104 (pdf, bibtex)
  42. Rudolf Rosa, Jan Mašek, David Mareček, Martin Popel, Daniel Zeman, Zdeněk Žabokrtský (2014): HamleDT 2.0: Thirty Dependency Treebanks Stanfordized. In: Proceedings of the 9th International Conference on Language Resources and Evaluation (LREC 2014), pp. 2334-2341, European Language Resources Association, Reykjavík, Iceland, ISBN 978-2-9517408-8-4 (pdf, local PDF, local PDF, bibtex)
  43. Daniel Zeman, Ondřej Dušek, David Mareček, Martin Popel, Loganathan Ramasamy, Jan Štěpánek, Zdeněk Žabokrtský, Jan Hajič (2014): HamleDT: Harmonized Multi-Language Dependency Treebank. In: Language Resources and Evaluation, ISSN 1574-020X, vol. 48, no. 4, pp. 601-637 (url, local PDF, bibtex)
  44. Niraj Aswani, Thomas Beckers, Erich Birngruber, Célia Boyer, Andreas Burner, Jakub Bystroň, Khalid Choukri, Sarah Cruchet, Hamish Cunningham, Jan Dědek, Ljiljana Dolamic, René Donner, Ondřej Dušek, Sebastian Dungs, Ivan Eggel, Antonio Foncubierta, Norbert Fuhr, Adam Funk, Alba García Seco de Herrera, Arnaud Gaudinat, Georgi Georgiev, Julien Gobeill, Lorraine Goeuriot, Paz Gomez, Mark A. Greenwood, Manfred Gschwandtner, Allan Hanbury, Jan Hajič, Jaroslava Hlaváčová, Markus Holzer, Gareth J.F. Jones, Blanca Jordán, Matthias Jordan, Klemens Kaderk, Franz Kainberger, Liadh Kelly, Sascha Kriewel, Marlene Kritz, Georg Langs, Nolan Lawson, Johannes Leveling, David Mareček, Dimitrios Markonis, Iván Martínez, Vassil Momtchev, Alexandre Masselot, Hélène Mazo, Henning Müller, Michal Novák, Johann Petrak, João Palotti, Pavel Pecina, Konstantin Pentchev, Deyan Peychev, Natalia Pletneva, Martin Popel, Diana Pottecher, Angus Roberts, Rudolf Rosa, Patrick Ruch, Alexander Sachs, Matthias Samwald, Priscille Schneller, Veronika Stefanov, Aleš Tamchyna, Miguel Angel Tinte, Zdeňka Urešová, Alejandro Vargas, Dina Vishnyakova (2013): Khresmoi Professional: Multilingual Semantic Search for Medical Professionals. In: Proceedings of the ACM SIGIR Workshop on Health Search and Discovery: Helping Users and Advancing Medicine, pp. 31-34, Microsoft Research, Cambridge, UK (url, local PDF, bibtex)
  45. David Mareček, Martin Popel, Loganathan Ramasamy, Jan Štěpánek, Daniel Zeman, Zdeněk Žabokrtský, Jan Hajič (2013): Cross-language Study on Influence of Coordination Style on Dependency Parsing Performance (technical report). In: (pdf, local PDF, bibtex)
  46. David Mareček, Milan Straka (2013): Stop-probability estimates computed on a large corpus improve Unsupervised Dependency Parsing. In: Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics, pp. 281-290, Association for Computational Linguistics, Sofija, Bulgaria, ISBN 978-1-937284-50-3 (pdf, local PDF, bibtex)
  47. Martin Popel, David Mareček, Jan Štěpánek, Daniel Zeman, Zdeněk Žabokrtský (2013): Coordination Structures in Dependency Treebanks. In: Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics, pp. 517-527, Association for Computational Linguistics, Sofija, Bulgaria, ISBN 978-1-937284-50-3 (pdf, local PDF, local PDF, local PDF, bibtex)
  48. Rudolf Rosa, David Mareček, Aleš Tamchyna (2013): Deepfix: Statistical Post-editing of Statistical Machine Translation Using Deep Syntactic Analysis. In: 51st Annual Meeting of the Association for Computational Linguistics Proceedings of the Student Research Workshop, pp. 172-179, Association for Computational Linguistics, Sofija, Bulgaria, ISBN 978-1-937284-53-4 (url, local PDF, local PDF, local PDF, bibtex)
  49. Ondřej Bojar, Zdeněk Žabokrtský, Ondřej Dušek, Petra Galuščáková, Martin Majliš, David Mareček, Jiří Maršík, Michal Novák, Martin Popel, Aleš Tamchyna (2012): The Joy of Parallelism with CzEng 1.0. In: Proceedings of the 8th International Conference on Language Resources and Evaluation (LREC 2012), pp. 3921-3928, European Language Resources Association, İstanbul, Turkey, ISBN 978-2-9517408-7-7 (url, local PDF, bibtex)
  50. Ondřej Dušek, Zdeněk Žabokrtský, Martin Popel, Martin Majliš, Michal Novák, David Mareček (2012): Formemes in English-Czech Deep Syntactic MT. In: Proceedings of the Seventh Workshop on Statistical Machine Translation, pp. 267-274, Association for Computational Linguistics, Montréal, Canada, ISBN 978-1-937284-20-6 (pdf, local PDF, bibtex)
  51. David Mareček (2012): Unsupervised Dependency Parsing (PhD thesis). In: (local PDF, bibtex)
  52. David Mareček, Zdeněk Žabokrtský (2012): Exploiting Reducibility in Unsupervised Dependency Parsing. In: Proceedings of the 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, pp. 297-307, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-937284-43-5 (bibtex)
  53. David Mareček, Zdeněk Žabokrtský (2012): Unsupervised Dependency Parsing using Reducibility and Fertility features. In: The NAACL-HLT Workshop on the Induction of Linguistic Structure, pp. 84-89, The Association for Computational Linguistics, Montréal, Canada, ISBN 978-1-937284-20-6 (bibtex)
  54. Rudolf Rosa, Ondřej Dušek, David Mareček, Martin Popel (2012): Using Parallel Features in Parsing of Machine-Translated Sentences for Correction of Grammatical Errors. In: Proceedings of Sixth Workshop on Syntax, Semantics and Structure in Statistical Translation (SSST-6), ACL, pp. 39-48, Association for Computational Linguistics, Jeju, Korea, ISBN 978-1-937284-38-1 (pdf, local PDF, local PDF, bibtex)
  55. Rudolf Rosa, David Mareček (2012): Dependency Relations Labeller for Czech. In: Text, Speech and Dialogue: 15th International Conference, TSD 2012. Proceedings, Lecture Notes in Computer Science, ISSN 0302-9743, 7499, pp. 256-263, Springer Verlag, Berlin / Heidelberg, ISBN 978-3-642-32789-6 (url, local PDF, local PDF, bibtex)
  56. Rudolf Rosa, David Mareček, Ondřej Dušek (2012): DEPFIX: A System for Automatic Correction of Czech MT Outputs. In: Proceedings of the Seventh Workshop on Statistical Machine Translation, pp. 362-368, Association for Computational Linguistics, Montréal, Canada, ISBN 978-1-937284-20-6 (pdf, local PDF, local HTML, local PDF, bibtex)
  57. Daniel Zeman, David Mareček, Martin Popel, Loganathan Ramasamy, Jan Štěpánek, Zdeněk Žabokrtský, Jan Hajič (2012): HamleDT: To Parse or Not to Parse?. In: Proceedings of the 8th International Conference on Language Resources and Evaluation (LREC 2012), pp. 2735-2741, European Language Resources Association, İstanbul, Turkey, ISBN 978-2-9517408-7-7 (url, local PDF, local PDF, bibtex)
  58. David Mareček (2011): Combining Diverse Word-Alignment Symmetrizations Improves Dependency Tree Projection. In: Lecture Notes in Computer Science, ISSN 0302-9743, 6608, pp. 144-154 (url, bibtex)
  59. David Mareček, Rudolf Rosa, Petra Galuščáková, Ondřej Bojar (2011): Two-step translation with grammatical post-processing. In: Proceedings of the Sixth Workshop on Statistical Machine Translation, pp. 426-432, Association for Computational Linguistics, Edinburgh, UK, ISBN 978-1-937284-12-1 (url, local PDF, local PDF, bibtex)
  60. David Mareček, Zdeněk Žabokrtský (2011): Gibbs Sampling with Treeness constraint in Unsupervised Dependency Parsing. In: Robust Unsupervised and Semisupervised Methods in Natural Language Processing, pp. 1-8, Incoma, Šumen, Bulgaria, ISBN 978-954-452-017-5 (bibtex)
  61. David Mareček, Zdeněk Žabokrtský (2011): Unsupervised Dependency Parsing (technical report). In: (pdf, bibtex)
  62. Martin Popel, David Mareček, Nathan David Green, Zdeněk Žabokrtský (2011): Influence of Parser Choice on Dependency-Based MT. In: Proceedings of the Sixth Workshop on Statistical Machine Translation, pp. 433-439, Association for Computational Linguistics, Edinburgh, UK, ISBN 978-1-937284-12-1 (bibtex)
  63. Ondřej Bojar, Kamil Kos, David Mareček (2010): Tackling Sparse Data Issue in Machine Translation Evaluation. In: Proceedings of the ACL 2010 Conference Short Papers, pp. 86-91, Association for Computational Linguistics, Uppsala, Sweden, ISBN 978-1-932432-69-5 (url, bibtex)
  64. Natalia Klyueva, David Mareček (2010): Towards Parallel Czech-Russian Dependency Treebank. In: Workshop on Annotation and Exploitation of Parallel Corpora, NEALT Proceedings Series, ISSN 1736-6305, 10, pp. 44-52, Northern European Association for Language Technology, Tartu, Estonia (local PDF, local PDF, bibtex)
  65. David Mareček, Martin Popel, Zdeněk Žabokrtský (2010): Maximum Entropy Translation Model in Dependency-Based MT Framework. In: Proceedings of the Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR, pp. 201-201, Association for Computational Linguistics, Uppsala, Sweden, ISBN 978-1-932432-71-8 (pdf, bibtex)
  66. Martin Popel, David Mareček (2010): Perplexity of n-gram and Dependency Language Models. In: Text, Speech and Dialogue. 13th International Conference, TSD 2010, Brno, Czech Republic, September 6-10, 2010. Proceedings, Lecture Notes in Computer Science, ISSN 0302-9743, 6231, pp. 173-180, Springer, Berlin / Heidelberg, ISBN 978-3-642-15759-2 (local PDF, local PDF, bibtex)
  67. Ondřej Bojar, David Mareček, Václav Novák, Martin Popel, Jan Ptáček, Jan Rouš, Zdeněk Žabokrtský (2009): English-Czech MT in 2008. In: Proceedings of the Fourth Workshop on Statistical Machine Translation, pp. 125-129, Association for Computational Linguistics, Athina, Greece (pdf, local PDF, bibtex)
  68. David Mareček (2009): Improving Word Alignment Using Alignment of Deep Structures. In: Proceedings of the 12th International Conference, TSD 2009, pp. 56-63, Springer, Berlin / Heidelberg, ISBN 978-3-642-04207-2 (pdf, bibtex)
  69. David Mareček (2009): Using Tectogrammatical Alignment in Phrase‐Based Machine Translation. In: WDS'09 Proceedings of Contributed Papers, pp. 22-27, Matfyzpress, Charles University, Praha, Czechia, ISBN 978-80-7378-101-9 (pdf, bibtex)
  70. David Mareček, Natalia Klyueva (2009): Converting Russian Treebank SynTagRus into Praguian PDT Style. In: Multilingual resources, technologies and evaluation for Central and Eastern European languages, pp. 30-35, INCOMA Ltd., Shoumen, Bulgaria, ISBN 978-954-452-008-3 (pdf, bibtex)
  71. David Mareček (2008): Automatic Alignment of Tectogrammatical Trees from Czech-English Parallel Corpus (masters thesis). In: (local PDF, bibtex)
  72. David Mareček, Zdeněk Žabokrtský, Václav Novák (2008): Automatic Alignment of Czech and English Deep Syntactic Dependency Trees. In: Proceedings of the Twelfth EAMT Conference, pp. 102-111, HITEC e.V., Hamburg, Germany, ISBN 978-3-00-025770-4 (pdf, local PDF, bibtex)