Marie Mikulová

office
Room 420
email
mikulova@ufal.mff.cuni.cz
phone
+420 221 914 361
address
Malostranské náměstí 25
118 00 Praha 1
Czech Republic

Main Research Interests

  • theoretical linguistics, computational linguistics, corpus linguistics
  • language description from morphology to (dependency) syntax and meaning
  • linguistics annotation of written and spoken texts
  • coordination of the Prague Dependency Treebank - Consolidated project (annotation, guidelines, publication)

Projects

Principal Investigator

Participation in Grants and Projects

  • LINDAT/CLARIAH-CZ Language Resources and Digital Arts and Humanities Research Infrastructure, (2016-)2023-2026, LM2023062

  • Computational Linguistics: Explicit description of language and annotated data focused on Czech, GAČR P406/2010/0875, 2010-2014, Supported by Grant Agency of Czech Republic.

  • LINDAT-Clarin: Large infrastructural grant for language resources, MŠMT LM2010013, 2010-2015. Supported by Ministry of Education of the Czech Republic.

  • Center for Computational Linguistics, MŠMT LC536, 2005-2011, Supported by: Ministry of Education of the Czech Republic

  • Využití materiálu z Pražského závislostního korpusu pro systémový popis syntaxe, GAUK 64307, 2007-2008, Supported by: Grant Agency of the Charles University in Prague.

  • Automatická hloubková analýza mluvené češtiny: od akustického signálu k významu, GAUK 375/2005, 2005 -2006, Supported by: Grant Agency of the Charles University in Prague.

  • Pražský závislostní korpus: Analýza vybraných jevů z české funkční onomatologie a syntaxe, GAUK 352/2005, 2005-2006, Supported by: Grant Agency of the Charles University in Prague.

  • Centrum komputační lingvistiky (Center of Excellence), LN00A063, 2001-2004, Supported by: Ministry of Education of the Czech Republic.

Curriculum Vitae

Professional Experience

  • from 2012: Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (Senior Research Associate)
  • 2001-2012: Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (Research Assistant)
  • member of  European Language Resources Association (ELRA; since 2006), Societas Linguistica Europea (since 2024), Jazykovědné sdružení ČR (since 2024)

Academic Service

  • Language Resources and Evaluation Conference (LREC): Scientific Committee member – since 2020
  • Conference on Computational Linguistics (COLING): Scientific Committee member – since 2025
  • Slovko - Natural Language Processing and Corpus Linguistics: Scientific Committee member – since 2025

Education

  • 2012: Ph.D. in Computational Linguistics,  Faculty of Mathematics and Physics, Charles University
  • 2004: Master Study graduation, Faculty of Education, Charles University
  • 2003: Master Study graduation, Faculty of Arts, Charles University

Teaching

At the Faculty of Arts (Charles University):

Selected Bibliography

Books

  • Mikulová M., Panevová J.: Formy a funkce okolnostních určení v češtině. Určení prostorová a časová. Institute of Formal and Applied Linguistics, Charles University, Prague, Czech Republic, ISBN 978-80-88132-13-4, 200 pp., 2021. [pdf]
  • Panevová J., Hajičová E., Kettnerová V., Lopatková M., Mikulová M., Ševčíková M.: Mluvnice současné češtiny 2, Syntax na základě anotovaného korpusu. Karolinum, Prague, Czech Republic, ISBN 978-80-246-2497-6, 291 pp., 2014.
  • Mikulová M.: Významová reprezentace elipsy. Institute of Formal and Applied Linguistics, Charles University,  Prague, Czech Republic, ISBN 978-80-904175-9-5, 230 pp., 2011. [pdf]

Book Chapters

  • Mikulová M.: Function-to-Form Principle in the Determination of Secondary Prepositions and Conjunctions (On the Case of Delimitation of Circumstantial Meanings). In: Synsémantické slovní druhy ve slovanských jazycích, Academia, Prague, Czech Republic, 2025.
  • Hajičová E., Mikulová M.: Information Structure in a Formal Description of Language as Reflected in an Annotated Corpus of Czech. In: Lifetime Linguistic Inspirations. To Igor Mel’čuk from Colleagues and Friends for his 90th Birthday, Peter Lang, Berlin, ISBN 978-3-631-89042-4, 187-200, 2022.
  • Panevová J., Mikulová M.: Synonymie a homonymie v gramatice. In: Človek a jeho jazyk 5. Povaha jazyka a jej poznávanie, Veda, Bratislava, Slovakia, ISBN 978-80-224-1977-2, 59-67, 2022.
  • Hajič J., Hajičová E., Mikulová M., Mírovský J.: Prague Dependency Treebank. In: Handbook of Linguistic Annotation, Springer Verlag, Berlin, Germany, ISBN 978-94-024-0879-9, 555-594, 2017.
  • Mikulová M., Hoffmannová J.: Korpusy mluvené češtiny a možnosti jejich využití pro poznání rozdílných "světů" mluvenosti a psanosti. In: Korpusová lingvistika. 2 Výzkum a výstavba korpusů, Lidové noviny, Prague, Czech Republic, ISBN 978-80-7422-115-6, 78-92, 2011.

Journal Articles

  • Hajičová E., Panevová J., Mikulová M., Hajič J.: Function Words in Praguian Functional Generative Description. Linguistic Analysis, 43 (3-4), 465-512, ISSN 0098-9053, 2024. [pdf]
  • Mikulová M.: Expressing Measure in Czech (Corpus-based Study). Jazykovedný časopis / Journal of Linguistics, 74 (1), 108-118, ISSN 0021-5597, 2023. [link]
  • Štěpánková B., Mikulová M.: Capturing Numerals and Pronouns at the Morphological Layer in the Prague Dependency Treebanks of Czech. Jazykovedný časopis / Journal of Linguistics, 72 (2), 454-464, ISSN 0021-5597, 2021. [link]
  • Panevová J., Mikulová M.: Subcategorization of Adverbials (The Case of Temporal Meanings). Korpus – gramatika – axiologie, 22, 16-30, ISSN 1804-137X, 2020. [link]
  • Panevová J., Hajičová E., Kettnerová V., Kolářová V., Lopatková M., Mikulová M., Ševčíková M.: Funkční generativní popis – rámec pro konzistentní popis gramatiky. Naše řeč, 103 (1-2), 55-78, ISSN 0027-8203, 2020. [link]
  • Mikulová M., Panevová J.: Subkategorizace adverbiálních významů (hranice mezi obsahem a významem). Korpus – gramatika – axiologie, 20, 33-46, ISSN 1804-137X, 2019. [link]
  • Hlaváčová J., Mikulová M., Štěpánková B., Hajič J.: Modifications of the Czech Morphological Dictionary for Consistent Corpus Annotation. Jazykovedný časopis / Journal of Linguistics, 70 (2), 80-389, ISSN 0021-5597, 2019. [link]
  • Mikulová M., Bejček E., Hajičová E., Panevová J.: Search for the Relation of Form and Function Using the ForFun Database. The Prague Bulletin of Mathematical Linguistics, 110, 71-84, ISSN 0032-6585, 2018. [pdf]
  • Mikulová M., Mírovský J., Nedoluzhko A., Pajas P., Štěpánek J., Hajič J.: PDTSC 2.0 - Spoken Corpus with Rich Multi-layer Structural Annotation. In: Lecture Notes in Computer Science, 10415, 129-137, 20th International Conference, TSD 2017 Prague, Czech Republic, Springer. Berlin / Heidelberg, ISBN 978-3-319-64205-5, ISSN 0302-9743, 2017. [link]
  • Mikulová M., Bejček E., Kolářová V., Panevová J.: Subcategorization of Adverbial Meanings Based On Corpus Data. Jazykovedný časopis / Journal of Linguistics, 68 (2), 268-277, ISSN 0021-5597, 2017. [link]
  • Kolářová V., Kolář J., Mikulová M.: Difference between Written and Spoken Czech: The Case of Verbal Nouns Denoting an Action. The Prague Bulletin of Mathematical Linguistics, 107, 19-38, ISSN 0032-6585, 2017. [pdf]
  • Hajič J., Hajičová E., Mikulová M., Mírovský J., Panevová J., Zeman D.: Deletions and Node Reconstructions in a Dependency-based Mutlilevel Annotation Scheme. In: Lecture Notes in Computer Science, 9041, 17-31, 16th International Conference on Computational Linguistics and Intelligent Text Processing, Springer, Berlin / Heidelberg, ISBN 978-3-319-18111-0, ISSN 0302-9743, 2015. [link]
  • Mikulová M., Štěpánek J., Urešová Z.: Liší se mluvené a psané texty ve valenci? Korpus – gramatika – axiologie, 8, 36-46, ISSN 1804-137X, 2013. [pdf]
  • Panevová J., Mikulová M.: Problém elipsy: Co s ním a kam s ním?. Prace Filologiczne, 60, 225-232, ISSN 0138-0567, 2011. [link]
  • Panevová J., Mikulová M.: On Reciprocity. The Prague Bulletin of Mathematical Linguistics, 87, 27-40, ISSN 0032-6585, 2007. [pdf]

Conference and Workshop Papers

  • Mikulová M., Štěpánková B., Štěpánek J.: From Form to Meaning: The Case of Particles within the Prague Dependency Treebank Annotation Scheme. In: The 31st International Conference on Computational Linguistics, Proceedings of the Main Conference, Abu Dhabi, UAE, ICCL, 2163-2175, 2025. [link]
  • Mikulová M.: Fine-grained Classification of Circumstantial Meanings within the Prague Dependency Treebank Annotation Scheme. In: Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024), European Language Resources Association, Torino, Italy, ISBN 978-2-493814-10-4, ISSN 2522-2686, 7314-7323, 2024. [link]
  • Mikulová M., Straka M., Štěpánek J., Štěpánková B., Hajič J.: Quality and Efficiency of Manual Annotation: Pre-annotation Bias. In: Proceedings of the 13th Conference on Language Resources and Evaluation (LREC 2022), European Language Resources Association, Marseille, France, ISBN 979-10-95546-72-6, pp. 2909-2918, 2022. [link]
  • Hajičová E., Mikulová M., Štěpánková B., Mírovský J.: Advantages of a complex multilayer annotation scheme: The case of the Prague Dependency Treebank. In: Proceedings of the 16th Lingusitic Annotation Workshop (LAW-XVI) within LREC2022, European Language Resources Association, Marseille, France, ISBN 978-2-493814-08-1, 70-78, 2022. [link]
  • Štěpánková B., Mikulová M., Hajič J.: The MorfFlex Dictionary of Czech as a Source of Linguistic Data. In: Proceedings of XIX EURALEX Congress: Lexicography for Inclusion, Alexandroupolis, Greece, ISBN 978-618-85138-1-5, ISSN 2521-7100, 387-392, 2020. [link]
  • Hajič J., Bejček E., Hlaváčová J., Mikulová M., Straka M., Štěpánek J., Štěpánková B.: Prague Dependency Treebank - Consolidated 1.0. In: Proceedings of the 12th International Conference on Language Resources and Evaluation (LREC 2020), European Language Resources Association, Marseille, France, ISBN 979-10-95546-34-4, 5208-5218, 2020. [link]
  • Mikulová M., Kolářová V., Panevová J., Hajičová E.: Delimiting Adverbial Meanings. A corpus-based Comparative Study on Czech Spatial Prepositions and Their English Eequivalents. In: Proceedings of the 5th International Conference on Dependency Linguistics (Depling, Syntaxfest 2019), Association for Computational Linguistics, Paris, France, ISBN 978-1-950737-63-5, 153-159, 2019. [link]
  • Mikulová M., Bejček E., Panevová J.: What Can We Find Out about Time and Space in the ForFun Database? In: Proceedings of the Second Workshop on Corpus-Based Research in the Humanities CRH-2, TU Wien, Wien, Austria, ISBN 978-3-901716-43-0, 133-142, 2018. [pdf]
  • Mikulová M., Bejček E.: ForFun 1.0: Prague Database of Forms and Functions -- An Invaluable Resource for Linguistic Research. In: Proceedings of the 11th International Conference on Language Resources and Evaluation (LREC 2018), European Language Resources Association, Miyazaki, Japan, ISBN 979-10-95546-00-9, 2018. [link]
  • Bejček E., Hajičová E., Mikulová M., Panevová J.: The Relation of Form and Function in Linguistic Theory and in a Multi-layer Treebank. In: Proceedings of the 16th International Workshop on Treebanks and Linguistic Theories, Charles University, Prague, Czech Republic, ISBN 978-80-88132-04-2, 56-63, 2017. [link]
  • Nedoluzhko A., Novák M., Cinková S., Mikulová M., Mírovský J.: Coreference in Prague Czech-English Dependency Treebank. In: Proceedings of the 10th International Conference on Language Resources and Evaluation (LREC 2016), European Language Resources Association, Paris, France, ISBN 978-2-9517408-9-1, 169-176, 2016. [link]
  • Hajičová E., Mikulová M., Panevová J.: Reconstruction of Deletions in a Dependency-based Description of Czech: Selected Issues. In: Proceedings of the 3rd International Conference on Dependency Linguistics (Depling 2015), Uppsala University, Uppsala, Sweden, ISBN 978-91-637-8965-6, 131-140, 2015. [link]
  • Mikulová M.: Semantic Representation of Ellipsis in the Prague Dependency Treebanks. In: Proceedings of the 26th Conference on Computational Linguistics and Speech Processing  (ROCLING XXVI 2014), Association for Computational Linguistics and Chinese Language Processing (ACLCLP), Taipei, Taiwan, ISBN 978-957-30792-7-9, 125-138, 2014. [link]
  • Hajič J., Hajičová E., Panevová J., Sgall P., Bojar O., Cinková S., Fučíková E., Mikulová M., Pajas P., Popelka J., Semecký J., Šindlerová J., Štěpánek J., Toman J., Urešová Z., Žabokrtský Z.: Announcing Prague Czech-English Dependency Treebank 2.0. In: Proceedings of the 8th International Conference on Language Resources and Evaluation (LREC 2012), European Language Resources Association, İstanbul, Turkey, ISBN 978-2-9517408-7-7, 3153-3160, 2012. [link]
  • Mikulová M., Štěpánek J.: Ways of Evaluation of the Annotators in Building the Prague Czech-English Dependency Treebank. In: Proceedings of the 7th International Conference on Language Resources and Evaluation (LREC 2010), European Language Resources Association, Valletta, Malta, ISBN 2-9517408-6-7, 1836-1839, 2010. [link]
  • Mikulová M., Štěpánek J.: Annotation Quality Checking and Its Implications for Design of Treebank (in Building the Prague Czech-English Dependency Treebank). In: Proceedings of 8th Treebanks and Linguistic Theories Workshop (TLT), Milano, Italy, ISBN 978-88-8311-712-1, 137-148, 2009. [pdf]
  • Hajič J., Cinková S., Mikulová M., Pajas P., Ptáček J., Toman J., Urešová Z.: PDTSL: An Annotated Resource For Speech Reconstruction. In: Proceedings of the 2008 IEEE Workshop on Spoken Language Technology, IEEE, Goa, India, ISBN 978-1-4244-3472-5, 93-96, 2008. [pdf]

Technical Reports

  • Mikulová M., Hajič J., Hana J., Hanová H., Hlaváčová J., Jeřábek E., Štěpánková B., Vidová Hladká B., Zeman D.: Manual for Morphological Annotation. Revision for Prague Dependency Treebank – Consolidated 2020 release. Technical report TR-2020-64, 2020. [pdf]
  • Mikulová M.: Annotation on the Tectogrammatical Level. Additions to Annotation Manual (with respect to PDTSC and PCEDT). Technical report TR-2013-52, 2013. [pdf] [Czech]
  • Mikulová M., Bejček E., Mírovský J., Nedoluzhko A., Panevová J., Poláková L., Straňák P., Ševčíková M., Žabokrtský Z.: From PDT 2.0 to PDT 3.0 (Modifications and Complements). Technical report TR-2013-54, 2013. [pdf] [Czech]
  • Mikulová M.: Pokyny k překladu určené překladatelům, revizorům a korektorům textů z Wall Street Journal pro projekt PCEDT. Technical report TR-2009-41, 2009. [pdf]
  • Cinková S., Mikulová M.: Speech reconstruction for the syntactic and semantic analysis of the NAP/AAA corpus. Technical report TR-2008-37, 2008. [pdf]
  • Mikulová M.: Rekonstrukce standardizovaného textu z mluvené řeči v Pražském závislostním korpusu mluvené češtiny. Manuál pro anotátory. Technical report TR-2008-38, 2008. [pdf] [English]
  • Cinková S., Hajič J., Mikulová M., Mladová L., Nedolužko A., Pajas P., Panevová J., Semecký J., Šindlerová J., Toman J., Urešová Z., Žabokrtský Z.: Annotation of English on the Tectogrammatical Level. Technical report TR-2006-35, 2006. [pdf]
  • Mikulová M., Bémová A., Hajič J., Hajičová E., Havelka J., Kolářová V., Kučová L., Lopatková M., Pajas P., Panevová J., Razímová M., Sgall P., Štěpánek J., Urešová Z., Veselá K., Žabokrtský Z.: Annotation on the Tectogrammatical Level in the Prague Dependency Treebank. Annotation manual. Technical report.  TR-2006-30, 1287pp., 2006. [pdf] [html] [Czech]

Data/Software

Prague Dependency Treebank – Consolidated [web]

  • Hajič J., Bejček E., Bémová A., Buráňová E., Fučíková E., Hajičová E., Havelka J., Hlaváčová J., Homola P., Ircing P., Kárník J., Kettnerová V., Klyueva N., Kolářová V., Kučová L., Lopatková M., Mareček D., Mikulová M., Mírovský J., Nedoluzhko A., Novák M., Pajas P., Panevová J., Peterek N., Poláková L., Popel M., Popelka J., Romportl J., Rysová M., Semecký J., Sgall P., Spoustová J., Straka M., Straňák P., Synková P., Ševčíková M., Šindlerová J., Štěpánek J., Štěpánková B., Toman J., Urešová Z., Vidová Hladká B., Zeman D., Zikánová Š., Žabokrtský Z.: Prague Dependency Treebank - Consolidated 2.0 (PDT-C 2.0). Data/software, LINDAT/CLARIAH-CZ digital library, Prague, Czech Republic, 2024. [link]
  • Mikulová M., Bémová A., Hajič J., Hajičová E., Ircing P., Kolářová V., Lopatková M., Mareček D., Mírovský J., Nedoluzhko A., Pajas P., Panevová J., Peterek N., Romportl J., Sgall P., Ševčíková M., Štěpánek J., Urešová Z., Žabokrtský Z.: Prague Dependency Treebank of Spoken Czech 2.0 (PDTSC 2.0). Data/software, LINDAT/CLARIAH-CZ digital library, Czech Republic, 2017. [link]
  • Hajič J., Panevová J., Hajičová E.,  Sgall P., Pajas P., Štěpánek J., Havelka J., Mikulová M., Žabokrtský Z., Ševčíková-Razímová M., Urešová Z.: Prague Dependency Treebank 2.0 (PDT 2.0). Linguistic Data Consortium, Philadelphia, PA, USA, 2006. [link]

Prague Czech-English Dependency Treebank [web]

  • Hajič J., Hajičová E., Panevová J., Sgall P., Cinková S., Fučíková E., Mikulová M., Pajas P., Popelka J., Semecký J., Šindlerová J., Štěpánek J., Toman J., Urešová Z., Žabokrtský Z.: Prague Czech-English Dependency Treebank 2.0 (PCEDT 2.0). Data/software, Linguistic Data Consortium, Philadelphia, PA, USA, 2011. [link]

MorfFlex CZ [web]

  • Hajič J., Hlaváčová J., Mikulová M., Straka M., Štěpánková B.: MorfFlex CZ 2.1. Data/software, LINDAT/CLARIAH-CZ digital library, Prague, Czech Republic, 2024. [link]

PDT-Vallex [web]

  • Urešová Z., Bémová A., Fučíková E., Hajič J., Kolářová V., Mikulová M., Pajas P., Panevová J., Štěpánek J.: PDT-Vallex: Valency Lexicon Linked to Czech Corpora 4.5 (PDT-Vallex 4.5). Data/software, LINDAT/CLARIAH-CZ digital library, Prague, Czech Republic, 2024. [link]

ForFun [web]

  • Mikulová M., Bejček E.: ForFun 1.0. Data/software, LINDAT/CLARIAH-CZ digital library, Czech Republic, 2017. [link]

CorefUD [web]

  • Popel M., Novák M., Žabokrtský Z., Zeman D., Nedoluzhko A., Acar K., Bamman D., Bourgonje P., Cinková S., Eckhoff H., Cebiroğlu E. G., Hajič J., Hardmeier Ch., Haug D., Jørgensen T., Kåsen A., Krielke P., Landragin F., Lapshinova-Koltunski E., Mæhlum P., Martí M. A., Mikulová M., Nøklestad A., Ogrodniczuk M., Øvrelid L., Pamay A. T., Recasens M., Solberg P. E., Stede M., Straka M., Swanson D., Toldova S., Vadász N., Velldal E., Vincze V., Zeldes A., Žitkus V.: CorefUD 1.2. Data/software, LINDAT/CLARIAH-CZ digital library, Czech Republic, 2024. [link]