Daniel Zeman ![[foto]](fotky/dan.jpg)
|
|
Ústav formální a aplikované lingvistiky, Informatická sekce, Matematicko-fyzikální fakulta, Univerzita Karlova, Praha
Elektronická pošta: zeman@ufal.mff.cuni.cz
Telefon: +420 221 914 225
Poštovní adresa:
ÚFAL MFF UK
Malostranské náměstí 25
CZ-11800 Praha
Česko
Kancelář:
Místnost 409, 4. patro od schodů/výtahu doleva
- Jan Berka, Ondřej Bojar, Mark Fishel, Maja Popović, Daniel Zeman (2013): Tools for Machine Translation Quality Inspection (technical report). In: (url, biblio, bibtex)
- David Mareček, Martin Popel, Loganathan Ramasamy, Jan Štěpánek, Daniel Zeman, Zdeněk Žabokrtský, Jan Hajič (2013): Cross-language Study on Influence of Coordination Style on Dependency Parsing Performance (technical report). In: (pdf, biblio, bibtex)
- Daniel Zeman (2012): CUNI: Feature Selection and Error Analysis of a Transition-Based Parser. In: Proceedings of the Workshop on Machine Translation and Parsing in Indian Languages (MTPIL-2012), pp. 143-148, The COLING 2012 Organizing Committee, Mumbai, India (url, biblio, bibtex)
- Daniel Zeman (2012): Data Issues of the Multilingual Translation Matrix. In: Proceedings of the Seventh Workshop on Statistical Machine Translation, pp. 395-400, Association for Computational Linguistics, Montréal, Canada, ISBN 978-1-937284-20-6 (pdf, biblio, bibtex)
- Jan Berka, Ondřej Bojar, Mark Fishel, Maja Popović, Daniel Zeman (2012): Automatic MT Error Analysis: Hjerson Helping Addicter. In: Proceedings of the 8th International Conference on Language Resources and Evaluation (LREC 2012), pp. 2158-2163, European Language Resources Association, İstanbul, Turkey, ISBN 978-2-9517408-7-7 (url, biblio, bibtex)
- Daniel Zeman, David Mareček, Martin Popel, Loganathan Ramasamy, Jan Štěpánek, Zdeněk Žabokrtský, Jan Hajič (2012): HamleDT: To Parse or Not to Parse? In: Proceedings of the 8th International Conference on Language Resources and Evaluation (LREC 2012), pp. 2735-2741, European Language Resources Association, İstanbul, Turkey, ISBN 978-2-9517408-7-7 (url, biblio, bibtex)
- Daniel Zeman, Mark Fishel, Jan Berka, Ondřej Bojar (2011): Addicter: What Is Wrong with My Translations? In: The Prague Bulletin of Mathematical Linguistics, ISSN 0032-6585, 96, pp. 79-88 (pdf, biblio, obd, bibtex)
- Mark Fishel, Ondřej Bojar, Daniel Zeman, Jan Berka (2011): Automatic Translation Error Analysis. In: Lecture Notes in Computer Science, ISSN 0302-9743, 6836, pp. 72-79 (url, biblio, obd, bibtex)
- Septina Dian Larasati, Vladislav Kuboň, Daniel Zeman (2011): Indonesian Morphology Tool (MorphInd): Towards an Indonesian Corpus. In: Communications in Computer and Information Science, ISSN 1865-0929, 100, pp. 119-129 (url, biblio, obd, bibtex)
- Daniel Zeman (2011): Hierarchical Phrase-Based MT at the Charles University for the WMT 2011 Shared Task. In: Proceedings of the Sixth Workshop on Statistical Machine Translation, pp. 496-500, Association for Computational Linguistics, Edinburgh, UK, ISBN 978-1-937284-12-1 (url, biblio, obd, bibtex)
- Bushra Jawaid, Daniel Zeman (2011): Word-Order Issues in English-to-Urdu Statistical Machine Translation. In: The Prague Bulletin of Mathematical Linguistics, ISSN 0032-6585, 95, pp. 87-106 (url, biblio, obd, bibtex)
- Eduard Bejček, Pavel Straňák, Daniel Zeman (2011): Influence of Treebank Design on Representation of Multiword Expressions. In: Lecture Notes in Computer Science, ISSN 0302-9743, 6608, pp. 1-14 (url, biblio, obd, bibtex)
- Daniel Zeman (2010): Morphological Stickers for Annotation of Check. In: Padesát je málo. Komorně laděný sborník u příležitosti 50. narozenin profesora Jana Hajiče, pp. 65-70, Univerzita Karlova v Praze, Praha, Czechia (biblio, obd, bibtex)
- Daniel Zeman (2010): Using TectoMT as a Preprocessing Tool for Phrase-Based Statistical Machine Translation. In: Text, Speech and Dialogue. 13th International Conference, TSD 2010, Brno, Czech Republic, September 6-10, 2010. Proceedings, Lecture Notes in Computer Science, ISSN 0302-9743, 6231, pp. 216-223, Springer, Berlin / Heidelberg, ISBN 978-3-642-15759-2 (biblio, obd, bibtex)
- Daniel Zeman (2010): Hierarchical Phrase-Based MT at the Charles University for the WMT 2010 Shared Task. In: Proceedings of the Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR, pp. 212-215, Association for Computational Linguistics, Uppsala, Sweden, ISBN 978-1-932432-71-8 (url, biblio, obd, bibtex)
- Ondřej Bojar, Pavel Straňák, Daniel Zeman (2010): Data Issues in English-to-Hindi Machine Translation. In: Proceedings of the 7th International Conference on Language Resources and Evaluation (LREC 2010), pp. 1771-1777, European Language Resources Association, Valletta, Malta, ISBN 2-9517408-6-7 (biblio, obd, bibtex)
- Daniel Zeman (2010): Hard Problems of Tagset Conversion. In: Proceedings of the Second International Conference on Global Interoperability for Language Resources, pp. 181-185, City University of Hong Kong, Hong Kong, China, ISBN 978-962-442-323-5 (biblio, obd, bibtex)
- Ondřej Bojar, Pavel Straňák, Daniel Zeman, Gaurav Jain, Michal Hrušecký, Michal Richter, Jan Hajič (2009): English-Hindi Translation – Obtaining Mediocre Results with Bad Data and Fancy Models. In: Proceedings of ICON 2009: 7th International Conference on Natural Language Processing, pp. 316-321, Macmillan Publishers, India, Hyderabad, India, ISBN 978-023-032-845-7 (biblio, bibtex)
- Daniel Zeman (2009): Maximum Spanning Malt: Hiring World's Leading Dependency Parsers to Plant Indian Trees. In: Proceedings of ICON09 NLP Tools Contest: Indian Language Dependency Parsing, pp. 18-23, International Institute of Information Technologies, Hyderabad, Hyderabad, India (biblio, obd, bibtex)
- Daniel Zeman (2009): Using Unsupervised Paradigm Acquisition for Prefixes. In: Evaluating Systems for Multilingual and Multimodal Information Access – 9th Workshop of the Cross-Language Evaluation Forum, Lecture Notes in Computer Science, ISSN 0302-9743, 5706, pp. 983-990, Springer, Berlin / Heidelberg, ISBN 978-3-642-04446-5 (url, biblio, bibtex)
- Daniel Zeman (2009): A Simple Generative Pipeline Approach to Dependency Parsing and Semantic Role Labeling. In: Proceedings of the Thirteenth Conference on Computational Natural Language Learning (CoNLL): Shared Task, pp. 120-125, Association for Computational Linguistics, Boulder, CO, USA, ISBN 978-1-932432-29-9 (pdf, biblio, obd, bibtex)
- Ondřej Bojar, Pavel Straňák, Daniel Zeman (2008): English-Hindi Translation in 21 Days. In: Proceedings of the 6th International Conference On Natural Language Processing (ICON-2008) NLP Tools Contest, International Institute of Information Technologies, Hyderabad, Pune, India (url, biblio, bibtex)
- Daniel Zeman (2008): Using Unsupervised Paradigm Acquisition for Prefixes. In: Working Notes for the Cross Language Evaluation Forum (CLEF) 2008 Workshop, pp. 1-7, Århus Universitet, Århus, Denmark (pdf, biblio, bibtex)
- Daniel Zeman (2008): Unsupervised Acquiring of Morphological Paradigms from Tokenized Text. In: Advances in Multilingual and Multimodal Information Retrieval, 8th Workshop of the Cross-Language Evaluation Forum, CLEF 2007, Budapest, Hungary, September 19-21, 2007, Revised Selected Papers, Lecture Notes in Computer Science, ISSN 0302-9743, vol. 5152/2008, no. 5152, pp. 892-899, Springer, Berlin / Heidelberg, ISBN 978-3-540-85759-4 (pdf, biblio, bibtex)
- Daniel Zeman (2008): Reusable Tagset Conversion Using Tagset Drivers. In: Proceedings of the 6th International Conference on Language Resources and Evaluation (LREC 2008), pp. 213-218, European Language Resources Association, Marrakech, Morocco, ISBN 2-9517408-4-0 (url, biblio, bibtex)
- Daniel Zeman, Philip Resnik (2008): Cross-Language Parser Adaptation between Related Languages. In: IJCNLP 2008 Workshop on NLP for Less Privileged Languages, pp. 35-42, International Institute of Information Technology, Hyderabad, India (url, biblio, bibtex)
- Daniel Zeman (2007): Unsupervised Acquiring of Morphological Paradigms from Tokenized Text. In: Working Notes for the Cross Language Evaluation Forum (CLEF) 2007 Workshop, Magyar Tudományos Akadémia, Budapest, Hungary, ISBN 2-912335-31-0 (pdf, biblio, bibtex)
- Daniel Zeman, Zdeněk Žabokrtský (2005): Improving Parsing Accuracy by Combining Diverse Dependency Parsers. In: Proceedings of the Ninth International Workshop on Parsing Technologies (IWPT), pp. 171-178, Association for Computational Linguistics, Vancouver, BC, Canada, ISBN 1-932432-58-2 (pdf, biblio, bibtex)
- Jiří Hana, Daniel Zeman, Jan Hajič, Hana Hanová, Barbora Hladká, Emil Jeřábek (2005): Manual for Morphological Annotation, Revision for the Prague Dependency Treebank 2.0 (technical report). In: (pdf, biblio, bibtex)
- Daniel Zeman (2004): Parsing with a Statistical Dependency Model (PhD thesis). Univerzita Karlova v Praze, Praha, Czechia (url, biblio, bibtex)
- Daniel Zeman (2004): Data-Oriented Parsing by Rens Bod, Remko Scha, and Khalil Sima'an (review). In: The Prague Bulletin of Mathematical Linguistics, ISSN 0032-6585, 81, pp. 69-72 (biblio, bibtex)
- Eva Hajičová, Jiří Havelka, Petr Sgall, Kateřina Veselá, Daniel Zeman (2004): Issues of Projectivity in the Prague Dependency Treebank. In: The Prague Bulletin of Mathematical Linguistics, ISSN 0032-6585, 81, pp. 5-22 (pdf, biblio, bibtex)
- Daniel Zeman (2004): Neprojektivity v Pražském závislostním korpusu (PDT) (technical report). ÚFAL/CKL MFF UK (pdf, biblio, bibtex)
- Daniel Zeman (2002): Can Subcategorization Help a Statistical Dependency Parser? In: Proceedings of the 19th International Conference on Computational Linguistics (COLING 2002), pp. 1156-1162, Morgan Kaufmann Publishers, San Francisco (url, biblio, bibtex)
- Daniel Zeman (2002): How to Decrease the Performance of a Statistical Parser. In: The Prague Bulletin of Mathematical Linguistics, ISSN 0032-6585, 78, pp. 53-62 (url, biblio, bibtex)
- Daniel Zeman (2001): How Much Will a RE-based Preprocessor Help a Statistical Parser? In: Proceedings of International Workshop on Parsing Technologies, pp. 253-256, Tsinghua University Press, Beijing, China, ISBN 7-302-04925-4 (url, biblio, bibtex)
- Daniel Zeman (2001): Parsing with Regular Expressions: A Minute to Learn, a Lifetime to Master. In: The Prague Bulletin of Mathematical Linguistics, ISSN 0032-6585, 75, pp. 29-37 (url, biblio, bibtex)
- Anoop Sarkar, Daniel Zeman (2000): Automatic Extraction of Subcategorization Frames for Czech. In: Proceedings of the 18th International Conference on Computational Linguistics (COLING), pp. 691-697, Universität des Saarlandes, Saarbrücken, Germany, ISBN 1-55860-717-X (pdf, biblio, bibtex)
- Daniel Zeman, Anoop Sarkar (2000): Learning Verb Subcategorization from Corpora: Counting Frame Subsets. In: Proceedings of the Second International Conference on Language Resources and Evaluation, pp. 227-233, European Language Resources Association, Athîna, Greece (pdf, biblio, bibtex)
- Kiril Ribarov, Daniel Zeman (2000): Stochastically-Based Semantic Analysis, Kluwer, 1999 (review). In: The Prague Bulletin of Mathematical Linguistics, ISSN 0032-6585, 73-74, pp. 105-108 (biblio, bibtex)
- Jarmila Panevová, Barbora Vidová Hladká, Kiril Ribarov, Vladislav Kuboň, Daniel Zeman, Martin Čmejrek, Jan Cuřín, Nino Peterek (2000): Počítačová lingvistika ve vztahu k informatice. In: Pokroky matematiky, fyziky a astronomie, ISSN 0032-2423, vol. 45, no. 3, pp. 207-218 (url, biblio, bibtex)
- Jan Hajič, Eric Brill, Michael Collins, Barbora Hladká, Douglas Jones, Cynthia Kuo, Lance Ramshaw, Oren Schwartz, Christoph Tillmann, Daniel Zeman (1998): Core Natural Language Processing Technology Applicable to Multiple Languages. In: An NSF Workshop: Language Engineering for Students and Professionals Integrating Research and Education, Johns Hopkins University, Baltimore, MD, USA (url, biblio, bibtex)
- Daniel Zeman (1998): A Statistical Approach to Parsing of Czech. In: The Prague Bulletin of Mathematical Linguistics, ISSN 0032-6585, 69, pp. 29-37 (url, biblio, bibtex)
- Jan Hajič, Eric Brill, Michael Collins, Barbora Hladká, Douglas Jones, Cynthia Kuo, Lance Ramshaw, Oren Schwartz, Christoph Tillmann, Daniel Zeman (1998): Core Natural Language Technology Applicable to Multiple Languages, Workshop ´98 Final Report. In: The Prague Bulletin of Mathematical Linguistics, ISSN 0032-6585, 70, pp. 73-82 (biblio, bibtex)
- Daniel Zeman (1997): Pravděpodobnostní model významových zápisů vět (masters thesis). Univerzita Karlova v Praze, Praha, Czechia (url, biblio, bibtex)