Pavel Pecina

Associate Professor, Institute of Formal and Applied Linguistics, Charles University, Prague

Publications
by year/type/topic

Book

  1. Pavel Pecina. Lexical Association Measures: Collocation Extraction. volume 4 of Studies in Computational and Theoretical Linguistics. UFAL, Praha, Czech Republic, 2009. (bib)

Journals

  1. Mohammed Attia, Pavel Pecina, Younes Samih, Khaled Shaalan, and Josef van Genabith. Arabic Spelling Error Detection and Correction. In Natural Language Engineering, 22(5), pp. 751-773, Cambridge University Press, 2016. (bib)
  2. Pavel Pecina, Antonio Toral, Vassilis Papavassiliou, Prokopis Prokopidis, Aleš Tamchyna, Andy Way, and Josef van Genabith. Domain adaptation of statistical machine translation with domain-focused web crawling. Language Resources and Evaluation, 49(1), pp. 147-193. Springer Netherlands, 2015. (bib)
  3. Antonio Toral, Pavel Pecina, Longyue Wang, and Josef van Genabith. Linguistically-augmented Perplexity-based Data Selection for Language Models. In Computer Speech & Language, Special Issue on Hybrid Machine Translation: Integration of Linguistics and Statistics, 32(1), pp. 11-26, Elsevier, 2015. (bib)
  4. Pavel Pecina, Ondřej Dušek, Lorraine Goeuriot, Jan Hajič, Jaroslava Hlaváčová, Gareth J. F. Jones, Liadh Kelly, Johannes Leveling, David Mareček, Michal Novák, Martin Popel, Rudolf Rosa, Aleš Tamchyna, and Zdeňka Urešová. Adaptation of machine translation for multilingual information retrieval in the medical domain. In Artificial Intelligence in Medicine 61, pp. 165-185, Elsevier, 2014. (bib)
  5. Aleš Tamchyna, Ondřej Dušek, Rudolf Rosa, and Pavel Pecina. MTMonkey: A Scalable Infrastructure for a Machine Translation Web Service. In The Prague Bulletin of Mathematical Linguistics, No. 100, pp. 31-40, 2013. (bib)
  6. Mohammed Attia, Pavel Pecina, Antonio Toral, and Josef van Genabith. A corpus-based finite-state morphological toolkit for contemporary Arabic. In Journal of Logic and Computation 24 (2), pp. 455-472, Oxford Journals, 2014. (bib)
  7. Christian Federmann, Maite Melero, Pavel Pecina, and Josef van Genabith. Towards Optimal Choice Selection for Improved Hybrid Machine Translation. In The Prague Bulletin of Mathematical Linguistics, No. 97, pp. 5-22, 2012. (bib)
  8. Mohammed Attia, Pavel Pecina, Antonio Toral, Lamia Tounsi, and Josef Genabith. A Lexical Database for Modern Standard Arabic Interoperable with a Finite State Morphological Transducer. In Systems and Frameworks for Computational Morphology, volume 100 of Communications in Computer and Information Science, pp. 98-118. Springer Berlin Heidelberg, 2011. (bib)
  9. Pavel Pecina. Lexical association measures and collocation extraction. Language Resources and Evaluation, 44, pp. 137-158. Springer Netherlands, 2010. (bib)
  10. Jimmy Lin, Craig G. Murray, Bonnie Dorr, Jan Hajič, and Pavel Pecina. A Cost-Effective Lexical Acquisition Process for Large-Scale Thesaurus Translation. Language Resources and Evaluation, 43, pp. 27-40. Springer Netherlands, 2009. (bib)
  11. Pavel Pecina, Petra Hoffmannová, Gareth Jones, Ying Zhang, and Douglas Oard. Overview of the CLEF-2007 Cross-Language Speech Retrieval Track. In Advances in Multilingual and Multimodal Information Retrieval, volume 5152 of Lecture Notes in Computer Science, pp. 674-686. Springer Berlin Heidelberg, 2008. (bib)
  12. Pavel Češka and Pavel Pecina. Charles University at CLEF 2007 Ad-Hoc Track. In Advances in Multilingual and Multimodal Information Retrieval, volume 5152 of Lecture Notes in Computer Science, pp. 33-36. Springer Berlin Heidelberg, 2008. (bib)
  13. Pavel Ircing, Pavel Pecina, Douglas Oard, Jianqiang Wang, Ryen White, and Jan Hoidekr. Information Retrieval Test Collection for Searching Spontaneous Czech Speech. In Text, Speech and Dialogue, volume 4629 of Lecture Notes in Computer Science, pp. 439-446. Springer Berlin Heidelberg, 2007. (bib)
  14. Douglas Oard, Jianqiang Wang, Gareth Jones, Ryen White, Pavel Pecina, Dagobert Soergel, Xiaoli Huang, and Izhak Shafran. Overview of the CLEF-2006 Cross-Language Speech Retrieval Track. In Evaluation of Multilingual and Multi-modal Information Retrieval, volume 4730 of Lecture Notes in Computer Science, pp. 744-758. Springer Berlin Heidelberg, 2007. (bib)

Conference Proceedings

  1. Jan Hajič jr. and Pavel Pecina. The MUSCIMA++ Dataset for Handwritten Optical Music Recognition. In Proceedings of the 14th IAPR International Conference on Document Analysis and Recognition, vol 1, pp. 39-46, Kyoto, Japan, 2017. (bib)
  2. Jan Hajič jr. and Pavel Pecina. Groundtruthing (Not Only) Music Notation with MUSICMarker: A Practical Overview. In Proceedings of the 14th IAPR International Conference on Document Analysis and Recognition, vol 2, pp. 47-48, Kyoto, Japan, 2017. (bib)
  3. Jan Hajič jr. and Pavel Pecina. How to Exploit Music Notation Syntax for OMR?. In Proceedings of the 14th IAPR International Conference on Document Analysis and Recognition, pp. 55-56, Kyoto, Japan, 2017. (bib)
  4. Antonio Jimeno Yepes, Aurelie Neveol, Mariana Neves, Karin Verspoor, Ondrej Bojar, Arthur Boyer, Cristian Grozea, Barry Haddow, Madeleine Kittner, Yvonne Lichtblau, Pavel Pecina, Roland Roller, Rudolf Rosa, Amy Siu, Philippe Thomas and Saskia Trescher. Findings of the WMT 2017 Biomedical Translation Shared Task In Proceedings of the Second Conference on Machine Translation, Volume 2: Shared Task Papers pp. 234-247, Copenhagen, Denmark, 2017. (bib)
  5. Joao Palotti, Guido Zuccon, Jimmy, Pavel Pecina, Mihai Lupu, Lorraine Goeuriot, Liadh Kelly, and Allan Hanbury. CLEF 2017 Task Overview: The IR Task at the eHealth Evaluation Lab - Evaluating Retrieval Methods for Consumer Health Search. In Working Notes of CLEF 2017 - Conference and Labs of the Evaluation Forum, Dublin, Ireland, 2017. (bib)
  6. Shadi Saleh and Pavel Pecina. Task3 Patient-Centred Information Retrieval: Team CUNI. In Working Notes of CLEF 2017 - Conference and Labs of the Evaluation Forum, Dublin, Ireland, 2017. (bib)
  7. Petra Galuščáková, Michal Batko, Jan Čech, Jiří Matas, David Novák and Pavel Pecina. Visual Descriptors in Methods for Video Hyperlinking. In Proceedings of the 2017 ACM on International Conference on Multimedia Retrieval, pp. 294-300, Bucharest, Romania, 2017. (bib)
  8. Jindřich Libovický, Jindřich Helcl, Marek Tlustý, Ondřej Bojar and Pavel Pecina. CUNI at Post-editing and Multimodal Translation Tasks. In Proceedings of the First Conference on Machine Translation, pp. 646-654, Berlin, Germany, 2016. (bib)
  9. Shadi Saleh and Pavel Pecina. Reranking Hypotheses of Machine-Translated Queries for Cross-Lingual Information Retrieval. In Experimental IR Meets Multilinguality, Multimodality, and Interaction. 7th International Conference of the CLEF Association, CLEF 2016, pp. 54-66, Évora, Portugal, 2016. (bib)
  10. Jan Hajič jr. and Pavel Pecina. Further Steps Towards a Standard Testbed for Optical Music Recognition. In Proceedings of the 17th International Society for Music Information Retrieval Conference, pp. 157-163, New York City, USA, 2016. (bib)
  11. Petra Galuščáková, Michal Batko, Martin Kruliš, Jakub Lokoč, David Novák, and Pavel Pecina. CUNI at TRECVID 2015 Video Hyperlinking Task. In TRECVID 2015 Workshop Notebook, Gaithersburg, MD, USA, 2016. (bib)
  12. Petra Galuščáková, Shadi Saleh and Pavel Pecina. SHAMUS: UFAL Search and Hyperlinking Multimedia System. In Proceedings of the 38th European Conference on Information Retrieval, demo papers, pp. 853-856, Padova, Italy, 2016. (bib)
  13. Vincent Kríž, Martin Holub, and Pavel Pecina. Feature Extraction for Native Language Identification Using Language Modeling. In Proceedings of Recent Advances in Natural Language Processing, pp. 298-306. Hissar, Bulgaria, 2015. (bib)
  14. Jindřich Libovický, Lukáš Neumann, Pavel Pecina, and Jiří Matas. A Machine Learning Approach to Hypothesis Decoding in Scene Text Recognition. In Computer Vision - ACCV 2014 Workshops. Singapore, 2014. Revised Selected Papers, Part II. Lecture Notes in Computer Science, vol. 9009, pp. 169-180, Springer International Publishing, 2015. (bib)
  15. Zdeňka Urešová, Ondřej Dušek, Jan Hajič, and Pavel Pecina. Multilingual Test Sets for Machine Translation of Search Queries for Cross-Lingual Information Retrieval in the Medical Domain. In Proceedings of the Ninth International Conference on Language Resources and Evaluation, pp. 3244-3247, Reykjavik, Iceland, 2014. (bib)
  16. Petra Galuščáková and Pavel Pecina. Experiments with Segmentation Strategies for Passage Retrieval in Audio-Visual Documents. In Proceedings of International Conference on Multimedia Retrieval, pp. 217-224, Glassgov, UK, 2014. (bib)
  17. Thanh Long Duong, Steven Bird, Paul Cook, and Pavel Pecina. Increasing the quality and quantity of source language data for unsupervised cross-lingual POS tagging. In Proceedings of the Sixth International Joint Conference on Natural Language Processing, pp. 1243-1249, Nagoya, Japan, 2013. (bib)
  18. Thanh Long Duong, Paul Cook, Steven Bird, and Pavel Pecina. Simpler unsupervised POS tagging with bilingual projections. In Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers), pp. 634-639, Sofia, Bulgaria, 2013. (bib)
  19. Maria Eskevich, Gareth J.F. Jones, Shu Chen, Robin Aly, Roeland Ordelman, Danish Nadeem, Camille Guinaudeau, Guillaume Gravier, Pascale Sébillot, Tom De Nies, Pedro Debevere, Rik Van de Walle, Petra Galuščáková, Pavel Pecina, and Martha Larson. Multimedia Information Seeking through Search And Hyperlinking. In Proceedings of the 3rd ACM conference on International conference on multimedia retrieval, pp. 287-294, Dallas, Texas, USA, 2013. (bib)
  20. Pavel Pecina, Antonio Toral, and Josef van Genabith. Simple and Effective Parameter Tuning for Domain Adaptation of Statistical Machine Translation. In Proceedings of the 24th International Conference on Computational Linguistics, pp. 2209-2224, Mumbai, India, 2012. (bib)
  21. Mohammed Attia, Pavel Pecina, Younes Samih, Khaled Shaalan, and Josef van Genabith. Improved Spelling Error Detection and Correction for Arabic. In Proceedings of the 24th International Conference on Computational Linguistics, pp. 103-112, Mumbai, India, 2012. (bib)
  22. Petra Galuščáková, Pavel Pecina, and Jan Hajič. Penalty Functions for Evaluation Measures of Unsegmented Speech Retrieval. In In Information Access Evaluation. Multilinguality, Multimodality, and Visual Analytics. Proceedings of the Third International Conference of the CLEF Initiative - CLEF 2012, Lecture Notes in Computer Science, vol. 7488, pp. 100-111, Springer Berlin Heidelberg, 2012. (bib)
  23. Niraj Aswani, Thomas Beckers, Erich Birngruber, Célia Boyer, Andreas Burner, Jakub Bystroň, et al. Khresmoi: Multimodal Multilingual Medical Information Search. In Proceedings of the 24th International Conference of the European Federation for Medical Informatics, Quality of Life through Quality of Information, Village of the future, IOS Press, Pisa, Italy, 2012. (bib)
  24. Pavel Pecina, Antonio Toral, Vassilis Papavassiliou, Prokopis Prokopidis, and Josef van Genabith. Domain Adaptation of Statistical Machine Translation using Web-Crawled Resources: A Case Study. In Proceedings of the 16th Annual Conference of the European Association for Machine Translation,, pp. 145-152, Trento, Italy, 2012. (bib)
  25. Antonio Toral, Marc Poch, Pavel Pecina, and Gregor Thurmair Efficiency-Based Evaluation of Aligners for Industrial Applications. In EAMT 2012: Proceedings of the 16th Annual Conference of the European Association for Machine Translation, pp. 57-60, Trento, Italy, 2012. (bib)
  26. Eleftherios Avramidis, Marta R. Costa-jussa, Christian Federmann, Maite Melero, Pavel Pecina, and Josef van Genabith. A Richly Annotated, Multilingual Parallel Corpus for Hybrid Machine Translation. In Proceedings of the Eight International Conference on Language Resources and Evaluation, pp. 3430-3435, Istanbul, Turkey, 2012. (bib)
  27. Khaled Shaalan,Younes Samih, Mohammed Attia, Pavel Pecina, and Josef van Genabith. Arabic Word Generation and Modelling for Spell Checking. In Proceedings of the Eight International Conference on Language Resources and Evaluation, pp. 719-725, Istanbul, Turkey, 2012. (bib)
  28. Pavel Pecina, Antonio Toral, Andy Way, Vassilis Papavassiliou, Prokopis Prokopidis, and Maria Giagkou. Towards Using Web-Crawled Data for Domain Adaptation in Statistical Machine Translation. In Proceedings of the 15th Annual Conference of the European Associtation for Machine Translation, pp. 297-304, Leuven, Belgium, 2011. (bib)
  29. Antonio Toral, Pavel Pecina, Andy Way, and Marc Poch. Towards a User-Friendly Webservice Architecture for Statistical Machine Translation in the PANACEA project. In Proceedings of the 15th Annual Conference of the European Associtation for Machine Translation, pp. 63-72, Leuven, Belgium, 2011. (bib)
  30. Mohammed Attia, Pavel Pecina, Lamia Tounsi, Antonio Toral, and Josef van Genabith. Lexical Profiling for Arabic. In Electronic Lexicography in the 21st Century, pags 22-33, Bled, Slovenia, 2011. (bib)
  31. Pavel Pecina and Pavel Schlesinger. Combining Association Measures for Collocation Extraction. In Proceedings of the COLING/ACL 2006 Main Conference Poster Sessions, pp. 651-658, Sydney, Australia, 2006. (bib)
  32. Craig G. Murray, Bonnie J. Dorr, Jimmy Lin, Jan Hajič, and Pavel Pecina. Leveraging Reusability: Cost-Effective Lexical Acquisition for Large-Scale Ontology Translation. In Proceedings of the 21st International Conference on Computational Linguistics and 44th Annual Meeting of the Association for Computational Linguistics, pp. 945-952, Sydney, Australia, 2006. (bib)
  33. Craig G. Murray, Bonnie Dorr, Jimmy Lin, Pavel Pecina, and Jan Hajič. Leveraging Recurrent Phrase Structure in Large-scale Ontology Translation. In Proceedings of the 11th Annual conference of the European Association for Machine Translation, pp. 1-10, Oslo, Norway, 2006. (bib)
  34. Silvie Cinková, Petr Podveský, Pavel Pecina, and Pavel Schlesinger. Semi-automatic Building of Swedish Collocation Lexicon. In Proceedings of the 5th International Conference on Language Resources and Evaluation, pp. 1890-1893, Genova, Italy, 2006. (bib)
  35. Pavel Pecina. An Extensive Empirical Study of Collocation Extraction Methods. In Proceedings of the 43th Annual Meeting of the Association for Computational Linguistics, Student Research Workshop, pp. pp. 13-18, Ann Arbor, Michigan, 2005. (bib)
  36. William Byrne, Sanjeev Khudanpur, Woosung Kim, Shankar Kumar, Pavel Pecina, Paola Virga, Peng Xu, and David Yarowsky. The Johns Hopkins University 2003 Chinese-English Machine Translation System. In Proceedings of the ninth Machine Translation Summit of the International Association for Machine Translation, pp. 447-450, New Orleans, Louisiana, USA, 2003. (bib)

Workshop Proceedings

  1. Guido Zuccon, Joao Palotti, Lorraine Goeuriot, Liadh Kelly, Mihai Lupu, Pavel Pecina, Henning Müller, Julie Budaher, Anthony Deacon. The IR Task at the CLEF eHealth Evaluation Lab 2016: User-centred Health Information Retrieval. In Working Notes of CLEF 2016 - Conference and Labs of the Evaluation forum, CEUR Workshop Proceedings 1609, pp. 15-27, Évora, Portugal, 2016. (bib)
  2. Shadi Saleh, Pavel Pecina. Task3 Patient-Centred Information Retrieval: Team CUNI. In Working Notes of CLEF 2016 - Conference and Labs of the Evaluation forum, CEUR Workshop Proceedings 1609, pp. 123-129, Évora, Portugal, 2016. (bib)
  3. Jindřich Libovický and Pavel Pecina. A Dataset and Evaluation Metric for Coherent Text Recognition from Scene Images. In Multimodal Corpora: Computer vision and language processing, pp. 33-36, Portorož, Slovenia, 2016.! (bib)
  4. Shadi Saleh and Pavel Pecina Adapting SMT Query Translation Reranker to New Languages in Cross-Lingual Information Retrieval. In Proceedings of the Medical Information Retrieval (MedIR) Workshop. A SIGIR 2016 workshop, Pisa, Italy, 2016. (bib)
  5. Petra Galuščáková and Pavel Pecina. Audio Information for Hyperlinking of TV Content. In Proceedings of the Third Edition Workshop on Speech, Language & Audio in Multimedia, pp. 27-30. Brisbane, Australia, 2015. (bib)
  6. Petra Galuščáková and Pavel Pecina. CUNI at MediaEval 2015 Search and Anchoring in Video Archives: Anchoring via Information Retrieval. In Working Notes Proceedings of the MediaEval 2015 Workshop, CEUR Workshop Proceedings, volume 1436, Wurzen, Germany, 2015. (bib)
  7. João Palotti, Guido Zuccon, Lorraine Goeuriot, Liadh Kelly, Allan Hanbury, Gareth JF Jones, Mihai Lupu, and Pavel Pecina. CLEF eHealth Evaluation Lab 2015, Task 2: Retrieving information about medical symptoms. In Working Notes of CLEF 2015 - Conference and Labs of the Evaluation forum, CEUR Workshop Proceedings 1391, Toulouse, France, 2015. (bib)
  8. Shadi Saleh, Feraena Bibyna, and Pavel Pecina. CUNI at the CLEF eHealth 2015 Task 2. In Working Notes of CLEF 2015 - Conference and Labs of the Evaluation forum, CEUR Workshop Proceedings 1391, Toulouse, France, 2015. (bib)
  9. Petra Galuščáková and Pavel Pecina. CUNI at MediaEval 2014 Search and Hyperlinking Task: Search Task Experiments. In Working Notes Proceedings of the MediaEval 2014 Workshop, CEUR Workshop Proceedings, volume 1263, Barcelona, Spain, 2014. (bib)
  10. Petra Galuščáková, Martin Kruliš, Jakub Lokoč, and Pavel Pecina. CUNI at MediaEval 2014 Search and Hyperlinking Task: Visual and Prosodic Features in Hyperlinking. In Working Notes Proceedings of the MediaEval 2014 Workshop, CEUR Workshop Proceedings, volume 1263, Barcelona, Spain, 2014. (bib)
  11. Lorraine Goeuriot, Liadh Kelly, Wei Li, Joao Palotti, Pavel Pecina, Guido Zuccon, Allan Hanbury, Gareth Jones and Henning Müller. ShARe/CLEF eHealth Evaluation Lab 2014, Task 3: User-centred health information retrieval. In CLEF Online Working Notes, CEUR Workshop Proceedings 1180, pp. 43-61, Sheffield, UK, 2014. (bib)
  12. Shadi Saleh and Pavel Pecina. CUNI at the ShARe/CLEF eHealth Evaluation Lab 2014. In CLEF Online Working Notes, CEUR Workshop Proceedings 1180, pp. 226-235, Sheffield, UK, 2014. (bib)
  13. Ondřej Bojar, Christian Buck, Christian Federmann, Barry Haddow, Johannes Leveling, Philipp Koehn, Christof Monz, Pavel Pecina, Matt Post, Herve Saint-Amand, Radu Soricut, Lucia Specia and Aleš Tamchyna. Findings of the 2014 Workshop on Statistical Machine Translation. In Proceedings of the Ninth Workshop on Statistical Machine Translation, pp. 12-58, Baltimore, USA, 2014. (bib)
  14. Ondřej Dušek, Jan Hajič, Jaroslava Hlaváčová, Michal Novák, Pavel Pecina, Rudolf Rosa, Aleš Tamchyna, Zdeňka Urešová, and Daniel Zeman. Machine Translation of Medical Texts in the Khresmoi Project. In Proceedings of the Ninth Workshop on Statistical Machine Translation, pp. 221-228, Baltimore, USA, 2014. (bib)
  15. Jindřich Libovický and Pavel Pecina. Tolerant BLEU: a Submission to the WMT14 Metrics Task In Proceedings of the Ninth Workshop on Statistical Machine Translation, pp. 409-413, Baltimore, USA, 2014. (bib)
  16. Petra Galuščáková and Pavel Pecina. CUNI at MediaEval 2013 Similar Segments in Social Speech Task. In Proceedings of the MediaEval 2013 Multimedia Benchmark Workshop, CEUR Workshop Proceedings, volume 1043, Barcelona, Spain, 2013. (bib)
  17. Petra Galuščáková and Pavel Pecina. CUNI at MediaEval 2013 Search and Hyperlinking Task. In Proceedings of the MediaEval 2013 Multimedia Benchmark Workshop, CEUR Workshop Proceedings, volume 1043, Barcelona, Spain, 2013. (bib)
  18. Niraj Aswani, Thomas Beckers, Erich Birngruber, Célia Boyer, Andreas Burner, Jakub Bystroň, et al. Khresmoi - multilingual semantic search of medical text and images. In Proceedings of the 14th World Congress on Medical and Health Informatics, Copenhagen, Denmark, Volume 192 of Studies in Health Technology and Informatics, page 1266, 2013. (bib)
  19. Lubomír Krčmář, Karel Ježek, and Pavel Pecina. Determining Compositionality of Word Expressions Using Various Word Space Models and Measures. In Proceedings of the Workshop on Continuous Vector Space Models and their Compositionality, pp. 64-73, Sofia, Bulgaria, 2013. (bib)
  20. Niraj Aswani, Thomas Beckers, Erich Birngruber, Célia Boyer, Andreas Burner, Jakub Bystroň, et al. Khresmoi Professional: Multilingual Semantic Search for Medical Professionals. In Proceedings of the ACM SIGIR Workshop on Health Search and Discovery: Helping Users and Advancing Medicine, pp. 31-34, Dublin, Ireland, 2013. (bib)
  21. Eduard Bejček, Pavel Straňák, and Pavel Pecina. Syntactic Identification of Occurrences of Multiword Expressions in Text using a Lexicon with Dependency Structures. In Proceedings of the 9th Workshop on Multiword Expressions, pp. 106-115, Atlanta, Georgia, USA, 2013. (bib)
  22. Lubomír Krčmář, Karel Ježek, and Pavel Pecina. Determining Compositionality of Word Expressions Using Word Space Models. In Proceedings of the 9th Workshop on Multiword Expressions, pp. 42-50, Atlanta, Georgia, USA, 2013. (bib)
  23. Petra Galuščáková and Pavel Pecina. CUNI at MediaEval 2012 Search and Hyperlinking Task. In Working Notes Proceedings of the MediaEval 2012 Workshop, CEUR Workshop Proceedings, volume 927, Pisa, Italy, 2012. (bib)
  24. Antonio Toral, Leroy Finn, Dominic Jones, Pavel Pecina, David Lewis, Declan Groves. Retraining Machine Translation with Post-edits to Increase Post-editing Productivity in Content Management Systems. In International Workshop on Expertise in Translation and Post-editing Research and Application, pp. 39-40, Copenhagen, Denmark, 2012. (bib)
  25. Eleftherios Avramidis, Marta R. Costa-jussa, Christian Federmann, Maite Melero, Pavel Pecina, and Josef van Genabith. The ML4HMT Workshop on Optimising the Division of Labour in Hybrid Machine Translation. In Proceedings of the Eight International Conference on Language Resources and Evaluation, pp. 2189-2139, Istanbul, Turkey, 2012. (bib)
  26. Mohammed Attia, Pavel Pecina, Antonio Toral, Lamia Tounsi, and Josef van Genabith. An Open-Source Finite State Morphological Transducer for Modern Standard Arabic. In Proceedings of the 9th International Workshop on Finite-State Methods and Natural Language Processing, pp. 125-133, Blois, France, 2011. (bib)
  27. Mohammed Attia, Antonio Toral, Lamia Tounsi, Pavel Pecina, and Josef van Genabith. Automatic Extraction of Arabic Multiword Expressions. In Proceedings of the 2010 Workshop on Multiword Expressions: from Theory to Applications, pp. 19-27, Beijing, China, 2010. (bib)
  28. Santanu Pal, Sudip Kumar Naskar, Pavel Pecina, Sivaji Bandyopadhyay, and Andy Way. Handling Named Entities and Compound Verbs in Phrase-Based Statistical Machine Translation. In Proceedings of the 2010 Workshop on Multiword Expressions: from Theory to Applications, pp. 46-54, Beijing, China, 2010. (bib)
  29. Jinhua Du, Pavel Pecina, and Andy Way. An Augmented Three-Pass System Combination Framework: DCU Combination System for WMT 2010. In Proceedings of the Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR, pp. 290-295, Uppsala, Sweden, 2010. (bib)
  30. Sergio Penkale, Rejwanul Haque, Sandipan Dandapat, Pratyush Banerjee, Ankit K. Srivastava, Jinhua Du, Pavel Pecina, Sudip Kumar Naskar, Mikel L. Forcada, and Andy Way. MATREX: The DCU MT System for WMT 2010. In Proceedings of the Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR, pp. 143-148, Uppsala, Sweden, 2010. (bib)
  31. Drahomíra Spoustová, Miroslav Spousta, and Pavel Pecina. Building a Web Corpus of Czech. In Proceedings of the 7th International Conference on Language Resources and Evaluation, pp. 998-1001, Valletta, Malta, 2010. (bib)
  32. Jana Straková and Pavel Pecina. Czech Information Retrieval with Syntax-based Language Models. In Proceedings of the 7th International Conference on Language Resources and Evaluation, pp. 1359-1362, Valletta, Malta, 2010. (bib)
  33. Petr Homola, Vladislav Kuboň, and Pavel Pecina. A Simple Automatic MT Evaluation Metric. In Proceedings of the Fourth Workshop on Statistical Machine Translation, pp. 33-36, Athens, Greece, 2009. (bib)
  34. Pavel Pecina. A Machine Learning Approach to Multiword Expression Extraction. In Proceedings of the LREC 2008 Workshop Towards a Shared Task for Multiword Expressions, pp. 54-57, Marrakech, Morocco, 2008. (bib)
  35. Pavel Pecina. Reference Data for Czech Collocation Extraction. In Proceedings of the LREC 2008 Workshop Towards a Shared Task for Multiword Expressions, pp. 11-14, Marrakech, Morocco, 2008. (bib)
  36. Miroslav Spousta, Michal Marek, and Pavel Pecina. Victor: the Web-Page Cleaning Tool. In Proceedings of the 4th Web as Corpus Workshop - Can we beat Google?, pp. 12-17, Marrakech, Morocco, 2008. (bib)
  37. Drahomíra Spoustová, Pavel Pecina, Jan Hajič, and Miroslav Spousta. Validating the Quality of Full Morphological Annotation. In Proceedings of the 6th International Conference on Language Resources and Evaluation, pp. 1-4, Marrakech, Morocco, 2008. (bib)
  38. Michal Marek, Pavel Pecina, and Miroslav Spousta. Web Page Cleaning with Conditional Random Fields. In Proceedings of the 3rd Web As a Corpus Workshop, Incorporating CLEANEVAL, pp. 155-162, Louvain-la-Neuve, Belgium, 2007. (bib)
  39. Jan Hajič, Martin Holub, Marie Hučínová, Martin Pavlík, Pavel Pecina, Pavel Straňák, and Pavel M. Šidák. Validating and Improving the Czech WordNet via Lexico-Semantic Annotation of the Prague Dependency Treebank. In Proceedings of the fourth International conference on Language Resources and Evaluation Workshop: Building Lexical Resources from Semantically Annotated Corpora, pp. 25-30, Lisbon, Portugal, 2004. (bib)

Other

  1. Jan Hajič jr. and Pavel Pecina. Matching Illustrative Images to “Soft News” Articles. In UFAL WDS 2015, Institute of Formal and Applied Linguistics, Charles University, Prague, pp. 49-56, Praha, Czechia, 2015. (bib)
  2. Pavel Pecina. Jörg Tiedemann: Bitext Alignment (book review). Machine Translation, Volume 27, Issue 1, pp. 77-79, Springer Netherlands, 2013. (bib)
  3. Pavel Pecina. Book Reviews: Syntax-Based Collocation Extraction by Violeta Seretan. Computational Linguistics, 37, pp. 631-633, 2011. (bib)
  4. Pavel Pecina. Lexical Association Measures: Collocation Extraction. PhD thesis, Faculty of Mathematics and Physics, Charles University, Prague, Czech Republic, 2008. (bib)
  5. Pavel Pecina, Petra Hoffmannová, Gareth Jones, Ying Zhang, and Douglas Oard. Overview of the CLEF-2007 Cross-Language Speech Retrieval Track. In Working Notes for the CLEF 2007 Workshop on Cross-Language Information Retrieval and Evaluation, Budapest, Hungary, 2007. (bib)
  6. Pavel Češka and Pavel Pecina. Charles University at CLEF 2007 Ad-Hoc Track. In Working Notes for the CLEF 2007 Workshop on Cross-Language Information Retrieval and Evaluation, Budapest, Hungary, 2007. (bib)
  7. Pavel Češka and Pavel Pecina. Charles University at CLEF 2007 CL-SR Track. In Working Notes for the CLEF 2007 Workshop on Cross-Language Information Retrieval and Evaluation, Budapest, Hungary, 2007. (bib)
  8. Douglas Oard, Jianqiang Wang, Gareth Jones, Ryen White, Pavel Pecina, Dagobert Soergel, Xiaoli Huang, and Izhak Shafran. Overview of the CLEF-2006 Cross-Language Speech Retrieval Track. In Working Notes for the CLEF 2006 Workshop on Cross-Language Information Retrieval and Evaluation, Alicante, Spain, 2006. (bib)

Links

Profiles

Contact

UFAL MFF UK
Room 422, 4th floor
Malostranské nám. 25
118 00 Prague 1
Czech Republic

+420 951 554 332