2011
- Pavel Pecina. Book Reviews: Syntax-Based Collocation Extraction by Violeta Seretan. Computational Linguistics, 37:631-633, 2011. (bib, pdf)
- Mohammed Attia, Pavel Pecina, Antonio Toral, Lamia Tounsi, and Josef Genabith. A Lexical Database for Modern Standard Arabic Interoperable with a Finite State Morphological Transducer. In Cerstin Mahlow and Michael Piotrowski, editors, Systems and Frameworks for Computational Morphology, volume 100 of Communications in Computer and Information Science, pages 98-118. Springer Berlin/Heidelberg, 2011. (bib, link)
- Mohammed Attia, Pavel Pecina, Antonio Toral, Lamia Tounsi, and Josef van Genabith. An Open-Source Finite State Morphological Transducer for Modern Standard Arabic. In Proceedings of the 9th International Workshop on Finite-State Methods and Natural Language Processing, Blois, France, 2011. (bib, pdf)
- Pavel Pecina, Antonio Toral, Andy Way, Vassilis Papavassiliou, Prokopis Prokopidis, and Maria Giagkou. Towards Using Web-Crawled Data for Domain Adaptation in Statistical Machine Translation. In Mikel L. Forcada, Heidi Depraetere, and Vincent Vandeghinste, editors, Proceedings of the 15th Annual Conference of the European Associtation for Machine Translation, pages 297-304, Leuven, Belgium, 2011. (bib, pdf)
- Antonio Toral, Pavel Pecina, Andy Way, and Marc Poch. Towards a User-Friendly Webservice Architecture for Statistical Machine Translation in the PANACEA project. In Mikel L. Forcada, Heidi Depraetere, and Vincent Vandeghinste, editors, Proceedings of the 15th Annual Conference of the European Associtation for Machine Translation, pages 63-72, Leuven, Belgium, 2011. (bib, pdf)
- Mohammed Attia, Pavel Pecina, Lamia Tounsi, Antonio Toral, and Josef van Genabith. Lexical Profiling for Arabic. In Electronic Lexicography in the 21st Century, Bled, Slovenia, 2011. (bib, pdf)
2010
- Pavel Pecina. Lexical association measures and collocation extraction. Language Resources and Evaluation, 44:137-158, 2010. (bib, link)
- Mohammed Attia, Antonio Toral, Lamia Tounsi, Pavel Pecina, and Josef van Genabith. Automatic Extraction of Arabic Multiword Expressions. In Proceedings of the 2010 Workshop on Multiword Expressions: from Theory to Applications, pages 19-27, Beijing, China, 2010. Coling 2010 Organizing Committee. (bib, pdf)
- Santanu Pal, Sudip Kumar Naskar, Pavel Pecina, Sivaji Bandyopadhyay, and Andy Way. Handling Named Entities and Compound Verbs in Phrase-Based Statistical Machine Translation. In Proceedings of the 2010 Workshop on Multiword Expressions: from Theory to Applications, pages 46-54, Beijing, China, 2010. (bib, pdf)
- Jinhua Du, Pavel Pecina, and Andy Way. An Augmented Three-Pass System Combination Framework: DCU Combination System for WMT 2010. In Proceedings of the Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR, pages 290-295, Uppsala, Sweden, 2010. (bib, pdf)
- Sergio Penkale, Rejwanul Haque, Sandipan Dandapat, Pratyush Banerjee, Ankit K. Srivastava, Jinhua Du, Pavel Pecina, Sudip Kumar Naskar, Mikel L. Forcada, and Andy Way. MATREX: The DCU MT System for WMT 2010. In Proceedings of the Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR, pages 143-148, Uppsala, Sweden, 2010. (bib, pdf)
- Drahomíra Spoustová, Miroslav Spousta, and Pavel Pecina. Building a Web Corpus of Czech. In Proceedings of the 7th International Conference on Language Resources and Evaluation, pages 998-1001, Valletta, Malta, 2010. (bib, pdf)
- Jana Straková and Pavel Pecina. Czech Information Retrieval with Syntax-based Language Models. In Proceedings of the 7th International Conference on Language Resources and Evaluation, pages 1359-1362, Valletta, Malta, 2010. (bib, pdf)
2009
- Jimmy Lin, G. Murray, Bonnie Dorr, Jan Hajič, and Pavel Pecina. A cost-effective lexical acquisition process for large-scale thesaurus translation. Language Resources and Evaluation, 43:27-40, 2009. (bib, link)
- Pavel Pecina. Lexical Association Measures: Collocation Extraction, volume 4 of Studies in Computational and Theoretical Linguistics. UFAL, Praha, Czech Republic, 2009. (bib)
- Petr Homola, Vladislav Kuboň, and Pavel Pecina. A Simple Automatic MT Evaluation Metric. In Proceedings of the Fourth Workshop on Statistical Machine Translation, pages 33-36, Athens, Greece, 2009. (bib, pdf)
2008
- Pavel Pecina. Lexical Association Measures: Collocation Extraction, PhD thesis, Faculty of Mathematics and Physics, Charles University, Prague, Czech Republic.
- Pavel Pecina, Petra Hoffmannová, Gareth Jones, Ying Zhang, and Douglas Oard. Overview of the CLEF-2007 Cross-Language Speech Retrieval Track. In Carol Peters, Valentin Jijkoun, Thomas Mandl, Henning Muller, Douglas Oard, Anselmo Peňas, Vivien Petras, and Diana Santos, editors, Advances in Multilingual and Multimodal Information Retrieval, volume 5152 of Lecture Notes in Computer Science, pages 674-686. Springer Berlin/Heidelberg, 2008. (bib, link)
- Pavel Češka and Pavel Pecina. Charles University at CLEF 2007 Ad-Hoc Track. In Carol Peters, Valentin Jijkoun, Thomas Mandl, Henning Muller, Douglas Oard, Anselmo Peňas, Vivien Petras, and Diana Santos, editors, Advances in Multilingual and Multimodal Information Retrieval, volume 5152 of Lecture Notes in Computer Science, pages 33-36. Springer Berlin/Heidelberg, 2008. (bib, link)
- Pavel Pecina. A Machine Learning Approach to Multiword Expression Extraction. In Proceedings of the LREC 2008 Workshop Towards a Shared Task for Multiword Expressions, pages 54-57, Marrakech, Morocco, 2008. (bib, pdf)
- Pavel Pecina. Reference Data for Czech Collocation Extraction. In Proceedings of the LREC 2008 Workshop Towards a Shared Task for Multiword Expressions, pages 11-14, Marrakech, Morocco, 2008. (bib, pdf)
- Miroslav Spousta, Michal Marek, and Pavel Pecina. Victor: the Web-Page Cleaning Tool. In Stefan Evert, Adam Kilgarriff, and Serge Sharoff, editors, Proceedings of the 4th Web as Corpus Workshop - Can we beat Google?, pages 12-17, Marrakech, Morocco, 2008. (bib, pdf)
- Drahomíra Spoustová, Pavel Pecina, Jan Hajič, and Miroslav Spousta. Validating the Quality of Full Morphological Annotation. In Proceedings of the 6th International Conference on Language Resources and Evaluation, pages 1-4, Marrakech, Morocco, 2008. (bib, pdf)
2007
- Pavel Ircing, Pavel Pecina, Douglas Oard, Jianqiang Wang, Ryen White, and Jan Hoidekr. Information Retrieval Test Collection for Searching Spontaneous Czech Speech. In Václav Matoušek and Pavel Mautner, editors, Text, Speech and Dialogue, volume 4629 of Lecture Notes in Computer Science, pages 439-446. Springer Berlin/Heidelberg, 2007. (bib, pdf)
- Douglas Oard, Jianqiang Wang, Gareth Jones, Ryen White, Pavel Pecina, Dagobert Soergel, Xiaoli Huang, and Izhak Shafran. Overview of the CLEF-2006 Cross-Language Speech Retrieval Track. In Carol Peters, Paul Clough, Fredric Gey, Jussi Karlgren, Bernardo Magnini, Douglas Oard, Maarten de Rijke, and Maximilian Stempfhuber, editors, Evaluation of Multilingual and Multi-modal Information Retrieval, volume 4730 of Lecture Notes in Computer Science, pages 744-758. Springer Berlin/Heidelberg, 2007. (bib, link)
- Michal Marek, Pavel Pecina, and Miroslav Spousta. Web Page Cleaning with Conditional Random Fields. In Cédrick Fairon, Hubert Naets, Adam Kilgarriff, and Gilles-Maurice de Schryver, editors, Proceedings of the 3rd Web As a Corpus Workshop, Incorporating CLEANEVAL, pages 155-162, Louvain-la-Neuve, Belgium, 2007. (bib, pdf)
- Pavel Pecina, Petra Hoffmannová, Gareth Jones, Ying Zhang, and Douglas Oard. Overview of the CLEF-2007 Cross-Language Speech Retrieval Track. In Alessandro Nardi and Carol Peters, editors, Working Notes for the CLEF 2007 Workshop on Cross-Language Information Retrieval and Evaluation, Budapest, Hungary, 2007. (bib, pdf)
- Pavel Češka and Pavel Pecina. Charles University at CLEF 2007 Ad-Hoc Track. In Alessandro Nardi and Carol Peters, editors, Working Notes for the CLEF 2007 Workshop on Cross-Language Information Retrieval and Evaluation, Budapest, Hungary, 2007. (bib, pdf)
- Pavel Češka and Pavel Pecina. Charles University at CLEF 2007 CL-SR Track. In Alessandro Nardi and Carol Peters, editors, Working Notes for the CLEF 2007 Workshop on Cross-Language Information Retrieval and Evaluation, Budapest, Hungary, 2007. (bib, pdf)
2006
- Pavel Pecina and Pavel Schlesinger. Combining Association Measures for Collocation Extraction. In Proceedings of the COLING/ACL 2006 Main Conference Poster Sessions, pages 651-658, Sydney, Australia, 2006. (bib, pdf)
- G. Craig Murray, Bonnie J. Dorr, Jimmy Lin, Jan Hajič, and Pavel Pecina. Leveraging Reusability: Cost-Effective Lexical Acquisition for Large-Scale Ontology Translation. In Proceedings of the 21st International Conference on Computational Linguistics and 44th Annual Meeting of the Association for Computational Linguistics, pages 945-952, Sydney, Australia, 2006. (bib, pdf)
- Craig Murray, Bonnie Dorr, Jimmy Lin, Pavel Pecina, and Jan Hajič. Leveraging Recurrent Phrase Structure in Large-scale Ontology Translation. In Proceedings of the 11th Annual conference of the European Association for Machine Translation, pages 1-10, Oslo, Norway, 2006. (bib, pdf)
- Silvie Cinková, Petr Podveský, Pavel Pecina, and Pavel Schlesinger. Semi-automatic Building of Swedish Collocation Lexicon. In Proceedings of the 5th International Conference on Language Resources and Evaluation, pages 1890-1893, Genova, Italy, 2006. (bib, pdf)
- Douglas Oard, Jianqiang Wang, Gareth Jones, Ryen White, Pavel Pecina, Dagobert Soergel, Xiaoli Huang, and Izhak Shafran. Overview of the CLEF-2006 Cross-Language Speech Retrieval Track. In Working Notes for the CLEF 2006 Workshop on Cross-Language Information Retrieval and Evaluation, Alicante, Spain, 2006. (bib, pdf)
2005
- Pavel Pecina. An Extensive Empirical Study of Collocation Extraction Methods. In Proceedings of the 43th Annual Meeting of the Association for Computational Linguistics, Student Research Workshop, Ann Arbor, Michigan, 2005. (bib, pdf)
2004
- Jan Hajič, Martin Holub, Marie Hučínová, Martin Pavlík, Pavel Pecina, Pavel Straňák, and Pavel M. Šidák. Validating and Improving the Czech WordNet via Lexico-Semantic Annotation of the Prague Dependency Treebank. In Proceedings of the fourth International conference on Language Resources and Evaluation Workshop: Building Lexical Resources from Semantically Annotated Corpora, Lisbon, Portugal, 2004. (bib, pdf)
2003
- William Byrne, Sanjeev Khudanpur, Woosung Kim, Shankar Kumar, Pavel Pecina, Paola Virga, Peng Xu, and David Yarowsky. The Johns Hopkins University 2003 Chinese-English Machine Translation System. In Proceedings of the ninth Machine Translation Summit of the International Association for Machine Translation, New Orleans, Louisiana, USA, 2003. (bib, pdf)
-= Update:10-19-11 =-
