Pavel Pecina
I am an associate professor working in the area of Natural Language Processing, Artificial Inteligence, and related areas at the Institute of Formal and Applied Linguistics, Faculty of Mathematics and Physics, Charles University, Prague, Czech Republic. My research interests include information extraction, information retrieval, machine translation, multimodal data interpretation, and optical music recognition.
News
- I was appointend the Chair of the Subject Area Board: Computational linguistics - our PhD program.
- We are co-organizing IWSLT Dialectal and Low-resource track: Levantine-to-English.
- I am seeking prospective students to pursue PhD in Natural Language Processing and related areas.
Research Profiles
- Google Scholar
- ORCID: 0000-0002-1855-5931
- Scopus ID: 23393602100
- Researcher ID: K-3770-2017
Teaching
- NPFL103 - Information Retrieval
- NPFL124 - Natural Language Processing
- NPFL147 - Statistical Methods in Natural Language Processing
Recent Publications
- Jiří Mayer, Pavel Pecina, Jan Hajič Jr. (2025). Smashcima: Full-Page Handwritten Music Document Synthesizer. In Proceedings of the 12th International Conference on Digital Libraries for Musicology (DLfM '25), pp. 119–123, ISBN 9798400720833, Seoul, South Korea (bib).
- Jiří Mayer, Filip Jebavý, Markéta Herzánová Vlková, Martina Dvořáková, Pavel Pecina, Jan Hajič Jr. (2025). MuNG Studio: Annotation Tool for Music Notation Graph. In Proceedings of the 12th International Conference on Digital Libraries for Musicology (DLfM '25), pp. 114–118, ISBN 979-84-0072-083-3, Seoul, South Korea (bib).
- Vojtěch Lanz, Pavel Pecina (2025). CUNI-a at ArchEHR-QA 2025: Do we need Giant LLMs for Clinical QA?. In Proceedings of the 24th Workshop on Biomedical Language Processing (Shared Tasks), pp. 27–40, Vienna, Austria (bib).
- Idris Abdulmumin et al. (2025). Findings of the IWSLT 2025 Evaluation Campaign. In Proceedings of the 22nd International Conference on Spoken Language Translation (IWSLT 2025), pp. 412–481, Vienna, Austria (bib).
- Christopher Brückner, Pavel Pecina (2025). Hierarchical Classification of Propaganda Techniques in Slavic Texts in Hyperbolic Space. In Proceedings of the 10th Workshop on Slavic Natural Language Processing (Slavic NLP 2025), pp. 183–189, Vienna, Austria (bib).
- Vojtěch Lanz, Pavel Pecina (2025). When Multilingual Models Compete with Monolingual Domain-Specific Models in Clinical Question Answering. In Proceedings of the Second Workshop on Patient-Oriented Language Processing (CL4Health), pp. 69–82, Albuquerque, New Mexico (bib).
- Christopher Brückner, Pavel Pecina (2025). Towards Semantic Tagging of Segmented Holocaust Narratives. In proceedings of the Prague Visual History and Digital Humanities Conference 2025, pp. 177-192, ISBN 978-80-7378-523-9, Prague, Czechia (bib).
(...)
Ongoing Projects
- GI-Insight: New methods for stomach examination using artificial intelligence: Utilization of deep learning for assisted gastroscopy, LUABA24136.
- RES-Q+: Comprehensive solutions of healthcare improvement based on the global Registry of Stroke Care Quality, HORIZON-HLTH-2021-TOOL-06/101057603.
- MEMORISE: Virtualisation and Multimodal Exploration of Heritage on Nazi Persecution, HORIZON-CL2-2021-HERITAGE-01/101061016.
(...)
Current Students
- G.M. Arafat Rahman - New Interactions for Users of Machine Translation (MSc)
- Filip Makara - Semantic search in spoken narratives (Bc)
- Adam Turčan - Data curation tool for NLP tasks (Bc)
- Maria Filtsova - Comparative Document Reader (Bc)
- Igbal Huseynov - Topical segmentation of spoken narratives (MSc)
- Vojtěch Lanz - Information extraction from clinical documents (PhD)
- Christopher Brückner - Information extraction from historical documents (PhD)
- Jiří Mayer - Optical music recognition (PhD)
- Michal Auersperger - Neural representations (PhD)
(...)