Pavel Pecina
I am an associate professor working in the area of Natural Language Processing, Artificial Inteligence, and related areas at the Institute of Formal and Applied Linguistics, Faculty of Mathematics and Physics, Charles University, Prague, Czech Republic. My research interests include information extraction, information retrieval, machine translation, multimodal data interpretation, and optical music recognition.
News
- Since 2025/26, Information Retrieval (NPFL103) has been moved to summer semmester.
- Since 2025/26, Statistical methods in NLP (NPFL147) replaces NPFL067 and NPFL068.
- I was appointend the Chair of the Subject Area Board: Computational linguistics - our PhD program (P4I3).
- I am seeking prospective students to pursue PhD in Natural Language Processing and related areas.
Research Profiles
- Google Scholar
- ORCID: 0000-0002-1855-5931
- Scopus ID: 23393602100
- Researcher ID: K-3770-2017
Teaching
- NPFL103 - Information Retrieval (summer)
- NPFL124 - Natural Language Processing (summer)
- NPFL147 - Statistical Methods in Natural Language Processing (winter)
Recent Publications
- Tobias Friedetzki, Naveen Chandraiah, Emil Svoboda, Pavel Pecina, Frank Puppe, Adrian Krenzer (2026). Discriminative Self-Supervised Pre-Training for Esophagitis Detection in Upper GI Endoscopy Images. To appear in Proceedings of Medical Imaging with Deep Learning, Taipei.
(...)
Ongoing Projects
- GI-Insight: New methods for stomach examination using artificial intelligence: Utilization of deep learning for assisted gastroscopy, LUABA24136.
- RES-Q+: Comprehensive solutions of healthcare improvement based on the global Registry of Stroke Care Quality, HORIZON-HLTH-2021-TOOL-06/101057603.
- MEMORISE: Virtualisation and Multimodal Exploration of Heritage on Nazi Persecution, HORIZON-CL2-2021-HERITAGE-01/101061016.
(...)
Current Students
- Rebeka Kampošová - Clinical data analysis (Bc)
- Magdalena Hrubešová - Normalization of clinical notes (Bc)
- G.M. Arafat Rahman - New interactions for users of machine translation (MSc)
- Filip Makara - Semantic search in spoken narratives (Bc)
- Adam Turčan - Data curation tool for NLP tasks (Bc)
- Maria Filtsova - Comparative Document Reader (Bc)
- Igbal Huseynov - Topical segmentation of spoken narratives (MSc)
- Vojtěch Lanz - Information extraction from clinical documents (PhD)
- Christopher Brückner - Information extraction from historical documents (PhD)
- Jiří Mayer - Optical music recognition (PhD)
- Michal Auersperger - Neural representations (PhD)
(...)


