Pavel Pecina

office
422
email
pecina@ufal.mff.cuni.cz
phone
+420 951 554 332
address
Malostranské náměstí 25
118 00 Praha 1
Czech Republic

I am an associate professor working in the area of Natural Language Processing, Artificial Inteligence, and related areas at the Institute of Formal and Applied Linguistics, Faculty of Mathematics and Physics, Charles University, Prague, Czech Republic. My research interests include information extraction, information retrieval, machine translation, multimodal data interpretation, and optical music recognition.

News

Research Profiles

Teaching

  • NPFL103 - Information Retrieval (summer)
  • NPFL124 - Natural Language Processing (summer)
  • NPFL147 - Statistical Methods in Natural Language Processing (winter)

Recent Publications

  • Tobias Friedetzki, Naveen Chandraiah, Emil Svoboda, Pavel Pecina, Frank Puppe, Adrian Krenzer (2026). Discriminative Self-Supervised Pre-Training for Esophagitis Detection in Upper GI Endoscopy Images. To appear in Proceedings of Medical Imaging with Deep Learning, Taipei.
  • Christopher Brückner, Jan Lehečka, Jan Švec, Pavel Pecina (2026). Modeling the Language of Holocaust Survivors' Testimony with Domain-Adapted Transformers. To appear in Proceedings of the Second Workshop on Holocaust Testimonies as Language Resources (HTRes) @ LREC 2026, Palma de Mallorca.
  • Christopher Brückner, Karin Roginer Hofmeister, Jiří Kocián, Pavel Pecina (2026). From Oral History to Structured Data: The MalachNER Dataset. To appear in Proceedings of the Second Workshop on Holocaust Testimonies as Language Resources (HTRes) @ LREC 2026, Palma de Mallorca.

(...)

Ongoing Projects

(...)

Current Students

  • Karel Dolník - Utilizing Machine Learning for Stylometric Analysis for Authorship Attribution (MSc)
  • Petr Ježek - Music synthesis (Bc)
  • Ivana Holpuchová - Clinical data analysis (Bc)
  • Rebeka Kampošová - Conversational analysis editor (Bc)
  • Magdalena Hrubešová - Normalization of clinical notes (Bc)
  • G.M. Arafat Rahman - New interactions for users of machine translation (MSc)
  • Filip Makara - Semantic search in spoken narratives (Bc)
  • Adam Turčan - Data curation tool for NLP tasks (Bc)
  • Maria Filtsova - Comparative Document Reader (Bc)
  • Igbal Huseynov - Topical segmentation of spoken narratives (MSc)
  • Vojtěch Lanz - Information extraction from clinical documents (PhD)
  • Christopher Brückner - Information extraction from historical documents (PhD)
  • Jiří Mayer - Optical music recognition (PhD)

(...)