Main Research Interests
Natural Language Processing
Handwritten Text Recognition
Searching in Handwritten Czech Documents
Disseration topic at KTIML MFF UK
This project focuses on processing Czech handwritten documents mainly from the first half of the 20th century, i.e., from the period of the Austro-Hungarian Empire and the First Republic, when both Czech and German were commonly used. The documents will be digitized into their textual forms and made available for querying by keywords and asking questions related to the content of the text and receive answers to them. Moreover, searching for information and answering the questions within multiple documents will be enabled. All the models will enable an exploration of current documents as well as the historical documents century to retrieve information from them for further historical research.
NAKI: Sources of Krkonoše [link]
Project at UFAL MFF UK
This project is one part of the Program of Applied Research and Development of National and Cultural Identity (NAKI II), provided by the Ministry of Culture. The main intention of the NAKI project is to make information sources about the history and cultural memory of Krkonoše publicly available in one virtual place regardless of locations of institutions where the original documents are stored. It will be achieved by making a web interface and database for evidence, processing, and presentation of the history of Krkonoše. The output of the NAKI project is a web interface that will enable creating metadata descriptions of collected data about the history of Krkonoše and a database for the inserting of documents from the first half of the 20th century and their editing via the web interface.
Multingual Transfer for Question Answering
Master Thesis at UFAL MFF UK
Question Answering is a computer science discipline in the field of Natural Language Processing and Information Retrieval. Its goal is to buil a system that can automatically find the answer to certain question in the text with understanding of the meaning of the text. Reading comprehension used for Question Answering is a well studied task, with huge training datasets in English. This project focuses on building reading comprehension systems for Czech, without requiring any manually annotated Czech training data using text translation and cross-lingual transfer models.
Nature Inspired Algorithms (Přírodou inspirované algoritmy) [NAIL119]
Summer semester (Thursday, 15:40), 2/2, Z+Zk, 5 credits,
In this subject, basic algorithms inspired by nature as EvolutionaryAlgorithms and Neural Networks and their applications for solving optimization problems and machine learning are presented. During the practicals, some of these algorithms are implemented and used to solve interesting problems in given areas.
The materials for the practicals are available on Github.
The lecture is taught by Martin Pilát and the materials are available on this website.
Individual Software Project (Ročníkový projekt) [NPRG045]
Both semesters, 0/1, Z, 4 credits
It is possible to arrange to work on an Individual Software Project under my supervision. I am concerned about topics from Artificial Intelligence and Nature Language Processing. If you are interested in any of these topics, you can contact me by email and we can come up with some interesting topics for you project of I can refer you to other relevant possible supervisors.