Ondřej Dušek

office: N233
office hours: email me to arrange a meeting
email: odusek@ufal.mff.cuni.cz
address: IMPAKT – „N“
V Holešovičkách 747/2
180 00 Praha 8
Czech Republic

Main Research Interests

Natural language generation

Spoken dialogue systems and chatbots

Evaluation methods for NLG/dialogue/NLP

Projects

Current

NG-NLG – Language generation with neural & symbolic methods (ERC StG, 2022-2027)

CEDMO – Digital media safety (Minstry of Industry & Commerce, 2024-2026)

Past

EDU-AI – Education chatbot assistant (Czech Technical Agency, 2021-2023)

THEaiTRE – Automatically generating a theatre play (Czech Technical Agency, 2020-2022)

NaMuDDiS – Natural Multi-domain Dialogue Systems (Charles University funded, 2019-2021)

METOD – dialogue management (industry cooperation with Agnostix co-funded by City of Prague, 2020)

At Heriot-Watt University in Edinburgh (2016-2018):

DILiGENt – natural language generation

MaDrIgAL – spoken dialogue systems

Alexa Prize Challenge – chatbots (Alana – 2x finalist team, 2x 3rd place)

During my Ph.D. study (2011-2016):

AdaNLG – adaptive natural language generator (2014-2016)

Vystadial – statistical spoken dialogue system (2013-2016)

QTLeap – semantic machine translation (2013-2016)

Khresmoi – medical information retrieval (working on machine translation, 2013-2014)

FAUST – improving machine translation fluency (2011-2013)

Teaching

List of classes
NPFL099 Statistical dialogue systems
NPFL123 Dialogue systems

Selected Bibliography

Google Scholar
ORCID: https://orcid.org/0000-0002-1415-1702
Scopus ID: 56075872200
Researcher ID: J-7852-2017

Students

Students I supervise:

Sourabrata Mukherjee (Ph.D. since 2019)

Ondřej Plátek (Ph.D. since 2021)

Patrícia Schmidtová (completed BSc. with Vojtěch Hudeček, 2018–2019; completed MSc. 2020–2022, Ph.D. since 2023)

Nalin Kumar (completed MSc. 2022-2024; Ph.D. since 2024)

Kristýna Onderková (Ph.D. since 2024)

Eliáš Cizl (MSc. since 2024)

Darya Hryhoryeva (MSc. since 2024)

Damian Waloszek (MSc. since 2023)

Ulvi Mamadov (MSc. since 2024)

Rafael Sargsyan (BSc. with Simone Balloccu since 2023)

My former students:

Zdeněk Kasner (completed Ph.D., 2019-2024)

Vojtěch Hudeček (completed Ph.D., 2019-2024, previously supervised by Zdeněk Žabokrtský)

Shubham Agarwal (Ph.D. at Heriot-Watt, with Verena Rieser & Ioannis Konstas, 2017-2019)

Karen Li (completed MSc. with Simone Balloccu & Vera Demberg (LCT Saarland Uni), 2023-2025)

Ivan Kartáč (completed MSc. with Mateusz Lango, 2023-2025)

Michelle Elizabeth (completed MSc. with Lina Rojas-Barajona (LCT, Orange Labs) 2024)

Ekaterina Garanina (completed MSc., LCT with Uni Groningen, with Gertjan van Noord, 2022-2024)

Peter Grajcar (completed MSc. 2022-2023)

Vojtěch John (completed BSc. 2021-2022)

Jonáš Kulhánek (completed MSc. 2020-2021)

Ondřej Motlíček (completed MSc. 2021–2022)

Tomáš Nekvinda (completed MSc. 2019–2020; Ph.D. 2020-2022)

Saad Obaid ul Islam (completed MSc., LCT with Uni Saarbrücken, with Vera Demberg & Iza Škrjanec, 2022-2023)

Borek Požár (completed BSc. with Martin Čmejrek & Jan Cuřín, 2020-2021, completed MSc. 2023-2024)

Jakub Růžička (completed BSc. with Jan Cuřín & Martin Čmejrek, 2022-2023)

Kristína Szabová (completed MSc. 2022-2023)

Jaroslav Šafář (completed MSc. 2021-2023)

Daniel Štancl (Ph.D. 2020-2022)

František Trebuňa (completed MSc. 2021-2024)

Jan Vainer (completed MSc. 2019–2020)

Xinnuo Xu (completed Ph.D., CDT Robotics Edinburgh, with Verena Rieser & Ioannis Konstas, 2016–2021)

News

2023/02/21: Our paper titled Barriers and enabling factors for error analysis in NLG research (van Miltenburg et al.) was accepted at the North European Journal of Language Technology. The journal is open-access.

2023/02/09: I gave a talk on Robust Data-to-text Generation with Pretrained Language Models at the Prague Computer Science seminar.

2023/01/20: Our paper Mind the Labels: Describing Relations in Knowledge Graphs With Pretrained Models (Kasner et al.) was accepted at EACL. A preprint is on arXiv.

2022/09/22: Our paper titled Learning Interpretable Latent Dialogue Actions With Less Supervision (Hudeček & Dušek) was accepted at AACL-IJCNLP. A preprint is on arXiv.

2022/07/15: Our paper titled AARGH! End-to-end Retrieval-Generation for Task-Oriented Dialog (Nekvinda & Dušek) was accepted at SIGDIAL. It is now available on arXiv.

2022/06/30: I gave a talk on Neural Conversational AI at the MLSS^N summer school in Cracow. You can get the slides here, live recording on Youtube is here.

2022/06/01: The ERC NG-NLG project officially started in April. I was hiring a post-doc – see details here.

2022/04/06: Our paper DIASER: A Unifying View on Task-Oriented Dialogue Annotation (Hudeček et al.) was accepted for LREC 2022. You can now see the paper in the proceedings.

2022/03/22: I gave a talk at the AICZECHIA seminar on Large Language Models for Data-to-text Generation. You can get the slides here, recording will be on YouTube soon.

2022/02/28: Our paper titled Neural Pipeline for Data-to-text Generation (Kasner & Dušek) was accepted at the ACL conference. You can see it now in ACL Anthology.

2022/02/14: We're starting another run of the Dialogue Systems course, this time it's included in the prg.ai minor as well.

2022/01/10: I was awarded an ERC grant titled Next-Generation Natural Language Generation. It joins explicit semantics and neural language models in order to get natural and accurate NLG outputs. You can read a short report in the UK Forum magazine, more info will follow.

Biographical

Tools I'm participating on

Alex – spoken dialogue system framework
Flect – statistical morphology generation
MTMonkey – machine translation web services infrastructure
RatPred – trainable NLG quality estimation
TGen – a statistical natural language generator
Treex – a modular NLP toolkit

Institute of Formal and Applied Linguistics

Charles University, Czech Republic
Faculty of Mathematics and Physics

Search form