Main Research Interests
- Natural language generation
- Spoken dialogue systems and chatbots
- Machine learning
- Machine translation
- NaMuDDiS – Natural Multi-domain Dialogue Systems (Charles University funded, 2019-2021)
- THEaiTRE – Automatically generating a theatre play (Czech Technical Agency, 2020-2022)
At Heriot-Watt University in Edinburgh (2016-2018):
- DILiGENt – natural language generation
- MaDrIgAL – spoken dialogue systems
- Alexa Prize Challenge – chatbots (Alana – 2x finalist team, 2x 3rd place)
During my Ph.D. study (2011-2016):
- AdaNLG – adaptive natural language generator (2014-2016)
- Vystadial – statistical spoken dialogue system (2013-2016)
- QTLeap – semantic machine translation (2013-2016)
- Khresmoi – medical information retrieval (working on machine translation, 2013-2014)
- FAUST – improving machine translation fluency (2011-2013)
Students I supervise:
- Vojtěch Hudeček (Ph.D. at Charles Uni, with Zdeněk Žabokrtský, since 2018)
- Zdeněk Kasner (Ph.D. at Charles Uni since 2019)
- Jonáš Kulhánek (MSc. at Charles Uni since 2020)
- Sourabrata Mukherjee (Ph.D. at Charles Uni since 2019)
- Tomáš Nekvinda (completed MSc. at Charles Uni, 2019–2020; Ph.D. since 2020)
- Patrícia Schmidtová (completed BSc. at Charles Uni, with Vojtěch Hudeček, 2018–2019; MSc. since 2020)
- Daniel Štancl (Ph.D. at Charles Uni since 2020)
- Xinnuo Xu (Ph.D., CDT Robotics Edinburgh, with Verena Rieser & Ioannis Konstas, since 2016)
My former students:
2020/12/17: The paper Evaluating data-to-text NLG accuracy by natural language inference by me & Zdeněk kasner has received the INLG 2020 Best Short Paper award!
2020/11/21: Two INLG 2020 papers by Zdeněk Kasner & me are now out on arXiv: Data-to-text generation by iterative text editing and Evaluating data-to-text NLG accuracy by natural language inference.
2020/11/16: The list of accepted papers and a preliminary schedule for the NLG Evaluation Workshop I'm co-organizing at INLG 2020 is now available.
2020/07/25: We got two papers on neural text-to-speech accepted to Interspeech, both are available on arXiv: Tomáš Nekvinda's One Model, Many Languages: Meta-learning for Multilingual Text-to-Speech and Jan Vainer's SpeedySpeech: Efficient Neural Speech Synthesis.
2020/05/26: We (Jindřich Libovický, Zdeněk Kasner, Jindřich Helcl & me) took part in the STAPLE/Duolingo translation & paraphrasing WNGT shared task. Our report is now in the ACL anthology.
2020/04/06: Our paper “Fact-based Content Weighting for Evaluating Abstractive Summarisation” (by Xinnuo Xu, me, Jingiy Li, Yannis Konstas & Verena Rieser) was accepted for ACL. You can read the paper in the ACL anthology.
2020/04/01: Our grant project THEaiTRE for automatically generating theatre plays was accepted by the Czech Technical Agency (and actually got the highest ranking of all submissions)! We plan to have a theatre play performed in January 2021, for the 100th anniversary of Karel Čapek's R.U.R.
2020/03/23: Verena Rieser and I were guests at the NLP Highlights podcast, and we talked about data-to-text generation. You can listen to the podcast on Soundcloud.
Tools I'm participating on
- Alex – spoken dialogue system framework
- Flect – statistical morphology generation
- MTMonkey – machine translation web services infrastructure
- RatPred – trainable NLG quality estimation
- TGen – a statistical natural language generator
- Treex – a modular NLP toolkit