This an archived version of the course from the '20/21 Fall semester. Here is the current version.

Statistical Dialogue Systems

Summary

This course presents advanced problems and current state-of-the-art in the field of dialogue systems, voice assistants, and conversational systems (chatbots). After a brief introduction into the topic, the course will focus mainly on the application of machine learning – especially deep learning/neural networks – in the individual components of the traditional dialogue system architecture as well as in end-to-end approaches (joining multiple components together).

This course is a follow-up to the course NPFL123 Dialogue Systems, but can be taken independently – important basics will be repeated. All required deep learning concepts will be explained, but only briefly, so some machine learning background is recommended.

Logistics

The course will be taught in English, but we're happy to explain in Czech as well. statistical dialogue system schema

The course will be taught online over Zoom, given the current pandemic. All lectures will be recorded so you can catch up later. Note that recordings of students' voices will not be retained to comply with privacy requirements – don't hesitate to talk in the lectures!

There is a shared Slack workspace for the course – email us to get access.

The schedule is the following:

Lectures + Labs: Tue 9:50-12:10
We start at 9:50 each week – the lab session is not taking place everytime, so sometimes we'll finish earlier.

Zoom meeting ID: 953 7826 3918
Password is the SIS code of this course (capitalized)

If you can't access Zoom, email us or text us on Slack.

To successfully finish this course, you'll need to:

pass an exam (covering the lectures, especially parts mentioned in the summary) – the exam is either an in-person test or an essay + Zoom discussion (see details on last lecture slides an video, email us if you need video link)
finish a small lab homework + a big lab project (individual or in groups) – implement some dialogue system experiments and write a report.

Lectures

Slides and from past lectures will appear here (video links are sent directly to students; email us if you want to get video links).

Introduction (29 Sep)
Machine Learning Toolkit (6 Oct)
Data & Evaluation (13 Oct)
Language Understanding (20 Oct)
Dialogue State Tracking (27 Oct)
Dialogue Policy (part 1) (3 Nov)
Dialogue Policy (part 2) + Language generation (part 1) (10 Nov)
Language Generation (part 2) (24 Nov)
End-to-end systems (1 Dec)
Chatbots (non-task-oriented) (8 Dec)
Linguistics & Ethics (15 Dec)
Multimodal Systems (5 Jan) – includes details on the exam

Labs

Lab assignments will appear in the dedicated GitLab repository. There's also videos for the lab instructions (email us to get links).

1st assignment – Datasets: deadline 13 Oct for choosing a dataset, 20 Oct for submitting a report
Semester projects – implement your own dialogue system/component
- Have a look at project topics (or make up your own) & choose a topic by 10 Nov + write-up in detail by 24 Nov
- Dialmonkey – dialogue framework that we'll use for main assignments
- Have a look at hints for experimenting before you start running your experiments.
- Before you start writing, read these writing guidelines.

Covered topics

Brief introduction into dialogue systems
- dialogue systems applications
- basic components of dialogue systems
- knowledge representation in dialogue systems
- data and evaluation
Language understanding (NLU)
- semantic representation of utterances
- statistical methods for NLU
Dialogue management
- dialogue representation as a (Partially Observable) Markov Decision Process
- dialogue state tracking
- action selection
- reinforcement learning
- user simulation
- deep reinforcement learning (using neural networks)
Response generation (NLG)
- introduction to NLG, basic methods (templates)
- generation using neural networks
End-to-end dialogue systems (one network to handle everything)
- sequence-to-sequence systems
- memory/attention-based systems
- pretrained language models
Open-domain systems (chatbots)
- generative systems (sequence-to-sequence, hierarchical models)
- information retrieval
- ensemble systems
Multimodal systems
- component-based and end-to-end systems
- image classification
- visual dialogue

Institute of Formal and Applied Linguistics

Charles University, Czech Republic
Faculty of Mathematics and Physics

Search form