Filip Jurčíček (2012): Reinforcement learning for spoken dialogue systems using off-policy natural gradient method. In: IEEE SLT '12: Proc. IEEE Spoken Language Technology Workshop, pp. 7-12, IEEE, Miami, FL, USA, ISBN 978-1-4673-5126-3 (url, bibtex)