This course deals with speech recognition and generation tasks and feature extraction of voice and utterance characteristics. Of particular interest will be topics related to Hidden Markov Models as applied to speech (FFT, n- dimensional clustering, Gaussian mixtures, parameter value extraction from data, phonetic representation, prosodic analysis etc.). Preparation and training of own speech recognition models.
Steve Young, Dan Kershaw, Julian Odell, Dave Ollason, Valtcho Valtchev, Phil Woodland, The HTK Book, Cambridge, Entropic Ltd. http://htk.eng.cam.ac.uk, 1995-2007
Zdena Palková, Fonetika a fonologie češtiny, Karolinum, Praha, 1997