Visit the ESSLLI 2013 site.​​

Dear students,

our course is over. We really enjoyed a great week with you on the way to machine learning! We've just posted the final versions of our presentations. Feel free to download them and study them. 

All the best, Barbora & Martin

August 27, 2013




Day 1 Day 2 Day 3 Day 4 Day 5





homework-1.1-solution.posted.ods  homework-2.2-solution.pdf hw-3.1-solution.pdfHW-3.1-examples.in.R hw-4.1-solution.pdfHW-4.1.R  
R scripts






R system


Introductory reading

  • Alpaydin, Ethem. Introduction to Machine Learning. The MIT Press. 2004, 2010 (url).

  • Domingos, Pedro. A few useful things to know about Machine learning. Communication of the ACM, vol. 55, Issue 10, October 2012, pp. 78--87, ACM, New York, USA. (pdf) [a nice non-technical reading]

  • Gonick, Larry and Woollcott Smith. The Cartoon Guide to Statistics. Harper Resource. 2005. 

  • Hladka, Barbora and Martin Holub. The course proposal esslli-proposal.2013.pdf, 2013.

  • Kononenko, Igor and Matjaz Kukar. Machine Learning and Data Mining: Introduction to Principles and Algorithms. Horwood Publishing, 2007 (url). [a light survey of the whole field]

Serious textbooks and tutorials (they require deeper mathematical background)

  • Bishop, Christopher M. Pattern Recognition And Machine Learning. Springer, 2006 (url).

  • Burges Christopher J. C.  A Tutorial on Support Vector Machines for Pattern Recognition. Data Mining and Knowledge Discovery, 2(2):121–167, 1998. http://research.microsoft.com/pubs/67119/svmtutorial.pdf

  • Cristianni, Nello and John Shawe-Taylor. An Introduction to Support Vector M​achines and other Kernel-based Learning Methods. Cambridge University Press, 2000.

  • Duda, Richard O., Peter R. Hart and David G. Stork. Pattern Classification. Second Edition. Wiley, 2001.

  • Hsu Chih-Wei, Chang Chih-Chung Chang and Chih-Jen Lin. A Practical Guide to Support Vector Classication. 2010. (pdf).

  • Hastie, Trevor, Robert Tibshirani and Jerome Friedman.  The Elements of Statistical Learning: Data Mining, Inference, and Prediction. Springer, 2009 (url).

About R

​​About data used

  • Leacock, C., Towell, G. and Voorhees, E. Corpus-Based Statistical Sense Resolution. In Proceedings of the ARPA Workshop on Human  Language Technology, pp. 260--265. 1993. [WSD task]  ​
  • Pecina, Pavel. Reference Data for Czech Collocation Extraction. In Proceedings of the LREC 2008 Workshop Towards a Shared Task for Multiword Expressions, pp. 11-14, Marrakech, Morocco. 2008. [COL task]


Barbora Hladka and Martin Holub

Institute of Formal and Applied Linguistics

Faculty of Mathematics and Physics

Charles University in Prague


Teaching the course was supported by the Czech Science Foundation, grants no. P103/12/G084, P406/12/0658 and Charles University in Prague, Faculty of Mathematics and Physics.