Sort by year / topic / type

Multimodality

Optical Music Recognition

Machine Translation

Lexical Association Measures, Multiword Expresions

Web as a Corpus

  • Drahomíra Spoustová, Miroslav Spousta, Pavel Pecina. (2010). Building a Web Corpus of Czech. In Proceedings of the 7th International Conference on Language Resources and Evaluation, pp. 998-1001, Valletta, Malta (bib).
  • Miroslav Spousta, Michal Marek, Pavel Pecina. (2008). Victor: the Web-Page Cleaning Tool. In Proceedings of the 4th Web as Corpus Workshop - Can we beat Google?, pp. 12-17, Marrakech, Morocco (bib).
  • Michal Marek, Pavel Pecina, Miroslav Spousta. (2007). Web Page Cleaning with Conditional Random Fields. In Proceedings of the 3rd Web As a Corpus Workshop, Incorporating CLEANEVAL, pp. 155-162, Louvain-la-Neuve, Belgium (bib).

Information Retrieval

Arabic Language Processing

Morphology and Tagging

Summarization

Neural Representations

  • Michal Auersperger, Pavel Pecina (2022). Defending Compositionality in Emergent Languages. In Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies: Student Research Workshop, pp. 285-291, Hybrid: Seattle, Washington + Online, ISBN 978-1-955917-73-5 (bib).
  • Michal Auersperger, Pavel Pecina (2021). Solving SCAN Tasks with Data Augmentation and Input Embeddings. In Proceedings of the Recent Advances in Natural Language Processing, pp. 86-91, INCOMA Ltd., Shoumen, Bulgaria, ISBN 978-954-452-072-4 (bib).

Named Entities

Lexical Semantics

Language Identification

Misc