Matyáš Kopp
Projects
- Parliamentary Corpora
- PML Tree Query
- PMLTQ perl module
- Treex::PML perl module
- PML-TQ REST API server for querying treebanks (https://github.com/ufal/perl-pmltq-server)
- Frontend web application for PMLTQ server (https://github.com/ufal/perl-pmltq-web)
- TrEd extenssion for PMLTQ (https://github.com/ufal/tred-extension-pmltq)
Curriculum Vitae
2014 - Bc. degree at Computer Science an Faculty of Information Technology Czech Technical University in Prague
Selected Bibliography
- The ParlaMint corpora of parliamentary proceedings (Electronic). In: Language Resources and Evaluation, ISSN 1574-020X (url, local PDF)
- ParlaMint II: The Show Must Go On. In: Proceedings of the LREC 2022 ParlaCLARIN III Workshop on Creating, Enriching and Using Parliamentary Corpora, pp. 1-6, European Language Resources Association (ELRA), Paris, France, ISBN 979-10-95546-85-6 (pdf, local PDF, bibtex)
- ParlaMint: Comparable Corpora of European Parliamentary Data. In: CLARIN Annual Conference 2021, pp. 20-25, CLARIN ERIC, Utrecht, The Netherlands (url, obd, bibtex)
- ParCzech 3.0: A Large Czech Speech Corpus with Rich Metadata. In: 24th International Conference on Text, Speech and Dialogue, pp. 293-304, Springer, Cham, Switzerland, ISBN 978-3-030-83526-2 (pdf, local PDF, obd, bibtex)
- Compiling Czech Parliamentary Stenographic Protocols into a Corpus. In: Proceedings of the LREC 2020 Workshop on Creating, Using and Linking of Parliamentary Corpora with Other Types of Political Discourse (ParlaCLARIN II), pp. 18-22, European Language Resources Association (ELRA), Paris, France, ISBN 979-10-95546-47-4 (url, local PDF, obd, bibtex)
- Converting Latin Treebank Data into an SQL Database for Query Purposes. In: Proceedings of the 2Nd International Conference on Digital Access to Textual Cultural Heritage, pp. 117-122, ACM, New York, NY, USA, ISBN 978-1-4503-5265-9 (obd, bibtex)
Data and software
- Multilingual comparable corpora of parliamentary debates ParlaMint 2.0 (DataSW). (url, obd)
- Linguistically annotated multilingual comparable corpora of parliamentary debates ParlaMint.ana 2.0 (DataSW). (url, obd)
- Linguistically annotated multilingual comparable corpora of parliamentary debates ParlaMint.ana 2.1 (DataSW). (url, obd)
- Multilingual comparable corpora of parliamentary debates ParlaMint 2.1 (DataSW). (url, obd)
- ParCzech 3.0 (DataSW). (url, obd)
- ParCzech PS7 2.0 (DataSW). (url, obd)
- ParCzech PS7 1.0 (DataSW). (url, obd)