Matyáš Kopp
Projects
- Parliamentary Corpora
- PML Tree Query
- PMLTQ perl module
- Treex::PML perl module
- PML-TQ REST API server for querying treebanks (https://github.com/ufal/perl-pmltq-server)
- Frontend web application for PMLTQ server (https://github.com/ufal/perl-pmltq-web)
- TrEd extenssion for PMLTQ (https://github.com/ufal/tred-extension-pmltq)
Curriculum Vitae
2014 - Bc. degree at Computer Science an Faculty of Information Technology Czech Technical University in Prague
Selected Bibliography
- TEI and Git in ParlaMint: Collaborative Development of Language Resources. In: CLARIN Annual Conference Proceedings 2022, pp. 57-60, CLARIN ERIC, Praha, Czechia (url, bibtex)
- The ParlaMint corpora of parliamentary proceedings (Electronic). In: Language Resources and Evaluation, ISSN 1574-020X (url, local PDF)
- Annotating Attribution in Czech News Server Articles. In: Proceedings of the 13th Conference on Language Resources and Evaluation (LREC 2022), pp. 1817-1823, European Language Resources Association, Marseille, France, ISBN 979-10-95546-72-6 (pdf, local PDF, bibtex)
- ParlaMint II: The Show Must Go On. In: Proceedings of the LREC 2022 ParlaCLARIN III Workshop on Creating, Enriching and Using Parliamentary Corpora, pp. 1-6, European Language Resources Association (ELRA), Paris, France, ISBN 979-10-95546-85-6 (pdf, local PDF, local PDF, bibtex)
- ParlaMint: Comparable Corpora of European Parliamentary Data. In: CLARIN Annual Conference Proceedings 2021, pp. 20-25, CLARIN ERIC, Utrecht, The Netherlands (url, bibtex)
- ParCzech 3.0: A Large Czech Speech Corpus with Rich Metadata. In: 24th International Conference on Text, Speech and Dialogue, pp. 293-304, Springer, Cham, Switzerland, ISBN 978-3-030-83526-2 (pdf, local PDF, bibtex)
- Compiling Czech Parliamentary Stenographic Protocols into a Corpus. In: Proceedings of the LREC 2020 Workshop on Creating, Using and Linking of Parliamentary Corpora with Other Types of Political Discourse (ParlaCLARIN II), pp. 18-22, European Language Resources Association (ELRA), Paris, France, ISBN 979-10-95546-47-4 (url, local PDF, bibtex)
- Converting Latin Treebank Data into an SQL Database for Query Purposes. In: Proceedings of the 2Nd International Conference on Digital Access to Textual Cultural Heritage, pp. 117-122, ACM, New York, NY, USA, ISBN 978-1-4503-5265-9 (bibtex)
Data and software
- SiR 1.0 (DataSW). (url)
- Multilingual comparable corpora of parliamentary debates ParlaMint 2.0 (DataSW). (url)
- Linguistically annotated multilingual comparable corpora of parliamentary debates ParlaMint.ana 2.0 (DataSW). (url)
- Linguistically annotated multilingual comparable corpora of parliamentary debates ParlaMint.ana 2.1 (DataSW). (url)
- Multilingual comparable corpora of parliamentary debates ParlaMint 2.1 (DataSW). (url)
- ParCzech 3.0 (DataSW). (url)
- ParCzech PS7 2.0 (DataSW). (url)
- ParCzech PS7 1.0 (DataSW). (url)