Matyáš Kopp

office
424
email
kopp@ufal.mff.cuni.cz

Projects

Signál a šum v éře Žurnalistiky 5.0 (2021-2024)

Curriculum Vitae

2014 - Bc. degree at Computer Science an Faculty of Information Technology Czech Technical University in Prague

Selected Bibliography

  1. Tomaž Erjavec, Matyáš Kopp (2022): TEI and Git in ParlaMint: Collaborative Development of Language Resources. In: CLARIN Annual Conference Proceedings 2022, pp. 57-60, CLARIN ERIC, Praha, Czechia (url, bibtex)
  2. Tomaž Erjavec, Maciej Ogrodniczuk, Petya Osenova, Nikola Ljubešić, Kiril Simov, Andrej Pančur, Michał Rudolf, Matyáš Kopp, Starkaður Barkarson, Steinþór Steingrímsson, Çağrı Çöltekin, Jesse de Does, Katrien Depuydt, Tommaso Agnoloni, Giulia Venturi, María Calzada Pérez, Luciana de Macedo, Costanza Navarretta, Giancarlo Luxardo, Matthew Coole, Paul Rayson, Vaidas Morkevičius, Tomas Krilavičius, Roberts Darģis, Orsolya Ring, Ruben van Heusden, Maarten Marx, Darja Fišer (2022): The ParlaMint corpora of parliamentary proceedings. In: Language Resources and Evaluation, ISSN 1574-020X, vol. 57, no. 1, pp. 415-448 (url, local PDF, bibtex)
  3. Barbora Hladká, Jiří Mírovský, Matyáš Kopp, Václav Moravec (2022): Annotating Attribution in Czech News Server Articles. In: Proceedings of the 13th Conference on Language Resources and Evaluation (LREC 2022), pp. 1817-1823, European Language Resources Association, Marseille, France, ISBN 979-10-95546-72-6 (pdf, local PDF, bibtex)
  4. Maciej Ogrodniczuk, Petya Osenova, Tomaž Erjavec, Darja Fišer, Nikola Ljubešić, Çağrı Çöltekin, Matyáš Kopp, Katja Meden (2022): ParlaMint II: The Show Must Go On. In: Proceedings of the LREC 2022 ParlaCLARIN III Workshop on Creating, Enriching and Using Parliamentary Corpora, pp. 1-6, European Language Resources Association (ELRA), Paris, France, ISBN 979-10-95546-85-6 (pdf, local PDF, local PDF, bibtex)
  5. Tomaž Erjavec, Maciej Ogrodniczuk, Petya Osenova, Andrej Pančur, Nikola Ljubešić, Tommaso Agnoloni, Starkaður Barkarson, María Calzada Pérez, Çağrı Çöltekin, Matthew Coole, Roberts Darģis, Luciana de Macedo, Jesse de Does, Katrien Depuydt, Sascha Diwersy, Dorte Haltrup Hansen, Matyáš Kopp, Tomas Krilavičius, Giancarlo Luxardo, Maarten Marx, Vaidas Morkevičius, Costanza Navarretta, Paul Rayson, Orsolya Ring, Michał Rudolf, Kiril Simov, Steinþór Steingrímsson, István Üveges, Ruben van Heusden, Giulia Venturi (2021): ParlaMint: Comparable Corpora of European Parliamentary Data. In: CLARIN Annual Conference Proceedings 2021, pp. 20-25, CLARIN ERIC, Utrecht, The Netherlands (url, bibtex)
  6. Matyáš Kopp, Vladislav Stankov, Jan Oldřich Krůza, Pavel Straňák, Ondřej Bojar (2021): ParCzech 3.0: A Large Czech Speech Corpus with Rich Metadata. In: 24th International Conference on Text, Speech and Dialogue, pp. 293-304, Springer, Cham, Switzerland, ISBN 978-3-030-83526-2 (pdf, local PDF, bibtex)
  7. Barbora Hladká, Matyáš Kopp, Pavel Straňák (2020): Compiling Czech Parliamentary Stenographic Protocols into a Corpus. In: Proceedings of the LREC 2020 Workshop on Creating, Using and Linking of Parliamentary Corpora with Other Types of Political Discourse (ParlaCLARIN II), pp. 18-22, European Language Resources Association (ELRA), Paris, France, ISBN 979-10-95546-47-4 (url, local PDF, bibtex)
  8. Christophe Onambélé, Matyáš Kopp, Marco Passarotti, Jiří Mírovský (2017): Converting Latin Treebank Data into an SQL Database for Query Purposes. In: Proceedings of the 2Nd International Conference on Digital Access to Textual Cultural Heritage, pp. 117-122, ACM, New York, NY, USA, ISBN 978-1-4503-5265-9 (bibtex)

Data and software

  1. Barbora Vidová Hladká, Jiří Mírovský, Matyáš Kopp, Václav Moravec (2022): SiR 1.0 (DataSW). (url)
  2. Tomaž Erjavec, Maciej Ogrodniczuk, Petya Osenova, Nikola Ljubešić, Kiril Simov, Vladislava Grigorova, Michał Rudolf, Andrej Pančur, Matyáš Kopp, Starkaður Barkarson, Steinþór Steingrímsson, Henk van der Pol, Griet Depoorter, Jesse de Does, Bart Jongejan, Dorte Haltrup Hansen, Costanza Navarretta, María Calzada Pérez, Luciana de Macedo, Ruben van Heusden, Maarten Marx, Çağrı Çöltekin, Matthew Coole, Tommaso Agnoloni, Francesca Frontini, Simonetta Montemagni, Valeria Quochi, Giulia Venturi, Manuela Ruisi, Carlo Marchetti, Roberto Battistoni, Miklós Sebők, Orsolya Ring, Roberts Darģis, Andrius Utka, Mindaugas Petkevičius, Monika Briedienė, Tomas Krilavičius, Vaidas Morkevičius (2021): Multilingual comparable corpora of parliamentary debates ParlaMint 2.0 (DataSW). (url)
  3. Tomaž Erjavec, Maciej Ogrodniczuk, Petya Osenova, Nikola Ljubešić, Kiril Simov, Vladislava Grigorova, Michał Rudolf, Andrej Pančur, Matyáš Kopp, Starkaður Barkarson, Steinþór Steingrímsson, Henk van der Pol, Griet Depoorter, Jesse de Does, Bart Jongejan, Dorte Haltrup Hansen, Costanza Navarretta, María Calzada Pérez, Luciana de Macedo, Ruben van Heusden, Maarten Marx, Çağrı Çöltekin, Matthew Coole, Tommaso Agnoloni, Francesca Frontini, Simonetta Montemagni, Valeria Quochi, Giulia Venturi, Manuela Ruisi, Carlo Marchetti, Roberto Battistoni, Miklós Sebők, Orsolya Ring, Roberts Darģis, Andrius Utka, Mindaugas Petkevičius, Monika Briedienė, Tomas Krilavičius, Vaidas Morkevičius, Roberto Bartolini, Andrea Cimino (2021): Linguistically annotated multilingual comparable corpora of parliamentary debates ParlaMint.ana 2.0 (DataSW). (url)
  4. Tomaž Erjavec, Maciej Ogrodniczuk, Petya Osenova, Nikola Ljubešić, Kiril Simov, Vladislava Grigorova, Michał Rudolf, Andrej Pančur, Matyáš Kopp, Starkaður Barkarson, Steinþór Steingrímsson, Henk van der Pol, Griet Depoorter, Jesse de Does, Bart Jongejan, Dorte Haltrup Hansen, Costanza Navarretta, María Calzada Pérez, Luciana de Macedo, Ruben van Heusden, Maarten Marx, Çağrı Çöltekin, Matthew Coole, Tommaso Agnoloni, Francesca Frontini, Simonetta Montemagni, Valeria Quochi, Giulia Venturi, Manuela Ruisi, Carlo Marchetti, Roberto Battistoni, Miklós Sebők, Orsolya Ring, Roberts Darģis, Andrius Utka, Mindaugas Petkevičius, Monika Briedienė, Tomas Krilavičius, Vaidas Morkevičius, Roberto Bartolini, Andrea Cimino, Sascha Diwersy, Giancarlo Luxardo (2021): Linguistically annotated multilingual comparable corpora of parliamentary debates ParlaMint.ana 2.1 (DataSW). (url)
  5. Tomaž Erjavec, Maciej Ogrodniczuk, Petya Osenova, Nikola Ljubešić, Kiril Simov, Vladislava Grigorova, Michał Rudolf, Andrej Pančur, Matyáš Kopp, Starkaður Barkarson, Steinþór Steingrímsson, Henk van der Pol, Griet Depoorter, Jesse de Does, Bart Jongejan, Dorte Haltrup Hansen, Costanza Navarretta, María Calzada Pérez, Luciana de Macedo, Ruben van Heusden, Maarten Marx, Çağrı Çöltekin, Matthew Coole, Tommaso Agnoloni, Francesca Frontini, Simonetta Montemagni, Valeria Quochi, Giulia Venturi, Manuela Ruisi, Carlo Marchetti, Roberto Battistoni, Miklós Sebők, Orsolya Ring, Roberts Darģis, Andrius Utka, Mindaugas Petkevičius, Tomas Krilavičius, Vaidas Morkevičius, Sascha Diwersy, Giancarlo Luxardo (2021): Multilingual comparable corpora of parliamentary debates ParlaMint 2.1 (DataSW). (url)
  6. Matyáš Kopp, Vladislav Stankov, Ondřej Bojar, Barbora Hladká, Pavel Straňák (2021): ParCzech 3.0 (DataSW). (url)
  7. Barbora Hladká, Matyáš Kopp, Pavel Straňák (2020): ParCzech PS7 2.0 (DataSW). (url)
  8. Barbora Hladká, Matyáš Kopp, Pavel Straňák (2020): ParCzech PS7 1.0 (DataSW). (url)