OP VVV LINDAT

LINDAT/CLARIN - Research infrastructure for language technologies – extension of the repository and its computational power

Project LINDAT/CLARIN aims to reinforce its IT infrastructure and complement the extended capability by carrying out research in progressive natural language and speech processing methods. Part of the extension is an addition of human resources to the research and infrastructure teams. The project covers all four institutional partners of the running LINDAT/CLARIN Research Infrastructure project, i.e., Charles University in Prague, Masaryk University Brno, University of West Bohemia in Pilsen and Institute of the Czech Language of the Academy of Sciences of the Czech Republic. The computational power and data storage of the infrastructure will be extended by additional equipment, in order to enable RI’s basic and applied research in the areas of text processing, speech processing, search in text sources from the repository and for building new and improved software tools. 

 
 

 

 

Publications

  1. Ondřej Bojar, Tom Kocmi, David Mareček, Roman Sudarikov, Dušan Variš (2017): CUNI Submission in WMT17: Chimera Goes Neural. In: Proceedings of the Second Conference on Machine Translation, Volume 2: Shared Task Papers, pp. 248-256, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-945626-96-8 (biblio, bibtex)
  2. Silvie Cinková, Anna Vernerová (2017): Are Annotators' Word-Sense Disambiguation Decisions Affected by Textual Entailment between Lexicon Glosses? In: Proceedings of the 17th Conference on Information Technologies - Applications and Theory (ITAT 2017), pp. 5-14, CreateSpace Independent Publishing Platform, Praha, Czechia, ISBN 978-1974274741 (pdf, biblio, batt1.pdf, bibtex)
  3. David Mareček, Ondřej Bojar, Ondřej Hübsch, Rudolf Rosa, Dušan Variš (2017): CUNI Experiments for WMT17 Metrics Task. In: Proceedings of the Second Conference on Machine Translation, Volume 2: Shared Task Papers, pp. 604-611, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-945626-96-8 (biblio, bibtex)
  4. Jiří Mírovský, Pavlína Synková, Magdaléna Rysová, Lucie Poláková (2017): CzeDLex – A Lexicon of Czech Discourse Connectives. In: The Prague Bulletin of Mathematical Linguistics, ISSN 0032-6585, 109, pp. 61-91 (url, biblio, batt1.pdf, bibtex)
  5. Klára Osolsobě, Jaroslava Hlaváčová, Vladimír Petkevič, Martin Svášek, Josef Šimandl (2017): Nová automatická morfologická analýza češtiny. In: Naše řeč, ISSN 0027-8203, vol. 100, no. 4/2017, pp. 225-234 (biblio, bibtex)
  6. Rudolf Rosa (2017): MonoTrans: Statistical Machine Translation from Monolingual Data. In: Proceedings of the 17th conference ITAT 2017: Slovenskočeský NLP workshop (SloNLP 2017), pp. 201-208, CreateSpace Independent Publishing Platform, Praha, Czechia, ISBN 978-1974274741 (pdf, biblio, batt1.pdf, batt2.pdf, batt3.pdf, bibtex)
  7. Rudolf Rosa, Zdeněk Žabokrtský (2017): Error Analysis of Cross-lingual Tagging and Parsing. In: Proceedings of the 16th International Workshop on Treebanks and Linguistic Theories, pp. 106-118, Univerzita Karlova, Praha, Czechia, ISBN 978-80-88132-04-2 (pdf, biblio, batt1.pdf, batt2.pdf, bibtex)
  8. Milan Straka, Jana Straková (2017): Tokenizing, POS Tagging, Lemmatizing and Parsing UD 2.0 with UDPipe. In: Proceedings of the CoNLL 2017 Shared Task: Multilingual Parsing from Raw Text to Universal Dependencies, pp. 88-99, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-945626-70-8 (pdf, biblio, bibtex)
  9. Milan Straka, Jana Straková, Jan Hajič (2017): Prague at EPE 2017: The UDPipe System. In: Proceedings of the 2017 Shared Task on Extrinsic Parser Evaluation at the Fourth International Conference on Dependency Linguistics and the 15th International Conference on Parsing Technologies, pp. 65-74, Association for Computational Linguistics (ACL), Stroudsburg, PA, USA, ISBN 978-1-945626-74-6 (pdf, biblio, bibtex)
  10. Jernej Vičič, Vladislav Kuboň, Petr Homola (2017): Česílko Goes Open-source. In: The Prague Bulletin of Mathematical Linguistics, ISSN 0032-6585, 107, pp. 57-66 (biblio, bibtex)