LINDAT/CLARIAH-CZ Language Resources and Digital Arts and Humanities Research Infrastructure

Description and structure of the project

LINDAT/CLARIAH-CZ was created by the merger of the large research infrastructures LINDAT/CLARIN and DARIAH-CZ, and in 2023 four Czech institutions from the Czech consortium EHRI were added. It is a unique large research infrastructure that deals mainly with language but also other digital resources and tools for their processing, develops them and provides them in relevant fields to the scientific community, the application development industry and in specific cases such as language culture, also directly to the public. LINDAT/CLARIAH-CZ is a joint, distributed national hub of the Czech Republic in the European research infrastructures CLARIN ERIC (Common Language Resources and Technology Infrastructure), DARIAH ERIC (Digital Research Infrastructure for the Arts and Humanities) and, from 2024, the future EHRI ERIC, and is composed of 15 leading research organizations of the Czech Republic operating in the humanities and arts disciplines – linguistics, history and historical bibliography, culture and science of culture, art history, philosophy, film culture, visual arts, musicology and history of music, ethnology, folklore, archaeology and also in several interdisciplinary disciplines. The aim of LINDAT/CLARIAH-CZ is to open access digitized data resources in these fields to the broad research community and students in the Czech Republic and the EU, while gaining access to similar resources available in the European networks CLARIN, DARIAH and EHRI. LINDAT/CLARIAH-CZ engages in international cooperation between similar research infrastructures as well as directly between institutions in all humanities disciplines and emphasizes digital and interdisciplinary processing methods, including modern machine learning methods and artificial intelligence. LINDAT/CLARIAH-CZ activities also include analysis of legal aspects of the use of digital humanities resources due to possible copyright restrictions and minimization of their impact on research work. LINDAT/CLARIAH-CZ also offers know-how, software tools for processing language and other digital resources and development of language technologies for the needs of industry and services, including use in new cultural and creative industries.

Open Science Principle

LINDAT/CLARIAH-CZ fully supports Open Science Principles for access to scientific data and scientific outputs created with public support, including respecting FAIR rules to the greatest extent possible, for all cooperating institutions as well as other institutions in the humanities and arts, where LINDAT/CLARIAH-CZ is part of the national and European Open Science Network (EOSC CZ, SSHOC). The LINDAT/CLARIAH-CZ repository itself, CLARIN DSpace, is being developed as open software with substantial contribution from LINDAT/CLARIAH-CZ.

Support for Research Projects

Beyond its involvement in infrastructure projects, LINDAT/CLARIAH-CZ supports directly a number of national and international scientific projects (e.g. European projects in Horizon 2020 and Horizon Europe, such as the recent projects HPLT, ELITR, Bergamot, RESQ+, MEMORISE and others), which use language data, data from the field of digital humanities and arts, as well as software tools developed in the LINDAT/CLARIAH-CZ consortium and provided as software packages or as web services, including user interfaces in the form of APIs or web-based AI.


Guarantee of sustainability

LINDAT/CLARIAH-CZ is hosted by the Institute of Formal and Applied Linguistics of MFF UK in Prague. The Institute of Formal and Applied Linguistics guarantees the availability of data stored in the LINDAT/CLARIAH-CZ repository, as a minimum measure, for a period of at least 10 years after the eventual termination of the LINDAT/CLARIAH-CZ project. Both LINDAT/CLARIAH-CZ and ÚFAL MFF UK can transfer this responsibility by written agreement to another CLARIN technical centre or its hosting institution.

Partner institutions in the LINDAT/CLARIAH-CZ consortium

  • Institute of Philosophy of the CAS, v.v.i.
  • Historical Institute of the CAS, v.v.i.
  • Library of the Academy of Sciences of the Czech Republic
  • Masaryk University
  • Moravian Land Library
  • National Film Archive
  • National Gallery
  • National Library of Prague
  • Institute for the Czech Language of the CAS, v.v.i.
  • University of West Bohemia in Pilsen
  • Masaryk Institute and Archive of the CAS, v.v.i.
  • Terezín Initiative Institute
  • Terezín Memorial
  • National Archive