Archive

Area of research Funding provider

Grants

Discourse
Duration Provider
AnaConn: Anaphoricity in Connectives: Lexical Description and Bilingual Corpus Analysis 2017–2019 GAČR
EVALD (Evaluator of Discourse): Automatic Evaluation of Text Coherence in Czech 1. 3. 2016 – 31. 12. 2019 Ministry of Culture
CzeDParse: Automatická analýza diskurzních vztahů v češtině 2019-2021 GAČR
UNCE VITRI: Center for the Transdisciplinary Research of Violence, Trauma and Justice 2018-2023 UK
LINDAT/CLARIN: Centre for Language Research Infrastructure in the Czech Republic 2016 - 2019 MŠMT
Global Coherence: Global Coherence of Czech Texts in the Corpus-Based Perspective 2020 - 2022 GAČR
Implicit Relations in Text Coherence 2017-2019 GAČR
LiFR: Linguistic Factors of Readability in Czech Administrative and Educational Texts 2019-2021 GAČR
INTERCOST-Readability: Modelování komplexity českých literárních textů VI 2018 - X 2021 MŠMT
NaMuDDiS: Natural multi-domain dialogue systems 2019-2021 UK
Lexicons
Duration Provider
AnaConn: Anaphoricity in Connectives: Lexical Description and Bilingual Corpus Analysis 2017–2019 GAČR
CzeDParse: Automatická analýza diskurzních vztahů v češtině 2019-2021 GAČR
VALLEX: Between Reciprocity and Reflexivity: The Case of Czech Reciprocal Constructions 2018-2020 GAČR
LINDAT/CLARIN: Centre for Language Research Infrastructure in the Czech Republic 2016 - 2019 MŠMT
CzEngClass: Contextually-based synonymy and valency of verbs in a bilingual setting 2017-2019 GAČR
NomVallex II.: Valency of Non-verbal Predicates. An Extension of Valency Studies to Adjectives and Deadjectival Nouns. 2019-2021 GAČR
Multilingual
Duration Provider
AnaConn: Anaphoricity in Connectives: Lexical Description and Bilingual Corpus Analysis 2017–2019 GAČR
LSD: Linguistic Structure Representation in Neural Networks 2018-2020 GAČR
Mnohojazyčný strojový překlad 2018-2020 GAČR
LangTech: Modernizace oboru Matematická lingvistika MŠMT
WELCOME: Multiple Intelligent Conversation Agent Services for Reception, Management and Integration of Third Country Nationals. 2020-2023 H2020
NEUREM3: Neuronové reprezentace v multimodálním a mnohojazyčném modelování (Neural Representations in Multi-modal and Multi-lingual Modelling) 2019-2024 GAČR
Universal morphosyntactic annotation of language data 2017-2019 GAUK
Coreference
Duration Provider
EVALD (Evaluator of Discourse): Automatic Evaluation of Text Coherence in Czech 1. 3. 2016 – 31. 12. 2019 Ministry of Culture
LINDAT/CLARIN: Centre for Language Research Infrastructure in the Czech Republic 2016 - 2019 MŠMT
Information Structure
Duration Provider
EVALD (Evaluator of Discourse): Automatic Evaluation of Text Coherence in Czech 1. 3. 2016 – 31. 12. 2019 Ministry of Culture
LiFR: Linguistic Factors of Readability in Czech Administrative and Educational Texts 2019-2021 GAČR
INTERCOST-Readability: Modelování komplexity českých literárních textů VI 2018 - X 2021 MŠMT
Annotations
Duration Provider
CzeDParse: Automatická analýza diskurzních vztahů v češtině 2019-2021 GAČR
LINDAT/CLARIN: Centre for Language Research Infrastructure in the Czech Republic 2016 - 2019 MŠMT
CzEngClass: Contextually-based synonymy and valency of verbs in a bilingual setting 2017-2019 GAČR
Global Coherence: Global Coherence of Czech Texts in the Corpus-Based Perspective 2020 - 2022 GAČR
Implicit Relations in Text Coherence 2017-2019 GAČR
OP VVV LINDAT: LINDAT/CLARIN - Research infrastructure for language technologies – extension of the repository and its computational power 2017–2019 MŠMT
LiFR: Linguistic Factors of Readability in Czech Administrative and Educational Texts 2019-2021 GAČR
INTERCOST-Readability: Modelování komplexity českých literárních textů VI 2018 - X 2021 MŠMT
ForFun: Subcategorization of adverbial meanings based on corpus data 2017-2019 GAČR
LAPPS-CLARIN: Transatlantic Collaboration between LAPPS and CLARIN: Semantic, Technical and Infrastructural Interoperability of Services 2016-2018, 2019-2020 Mellon Foundation (USA)
Universal morphosyntactic annotation of language data 2017-2019 GAUK
Corpora
Duration Provider
CzeDParse: Automatická analýza diskurzních vztahů v češtině 2019-2021 GAČR
LINDAT/CLARIN: Centre for Language Research Infrastructure in the Czech Republic 2016 - 2019 MŠMT
CzEngClass: Contextually-based synonymy and valency of verbs in a bilingual setting 2017-2019 GAČR
Global Coherence: Global Coherence of Czech Texts in the Corpus-Based Perspective 2020 - 2022 GAČR
Implicit Relations in Text Coherence 2017-2019 GAČR
OP VVV LINDAT: LINDAT/CLARIN - Research infrastructure for language technologies – extension of the repository and its computational power 2017–2019 MŠMT
LiFR: Linguistic Factors of Readability in Czech Administrative and Educational Texts 2019-2021 GAČR
INTERCOST-Readability: Modelování komplexity českých literárních textů VI 2018 - X 2021 MŠMT
ForFun: Subcategorization of adverbial meanings based on corpus data 2017-2019 GAČR
LAPPS-CLARIN: Transatlantic Collaboration between LAPPS and CLARIN: Semantic, Technical and Infrastructural Interoperability of Services 2016-2018, 2019-2020 Mellon Foundation (USA)
Universal morphosyntactic annotation of language data 2017-2019 GAUK
NomVallex II.: Valency of Non-verbal Predicates. An Extension of Valency Studies to Adjectives and Deadjectival Nouns. 2019-2021 GAČR
Data
Duration Provider
CzeDParse: Automatická analýza diskurzních vztahů v češtině 2019-2021 GAČR
VALLEX: Between Reciprocity and Reflexivity: The Case of Czech Reciprocal Constructions 2018-2020 GAČR
UNCE VITRI: Center for the Transdisciplinary Research of Violence, Trauma and Justice 2018-2023 UK
LINDAT/CLARIN: Centre for Language Research Infrastructure in the Czech Republic 2016 - 2019 MŠMT
CzEngClass: Contextually-based synonymy and valency of verbs in a bilingual setting 2017-2019 GAČR
Global Coherence: Global Coherence of Czech Texts in the Corpus-Based Perspective 2020 - 2022 GAČR
OP VVV LINDAT: LINDAT/CLARIN - Research infrastructure for language technologies – extension of the repository and its computational power 2017–2019 MŠMT
LiFR: Linguistic Factors of Readability in Czech Administrative and Educational Texts 2019-2021 GAČR
INTERCOST-Readability: Modelování komplexity českých literárních textů VI 2018 - X 2021 MŠMT
ForFun: Subcategorization of adverbial meanings based on corpus data 2017-2019 GAČR
LAPPS-CLARIN: Transatlantic Collaboration between LAPPS and CLARIN: Semantic, Technical and Infrastructural Interoperability of Services 2016-2018, 2019-2020 Mellon Foundation (USA)
Word-formation structure of Czech words: a data-based research 2019-2021 GAČR
Monolingual
Duration Provider
VALLEX: Between Reciprocity and Reflexivity: The Case of Czech Reciprocal Constructions 2018-2020 GAČR
Implicit Relations in Text Coherence 2017-2019 GAČR
LiFR: Linguistic Factors of Readability in Czech Administrative and Educational Texts 2019-2021 GAČR
ForFun: Subcategorization of adverbial meanings based on corpus data 2017-2019 GAČR
Semantics
Duration Provider
VALLEX: Between Reciprocity and Reflexivity: The Case of Czech Reciprocal Constructions 2018-2020 GAČR
UNCE VITRI: Center for the Transdisciplinary Research of Violence, Trauma and Justice 2018-2023 UK
LINDAT/CLARIN: Centre for Language Research Infrastructure in the Czech Republic 2016 - 2019 MŠMT
CzEngClass: Contextually-based synonymy and valency of verbs in a bilingual setting 2017-2019 GAČR
Global Coherence: Global Coherence of Czech Texts in the Corpus-Based Perspective 2020 - 2022 GAČR
LiFR: Linguistic Factors of Readability in Czech Administrative and Educational Texts 2019-2021 GAČR
INTERCOST-Readability: Modelování komplexity českých literárních textů VI 2018 - X 2021 MŠMT
WELCOME: Multiple Intelligent Conversation Agent Services for Reception, Management and Integration of Third Country Nationals. 2020-2023 H2020
ForFun: Subcategorization of adverbial meanings based on corpus data 2017-2019 GAČR
syntax
Duration Provider
VALLEX: Between Reciprocity and Reflexivity: The Case of Czech Reciprocal Constructions 2018-2020 GAČR
LiFR: Linguistic Factors of Readability in Czech Administrative and Educational Texts 2019-2021 GAČR
LSD: Linguistic Structure Representation in Neural Networks 2018-2020 GAČR
INTERCOST-Readability: Modelování komplexity českých literárních textů VI 2018 - X 2021 MŠMT
Valency
Duration Provider
VALLEX: Between Reciprocity and Reflexivity: The Case of Czech Reciprocal Constructions 2018-2020 GAČR
LINDAT/CLARIN: Centre for Language Research Infrastructure in the Czech Republic 2016 - 2019 MŠMT
CzEngClass: Contextually-based synonymy and valency of verbs in a bilingual setting 2017-2019 GAČR
NomVallex II.: Valency of Non-verbal Predicates. An Extension of Valency Studies to Adjectives and Deadjectival Nouns. 2019-2021 GAČR
Machine Translation
Duration Provider
Bergamot: Browser-based Multilingual Translation 2019-2021 H2020
LINDAT/CLARIN: Centre for Language Research Infrastructure in the Czech Republic 2016 - 2019 MŠMT
ELITR: European Live Translator 2019-2021 H2020
LSD: Linguistic Structure Representation in Neural Networks 2018-2020 GAČR
Mnohojazyčný strojový překlad 2018-2020 GAČR
WELCOME: Multiple Intelligent Conversation Agent Services for Reception, Management and Integration of Third Country Nationals. 2020-2023 H2020
Multi-modality
Duration Provider
CEMI: Center for large-scale multi-modal data interpretation 2012 - 2019 GAČR
UNCE VITRI: Center for the Transdisciplinary Research of Violence, Trauma and Justice 2018-2023 UK
LINDAT/CLARIN: Centre for Language Research Infrastructure in the Czech Republic 2016 - 2019 MŠMT
WELCOME: Multiple Intelligent Conversation Agent Services for Reception, Management and Integration of Third Country Nationals. 2020-2023 H2020
NEUREM3: Neuronové reprezentace v multimodálním a mnohojazyčném modelování (Neural Representations in Multi-modal and Multi-lingual Modelling) 2019-2024 GAČR
Dialog
Duration Provider
LINDAT/CLARIN: Centre for Language Research Infrastructure in the Czech Republic 2016 - 2019 MŠMT
WELCOME: Multiple Intelligent Conversation Agent Services for Reception, Management and Integration of Third Country Nationals. 2020-2023 H2020
NaMuDDiS: Natural multi-domain dialogue systems 2019-2021 UK
THEaiTRE: THEAITRE: Umělá inteligence autorem divadelní hry? April 2020 - April 2022 TAČR
Linked data
Duration Provider
LINDAT/CLARIN: Centre for Language Research Infrastructure in the Czech Republic 2016 - 2019 MŠMT
Machine Learning
Duration Provider
LINDAT/CLARIN: Centre for Language Research Infrastructure in the Czech Republic 2016 - 2019 MŠMT
LSD: Linguistic Structure Representation in Neural Networks 2018-2020 GAČR
Mnohojazyčný strojový překlad 2018-2020 GAČR
LangTech: Modernizace oboru Matematická lingvistika MŠMT
NEUREM3: Neuronové reprezentace v multimodálním a mnohojazyčném modelování (Neural Representations in Multi-modal and Multi-lingual Modelling) 2019-2024 GAČR
Universal morphosyntactic annotation of language data 2017-2019 GAUK
Morphology
Duration Provider
LINDAT/CLARIN: Centre for Language Research Infrastructure in the Czech Republic 2016 - 2019 MŠMT
LSD: Linguistic Structure Representation in Neural Networks 2018-2020 GAČR
Word-formation structure of Czech words: a data-based research 2019-2021 GAČR
Parsers
Duration Provider
LINDAT/CLARIN: Centre for Language Research Infrastructure in the Czech Republic 2016 - 2019 MŠMT
LSD: Linguistic Structure Representation in Neural Networks 2018-2020 GAČR
WELCOME: Multiple Intelligent Conversation Agent Services for Reception, Management and Integration of Third Country Nationals. 2020-2023 H2020
Universal morphosyntactic annotation of language data 2017-2019 GAUK
Publications
Duration Provider
LINDAT/CLARIN: Centre for Language Research Infrastructure in the Czech Republic 2016 - 2019 MŠMT
Implicit Relations in Text Coherence 2017-2019 GAČR
Speech Recognition
Duration Provider
LINDAT/CLARIN: Centre for Language Research Infrastructure in the Czech Republic 2016 - 2019 MŠMT
ELITR: European Live Translator 2019-2021 H2020
WELCOME: Multiple Intelligent Conversation Agent Services for Reception, Management and Integration of Third Country Nationals. 2020-2023 H2020
Taggers
Duration Provider
LINDAT/CLARIN: Centre for Language Research Infrastructure in the Czech Republic 2016 - 2019 MŠMT
LSD: Linguistic Structure Representation in Neural Networks 2018-2020 GAČR
Tools
Duration Provider
LINDAT/CLARIN: Centre for Language Research Infrastructure in the Czech Republic 2016 - 2019 MŠMT
OP VVV LINDAT: LINDAT/CLARIN - Research infrastructure for language technologies – extension of the repository and its computational power 2017–2019 MŠMT
THEaiTRE: THEAITRE: Umělá inteligence autorem divadelní hry? April 2020 - April 2022 TAČR
LAPPS-CLARIN: Transatlantic Collaboration between LAPPS and CLARIN: Semantic, Technical and Infrastructural Interoperability of Services 2016-2018, 2019-2020 Mellon Foundation (USA)
Duration Provider
ELG: European Language Grid 2019-2021 H2020
SSHOC: Social Sciences & Humanities Open Cloud 2019-30/04/2022 H2020
Teaching
Duration Provider
LCT: European Masters Program Language and Communication Technologies 2006-2011, 2013-2018, 2020-2025 EU ERASMUS MUNDUS
INTERCOST-Readability: Modelování komplexity českých literárních textů VI 2018 - X 2021 MŠMT
LangTech: Modernizace oboru Matematická lingvistika MŠMT
NaMuDDiS: Natural multi-domain dialogue systems 2019-2021 UK
Generation
Duration Provider
NaMuDDiS: Natural multi-domain dialogue systems 2019-2021 UK
Provider: H2020
Duration Provider Grant ID PI Area
WELCOME: Multiple Intelligent Conversation Agent Services for Reception, Management and Integration of Third Country Nationals. 2020-2023 H2020 870930 Pavel Pecina Dialog, Machine Translation, Multi-modality, Multilingual, Parsers, Semantics, Speech Recognition
SSHOC: Social Sciences & Humanities Open Cloud 2019-30/04/2022 H2020 823782 Jan Hajič
ELITR: European Live Translator 2019-2021 H2020 825460 Ondřej Bojar Machine Translation, Speech Recognition
Bergamot: Browser-based Multilingual Translation 2019-2021 H2020 825303 Ondřej Bojar Machine Translation
ELG: European Language Grid 2019-2021 H2020 825627 Jan Hajič
Provider: Mellon Foundation (USA)
Duration Provider Grant ID PI Area
LAPPS-CLARIN: Transatlantic Collaboration between LAPPS and CLARIN: Semantic, Technical and Infrastructural Interoperability of Services 2016-2018, 2019-2020 Mellon Foundation (USA) G-1901-06505 Jan Hajič Annotations, Corpora, Data, Tools

Institutional support for research at the Charles University

Duration Provider Grant ID PI Area
NaMuDDiS: Natural multi-domain dialogue systems 2019-2021 UK PRIMUS 19/SCI/10 Ondřej Dušek Dialog, Discourse, Generation, Teaching
UNCE VITRI: Center for the Transdisciplinary Research of Violence, Trauma and Justice 2018-2023 UK UNCE/HUM/009 Jakub Mlynář Data, Discourse, Multi-modality, Semantics

EU ERASMUS MUNDUS

Duration Provider Grant ID PI Area
LCT: European Masters Program Language and Communication Technologies 2006-2011, 2013-2018, 2020-2025 EU ERASMUS MUNDUS 610622-EPP-1-2019-1-DE-EPPKA1-JMD-MOB Vladislav Kuboň Teaching

Ministry of Education, Youth and Sport (Czech Republic)

Duration Provider Grant ID PI Area
INTERCOST-Readability: Modelování komplexity českých literárních textů VI 2018 - X 2021 MŠMT LTC18020 Silvie Cinková Annotations, Corpora, Data, Discourse, Information Structure, Semantics, syntax, Teaching
OP VVV LINDAT: LINDAT/CLARIN - Research infrastructure for language technologies – extension of the repository and its computational power 2017–2019 MŠMT CZ.02.1.01/0.0/0.0/16_013/0001781 Jan Hajič Annotations, Corpora, Data, Tools
LINDAT/CLARIN: Centre for Language Research Infrastructure in the Czech Republic 2016 - 2019 MŠMT LM2015071 Jan Hajič Annotations, Coreference, Corpora, Data, Dialog, Discourse, Lexicons, Linked data, Machine Learning, Machine Translation, Morphology, Multi-modality, Parsers, Publications, Semantics, Speech Recognition, Taggers, Tools, Valency
LangTech: Modernizace oboru Matematická lingvistika MŠMT CZ.02.2.69/0.0/0.0/16_018/0002373 Zdeněk Žabokrtský Machine Learning, Multilingual, Teaching

Ministry of Culture

Duration Provider Grant ID PI Area
EVALD (Evaluator of Discourse): Automatic Evaluation of Text Coherence in Czech 1. 3. 2016 – 31. 12. 2019 Ministry of Culture DG16P02B016 Kateřina Rysová Coreference, Discourse, Information Structure

Czech Science Foundation

Duration Provider Grant ID PI Area
Global Coherence: Global Coherence of Czech Texts in the Corpus-Based Perspective 2020 - 2022 GAČR 20-09853S Lucie Poláková Annotations, Corpora, Data, Discourse, Semantics
NEUREM3: Neuronové reprezentace v multimodálním a mnohojazyčném modelování (Neural Representations in Multi-modal and Multi-lingual Modelling) 2019-2024 GAČR 19-26934X Ondřej Bojar Machine Learning, Multi-modality, Multilingual
CzeDParse: Automatická analýza diskurzních vztahů v češtině 2019-2021 GAČR 19-03490S Jiří Mírovský Annotations, Corpora, Data, Discourse, Lexicons
LiFR: Linguistic Factors of Readability in Czech Administrative and Educational Texts 2019-2021 GAČR 19-19191S Silvie Cinková Annotations, Corpora, Data, Discourse, Information Structure, Monolingual, Semantics, syntax
NomVallex II.: Valency of Non-verbal Predicates. An Extension of Valency Studies to Adjectives and Deadjectival Nouns. 2019-2021 GAČR 19-16633S Veronika Kolářová Corpora, Lexicons, Valency
Word-formation structure of Czech words: a data-based research 2019-2021 GAČR 19-14534S Magda Ševčíková Data, Morphology
VALLEX: Between Reciprocity and Reflexivity: The Case of Czech Reciprocal Constructions 2018-2020 GAČR 18-03984S Markéta Lopatková Data, Lexicons, Monolingual, Semantics, syntax, Valency
LSD: Linguistic Structure Representation in Neural Networks 2018-2020 GAČR 18-02196S David Mareček Machine Learning, Machine Translation, Morphology, Multilingual, Parsers, syntax, Taggers
Mnohojazyčný strojový překlad 2018-2020 GAČR 18-24210S Ondřej Bojar Machine Learning, Machine Translation, Multilingual
AnaConn: Anaphoricity in Connectives: Lexical Description and Bilingual Corpus Analysis 2017–2019 GAČR GA17-06123S Kateřina Rysová Discourse, Lexicons, Multilingual
CzEngClass: Contextually-based synonymy and valency of verbs in a bilingual setting 2017-2019 GAČR GA17-07313S Zdeňka Urešová Annotations, Corpora, Data, Lexicons, Semantics, Valency
Implicit Relations in Text Coherence 2017-2019 GAČR GA 17-03461S Šárka Zikánová Annotations, Corpora, Discourse, Monolingual, Publications
ForFun: Subcategorization of adverbial meanings based on corpus data 2017-2019 GAČR GA17-12624S Marie Mikulová Annotations, Corpora, Data, Monolingual, Semantics
CEMI: Center for large-scale multi-modal data interpretation 2012 - 2019 GAČR GAP103/12/G084 Pavel Pecina Multi-modality

Technology Agency (Czech Republic)

Duration Provider Grant ID PI Area
THEaiTRE: THEAITRE: Umělá inteligence autorem divadelní hry? April 2020 - April 2022 TAČR TL03000348 Rudolf Rosa Dialog, Tools

Grant Agency of the Charles University

Duration Provider Grant ID PI Area
Universal morphosyntactic annotation of language data 2017-2019 GAUK 794417 Kira Droganova Annotations, Corpora, Machine Learning, Multilingual, Parsers
National Scientific Foundation
Duration Provider Area
PIRE: Partnership for International Research and Education till 2014 NSF Machine Translation, Semantics, Speech Recognition, Teaching
Provider: H2020
Duration Provider Area
CLARIN-PLUS September 2015 – August 2017 H2020
QT21: QT21: Quality Translation 21 II.2015-I.2018 H2020 Data, Lexicons, Linked data, Machine Learning, Machine Translation, Tools
KConnect: Khresmoi Multilingual Medical Text Analysis, Search and Machine Translation Connected in a Thriving Data-Value Chain 2015-2017 H2020 Information Retrieval, Machine Translation, Semantics
HimL: Health in my Language 2.2015–1.2018 H2020 Data, Lexicons, Machine Translation, Morphology
CRACKER: Cracking the Language Barrier: Coordination, Evaluation and Resources for European MT Research 1.2015-12.2017 H2020 Data, Machine Translation

Research - European Commission

Duration Provider Area
EuroMatrix IX.2006-II.2009 FP6 Annotations, Corpora, Machine Translation, Tools, Valency
Provider: EU OP PPR
Duration Provider Area
MTviet: Machine Translation from Vietnamese into Czech for the Purposes of the Police of the Czech Republic 2017-2018 EU OP PPR Machine Translation

Grant Agency of the Charles University

Duration Provider Area
DeepSynt: Deep Syntactic Representation across Languages 2017-2018 GAUK Corpora, Data, Multilingual
Open domain dialog management with knowledge graphs 2016-2018 GAUK Data, Dialog, Machine Learning
open-domain SLU: Spoken Language Understanding in open-domain environment 2016-2018 GAUK Dialog, Information Retrieval, Linked data, Machine Learning, Semantics
ANNMT: Utilization of artificial neural networks in machine translation 2016-2018 GAUK Machine Translation
Using Language Knowledge in Scene Text Recognition 2015-2017 GAUK Multi-modality
cross-coref: Cross-lingual approaches to coreference resolution 2015-2017 GAUK Annotations, Coreference, Corpora, Data, Machine Learning, Machine Translation, Multilingual
DiaMine: Information mining from spoken dialogue 2015-2017 GAUK Data, Dialog, Machine Learning, Speech Recognition
Čapek GAUK: An alternative way of getting more annotated linguistic data 2014-2016 GAUK Annotations, Tools
AdaNLG: An adaptive natural language generator 2014-2016 GAUK Dialog, Generation, Multilingual, Semantics
croSSSynt: Modelling dependency syntax across languages 2014-2016 GAUK Annotations, Corpora, Data, Multilingual, Parsers
MSDS: Modern Spoken Dialog Systems 2014, 2015, 2016 GAUK Data, Dialog, Machine Learning, Speech Recognition
DepRefSet: Utilizing a Multitude of References in Machine Translation 2013-2015 GAUK Data, Machine Translation
Interactive information retrieval in audiovisual dialogue corpora 2013-2015 GAUK Information Retrieval, Speech Retrieval
Tools and data for Machine Translation between Related Languages 2012-2013 GAUK Corpora, Data, Machine Translation, Tools, Valency
Utilization of coreference in MT: Utilization of coreference in Machine Translation 2011-2013 GAUK Linked data, Machine Translation
Sentence-Level Polarity Detection in a Computer Corpus 2011-2013 GAUK Annotations, Corpora, Data, Lexicons, Tools

Ministry of Culture

Duration Provider Area
ÚSTR: Systém pro trvalé uchování dokumentace a prezentaci historichých pramenů z období totalitních režimů 2016-2019 Ministry of Culture
VIADAT: Virtuální asistent pro zpřístupnění historických audiovizuálních dat 2016-2019 Ministry of Culture Annotations, Speech Recognition, Tools
AMALACH 2012-2015 Ministry of Culture Information Retrieval, Machine Translation, Multi-modality, Speech Recognition, Speech Retrieval, Teaching

Czech Science Foundation

Duration Provider Area
CorefChains: Structure of coreferential chains in parallel language data 2016-2018 GAČR Annotations, Coreference, Corpora, Data
NomVallex: Corpus-based Valency Lexicon of Czech Nouns 2016-2018 GAČR Corpora, Lexicons, Valency
DerInfMorph: An Integrated Approach to Derivational and Inflectional Morphology of Czech 2016-2018 GAČR Data, Monolingual, Morphology
Manyla: Morphologically and Syntactically Annotated Corpora of Many Languages 2015–2017 GAČR Annotations, Corpora, Data, Morphology, Multilingual, Parsers, Taggers
zelligharris: Reviving Zellig S. Harris: More linguistic information for distributional lexical analysis of English and Czech 2015-2017 GAČR Annotations, Corpora, Data, Semantics, Taggers
On Linguistic Structure of Evaluative Meaning in Czech 2015-2017 GAČR Annotations, Corpora, Data, Lexicons, Semantics
Combining Words: Syntactic Properties of Czech Multiword Expressions with Light Verbs 2015-2017 GAČR Annotations, Data, Lexicons, Multiword Expressions, Valency
LiStr: Sentence structure induction without annotated corpora 2014 - 2016 GAČR Machine Learning, Multilingual, Parsers
CzEngVallex: A comparison of Czech and English verbal valency based on corpus material (theory and practice) 2013-2015 GAČR Annotations, Corpora, Data, Lexicons
Vybrané derivační vztahy pro automatické zpracovaní češtiny 2012–2014 GAČR Morphology
VALLEX: Delving Deeper: Lexicographic Description of Syntactic and Semantic Properties of Czech Verbs 2012-2015 GAČR Annotations, Data, Lexicons, Semantics, syntax, Valency
Systematic, economical and corpus-based description of valency properties of Czech deverbal nouns (theory and practice) 2012-2014 GAČR Lexicons, Valency
CorefDisk: Coreference, Discourse Relations and Information Structure in a Contrastive Perspective 2012 - 2015 GAČR Annotations, Coreference, Corpora, Data, Discourse, Information Structure
CZECHMATE: Čeština ve věku strojového překladu 2011 – 2013 GAČR Annotations, Corpora, Data, Machine Translation, Morphology, Parsers
NoSCoM: Non-Standard Computational Models and Their Applications in Complexity, Linguistics, and Learning 2010-2014 GAČR
Komputační lingvistika: Explicitní popis jazyka a anotovaná data se zřetelem na češtinu 2010-2013 GAČR Annotations, Coreference, Corpora, Data, Discourse, Information Structure

Ministry of Education, Youth and Sport (Czech Republic)

Duration Provider Area
Multilingual Corpus Annotation as a Support for Language Technologies 2014-2016 MŠMT Annotations, Coreference, Corpora, Data, Discourse
MOBAme: Modern Bayesian methods in machine learning 2013-2013 MŠMT Teaching
VYSTADIAL: Development of statistical methods for spoken dialogue systems 2012-2016 MŠMT Corpora, Dialog, Speech Recognition, Tools
KontaktII: Strojový překlad se sémantickou informací 2012-2014 MŠMT Annotations, Corpora, Data, Lexicons, Machine Translation, Semantics, Valency
LINDAT/Clarin: Establishing and operating the Czech node of pan-European infrastructure for research (Vybudování a provoz českého uzlu pan-evropské infrastruktury pro výzkum) 2010-2015 MŠMT Annotations, Coreference, Corpora, Data, Dialog, Discourse, Lexicons, Linked data, Machine Learning, Machine Translation, Morphology, Multi-modality, Parsers, Publications, Semantics, Speech Recognition, Taggers, Tools, Valency
Kontakt: Towards a Computational Analysis of Text Structure 2010 - 2012 MŠMT Annotations, Coreference, Corpora, Data, Discourse
TextLink-cz: TextLink: Skladba diskurzu v evropských jazycích 1.11.2015 - 31.12.2017 MŠMT Annotations, Corpora, Data, Discourse, Lexicons, Linked data, Monolingual
LD-Parseme: PARSEME: Parsing a víceslovné výrazy – k jazykovědné přesnosti a výpočetní efektivitě ve zpracování přirozeného jazyka 04-2014 – 03-2017 MŠMT Lexicons, Multiword Expressions, Semantics, Valency
FP7 - Research - Europa
Duration Provider Area
TextLink: TextLink: Structuring Discourse in Multilingual Europe 2014 - 2017 FP7 Coreference, Corpora, Discourse, Linked data, Multilingual
QTLeap: Quality Translation by Deep Language Engineering Approaches 2013–2016 FP7 Linked data, Machine Translation
PARSEME: PARSEME: Parsing and Multiword Expressions 2013-2017 FP7 Lexicons, Multiword Expressions, Semantics, Valency
MosesCore 2012-2015 FP7 Data, Machine Translation, Teaching, Tools
EUDAT: EUDAT: European Data Infrastructure 2011–2014 FP7 Data
FAUST: Feedback Analysis for User adaptive Statistical Translation 2010–2013 FP7 Machine Translation
KHRESMOI: Medical information analysis and retrieval 2010-2014 FP7 Information Retrieval, Machine Translation
CLARA: Common Language Resources and their Applications - a Marie Curie ITN 2009-2013 FP7 Annotations, Corpora, Data, Machine Translation, Teaching
EuroMatrixPlus 2009-2012 FP7 Machine Translation

Institutional support for research at the Charles University

Duration Provider Area
PRVOUK: Programy rozvoje vědních oblastí na Univerzitě Karlově - Informatika 2012-2016 UK

Technology Agency (Czech Republic)

Duration Provider Area
INTLIB: Intelligent library 2012-2015 TAČR Data, Linked data, Tools

EU Lifelong Learning Programme

Duration Provider Area
Merlin 2012-2014 LLP Annotations, Corpora, Data
Provider: Inspire
Duration Provider Area
INSPIRE: INSPIRE in Pocket Inspire Machine Translation