Universal Dependencies - project providing the annotation for treebanks in 18 languages. The annotation scheme is based on (universal) Stanford dependencies (de Marneffe et al., 2006, 2008, 2014), Google universal part-of-speech tags (Petrov et al., 2012), and the Interset interlingua for morphosyntactic tagsets (Zeman, 2008). The project page with general description of data and tagsets are here.
Search among monolingual corpora, click Universal Dependencies and choose the language you like:
The general and full information on UI and search in KonText can be found here, though note that the attributes and metainformation to search are different from the UD tagset, so use it just as a manual on how to search in general. The attributes for search in UD correspond to those in CoNLL-U.
In order to make some experiments in comparative linguistics, we compiled the - 5,000 first
sentences for each language from UD. There is no sense in searching for some lexical issues, but the grammar attributes can be used to compare certain linguistic phenomena in several languages. The frequency distribution in the languages can be viewed with the function Frequency->Doc IDs (the user should be logged in to access this function), where Doc ID stand for a concrete language.
examples of queries: