3.2.1. Countries, cities, rivers, mountains

Main noun. The main word (head) in a multi-word name of a city is always noun; the same holds for a one-word city name. If it is homonymous with an adjective, a new noun lemma is created for the name. Thus Hluboká is lemmatized as Hluboká_;G / NNFS1-----A---- rather than hluboký / AAFS1----1A---- (lit. deep)Nouns that are frequently used in names (such as Újezd, Ústí may have their own geographical lemmas even if they are homonymous with a normal word. For homonymous pairs where the non-geographical usage is much more common (such as voda (water), ves (village), město (city)) it is recommended to stick with the non-geographical lemma even in geographical usages.

Modifiers in multi-word names. Attributive adjectives, prepositions, conjunctions etc. should be lemmatized as normal words. Other nouns may be lemmatized as geographical if they are nested geographical names (e.g. names of rivers or mountains in names of cities).

Part of speech of foreign words. Original part of speech of the word in the source language is used unless there is a good reason not to do so. Besides not knowing the original part of speech, a very good reason is that the word behaves as a different part of speech in Czech texts. For instance, blanc is adjective in French Mont Blanc but it behaves as a noun in na Mont Blanku. Mont can be annotated as an undeclined noun. See Chapter 6, Foreign words and phrases for more information on foreign words.

Table 3.2. Examples of geographical names

Name

Type

Morphological annotation

Česká republika

country

český / AAFS1----1A---- // republika / NNFS1-----A----

Ústí nad Labem

city

Ústí_;G / NNNS1-----A---- // nad-1 / RR--7---------- // Labe_;G / NNNS7-----A----

Karlovy Vary

city

Karlův_;Y_^(*3el) / AUIP1M--------- // Vary_;G_^(Karlovy_Vary) / NNIP1-----A----

Dobrá Voda

city

dobrý / AAFS1----1A---- // voda / NNFS1-----A----

Odolena Voda

city

Odolena_;G_^(Odolena_Voda) / AAXXX----1A---- // voda / NNFS1-----A----

Černá v Pošumaví

city

Černá_;G / NNFS1-----A---- // v-1 / RR--6-----A---- // Pošumaví_;G / NNNS6-----A----

Ohrada u Hluboké

city

ohrada / NNFS1-----A---- // u-1 / RR--2---------- // Hluboká_;G / NNFS2-----A----

Hradec Králové

city

Hradec_;G / NNIS1-----A---- // králová_^(královna) / NNFS2-----A----

Kostelec nad Černými Lesy

city

Kostelec_;G / NNIS1-----A---- // nad-1 / RR--7---------- // černý_;o / AAIP7----1A---- // les / NNIP7-----A----

New York

city

new_,t_^(angl._nový) / AAXXX----1A---- // York_;G / NNIS1-----A----

A Coruńa

city

o-10_,t_^(port._člen) / AAFSX----1A---- // Coruńa_;G / NNFS1-----A----

Săo Paulo

city

săo_,t_^(port._svatý) / AAMSX----1A---- // Paulo_;Y / NNMS1-----A----

Rio de Janeiro

city

Rio_;G / NNNS1-----A---- // de_,t / RR--X----------- // Janeiro_;G / NNNS1-----A----

Le Havre

city

le_,t_^(fr._člen) / AAISX----1A---- // Havre_;G / NNIS1-----A----

Krems an der Donau

city

Krems_;G / NNIS1-----A---- // an_,t / RR--3---------- // der_,t_^(něm._člen) / AAFS3----1A---- // Donau_;G / NNFSX-----A----

San Juan de la Rambla

city

san_,t_^(šp._a_it._svatý) / AAMSX----1A---- // Juan_;Y / NNMS1-----A---- // de_,t / RR--X---------- // el_,t_^(šp._člen) / AAFSX----1A---- // Rambla_;G / NNFSX----1A----

Kao-hsiung

city

Kao_;G / AAXXX----1A---- // - / Z:------------- // hsiung_;G_^(př._Kao-hsiung) / NNXXX-----A----

Wu-lu-mu-čchi

city

Wu_;G / NNXXX-----A---- // - / Z:------------- // lu_;G / NNXXX-----A---- // - / Z:------------- // mu_;G / NNXXX-----A---- // - / Z:------------- // čchi_;G / NNXXX-----A----

Gerlachovský štít

mountain

gerlachovský / AAIS1----1A---- // štít / NNIS1-----A----

Divoká Orlice

river

divoký / AAFS1----1A---- // Orlice_;G / NNFS1-----A----