As for the nodes representing words that are present at the surface level, their t-lemma is usually identical to their m-lemma.
However, some words have a special t-lemma, which has no counterpart among morphological lemmas, (the so called t-lemma substitute, see Section 4, "T-lemma substitutes"; cf. a), b) and j) in the list), or they have a t-lemma that corresponds to the m-lemma of a different word (cf. c) through i) in the list), or a multi-word t-lemma that corresponds to two (or more) m-lemmas (cf. k) in the list). In still other cases, the t-lemma corresponds to the surface form of a given word (cf. l) in the list). Paratactic structure root nodes have so called representative (i.e. typical) t-lemmas (cf. m) in the list).
The relevant cases are the following:
personal pronouns (including the reflexive si and se) have the t-lemma substitute
#PersPron (see Section 4, "T-lemma substitutes").
tobě (=you.DAT/LOC) is represented by the
oni (=they) →
sobě (=self.DAT/LOC) →
possessive pronouns (including the reflexive svůj) are also represented by the t-lemma substitute
#PersPron (see Section 4, "T-lemma substitutes").
náš (=our) is assigned the
její (=her) →
svoje (=self's) →
matčin (=mother's) is represented by the t-lemma matka (=mother);
Pavlova (=Pavel's) → Pavel.
zklamán (=disappointed) is represented by the t-lemma zklamaný;
spokojena (=satisfied.fem., short form) → spokojený (=satisfied.masc., long form);
ochoten (=willing) → ochotný.
NB! Passive participles are represented by the infinitive; for example pozván (=invited) is represented by a node with the t-lemma pozvat (=invite).
pěkně (=nicely) is represented by a node with the t-lemma pěkný (=nice).
tudy (=this_way) is represented by a node with the t-lemma tady (=here);
sem (=here.directional) → tady (=here.locative).
doteď (=until_now) has the t-lemma teď (=now);
doposud (=until_now) → teď (=now).
trojí (=three_kinds_of) is represented by a node with the t-lemma tři (=three);
třetina (=one_third) → tři (=three);
kolikátý (=how_many.ordinal) → kolik (=how_many.cardinal);
pětkrát (=five_times) → pět (=five).
See Section 1, "Syntactic and lexical derivation" and Section 6.1.5, "Definite quantificational semantic nouns", Section 6.2.4, "Definite quantificational semantic adjectives" a Section 6.2.5, "Indefinite quantificational semantic adjectives".
někdo (=someone) has the t-lemma kdo (=who);
nic (=nothing) → co (=what);
všechen (=all) → co (=what);
žádný (=none) → který (=which).
See Section 1, "Syntactic and lexical derivation" and Section 6.1.4, "Indefinite pronominal semantic nouns", Section 6.2.3, "Indefinite pronominal semantic adjectives", Section 6.3.5, "Definite pronominal semantic adverbs" a Section 6.3.6, "Indefinite pronominal semantic adverbs".
punctuation marks and other symbols are assigned t-lemma substitutes (similarly to personal and possessive pronouns). See Section 4, "T-lemma substitutes".
the comma has the t-lemma
expressions that are built out of more parts (words) but have a single meaning are in some cases represented by a single node with a single t-lemma in which the parts are put together. Such a t-lemma is called multi-word t-lemma; for more details see Section 3, "T-lemmas of multi-word (complex) lexical units".
smát se (=laugh; lit. laugh REFL) is represented by a single node whose t-lemma is smát_se;
a nebo (=or; literally and_or) → a_nebo.
van Beethoven → van_Beethoven.
frozen verbal forms (finite forms, as well as transgressives (gerunds) and infinitives, i.e. forms having adverbial functions), are represented by nodes the t-lemmas of which are identical to the surface form of such an expression, e.g. myslím, soudě (=I_think, judging). Similarly, foreign-language expressions (with the
FPHR functor) are assigned t-lemmas that are not different from the corresponding surface forms.
different variants of conjunctions and other connectives and operators are represented by a node (
coap) the t-lemma of which corresponds to the m-lemma of one of the variants (this is the so called representative t-lemma). The representative t-lemma may also be complex; cf. k) in the list and Section 3.1, "Multi-word t-lemma".
both buď (=either) - nebo (=or) and buďto - nebo are represented by a single node with the representative t-lemma buď_nebo;
od (=from) - přes (=via) - do (=to), as well as od - přes - po (=to) and od - přes - k (=to) → od_přes_do.
The choice of the t-lemma described in b), c) and e) through i) is a result of taking the derivational processes into account. In principle, derived expressions have the same t-lemma as the base expressions. For the information regarding the relevant derivation types, see Section 1, "Syntactic and lexical derivation".
Newly established nodes may be assigned one of the t-lemma substitutes, which do not correspond to any m-lemma; see Section 4, "T-lemma substitutes". As for determining the appropriate t-lemma, copied nodes are subject to the same rules as the nodes present at the surface level.