t

The morphological tag of the current token (which can be found in the text part of <f> or <d>), manually disambiguated. The tagset is defined by the morphological dictionary used for preprocessing the data.

In the Prague Dependency Treebank (PDT), the following tagset system is currently in use. For more information, please refer to the PDT documentation.

Each tag is a 15-tuple of symbols (mostly uppercase letters and digits, but many lowercase and special symbols are used as well). Each single-character position contains a value from one morphological category. 13 categories are in fact fully used:
PositionCategory nameDescription
1POSPart of Speech
2SUBPOSDetailed Part of Speech
3GENDERGrammatical Gender (for agreement)
4NUMBERGrammatical Number (for agreement)
5CASEMorphological Case
6POSSGENDERGender of Possessor
7POSSNUMBERNumber of Possessor
8PERSONPerson
9TENSETense
10GRADEDegree of Comparison
11NEGATIONNegation
12VOICEVoice
13RESERVE1Reserved
14RESERVE2Reserved
15VARVariant, Style, Register

For more information on the individual categories, especially the sets of possible values, please see the full Tagset documentation (psfile, pdffile) or the quick tagset reference (htmlfile, pdffile).


Content


ATTRIBUTES
CONTENT DECLARATION

Tag Minimization
Open Tag: REQUIRED
Close Tag: OPTIONAL

Parent Elements


Top Elements
All Elements


csts DTD