Manual for Morphological Annotation

Revision for the Prague Dependency Treebank 2.0

ÚFAL Technical Report No. 2005-27

Jiří Hana

Daniel Zeman

Jan Hajič
Hana Hanová
Barbora Hladká
Emil Jeřábek

Table of Contents

Preface to Version 2.0
Preface to Version 1.0
1. Introduction
2. Lemma and tag structure
2.1. Lemma structure
2.1.1. Base form and number
2.1.2. Reference
2.1.3. Category
2.1.4. Term
2.1.5. Style
2.1.6. Explanational comment
2.1.7. Comment on derivation
2.2. Tag Structure
2.2.1. Positional tags
2.2.2. Compact tags
2.2.3. Informal abbreviations
3. Names
3.1. Personal names
3.1.1. von, van, etc.
3.1.2. Chinese and Korean names
3.1.3. Foreignized Czech names
3.2. Geographical names
3.2.1. Countries, cities, rivers, mountains
3.2.2. Streets
3.3. Companies and institutions
3.3.1. Restaurants
3.3.2. Sport clubs
3.4. Horses, DJ's etc.
3.5. Products
3.6. Sporting and other events
3.7. Other
3.7.1. Buildings
3.7.2. Televisions
3.7.3. News and magazines
3.7.4. Song names
3.8. Adjectives derived from names
4. Abbreviations
4.1. Gender
4.2. Isolated letters
4.3. Units of measurements
4.4. Authors' signatures
4.5. Academic titles
5. Colloquial Czech
5.1. Cos, kdys, jaks...
5.2. Suffix in plural of neuter
6. Foreign words and phrases
6.1. Articles
6.2. English noun clusters
6.3. Nouns
6.4. Verbs
6.4.1. English verbs
6.5. Slavic languages and Czech dialects
7. Errors
7.1. Characters
7.2. Separators
8. Hard to decide
8.1. až
8.2. jak
8.3. málo
8.4. moc
8.5. proto
8.6. svůj
8.7. tak
9. Selected words
10. Date and time
11. Numbers, numerals and quantifiers
12. Hyphenated composites
13. Insertion
13.1. Possessive adjectives
13.2. Words ending with -ismus, -izmus
13.3. Transcription of pronunciation
13.4. Crippled forms
13.5. Isolated morphemes
13.6. Geometry
13.7. Chess codes

List of Tables

2.1. Lemma examples
2.2. Lemma categories
2.3. Term types
2.4. Style flags
2.5. SUBPOS
2.6. Gender
2.7. Number
2.8. CASE
2.9. Possessor's Gender
2.10. Possessor's Number
2.11. PERSON
2.12. Tense
2.13. GRADE
2.14. NEGATION
2.15. Voice
2.16. VAR
3.1. Name types
3.2. Examples of geographical names
3.3. Examples of company names
3.4. Examples of restaurant names
3.5. Examples of sport club names
3.6. Examples of event names
4.1. Examples of abbreviations
4.2. Gender of abbreviations
4.3. Examples of isolated letters
4.4. Examples of units
5.1. Colloquial examples
6.1. Examples of foreign phrases
6.2. Articles in common foreign languages
6.3. Number and case of English nouns
6.4. Examples of English verbs

List of Examples

2.1. Following examples illustrate this:
2.2. Other examples:
3.1. Personal names with von, van etc.
3.2. Chinese and Korean names
3.3. Street names
3.4. Names of horses
3.5. TV company names
3.6. Names of periodicals
11.1. Case agreement in counted phrases
12.1. Hyphenated composites
13.1. -ismus, -izmus
13.2. Transcription of pronunciation
13.3. Crippled forms