Archive – academic year 2020/21
Lecture 1 (March 3, 2021): Introduction, trees, dependencies
- Osborne, T. (2019) A Dependency Grammar of English. John Benjamins Publishing Company, Amsterdam/Philadelphia (available in my office)
- Hajičová, E., Panevová, J., Sgall, P. (2002) Úvod do teoretické a počítačové lingvistiky, sv. I. Karolinum, Praha (available in the secretariat)
- Štekauer, P., ed. (2000) Rudiments of English Linguistics. Slovacontact, Prešov. (available in my office)
- Štěpánek, J. (2006) Závislostní zachycení větné struktury v anotovaném syntaktickém korpusu. PhD Thesis, MFF UK (link)
- Wikipedia - basic articles on dependency grammar are consistent with Timothy Osborne's approac
- Universal dependencies (intro):
- Prague Dependency Treebank (intro):
Lecture 2 (March 10, 2021): Non-dependency relations and their representation; Word order and (non-)projectivity
- Kuhlmann, M., Nivre, J. (2006): Mildly Non-Projective Dependency Structures. In COLING/ACL Main Conference Poster Sessions, 507–514 (link).
- Petkevič, V. (1995) A New Formal Specification of Underlying Structure. Theoretical Linguistics, vol. 21, No.1
- Štěpánek, J. (2006) Závislostní zachycení větné struktury v anotovaném syntaktickém korpusu. PhD Thesis, MFF UK (link)
- Havelka, J. (2007): Mathematical Properties of Dependency Trees and their Application to Natural Language Syntax. PhD Thesis, MFF UK (link)
- Universal Dependencies:
- Prague Dependency Treebank:
Lecture 3 (March 17, 2021): Intro to Stratificational Approach to Language Description (stratificational grammar, FGD, MTT)
- Hajičová, E., Panevová, J., Sgall, P. (2002) Úvod do teoretické a počítačové lingvistiky, sv. I. Karolinum, Praha (available in the secretariat)
- Štekauer, P., ed. (2000) Rudiments of English Linguistics. Slovacontact, Prešov. (available in my office)
- Sgall, P. (1967) Generativní popis jazyka a česká deklinace. Academia, Praha (available in my office)
- Žabokrtský, Z. (2006) Resemblances between Meaning - Text Theory and Functional Generative Description. In Proceedings of the 2nd International Conference of Meaning-Text Theory, Slavic Culture Languages Publishers House, Moskva, pp. 549-557. (link)
Sgall, P., Hajičová, E., Panevová, J. (1986) The Meaning of the Sentence in Its Semantic and Pragmatic Aspects. Reidel, Dordrecht.
Lecture 4 (March 24, 2021):
TOPIC 1: Intro to Prague Dependency Treebank
- Hajič, J., Hajičová, E., Mírovský, J., Panevová, J.: Linguistically Annotated Corpus as an Invaluable Resource for Advancements in Linguistic Research: A Case Study. The Prague Bulletin of Mathematical Linguistics, No. 106, ISSN 0032-6585, pp. 69-124, 2016 (link)
- Hajičová, E., Panevová, J., Sgall, P. (2002) Úvod do teoretické a počítačové lingvistiky, sv. I. Karolinum, Praha (available in the secretariat)
- Hajič, J. (1998) Building a Syntactically Annotated Corpus: The Prague Dependency Treebank. In E. Hajičová (ed.): Issues of Valency and Meaning. Studies in Honour of Jarmila Panevová, Karolinum, Charles University Press, Prague, Republic, pp. 106-132 (link)
- Štekauer, P., ed. (2000) Rudiments of English Linguistics. Slovacontact, Prešov. (available in my office)
- PDT-C webpage
- PDT 2.0 guide
TOPIC 2: PDT and its morphological annotation
- note: You are not supposed to memorize the tag structure but you might be ask to provide examples (using the following table pdf)
- Matthews, H. (1997) The Concise Oxford Dictionary of Linguistics. Oxford University Press, Oxford
- Filipec, J. (1994) Lexicology and Lexicography: Development and State of the Research. In Luelsdorff, P.A. (ed.) The Prague School of Structural and Functional Linguistics, Amsterdam-Philadelphia, John Benjamins, p.163–183
- Hajič, J. (2004) Disambiguation of Rich Inflection (Computational Morphology of Czech). Karolinum, Charles Univeristy Press, Prague.
- Straková Jana, Straka Milan and Hajič Jan. (2014) Open-Source Tools for Morphology, Lemmatization, POS Tagging and Named Entity Recognition. In Proceedings of 52nd Annual Meeting of the Association for Computational Linguistics: System Demonstrations, pages 13-18, Baltimore, Maryland, June 2014. Association for Computational Linguistics.
- PDT documentation: Manual for morphological annotation
- Table with morphological tags in PDT 2.0 (pdf)
TOPIC 1: Intro to Prague Dependency Treebank
Lecture 5 (March 31, 2021): Intro to UD, morphology
- Nivre Joakim et al. (2020) Universal Dependencies v2: An Evergrowing Multilingual Treebank Collection. In: Proceedings of LREC 2020. ELRA, Marseille, France, p. 4034-4043, 2020 (link).
Lecture 6 (April 7, 2021): Surface syntactic annotation in PDT (a-layer)
- Hajič, J. (1998) Building a Syntactically Annotated Corpus: The Prague Dependency Treebank. In E. Hajičová (ed.): Issues of Valency and Meaning. Studies in Honour of Jarmila Panevová, Karolinum, Charles University Press, Prague, Republic, pp. 106-132 (link)
- Štekauer, P., ed. (2000) Rudiments of English Linguistics.Slovacontact, Prešov (chapter 4, Syntax)
- Quirk, R., Greenbaum, S., Leech, G., Svartvik, J. (1985) A Comprehensive Grammar of the English Language, Longman, London.
- PDT documentation: Manual for Analytical Annotation (link)
- Table with analytical functions in PDT 2.0 (pdf)
- Lectures 7, 8 and 9 (April 14, 21 and 28, 2021): Syntax in UD
Lecture 10 (May 5, 2021):
- TOPIC 1: UD: Enghanced dependencies
- TOPIC 2: PropBank
- May 12, 2021 - Rector's Day (lecture cancelled)
Lecture 11 (May 19, 2021):
TOPIC 1: Intro to t-layer
- Hajič, J, Hajičová, E., Mikulová, M., Mírovský, J.: Prague Dependency Treebank. Chapter in Ide, N., Pustejovsky, J. (eds.) Handbook of Linguistic Annotation, Springer, Berlin, pp. 555-594, 201
- Hajič, J., Hajičová, E., Mírovský, J., Panevová, J.: Linguistically Annotated Corpus as an Invaluable Resource for Advancements in Linguistic Research: A Case Study. The Prague Bulletin of Mathematical Linguistics, No. 106, ISSN 0032-6585, pp. 69-124, 2016 (link)
- Sgall, P., Panevová, J., Hajičová, E. (2004) Deep Syntactic Annotation: Tectogrammatical Annotation and Beyond. In A. Meyers (ed.) Proceedings of the HLT-NAACL 2004 Workshop: Frontiers in Corpus Annotation, ACL, Boston, Massachusetts, USA, pp. 32-38. (link)
- Table with T-nodes attributes in PDT 2.0 (pdf);
- PDT documentation: PDT-C link
TOPIC 2: Valency in FGD
- Fillmore, C.J. (1968) The Case for Case. In (Bach, E., Harms, R.T., eds.) Universals in Linguistic Theory, Holt, Rinehart and Winston, p. 1-88
- Hajič, J., Hajičová, E., Mírovský, J., Panevová, J.: Linguistically Annotated Corpus as an Invaluable Resource for Advancements in Linguistic Research: A Case Study. The Prague Bulletin of Mathematical Linguistics, No. 106, ISSN 0032-6585, pp. 69-124, 2016 (link)
- Panevová, J. (1994) Valency Frames and the Meaning of the Sentence, In Luelsfdorff, P. A. (ed.) The Prague School of Structural and Functional Linguistics, Amsterdam, Philadelphia, John Benjamins Publishing Company, p. 223-243
- Lopatková et al. (2016) Valenční slovník českých sloves. Praha, Karolinum
Lecture 12 (May 26, 2021): Valency in the PDT family
- Hajič, J. et al (2003) PDT-VALLEX: Creating a Large-coverage Valency Lexicon for Treebank Annotation. In Proceedings of The Second Workshop on Treebanks and Linguistic Theories, Vaxjo University Press, Vaxjo, Sweden, p. 57-68 (link)
- PDT documentation: PDT-C link
- A bit of history - see the PDT 2.0 Guide - link
Lecture 13 (June 2, 2021):
TOPIC 1: Lexical information in the PDT family (t-layer)
- PDT documentation: PDT-C link
TOPIC 2: Morphological information in the PDT family (t-layer)
Other Useful Links and Other Materials
- Table with Czech positional morphological tags (pdf);
- Table with analytical functions in PDT 2.0 (pdf);
- Table with T-nodes attributes in PDT 2.0 (pdf);
- PDT 2.0 Guide or here pdf
- PDT documentation (PDT 3.5, 3.0, 2.0), PDT-C link
- Universal Dependencies (link)
- Udapi tutorial
A subset of the lectures have been recorded and are available for viewing. These are old recordings from 2020 (remote teaching during Covid-19 lockdown) but their contents still largely overlaps with today's course, so they may help you if you missed a class. The recordings that are still available do not cover the whole course, though. Only those related to Universal Dependencies are available.