Archive – academic year 2020/21

Lectures

  • Lecture 1 (March 3, 2021): Introduction, trees, dependencies 
    • reading:
      • Osborne, T. (2019) A Dependency Grammar of English. John Benjamins Publishing Company, Amsterdam/Philadelphia (available in my office)
      • Hajičová, E., Panevová, J., Sgall, P. (2002) Úvod do teoretické a počítačové lingvistiky, sv. I. Karolinum, Praha (available in the secretariat)
      • Štekauer, P., ed. (2000) Rudiments of English Linguistics. Slovacontact, Prešov. (available in my office)
      • Štěpánek, J. (2006) Závislostní zachycení větné struktury v anotovaném syntaktickém korpusu. PhD Thesis, MFF UK (link)
      • Wikipedia - basic articles on dependency grammar are consistent with Timothy Osborne's approac
      • Universal dependencies (intro): https://universaldependencies.org/
      • Prague Dependency Treebank (intro): https://ufal.mff.cuni.cz/pdt3.5/
  • Lecture 2 (March 10, 2021): Non-dependency relations and their representation; Word order and (non-)projectivity
    • reading:
      • Kuhlmann, M., Nivre, J. (2006): Mildly Non-Projective Dependency Structures. In COLING/ACL Main Conference Poster Sessions, 507–514 (link).
      • Petkevič, V. (1995) A New Formal Specification of Underlying Structure. Theoretical Linguistics, vol. 21, No.1
      • Štěpánek, J. (2006) Závislostní zachycení větné struktury v anotovaném syntaktickém korpusu. PhD Thesis, MFF UK (link)
      • Havelka, J. (2007): Mathematical Properties of Dependency Trees and their Application to Natural Language Syntax. PhD Thesis, MFF UK (link)
      • Universal Dependencies:  https://universaldependencies.org/
      • Prague Dependency Treebank:   https://ufal.mff.cuni.cz/pdt3.5/
  • Lecture 3 (March 17, 2021): Intro to Stratificational Approach to Language Description (stratificational grammar, FGD, MTT)
    • reading:
      • Hajičová, E., Panevová, J., Sgall, P. (2002) Úvod do teoretické a počítačové lingvistiky, sv. I. Karolinum, Praha (available in the secretariat)
      • Štekauer, P., ed. (2000) Rudiments of English Linguistics. Slovacontact, Prešov. (available in my office)
      • Sgall, P. (1967) Generativní popis jazyka a česká deklinace. Academia, Praha (available in my office)
      • Žabokrtský, Z. (2006) Resemblances between Meaning - Text Theory and Functional Generative Description. In Proceedings of the 2nd International   Conference of Meaning-Text Theory, Slavic Culture Languages Publishers House, Moskva, pp. 549-557. (link)
      • https://www.britannica.com/science/linguistics/Stratificational-grammar
      • advanced:
        Sgall, P., Hajičová, E., Panevová, J. (1986) The Meaning of the Sentence in Its Semantic and Pragmatic Aspects. Reidel, Dordrecht.
  • Lecture 4 (March 24, 2021):
    • TOPIC 1: Intro to Prague Dependency Treebank 
      • reading:
        • Hajič, J., Hajičová, E., Mírovský, J., Panevová, J.: Linguistically Annotated Corpus as an Invaluable Resource for Advancements in Linguistic Research: A Case Study. The Prague Bulletin of Mathematical Linguistics, No. 106, ISSN 0032-6585, pp. 69-124, 2016 (link)
        • Hajičová, E., Panevová, J., Sgall, P. (2002) Úvod do teoretické a   počítačové lingvistiky, sv. I. Karolinum, Praha (available in the secretariat)
        • Hajič, J. (1998) Building a Syntactically Annotated Corpus: The Prague Dependency Treebank. In E. Hajičová (ed.): Issues of Valency and Meaning. Studies in Honour of Jarmila Panevová, Karolinum, Charles University Press, Prague, Republic, pp. 106-132 (link)
        • Štekauer, P., ed. (2000) Rudiments of English Linguistics. Slovacontact, Prešov. (available in my office)
        • PDT-C webpage
        • PDT 2.0 guide 
    • TOPIC 2: PDT and its morphological annotation 
      • note: You are not supposed to memorize the tag structure but you might be ask to provide examples (using the following table pdf)
      • reading:
        • Matthews, H. (1997) The Concise Oxford Dictionary of Linguistics. Oxford University Press, Oxford
        • Filipec, J. (1994) Lexicology and Lexicography: Development and State of the Research. In Luelsdorff, P.A. (ed.) The Prague School of Structural and Functional  Linguistics, Amsterdam-Philadelphia, John Benjamins, p.163–183
        • Hajič, J. (2004) Disambiguation of Rich Inflection (Computational Morphology of  Czech). Karolinum, Charles Univeristy Press, Prague.
        • http://wiki.korpus.cz/doku.php/seznamy:tagy
        • http://ufal.mff.cuni.cz/pdt/Morphology_and_Tagging/Doc/hmptagqr.html
        • Straková Jana, Straka Milan and Hajič Jan. (2014) Open-Source Tools for Morphology, Lemmatization, POS Tagging and Named Entity Recognition. In Proceedings of 52nd Annual Meeting of the Association for Computational Linguistics: System Demonstrations, pages 13-18, Baltimore, Maryland, June 2014. Association for Computational Linguistics.  
        • DEMO: http://lindat.mff.cuni.cz/services/morphodita/
        • PDT documentation: Manual for morphological annotation http://ufal.mff.cuni.cz/pdt2.0/doc/pdt-guide/en/html/ch05.html
      • Table with morphological tags in PDT 2.0 (pdf)
  • Lecture 5 (March 31, 2021): Intro to UD, morphology
    • reading:
      • Nivre Joakim et al. (2020) Universal Dependencies v2: An Evergrowing Multilingual Treebank Collection. In: Proceedings of LREC 2020. ELRA, Marseille, France, p. 4034-4043, 2020 (link).
      • https://universaldependencies.org/
  • Lecture 6 (April 7, 2021): Surface syntactic annotation in PDT (a-layer)
    • reading:
      • Hajič, J. (1998) Building a Syntactically Annotated Corpus: The Prague Dependency Treebank. In E. Hajičová (ed.): Issues of Valency and Meaning. Studies in Honour of Jarmila Panevová, Karolinum, Charles University Press, Prague, Republic, pp. 106-132 (link)
      • Štekauer, P., ed. (2000) Rudiments of English Linguistics.Slovacontact, Prešov (chapter 4, Syntax)
      • Quirk, R., Greenbaum, S., Leech, G., Svartvik, J. (1985) A Comprehensive Grammar of the English Language, Longman, London.
      • PDT documentation: Manual for Analytical Annotation (link)
    • Table with analytical functions in PDT 2.0 (pdf)
  • Lectures 7, 8 and 9 (April 14, 21 and 28, 2021): Syntax in UD
  • Lecture 10 (May 5, 2021):
    • TOPIC 1: UD: Enghanced dependencies
    • TOPIC 2: PropBank
  • May 12, 2021 -  Rector's Day (lecture cancelled)
  • Lecture 11 (May 19, 2021):
    • TOPIC 1: Intro to t-layer
      • reading:
        • Hajič, J, Hajičová, E., Mikulová, M., Mírovský, J.: Prague Dependency Treebank. Chapter in Ide, N., Pustejovsky, J. (eds.) Handbook of Linguistic Annotation, Springer, Berlin, pp. 555-594, 201
        • Hajič, J., Hajičová, E., Mírovský, J., Panevová, J.: Linguistically Annotated Corpus as an Invaluable Resource for Advancements in Linguistic Research: A Case Study. The Prague Bulletin of Mathematical Linguistics, No. 106, ISSN 0032-6585, pp. 69-124, 2016 (link)
        • Sgall, P., Panevová, J., Hajičová, E. (2004)  Deep Syntactic Annotation: Tectogrammatical Annotation and Beyond. In A. Meyers (ed.) Proceedings of the HLT-NAACL 2004 Workshop: Frontiers in Corpus Annotation, ACL, Boston, Massachusetts, USA, pp. 32-38. (link)
        • Table with T-nodes attributes in PDT 2.0 (pdf);
        • PDT documentation: PDT-C link
    • TOPIC 2: Valency in FGD
      • reading:
        • Fillmore, C.J. (1968) The Case for Case. In (Bach, E., Harms, R.T., eds.) Universals in Linguistic Theory, Holt, Rinehart and Winston, p. 1-88
        • Hajič, J., Hajičová, E., Mírovský, J., Panevová, J.: Linguistically Annotated Corpus as an Invaluable Resource for Advancements in Linguistic Research: A Case Study. The Prague Bulletin of Mathematical Linguistics, No. 106, ISSN 0032-6585, pp. 69-124, 2016 (link)
        • Panevová, J. (1994) Valency Frames and the Meaning of the Sentence, In Luelsfdorff, P. A. (ed.) The Prague School of Structural and Functional Linguistics, Amsterdam, Philadelphia, John Benjamins Publishing Company, p. 223-243
        • Lopatková et al. (2016) Valenční slovník českých sloves. Praha, Karolinum
  • Lecture 12 (May 26, 2021): Valency in the PDT family
    • reading:
      • Hajič, J. et al (2003) PDT-VALLEX: Creating a Large-coverage Valency Lexicon for Treebank Annotation. In Proceedings of The Second Workshop on Treebanks and Linguistic Theories, Vaxjo University Press, Vaxjo, Sweden, p. 57-68 (link)
      • PDT documentation: PDT-C link
      • A bit of history - see the PDT 2.0 Guide - link
  • Lecture 13 (June 2, 2021):
    • TOPIC 1: Lexical information in the PDT family (t-layer)
      • reading:
        • PDT documentation: PDT-C link
    • TOPIC 2: Morphological information in the PDT family (t-layer)
      • reading:
        • Razímová Magda, Žabokrtský Zdeněk: Annotation of Grammatemes in the Prague Dependency Treebank 2.0. In: Proceedings of the LREC Workshop on Annotation Science, Genova, Italy, ISBN 2-9517408-2-4, pp. 12-19, 2006 - link
        • PDT documentation: PDT-C link

Other Useful Links and Other Materials

Recordings

A subset of the lectures have been recorded and are available for viewing. These are old recordings from 2020 (remote teaching during Covid-19 lockdown) but their contents still largely overlaps with today's course, so they may help you if you missed a class. The recordings that are still available do not cover the whole course, though. Only those related to Universal Dependencies are available.