Up

TECTOGRAMMATICAL LAYER


In general       TGTS        Data sample       References       What is PDT


General Characterization of Tectogrammatical Layer
The tagging scheme on the deep structure level is based on the dependency-based theoretical framework of the Functional Generative Description (FGD), namely on its level of the tectogrammatical representations (for motivating discussions and for more details, see e.g. Sgall 1967 and 1992; Sgall et al. 1986; a formalization can be found in Petkevic 1987 and 1995, see References).
The tectogrammatical layer can be characterized as the level of linguistic (literal) meaning, i.e. as the structuring of the cognitive content proper to a particular language. On this level, the irregularities of the outer shape of sentences are absent (including synonymy and at least the prototypical cases of ambiguity) and it can thus serve as a useful interface between linguistics in the narrow sense (as the theory of language systems) on one side and such interdisciplinary domains as that of semantic interpretation (logical analysis of language, reference assignment based on inferencing using contextual and other knowledge, further metaphorical and other figurative meanings), that of discourse analysis or text linguistics, and so on, on the other.


Characterization of the Tectogrammatical Tree Structure (TGTS)
(for detailed information see:  E. Hajicova. Dependency-Based Underlying-Structure Tagging of a Very Large Czech Corpus, see References)
  1. A node of the TGTS represents an occurrence of an autosemantic (lexical, meaningful) word; the correlates of function words (i.e. synsemantic, auxiliary word forms) are attached as indices to the autosemantic words to which they belong (i.e. auxiliary verbs and subordinating conjunctions to the verbs, prepositions to nouns, etc.); coordinating conjunctions remain as nodes of their own (similarly as in the ATSs...odkaz).
  2. In cases of deletions in the surface shapes of sentences nodes for the deleted autosemantic words are added to the tree structure.
  3. Non-projective structures are not allowed on the tectogrammatical layer of tagging; the relevant asymmetries are accounted for as differences between underlying and morphemic word order.
  4. Not only the direction of the dependency relation (dependent from the right - dependent from the left), but also the ordering of the sister nodes is specified in the TGTSs.
  5.   Each TGTS has the form of a dependency tree with the verb of the main clause as its root (to be more precise, the root of the TGTS is a special node identifying the sentence of which the given structure is the TGTS, and the node of the main verb is the only node immediately depending on this identifier). In case of nominal 'sentences' (i.e. of constructions without a finite verb), three possibilities obtain: (i) the governing verb is added (in case of surface deletions, which is relatively rare), or (ii) a symbol for 'empty verb' ('EV') is added as the governor (e.g. Od našeho washingtonského zpravodaje 'From our correspondent from Washington', with the node for 'correspondent' depending on 'EV'), or (iii) the governing nominal node acts as the governor (e.g. with author names).
     Each label of a node consists of the following parts:


    See complete list of TGTS attributes and their values (available in: pdffile, psfile).



Documentation of the annotation style on tectogrammatical layer is available in PDT references. All papers related to the  theoretical background are listed bellow (alphabetically by the last name of the first author).

References
The file pdt.bib contains a collection of books, papers, technical reports related to the PDT.
  1. Eva Hajicova. (1993). Issues of Sentence Structure and Discourse Patterns. Prague: Charles University.
    Available in: BibTex item
  2. Eva Hajicová  (1998a). Movement Rules Revisited. In: Processing of Dependency-Based Grammars, Proceedings from the Workshop COLING/ACL, Montreal, ed. S. Kahane and A. Polguere, 49-57.
    Available in: BibTex item
  3. Eva Hajicova, Marketa Ceplova. (2000). Deletions and Their Reconstruction in Tectogrammatical Syntactic Tagging of Very Large Corpora. In Proceedings of COLING'2000, pp. 228-284, Saarbruecken, Germany.
    Available in: BibTex item
  4. Eva Hajicova, Jarmila Panevova. (1984). Valency (case) frames of verbs. In: Sgall (1984:147-188).
    Available in: BibTex item
  5. Eva Hajicova, Barbara Partee, Petr Sgall (1998): Topic-focus articulation, tripartite structures, and semantic content. Amsterdam:Kluwer
    Available in: BibTex item
  6. Marcus M. P., Kim G., Marcinkiewicz M. A. et al. (1994). The Penn Treebank: Annotating Predicate Argument Structure. Proceedings of the ARPA Human Language Technology Workshop. San Francisco: Morgan Kaufmann.
  7. Marcus M. P., Santorini B. and Marcinkiewicz M. A. (1993). Building a Large Annotated Corpus of English: the Penn Treebank. Computational Linguistics, 19(2), 313-330.
  8. Jarmila Panevova.  1974. "On verbal frames in Functional Generative Description". Prague Bulletin of Mathematical Linguistics 22:3-40; 23(1975):17-52.
    Available in: BibTex item
  9. Jarmila Panevova. 1980. Formy a funkce ve stavbe ceske vety. [Forms and Functions in the Structure of the Czech Sentence]. Prague: Academia.
    Available in: BibTex item
  10. Vladimir Petkevic (1987). A New Dependency Based Specification of Underlying Representations of Sentences. Theoretical Linguistics 14:143-172.
    Available in: BibTex item
  11. Vladimir Petkevic. (1995). A New Formal Specification of Underlying Representations. Theoretical Linguistics 21:7-61.
    Available in: BibTex item
  12. Petr Sgall. (1967). Generativni popis jazyka a ceska deklinace. [Generative Description of Czech and Czech Declension.] Prague: Academia.
    Available in: BibTex item
  13. Petr Sgall ed. (1984). Contributions to Functional Syntax, Semantics and Language Comprehension. Amsterdam: Benjamins - Prague: Academia.
    Available in: BibTex item
  14. Petr Sgall. 1992. Underlying Structure of Sentences and Its Relations to Semantics. Wiener Slawistischer Almanach. Sonderband 33. Ed. by T. Reuther. Wien: Gesellschaft zur Förderung slawistischer Studien, 273-282.
    Available in: BibTex item
  15. Petr Sgall. (1997a). Valency and Underlying Structure. An Alternative View on Dependency. In: L. Wanner (ed.): Recent Trends in Meaning-Text Theory. Amsterdam/Philadelphia: Benjamins, 149-166.
    Available in: BibTex item
  16. Petr Sgall (1997b). On the Usefulness of Movement Rules. In: Caron B. (ed.), Actes du 16e Congres International des Linguistes (Paris 20-25 juillet 1997), Oxford: Elsevier Sciences.
    Available in: BibTex item
  17. Petr Sgall (in press). The Freedom of Language. To appear in Prague Linguistic Circle Papers 4.
  18. Petr Sgall, Eva Hajicova, Jarmila Panevova (1986): The Meaning of the Sentence in Its Semantic and Pragmatic Aspects, ed. by J. L. Mey, Dordrecht:Reidel - Prague: Academia.
    Available in: BibTex item