[ Skip to the content ]

Institute of Formal and Applied Linguistics

at Faculty of Mathematics and Physics, Charles University, Prague, Czech Republic


[ Back to the navigation ]

Publication


Year 2016
Type in proceedings
Status published
Language English
Author(s) Lapshinova-Koltunski, Ekaterina Nedoluzhko, Anna Kunz, Kerstin Anna
Title An interoperable approach to the analysis of discourse
Czech title Interoperabilní přístup k analýze diskurzu
Proceedings 2016: Paris, France: LREC 2016: Proceedings of the 10th International Conference on Language Resources and Evaluation (LREC 2016)
Pages range 991-997
How published online
URL https://www.google.cz/url?sa=t&rct=j&q=&esrc=s&source=web&cd=1&ved=0ahUKEwih0Kry4ebPAhVGK8AKHcqoBAkQFggfMAA&url=http%3A%2F%2Fwww.lrec-conf.org%2Fproceedings%2Flrec2016%2Fpdf%2F66_Paper.pdf&usg=AFQjCNE26lZJXN1aFlf19vgDtOrdFoMkLg&sig2=oEYIVbKAY8PDOlwgqwIznA&bvm=bv.135974163,d.ZGg&cad=rja
Supported by 2016-2018 GA16-05394S (Structure of coreferential chains in parallel language data) 2016-2019 LM2015071 (LINDAT-CLARIN: Institut pro analýzu, zpracování a distribuci lingvistických dat)
Czech abstract Rozšířená a přepracovaná verze Lapshinova et al. (2016). V článku provádíme srovnávací analýzu diskurzních jevů v němčině a češtině na základě dvou korpusových zdrojů pro tyto jazyky. Korpusy jsou anotovány v rámci různých teoretických přístupů: FGP (Sgall et al. 1986) pro češtinu a teorie textové koheze (Halliday & Hasan 1976) pro němčinu. Hledáme společné rysy, automaticky je identifikujeme v textech a tím se dostáváme k srovnatelným korpusům anotovaných stejnými typy vztahů, které dále rozebíráme. (viz hlavně angl. abstrakt a text)
English abstract In the present paper, we analyse variation of discourse phenomena in two typologically different languages, i.e. in German and Czech. The novelty of our approach lies in the nature of the resources we are using. Advantage is taken of existing resources, which are, however, annotated on the basis of two different frameworks. We use an interoperable scheme unifying discourse phenomena in both frameworks into more abstract categories and considering only those phenomena that have a direct match in German and Czech. The discourse properties we focus on are relations of identity, semantic similarity, ellipsis and discourse relations. Our study shows that the application of interoperable schemes allows an exploitation of discourse-related phenomena analysed in different projects and on the basis of different frameworks. As corpus compilation and annotation is a time-consuming task, positive results of this experiment open up new paths for contrastive linguistics, translation studies and NLP, including machine translation.
Specialization linguistics ("jazykověda")
Confidentiality default – not confidential
Open access no
Editor(s)* Nicoletta Calzolari; Khalid Choukri; Thierry Declerck; Marko Grobelnik; Bente Maegaard; Joseph Mariani; Asunción Moreno; Jan Odijk; Stelios Piperidis
ISBN* 978-2-9517408-9-1
Address* Paris, France
Month* May
Venue* Grand Hotel Bernardin Conference Center
Publisher* European Language Resources Association
Creator: Common Account
Created: 10/19/16 1:34 PM
Modifier: Almighty Admin
Modified: 2/25/17 10:07 PM
***

LREC_clanek_NedoLapshpubliclrec2016.pdfapplication/postscript
LREC_poster_NedoLapshpublicLREC_poster1.pdfapplication/postscript
Content, Design & Functionality: ÚFAL, 2006–2016. Page generated: Wed Sep 20 18:39:36 CEST 2017

[ Back to the navigation ] [ Back to the content ]

100% OpenAIRE compliant