[ Skip to the content ]

Institute of Formal and Applied Linguistics

at Faculty of Mathematics and Physics, Charles University, Prague, Czech Republic


[ Back to the navigation ]

Publication


Year 2018
Type in proceedings
Status submitted
Language English
Author(s) Dwivedi, Puneet Zeman, Daniel
Title Universal Dependencies for Sanskrit: A Pilot Study
Czech title Universal Dependencies pro sanskrt: pilotní studie
Proceedings 2018: Paris, France: LREC 2018: Proceedings of the 11th International Conference on Language Resources and Evaluation (LREC 2018)
Note Submitted to LREC (after rejection from WSSANLP at COLING 2016, EACL and LAW/EACL 2017.
How published online
Supported by 2015-2017 GA15-10472S (Morfologicky a syntakticky anotované korpusy mnoha jazyků) 2017-2021 PROGRES Q48 (Informatika)
Czech abstract Popisujeme první kroky k syntakticky anotovanému korpusu pro sanskrt v rámci formalismu Universal Dependencies. Naše data jsou zatím velmi malá, obsahují pouze necelých 200 vět — jde o výsledek projektu během letní stáže. Nicméně, pokud víme, toto je první veřejně dostupný syntakticky anotovaný text v sanskrtu. Popisujeme také experiment s automatickou syntaktickou analýzou (parsingem), s výsledky lepšími než u delexikalizovaného parsingu.
English abstract We present the first steps towards a treebank of Sanskrit within the Universal Dependencies framework. Our dataset is tiny at the moment, consisting of less than 200 sentences—a result of a summer internship project. Nevertheless, this seems to be, to the best of our knowledge, the first publicly available piece of syntactically annotated Sanskrit text. We also present a parsing experiment, with results surpassing delexicalized parsing.
Specialization linguistics ("jazykověda")
Confidentiality default – not confidential
Open access no
Editor(s)* Nicoletta Calzolari; Khalid Choukri; Thierry Declerck; Marko Grobelnik; Bente Maegaard; Joseph Mariani; Asunción Moreno; Jan Odijk; Stelios Piperidis
Address* Paris, France
Month* May
Venue* Phoenix Seagaia Conference Center
Publisher* European Language Resources Association
Organization* European Language Resource Association
Creator: Common Account
Created: 9/26/16 8:32 AM
Modifier: Common Account
Modified: 10/3/17 12:59 PM
***

Submitted PDFpublicSUBMITTED.pdfapplication/pdf
Content, Design & Functionality: ÚFAL, 2006–2016. Page generated: Sun Nov 19 00:43:32 CET 2017

[ Back to the navigation ] [ Back to the content ]

100% OpenAIRE compliant