[ Skip to the content ]

Institute of Formal and Applied Linguistics

at Faculty of Mathematics and Physics, Charles University, Prague, Czech Republic

[ Back to the navigation ]


Year 2016
Type in proceedings
Status published
Language English
Author(s) Cinková, Silvie Krejčová, Ema Vernerová, Anna Baisa, Vít
Title Graded and Word-Sense-Disambiguation Decisions in Corpus Pattern Analysis: a Pilot Study
Czech title Škálované posouzení vzoru užití vs. desambiguace významu: Pilotní studie
Proceedings 2016: Paris, France: LREC 2016: Proceedings of the 10th International Conference on Language Resources and Evaluation (LREC 2016)
Pages range 848-854
How published online
URL http://www.lrec-conf.org/proceedings/lrec2016/pdf/506_Paper.pdf
Supported by 2015-2017 GA15-20031S (Odkaz Zelliga S. Harrise: více lingvistické informace pro distribuční lexikální analýzu angličtiny a češtiny) 2016 SVV 260 333 (Teoretické základy informatiky a výpočetní lingvistiky) 2016-2019 LM2015071 (Jazyková výzkumná infrastruktura v České republice) 2012-2016 PRVOUK P46 (Informatika)
Czech abstract Představujeme pilotní analýzu nového lingvistického zdroje, VPS-GradeUp, který je dostupný z http://hdl.handle.net/11234/1-1585.
English abstract We present a pilot analysis of a new linguistic resource, VPS-GradeUp (available at http://hdl.handle.net/11234/1-1585). The resource contains 11,400 graded human decisions on usage patterns of 29 English lexical verbs, randomly selected from the Pattern Dictionary of English Verbs (Hanks, 2000 2014) based on their frequency and the number of senses their lemmas have in PDEV. This data set has been created to observe the interannotator agreement on PDEV patterns produced using the Corpus Pattern Analysis (Hanks, 2013). Apart from the graded decisions, the data set also contains traditional Word-Sense-Disambiguation (WSD) labels. We analyze the associations between the graded annotation and WSD annotation. The results of the respective annotations do not correlate with the size of the usage pattern inventory for the respective verbs lemmas, which makes the data set worth further linguistic analysis.
Specialization linguistics ("jazykověda")
Confidentiality default – not confidential
Open access no
Article no. 506
Editor(s)* Nicoletta Calzolari; Khalid Choukri; Thierry Declerck; Marko Grobelnik; Bente Maegaard; Joseph Mariani; Asunción Moreno; Jan Odijk; Stelios Piperidis
ISBN* 978-2-9517408-9-1
Address* Paris, France
Month* May
Venue* Grand Hotel Bernardin Conference Center
Publisher* European Language Resources Association
Creator: Common Account
Created: 6/2/16 2:44 PM
Modifier: Almighty Admin
Modified: 2/25/17 10:06 PM

Content, Design & Functionality: ÚFAL, 2006–2018. Page generated: Mon Feb 18 06:11:33 CET 2019

[ Back to the navigation ] [ Back to the content ]

100% OpenAIRE compliant