[ Skip to the content ]

Institute of Formal and Applied Linguistics

at Faculty of Mathematics and Physics, Charles University, Prague, Czech Republic


[ Back to the navigation ]

Publication


Year 2010
Type in proceedings
Status published
Language English
Author(s) Cinková, Silvie Holub, Martin Rychlý, Pavel Smejkalová, Lenka Šindlerová, Jana
Title Can Corpus Pattern Analysis Be Used in NLP?
Czech title Může být Corpus Pattern Analysis použita v NLP?
Proceedings 2010: Berlin / Heidelberg: TSD 2010: Text, Speech and Dialogue. 13th International Conference, TSD 2010, Brno, Czech Republic, September 6-10, 2010. Proceedings
Pages range 67-74
How published print
Supported by 2010-2013 GAP406/10/0875 (Komputační lingvistika: Explicitní popis jazyka a anotovaná data se zřetelem na češtinu) 2005-2010 MSM 0021620838 (Moderní metody, struktury a systémy informatiky) 2010 SVV 261 314 (Specifický vysokoškolský výzkum) 2009-2012 FP7-ICT-2007-3-231720 (EuroMatrix Plus) 2012-2016 PRVOUK P46 (Informatika)
Czech abstract Tento příspěvek je pilotní studií validace elektronického slovníku anglických sloves PDEV pro účely NLP.
English abstract Corpus Pattern Analysis (CPA) 4 coined and implemented by Hanks as the Pattern Dictionary of English Verbs (PDEV) 3 appears to be the only deliberate and consistent implementation of Sinclair’s concept of Lexical Item 12 In his theoretical inquiries 5Hanks hypothesizes that the pattern repository produced by CPA can also support the word sense disambiguation task. Although more than 670 verb entries have already been compiled in PDEV, no systematic evaluation of this ambitious project has been reported yet. Assuming that the Sinclairian concept of the Lexical Item is correct, we started to closely examine PDEV with its possible NLP application in mind. Our experiments presented in this paper have been performed on a pilot sample of English verbs to provide a first reliable view on whether humans can agree in assigning PDEV patterns to verbs in a corpus. As a conclusion we suggest procedures for future development of PDEV.
Specialization linguistics ("jazykověda")
Confidentiality default – not confidential
Open access no
WOS Code 000288619400010
Editor(s)* Petr Sojka; Aleš Horák; Ivan Kopeček; Karel Pala
ISBN* 978-3-642-15759-2
ISSN* 0302-9743
Address* Berlin / Heidelberg
Month* September
Venue* Hotel Continental
Publisher* Springer
Institution* Masarykova univerzita
Journal* Lecture Notes in Computer Science
Creator: Common Account
Created: 9/24/10 12:37 PM
Modifier: Common Account
Modified: 11/23/15 11:24 AM
***

Can Corpus Pattern Analysis Be Used in NLP?publicTSD-2010.pdfapplication/pdf
Content, Design & Functionality: ÚFAL, 2006–2016. Page generated: Wed Nov 22 08:30:27 CET 2017

[ Back to the navigation ] [ Back to the content ]

100% OpenAIRE compliant