[ Skip to the content ]

Institute of Formal and Applied Linguistics

at Faculty of Mathematics and Physics, Charles University, Prague, Czech Republic


[ Back to the navigation ]

Publication


Year 2011
Type article
Status published
Language English
Author(s) Bojar, Ondřej
Title Analyzing Error Types in English-Czech Machine Translation
Czech title Analýza typů chyb v anglicko-českém strojovém překladu
Journal The Prague Bulletin of Mathematical Linguistics
Volume 95
Pages range 63-76
Month March
URL http://ufal.mff.cuni.cz/pbml/95/art-bojar.pdf
Supported by 2011-2013 GAP406/11/1499 (Čeština ve věku strojového překladu) 2010-2012 GPP406/10/P259 (Hybridní frázový a hloubkově-syntaktický strojový překlad) 2009-2012 FP7-ICT-2007-3-231720 (EuroMatrix Plus) 2009-2012 7E09003 (EuroMatrixPlus – Bringing Machine Translation for European Languages to the User)
Czech abstract Článek zkoumá dvě metody ručního vyhodnocení kvality překladu, které mohou pomoci identifikovat nejčastější typy chyb.
English abstract This paper examines two techniques of manual evaluation that can be used to identify error types of individual machine translation systems. The first technique of “blind post-editing” is being used in WMT evaluation campaigns since 2009 and manually constructed data of this type are available for various language pairs. The second technique of explicit marking of errors has been used in the past as well. We propose a method for interpreting blind post-editing data at a finer level and compare the results with explicit marking of errors. While the human annotation of either of the techniques is not exactly reproducible (relatively low agreement), both techniques lead to similar observations of differences of the systems. Specifically, we are able to suggest which errors in MT output are easy and hard to correct with no access to the source, a situation experienced by users who do not understand the source language.
Specialization linguistics ("jazykověda")
Confidentiality default – not confidential
Open access no
DOI 10.2478/v10108-011-0005-2
ISSN* 0032-6585
Institution* Univerzita Karlova v Praze
Creator: Common Account
Created: 8/31/11 10:50 AM
Modifier: Almighty Admin
Modified: 3/5/12 4:17 PM
***

paperpublic2011-FILE-bojar_pbml_2011-PUBLISHED.pdfapplication/pdf
Content, Design & Functionality: ÚFAL, 2006–2016. Page generated: Mon Jan 22 20:55:45 CET 2018

[ Back to the navigation ] [ Back to the content ]

100% OpenAIRE compliant