[ Skip to the content ]

Institute of Formal and Applied Linguistics

at Faculty of Mathematics and Physics, Charles University, Prague, Czech Republic


[ Back to the navigation ]

Publication


Year 2011
Type in proceedings
Status published
Language English
Author(s) Bojar, Ondřej Tamchyna, Aleš
Title Improving Translation Model by Monolingual Data
Czech title Zlepšení překladového modelu pomocí jednojazyčných dat
Proceedings 2011: Edinburgh, UK: WMT 2011 (EMNLP): Proceedings of the Sixth Workshop on Statistical Machine Translation
Pages range 330-336
URL http://www.aclweb.org/anthology/W11-2138
Supported by 2009-2012 FP7-ICT-2007-3-231720 (EuroMatrix Plus) 2009-2012 7E09003 (EuroMatrixPlus – Bringing Machine Translation for European Languages to the User) 2010-2012 GPP406/10/P259 (Hybridní frázový a hloubkově-syntaktický strojový překlad) 2005-2010 MSM 0021620838 (Moderní metody, struktury a systémy informatiky)
Czech abstract Používáme jednojazyčná data na cílové straně k tomu, abychom obohatili překladový model ve statistickém strojovém překladu.
English abstract We use target-side monolingual data to extend the vocabulary of the translation model in statistical machine translation. This method called “reverse self-training” improves the decoder’s ability to produce grammatically correct translations into languages with morphology richer than the source language esp. in small-data setting. We empirically evaluate the gains for several pairs of European languages and discuss some approaches of the underlying back-off techniques needed to translate unseen forms of known words. We also provide a description of the systems we submitted to WMT11 Shared Task.
Specialization linguistics ("jazykověda")
Confidentiality default – not confidential
Open access no
Editor(s)* Chris Callison-Burch; Philipp Koehn; Christof Monz; Omar F. Zaidan
ISBN* 978-1-937284-12-1
Address* Edinburgh, UK
Month* July
Venue* Informatics Forum, Edinburgh
Publisher* Association for Computational Linguistics
Institution* University of Edinburgh
Creator: Common Account
Created: 8/31/11 10:08 AM
Modifier: Almighty Admin
Modified: 3/5/12 4:18 PM
***

paper-as-publishedpublicWMT38.pdfapplication/pdf
Content, Design & Functionality: ÚFAL, 2006–2016. Page generated: Sat Jun 23 19:23:57 CEST 2018

[ Back to the navigation ] [ Back to the content ]

100% OpenAIRE compliant