[ Skip to the content ]

Institute of Formal and Applied Linguistics

at Faculty of Mathematics and Physics, Charles University, Prague, Czech Republic


[ Back to the navigation ]

Publication


Year 2016
Type oral presentation *
Status published
Language English
Author(s) Hajič, Jan
Title The Chimera-TectoMT architecture of Machine Translation with Deep Linguistic Analysis
Czech title Chimera-TectoMt architektura strojového překladu s hlubokou jazykovou analýzou
Publisher's city and country Stroustrup, PA, USA
Venue San Diego Sheraton hotel
Month June
Czech abstract Prezentace vítězného systému ze soutěže (Shared Task) WMT 2013-2015. Byla prezentována architektura, zásadní moduly a principy, a statistické komponenty.
English abstract The TectoMT system is a result of long-term development which began in the pre-statistical era at Charles University in Prague and continued to include state-of-the-art tools for POS tagging, morphological feature disambiguation, lemmatization parsing, and some aspects of semantic analysis. It follows the usual Analysis – Transfer – Generation workflow, with transfer trained on a large parallel corpus using Hidden Markov Tree Model. Generation is partly rule-based (at the syntax level) and partly statistical (at the inflection/morphology level). Chimera is a hybrid system that uses a specific combination of TectoMT and a standard Phrase-based SMT (Moses), complemented by a “Depfix” automatic post-editing system, which as a whole improves on the individual systems, as documented in the results of the recent WMT Shared tasks. The system has been originally developed for English-Czech and recently transferred to several other languages within the EU QTLeap project (qtleap.eu), where it has been successfully used in the IT domain for both question and answer translation in a Q&A context. Both the TectoMT and Chimera systems will be presented together with a discussion about language (in)dependence of such a hybrid solution.
Specialization linguistics ("jazykověda")
Confidentiality default – not confidential
Event NAACL SedMT 2016 Workshop
Presentation type invited talk at conference/workshop
Open access no
Creator: Common Account
Created: 10/20/16 6:10 PM
Modifier: Common Account
Modified: 10/20/16 6:10 PM
***

Content, Design & Functionality: ÚFAL, 2006–2016. Page generated: Tue Sep 26 02:15:42 CEST 2017

[ Back to the navigation ] [ Back to the content ]

100% OpenAIRE compliant