[ Skip to the content ]

Institute of Formal and Applied Linguistics

at Faculty of Mathematics and Physics, Charles University, Prague, Czech Republic


[ Back to the navigation ]

Publication


Year 2016
Type in proceedings without ISBN
Status published
Language English
Author(s) Saleh, Shadi Pecina, Pavel
Title Task3 Patient-Centred Information Retrieval: Team CUNI
Czech title Úloha 3: vyhledávání informací se zaměřením na uživatele: Tým Univerzity Karlovy
Proceedings 2016: : CLEF 2016: CLEF 2016 Working Notes
Pages range 123-129
How published online
Supported by 2015-2017 H2020-ICT-2014-1-644753 (KConnect (Khresmoi Multilingual Medical Text Analysis, Search and Machine Translation Connected in a Thriving Data-Value Chain)) 2012-2016 PRVOUK P46 (Informatika) 2016 SVV 260 333 (Teoretické základy informatiky a výpočetní lingvistiky) 2012-2018 GBP103/12/G084 (Centrum pro multi-modální interpretaci dat velkého rozsahu)
Czech abstract Zpráva o účasti týmu Univerzity Karlovy v soutěži vyhledávání zdravotních informací CLEF eHealth Evaluation Lab 2016.
English abstract In this paper we present our participation as the team of the Charles University at Task3 Patient-Centred Information Retrieval. In the monolingual task and its subtasks, we submitted two runs: one is based on language model approach and the second one is based on vector space model. For the multilingual task, Khresmoi translator, a Statistical Machine Translation (SMT) system, is used to translate the queries into English and get the n-best-list. For the baseline system, we take 1-best-list translation and use it for the retrieval, while for other runs, we use a machine learning model to rerank the n-best-list translations and predict the translation that gives the best CLIR performance in terms of P@10. We present set of features to train the model, these features are generated from the SMT verbose output, different resources like UMLS Metathesaurus, MetaMap, document collection and from the Wikipedia articles. Experiments on previous CLEF eHealth IR tasks test set show significant improvement brought by the reranker over the baseline system.
Specialization linguistics ("jazykověda")
Confidentiality default – not confidential
Open access no
Editor(s)* Kristzian Balog; Linda Cappellato; Nicola Ferro; Craig Macdonald
ISSN* 1613-0073
Month* September
Publisher* CEUR-WS
Organization* School of Sciences and Technology of the University
Creator: Common Account
Created: 7/4/16 12:34 PM
Modifier: Almighty Admin
Modified: 2/25/17 10:07 PM
***

Content, Design & Functionality: ÚFAL, 2006–2016. Page generated: Sun Sep 24 00:06:23 CEST 2017

[ Back to the navigation ] [ Back to the content ]

100% OpenAIRE compliant