[ Skip to the content ]

Institute of Formal and Applied Linguistics

at Faculty of Mathematics and Physics, Charles University, Prague, Czech Republic


[ Back to the navigation ]

Publication


Year 2017
Type in proceedings
Status published
Language English
Author(s) Rosa, Rudolf Zeman, Daniel Mareček, David Žabokrtský, Zdeněk
Title Slavic Forest, Norwegian Wood
Czech title Slavkovský les, norské dřevo
Proceedings 2017: Stroudsburg, PA, USA: EACL 2017 workshop: Proceedings of the Fourth Workshop on NLP for Similar Languages, Varieties and Dialects (VarDial4)
Pages range 210-219
Note A croSSSynt paper
How published online
URL http://web.science.mq.edu.au/~smalmasi/vardial4/pdf/VarDial26.pdf
Supported by 2015-2017 GA15-10472S (Morfologicky a syntakticky anotované korpusy mnoha jazyků) 2016 SVV 260 333 (Teoretické základy informatiky a výpočetní lingvistiky) 2015-2018 H2020-ICT-2014-1-644402 (Himl (Health in my Language)) 2016-2019 LM2015071 (LINDAT-CLARIN: Institut pro analýzu, zpracování a distribuci lingvistických dat) 2017-2021 PROGRES Q48 (Informatika)
Czech abstract My měli korpus, anebo ten korpus měl nás.
Měl značky pěkný, jednoduchý, harmonický.
Když zvali nás řekli ať použijem všechno, co chcem.
Však k použití nikde nic není, ať hnem se kam hnem.
Tak cizí řeči jsme přeložili, využili.
Dva týdny práce, dřeli jsme fest, a je tu test.
Trénink běžel přes noc do rána, než přišel deadline.
My nespali, čekali, doufali, že dopadnem fajn.
Ráno bylo tu, hleďme na to, my měli zlato.
Tak popíšem papír, co zrodilo ho norský dřevo.
English abstract We once had a corp,
or should we say, it once had us
They showed us its tags,
isn’t it great, unified tags
They asked us to parse
and they told us to use everything
So we looked around
and we noticed there was near nothing
We took other langs,
bitext aligned: words one-to-one
We played for two weeks,
and then they said, here is the test
The parser kept training till morning,
just until deadline
So we had to wait and hope what we get
would be just fine
And, when we awoke,
the results were done, we saw we’d won
So, we wrote this paper,
isn’t it good, Norwegian wood.
Specialization computer science ("informatika")
Confidentiality default – not confidential
Open access yes
Editor(s)* Preslav Nakov; Marcos Zampieri; Nikola Ljubešić; Jörg Tiedemann; Shervin Malmasi; Ahmed Ali
ISBN* 978-1-945626-43-2
Address* Stroudsburg, PA, USA
Month* April
Venue* Valencia Conference Center
Publisher* Association for Computational Linguistics
Organization* Association for Computational Linguistics
Creator: Common Account
Created: 2/11/17 4:28 AM
Modifier: Common Account
Modified: 9/18/17 6:28 PM
***

Final paperpublicVarDial26.pdfapplication/pdf
Slidespublic2017-04-03-vardial-valencia.pdfapplication/pdf
Content, Design & Functionality: ÚFAL, 2006–2016. Page generated: Sun Nov 19 00:30:47 CET 2017

[ Back to the navigation ] [ Back to the content ]

100% OpenAIRE compliant