Federica Gamba

office
409
email
gamba@ufal.mff.cuni.cz
address
Malostranské náměstí 25
118 00 Praha 1
Czech Republic

Projects

PhD topic: Dealing with Latin variability in parsing (supervisor: Daniel Zeman)

GAUK (104924, 2024-2026): Adapting Uniform Meaning Representation (UMR) for the Italic/Romance languages.

Curriculum Vitae

  • since March 2022: PhD student in Computational Linguistics at ÚFAL MFF UK.
  • 2021-2022: Research assistant at the Institute for Computational Linguistics (ILC-CNR), Pisa, Italy.
  • 2018-2021: Graduate degree in Humanities at IUSS Pavia (University School for Advanced Studies), Italy. Final thesis: 'More data and new tools. Advances in parsing the Index Thomisticus Treebank'.
  • 2018-2020: Master's degree in Theoretical and Applied Linguistics at University of Pavia, Italy. Final thesis: 'Including a new textual resource into the LiLa Knowledge Base. Lemmatization, PoS tagging and linking of Querolus'.
  • 2015-2019: Undergraduate degree in Humanities at IUSS Pavia (University School for Advanced Studies), Italy.
  • 2015-2018: Bachelor's degree in Classics at University of Pavia, Italy.

Selected Bibliography

  1. Markéta Lopatková, Eva Fučíková, Federica Gamba, Jan Hajič, Hana Hledíková, Marie Mikulová, Michal Novák, Jan Štěpánek, Daniel Zeman, Šárka Zikánová (2025): UMR 2.0 - Czech: Release Notes (technical report). In: (pdf, bibtex)
  2. Federica Gamba (2024): Predicate Sense Disambiguation for UMR Annotation of Latin: Challenges and Insights. In: Proceedings of the 1st Workshop on Machine Learning for Ancient Languages, pp. 19-29, Association for Computational Linguistics, Kerrville, TX, USA, ISBN 979-8-89176-144-5 (url, bibtex)
  3. Federica Gamba, Marco Passarotti, Paolo Ruffolo (2024): Publishing the Dictionary of Medieval Latin in the Czech Lands as Linked Data in the LiLa Knowledge Base. In: Italian Journal of Computational Linguistics, ISSN 2499-4553, vol. 10, no. 1, pp. 95-116 (url, bibtex)
  4. Federica Gamba, Abishek Stephen, Zdeněk Žabokrtský (2024): Universal Feature-based Morphological Trees. In: Proceedings of the LREC-COLING 2024 Joint Workshop on Multiword Expressions and Universal Dependencies (MWE-UD 2024), pp. 125-137, European Language Resources Association (ELRA), Torino, Italy, ISBN 978-2-493814-20-3 (pdf, local PDF, bibtex)
  5. Markéta Lopatková, Eva Fučíková, Federica Gamba, Jan Štěpánek, Daniel Zeman, Šárka Zikánová (2024): Towards a Conversion of the Prague Dependency Treebank Data to the Uniform Meaning Representation. In: Proceedings of the 24th Conference Information Technologies – Applications and Theory (ITAT 2024), pp. 62-76, CEUR-WS.org, Košice, Slovakia (url, local PDF, bibtex)
  6. Milan Straka, Jana Straková, Federica Gamba (2024): ÚFAL LatinPipe at EvaLatin 2024: Morphosyntactic Analysis of Latin. In: Proceedings of the Third Workshop on Language Technologies for Historical and Ancient Languages (LT4HALA) @ LREC-COLING-2024, pp. 207-214, ELRA and ICCL, Torino, Italia, ISBN 978-2-493814-46-3 (pdf, local PDF, bibtex)
  7. Federica Gamba, Marco Passarotti, Paolo Ruffolo (2023): Linking the Dictionary of Medieval Latin in the Czech Lands to the LiLa Knowledge Base. In: Proceedings of the Ninth Italian Conference on Computational Linguistics, pp. 1-8, CEUR Workshop Proceedings, Venice, Italy (pdf, bibtex)
  8. Federica Gamba, Daniel Zeman (2023): Latin Morphology through the Centuries: Ensuring Consistency for Better Language Processing. In: Proceedings of the Ancient Language Processing Workshop, pp. 59-67, INCOMA, Varna, Bulgaria, ISBN 978-954-452-087-8 (pdf, local PDF, local PDF, bibtex)
  9. Federica Gamba, Daniel Zeman (2023): Universalising Latin Universal Dependencies: a harmonisation of Latin treebanks in UD. In: Proceedings of the Sixth Workshop on Universal Dependencies (UDW, GURT/SyntaxFest 2023), pp. 7-16, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-959429-34-0 (pdf, local PDF, local PDF, bibtex)
  10. Federica Gamba, Francesca Frontini, Daan Broeder, Monica Monachini (2022): Language Technologies for the Creation of Multilingual Terminologies. Lessons Learned from the SSHOC Project. In: Proceedings of the 13th Conference on Language Resources and Evaluation (LREC 2022), pp. 154-163, European Language Resources Association, Marseille, France, ISBN 979-10-95546-72-6 (pdf, bibtex)
  11. Francesca Frontini, Federica Gamba, Monica Monachini, Daan Broeder, Kea Tijdens, Irena Vipavc Brvar (2021): D3.9 Report on Ontology and Vocabulary Collection and Publication (technical report). In: (url, bibtex)
  12. Federica Gamba, Marco Passarotti, Paolo Ruffolo (2021): More Data and New Tools. Advances in Parsing the Index Thomisticus Treebank. In: Proceedings of the Conference on Computational Humanities Research 2021, pp. 108-122, CEUR Workshop Proceedings (CEUR-WS.org) (pdf, bibtex)