Federica Gamba

office
409
email
gamba@ufal.mff.cuni.cz
address
Malostranské náměstí 25
118 00 Praha 1
Czech Republic

Projects

PhD topic: Exploring the Syntax-Semantic Interface in Computational Models (supervisor: Daniel Zeman)

GAUK No. 104924 (2024-2026): Adapting Uniform Meaning Representation (UMR) for the Italic/Romance languages.

Curriculum Vitae

  • since March 2022: PhD student in Computational Linguistics at ÚFAL MFF UK.
  • 2021-2022: Research assistant at the Institute for Computational Linguistics (ILC-CNR), Pisa, Italy.
  • 2018-2021: Graduate degree in Humanities at IUSS Pavia (University School for Advanced Studies), Italy. Final thesis: 'More data and new tools. Advances in parsing the Index Thomisticus Treebank'.
  • 2018-2020: Master's degree in Theoretical and Applied Linguistics at University of Pavia, Italy. Final thesis: 'Including a new textual resource into the LiLa Knowledge Base. Lemmatization, PoS tagging and linking of Querolus'.
  • 2015-2019: Undergraduate degree in Humanities at IUSS Pavia (University School for Advanced Studies), Italy.
  • 2015-2018: Bachelor's degree in Classics at University of Pavia, Italy.

Selected Bibliography

  1. Federica Gamba, Alexis Palmer, Daniel Zeman (2025): Bootstrapping UMRs from Universal Dependencies for Scalable Multilingual Annotation. In: Proceedings of the 19th Linguistic Annotation Workshop, pp. 126-136, Association for Computational Linguistics, Wien, Austria, ISBN 979-8-89176-262-6 (url, local PDF, bibtex)
  2. Markéta Lopatková, Eva Fučíková, Federica Gamba, Jan Hajič, Hana Hledíková, Marie Mikulová, Michal Novák, Jan Štěpánek, Daniel Zeman, Šárka Zikánová (2025): UMR 2.0 - Czech: Release Notes (technical report). In: (pdf, bibtex)
  3. Aman Sinha, Federica Gamba (2025): Fossils at SemEval-2025 Task 9: Tasting Loss Functions for Food Hazard Detection in Text Reports. In: Proceedings of the 19th International Workshop on Semantic Evaluation (SemEval-2025), pp. 1515-1521, Association for Computational Lingustics, Kerrville, TX, USA, ISBN 979-8-89176-273-2 (pdf, bibtex)
  4. Jan Štěpánek, Daniel Zeman, Markéta Lopatková, Federica Gamba, Hana Hledíková (2025): Comparing Manual and Automatic UMRs for Czech and Latin. In: Proceedings of the 6th International Workshop on Designing Meaning Representations (DMR 2025), pp. 1-12, Association for Computational Lingustics, Stroudsburg, PA, USA, ISBN 979-8-89176-296-1 (url, local PDF, bibtex)
  5. Federica Gamba (2024): Predicate Sense Disambiguation for UMR Annotation of Latin: Challenges and Insights. In: Proceedings of the 1st Workshop on Machine Learning for Ancient Languages, pp. 19-29, Association for Computational Linguistics, Kerrville, TX, USA, ISBN 979-8-89176-144-5 (url, bibtex)
  6. Federica Gamba, Marco Passarotti, Paolo Ruffolo (2024): Publishing the Dictionary of Medieval Latin in the Czech Lands as Linked Data in the LiLa Knowledge Base. In: Italian Journal of Computational Linguistics, ISSN 2499-4553, vol. 10, no. 1, pp. 95-116 (url, bibtex)
  7. Federica Gamba, Abishek Stephen, Zdeněk Žabokrtský (2024): Universal Feature-based Morphological Trees. In: Proceedings of the LREC-COLING 2024 Joint Workshop on Multiword Expressions and Universal Dependencies (MWE-UD 2024), pp. 125-137, European Language Resources Association (ELRA), Torino, Italy, ISBN 978-2-493814-20-3 (pdf, local PDF, bibtex)
  8. Markéta Lopatková, Eva Fučíková, Federica Gamba, Jan Štěpánek, Daniel Zeman, Šárka Zikánová (2024): Towards a Conversion of the Prague Dependency Treebank Data to the Uniform Meaning Representation. In: Proceedings of the 24th Conference Information Technologies – Applications and Theory (ITAT 2024), pp. 62-76, CEUR-WS.org, Košice, Slovakia (url, local PDF, bibtex)
  9. Milan Straka, Jana Straková, Federica Gamba (2024): ÚFAL LatinPipe at EvaLatin 2024: Morphosyntactic Analysis of Latin. In: Proceedings of the Third Workshop on Language Technologies for Historical and Ancient Languages (LT4HALA) @ LREC-COLING-2024, pp. 207-214, ELRA and ICCL, Torino, Italia, ISBN 978-2-493814-46-3 (pdf, local PDF, bibtex)
  10. Federica Gamba, Marco Passarotti, Paolo Ruffolo (2023): Linking the Dictionary of Medieval Latin in the Czech Lands to the LiLa Knowledge Base. In: Proceedings of the Ninth Italian Conference on Computational Linguistics, pp. 1-8, CEUR Workshop Proceedings, Venice, Italy (pdf, bibtex)
  11. Federica Gamba, Daniel Zeman (2023): Latin Morphology through the Centuries: Ensuring Consistency for Better Language Processing. In: Proceedings of the Ancient Language Processing Workshop, pp. 59-67, INCOMA, Varna, Bulgaria, ISBN 978-954-452-087-8 (pdf, local PDF, local PDF, bibtex)
  12. Federica Gamba, Daniel Zeman (2023): Universalising Latin Universal Dependencies: a harmonisation of Latin treebanks in UD. In: Proceedings of the Sixth Workshop on Universal Dependencies (UDW, GURT/SyntaxFest 2023), pp. 7-16, Association for Computational Linguistics, Stroudsburg, PA, USA, ISBN 978-1-959429-34-0 (pdf, local PDF, local PDF, bibtex)
  13. Federica Gamba, Francesca Frontini, Daan Broeder, Monica Monachini (2022): Language Technologies for the Creation of Multilingual Terminologies. Lessons Learned from the SSHOC Project. In: Proceedings of the 13th Conference on Language Resources and Evaluation (LREC 2022), pp. 154-163, European Language Resources Association, Marseille, France, ISBN 979-10-95546-72-6 (pdf, bibtex)
  14. Francesca Frontini, Federica Gamba, Monica Monachini, Daan Broeder, Kea Tijdens, Irena Vipavc Brvar (2021): D3.9 Report on Ontology and Vocabulary Collection and Publication (technical report). In: (url, bibtex)
  15. Federica Gamba, Marco Passarotti, Paolo Ruffolo (2021): More Data and New Tools. Advances in Parsing the Index Thomisticus Treebank. In: Proceedings of the Conference on Computational Humanities Research 2021, pp. 108-122, CEUR Workshop Proceedings (CEUR-WS.org) (pdf, bibtex)