Coreference in Universal Dependencies (CorefUD) is an initiative to collect coreference corpora in various languages and harmonize them to the same scheme and data format (CoNLL-U).
CorefUD 1.1, the current version of the collection, can be downloaded from http://hdl.handle.net/11234/1-5053.
We organize a CRAC 2023 Shared Task on Multilingual Coreference Resolution, which follows the previous edition of the shared task in 2022.
If you want to learn more about the collection, please have a look at
Feel free to write us if you have any questions: Anna Nedoluzhko, Michal Novák, Martin Popel, Zdeněk Žabokrtský, and Daniel Zeman.
When using CorefUD, please cite the following LREC paper:
@inproceedings{nedoluzhko-etal-2022-corefud, title = "{C}oref{UD} 1.0: Coreference Meets {U}niversal {D}ependencies", author = "Nedoluzhko, Anna and Nov{\'a}k, Michal and Popel, Martin and {\v{Z}}abokrtsk{\'y}, Zden{\v{e}}k and Zeldes, Amir and Zeman, Daniel", booktitle = "Proceedings of the Thirteenth Language Resources and Evaluation Conference", month = jun, year = "2022", address = "Marseille, France", publisher = "European Language Resources Association", url = "https://aclanthology.org/2022.lrec-1.520", pages = "4859--4872", }