Number of Empty Nodes in Each Deep UD Treebank

Empty nodes are used to represent elided predicates in gapping and stripping in the enhanced graphs.

Treebank Tokens Empty nodes Empty per 10k tokens
Afrikaans AfriBooms 49276 0 0
Akkadian PISANDUB 1852 0 0
Amharic ATT 10010 0 0
Ancient Greek Perseus 202989 0 0
Ancient Greek PROIEL 213999 617 29
Arabic PADT 282384 82 3
Arabic PUD 20751 2 1
Armenian ArmTDP 36549 0 0
Assyrian AS 453 0 0
Basque BDT 121443 0 0
Belarusian HSE 13325 0 0
Breton KEB 10054 0 0
Bulgarian BTB 156149 1 0
Buryat BDT 10185 0 0
Catalan AnCora 531971 4 0
Classical Chinese Kyoto 55026 0 0
Coptic Scriptorium 25756 0 0
Croatian SET 199441 132 7
Czech CAC 494383 1339 27
Czech CLTT 35630 20 6
Czech FicTree 167056 214 13
Czech PDT 1506484 2649 18
Czech PUD 18610 13 7
Danish DDT 100733 0 0
Dutch Alpino 208633 64 3
Dutch LassySmall 98107 64 7
English EWT 254829 26 1
English GUM 97697 0 0
English LinES 82815 9 1
English ParTUT 49634 16 3
English PUD 21176 8 4
Erzya JR 15790 0 0
Estonian EDT 434245 443 10
Estonian EWT 27246 36 13
Faroese OFT 10002 0 0
Finnish FTB 159612 0 0
Finnish PUD 15813 5 3
Finnish TDT 202194 172 9
French FQB 24135 1 0
French GSD 400387 53 1
French ParTUT 28594 5 2
French Sequoia 70567 44 6
French Spoken 34972 1 0
Galician CTG 138837 0 0
Galician TreeGal 25548 1 0
German GSD 292788 6 0
German HDT 3055010 0 0
German LIT 40456 39 10
German PUD 21329 8 4
Gothic PROIEL 55336 0 0
Greek GDT 63441 39 6
Hebrew HTB 161417 0 0
Hindi HDTB 351704 0 0
Hungarian Szeged 42032 30 7
Chinese CFL 7256 0 0
Chinese GSD 123283 3 0
Indonesian GSD 121923 0 0
Irish IDT 23964 0 0
Italian ISDT 298343 51 2
Italian ParTUT 55558 13 2
Italian PoSTWITA 124445 51 4
Italian PUD 23731 4 2
Italian VIT 279839 3 0
Japanese GSD 184118 0 0
Japanese Modern 14494 0 0
Japanese PUD 26707 0 0
Karelian KKPP 3094 0 0
Kazakh KTB 10536 2 2
Komi Zyrian IKDP 1287 0 0
Komi Zyrian Lattice 2017 0 0
Korean GSD 80322 0 0
Korean Kaist 350090 0 0
Kurmanji MG 10260 0 0
Latin ITTB 353035 1151 33
Latin Perseus 29138 0 0
Latin PROIEL 200163 404 20
Latvian LVTB 208438 236 11
Lithuanian ALKSNIS 39754 0 0
Lithuanian HSE 5356 0 0
Marathi UFAL 3849 0 0
Mbya Guarani Thomas 1318 0 0
Naija NSC 12863 0 0
North Sami Giella 26845 0 0
Norwegian Bokmaal 310221 0 0
Norwegian Nynorsk 301353 0 0
Norwegian NynorskLIA 55410 0 0
Old Church Slavonic PROIEL 57563 146 25
Old Russian RNC 14472 0 0
Old Russian TOROT 149780 0 0
Polish LFG 130967 0 0
Polish PDB 351406 82 2
Polish PUD 18389 5 3
Portuguese Bosque 227799 10 0
Romanian Nonstandard 241714 130 5
Romanian RRT 218511 60 3
Russian GSD 98000 114 12
Russian PUD 19355 19 10
Russian SynTagRus 1106296 903 8
Russian Taiga 38555 41 11
Sanskrit UFAL 1843 0 0
Serbian SET 97673 0 0
Slovak SNK 106043 80 8
Slovenian SSJ 140670 0 0
Slovenian SST 29488 8 3
Spanish AnCora 549569 5 0
Spanish GSD 431587 0 0
Swedish LinES 79811 9 1
Swedish PUD 19076 8 4
Swedish Talbanken 96819 43 4
Tagalog TRG 292 0 0
Tamil TTB 9581 0 0
Turkish GB 16879 0 0
Turkish IMST 57859 0 0
Ukrainian IU 122091 151 12
Upper Sorbian UFAL 11196 0 0
Urdu UDTB 138077 0 0
Vietnamese VTB 43754 0 0
Warlpiri UFAL 314 0 0
Welsh CCG 10662 0 0
Wolof WTB 44258 0 0
Yoruba YTB 2664 0 0