Number of External Subjects of Open Complements in Each Deep UD Treebank

A node is counted as an external subject if it has two or more parents, one of them is connected via a subject relation, the other is connected either via a subject relation or via an object relation, and at the same time the first parent is attached to the second parent via an xcomp relation.

TreebankTokensExternal subjectsXsubj per 10k tokens
Afrikaans AfriBooms492769920
Akkadian PISANDUB1852843
Amharic ATT100106060
Ancient Greek Perseus2029892199108
Ancient Greek PROIEL2139992303108
Arabic PADT282384131947
Arabic PUD207513215
Armenian ArmTDP3654931586
Assyrian AS453122
Basque BDT12144361751
Belarusian HSE13325147110
Breton KEB100542727
Bulgarian BTB15614952333
Buryat BDT101856463
Catalan AnCora531971241345
Classical Chinese Kyoto5502612923
Coptic Scriptorium2575615661
Croatian SET1994412010101
Czech CAC494383254952
Czech CLTT3563020959
Czech FicTree16705688053
Czech PDT15064841053270
Czech PUD18610187100
Danish DDT10073353954
Dutch Alpino2086332165104
Dutch LassySmall9810750251
English EWT2548293285129
English GUM976971038106
English LinES82815935113
English ParTUT4963442986
English PUD21176229108
Erzya JR15790163103
Estonian EDT4342454480103
Estonian EWT27246321118
Faroese OFT100023939
Finnish FTB159612101263
Finnish PUD15813170108
Finnish TDT202194172885
French FQB2413521890
French GSD400387372193
French ParTUT2859414149
French Sequoia70567922131
French Spoken3497233195
Galician CTG13883745833
Galician TreeGal2554815259
German GSD292788157054
German HDT30550101574852
German LIT40456503124
German PUD2132915372
Gothic PROIEL55336596108
Greek GDT6344142767
Hebrew HTB16141777348
Hindi HDTB35170474621
Hungarian Szeged4203214033
Chinese CFL725699136
Chinese GSD1232831229100
Indonesian GSD121923115595
Irish IDT23964711297
Italian ISDT298343162054
Italian ParTUT5555832759
Italian PoSTWITA12444552142
Italian PUD2373118779
Italian VIT27983957320
Japanese GSD18411800
Japanese Modern1449400
Japanese PUD2670700
Karelian KKPP30942271
Kazakh KTB1053644
Komi Zyrian IKDP1287647
Komi Zyrian Lattice20171889
Korean GSD8032200
Korean Kaist350090168248
Kurmanji MG102601414
Latin ITTB3530353772107
Latin Perseus29138367126
Latin PROIEL2001632146107
Latvian LVTB208438171782
Lithuanian ALKSNIS3975439098
Lithuanian HSE53564177
Marathi UFAL38491744
Mbya Guarani Thomas131818
Naija NSC12863148115
North Sami Giella2684513851
Norwegian Bokmaal3102213148101
Norwegian Nynorsk301353284494
Norwegian NynorskLIA5541037467
Old Church Slavonic PROIEL5756351489
Old Russian RNC144727250
Old Russian TOROT149780101167
Polish LFG13096787567
Polish PDB351406236967
Polish PUD1838914981
Portuguese Bosque227799124054
Romanian Nonstandard24171493139
Romanian RRT21851184739
Russian GSD9800076978
Russian PUD19355284147
Russian SynTagRus1106296881480
Russian Taiga3855524463
Sanskrit UFAL18431265
Serbian SET9767388791
Slovak SNK10604344242
Slovenian SSJ14067056540
Slovenian SST294889432
Spanish AnCora549569289153
Spanish GSD431587103624
Swedish LinES79811983123
Swedish PUD19076206108
Swedish Talbanken9681992395
Tagalog TRG29200
Tamil TTB95811010
Turkish GB168796740
Turkish IMST5785900
Ukrainian IU122091105787
Upper Sorbian UFAL111966356
Urdu UDTB13807744132
Vietnamese VTB437541306298
Warlpiri UFAL3144127
Welsh CCG10662289271
Wolof WTB44258693157
Yoruba YTB26641349