Hausa Visual Genome
Hausa Visual Genome is a multimodal dataset consisting of text and images suitable for English-to-Hausa multimodal machine translation tasks and multimodal research. We have selected short English segments (captions) from Visual Genome along with associated images and automatically translated them to Hausa with manual post-editing, taking the associated images into account. The training set contains 29K segments. Further 1K and 1.6K segments are provided in development and test sets, respectively, which follow the same (random) sampling from the original Hindi Visual Genome.