computer vision machine learning machine translation natural language processing
MMID is a large-scale, massively multilingual dataset of images paired with the words they represent collected at the University of Pennsylvania. The dataset is doubly parallel: for each language, words are stored parallel to images that represent the word, and parallel to the word's translation into English (and corresponding images.)
Language data is added as it is ready for distribution.
See citation instructions at http://multilingual-images.org
https://multilingual-images.org/doc.html
See all datasets managed by Penn NLP.
The Massively Multilingual Image Dataset (MMID) was accessed on DATE
from https://registry.opendata.aws/mmid.
arn:aws:s3:::mmid-pds
us-east-1
aws s3 ls --no-sign-request s3://mmid-pds/