deep learning machine learning natural language processing
Some of the most important datasets for NLP, with a focus on classification, including IMDb, AG-News, Amazon Reviews (polarity and full), Yelp Reviews (polarity and full), Dbpedia, Sogou News (Pinyin), Yahoo Answers, Wikitext 2 and Wikitext 103, and ACL-2010 French-English 10^9 corpus. This is part of the fast.ai datasets collection hosted by AWS for convenience of fast.ai students. See documentation link for citation and license details for each dataset.
As required
Varies by dataset - see documentation link
http://course.fast.ai/datasets
See all datasets managed by fast.ai.
NLP - fast.ai datasets was accessed on DATE
from https://registry.opendata.aws/fast-ai-nlp.
arn:aws:s3:::fast-ai-nlp
us-east-1
aws s3 ls --no-sign-request s3://fast-ai-nlp/