The Registry of Open Data on AWS is now available on AWS Data Exchange
All datasets on the Registry of Open Data are now discoverable on AWS Data Exchange alongside 3,000+ existing data products from category-leading data providers across industries. Explore the catalog to find open, free, and commercial data sets. Learn more about AWS Data Exchange

Usage examples for all datasets listed in the Registry of Open Data on AWS tagged with machine learning.


The Human Sleep Project

Tools & Applications
Publications

1000 Genomes Phase 3 Reanalysis with DRAGEN 3.5, 3.7, 4.0, and 4.2

Tutorials
Tools & Applications
Publications

Allen Cell Imaging Collections

Tutorials
Tools & Applications
Publications

NASA Prediction of Worldwide Energy Resources (POWER)

Tutorials
Tools & Applications
Publications

ESA WorldCover

Tutorials
Tools & Applications
Publications

SpaceNet

Tutorials
Tools & Applications
Publications

2021 Amazon Last Mile Routing Research Challenge Dataset

Tools & Applications
Publications

Low Altitude Disaster Imagery (LADI) Dataset

Tutorials
Tools & Applications
Publications

Radiant MLHub

Tutorials
Tools & Applications
Publications

Materials Project Data

Tutorials
Tools & Applications
Publications

10m Annual Land Use Land Cover (9-class)

Tools & Applications
Publications

Pacific Ocean Sound Recordings

Tutorials
Publications

RarePlanes

Tutorials
Tools & Applications
Publications

Solar Dynamics Observatory (SDO) Machine Learning Dataset

Tutorials
Tools & Applications
Publications

ESA WorldCover Sentinel-1 and Sentinel-2 10m Annual Composites

Tutorials
Tools & Applications
Publications

High resolution, annual cropland and landcover maps for selected African countries

Tutorials
Publications

MONKEY

Tools & Applications

A region-wide, multi-year set of crop field boundary labels for Africa

Tutorials
Publications

High Resolution Canopy Height Maps by WRI and Meta

Tools & Applications
Publications

OpenCell on AWS

Tools & Applications
Publications

Sentinel-2 L2A 120m Mosaic

Tutorials
Tools & Applications
Publications

iSDAsoil

Tutorials
Tools & Applications
Publications

Allen Ivy Glioblastoma Atlas

Tutorials
Tools & Applications
Publications

I-CARE:International Cardiac Arrest REsearch consortium Electroencephalography Database

Tools & Applications
Publications

NASA SOHO/LASCO2 comet challenge on AWS

Publications

National Cancer Institute Imaging Data Commons (IDC) Collections

Tutorials
Tools & Applications
Publications

PD12M

Tutorials
Tools & Applications
Publications

SPaRCNet data:Seizures, Rhythmic and Periodic Patterns in ICU Electroencephalography

Tools & Applications
Publications

Sophos/ReversingLabs 20 Million malware detection dataset

Tutorials
Tools & Applications
Publications

Wind AI Bench

Tutorials

Africa Soil Information Service (AfSIS) Soil Chemistry

Tutorials
Publications

AgricultureVision

Publications

Astrophysics Division Galaxy Segmentation Benchmark Dataset

Publications

Aurora Multi-Sensor Dataset

Tutorials
Publications

Consented Activities of People

Tools & Applications

CryoET Data Portal

Tutorials
Tools & Applications
Publications

DigitalCorpora

Publications

EEGDash on AWS

Tutorials
Tools & Applications

Emory Knee Radiograph (MRKR) dataset

Tutorials
Publications

Harvard Electroencephalography Database

Tools & Applications
Publications

Harvard-Emory ECG Database

Tools & Applications
Publications

Neural Encoding Simulation Toolkit (NEST)

Tutorials
Tools & Applications
Publications

SeeFar V0

Tutorials
Publications

3DCoMPaT: Composition of Materials on Parts of 3D Things

Publications

A2D2: Audi Autonomous Driving Dataset

Tutorials

AI2 Diagram Dataset (AI2D)

Publications

AI2 Meaningful Citations Data Set

Publications

AI2 Reasoning Challenge (ARC) 2018

Publications

Astrophysics Division Galaxy Morphology Benchmark Dataset

Publications

CHIMERA

Tools & Applications

Corn Kernel Counting Dataset

Publications

Discrete Reasoning Over the content of Paragraphs (DROP)

Publications

High Resolution Population Density Maps + Demographic Estimates by CIESIN and Meta

Tutorials

Image classification - fast.ai datasets

Tools & Applications

Longitudinal Nutrient Deficiency

Publications

MAN TruckScenes

Tutorials
Tools & Applications
Publications

Mars Spectrometry 2: Gas Chromatography for the Sample Analysis at Mars Data (SAM) Instrument

Publications

Mars Spectrometry: Detect Evidence for Past Habitability

Publications

Multi-robot, Multi-Sensor, Multi-Environment Event Dataset (M3ED)

Publications

NYUMets Brain Dataset

Publications

Orcasound - bioacoustic data for marine conservation

Tools & Applications

Quoref

Publications

RSNA Abdominal Trauma Detection (RSNA-ABT)

Publications

RSNA Cervical Spine Fracture Detection (RSNA-CSF) Dataset

Publications

RSNA Intracranial Hemorrhage Detection

Publications

RSNA Pulmonary Embolism Detection

Publications

Reasoning Over Paragraph Effects in Situations (ROPES)

Publications

Voices Obscured in Complex Environmental Settings (VOiCES)

Tutorials

Gretel Synthetic Safety Alignment Dataset

Tutorials
Tools & Applications

Amazon Bin Image Dataset

Publications

YouTube 8 Million - Data Lakehouse Ready

Tutorials
Publications

Amazon-PQA

Publications

Answer Reformulation

Publications

Automatic Speech Recognition (ASR) Error Robustness

Publications

DialoGLUE: A Natural Language Understanding Benchmark for Task-Oriented Dialogue

Publications

Enriched Topical-Chat Dataset for Knowledge-Grounded Dialogue Systems

Publications

Humor Detection from Product Question Answering Systems

Publications

Humor patterns used for querying Alexa traffic

Publications

Learning to Rank and Filter - community question answering

Publications

Multi Token Completion

Publications

Pre- and post-purchase product questions

Publications

Product Comparison Dataset for Online Shopping

Publications

PyEnvs and CallArgs

Publications

WikiSum: Coherent Summarization Dataset for Efficient Human-Evaluation

Publications

Wizard of Tasks

Publications

If you want to add a dataset or usage example to this registry, please follow the instructions on the Registry of Open Data on AWS GitHub repository or tell us about your project.

Home