The Registry of Open Data on AWS is now available on AWS Data Exchange
All datasets on the Registry of Open Data are now discoverable on AWS Data Exchange alongside 3,000+ existing data products from category-leading data providers across industries. Explore the catalog to find open, free, and commercial data sets. Learn more about AWS Data Exchange

Usage examples for all datasets listed in the Registry of Open Data on AWS.


The Human Sleep Project

Tools & Applications
Publications

Common Crawl

Tutorials
Tools & Applications
Publications

The Cancer Genome Atlas

Tools & Applications
Publications

Foldingathome COVID-19 Datasets

Tutorials
Tools & Applications
Publications

Therapeutically Applicable Research to Generate Effective Treatments (TARGET)

Tools & Applications
Publications

Sentinel-2

Tutorials
Tools & Applications
Publications

USGS Landsat

Tutorials
Tools & Applications
Publications

Sudachi Language Resources

Tutorials
Tools & Applications
Publications

CZ CELLxGENE Discover Census

Tutorials
Tools & Applications
Publications

Gabriella Miller Kids First Pediatric Research Program (Kids First)

Tools & Applications
Publications

NASA Prediction of Worldwide Energy Resources (POWER)

Tutorials
Tools & Applications
Publications

NOAA Geostationary Operational Environmental Satellites (GOES) 16, 17 & 18

Tutorials
Tools & Applications
Publications

NEXRAD on AWS

Tutorials
Tools & Applications
Publications

Sentinel-2 Cloud-Optimized GeoTIFFs

Tutorials
Tools & Applications
Publications

Terrain Tiles

Tutorials
Tools & Applications
Publications

Allen Cell Imaging Collections

Tutorials
Tools & Applications
Publications

Inter-mission Time Series of Land Ice Velocity and Elevation (ITS_LIVE)

Tutorials
Tools & Applications
Publications

ESA WorldCover

Tutorials
Tools & Applications
Publications

Genome Aggregation Database (gnomAD)

Tools & Applications
Publications

GeoNet Aotearoa New Zealand Data

Tutorials
Publications

SpaceNet

Tutorials
Tools & Applications
Publications

The Singapore Nanopore Expression Data Set

Tutorials
Tools & Applications
Publications

2021 Amazon Last Mile Routing Research Challenge Dataset

Tools & Applications
Publications

Digital Earth Africa Global Mangrove Watch

Tutorials
Tools & Applications
Publications

Digital Earth Africa Landsat Collection 2 Level 2

Tutorials
Tools & Applications
Publications

Fly Brain Anatomy: FlyLight Gen1 and Split-GAL4 Imagery

Tutorials
Tools & Applications
Publications

Digital Earth Africa CHIRPS Rainfall

Tutorials
Tools & Applications
Publications

Digital Earth Africa Coastlines

Tutorials
Tools & Applications
Publications

Digital Earth Africa GeoMAD

Tutorials
Tools & Applications
Publications

Digital Earth Africa Water Observations from Space

Tutorials
Tools & Applications
Publications

International Neuroimaging Data-Sharing Initiative (INDI)

Tutorials
Tools & Applications
Publications

Low Altitude Disaster Imagery (LADI) Dataset

Tutorials
Tools & Applications
Publications

NOAA Joint Polar Satellite System (JPSS)

Tutorials
Tools & Applications
Publications

NOAA Operational Forecast System (OFS)

Tools & Applications
Publications

RADARSAT-1

Tutorials
Tools & Applications

CBERS on AWS

Tutorials
Tools & Applications
Publications

Digital Earth Africa Sentinel-1 Radiometrically Terrain Corrected

Tutorials
Tools & Applications
Publications

Digital Earth Africa Sentinel-2 Level-2A

Tutorials
Tools & Applications
Publications

Maxar Open Data Program

Tutorials
Tools & Applications
Publications

Multi-Scale Ultra High Resolution (MUR) Sea Surface Temperature (SST)

Tutorials
Tools & Applications
Publications

New Zealand Imagery

Tutorials
Tools & Applications
Publications

Catalina Sky Survey (CSS) subset data on AWS

Tools & Applications
Publications

Department of Energy's Open Energy Data Initiative (OEDI)

Tools & Applications
Publications

Digital Earth Africa ALOS PALSAR, ALOS-2 PALSAR-2 and JERS-1

Tutorials
Tools & Applications
Publications

Digital Earth Africa Cropland Extent Map (2019)

Tutorials
Tools & Applications
Publications

Digital Earth Africa Fractional Cover

Tutorials
Tools & Applications
Publications

Garvan Institute Long Read Sequencing Benchmark Data

Tutorials
Tools & Applications
Publications

Open NeuroData

Tutorials
Tools & Applications
Publications

PubSeq - Public Sequence Resource

Tutorials
Tools & Applications
Publications

Southern California Earthquake Data

Tutorials
Publications

USGS 3DEP LiDAR Point Clouds

Tutorials
Tools & Applications
Publications

nuScenes

Tutorials
Tools & Applications
Publications

Boreas Autonomous Driving Dataset

Tutorials
Publications

Cancer Cell Line Encyclopedia (CCLE)

Tools & Applications
Publications

DOE's Water Power Technology Office's (WPTO) US Wave dataset

Tools & Applications
Publications

GEOGLOWS Hydrological Model Version 2

Tutorials
Tools & Applications
Publications

IBL Neuropixels Brainwide Map on AWS

Tutorials
Tools & Applications

NASA Earth Exchange (NEX) Data Collection

Tutorials
Tools & Applications
Publications

NOAA Water-Column Sonar Data Archive

Tutorials
Tools & Applications
Publications

NREL Wind Integration National Dataset

Tutorials
Tools & Applications
Publications

New Zealand Elevation

Tutorials
Tools & Applications
Publications

Northern California Earthquake Data

Tutorials
Publications

Radiant MLHub

Tutorials
Tools & Applications
Publications

Toxicant Exposures and Responses by Genomic and Epigenomic Regulators of Transcription (TaRGET)

Tutorials
Tools & Applications
Publications

World Bank - Light Every Night

Tutorials
Tools & Applications
Publications

ASTER L1T Cloud-Optimized GeoTIFFs

Tutorials
Tools & Applications
Publications

ArcticDEM

Tools & Applications
Publications

Clinical Proteomic Tumor Analysis Consortium 2 (CPTAC-2)

Tools & Applications
Publications

Coupled Model Intercomparison Project 6

Tutorials
Publications

Earth Observation Data Cubes for Brazil

Tutorials
Tools & Applications
Publications

Global Database of Events, Language and Tone (GDELT)

Tutorials
Tools & Applications

ICGC on AWS

Tutorials
Publications

Materials Project Data

Tutorials
Tools & Applications
Publications

NOAA National Water Model CONUS Retrospective Dataset

Tutorials
Publications

OpenAQ

Tutorials
Tools & Applications
Publications

Reference Elevation Model of Antarctica (REMA)

Tools & Applications
Publications

Scottish Public Sector LiDAR Dataset

Tutorials
Tools & Applications
Publications

nuPlan

Tutorials
Tools & Applications
Publications

1000 Genomes Phase 3 Reanalysis with DRAGEN 3.5 and 3.7

Tutorials
Tools & Applications
Publications

10m Annual Land Use Land Cover (9-class)

Tools & Applications
Publications

Argoverse

Tutorials
Tools & Applications
Publications

BossDB Open Neuroimagery Datasets

Tutorials
Tools & Applications
Publications

CMIP6 GCMs downscaled using WRF

Tutorials
Publications

Capella Space Synthetic Aperture Radar (SAR) Open Dataset

Tutorials
Tools & Applications
Publications

Clinical Proteomic Tumor Analysis Consortium 3 (CPTAC-3)

Tools & Applications
Publications

HYbrid Coordinate Ocean Model Global Ocean Forecast System Reanalysis

Tutorials
Tools & Applications
Publications

IBL Neuropixels Reproducible Ephys Data on AWS

Tutorials
Tools & Applications
Publications

NOAA Rapid Refresh Forecast System (RRFS) [Prototype]

Publications

NYU Langone & FAIR FastMRI Dataset

Tutorials
Publications

New York City Taxi and Limousine Commission (TLC) Trip Record Data

Tutorials

Open Bioinformatics Reference Data for Galaxy

Tutorials
Tools & Applications
Publications

OpenEEW

Tutorials
Tools & Applications
Publications

Pacific Ocean Sound Recordings

Tutorials
Publications

PoroTomo

Tutorials
Publications

RarePlanes

Tutorials
Tools & Applications
Publications

Serratus: Ultra-deep Search for Novel Viruses - Versioned Data Release

Tools & Applications
Publications

Solar Dynamics Observatory (SDO) Machine Learning Dataset

Tutorials
Tools & Applications
Publications

The MIT Supercloud Dataset

Tutorials
Tools & Applications
Publications

2010 Census Production Settings Demographic and Housing Characteristics (DHC) Demonstration Noisy Measurement File

Publications

2020 Census Demographic and Housing Characteristics (DHC) Noisy Measurement File

Publications

3000 Rice Genomes Project

Tools & Applications
Publications

Amazonia EO satellite on AWS

Tutorials
Tools & Applications

Argo marine floats data and metadata from Global Data Assembly Centre (Argo GDAC)

Tutorials
Tools & Applications
Publications

Automated Segmentation of Intracellular Substructures in Electron Microscopy (ASEM) on AWS

Tutorials
Tools & Applications
Publications

CAM6 Data Assimilation Research Testbed (DART) Reanalysis: Cloud-Optimized Dataset

Tutorials
Tools & Applications
Publications

CAncer MEtastases in LYmph nOdes challeNge (CAMELYON) Dataset

Tools & Applications
Publications

CESM-HR

Tutorials
Tools & Applications
Publications

CoMMpass from the Multiple Myeloma Research Foundation

Tools & Applications
Publications

Community Earth System Model Large Ensemble (CESM LENS)

Tutorials
Tools & Applications
Publications

Daylight Map Distribution of OpenStreetMap

Tutorials
Publications

ESA WorldCover Sentinel-1 and Sentinel-2 10m Annual Composites

Tutorials
Tools & Applications
Publications

End-Use Load Profiles for the U.S. Building Stock

Tutorials
Tools & Applications
Publications

Global 30m Height Above Nearest Drainage (HAND)

Tutorials
Tools & Applications
Publications

Global Seasonal Sentinel-1 Interferometric Coherence and Backscatter Data Set

Tutorials
Publications

IBL Behavioral Data on AWS

Tutorials
Tools & Applications
Publications

JMA Himawari-8/9

Publications

NASA / USGS Lunar Orbiter Laser Altimeter Cloud Optimized Point Cloud

Tutorials
Tools & Applications

NIH NCBI Sequence Read Archive (SRA) on AWS

Tutorials
Tools & Applications
Publications

Normalized Difference Urban Index (NDUI)

Tutorials
Tools & Applications
Publications

Open-Meteo Weather API Database

Tutorials
Tools & Applications
Publications

OpenStreetMap on AWS

Tutorials
Tools & Applications

Overture Maps Foundation Open Map Data

Tutorials
Publications

Ozone Monitoring Instrument (OMI) / Aura NO2 Tropospheric Column Density

Tutorials
Tools & Applications

Prefeitura Municipal de São Paulo (PMSP) LiDAR Point Cloud

Tools & Applications
Publications

RACECAR Dataset

Tutorials
Tools & Applications
Publications

Sea Around Us Global Fisheries Catch Data

Tutorials
Tools & Applications
Publications

Sofar Spotter Archive

Tutorials
Publications

SondeHub Radiosonde Telemetry

Tutorials
Tools & Applications
Publications

The Human Connectome Project

Tutorials
Tools & Applications
Publications

Basic Local Alignment Sequences Tool (BLAST) Databases

Tools & Applications
Publications

CMAS Data Warehouse

Tutorials

Chalmers Cloud Ice Climatology

Tutorials
Tools & Applications
Publications

Community Earth System Model v2 Large Ensemble (CESM2 LENS)

Tools & Applications
Publications

ECMWF real-time forecasts

Tutorials
Tools & Applications
Publications

Earth Radio Occultation

Tutorials
Tools & Applications
Publications

Encyclopedia of DNA Elements (ENCODE)

Tutorials
Publications

GEOS-Chem Input Data

Tutorials
Publications

Genome in a Bottle on AWS

Tools & Applications
Publications

HYCOM-OceanTrack Integrated HYCOM Eulerian Fields and Lagrangian Trajectories Dataset

Tutorials
Tools & Applications
Publications

High Resolution Canopy Height Maps by WRI and Meta

Tools & Applications
Publications

Indiana Statewide Digital Aerial Imagery Catalog

Tools & Applications
Publications

JAXA / USGS / NASA Kaguya/SELENE Terrain Camera Observations

Tutorials
Tools & Applications

Molecular Profiling to Predict Response to Treatment (phs001965)

Tools & Applications
Publications

Mouse Brain Anatomy: MouseLight Imagery

Tools & Applications
Publications

NA-CORDEX - North American component of the Coordinated Regional Downscaling Experiment

Tools & Applications
Publications

NAIP on AWS

Tutorials
Tools & Applications
Publications

NASA / USGS Controlled Europa DTMs

Tutorials
Tools & Applications

NASA / USGS Controlled THEMIS Mosaics

Tutorials
Tools & Applications

NASA / USGS Europa Controlled Observation Mosaics

Tutorials
Tools & Applications

NASA / USGS Europa Controlled Observations

Tutorials
Tools & Applications

NASA / USGS Mars Reconnaissance Orbiter (MRO) Context Camera (CTX) Targeted DTMs

Tutorials
Tools & Applications

NASA / USGS Released HiRISE Digital Terrain Models

Tutorials
Tools & Applications

NASA / USGS Uncontrolled HiRISE RDRs

Tutorials
Tools & Applications

NOAA Global Historical Climatology Network Daily (GHCN-D)

Tutorials
Publications

NOAA High-Resolution Rapid Refresh (HRRR) Model

Tutorials
Publications

NREL National Solar Radiation Database

Tools & Applications
Publications

OpenAlex dataset

Tutorials
Publications

OpenCell on AWS

Tools & Applications
Publications

Refgenie reference genome assets

Tutorials
Tools & Applications
Publications

SILO climate data on AWS

Tutorials
Tools & Applications

Sea Surface Temperature Daily Analysis: European Space Agency Climate Change Initiative product version 2.1

Tutorials
Publications

Sentinel-1

Tools & Applications

Sentinel-2 L2A 120m Mosaic

Tutorials
Tools & Applications
Publications

Sentinel-3

Tutorials
Tools & Applications
Publications

Sentinel-5P Level 2

Tutorials
Tools & Applications
Publications

SiPeCaM (Sitios Permanentes de la Calibración y Monitoreo de la Biodiversidad)

Tutorials
Publications

Speedtest by Ookla Global Fixed and Mobile Network Performance Maps

Tutorials

Storm EVent ImageRy (SEVIR)

Tutorials
Tools & Applications
Publications

Synthea synthetic patient generator data in OMOP Common Data Model

Tutorials
Tools & Applications

UK Biobank Linkage Disequilibrium Matrices

Tutorials
Tools & Applications
Publications

UK Biobank Pan-Ancestry Summary Statistics

Tutorials
Tools & Applications
Publications

Virginia Coastal Resilience Master Plan, Phase 1 - December 2021

Tools & Applications
Publications

Yale-CMU-Berkeley (YCB) Object and Model Set

Publications

iSDAsoil

Tutorials
Tools & Applications
Publications

real-changesets

Tools & Applications
Publications

ASF SAR Data Products for Disaster Events

Tools & Applications
Publications

Allen Ivy Glioblastoma Atlas

Tutorials
Tools & Applications
Publications

Allen Mouse Brain Atlas

Tutorials
Tools & Applications
Publications

Beat Acute Myeloid Leukemia (AML) 1.0

Tools & Applications
Publications

Blended TROPOMI+GOSAT Satellite Data Product for Atmospheric Methane

Tutorials
Publications

Broad Genome References

Tutorials
Tools & Applications
Publications

COBRA

Tools & Applications

COVID-19 Harmonized Data

Tutorials
Tools & Applications

Cell Organelle Segmentation in Electron Microscopy (COSEM) on AWS

Publications

CitrusFarm Dataset

Tools & Applications
Publications

Clinical Trial Sequencing Project - Diffuse Large B-Cell Lymphoma

Tools & Applications
Publications

Department of Energy’s Geothermal Data Repository (GDR) Data Lake

Tutorials
Publications

Distributed Archives for Neurophysiology Data Integration (DANDI)

Tools & Applications

EURO-CORDEX - European component of the Coordinated Regional Downscaling Experiment

Tools & Applications
Publications

Exceptional Responders Initiative

Tools & Applications
Publications

Finnish Meteorological Institute Weather Radar Data

Tutorials

Foundation Medicine Adult Cancer Clinical Dataset (FM-AD)

Tools & Applications
Publications

Golden Retriever Lifetime Study: Whole genome genotyping of Golden Retrievers on Axiom HD Arrays

Tutorials
Publications

I-CARE:International Cardiac Arrest REsearch consortium Electroencephalography Database

Tools & Applications
Publications

Japanese Tokenizer Dictionaries

Tutorials
Tools & Applications
Publications

Kraken2 NCBI RefSeq Complete V205 database on AWS

Tutorials
Tools & Applications
Publications

KyFromAbove on AWS

Publications

MIMIC-III (‘Medical Information Mart for Intensive Care’)

Tutorials
Tools & Applications

Medical Segmentation Decathlon

Tutorials
Tools & Applications
Publications

Multiview Extended Video with Activities (MEVA)

Publications

NASA Earth Exchange Global Daily Downscaled Projections (NEX-GDDP-CMIP6)

Tools & Applications
Publications

NASA High Energy Astrophysics Mission Data

Tutorials
Tools & Applications
Publications

NASA Legacy Archive for Microwave Background Data Analysis (LAMBDA)

Tutorials
Tools & Applications
Publications

NASA SOHO/LASCO2 comet challenge on AWS

Publications

NASA Space Biology Open Science Data Repository (OSDR)

Publications

NIFS Large Helical Device (LHD) Experiment

Tutorials
Tools & Applications
Publications

NOAA - hourly position, current, and sea surface temperature from drifters

Tutorials
Publications

NOAA Emergency Response Imagery

Tutorials
Publications

NOAA Global Ensemble Forecast System (GEFS) Re-forecast

Tutorials
Publications

NapierOne Mixed File Dataset

Tutorials
Publications

National Cancer Institute Imaging Data Commons (IDC) Collections

Tutorials
Tools & Applications
Publications

National Herbarium of NSW

Tutorials
Publications

Open City Model (OCM)

Tutorials

Open VLF: Scientific Open Data Initiative for CRAAM's SAVNET and AWESOME VLF Data.

Tools & Applications
Publications

OpenProteinSet

Tutorials
Publications

Pohang Canal Dataset: A Multimodal Maritime Dataset for Autonomous Navigation in Restricted Waters

Tools & Applications
Publications

SPaRCNet data:Seizures, Rhythmic and Periodic Patterns in ICU Electroencephalography

Tools & Applications
Publications

STOIC2021 Training

Tools & Applications
Publications

Sophos/ReversingLabs 20 Million malware detection dataset

Tutorials
Tools & Applications
Publications

The Human Microbiome Project

Publications

Variant Effect Predictor (VEP) and the Loss-Of-Function Transcript Effect Estimator (LOFTEE) Plugin

Tools & Applications

VirtualFlow Ligand Libraries

Tutorials
Tools & Applications
Publications

Wind AI Bench

Tutorials

1940 Census Population Schedules, Enumeration District Maps, and Enumeration District Descriptions

Tutorials
Tools & Applications

1950 Census Population Schedules, Enumeration District Maps, and Enumeration District Descriptions

Tutorials
Tools & Applications

2010 Census Production Settings Redistricting Data (P.L. 94-171) Demonstration Noisy Measurement File

Publications

2020 Census Redistricting Data (P.L. 94-171) Noisy Measurement File

Publications

4D Nucleome (4DN)

Tutorials

A Global Drought and Flood Catalogue from 1950 to 2016

Tutorials
Publications

Africa Soil Information Service (AfSIS) Soil Chemistry

Tutorials
Publications

AgricultureVision

Publications

Allen Institute for Brain Science - Synaptic Physiology Public Data Set

Tools & Applications
Publications

Allen Institute for Neural Dynamics - Extracellular Electrophysiology Compression Benchmark

Tutorials
Publications

Astrophysics Division Galaxy Segmentation Benchmark Dataset

Publications

Atmospheric Models from Météo-France

Tools & Applications

Aurora Multi-Sensor Dataset

Tutorials
Publications

Binding DB - Data Lakehouse Ready

Tutorials
Publications

Biological and Physical Sciences (BPS) Microscopy Benchmark Training Dataset

Publications

Biological and Physical Sciences (BPS) RNA Sequencing Benchmark Training Dataset

Publications

COVID-19 Data Lake

Tutorials
Tools & Applications

Cancer Genome Characterization Initiatives - Burkitt Lymphoma, HIV+ Cervical Cancer

Tools & Applications
Publications

Cell Painting Image Collection

Tools & Applications
Publications

Cloud Indexes for Bowtie, Kraken, HISAT, and Centrifuge

Tutorials
Publications

Collection of open nation-scale LiDAR datasets

Tutorials
Tools & Applications

Consented Activities of People

Tools & Applications

Copernicus Digital Elevation Model (DEM)

Tools & Applications

CoversBR

Tutorials

Covid Job Impacts - US Hiring Data Since March 1 2020

Tutorials
Tools & Applications

DNAStack COVID19 SRA Data

Tutorials
Tools & Applications

DigitalCorpora

Publications

Downscaled Climate Data for Alaska (v1.1, August 2023)

Publications

EMory BrEast Imaging Dataset (EMBED)

Tutorials
Publications

Ford Multi-AV Seasonal Dataset

Tutorials

GATK Structural Variation (SV) Data

Tutorials
Tools & Applications

Genomic Characterization of Metastatic Castration Resistant Prostate Cancer

Tools & Applications
Publications

Harvard Electroencephalography Database

Tools & Applications
Publications

Harvard-Emory ECG Database

Tools & Applications
Publications

Hecatomb Databases

Tutorials
Publications

Indexes for Kaiju

Tutorials
Publications

Integrative Analysis of Lung Adenocarcinoma in Environment and Genetics Lung cancer Etiology (Phase 2)

Tools & Applications

NASA Physical Sciences Informatics (PSI)

Publications

NOAA Analysis of Record for Calibration (AORC) Dataset

Tutorials
Publications

NOAA Climate Forecast System (CFS)

Publications

NOAA Multi-Radar/Multi-Sensor System (MRMS)

Publications

NOAA Unified Forecast System Subseasonal to Seasonal Prototypes

Publications

NOAA World Ocean Database (WOD)

Publications

National Archives Catalog

Tutorials
Tools & Applications

National Cancer Institute Center for Cancer Research - Diffuse Large B Cell Lymphoma (DLBCL) Genomics and Expression

Tools & Applications
Publications

Nighttime-Fire-Flare

Publications

OpenCRAVAT

Tutorials
Tools & Applications

Oregon Health & Science University Chronic Neutrophilic Leukemia Dataset

Tools & Applications
Publications

PALSAR-2 ScanSAR Turkey & Syria Earthquake (L2.1 & L1.1)

Publications

Pancreatic Cancer Organoid Profiling

Tools & Applications
Publications

Protein Data Bank 3D Structural Biology Data

Publications

RAPID NRT Flood Maps

Publications

REDASA COVID-19 Open Data

Tools & Applications
Publications

Reference data for HiFi human WGS

Tutorials
Tools & Applications

Sentinel-1 SLC dataset for South and Southeast Asia, Taiwan, Korea and Japan

Tutorials
Publications

Sounds of Central African landscapes

Publications

Sub-Meter Canopy Tree Height of California in 2020 by CTrees.org

Publications

TIGER Training

Tools & Applications

Terra Fusion Data Sampler

Tutorials
Tools & Applications

Transiting Exoplanet Survey Satellite (TESS)

Tools & Applications
Publications

USGS COAWST (Coupled Ocean Atmosphere Wave and Sediment Transport) Forecast Model Archive, US East and Gulf Coasts

Tutorials
Publications

UniProt

Tutorials

Whiffle WINS50 Open Data on AWS

Tutorials
Publications

1000 Genomes

Publications

3-Band Cryo Data | Wide-field Infrared Survey Explorer (WISE)

Tutorials

3DCoMPaT: Composition of Materials on Parts of 3D Things

Publications

A2D2: Audi Autonomous Driving Dataset

Tutorials

AI2 Diagram Dataset (AI2D)

Publications

AI2 Meaningful Citations Data Set

Publications

AI2 Reasoning Challenge (ARC) 2018

Publications

ARPA-E PERFORM Forecast data

Tools & Applications

All-Sky Data | Wide-field Infrared Survey Explorer (WISE)

Tutorials

AllWISE Data | Wide-field Infrared Survey Explorer (WISE)

Tutorials

Allen Brain Observatory - Visual Coding AWS Public Data Set

Tutorials

Allen Institute for Neural Dynamics - Mouse Neuroanatomy and Physiology Data

Tutorials

Analysis Ready Sentinel-1 Backscatter Imagery

Tutorials

Astrophysics Division Galaxy Morphology Benchmark Dataset

Publications

CIViC (Clinical Interpretation of Variants in Cancer)

Publications

CMS 2008-2010 Data Entrepreneurs’ Synthetic Public Use File (DE-SynPUF) in OMOP Common Data Model

Tutorials
Tools & Applications

COVID-19 Genome Sequence Dataset

Tools & Applications

COVID-19 Open Research Dataset (CORD-19)

Tools & Applications

Co-Produced Climate Data to Support California's Resilience Investments

Tutorials

Common Screens

Tutorials

Community Earth System Model v2 ARISE (CESM2 ARISE)

Tutorials

Conformational Space of Short Peptides

Tutorials

Corn Kernel Counting Dataset

Publications

Coupled Model Intercomparison Project Phase 5 (CMIP5) University of Wisconsin-Madison Probabilistic Downscaling Dataset

Publications

Crowdsourced Bathymetry

Tutorials

Defense Meteorology Satellite Program (DMSP) Auroral Particle Flux

Tools & Applications

Discrete Reasoning Over the content of Paragraphs (DROP)

Publications

End of Term Web Archive Dataset

Publications

Ensemble Meteorological Dataset for Planet Earth, EM-Earth

Publications

GATK Test Data

Tools & Applications

Geosnap Data, Center for Geospatial Sciences

Tools & Applications

Global Biodiversity Information Facility (GBIF) Species Occurrences

Tutorials

High Resolution Population Density Maps + Demographic Estimates by CIESIN and Meta

Tutorials

High-Order Accurate Direct Numerical Simulation of Flow over a MTU-T161 Low Pressure Turbine Blade

Publications

Human Cancer Models Initiative (HCMI) Cancer Model Development Center

Tools & Applications

Human PanGenomics Project

Publications

IDEAM - Colombian Radar Network

Tutorials

Image classification - fast.ai datasets

Tools & Applications

Korea Meteorological Administration (KMA) GK-2A Satellite Data

Publications

LOFAR ELAIS-N1 cycle 2 observations on AWS

Publications

Legal Entity Identifier (LEI) and Legal Entity Reference Data (LE-RD)

Publications

Longitudinal Nutrient Deficiency

Publications

MODIS MYD13A1, MOD13A1, MYD11A1, MOD11A1, MCD43A4

Tools & Applications

Mars Spectrometry 2: Gas Chromatography for the Sample Analysis at Mars Data (SAM) Instrument

Publications

Mars Spectrometry: Detect Evidence for Past Habitability

Publications

Multi-robot, Multi-Sensor, Multi-Environment Event Dataset (M3ED)

Publications

MultiCoNER Datasets

Publications

My School Today

Tutorials

NEOWISE Post-Cryo Data | Wide-field Infrared Survey Explorer (WISE)

Tutorials

NEOWISE Reactivation Data | Near-Earth Object Wide-field Infrared Survey Explorer (NEOWISE)

Tutorials

NIH NCBI PubMed Central (PMC) Article Datasets - Full-Text Biomedical and Life Sciences Journal Articles on AWS

Tutorials

NOAA Coastal Lidar Data

Tools & Applications

NOAA Global Forecast System (GFS)

Publications

NOAA Global Surface Summary of Day

Tutorials

NOAA Integrated Surface Database (ISD)

Tutorials

NOAA Multi-Year Reanalysis of Remotely Sensed Storms (MYRORSS)

Publications

NOAA National Digital Forecast Database (NDFD)

Publications

NOAA National Water Model Short-Range Forecast

Publications

NOAA S-111 Surface Water Currents Data

Tutorials

NOAA U.S. Climate Normals

Tutorials

NOAA Wave Ensemble Reforecast

Tutorials

NOAA/PMEL Ocean Climate Stations Moorings

Publications

NYUMets Brain Dataset

Publications

Natural Earth

Publications

New Jersey Statewide Digital Aerial Imagery Catalog

Tutorials

New Jersey Statewide LiDAR

Tutorials

Ohio State Cardiac MRI Raw Data (OCMR)

Tutorials

OpenSurfaces

Publications

OpenUniverse Matched Rubin and Roman Simulations: Data Preview (Troxel et al. 2024)

Tutorials

Orcasound - bioacoustic data for marine conservation

Tools & Applications

Oxford Nanopore Technologies Benchmark Datasets

Tutorials

PALSAR-2 ScanSAR CARD4L (L2.2)

Publications

PALSAR-2 ScanSAR Flooding in Rwanda (L2.1)

Publications

PALSAR-2 ScanSAR Tropical Cycolne Mocha (L2.1)

Publications

Public Utility Data Liberation Project

Tutorials

QIIME 2 User Tutorial Datasets

Tutorials

Quoref

Publications

Reasoning Over Paragraph Effects in Situations (ROPES)

Publications

SILAM Air Quality

Tutorials

Safecast

Tools & Applications

Seattle Alzheimer's Disease Brain Cell Atlas (SEA-AD)

Tools & Applications

Sentinel-1 SLC dataset for Germany

Tutorials

Single-Cell Atlas of Human Blood During Healthy Aging

Publications

Spitzer Enhanced Imaging Products (SEIP) Super Mosaics

Tutorials

Sup3rCC

Tutorials

Swiss Public Transport Stops

Tools & Applications

Synthea Coherent Data Set

Publications

Tabula Muris

Publications

Tabula Muris Senis

Tutorials

Tabula Sapiens

Publications

Tropical Cyclone Precipitation, Infrared, Microwave, and Environmental Dataset (TC PRIMED)

Tutorials

U.S. Census ACS PUMS

Tutorials

UK Biobank Pharma Proteomics Project (UKB-PPP)

Publications

UK Earth System Model (UKESM1) ARISE-SAI geoengineering experiment data

Tutorials

VitalDB

Publications

Voices Obscured in Complex Environmental Settings (VOiCES)

Tutorials

World Bank Climate Change Knowledge Portal (CCKP)

Publications

Xiph.Org Test Media

Tutorials

ZINC Database

Publications

iHART Whole Genome Sequencing Data Set

Publications

recount3

Tutorials

Baby Open Brains (BOBs) Repository on AWS

Tutorials
Tools & Applications
Publications

MegaScenes

Tutorials
Tools & Applications
Publications

Amazon Bin Image Dataset

Publications

ChEMBL - Data Lakehouse Ready

Tutorials
Publications

ClinVar - Data Lakehouse Ready

Tutorials
Publications

Estimating Confidence Intervals for 2020 Census Statistics Using an Approximate Monte Carlo Simulation

Tools & Applications
Publications

Open Targets - Data Lakehouse Ready

Tutorials
Publications

YouTube 8 Million - Data Lakehouse Ready

Tutorials
Publications

1000 Genomes Phase 3 Reanalysis with DRAGEN 3.5 - Data Lakehouse Ready

Tutorials

AWS Public Blockchain Data

Publications

AWS iGenomes

Tools & Applications

Amazon-PQA

Publications

Answer Reformulation

Publications

Automatic Speech Recognition (ASR) Error Robustness

Publications

BodyM Dataset

Publications

DialoGLUE: A Natural Language Understanding Benchmark for Task-Oriented Dialogue

Publications

Enriched Topical-Chat Dataset for Knowledge-Grounded Dialogue Systems

Publications

Genome Aggregation Database (gnomAD) - Data Lakehouse Ready

Tutorials

Google Brain Genomics Sequencing Dataset for Benchmarking and Development

Publications

Helpful Sentences from Reviews

Publications

Humor Detection from Product Question Answering Systems

Publications

Humor patterns used for querying Alexa traffic

Publications

Learning to Rank and Filter - community question answering

Publications

Low Context Name Entity Recognition (NER) Datasets with Gazetteer

Publications

Multi Token Completion

Publications

Multilingual Name Entity Recognition (NER) Datasets with Gazetteer

Publications

PASS: Perturb-and-Select Summarizer for Product Reviews

Publications

PersonPath22

Publications

Phrase Clustering Dataset (PCD)

Publications

Pre- and post-purchase product questions

Publications

Product Comparison Dataset for Online Shopping

Publications

PyEnvs and CallArgs

Publications

Shopping Humor Generation

Publications

Visual Anomaly (VisA)

Publications

WikiSum: Coherent Summarization Dataset for Efficient Human-Evaluation

Publications

Wizard of Tasks

Publications

If you want to add a dataset or usage example to this registry, please follow the instructions on the Registry of Open Data on AWS GitHub repository or tell us about your project.

Home