The Registry of Open Data on AWS is now available on AWS Data Exchange
All datasets on the Registry of Open Data are now discoverable on AWS Data Exchange alongside 3,000+ existing data products from category-leading data providers across industries. Explore the catalog to find open, free, and commercial data sets. Learn more about AWS Data Exchange

Usage examples for all datasets listed in the Registry of Open Data on AWS tagged with genomic.


The Cancer Genome Atlas

Tools & Applications
Publications

Therapeutically Applicable Research to Generate Effective Treatments (TARGET)

Tools & Applications
Publications

Gabriella Miller Kids First Pediatric Research Program (Kids First)

Tools & Applications
Publications

Genome Aggregation Database (gnomAD)

Tools & Applications
Publications

The Singapore Nanopore Expression Data Set

Tutorials
Tools & Applications
Publications

Garvan Institute Long Read Sequencing Benchmark Data

Tutorials
Tools & Applications
Publications

PubSeq - Public Sequence Resource

Tutorials
Tools & Applications
Publications

Cancer Cell Line Encyclopedia (CCLE)

Tools & Applications
Publications

Toxicant Exposures and Responses by Genomic and Epigenomic Regulators of Transcription (TaRGET)

Tutorials
Tools & Applications
Publications

Clinical Proteomic Tumor Analysis Consortium 2 (CPTAC-2)

Tools & Applications
Publications

ICGC on AWS

Tutorials
Publications

1000 Genomes Phase 3 Reanalysis with DRAGEN 3.5 and 3.7

Tutorials
Tools & Applications
Publications

Clinical Proteomic Tumor Analysis Consortium 3 (CPTAC-3)

Tools & Applications
Publications

Open Bioinformatics Reference Data for Galaxy

Tutorials
Tools & Applications
Publications

Serratus: Ultra-deep Search for Novel Viruses - Versioned Data Release

Tools & Applications
Publications

3000 Rice Genomes Project

Tools & Applications
Publications

CoMMpass from the Multiple Myeloma Research Foundation

Tools & Applications
Publications

NIH NCBI Sequence Read Archive (SRA) on AWS

Tutorials
Tools & Applications
Publications

Basic Local Alignment Sequences Tool (BLAST) Databases

Tools & Applications
Publications

Encyclopedia of DNA Elements (ENCODE)

Tutorials
Publications

Genome in a Bottle on AWS

Tools & Applications
Publications

Molecular Profiling to Predict Response to Treatment (phs001965)

Tools & Applications
Publications

Refgenie reference genome assets

Tutorials
Tools & Applications
Publications

UK Biobank Linkage Disequilibrium Matrices

Tutorials
Tools & Applications
Publications

UK Biobank Pan-Ancestry Summary Statistics

Tutorials
Tools & Applications
Publications

Beat Acute Myeloid Leukemia (AML) 1.0

Tools & Applications
Publications

Broad Genome References

Tutorials
Tools & Applications
Publications

Clinical Trial Sequencing Project - Diffuse Large B-Cell Lymphoma

Tools & Applications
Publications

Exceptional Responders Initiative

Tools & Applications
Publications

Foundation Medicine Adult Cancer Clinical Dataset (FM-AD)

Tools & Applications
Publications

NASA Space Biology Open Science Data Repository (OSDR)

Publications

The Human Microbiome Project

Publications

Variant Effect Predictor (VEP) and the Loss-Of-Function Transcript Effect Estimator (LOFTEE) Plugin

Tools & Applications

4D Nucleome (4DN)

Tutorials

Binding DB - Data Lakehouse Ready

Tutorials
Publications

Cancer Genome Characterization Initiatives - Burkitt Lymphoma, HIV+ Cervical Cancer

Tools & Applications
Publications

Cloud Indexes for Bowtie, Kraken, HISAT, and Centrifuge

Tutorials
Publications

DNAStack COVID19 SRA Data

Tutorials
Tools & Applications

GATK Structural Variation (SV) Data

Tutorials
Tools & Applications

Genomic Characterization of Metastatic Castration Resistant Prostate Cancer

Tools & Applications
Publications

Hecatomb Databases

Tutorials
Publications

Indexes for Kaiju

Tutorials
Publications

Integrative Analysis of Lung Adenocarcinoma in Environment and Genetics Lung cancer Etiology (Phase 2)

Tools & Applications

National Cancer Institute Center for Cancer Research - Diffuse Large B Cell Lymphoma (DLBCL) Genomics and Expression

Tools & Applications
Publications

OpenCRAVAT

Tutorials
Tools & Applications

Oregon Health & Science University Chronic Neutrophilic Leukemia Dataset

Tools & Applications
Publications

Pancreatic Cancer Organoid Profiling

Tools & Applications
Publications

1000 Genomes

Publications

CIViC (Clinical Interpretation of Variants in Cancer)

Publications

COVID-19 Genome Sequence Dataset

Tools & Applications

GATK Test Data

Tools & Applications

Human Cancer Models Initiative (HCMI) Cancer Model Development Center

Tools & Applications

Human PanGenomics Project

Publications

Oxford Nanopore Technologies Benchmark Datasets

Tutorials

QIIME 2 User Tutorial Datasets

Tutorials

Synthea Coherent Data Set

Publications

Tabula Muris

Publications

Tabula Muris Senis

Tutorials

Tabula Sapiens

Publications

iHART Whole Genome Sequencing Data Set

Publications

recount3

Tutorials

ChEMBL - Data Lakehouse Ready

Tutorials
Publications

ClinVar - Data Lakehouse Ready

Tutorials
Publications

Open Targets - Data Lakehouse Ready

Tutorials
Publications

1000 Genomes Phase 3 Reanalysis with DRAGEN 3.5 - Data Lakehouse Ready

Tutorials

AWS iGenomes

Tools & Applications

Genome Aggregation Database (gnomAD) - Data Lakehouse Ready

Tutorials

Google Brain Genomics Sequencing Dataset for Benchmarking and Development

Publications

If you want to add a dataset or usage example to this registry, please follow the instructions on the Registry of Open Data on AWS GitHub repository or tell us about your project.

Home