The Registry of Open Data on AWS is now available on AWS Data Exchange
All datasets on the Registry of Open Data are now discoverable on AWS Data Exchange alongside 3,000+ existing data products from category-leading data providers across industries. Explore the catalog to find open, free, and commercial data sets. Learn more about AWS Data Exchange

Usage examples for all datasets listed in the Registry of Open Data on AWS tagged with whole genome sequencing.


The Cancer Genome Atlas

Tools & Applications
Publications

Therapeutically Applicable Research to Generate Effective Treatments (TARGET)

Tools & Applications
Publications

1000 Genomes Phase 3 Reanalysis with DRAGEN 3.5, 3.7, 4.0, and 4.2

Tutorials
Tools & Applications
Publications

Gabriella Miller Kids First Pediatric Research Program (Kids First)

Tools & Applications
Publications

Genome Aggregation Database (gnomAD)

Tools & Applications
Publications

Cancer Cell Line Encyclopedia (CCLE)

Tools & Applications
Publications

Logan Unitigs and Contigs of the Sequence Read Archive (SRA) on AWS

Tutorials
Tools & Applications

CoMMpass from the Multiple Myeloma Research Foundation

Tools & Applications
Publications

NIH NCBI Sequence Read Archive (SRA) on AWS

Tutorials
Tools & Applications
Publications

Molecular Profiling to Predict Response to Treatment (phs001965)

Tools & Applications
Publications

Refgenie reference genome assets

Tutorials
Tools & Applications
Publications

Clinical Trial Sequencing Project - Diffuse Large B-Cell Lymphoma

Tools & Applications
Publications

Exceptional Responders Initiative

Tools & Applications
Publications

1000 Genomes

Tools & Applications
Publications

Cloud Indexes for Bowtie, Kraken, HISAT, and Centrifuge

Tutorials
Publications

DNAStack COVID19 SRA Data

Tutorials
Tools & Applications

Genomic Characterization of Metastatic Castration Resistant Prostate Cancer

Tools & Applications
Publications

Hecatomb Databases

Tutorials
Publications

Indexes for Kaiju

Tutorials
Publications

Integrative Analysis of Lung Adenocarcinoma in Environment and Genetics Lung cancer Etiology (Phase 2)

Tools & Applications

Pancreatic Cancer Organoid Profiling

Tools & Applications
Publications

Reference data for HiFi human WGS

Tutorials
Tools & Applications

COVID-19 Genome Sequence Dataset

Tools & Applications

Human Cancer Models Initiative (HCMI) Cancer Model Development Center

Tools & Applications

Oxford Nanopore Technologies Benchmark Datasets

Tutorials

iHART Whole Genome Sequencing Data Set

Publications

MetaGraph Sequence Indexes

Tutorials
Tools & Applications
Publications

Platinum Pedigree

Publications

1KG-ONT-VIENNA panel

Publications

AllTheBacteria

Publications

Google Brain Genomics Sequencing Dataset for Benchmarking and Development

Publications

If you want to add a dataset or usage example to this registry, please follow the instructions on the Registry of Open Data on AWS GitHub repository or tell us about your project.

Home