Usage examples for all datasets listed in the Registry of Open Data on AWS tagged with genomic.


The Cancer Genome Atlas

Tools & Applications
Publications

Therapeutically Applicable Research to Generate Effective Treatments (TARGET)

Tools & Applications
Publications

Gabriella Miller Kids First Pediatric Research Program (Kids First)

Tools & Applications
Publications

Genome Aggregation Database (gnomAD)

Tools & Applications
Publications

PubSeq - Public Sequence Resource

Tutorials
Tools & Applications
Publications

Cancer Cell Line Encyclopedia (CCLE)

Tools & Applications
Publications

Toxicant Exposures and Responses by Genomic and Epigenomic Regulators of Transcription (TaRGET)

Tutorials
Tools & Applications
Publications

Clinical Proteomic Tumor Analysis Consortium 2 (CPTAC-2)

Tools & Applications
Publications

ICGC on AWS

Tutorials
Publications

1000 Genomes Phase 3 Reanalysis with DRAGEN 3.5 and 3.7

Tutorials
Tools & Applications
Publications

Clinical Proteomic Tumor Analysis Consortium 3 (CPTAC-3)

Tools & Applications
Publications

Open Bioinformatics Reference Data for Galaxy

Tutorials
Tools & Applications
Publications

3000 Rice Genomes Project

Tools & Applications
Publications

CoMMpass from the Multiple Myeloma Research Foundation

Tools & Applications
Publications

NIH NCBI Sequence Read Archive (SRA) on AWS

Tutorials
Tools & Applications
Publications

Basic Local Alignment Sequences Tool (BLAST) Databases

Tools & Applications
Publications

Encyclopedia of DNA Elements (ENCODE)

Tutorials
Publications

Genome in a Bottle on AWS

Tools & Applications
Publications

Refgenie reference genome assets

Tutorials
Tools & Applications
Publications

UK Biobank Pan-Ancestry Summary Statistics

Tutorials
Tools & Applications
Publications

Beat Acute Myeloid Leukemia (AML) 1.0

Tools & Applications
Publications

Broad Genome References

Tutorials
Tools & Applications
Publications

Clinical Trial Sequencing Project - Diffuse Large B-Cell Lymphoma

Tools & Applications
Publications

Foundation Medicine Adult Cancer Clinical Dataset (FM-AD)

Tools & Applications
Publications

Serratus: Ultra-deep Search for Novel Viruses - Versioned Data Release

Tools & Applications
Publications

The Human Microbiome Project

Publications

Variant Effect Predictor (VEP) and the Loss-Of-Function Transcript Effect Estimator (LOFTEE) Plugin

Tools & Applications

4D Nucleome (4DN)

Tutorials

Cancer Genome Characterization Initiatives - Burkitt Lymphoma, HIV+ Cervical Cancer

Tools & Applications
Publications

Cloud Indexes for Bowtie, Kraken, HISAT, and Centrifuge

Tutorials
Publications

DNAStack COVID19 SRA Data

Tutorials
Tools & Applications

Hecatomb Databases

Tutorials
Publications

National Cancer Institute Center for Cancer Research - Diffuse Large B Cell Lymphoma (DLBCL) Genomics and Expression

Tools & Applications
Publications

Oregon Health & Science University Chronic Neutrophilic Leukemia Dataset

Tools & Applications
Publications

Pancreatic Cancer Organoid Profiling

Tools & Applications
Publications

1000 Genomes

Publications

AWS iGenomes

Tools & Applications

CIViC (Clinical Interpretation of Variants in Cancer)

Publications

COVID-19 Genome Sequence Dataset

Tools & Applications

GATK Test Data

Tools & Applications

Human Cancer Models Initiative (HCMI) Cancer Model Development Center

Tools & Applications

Human PanGenomics Project

Publications

Oxford Nanopore Technologies Benchmark Datasets

Tutorials

QIIME 2 User Tutorial Datasets

Tutorials

Tabula Muris

Publications

iHART Whole Genome Sequencing Data Set

Publications

Binding DB - Data Lakehouse Ready

Tutorials
Publications

ChEMBL - Data Lakehouse Ready

Tutorials
Publications

ClinVar - Data Lakehouse Ready

Tutorials
Publications

GATK Structural Variation (SV) Data

Tutorials
Tools & Applications

Open Targets - Data Lakehouse Ready

Tutorials
Publications

1000 Genomes Phase 3 Reanalysis with DRAGEN 3.5 - Data Lakehouse Ready

Tutorials

Genome Aggregation Database (gnomAD) - Data Lakehouse Ready

Tutorials

Google Brain Genomics Sequencing Dataset for Benchmarking and Development

Publications

If you want to add a dataset or usage example to this registry, please follow the instructions on the Registry of Open Data on AWS GitHub repository or tell us about your project.

Home