The Registry of Open Data on AWS is now available on AWS Data Exchange
All datasets on the Registry of Open Data are now discoverable on AWS Data Exchange alongside 3,000+ existing data products from category-leading data providers across industries. Explore the catalog to find open, free, and commercial data sets. Learn more about AWS Data Exchange

Usage examples for all datasets listed in the Registry of Open Data on AWS tagged with bioinformatics.


The Human Sleep Project

Tools & Applications
Publications

1000 Genomes Phase 3 Reanalysis with DRAGEN 3.5, 3.7, 4.0, and 4.2

Tutorials
Tools & Applications
Publications

CZ CELLxGENE Discover Census

Tutorials
Tools & Applications
Publications

Genome Aggregation Database (gnomAD)

Tools & Applications
Publications

The Singapore Nanopore Expression Data Set

Tutorials
Tools & Applications
Publications

Garvan Institute Long Read Sequencing Benchmark Data

Tutorials
Tools & Applications
Publications

PubSeq - Public Sequence Resource

Tutorials
Tools & Applications
Publications

Toxicant Exposures and Responses by Genomic and Epigenomic Regulators of Transcription (TaRGET)

Tutorials
Tools & Applications
Publications

Open Bioinformatics Reference Data for Galaxy

Tutorials
Tools & Applications
Publications

Caenorabditis Diversity Natural Resource

Tutorials
Publications

Basic Local Alignment Sequences Tool (BLAST) Databases

Tools & Applications
Publications

Encyclopedia of DNA Elements (ENCODE)

Tutorials
Publications

Refgenie reference genome assets

Tutorials
Tools & Applications
Publications

Synthea synthetic patient generator data in OMOP Common Data Model

Tutorials
Tools & Applications

Broad Genome References

Tutorials
Tools & Applications
Publications

I-CARE:International Cardiac Arrest REsearch consortium Electroencephalography Database

Tools & Applications
Publications

Kraken2 NCBI RefSeq Complete V205 database on AWS

Tutorials
Tools & Applications
Publications

MIMIC-III (‘Medical Information Mart for Intensive Care’)

Tutorials
Tools & Applications

NASA Space Biology Open Science Data Repository (OSDR)

Publications

QIIME 2 Tutorial Data

Tutorials

SPaRCNet data:Seizures, Rhythmic and Periodic Patterns in ICU Electroencephalography

Tools & Applications
Publications

VirtualFlow Ligand Libraries

Tutorials
Tools & Applications
Publications

4D Nucleome (4DN)

Tutorials

Biodiversity Heritage Library Metadata and Page Images

Tools & Applications
Publications

COVID-19 Data Lake

Tutorials
Tools & Applications

Cloud Indexes for Bowtie, Kraken, HISAT, and Centrifuge

Tutorials
Publications

DNAStack COVID19 SRA Data

Tutorials
Tools & Applications

Emory Knee Radiograph (MRKR) dataset

Tutorials
Publications

GATK Structural Variation (SV) Data

Tutorials
Tools & Applications

Harvard Electroencephalography Database

Tools & Applications
Publications

Harvard-Emory ECG Database

Tools & Applications
Publications

Hecatomb Databases

Tutorials
Publications

Indexes for Kaiju

Tutorials
Publications

Protein Data Bank 3D Structural Biology Data

Publications

UniProt

Tutorials

CMS 2008-2010 Data Entrepreneurs’ Synthetic Public Use File (DE-SynPUF) in OMOP Common Data Model

Tutorials
Tools & Applications

COVID-19 Genome Sequence Dataset

Tools & Applications

Conformational Space of Short Peptides

Tutorials

GATK Test Data

Tools & Applications

Global Biodiversity Information Facility (GBIF) Species Occurrences

Tutorials

Oxford Nanopore Technologies Benchmark Datasets

Tutorials

Synthea Coherent Data Set

Publications

recount3

Tutorials

SocialGene RefSeq Databases

Tutorials
Tools & Applications
Publications

GenomeKit genomic data

Tutorials
Tools & Applications

Platinum Pedigree

Publications

Google Brain Genomics Sequencing Dataset for Benchmarking and Development

Publications

OceanOmics

Tutorials

If you want to add a dataset or usage example to this registry, please follow the instructions on the Registry of Open Data on AWS GitHub repository or tell us about your project.

Home