NIH NCBI Sequence Research Archive (SRA) on AWS

genetic genomic life sciences

Description

The Sequence Read Archive (SRA) stores raw sequencing data from the next generation of sequencing platforms including Roche 454 GS System®, Illumina Genome Analyzer®, Applied Biosystems SOLiD® System, Helicos Heliscope®, Complete Genomics®, and Pacific Biosciences SMRT®. SRA on AWS hosts a subset of SRA accession codes in two public S3 buckets.

Update Frequency

Bucket contents are expected to update daily as new datasets are submitted to SRA and older datasets are cycled based on age and usage.

License

CC BY-NC 2.5

Documentation

https://catalog.data.gov/dataset/sequence-read-archive-sra

Managed By

NCBI SRA

See all datasets managed by NCBI SRA.

Contact

SRA Staff

Usage Examples

Tutorials
Tools & Applications
Publications

Resources on AWS

  • Description
    .bam, .cram, and .fastq files in a public S3 bucket
    Resource type
    S3 Bucket
    Amazon Resource Name (ARN)
    arn:aws:s3:::sra-pub-src-1
    AWS Region
    us-east-1
  • Description
    .bam, .cram, and .fastq files in a public S3 bucket
    Resource type
    S3 Bucket
    Amazon Resource Name (ARN)
    arn:aws:s3:::sra-pub-src-2
    AWS Region
    us-east-1

Edit this dataset entry on GitHub

Home