Name: NCBI SRA Gene Feature RNA-Seq counts
License: NCBI Policy and NIH Genomic Data Sharing Policy

Description

The NIH Sequence Read Archive (SRA), hosted by the [National Center for Biotechnology Information (NCBI) at the National Library of Medicine (NLM) stores sequencing data and alignment information from high-throughput next-generation sequencing platforms. SRA has conducted gene expression analysis of publicly released human and mouse RNA-Seq experiments to process raw RNA-seq reads into concise formats that summarize the expression results. The un-normalized feature counts for each SRA record are available in tab-delimited (*.tsv) format. The tsv files include two columns, the gene id and count. These counts facilitate differential gene expression analyses, particularly across studies and the entirety of the SRA corpus.

Update Frequency

Daily

License

NCBI Policy and NIH Genomic Data Sharing Policy

Documentation

https://www.ncbi.nlm.nih.gov/geo/info/rnaseqcounts.html

Managed By

NCBI at NLM

See all datasets managed by NCBI at NLM.

Contact

sra@ncbi.nlm.nih.gov

How to Cite

NCBI SRA Gene Feature RNA-Seq counts was accessed on DATE from https://registry.opendata.aws/ncbi-sra-rnaseq.

Usage Examples

Publications

The Sequence Read Archive by Leinonen et al (2011)

Resources on AWS

Description

*.tsv files in a public s3 public. The files contain un-normalized feature counts for the SRA run accession listed in the file name. The scope of these records is human and mouse, bulk RNA-Seq data released in the non-controlled access portion of SRA.

Resource type

S3 Bucket

Amazon Resource Name (ARN)

arn:aws:s3:::sra-rnaseq-analysis

AWS Region

us-east-1

AWS CLI Access (No AWS account required)

aws s3 ls --no-sign-request s3://sra-rnaseq-analysis/