NCBI SRA Gene Feature RNA-Seq counts

genetic life sciences RNA-seq transcriptomics

Description

The NIH Sequence Read Archive (SRA), hosted by the [National Center for Biotechnology Information (NCBI) at the National Library of Medicine (NLM) stores sequencing data and alignment information from high-throughput next-generation sequencing platforms. SRA has conducted gene expression analysis of publicly released human and mouse RNA-Seq experiments to process raw RNA-seq reads into concise formats that summarize the expression results. The un-normalized feature counts for each SRA record are available in tab-delimited (*.tsv) format. The tsv files include two columns, the gene id and count. These counts facilitate differential gene expression analyses, particularly across studies and the entirety of the SRA corpus.

Update Frequency

Daily

License

NCBI Policy and NIH Genomic Data Sharing Policy

Documentation

https://www.ncbi.nlm.nih.gov/geo/info/rnaseqcounts.html

Managed By

NCBI at NLM

See all datasets managed by NCBI at NLM.

Contact

sra@ncbi.nlm.nih.gov

How to Cite

NCBI SRA Gene Feature RNA-Seq counts was accessed on DATE from https://registry.opendata.aws/ncbi-sra-rnaseq.

Usage Examples

Publications

Resources on AWS

  • Description
    *.tsv files in a public s3 public. The files contain un-normalized feature counts for the SRA run accession listed in the file name. The scope of these records is human and mouse, bulk RNA-Seq data released in the non-controlled access portion of SRA.
    Resource type
    S3 Bucket
    Amazon Resource Name (ARN)
    arn:aws:s3:::sra-rnaseq-analysis
    AWS Region
    us-east-1
    AWS CLI Access (No AWS account required)
    aws s3 ls --no-sign-request s3://sra-rnaseq-analysis/

Edit this dataset entry on GitHub

Tell us about your project

Home