Name: The Singapore Nanopore Expression Data Set
License: CC BY-NC 4.0

bam bioinformatics fast5 fasta fastq genomic life sciences long read sequencing short read sequencing transcriptomics

Description

The Singapore Nanopore Expression (SG-NEx) project is an international collaboration to generate reference transcriptomes and a comprehensive benchmark data set for long read Nanopore RNA-Seq. Transcriptome profiling is done using PCR-cDNA sequencing (PCR-cDNA), amplification-free cDNA sequencing (direct cDNA), direct sequencing of native RNA (direct RNA), and short read RNA-Seq. The SG-NEx core data includes 5 of the most commonly used cell lines and it is extended with additional cell lines and samples that cover a broad range of human tissues. All core samples are sequenced with at least 3 high quality replicates. For a subset of samples spike-in RNAs are used and matched m6A profiling data is available.

Update Frequency

Datasets will be updated periodically as additional data are generated.

License

CC BY-NC 4.0

Documentation

https://github.com/GoekeLab/sg-nex-data

Managed By

The Genome Institute of Singapore (https://www.a-star.edu.sg/gis)

See all datasets managed by The Genome Institute of Singapore (https://www.a-star.edu.sg/gis).

Contact

SG-NEx team

How to Cite

The Singapore Nanopore Expression Data Set was accessed on DATE from https://registry.opendata.aws/sgnex. In addition, please cite Chen et al. A systematic benchmark of Nanopore long read RNA sequencing for transcript level analysis in human cell lines. bioRxiv (2021). doi: https://doi.org/10.1101/2021.04.21.440736 when referencing the SG-NEx data in publications.

Usage Examples

Resources on AWS

Description

Nanopore long read RNA Seq data and matched short read RNA-Seq from the Singapore Nanopore Expression Project (SG-NEx). The data includes raw signal data (fast5), basecalled reads (fastq), aligned reads (bam), processed data for RNA modification detection (json), reference genome annotation files (gtf and fa) and sample metadata (txt).

Resource type

S3 Bucket

Amazon Resource Name (ARN)

arn:aws:s3:::sg-nex-data

AWS Region

ap-southeast-1

AWS CLI Access (No AWS account required)

aws s3 ls --no-sign-request s3://sg-nex-data/

Explore

Browse Bucket
Description

Nanopore long read RNA Seq data from the Singapore Nanopore Expression Project (SG-NEx). The data includes raw signal data (blow5), converted from raw signal data (fast5).

Resource type

S3 Bucket

Amazon Resource Name (ARN)

arn:aws:s3:::sg-nex-data-blow5

AWS Region

ap-southeast-1

AWS CLI Access (No AWS account required)

aws s3 ls --no-sign-request s3://sg-nex-data-blow5/

Explore

Browse Bucket

The Singapore Nanopore Expression Data Set