The goal of the project is to identify recurrent genetic alterations (mutations, deletions, amplifications, rearrangements) and/or gene expression signatures. National Cancer Institute (NCI) utilized whole genome sequencing and/or whole exome sequencing in conjunction with transcriptome sequencing. The samples were processed and submitted for genomic characterization using pipelines and procedures established within The Cancer Genome Analysis (TCGA) project.
Genomic Data Commons (GDC) is source of truth for this dataset; GDC offers monthly data releases, although this dataset may not be updated at every release.
NIH Genomic Data Sharing Policy: https://gdc.cancer.gov/access-data/data-access-policies
See all datasets managed by Center for Translational Data Science at The University of Chicago.
Clinical Trial Sequencing Project - Diffuse Large B-Cell Lymphoma was accessed on
DATE from https://registry.opendata.aws/ctsp-dlbcl.
aws s3 ls --no-sign-request s3://gdc-ctsp-phs001175-2-open/