A centralized sequence repository for all strains of novel corona virus (SARS-CoV-2) submitted to the National Center for Biotechnology Information (NCBI). Included are both the original sequences submitted by the principal investigator as well as SRA-processed sequences that require the SRA Toolkit for analysis.
See all datasets managed by National Library of Medicine (NLM).
sra-srcfolder are in FASTQ, BAM, or CRAM format (original submission); files in the
runfolder are in .sra format and require the SRA Toolkit
aws s3 ls s3://sra-pub-sars-cov2/ --no-sign-request
aws s3 ls s3://sra-pub-sars-cov2-metadata-us-east-1/ --no-sign-request