The Registry of Open Data on AWS is now available on AWS Data Exchange
All datasets on the Registry of Open Data are now discoverable on AWS Data Exchange alongside 3,000+ existing data products from category-leading data providers across industries. Explore the catalog to find open, free, and commercial data sets. Learn more about AWS Data Exchange

MetaGraph Sequence Indexes

analysis ready data biodiversity bioinformatics biology fasta genome genomic graph information retrieval life sciences medicine metagenomics microbiome transcriptomics whole exome sequencing whole genome sequencing

Description

The MetaGraph Sequence Indexes dataset comprises full-text searchable index files for raw sequencing data hosted in major public repositories. These include the European Nucleotide Archive (ENA) managed by the European Bioinformatics Institute (EMBL-EBI), the Sequence Read Archive (SRA) maintained by the National Center for Biotechnology Information (NCBI), and the DNA Data Bank of Japan (DDBJ) Sequence Read Archive (DRA).All index files can be used with the MetaGraph framework for sequence search. Indexes can be jointly used for aggregated search in the cloud or can be individually downloaded for search using local hardware.

Update Frequency

Continuously as new sequencing data becomes available.

License

CC BY-SA 4.0

Documentation

Documentation of the dataset available under https://github.com/ratschlab/metagraph-open-data Documentation of the MetaGraph framework is available under https://metagraph.ethz.ch/

Managed By

Biomedical Informatics Lab, ETH Zurich, Switzerland

See all datasets managed by Biomedical Informatics Lab, ETH Zurich, Switzerland.

Contact

Please open an issue under https://github.com/ratschlab/metagraph-open-data/issues

How to Cite

MetaGraph Sequence Indexes was accessed on DATE from https://registry.opendata.aws/metagraph. Karasikov M, Mustafa H, Danciu D, Zimmermann M, Barber C, Raetsch G, Kahles A. Indexing All Life’s Known Biological Sequences. Preprint (2024). doi: 10.1101/2020.10.01.322164

Usage Examples

Tutorials
Tools & Applications
Publications

Resources on AWS

  • Description
    MetaGraph Sequence Indexes
    Resource type
    S3 Bucket
    Amazon Resource Name (ARN)
    arn:aws:s3:::metagraph
    AWS Region
    eu-west-1
    AWS CLI Access (No AWS account required)
    aws s3 ls --no-sign-request s3://metagraph/
    Explore
    Content Browser

Edit this dataset entry on GitHub

Tell us about your project

Home