The Registry of Open Data on AWS is now available on AWS Data Exchange
All datasets on the Registry of Open Data are now discoverable on AWS Data Exchange alongside 3,000+ existing data products from category-leading data providers across industries. Explore the catalog to find open, free, and commercial data sets. Learn more about AWS Data Exchange

BUSCO Datasets

assembly bacteria bioinformatics genomic life sciences metagenomics open source software protein virus

Description

Lineage datasets for use with BUSCO software package. Each dataset contains HMM profiles for clade specific, universal, single-copy marker genes. Datasets are available across archaea, bacteria, eukaryota and virus domains. The repository also includes necessary data files for phylogenetic placement of an input assembly.

Update Frequency

New datasets are released to correspond with updates in OrthoDB versions. Maintenance updates occur a few times a year if necessary to fix any bugs or update metadata.

License

The BUSCO datasets are licensed under the Creative Commons Attribution-NoDerivatives 4.0 International License. To view a copy of this license, visit http://creativecommons.org/licenses/by-nd/4.0/ or send a letter to Creative Commons, PO Box 1866, Mountain View, CA 94042, USA. Any use of these datasets for analyses in a publication or product must include the citation of the corresponding paper - https://doi.org/10.1093/molbev/msab199

Documentation

https://busco.ezlab.org/busco_userguide.html#lineage-datasets

Managed By

Computational Evolutionary Genomics Group, University of Geneva

See all datasets managed by Computational Evolutionary Genomics Group, University of Geneva.

Contact

https://gitlab.com/ezlab/busco/-/issues

How to Cite

BUSCO Datasets was accessed on DATE from https://registry.opendata.aws/busco-data.

Usage Examples

Tutorials
Publications

Resources on AWS

  • Description
    BUSCO datasets and companion files for use with BUSCO pipeline
    Resource type
    S3 Bucket
    Amazon Resource Name (ARN)
    arn:aws:s3:::busco-data
    AWS Region
    us-east-1
    AWS CLI Access (No AWS account required)
    aws s3 ls --no-sign-request s3://busco-data/
  • Description
    Notifications for new BUSCO data
    Resource type
    SNS Topic
    Amazon Resource Name (ARN)
    arn:aws:sns:us-east-1:622022425660:my-dataset-object_created
    AWS Region
    us-east-1

Edit this dataset entry on GitHub

Tell us about your project

Home