assembly bacteria bioinformatics genomic life sciences metagenomics open source software protein virus
Lineage datasets for use with BUSCO software package. Each dataset contains HMM profiles for clade specific, universal, single-copy marker genes. Datasets are available across archaea, bacteria, eukaryota and virus domains. The repository also includes necessary data files for phylogenetic placement of an input assembly.
New datasets are released to correspond with updates in OrthoDB versions. Maintenance updates occur a few times a year if necessary to fix any bugs or update metadata.
The BUSCO datasets are licensed under the Creative Commons Attribution-NoDerivatives 4.0 International License. To view a copy of this license, visit http://creativecommons.org/licenses/by-nd/4.0/ or send a letter to Creative Commons, PO Box 1866, Mountain View, CA 94042, USA. Any use of these datasets for analyses in a publication or product must include the citation of the corresponding paper - https://doi.org/10.1093/molbev/msab199
https://busco.ezlab.org/busco_userguide.html#lineage-datasets
Computational Evolutionary Genomics Group, University of Geneva
See all datasets managed by Computational Evolutionary Genomics Group, University of Geneva.
https://gitlab.com/ezlab/busco/-/issues
BUSCO Datasets was accessed on DATE
from https://registry.opendata.aws/busco-data.
arn:aws:s3:::busco-data
us-east-1
aws s3 ls --no-sign-request s3://busco-data/
arn:aws:sns:us-east-1:622022425660:my-dataset-object_created
us-east-1