bioinformatics genome genomic Homo sapiens life sciences Mus musculus non-human primate open source software Rattus norvegicus variant annotation
GenomeKit is Deep Genomics’ Python library for fast and easy access to genomic resources such as sequence, data tracks, and annotations. The goal is to let machine learning researchers build data sets easily, and to be creative about how those data sets are designed. Out of the box, GenomeKit provides access to pre-built optimized genomic data files that are required for its operation.
Data is updated when popular new genome versions (assemblies or annotations) are released
Apache License Version 2.0 https://www.apache.org/licenses/LICENSE-2.0
https://deepgenomics.github.io/GenomeKit/data_org.html
Deep Genomics
See all datasets managed by Deep Genomics.
https://github.com/deepgenomics/GenomeKit/issues
GenomeKit genomic data was accessed on DATE
from https://registry.opendata.aws/genomekit.
arn:aws:s3:::genomekit-public-dg
us-east-1
aws s3 ls --no-sign-request s3://genomekit-public-dg/