Genome Aggregation Database (gnomAD)

bioinformatics genetic genomic life sciences population population genetics short read sequencing whole genome sequencing


The Genome Aggregation Database (gnomAD) is a resource developed by an international coalition of investigators that aggregates and harmonizes both exome and genome data from a wide range of large-scale human sequencing projects. The summary data provided here are released for the benefit of the wider scientific community without restriction on use. The v2 data set (GRCh37) spans 125,748 exome sequences and 15,708 whole-genome sequences from unrelated individuals. The v3 data set (GRCh38) spans 71,702 genomes, selected as in v2. Sign up for the gnomAD mailing list here.

Update Frequency

Data from new releases are made public as soon as they are available. New releases, including both minor and major versions, have historically been issued on the order of once per year.


MIT; terms of use


Managed By

gnomAD Production Team at the Broad Institute

See all datasets managed by gnomAD Production Team at the Broad Institute.


Usage Examples

Tools & Applications

Resources on AWS

  • Description
    gnomAD summary data aggregated from large-scale human genome and exome sequencing projects.
    Resource type
    S3 Bucket
    Amazon Resource Name (ARN)
    AWS Region
    AWS CLI Access (No AWS account required)
    aws s3 ls s3://gnomad-public-us-east-1/ --no-sign-request

Edit this dataset entry on GitHub