bioinformatics genomic genotyping Homo sapiens life sciences long read sequencing whole genome sequencing
The Platinum Pedigree Consortium (PCC) is a collaborative project to create a comprehensive reference for human genetic variation using a four-generation, 28-member family (CEPH-1463). We employed five different short and long-read sequencing technologies to generate phased assemblies and characterize both inherited and de novo variation, including at some of the most difficult to genotype genomic regions such as tandem repeats, centromeres, and the Y chromosome. This extensive "truth set" is publicly available and can be used to test and benchmark new algorithms and technologies to better understand human genetic variation.
As needed
https://github.com/Platinum-Pedigree-Consortium
Platinum Pedigree Consortium
See all datasets managed by Platinum Pedigree Consortium.
https://github.com/Platinum-Pedigree-Consortium/Platinum-Pedigree-Datasets/issues
Platinum Pedigree was accessed on DATE
from https://registry.opendata.aws/platinum-pedigree.
arn:aws:s3:::platinum-pedigree-data
us-west-1
aws s3 ls --no-sign-request s3://platinum-pedigree-data/