bioinformatics health life sciences natural language processing us
The Synthea generated data is provided here as a 1,000 person (1k), 100,000 person (100k), and 2,800,000 persom (2.8m) data sets in the OMOP Common Data Model format. SyntheaTM is a synthetic patient generator that models the medical history of synthetic patients. Our mission is to output high-quality synthetic, realistic but not real, patient data and associated health records covering every aspect of healthcare. The resulting data is free from cost, privacy, and security restrictions. It can be used without restriction for a variety of secondary uses in academia, research, industry, and government (although a citation would be appreciated). You can read our first academic paper here: https://doi.org/10.1093/jamia/ocx079
Not updated
https://github.com/synthetichealth/synthea/blob/master/LICENSE
https://github.com/synthetichealth/synthea/wiki
See all datasets managed by Amazon Web Sevices.
Post any questions to re:Post and use the AWS Open Data
tag.
Synthea synthetic patient generator data in OMOP Common Data Model was accessed on DATE
from https://registry.opendata.aws/synthea-omop.
arn:aws:s3:::synthea-omop
us-east-1
aws s3 ls --no-sign-request s3://synthea-omop/