COVID-19 Data Lake

bioinformatics biology coronavirus COVID-19 health life sciences medicine MERS SARS

Description

A centralized repository of up-to-date and curated datasets on or related to the spread and characteristics of the novel corona virus (SARS-CoV-2) and its associated illness, COVID-19. Globally, there are several efforts underway to gather this data, and we are working with partners to make this crucial data freely available and keep it up-to-date. Hosted on the AWS cloud, we have seeded our curated data lake with COVID-19 case tracking data from Johns Hopkins and The New York Times, hospital bed availability from Definitive Healthcare, and over 45,000 research articles about COVID-19 and related coronaviruses from the Allen Institute for AI.

Update Frequency

Periodically

License

Varies by dataset

Documentation

https://aws.amazon.com/blogs/big-data/a-public-data-lake-for-analysis-of-covid-19-data/

Managed By

AWS Data Lake Team

See all datasets managed by AWS Data Lake Team.

Contact

aws-covid-19-data-lake@amazon.com

Usage Examples

Tutorials
Tools & Applications

Resources on AWS

  • Description
    Collected COVID-19 related datasets
    Resource type
    S3 Bucket
    Amazon Resource Name (ARN)
    arn:aws:s3:::covid19-lake
    AWS Region
    us-east-2

Edit this dataset entry on GitHub

Home