The Registry of Open Data on AWS is now available on AWS Data Exchange
All datasets on the Registry of Open Data are now discoverable on AWS Data Exchange alongside 3,000+ existing data products from category-leading data providers across industries. Explore the catalog to find open, free, and commercial data sets. Learn more about AWS Data Exchange

Cornell EAS Data Lake

agriculture climate earth observation elevation environmental geospatial mapping meteorological weather

Deprecated

The provider of this dataset no longer provides this data through this mechanism. If there are any questions, reach out through the contact mechanism listed below.


Description

Earth & Atmospheric Sciences at Cornell University has created a public data lake of climate data. The data is stored in columnar storage formats (ORC) to make it straightforward to query using standard tools like Amazon Athena or Apache Spark. The data itself is originally intended to be used for building decision support tools for farmers and digital agriculture. The first dataset is the historical NDFD / NDGD data distributed by NCEP / NOAA / NWS. The NDFD (National Digital Forecast Database) and NDGD (National Digital Guidance Database) contain gridded forecasts and observations at 2.5km resolution for the Contiguous United States (CONUS). There are also 5km grids for several smaller US regions and non-continguous territories, such as Hawaii, Guam, Puerto Rico and Alaska. NOAA distributes archives of the NDFD/NDGD via its NOAA Operational Model Archive and Distribution System (NOMADS) in Grib2 format. The data has been converted to ORC to optimize storage space and to, more importantly, simplify data access via standard data analytics tools.

Update Frequency

Hourly

License

https://datalake.eas.cornell.edu/license.txt

Documentation

https://datalake.eas.cornell.edu/

Managed By

Not currently managed

See all datasets managed by Not currently managed.

Contact

digitalag@cornell.edu

How to Cite

Cornell EAS Data Lake was accessed on DATE from https://registry.opendata.aws/cornell-eas-data-lake.

Usage Examples

Tutorials

Resources on AWS

  • Description
    Cornell EAS Data Lake
    Resource type
    S3 Bucket
    Amazon Resource Name (ARN)
    arn:aws:s3:::cornell-eas-data-lake
    AWS Region
    us-east-2
    AWS CLI Access (No AWS account required)
    aws s3 ls --no-sign-request s3://cornell-eas-data-lake/
  • Description
    Cornell EAS Data Lake Notifications. Used to send human readable information about updates to the EAS Data Lake.
    Resource type
    SNS Topic
    Amazon Resource Name (ARN)
    arn:aws:sns:us-east-2:003709786761:cornell-eas-data-lake-human
    AWS Region
    us-east-2
  • Description
    Cornell EAS Data Lake Automation Notifications. Used to send JSON notifications to automated build pipelines and ETL jobs when the EAS Data Lake is updated.
    Resource type
    SNS Topic
    Amazon Resource Name (ARN)
    arn:aws:sns:us-east-2:003709786761:cornell-eas-data-lake
    AWS Region
    us-east-2

Edit this dataset entry on GitHub

Tell us about your project

Home