About

This registry exists to help people discover and share datasets that are available via AWS resources. See recent additions and learn more about sharing data on AWS.

See all usage examples for datasets listed in this registry tagged with environmental.


Search datasets (currently 13 matching datasets)

You are currently viewing a subset of data tagged with environmental.


Add to this registry

If you want to add a dataset or example of how to use a dataset to this registry, please follow the instructions on the Registry of Open Data on AWS GitHub repository.

Unless specifically stated in the applicable dataset documentation, datasets available through the Registry of Open Data on AWS are not provided and maintained by AWS. Datasets are provided and maintained by a variety of third parties under a variety of licenses. Please check dataset licenses and related documentation to determine if a dataset may be used for your application.


Tell us about your project

If you have a project using a listed dataset, please tell us about it. We may work with you to feature your project in a blog post.

NASA Prediction of Worldwide Energy Resources (POWER)

agricultureair qualityanalyticsarchivesatmosphereclimateclimate modeldata assimilationdeep learningearth observationenergyenvironmentalforecastgeosciencegeospatialglobalhistoryimagingindustrymachine learningmachine translationmetadatameteorologicalmodelnetcdfopendapradiationsatellite imagerysolarstatisticssustainabilitytime series forecastingwaterweatherzarr

NASA's goal in Earth science is to observe, understand, and model the Earth system to discover how it is changing, to better predict change, and to understand the consequences for life on Earth. The Applied Sciences Program, within the Earth Science Division of the NASA Science Mission Directorate, serves individuals and organizations around the globe by expanding and accelerating societal and economic benefits derived from Earth science, information, and technology research and development.

The Prediction Of Worldwide Energy Resources (POWER) Project, funded through the Applied Sciences Program at ...

Details →

Usage examples

See 18 usage examples →

NOAA Operational Forecast System (OFS)

climatecoastaldisaster responseenvironmentalmeteorologicaloceanswaterweather

ANNOUNCEMENTS: [NOS OFS Version Updates and Implementation of Upgraded Oceanographic Forecast Modeling Systems for Lakes Superior and Ontario; Effective October 25, 2022}(https://www.weather.gov/media/notification/pdf2/scn22-91_nos_loofs_lsofs_v3.pdf)

For decades, mariners in the United States have depended on NOAA's Tide Tables for the best estimate of expected water levels. These tables provide accurate predictions of the astronomical tide (i.e., the change in water level due to the gravitational effects of the moon and sun and the rotation of the Earth); however, they cannot predict water-level changes due to wind, atmospheric pressure, and river flow, which are often significan...

Details →

Usage examples

See 11 usage examples →

Multi-Scale Ultra High Resolution (MUR) Sea Surface Temperature (SST)

climateearth observationenvironmentalnatural resourceoceanssatellite imagerywaterweather

A global, gap-free, gridded, daily 1 km Sea Surface Temperature (SST) dataset created by merging multiple Level-2 satellite SST datasets. Those input datasets include the NASA Advanced Microwave Scanning Radiometer-EOS (AMSR-E), the JAXA Advanced Microwave Scanning Radiometer 2 (AMSR-2) on GCOM-W1, the Moderate Resolution Imaging Spectroradiometers (MODIS) on the NASA Aqua and Terra platforms, the US Navy microwave WindSat radiometer, the Advanced Very High Resolution Radiometer (AVHRR) on several NOAA satellites, and in situ SST observations from the NOAA iQuam project. Data are available fro...

Details →

Usage examples

See 10 usage examples →

Department of Energy's Open Energy Data Initiative (OEDI)

energyenvironmentalgeospatiallidarmodelsolar

Data released under the Department of Energy's (DOE) Open Energy Data Initiative (OEDI). The Open Energy Data Initiative aims to improve and automate access of high-value energy data sets across the U.S. Department of Energy’s programs, offices, and national laboratories. OEDI aims to make data actionable and discoverable by researchers and industry to accelerate analysis and advance innovation.

Details →

Usage examples

See 9 usage examples →

NREL Wind Integration National Dataset

environmentalgeospatialmeteorological

Released to the public as part of the Department of Energy's Open Energy Data Initiative, the Wind Integration National Dataset (WIND) is an update and expansion of the Eastern Wind Integration Data Set and Western Wind Integration Data Set. It supports the next generation of wind integration studies.

Details →

Usage examples

See 9 usage examples →

Radiant MLHub

cogearth observationenvironmentalgeospatiallabeledmachine learningsatellite imagerystac

Radiant MLHub is an open library for geospatial training data that hosts datasets generated by Radiant Earth Foundation's team as well as other training data catalogs contributed by Radiant Earth’s partners. Radiant MLHub is open to anyone to access, store, register and/or share their training datasets for high-quality Earth observations. All of the training datasets are stored using a SpatioTemporal Asset Catalog (STAC) compliant catalog and exposed through a common API. Training datasets include pairs of imagery and labels for different types of machine learning problems including image ...

Details →

Usage examples

See 9 usage examples →

CMIP6 GCMs downscaled using WRF

agricultureatmosphereclimateearth observationenvironmentalmodeloceanssimulationsweather

High-resolution historical and future climate simulations from 1980-2100

Details →

Usage examples

See 8 usage examples →

NOAA Water-Column Sonar Data Archive

biodiversityearth observationecosystemsenvironmentalgeospatialmappingoceans

Water-column sonar data archived at the NOAA National Centers for Environmental Information.

Details →

Usage examples

See 8 usage examples →

Toxicant Exposures and Responses by Genomic and Epigenomic Regulators of Transcription (TaRGET)

bioinformaticsbiologyenvironmentalepigenomicsgeneticgenomiclife sciences

The TaRGET (Toxicant Exposures and Responses by Genomic and Epigenomic Regulators of Transcription) Program is a research consortium funded by the National Institute of Environmental Health Sciences (NIEHS). The goal of the collaboration is to address the role of environmental exposures in disease pathogenesis as a function of epigenome perturbation, including understanding the environmental control of epigenetic mechanisms and assessing the utility of surrogate tissue analysis in mouse models of disease-relevant environmental exposures.

Details →

Usage examples

See 8 usage examples →

Coupled Model Intercomparison Project 6

agricultureatmosphereclimateearth observationenvironmentalmodeloceanssimulationsweather

The sixth phase of global coupled ocean-atmosphere general circulation model ensemble.

Details →

Usage examples

See 7 usage examples →

Materials Project Data

chemistrycloud computingdata assimilationdigital assetsdigital preservationenergyenvironmentalfree softwaregenomeHPCinformation retrievalinfrastructurejsonmachine learningmaterials sciencemolecular dynamicsmoleculeopen source softwarephysicspost-processingx-ray crystallography

Materials Project is an open database of computed materials properties aiming to accelerate materials science research. The resources in this OpenData dataset contain the raw, parsed, and build data products.

Details →

Usage examples

See 7 usage examples →

NOAA National Water Model CONUS Retrospective Dataset

agricultureagricultureclimatedisaster responseenvironmentaltransportationweather

The NOAA National Water Model Retrospective dataset contains input and output from multi-decade CONUS retrospective simulations. These simulations used meteorological input fields from meteorological retrospective datasets. The output frequency and fields available in this historical NWM dataset differ from those contained in the real-time operational NWM forecast model. Additionally, note that no streamflow or other data assimilation is performed within any of the NWM retrospective simulations

One application of this dataset is to provide historical context to current near real-time streamflow, soil moisture and snowpack conditions. The retrospective data can be used to infer flow frequencies and perform tempor...

Details →

Usage examples

See 7 usage examples →

OpenAQ

air qualitycitiesenvironmentalgeospatial

Global, aggregated physical air quality data from public data sources provided by government, research-grade and other sources. These awesome groups do the hard work of measuring these data and publicly sharing them, and our community makes them more universally-accessible to both humans and machines.

Details →

Usage examples

See 7 usage examples →

Scottish Public Sector LiDAR Dataset

citiescoastalcogelevationenvironmentallidarurban

This dataset is Lidar data that has been collected by the Scottish public sector and made available under the Open Government Licence. The data are available as point cloud (LAS format or in LAZ compressed format), along with the derived Digital Terrain Model (DTM) and Digital Surface Model (DSM) products as Cloud optimized GeoTIFFs (COG) or standard GeoTIFF. The dataset contains multiple subsets of data which were each commissioned and flown in response to different organisational requirements. The details of each can be found at https://remotesensingdata.gov.scot/data#/list

Details →

Usage examples

See 7 usage examples →

10m Annual Land Use Land Cover (9-class)

cogearth observationenvironmentalgeospatialland coverland usemachine learningmappingplanetarysatellite imagerystacsustainability

This dataset, produced by Impact Observatory, Microsoft, and Esri, displays a global map of land use and land cover (LULC) derived from ESA Sentinel-2 imagery at 10 meter resolution for the years 2017 - 2023. Each map is a composite of LULC predictions for 9 classes throughout the year in order to generate a representative snapshot of each year. This dataset was generated by Impact Observatory, which used billions of human-labeled pixels (curated by the National Geographic Society) to train a deep learning model for land classification. Each global map was produced by applying this model to ...

Details →

Usage examples

See 6 usage examples →

Pacific Ocean Sound Recordings

acousticsbiodiversitybiologyclimatecoastaldeep learningecosystemsenvironmentalmachine learningmarine mammalsoceansopen source software

This project offers passive acoustic data (sound recordings) from a deep-ocean environment off central California. Recording began in July 2015, has been nearly continuous, and is ongoing. These resources are intended for applications in ocean soundscape research, education, and the arts.

Details →

Usage examples

See 6 usage examples →

Public Utility Data Liberation Project

economicselectricityenergyenergy modelingenvironmentalgeospatialgovernment recordsindustrialindustryinfrastructuremarket dataparquetregulatorysolarsqliteusutilities

The Public Utility Data Liberation Project (PUDL) provides analysis-ready U.S. energy system data in bulk for programmatic use. Sources include the U.S. Energy Information Administration (EIA), the Environmental Protection Agency (EPA), the Federal Energy Regulatory Commission (FERC), the Pipeline and Hazardous Materials Safety Administration (PHMSA), the Securities and Exchange Commission (SEC). The primary focus is on the electricity sector, with additional data on the natural gas system and energy company financial reporting.

Details →

Usage examples

See 6 usage examples →

Wildfire Projections to Support Climate Resilience

agricultureclimateclimate modelclimate projectionsdisaster responseelectricityenergyenvironmentalgeospatialmeteorologicalsolarsustainabilityweather

Wildfire projections for California and her environs in support of California's Fifth Climate Assessment supported with historical weather observations and renewable energy capacity profiles for grid operations.

Details →

Usage examples

See 6 usage examples →

CMAS Data Warehouse

air qualityclimatecloud computingenvironmentalgeospatialmeteorologicalmodelMPASnetcdfopen source software

CMAS Data Warehouse on AWS collects and disseminates meteorology, emissions and air quality model input and output for Community Multiscale Air Quality (CMAQ) Model Applications. This dataset is available as part of the AWS Open Data Program, therefore egress fees are not charged to either the host or the person downloading the data. This S3 bucket is maintained as a public service by the University of North Carolina's CMAS Center, the US EPA’s Office of Research and Development, and the US EPA’s Office of Air and Radiation. Metadata and DOIs for datasets included in the CMAS Data Wareho...

Details →

Usage examples

See 5 usage examples →

Global Seasonal Sentinel-1 Interferometric Coherence and Backscatter Data Set

agriculturecogearth observationearthquakesecosystemsenvironmentalgeologygeophysicsgeospatialglobalinfrastructuremappingnatural resourcesatellite imagerysynthetic aperture radarurban

This data set is the first-of-its-kind spatial representation of multi-seasonal, global SAR repeat-pass interferometric coherence and backscatter signatures. Global coverage comprises all land masses and ice sheets from 82 degrees northern to 79 degrees southern latitude. The data set is derived from high-resolution multi-temporal repeat-pass interferometric processing of about 205,000 Sentinel-1 Single-Look-Complex data acquired in Interferometric Wide-Swath mode (Sentinel-1 IW mode) from 1-Dec-2019 to 30-Nov-2020. The data set was developed by Earth Big Data LLC and Gamma Remote Sensing AG, ...

Details →

Usage examples

See 5 usage examples →

NOAA National Air Quality Forecast Capability (NAQFC) Regional Model Guidance

agricultureclimatedisaster responseenvironmentalmeteorologicalweather

The National Air Quality Forecasting Capability (NAQFC) dataset contains model-generated air quality (AQ) forecast guidance from three different prediction systems. The first system is a coupled weather and atmospheric chemistry numerical forecast model, known as the Air Quality Model (AQM). It is used to produce forecast guidance for ozone (O3) and particulate matter that is less than or equal to 2.5 micrometers in diameter (PM2.5). Prior to May 14, 2024, AQM predictions were derived using the EPA’s Community Multiscale Air Quality (CMAQ) model, driven by meteorological fields from NCEP’s operational weather forecast models, ...

Details →

Usage examples

See 5 usage examples →

Ozone Monitoring Instrument (OMI) / Aura NO2 Tropospheric Column Density

air qualityatmosphereearth observationenvironmentalgeospatialsatellite imagery

NO2 tropospheric column density, screened for CloudFraction < 30% global daily composite at 0.25 degree resolution for the temporal range of 2004 to May 2020. Original archive data in HDF5 has been processed into a Cloud-Optimized GeoTiff (COG) format. Quality Assurance - This data has been validated by the NASA Science Team at Goddard Space Flight Center.Cautionary Note: https://airquality.gsfc.nasa.gov/caution-interpretation.

Details →

Usage examples

See 5 usage examples →

SPARTAN Data

air qualityenvironmental

SPARTAN (Surface PARTiculate mAtter Network) measures and provides surface ambient particulate matter (PM2.5 and PM10) concentration and the chemical composition around the world, with the purpose of connecting ground-based PM2.5 and satellite remote sensing.

Details →

Usage examples

See 5 usage examples →

Sofar Spotter Archive

climateenvironmentalmeteorologicaloceansoceanssustainabilityweather

This dataset includes archival hourly data from the [Sofar Spotter buoy global network] (https://weather.sofarocean.com/) from 2019 to March 2022.

Details →

Usage examples

See 5 usage examples →

SondeHub Radiosonde Telemetry

climateenvironmentalGPSweather

SondeHub Radiosonde telemetry contains global radiosonde (weather balloon) data captured by SondeHub from our participating radiosonde_auto_rx receiving stations. radiosonde_auto_rx is a open source project aimed at receiving and decoding telemetry from airborne radiosondes using software-defined-radio techniques, enabling study of the telemetry and sometimes recovery of the radiosonde itself. Currently 313 receiver stations are providing data for an average of 384 radiosondes a day. The data within this repository contains received telemetry frames, including radiosonde type, gps position, a...

Details →

Usage examples

See 5 usage examples →

Chalmers Cloud Ice Climatology

atmosphereclimatedeep learningenvironmentalexplorationgeophysicsgeosciencegeospatialglobaliceplanetarysatellite imageryzarr

The Chalmers Cloud Ice Climatology (CCIC) is a novel, deep-learning-based climate record of ice-particle concentrations in the atmosphere. CCIC results are available at high spatial and temporal resolution (0.07° / 3 h from 1983, 0.036° / 30 min from 2000) and thus ideally suited for evaluating high-resolution weather and climate models or studying individual weather systems.

Details →

Usage examples

See 4 usage examples →

FoMo - A Multi-Season Dataset for Robot Navigation in Forêt Montmorency

autonomous vehiclesbenchmarkcomputer visionenvironmentalextreme weathergeospatialGNSSIMUlidarlocalizationmappingmeteorologicalperceptionradarRINEXroboticssignal processing

The FoMo dataset is a multi-season collection recorded in a boreal forest environment, featuring deep snow, off-road terrain, steep slopes, and highly variable weather. It provides synchronized multi-modal sensor data—including two lidars (RoboSense and Leishen), an FMCW radar (Navtech), stereo and monocular cameras, dual IMUs, wheel odometry, power data, calibration sequences, and precise ground-truth trajectories via GNSS-PPK fusion. Designed to support research on robust robot autonomy under adverse conditions, FoMo includes repeated traversals of six trajectories of varying complexity for ...

Details →

Usage examples

See 4 usage examples →

NOAA Global Forecast System (GFS)

agricultureclimatedisaster responseenvironmentalmeteorologicalweather

NOTE - Upgrade NCEP Global Forecast System to v16.3.0 - Effective November 29, 2022 See notification HERE

The Global Forecast System (GFS) is a weather forecast model produced by the National Centers for Environmental Prediction (NCEP). Dozens of atmospheric and land-soil variables are available through this dataset, from temperatures, winds, and precipitation to soil moisture and atmospheric ozone concentration. The entire globe is covered by the GFS at a base horizontal resolution of 18 miles (28 kilometers) between grid points, which is used by the operational forecasters who predict weather out to 16...

Details →

Usage examples

See 4 usage examples →

NOAA High-Resolution Rapid Refresh (HRRR) Model

agricultureclimatedisaster responseenvironmentalweather

The HRRR is a NOAA real-time 3-km resolution, hourly updated, cloud-resolving, convection-allowing atmospheric model, initialized by 3km grids with 3km radar assimilation. Radar data is assimilated in the HRRR every 15 min over a 1-h period adding further detail to that provided by the hourly data assimilation from the 13km radar-enhanced Rapid Refresh.

The HRRR ZARR formatted data was originally generated by the University of Utah under a grant provided by NOAA. They are are continuing to publish ZARR versions of HRRR data. For information about data in the s3://hrrrzarr/ please contact &#x...

Details →

Usage examples

See 4 usage examples →

NOAA's Coastal Ocean Reanalysis (CORA) Dataset: 1979-2022

agricultureagricultureclimatedisaster responseenvironmentaloceanstransportationweather

NOAA's Coastal Ocean Reanalysis (CORA) for the Gulf, East Coast/Atlantic, and Caribbean (GEC) is produced using verified hourly water levels from the National Ocean Service’s Center of Operational Oceanographic Products & Services (CO-OPS). ADvanced CIRCulation Model (ADCIRC) and Simulating WAves Nearshore (SWAN) models are coupled to model coastal water levels and nearshore waves. Hourly water level observations are used for data assimilation and validation to improve the accuracy of modeled water levels and wave datasets.

Additional Details:
Metadata associated with model domain and time span:

  • Timeseries - 1979 to 2022
  • Size - Approx. 44.6 TB
  • Domain - Lat 5.8 to 45.8 ; Long -98.0 to -53.8
...

Details →

Usage examples

See 4 usage examples →

SILO climate data on AWS

agricultureclimateearth observationenvironmentalmeteorologicalmodelsustainabilitywaterweather

SILO is a database of Australian climate data from 1889 to the present. It provides continuous, daily time-step data products in ready-to-use formats for research and operational applications. SIL...

Details →

Usage examples

See 4 usage examples →

Sea Surface Temperature Daily Analysis: European Space Agency Climate Change Initiative product version 2.1

climateearth observationenvironmentalgeospatialglobaloceans

Global daily-mean sea surface temperatures, presented on a 0.05° latitude-longitude grid, with gaps between available daily observations filled by statistical means, spanning late 1981 to recent time. Suitable for large-scale oceanographic meteorological and climatological applications, such as evaluating or constraining environmental models or case-studies of marine heat wave events. Includes temperature uncertainty information and auxiliary information about land-sea fraction and sea-ice coverage. For reference and citation see: www.nature.com/articles/s41597-019-0236-x.

Details →

Usage examples

See 4 usage examples →

Sentinel-3

cogearth observationenvironmentalgeospatiallandoceanssatellite imagerystac

This data set consists of observations from the Sentinel-3 satellite of the European Commission’s Copernicus Earth Observation Programme. Sentinel-3 is a polar orbiting satellite that completes 14 orbits of the Earth a day. It carries the Ocean and Land Colour Instrument (OLCI) for medium resolution marine and terrestrial optical measurements, the Sea and Land Surface Temperature Radiometer (SLSTR), the SAR Radar Altimeter (SRAL), the MicroWave Radiometer (MWR) and the Precise Orbit Determination (POD) instruments. The satellite was launched in 2016 and entered routine operational phase in 201...

Details →

Usage examples

See 4 usage examples →

Sentinel-5P Level 2

air qualityatmospherecogearth observationenvironmentalgeospatialsatellite imagerystac

This data set consists of observations from the Sentinel-5 Precursor (Sentinel-5P) satellite of the European Commission’s Copernicus Earth Observation Programme. Sentinel-5P is a polar orbiting satellite that completes 14 orbits of the Earth a day. It carries the TROPOspheric Monitoring Instrument (TROPOMI) which is a spectrometer that senses ultraviolet (UV), visible (VIS), near (NIR) and short wave infrared (SWIR) to monitor ozone, methane, formaldehyde, aerosol, carbon monoxide, nitrogen dioxide and sulphur dioxide in the atmosphere. The satellite was launched in October 2017 and entered ro...

Details →

Usage examples

See 4 usage examples →

Blended TROPOMI+GOSAT Satellite Data Product for Atmospheric Methane

climateenvironmentalsatellite imagery

A dataset of satellite retrievals of atmospheric methane that extends from 30 April 2018 to present.

Details →

Usage examples

See 3 usage examples →

Canopy Tree Height Map for the Amazon Forest (mean height composite 2020-2024) by CTrees.org

cogconservationdeep learningearth observationenvironmentalgeospatialimage processingland coverlidarsatellite imagery

Mean canopy Tree Height for the Amazon Forest on the period 2020-2024 at 4.78 m of spatial resolution. Created using a deep learning model on high-resolution Planet imagery from the Norway's International Climate and Forest Initiative (NICFI) Satellite Data Program. From the original research paper https://doi.org/10.48550/arXiv.2501.10600

Details →

Usage examples

See 3 usage examples →

Global aboveground biomass (AGB), 100m by CTrees.org

cogconservationdeep learningearth observationenvironmentalgeospatialimage processingland coversatellite imageryzarr

CTrees' global aboveground biomass (AGB) estimates, annually for 2000-2025 at a standard spatial resolution of 100 meters.

Details →

Usage examples

See 3 usage examples →

NASA Earth Exchange Global Daily Downscaled Projections (NEX-GDDP-CMIP6)

air temperatureclimateclimate modelclimate projectionsCMIP6cogearth observationenvironmentalglobalmodelNASA Center for Climate Simulation (NCCS)near-surface relative humiditynear-surface specific humiditynetcdfprecipitation

The NEX-GDDP-CMIP6 dataset is comprised of global downscaled climate scenarios derived from the General Circulation Model (GCM) runs conducted under the Coupled Model Intercomparison Project Phase 6 (CMIP6) and across two of the four "Tier 1" greenhouse gas emissions scenarios known as Shared Socioeconomic Pathways (SSPs). The CMIP6 GCM runs were developed in support of the Sixth Assessment Report of the Intergovernmental Panel on Climate Change (IPCC AR6). This dataset includes downscaled projections from ScenarioMIP model runs for which daily scenarios were produced and distributed...

Details →

Usage examples

See 3 usage examples →

NOAA - hourly position, current, and sea surface temperature from drifters

climateenvironmentalmeteorologicaloceanssustainabilityweather

This dataset includes hourly sea surface temperature and current data collected by satellite-tracked surface drifting buoys ("drifters") of the NOAA Global Drifter Program. The Drifter Data Assembly Center (DAC) at NOAA’s Atlantic Oceanographic and Meteorological Laboratory (AOML) has applied quality control procedures and processing to edit these observational data and obtain estimates at regular hourly intervals. The data include positions (latitude and longitude), sea surface temperatures (total, diurnal, and non-diurnal components) and velocities (eastward, northward) with accompanying uncertainty estimates. Metadata include identification numbe...

Details →

Usage examples

See 3 usage examples →

NOAA IOOS MARACOOS Regional Ocean Modeling System (ROMS) "Doppio" Data Assimilative Reanalysis

coastalenvironmentalmarineoceans

This dataset, identified as Doppio Analysis Version 3 Release 3 (DopAnV3R3-ini2007) initialized January 2007, comprises outputs from a Regional Ocean Modeling System (ROMS) data assimilative reanalysis of ocean circulation in the Mid-Atlantic Bight and Gulf of Maine for 2007-2024.

A multi-year reanalysis (2007-2024) of circulation in the coastal ocean and adjacent deep sea of the northeast U.S. continental shelf has been computed using the Regional Ocean Modeling System (ROMS) with four-dimensional variational (4D-Var) data assimilation (DA) of observations from satellites, land-based ocean surface current measuring radar, and all available in situ observations from the MARACOOS ...

Details →

Usage examples

See 3 usage examples →

National Herbarium of NSW

agriculturebiodiversitybiologyclimatedigital preservationecosystemsenvironmental

The National Herbarium of New South Wales is one of the most significant scientific, cultural and historical botanical resources in the Southern hemisphere. The 1.43 million preserved plant specimens have been captured as high-resolution images and the biodiversity metadata associated with each of the images captured in digital form. Botanical specimens date from year 1770 to today, and form voucher collections that document the distribution and diversity of the world's flora through time, particularly that of NSW, Austalia and the Pacific.The data is used in biodiversity assessment, syste...

Details →

Usage examples

See 3 usage examples →

OPERA Land Surface Disturbance Annual from Harmonized Landsat Sentinel-2 product (Version 1)

cogearth observationenvironmentalgloballandland coverland use

The Observational Products for End-Users from Remote Sensing Analysis (OPERA) Land Surface Disturbance Annual from Harmonized Landsat Sentinel-2 (HLS) product Version 1 summarizes the DIST-ALERT data product into an annual vegetation disturbance data product. Vegetation disturbance is mapped when there is an indicated decrease in vegetation cover within an HLS Version 2 pixel. The product also provides auxiliary generic disturbance information as determined from the variations of the reflectance through the DIST-ALERT scenes to provide information about more general disturbance trends. The DIS...

Details →

Usage examples

See 3 usage examples →

QIIME 2 Tutorial Data

bioinformaticsbiologyecosystemsenvironmentalgeneticgenomichealthlife sciencesmetagenomicsmicrobiome

QIIME 2 (pronounced “chime two”) is a microbiome multi-omics bioinformatics and data science platform that is trusted, free, open source, extensible, and community developed and supported.

Details →

Usage examples

See 3 usage examples →

Africa Soil Information Service (AfSIS) Soil Chemistry

agricultureenvironmentalfood securitymachine learning

This dataset contains soil infrared spectral data and paired soil property reference measurements for georeferenced soil samples that were collected through the Africa Soil Information Service (AfSIS) project, which lasted from 2009 through 2018. In this release, we include data collected during Phase I (2009-2013.) Georeferenced samples were collected from 19 countries in Sub-Saharan African using a statistically sound sampling scheme, and their soil properties were analyzed using both conventional soil testing methods and spectral methods (infrared diffuse reflectance spectroscopy). The two ...

Details →

Usage examples

See 2 usage examples →

Atmospheric Models from Météo-France

agricultureclimatedisaster responseearth observationenvironmentalmeteorologicalmodelweather

Global and high-resolution regional atmospheric models from Météo-France.

  • ARPEGE World covers the entire world at a base horizontal resolution of 0.5° (~55km) between grid points, it predicts weather out up to 114 hours in the future.
  • ARPEGE Europe covers Europe and North-Africa at a base horizontal resolution of 0.1° (~11km) between grid points, it predicts weather out up to 114 hours in the future.
  • AROME France covers France at a base horizontal resolution of 0.025° (~2.5km) between grid points, it predicts weather out up to 42 hours in the future.
  • AROME France HD covers France and neighborhood a
...

Details →

Usage examples

See 2 usage examples →

Downscaled Climate Data for Alaska (v1.1, August 2023)

agricultureclimatecoastalearth observationenvironmentalsustainabilityweather

This dataset contains historical and projected dynamically downscaled climate data for the State of Alaska and surrounding regions at 20km spatial resolution and hourly temporal resolution. Select variables are also summarized into daily resolutions. This data was produced using the Weather Research and Forecasting (WRF) model (Version 3.5). We downscaled both ERA-Interim historical reanalysis data (1979-2015) and both historical and projected runs from 2 GCM’s from the Coupled Model Inter-comparison Project 5 (CMIP5): GFDL-CM3 and NCAR-CCSM4 (historical run: 1970-2005 and RCP 8.5: 2006-2100)....

Details →

Usage examples

See 2 usage examples →

NOAA Analysis of Record for Calibration (AORC) Dataset

agricultureagricultureclimatedisaster responseenvironmentaltransportationweather

...

Details →

Usage examples

See 2 usage examples →

NOAA Global Forecast System (GFS) netCDF Formatted Data

agricultureclimatedisaster responseenvironmentalmeteorologicalweather

The Global Forecast System (GFS) is a weather forecast model produced by the National Centers for Environmental Prediction (NCEP). Dozens of atmospheric and land-soil variables are available through this dataset, from temperatures, winds, and precipitation to soil moisture and atmospheric ozone concentration. The GFS data files stored here can be immediately used for OAR/ARL’s NOAA-EPA Atmosphere-Chemistry Coupler Cloud (NACC-Cloud) tool, and are in a Network Common Data Form (netCDF), which is a very common format used across the scientific community. These particular GFS files contain a comprehensive number of global atmosphere/land variables at a relatively high spati...

Details →

Usage examples

See 2 usage examples →

NOAA Unified Forecast System Subseasonal to Seasonal Prototypes

agricultureclimatedisaster responseenvironmentalmeteorologicaloceansweather

The Unified Forecast System Subseasonal to Seasonal prototypes consist of reforecast data from the UFS atmosphere-ocean coupled model experimental prototype version 5, 6, 7, and 8 produced by the Medium Range and Subseasonal to Seasonal Application team of the UFS-R2O project. The UFS prototypes are the first dataset released to the broader weather community for analysis and feedback as part of the development of the next generation operational numerical weather prediction system from NWS. The datasets includes all the major weather variables for atmosphere, land, ocean, sea ice, and ocean wav...

Details →

Usage examples

See 2 usage examples →

Nighttime-Fire-Flare

anomaly detectionclassificationdisaster responseearth observationenvironmentalNASA SMD AIsatellite imagerysocioeconomicurban

Detection of nighttime combustion (fire and gas flaring) from daily top of atmosphere data from NASA's Black Marble VNP46A1 product using VIIRS Day/Night Band and VIIRS thermal bands.

Details →

Usage examples

See 2 usage examples →

OPERA Land Surface Disturbance Alert from Harmonized Landsat Sentinel-2 product (Version 1)

cogearth observationenvironmentalgloballandland coverland usesatellite imagery

The Observational Products for End-Users from Remote Sensing Analysis (OPERA) Land Surface Disturbance Alert from Harmonized Landsat Sentinel-2 (HLS) product Version 1 maps vegetation disturbance alerts that are derived from data collected by Landsat 8 and Landsat 9 Operational Land Imager (OLI) and Sentinel-2A, Sentinel-2B, and Sentinel-2C Multi-Spectral Instrument (MSI). A vegetation disturbance alert is detected at 30 meter (m) spatial resolution when there is an indicated decrease in vegetation cover within an HLS pixel. The Level-3 data product also provides additional information about more ...

Details →

Usage examples

See 2 usage examples →

OPERA Land Surface Disturbance Alert from Harmonized Landsat Sentinel-2 provisional product (Version 0)

cogearth observationenvironmentalgloballandland coverland use

The OPERA_L3_DIST-ALERT-HLS Version 0 data product was decommissioned on April 25, 2025. Users are encouraged to use the OPERA_L3_DIST-ALERT-HLS V1 data product which was released on March 14, 2024, and has achieved stage 1 validation.The Observational Products for End-Users from Remote Sensing Analysis (OPERA) Land Surface Disturbance Alert from Harmonized Landsat Sentinel-2 (HLS) provisional data product Version 0 maps vegetation disturbance alerts from data collected by Landsat 8 and Landsat 9 Operational Land Imager (OLI) and Sentinel-2A, Sentinel-2B, and Sentinel-2C Multi-Spectral Instrum...

Details →

Usage examples

See 2 usage examples →

RAPID NRT Flood Maps

agriculturedisaster responseearth observationenvironmentalwater

Near Real-time and archival data of High-resolution (10 m) flood inundation dataset over the Contiguous United States, developed based on the Sentinel-1 SAR imagery (2016-current) archive, using an automated Radar Produced Inundation Diary (RAPID) algorithm.

Details →

Usage examples

See 2 usage examples →

SatPM2.5

air qualityatmosphereenvironmentalhealthnetcdf

Fine particulate matter (PM2.5) concentrations are estimated using information from satellite-, simulation- and monitor-based sources. Aerosol optical depth from multiple satellites (MODIS, VIIRS, MISR, SeaWiFS, and VIIRS) and their respective retrievals (Dark Target, Deep Blue, MAIAC) is combined with simulation (GEOS-Chem) based upon their relative uncertainties as determined using ground-based sun photometer (AERONET) observations to produce geophysical estimates that explain most of the variance in ground-based PM2.5 measurements. A subsequent statistical fusion incorporates additional inf...

Details →

Usage examples

See 2 usage examples →

SeeFar V0

biodiversityclimatecoastalearth observationenvironmentalgeospatialglobalmachine learningmappingnatural resourcesatellite imagerysustainability

A collection of multi-resolution satellite images from both public and commercial satellites. The dataset is specifically curated for training geospatial foundation models.

Details →

Usage examples

See 2 usage examples →

Sentinel-1 SLC dataset for South and Southeast Asia, Taiwan, Korea and Japan

disaster responseearth observationenvironmentalgeospatialsatellite imagerysynthetic aperture radar

The S1 Single Look Complex (SLC) dataset contains Synthetic Aperture Radar (SAR) data in the C-Band wavelength. The SAR sensors are installed on a two-satellite (Sentinel-1A and Sentinel-1B) constellation orbiting the Earth with a combined revisit time of six days, operated by the European Space Agency. The S1 SLC data are a Level-1 product that collects radar amplitude and phase information in all-weather, day or night conditions, which is ideal for studying natural hazards and emergency response, land applications, oil spill monitoring, sea-ice conditions, and associated climate change effec...

Details →

Usage examples

See 2 usage examples →

Sub-Meter Canopy Tree Height of California in 2020 by CTrees.org

aerial imagerycogconservationdeep learningearth observationenvironmentalgeospatialimage processingland cover

Canopy Tree Height maps for California in 2020. Created using a deep learning model on very-high-resolution airborne imagery from the National Agriculture Imagery Program (NAIP) by United States Department of Agriculture (USDA).

Details →

Usage examples

See 2 usage examples →

Tropical Cyclone Precipitation, Infrared, Microwave, and Environmental Dataset (TC PRIMED)

atmosphereearth observationenvironmentalgeophysicsgeoscienceglobalmeteorologicalmodelnetcdfprecipitationsatellite imageryweather

The Tropical Cyclone Precipitation, Infrared, Microwave and Environmental Dataset (TC PRIMED) is a dataset centered around passive microwave observations of global tropical cyclones from low-Earth-orbiting satellites. TC PRIMED is a compilation of tropical cyclone data from various sources, including 1) tropical cyclone information from the National Oceanic and Atmospheric Administration (NOAA) National Weather Service National Hurricane Center (NHC) and Central Pacific Hurricane Center (CPHC) and the U.S. Department of Defense Joint Typhoon Warning Center, 2) low-Earth-orbiting satellite obse...

Details →

Usage examples

See 2 usage examples →

ARPA-E PERFORM Forecast data

energyenvironmentalgeospatialmodelsolar

The ARPA-E PERFORM Program is an ARPA-E funded program that aim to use time-coincident power and load seeks to develop innovative management systems that represent the relative delivery risk of each asset and balance the collective risk of all assets across the grid. A risk-driven paradigm allows operators to: (i) fully understand the true likelihood of maintaining a supply-demand balance and system reliability, (ii) optimally manage the system, and (iii) assess the true value of essential reliability services. This paradigm shift is critical for all power systems and is essential for grids wi...

Details →

Usage examples

See 1 usage example →

Analysis Ready Sentinel-1 Backscatter Imagery

agriculturecogdisaster responseearth observationenvironmentalgeospatialsatellite imagerystacsynthetic aperture radar

The Sentinel-1 mission is a constellation of C-band Synthetic Aperature Radar (SAR) satellites from the European Space Agency launched since 2014. These satellites collect observations of radar backscatter intensity day or night, regardless of the weather conditions, making them enormously valuable for environmental monitoring. These radar data have been processed from original Ground Range Detected (GRD) scenes into a Radiometrically Terrain Corrected, tiled product suitable for analysis. This product is available over the Contiguous United States (CONUS) since 2017 when Sentinel-1 data becam...

Details →

Usage examples

See 1 usage example →

Coupled Model Intercomparison Project Phase 5 (CMIP5) University of Wisconsin-Madison Probabilistic Downscaling Dataset

climatecoastaldisaster responseenvironmentalmeteorologicaloceanssustainabilitywaterweather

The University of Wisconsin Probabilistic Downscaling (UWPD) is a statistically downscaled dataset based on the Coupled Model Intercomparison Project Phase 5 (CMIP5) climate models. UWPD consists of three variables, daily precipitation and maximum and minimum temperature. The spatial resolution is 0.1°x0.1° degree resolution for the United States and southern Canada east of the Rocky Mountains.

The downscaling methodology is not deterministic. Instead, to properly capture unexplained variability and extreme events, the methodology predicts a spatially and temporally varying Probability Density Function (PDF) for each variable. Statistics such as the mean, me...

Details →

Usage examples

See 1 usage example →

East Coast Community Ocean Forecast System (ECCOFS)

coastalenvironmentalforecastmarineoceansweather

The East Coast Community Ocean Forecast System (ECCOFS) is a data assimilating ocean analysis and forecast system being developed by Rutgers University, the University of California Santa Cruz, Fathom Science Inc., and the National Ocean Service (NOS) of NOAA for transition to operations at NCEP in 2028. The ECCOFS domain spans the eastern seaboard of North America and Intra-Americas Seas from the Grand Banks of Newfoundland in the north to the mouth of the Orinoco River, Venezuela, in the south. ECCOFS will complement the existing WCOFS (West Coast Operational Forecast System) to achieve complete forecast coverage of U.S. territori...

Details →

Usage examples

See 1 usage example →

IGP Brick Kilns Bangladesh

air qualityenergyenvironmentalinfrastructure

This dataset includes detailed information about brick kilns, their locations, capacities, emissions, and other relevant attributes around the Indian Gangetic Plain.

Details →

Usage examples

See 1 usage example →

IGP Brick Kilns India

air qualityenergyenvironmentalinfrastructure

This dataset includes detailed information about brick kilns, their locations, capacities, emissions, and other relevant attributes around the Indian Gangetic Plain.

Details →

Usage examples

See 1 usage example →

IGP Brick Kilns Pakistan

air qualityenergyenvironmentalinfrastructure

This dataset includes detailed information about brick kilns, their locations, capacities, emissions, and other relevant attributes around the Pakistann Gangetic Plain.

Details →

Usage examples

See 1 usage example →

IGP Cement Plants

air qualityenvironmentalindustrialinfrastructure

This dataset includes detailed information about cement plants, their locations, capacities, emissions, and other relevant attributes around the Indian Gangetic Plain.

Details →

Usage examples

See 1 usage example →

IGP Paper and Pulp Plant

air qualityenvironmentalindustrialinfrastructure

This dataset includes detailed information about paper and pulp plants, their locations, capacities, emissions, and other relevant attributes around the Indian Gangetic Plain.

Details →

Usage examples

See 1 usage example →

IGP Power Generation Plant

air qualityenergyenvironmentalinfrastructure

This dataset includes detailed information about power generation plants, their locations, capacities, emissions, and other relevant attributes around the Indian Gangetic Plain.

Details →

Usage examples

See 1 usage example →

IGP Steel Plants

air qualityenvironmentalindustrialinfrastructure

This dataset includes detailed information about steel plants, their locations, capacities, emissions, and other relevant attributes around the Indian Gangetic Plain.

Details →

Usage examples

See 1 usage example →

IGP Waste Management Data

air qualityenvironmentalinfrastructuresustainability

This dataset includes detailed information about waste management sites, their locations, capacities, emissions, and other relevant attributes around the Indian Gangetic Plain.

Details →

Usage examples

See 1 usage example →

MODIS/Terra Calibrated Radiances 5-Min L1B Swath 500m

atmospheredatacenterearth observationenvironmentalglobalhdfmetadataopendaporbit

The MODIS/Terra Calibrated Radiances 5Min L1B Swath 500m data set contains calibrated and geolocated at-aperture radiances for 7 discrete bands located in the 0.45 to 2.20 micron region of the electromagnetic spectrum. These data are generated from the MODIS Level 1A scans of raw radiance and in the process converted to geophysical units of W/(m^2 um sr). Additional data are provided including quality flags, error estimates and calibration data.Visible, shortwave infrared, and near infrared measurements are only made during the daytime (except band 26), while radiances for the thermal infrared region (bands 20-25, 27-36) are measured continuously.Channels 1 and 2 have 250 m resolution, channels 3 through 7 have 500 m resolution. However, for the MODIS L1B 500 m product, ...

Details →

Usage examples

See 1 usage example →

Marginal Build Emissions Rates (MBERs) for Electricity

carbonclimatecsvelectricityenergyenergy modelingenvironmental

The Climate TRACE coalition has developed and maintains free global hourly Build Margin data, also known as MBERs, that are compliant with the Greenhouse Gas Protocol's Project Protocol electricity sector guidance, Guidelines for Grid-Connected Electricity Projects ("GHGP Guidelines").

Details →

Usage examples

See 1 usage example →

NOAA Global Surface Summary of Day

agricultureclimateenvironmentalnatural resourceregulatoryweather

Global Surface Summary of the Day is derived from The Integrated Surface Hourly (ISH) dataset. The ISH dataset includes global data obtained from the USAF Climatology Center, located in the Federal Climate Complex with NCDC. The latest daily summary data are normally available 1-2 days after the date-time of the observations used in the daily summaries. The online data files begin with 1929 and are at the time of this writing at the Version 8 software level. Over 9000 stations' data are typically available. The daily elements included in the dataset (as available from each station) are:
Mean t...

Details →

Usage examples

See 1 usage example →

NOAA HYSPLIT-compatible meteorological data archives

agricultureclimatedisaster responseenvironmentalmeteorologicalweather

The HYSPLIT model is a complete system for computing simple air parcel trajectories, as well as complex transport, dispersion, chemical transformation, and deposition simulations. HYSPLIT continues to be one of the most extensively used atmospheric transport and dispersion models in the atmospheric sciences community. A common application is a back trajectory analysis to determine the origin of air masses and establish source-receptor relationships. HYSPLIT has also been used in a variety of simulations describing the atmospheric transport, dispersion, and deposition of pollutants and hazardou...

Details →

Usage examples

See 1 usage example →

NOAA National Water Model Short-Range Forecast

agricultureagricultureclimatedisaster responseenvironmentaltransportationweather

The National Water Model (NWM) is a water resources model that simulates and forecasts water budget variables, including snowpack, evapotranspiration, soil moisture and streamflow, over the entire continental United States (CONUS). The model, launched in August 2016, is designed to improve the ability of NOAA to meet the needs of its stakeholders (forecasters, emergency managers, reservoir operators, first responders, recreationists, farmers, barge operators, and ecosystem and floodplain managers) by providing expanded accuracy, detail, and frequency of water information. It is operated by NOA...

Details →

Usage examples

See 1 usage example →

NOAA nClimGrid and Livneh Gridded Historical Climate Observation Thresholds

agricultureclimateenvironmentalmeteorologicalweather

Livneh and nClimGrid are gridded observed historical climatology data that were used in the LOCA2 and STAR-ESDM downscaling process of global climate models as part of the 5th National Climate Assessment. The original Livneh and nClimGrid daily temperature and precipitation observations have been converted to a series of decision-relevant thresholds as part of the (U.S. Climate Resilience Information System (CRIS)). These thresholds, such as days with extreme heat or precipitation, have been calculated to match the future projections from LOCA2 and STAR, also available in CRIS.

Details →

Usage examples

See 1 usage example →

NOAA's Observations for Global Workflow (GW) Test Cases

environmentalforecastweather

A collection of observations necessary to run National Weather Service’s (NWS) GW test cases. These may include incomplete or otherwise immature data or data formats that are subject to change as development proceeds.

  • dump_nr: non-restricted versions of observations from NWS Observation Processing (ObsProc) in Binary Universal Form for the Representation of meteorological data (BUFR) format
  • dump_ioda_nr: non-restricted observations converted to the Interface for Observation Data Access (IODA) format
  • experimental_obs: Additional atmospheric and marine observations
  • GEFS_ExtData: Assorted aerosol emission data
...

Details →

Usage examples

See 1 usage example →

NOAA/PMEL Ocean Climate Stations Moorings

climateenvironmentaloceansweather

The mission of the Ocean Climate Stations (OCS) Project is to make meteorological and oceanic measurements from autonomous platforms. Calibrated, quality-controlled, and well-documented climatological measurements are available on the OCS webpage and the OceanSITES Global Data Assembly Centers (GDACs), with near-realtime data available prior to release of the complete, downloaded datasets.

OCS measurements served through the Big Data Program come from OCS high-latitude moored buoys located in the Kuroshio Extension (32°N 145°E) and the Gulf of Alaska (50°N 145°W). Initiated in 2004 and 20...

Details →

Usage examples

See 1 usage example →

National Herbarium of Israel

biodiversitybiologyclimatedigital preservationenvironmentalimage processingimaginglife sciences

Our collection encompasses approximately one million vascular plant specimens from the Mediterranean and Middle East biodiversity hotspot, representing flora from Israel, Jordan, Hermon, Sinai, Egypt, the Caucasus, Arabia, North Africa, and throughout the Mediterranean basin. This scientifically significant repository includes published voucher specimens, original specimens used for "Flora Palaestina" illustrations, and critical references for the Israeli gene bank collections. The ongoing digitization process captures high-resolution images of each specimen while systematically inco...

Details →

Usage examples

See 1 usage example →

Orcasound - bioacoustic data for marine conservation

biodiversitybiologycoastalconservationdeep learningecosystemsenvironmentalgeospatiallabeledlife sciencesmachine learningmappingoceansopen source softwaresignal processing

Live-streamed and archived audio data (~2018-present) from underwater microphones (hydrophones) containing marine biological signals as well as ambient ocean noise. Hydrophone placement and passive acoustic monitoring effort prioritizes detection of orca sounds (calls, clicks, whistles) and potentially harmful noise. Geographic focus is on the US/Canada critical habitat of Southern Resident killer whales (northern CA to central BC) with initial focus on inland waters of WA. In addition to the raw lossy or lossless compressed data, we provide a growing archive of annotated bioacoustic bouts.

Details →

Usage examples

See 1 usage example →

Safecast

air qualityclimateenvironmentalgeospatialradiation

An ongoing collection of radiation and air quality measurements taken by devices involved in the Safecast project.

Details →

Usage examples

See 1 usage example →

Sentinel-1 SLC dataset for Germany

disaster responseearth observationenvironmentalgeospatialsatellite imagerysustainabilitysynthetic aperture radar

The Sentinel1 Single Look Complex (SLC) unzipped dataset contains Synthetic Aperture Radar (SAR) data from the European Space Agency’s Sentinel-1 mission. Different from the zipped data provided by ESA, this dataset allows direct access to individual swaths required for a given study area, thus drastically minimizing the storage and downloading time requirements of a project. Since the data is stored on S3, users can utilize the boto3 library and s3 get_object method to read the entire content of the object into the memory for processing, without actually having to download it. The Sentinel-1 ...

Details →

Usage examples

See 1 usage example →

(EXPERIMENTAL) NOAA FourCastNet Global Forecast System (FourCastNetGFS) (EXPERIMENTAL)

agricultureclimatedisaster responseenvironmentalmeteorologicalweather

The FourCastNet Global Forecast System (FourCastNetGFS) is an experimental system set up by the National Centers for Environmental Prediction (NCEP) to produce medium range global forecasts. The model runs on a 0.25 degree latitude-longitude grid (about 28 km) and 13 pressure levels. The model produces forecasts 4 times a day at 00Z, 06Z, 12Z and 18Z cycles. Major atmospheric and surface fields including temperature, wind components, geopotential height, relative humidity and 2 meter temperature and 10 meter winds are available. The products are 6 hourly forecasts up to 10 days. The data format is ...

Details →

CarbonPDF

csvenvironmentalindustryinformation retrievalproduct comparison

A carbon question-answering (QA) dataset specifically designed to facilitate the extraction and analysis of data from real-world carbon reports of computing products. The dataset features annotated metadata, a variety of numerical reasoning tasks, and structured derivations to ensure accurate processing of fragmented and inconsistent information.

Details →

EMBER Modeling Files

air qualityenvironmentalregulatory

These data are being released to support states in wildfire Exceptional Events demonstrations. EPA released fire impacts modeling data for the 2023 fire season called the Expedited Modeling of Burn Events Results (EMBER) in December 2024. EPA has updated the EMBER Dataset Tool on the EPA website to include summary tables and graphics four new years of EMBER data: 2021, 2022, 2024, and 2025. As part of the effort to update the EMBER data, we are posting the full model input and output files publicly here to aid state, local, and Tribal air agencies in constructing any potential Exceptional Even...

Details →

EPA Dynamically Downscaled Ensemble (EDDE) Version 1

agricultureair qualityair temperatureatmosphereclimateclimate modelclimate projectionsCMIP5CMIP6ecosystemselevationenvironmentalEulerianeventsfloodsfluid dynamicsgeosciencegeospatialhdf5healthHPChydrologyinfrastructureland coverland usemeteorologicalmodelnear-surface air temperaturenear-surface relative humiditynear-surface specific humiditynetcdfopen source softwarephysicspost-processingprecipitationradiationsimulationsuswaterweather

The data are a subset of the EPA Dynamically Downscaled Ensemble (EDDE), Version 1. EDDE is a collection of physics-based modeled data that represent 3D atmospheric conditions for historical and future periods under different scenarios. The EDDE Version 1 datasets cover the contiguous United States at a horizontal grid spacing of 36 kilometers at hourly increments. EDDE Version 1 includes simulations that have been dynamically downscaled from multiple global climate models (GCMs) under both mid- and high-emission scenarios from the Fifth Coupled Model Intercomparison Project (CMIP5) using the...

Details →

EPA Dynamically Downscaled Ensemble (EDDE) Version 2

agricultureair qualityair temperatureatmosphereclimateclimate modelclimate projectionsCMIP5CMIP6ecosystemselevationenvironmentalEulerianeventsfloodsfluid dynamicsgeosciencegeospatialhdf5healthHPChydrologyinfrastructureland coverland usemeteorologicalmodelnear-surface air temperaturenear-surface relative humiditynear-surface specific humiditynetcdfopen source softwarephysicspost-processingprecipitationradiationsimulationsuswaterweather

The data are a subset of the EPA Dynamically Downscaled Ensemble (EDDE), Version 2. EDDE is a collection of physics-based modeled data that represent 3D atmospheric conditions for historical and future periods under different scenarios. The EDDE Version 2 datasets cover the contiguous United States at a horizontal grid spacing of 12 kilometers at hourly increments. EDDE Version 2 will include simulations that have been dynamically downscaled from multiple global climate models (GCMs) under multiple emission scenarios from the Sixth Coupled Model Intercomparison Project (CMIP6) using the Weath...

Details →

EPA Hourly Prognostic Meteorological Data

air qualityenvironmentalmeteorologicalregulatoryweather

The data are hourly outputs from the Weather Research and Forecasting (WRF) model generated by the EPA's Office of State Air Partnerships (OSAP), Air Quality Assessment Division, Air Quality Modeling Branch. These data were generated at a 12-km resolution over the Continental United States (12US), beginning for the year 2021 and continuing annually through 2023. These files are intended for use in a broad range of air quality applications, but specifically may be used in dispersion modeling applications that would benefit from the use of the Mesoscale Model Interface (MMIF) tool (https:/...

Details →

EPA Risk-Screening Environmental Indicators

environmental

Detailed air model results from EPA’s Risk-Screening Environmental Indicators (RSEI) model.

Details →

Grid Algorithms and Data Analytics Library (GADAL)

energyenvironmentalmodelsustainability

The aim of this project is to create an easy-to-use platform where various types of analytics can be performed on a wide range of electrical grid datasets. The aim is to establish an open-source library of algorithms that universities, national labs and other developers can contribute to which can be used on both open-source and proprietary grid data to improve the analysis of electrical distribution systems for the grid modeling community. OEDI Systems Integration (SI) is a grid algorithms and data analytics API created to standardize how data is sent between different modules that are run as...

Details →

Gulfwide Avian Colony Monitoring Survey Photos

biologyconservationecosystemsenvironmentallabeledlife sciencesobject detection

For this project, The Water Institute (the Institute) and subcontractor Colibri Ecological Consulting, LLC (Colibri) utilized established methods and protocols capable of assessing changes of colonial waterbird populations and their important habitats within individual states and the broader northern Gulf of Mexico region. Data collection activities included: Aerial Photographic Nest Surveys: Implementation of fixed-wing aircraft surveys intended to assess waterbird colonies and document associated nesting within select portions of the northern Gulf of Mexico. Additional detail is provide...

Details →

High Resolution Downscaled Climate Data for Southeast Alaska

agricultureclimatecoastalearth observationenvironmentalsustainabilityweather

This dataset contains historical and projected dynamically downscaled climate data for the Southeast region of the State of Alaska at 1 and 4km spatial resolution and hourly temporal resolution. Select variables are also summarized into daily resolutions. This data was produced using the Weather Research and Forecasting (WRF) model (Version 4.0). We downscaled both Climate Forecast System Reanalysis (CFSR) historical reanalysis data (1980-2019) and both historical and projected runs from two GCM’s from the Coupled Model Inter-comparison Project 5 (CMIP5): GFDL-CM3 and NCAR-CCSM4 (historical ru...

Details →

ISERV

earth observationenvironmentalgeospatialsatellite imagery

ISS SERVIR Environmental Research and Visualization System (ISERV) was a fully-automated prototype camera aboard the International Space Station that was tasked to capture high-resolution Earth imagery of specific locations at 3-7 frames per second. In the course of its regular operations during 2013 and 2014, ISERV's camera acquired images that can be used primaliry in use is environmental and disaster management.

Details →

NOAA / NGA Satellite Computed Bathymetry Assessment-SCuBA

agricultureagriculturebathymetryclimatedisaster responseenvironmentaloceanstransportationweather

One of the National Geospatial-Intelligence Agency’s (NGA) and the National Oceanic and Atmospheric Administration’s (NOAA) missions is to ensure the safety of navigation on the seas by maintaining the most current information and the highest quality services for U.S. and global transport networks. To achieve this mission, we need accurate coastal bathymetry over diverse environmental conditions. The SCuBA program focused on providing critical information to improve existing bathymetry resources and techniques with two specific objectives. The first objective was to validate National Aeronautics and Space Administration’...

Details →

NOAA 3-D Surge and Tide Operational Forecast System for the Atlantic Basin (STOFS-3D-Atlantic)

climatecoastaldisaster responseenvironmentalglobalmarine navigationmeteorologicaloceanssustainabilitywaterweather

NOTICE - The Coast Survey Development Laboratory (CSDL) in NOAA/National Ocean Service (NOS)/Office of Coast Survey is upgrading the Surge and Tide Operational Forecast System (STOFS, formerly ESTOFS) to Version 2.1. A Service Change Notice (SCN) has been issued and can be found "HERE"

NOAA's Surge and Tide Operational Forecast System: Three-Dimensional Component for the Atlantic Basin (STOFS-3D-Atlantic). STOFS-3D-Atlantic runs daily (at 12 UTC) to provide users with 24-hour nowcasts (analyses of near present conditions) and up to 96-hour forecast guidance of water level conditions, and 2- and 3...

Details →

NOAA Cloud Optimized Zarr Reference Files (Kerchunk)

climatecoastaldisaster responseenvironmentalmeteorologicaloceanswaterweather

This repository contains references to datasets published to the NOAA Open Data Dissemination Program. These reference datasets serve as index files to the original data by mapping to the Zarr V2 specification. When multidimensional model output is read through zarr, data can be lazily loaded (i.e. retrieving only the data chunks needed for processing) and data reads can be scaled horizontally to optimize object storage read performance.

The process used to optimize the data is called kerchunk. RPS runs the workflow in their AWS cloud environment every time a new data notification is received from a relevant source data bucket.

These are the current datasets being cloud-optimized. Refer to those pages for file naming conventions and other information regarding the specific model implementations:
Details →

NOAA Global Data Assimilation (DA) Test Data

agricultureclimatedisaster responseenvironmentalmeteorologicalweather

The Unified Forecast System (UFS) is a community-based, coupled, comprehensive Earth Modeling System. It supports multiple applications with different forecast durations and spatial domains. The Global Data Assimilation System (GDAS) Application (App) is being used as the basis for uniting the Global Workflow and Global Forecast System (GFS) model with Joint Effort for Data assimilation Integration (JEDI) capabilities.

The National Centers for Environmental Prediction (NCEP) use GDAS to interpolate data from various observing systems and instruments onto a three-dimensional grid. GDAS obtain...

Details →

NOAA Global Real-Time Ocean Forecast System (Global RTOFS)

climatecoastaldisaster responseenvironmentalglobalmeteorologicaloceanswaterweather

NOAA is soliciting public comment on petential changes to the Real Time Ocean Forecast System (RTOFS) through March 27, 2024. Please see Public Notice at (https://www.weather.gov/media/notification/pdf_2023_24/pns24-12_rtofs_v2.4.0.pdf)

NOAA's Global Real-Time Ocean Forecast System (Global RTOFS) provides users with nowcasts (analyses of near present conditions) and forecast guidance up to eight days of ocean temperature and salinity, water velocity, sea surface elevation, sea ice coverage and sea ice thickness.

The Global Operational Real-Time Ocean Forecast System (Global RTOFS) is based on an eddy resolving 1/12° global HYCOM (HYbrid Coor...

Details →

NOAA Global Surge and Tide Operational Forecast System 2-D (STOFS-2D-Global)

climatecoastaldisaster responseenvironmentalglobalmeteorologicaloceanswaterweather

NOTICE - The Coast Survey Development Laboratory (CSDL) in NOAA/National Ocean Service (NOS)/Office of Coast Survey has upgraded the Surge and Tide Operational Forecast System (STOFS, formerly ESTOFS) to Version 2.1. A Service Change Notice (SCN) has been issued and can be found "HERE"

NOAA's Global Surge and Tide Operational Forecast System 2-D (STOFS-2D-Global) provides users with nowcasts (analyses of near present conditions) and forecast guidance of water level conditions for the entire globe. STOFS-2D-Global has been developed to serve the marine navigation, weather forecasting, an...

Details →

NOAA National Blend of Models (NBM) Parallel

agricultureclimatedisaster responseenvironmentalmeteorologicalweather

The National Blend of Models (NBM) is a nationally consistent and skillful suite of calibrated forecast guidance based on a blend of both NWS and non-NWS numerical weather prediction model data and post-processed model guidance. The goal of the NBM is to create a highly accurate, skillful and consistent starting point for the gridded forecast. This dataset contains data from the current parallel version of the NBM which is a test version, featuring many changes, that is a candidate to be implemented into operations following a careful vetting process.

Details →

NOAA Unified Forecast System (UFS) Hierarchical Testing Framework (HTF)

agricultureclimatedisaster responseenvironmentalmeteorologicaloceansweather

The "Unified Forecast System" (UFS) is a community-based, coupled, comprehensive Earth Modeling System. The Hierarchical Testing Framework (HTF) serves as a comprehensive toolkit designed to enhance the testing capabilities within UFS "repositories". It aims to standardize and simplify the testing process across various "UFS Weather Model" (WM) components and associated modules, aligning with the Hierarchical System Development (HSD) approach and NOAA baseline operational metrics.

The HTF provides a structured methodology for test case design and execution, which enh...

Details →

OSAP 2022 Modeling Platform

air qualityenvironmentalmeteorologicalregulatoryweather

The data are part of the 2022 Modeling Platform used to support regulatory actions and technical analyses conducted by the EPA's Office of State Air Partnerships (OSAP). Specifically, this data includes Weather Research and Forecasting Model (v4.4.2) conducted at a 12-km resolution over the Continental United States (12US). MCIP-processed files and wrfcamx-processed (12US1 domain) are also available as part of this dataset to assist in the use of emissions processing and photochemical modeling. These files may be used in downstream applications to generate emissions, photochemical mode...

Details →

VENUS L2A Cloud-Optimized GeoTIFFs

activity detectionagriculturecogdisaster responseearth observationenvironmentalgeospatialimage processingland covernatural resourcesatellite imagerystac

The Venµs science mission is a joint research mission undertaken by CNES and ISA, the Israel Space Agency. It aims to demonstrate the effectiveness of high-resolution multi-temporal observation optimised through Copernicus, the global environmental and security monitoring programme. Venµs was launched from the Centre Spatial Guyanais by a VEGA rocket, during the night from 2017, August 1st to 2nd. Thanks to its multispectral camera (12 spectral bands in the visible and near-infrared ranges, with spectral characteristics provided here), it acquires imagery every 1-2 days over 100+ areas at...

Details →

Digital Earth Pacific Mangroves Extent and Density

climateearth observationenvironmentalgeosciencegeospatial

Pacific Mangroves beta version product is an extension of the Global Mangrove Watch (GMV v3, 2020). which shows the extent of mangrove ecosystems across Pacific Island Countries and Territories (PICTs). The changes in mangroves extent was further classified into three categories of closed (high-density), open (lower density) and non-mangrove. This was used as the baseline training layer where mangrove categories between 2016 and 2022 were analysed.

Details →

Usage examples

See 2 usage examples →

Digital Earth Pacific Water Observatins from Space (WOfS)

earth observationenvironmentalgeosciencegeospatialwater

Water Observations from Space (WOfS) beta version product for Water Observations from Space (WOfS) is an annual summary of the temporal and spatial extent of surface water over landscapes. In essence, this highlights where water is usually or where it is rarely. The results are visualised to compare points in time spanning over a year, a season or multiple years. The dataset extends back historically to 2013.

Details →

Usage examples

See 2 usage examples →

Pacific Coastlines Change

coastalearth observationenvironmentalgeosciencegeospatial

Pacific Coastlines beta version product includes coastline change detection since the year 2000 for Pacific Island Country and Territories (PICTs). This product will provide ongoing monitoring of coastline change detection. This provides insights into processes including erosion (where landmass area decreases) and accretion or deposition (where landmass area increases).

Details →

Usage examples

See 2 usage examples →

Sentinel-1 Mean and Median Annual Mosaic

climateearth observationenvironmentalgeosciencegeospatial

Sentinel-1 carries a Synthetic Aperture RADAR (SAR) that operates on the C-band. This platform offers SAR data day and night and in all-weather conditions.

Details →

Usage examples

See 2 usage examples →

IGP Coal Plant

air qualityenergyenvironmentalinfrastructure

This dataset includes detailed information about coal power plants, their locations, capacities, emissions, and other relevant attributes around the Indian Gangetic Plain.

Details →

Usage examples

See 1 usage example →

Ocean Biodiversity Information System (OBIS) species occurrence data

biodiversitycoastalconservationecosystemsenvironmentalgeospatiallife sciencesoceanswater

The Ocean Biodiversity Information System (OBIS) was founded in 2000 under the Census of Marine Life. It is now a programme component of the International Oceanographic Data and Information Exchange (IODE) programme of the Intergovernmental Oceanographic Commission (IOC) of UNESCO. OBIS aims to be the most comprehensive data and information gateway on the diversity, distribution and abundance of marine life to support its Member States in achieving a healthy and resilient ocean ecosystem. The OBIS network consists of over 30 regional and thematic nodes, and provides access to more than 5,000 d...

Details →

Usage examples

See 1 usage example →

AI Weather Prediction (AIWP) Model Reforecasts

environmentalmeteorologicalweather



This is an archive of pure AI-based weather prediction reforecasts produced collaboratively between the Cooperative Institute for Research in the Atmosphere (CIRA) and the NOAA Global Systems Laboratory (NOAA-GSL).

Currently, FourCastNetv2-small, Pangu-Weather, and GraphCast are included, with more models to come. Each of these models has been initialized with both NOAA GFS (directories with no extension) and ECMWF IFS initial conditions (directories ending in "_IFS"). The datasets are updated with near-real-time data twice per day (00Z and 12Z initializations).

FourCastNetv2-small and Pangu-Weather are available from 10/2020 to present...

Details →

Clay v1.5 NAIP-2

aerial imageryagricultureenvironmentalland usenatural resource

National Agriculture Imagery Program (NAIP) dataset providing high-resolution aerial imagery for agricultural monitoring, land use analysis, and natural resource management.

Details →

Clay v1.5 Sentinel-2

agricultureearth observationenvironmentalland usesatellite imagery

Sentinel-2 satellite imagery dataset providing high-resolution optical data for land monitoring, agriculture, and environmental applications.

Details →