This registry exists to help people discover and share datasets that are available via AWS resources. See recent additions and learn more about sharing data on AWS.
See all usage examples for datasets listed in this registry tagged with government records.
You are currently viewing a subset of data tagged with government records.
If you want to add a dataset or example of how to use a dataset to this registry, please follow the instructions on the Registry of Open Data on AWS GitHub repository.
Unless specifically stated in the applicable dataset documentation, datasets available through the Registry of Open Data on AWS are not provided and maintained by AWS. Datasets are provided and maintained by a variety of third parties under a variety of licenses. Please check dataset licenses and related documentation to determine if a dataset may be used for your application.
If you have a project using a listed dataset, please tell us about it. We may work with you to feature your project in a blog post.
archivesgovernment recordsnaranational archives catalog
The National Archives Catalog dataset contains all of the descriptions; authority records; digitized and electronic records; and tags, transcriptions and comments for NARA’s archival holdings available in the Catalog.
climateclimate modelelectricityenergyenergy modelingenvironmentalgovernment recordsinfrastructureopen source softwareutilities
The Public Utility Data Liberation Project (PUDL) provides analysis-ready energy system data to climate advocates,
researchers, policymakers, and journalists.
PUDL is an open source data processing pipeline
that makes US energy data easier to access and use programmatically. Hundreds of gigabytes of valuable data
are published by US government agencies, but it's often difficult to work with.
PUDL takes the original spreadsheets, CSV files, and databases and turns them into a unified resource. This allows users to
spend more time on novel analysis and less time on data preparation.
This...
government records
The regulations.gov website allows users to view proposed rules and supporting documents for the federal rule-making process. In addition, users can post and view comments about those proposed rules. The site contains about 27 million pieces of text and binary data, but the API that provides access only allows a user to obtain one thousand items per hour. As a result, it would take approximately 3 years to download all the data. Mirrulations (MIRRor of regULATIONS.gov) is a system that uses a collection of donated API keys to create a mirror of the data. In addition, for each pdf in the da...