archives internet natural language processing web archive
The End of Term Web Archive (EOT) captures and saves U.S. Government websites at the end of presidential administrations. The EOT has thus far preserved websites from administration changes in 2008, 2012, 2016, and 2020. Data from these web crawls have been made openly available in several formats in this dataset.
Every four years after a US Presidentaial Election
There are no restrictions on the use, access, and/or download of data from the End of Term Web Archive Dataset. We request that you cite the End of Term Web Archive project when using the data provided from this dataset.
Creative Commons Zero
See all datasets managed by End of Term Web Archive.
Mark Phillips firstname.lastname@example.org, Sawood Alam email@example.com
End of Term Web Archive Dataset was accessed on
DATE from https://registry.opendata.aws/eot-web-archive.
aws s3 ls --no-sign-request s3://eotarchive/
Edit this dataset entry on GitHub