life sciences machine learning structural biology
The “Cryo-EM SPA Workflow Records” contains all outputs of all processing steps involved in cryogenic electron microscopy (cryo-EM) single particle analysis (SPA), including both intermediate and final output data. The primary focus will be on data generated by RELION and CryoSPARC, two widely used software packages for :Cryo-EM SPA. These records will be archived systematically. To ensure the data remains reproducible while minimizing storage demands, large-sized files that can be regenerated will be excluded prior to registration. The aim is to retain only the essential metadata, processing parameters, and representative outputs that allow for full reconstruction of the analysis pipeline. This approach balances the need for long-term data preservation with practical considerations for storage capacity. Through this effort, we seek to enhance transparency and reproducibility in cryo-EM research by providing a structured and accessible record of the analysis process. Importantly, the use of this dataset is intended to facilitate the development of future AI algorithms in the field of Cryo-EM.
New releases are published on a rolling basis. Please contact the team via email for any questions.
CC0
https://github.com/KEK-SBRC-CryoEM/cryoem-spa-workflow-records
High Energy Accelerator Research Organization (KEK) Structural Biology Research Center (SBRC)
See all datasets managed by High Energy Accelerator Research Organization (KEK) Structural Biology Research Center (SBRC).
Cryo-EM SPA Workflow Records was accessed on DATE from https://registry.opendata.aws/cryoem-spa-workflow-records.