Usage examples for all datasets listed in the Registry of Open Data on AWS tagged with protein.
Tutorials
Tools & Applications
-
SARS-CoV-2 spike RBD with N501Y mutation bound to human ACE2 (953.7 µs) by The Chodera lab at the Memorial Sloan Kettering Cancer Center
-
SARS-CoV-2 COVID Moonshot absolute free energy calculations by The Voelz lab at Temple University
-
SARS-CoV-2 main viral protease (Mpro, 3CLPro, nsp5) dimer simulations: A 2.9 ms dataset of the SARS-CoV-2 main viral protease (apo, dimer) in search of cryptic pockets by The Bowman lab at Washington University in St. Louis
-
SARS-CoV-2 main viral protease (Mpro, 3CLPro, nsp5) monomer simulations: A 2.6 ms equilibrium dataset of the SARS-CoV-2 main viral protease (apo, monomer) by The Chodera lab at MSKCC
-
SARS-CoV-2 main viral protease (Mpro, 3CLPro, nsp5) monomer simulations: A 6.4 ms dataset of the SARS-CoV-2 main viral protease (apo, monomer) in search of cryptic pockets by The Bowman lab at Washington University in St. Louis
-
SARS-CoV-2 nsp10 dataset: A 6.1 ms dataset of the SARS-CoV-2 nsp10 protein in search of cryptic pockets by The Bowman lab at Washington University in St. Louis
-
SARS-CoV-2 nsp3 macrodomain dataset: An 11 ms dataset of the SARS-CoV-2 nsp3 macrodomain in search of cryptic pockets by The Bowman lab at Washington University in St. Louis
-
SARS-CoV-2 nsp3 pl2pro domain dataset: An 731 µs dataset of the SARS-CoV-2 nsp3 pl2pro domain in search of cryptic pockets by The Bowman lab at Washington University in St. Louis
-
SARS-CoV-2 nsp7 simulations: A 3.7 ms dataset of the SARS-CoV-2 nsp7 protein in search of cryptic pockets by The Bowman lab at Washington University in St. Louis
-
SARS-CoV-2 nsp8 simulations: A 1.8 ms dataset of the SARS-CoV-2 nsp8 protein in search of cryptic pockets by The Bowman lab at Washington University in St. Louis
-
SARS-CoV-2 nsp9 simulations: A 9 ms dataset of the SARS-CoV-2 nsp9 protein in search of cryptic pockets by The Bowman lab at Washington University in St. Louis
-
SARS-CoV-2 RNA polymerase (nsp12, RdRP) dataset: A 3.4 ms dataset of the SARS-CoV-2 nsp12 protein in search of cryptic pockets by The Bowman lab at Washington University in St. Louis
-
SARS-CoV-2 spike protein dataset: A 1.2 ms dataset of the SARS-CoV-2 spike protein in search of cryptic pockets by The Bowman lab at Washington University in St. Louis
-
SARS-CoV-2 spike RBD (with glycosylation) (1.8 ms) by The Chodera lab at the Memorial Sloan Kettering Cancer Center
-
SARS-CoV-2 spike RBD (without glycosylation) (1.9 ms) by The Chodera lab at the Memorial Sloan Kettering Cancer Center
-
SARS-CoV-2 spike RBD bound to human ACE2 receptor (173.8 us): Wild-type and mutant simulations by The Chodera lab at the Memorial Sloan Kettering Cancer Center
-
SARS-CoV-2 spike RBD bound to monoclonal antibody S2H97 (623.7 us) by The Chodera lab at the Memorial Sloan Kettering Cancer Center
-
SARS-CoV-2 spike RBD bound to monoclonal antibody S309 (1.1 ms) by The Chodera lab at the Memorial Sloan Kettering Cancer Center
-
SARS-CoV-2 spike RBD with P337A mutation bound to monoclonal antibody S309 (907.0 µs) by The Chodera lab at the Memorial Sloan Kettering Cancer Center
-
SARS-CoV-2 spike RBD with P337L mutation bound to monoclonal antibody S309 (923.2 µs) by The Chodera lab at the Memorial Sloan Kettering Cancer Center
Publications
-
Circulating SARS-CoV-2 spike N439K variants maintain fitness while evading antibody-mediated immunity by Emma C. Thomson, Laura E, Rosen, James G. Shepherd, et al.
-
SARS-CoV-2 RBD antibodies that maximize breadth and resistance to escape by Tyler N. Starr, Nadine Czudnochowski, Zhuoming Liu, et al.
-
SARS-CoV-2 Simulations Go Exascale to Capture Spike Opening and Reveal Cryptic Pockets Across the Proteome by Maxwell I. Zimmerman, Justin R. Porter, Michael D. Ward, Sukrit Singh, Neha Vithani, Artur Meller, Upasana L. Mallimadugula, Catherine E. Kuhn, Jonathan H. Borowsky, View ORCID ProfileRafal P. Wiewiora, Matthew F. D. Hurley, Aoife M Harbison, Carl A Fogarty, Joseph E. Coffland, Elisa Fadda, Vincent A. Voelz, John D. Chodera, Gregory R. Bowman
Tutorials
Tools & Applications
Publications
Tutorials
Tools & Applications
Publications
Tutorials
Tools & Applications
Publications
-
ColabFold: Making protein folding accessible to all by Mirdita M, Schütze K, Moriwaki Y, Heo L, Ovchinnikov S and Steinegger M
-
Fast and accurate protein structure search with Foldseek by van Kempen M, Kim S, Tumescheit C, Mirdita M, Lee J, Gilchrist CLM, et al.
-
MMseqs2 enables sensitive protein sequence searching for the analysis of massive data sets by Steinegger M and Söding J
Tutorials
Tools & Applications
Publications
-
A program for annotating and predicting the effects of single nucleotide polymorphisms, SnpEff: SNPs in the genome of Drosophila melanogaster strain w1118; iso-2; iso-3 by Cingolani P, Platts A, Wang le L, Coon M, Nguyen T, Wang L, Land SJ, Lu X, Ruden DM
-
Using Drosophila melanogaster as a model for genotoxic chemical mutational studies with a new program, SnpSift by Cingolani P, Patel VM, Coon M, Nguyen T, Land SJ, Ruden DM, Lu X
Tutorials
Publications
-
BUSCO - assessing genomic data quality and beyond. by Mosè Manni, Matthew R. Berkeley, Mathieu Seppey, Evgeny M. Zdobnov
-
BUSCO Update - Novel and Streamlined Workflows along with Broader and Deeper Phylogenetic Coverage for Scoring of Eukaryotic, Prokaryotic, and Viral Genomes. by Mosè Manni, Matthew R Berkeley, Mathieu Seppey, Felipe A Simão, Evgeny M Zdobnov
-
OrthoDB and BUSCO update - annotation of orthologs with wider sampling of genomes. by Fredrik Tegenfeldt, Dmitry Kuznetsov, Mosè Manni, Matthew Berkeley, Evgeny M Zdobnov, Evgenia V Kriventseva
Tools & Applications
Publications
-
BLAST+: Architecture and Applications by Christiam Camacho 1 , George Coulouris, Vahram Avagyan, Ning Ma, Jason Papadopoulos, Kevin Bealer, Thomas L Madden
-
Gapped BLAST and PSI-BLAST: A New Generation of Protein Database Search Programs by S F Altschul, T L Madden, A A Schäffer, J Zhang, Z Zhang, W Miller, D J Lipman
Tutorials
Publications
-
CELL-Diff: Unified diffusion modeling for protein sequences and microscopy images by Zheng Dihan, Bo Huang
-
SubCell: Vision foundation models for microscopy capture single-cell biology by Ankit Gupta, Zoe Wefers, Konstantin Kahnert, Jan N Hansen, William D. Leineweber, Anthony Cesnik, Dan Lu, Ulrika Axelsson, Frederic Ballllosera Navarro, Theofanis Karaletsos, Emma Lundberg
Tutorials
Tools & Applications
Publications
Tutorials
Publications
-
OpenFold: Retraining AlphaFold2 yields new insights into its learning mechanisms and capacity for generalization by Ahdritz, Gustaf; Bouatta, Nazim; Kadyan, Sachin; Xia, Qinghui; Gerecke, William; O'Donnell, Timothy J, et al
-
OpenProteinSet: Training data for structural biology at scale by Ahdritz, Gustaf; Bouatta, Nazim; Kadyan, Sachin; Jarosch, Lukas; Berenberg, Daniel; Fisk, Ian, et al
Tutorials
Tools & Applications
Publications
Tutorials
Tools & Applications
Tutorials
Tools & Applications
Publications
If you want to add a dataset or usage example to this registry, please follow the instructions on the Registry of Open Data on AWS GitHub repository or tell us about your project.
Home
SocialGene RefSeq Databases
Tutorials
Amazon EC2
Tools & Applications
Publications