| Galactica: A Large Language Model for Science | Nov 16, 2022 | AnachronismsBias Detection | CodeCode Available | 4 | 5 |
| OpenProteinSet: Training data for structural biology at scale | Aug 10, 2023 | Protein DesignProtein Structure Prediction | CodeCode Available | 4 | 5 |
| Robust deep learning based protein sequence design using ProteinMPNN | Jun 4, 2022 | Deep LearningDrug Discovery | CodeCode Available | 3 | 5 |
| Highly accurate protein structure prediction with AlphaFold | Jul 15, 2021 | PredictionProtein Folding | CodeCode Available | 3 | 5 |
| MotifBench: A standardized protein design benchmark for motif-scaffolding problems | Feb 18, 2025 | Protein DesignProtein Structure Prediction | CodeCode Available | 2 | 5 |
| ProteinBERT: a universal deep-learning model of protein sequence and function | Feb 10, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 2 | 5 |
| State-specific protein-ligand complex structure prediction with a multi-scale deep generative model | Sep 30, 2022 | BenchmarkingBlind Docking | CodeCode Available | 2 | 5 |
| Distribution-Free, Risk-Controlling Prediction Sets | Jan 7, 2021 | BIG-bench Machine LearningClassification | CodeCode Available | 2 | 5 |
| FastFold: Reducing AlphaFold Training Time from 11 Days to 67 Hours | Mar 2, 2022 | Protein Structure PredictionTranslation | CodeCode Available | 2 | 5 |
| MegaFold: System-Level Optimizations for Accelerating Protein Structure Prediction Models | Jun 24, 2025 | GPUProtein Folding | CodeCode Available | 2 | 5 |