| The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits | Feb 27, 2024 | All | CodeCode Available | 4 |
| All You May Need for VQA are Image Captions | May 4, 2022 | AllImage Captioning | CodeCode Available | 3 |
| FineWeb2: One Pipeline to Scale Them All -- Adapting Pre-Training Data Processing to Every Language | Jun 26, 2025 | All | CodeCode Available | 3 |
| Emu3: Next-Token Prediction is All You Need | Sep 27, 2024 | All | CodeCode Available | 3 |
| Patches Are All You Need? | Jan 24, 2022 | AllImage Classification | CodeCode Available | 3 |
| One Policy to Run Them All: an End-to-end Learning Approach to Multi-Embodiment Locomotion | Sep 10, 2024 | AllDeep Reinforcement Learning | CodeCode Available | 3 |
| NdLinear Is All You Need for Representation Learning | Mar 21, 2025 | AllRepresentation Learning | CodeCode Available | 3 |
| One Transformer Fits All Distributions in Multi-Modal Diffusion at Scale | Mar 12, 2023 | AllImage Generation | CodeCode Available | 3 |
| RAP-SAM: Towards Real-Time All-Purpose Segment Anything | Jan 18, 2024 | AllDecoder | CodeCode Available | 3 |
| Class Symbolic Regression: Gotta Fit 'Em All | Dec 4, 2023 | AllDeep Reinforcement Learning | CodeCode Available | 3 |
| BRIGHT: A globally distributed multimodal building damage assessment dataset with very-high-resolution for all-weather disaster response | Jan 10, 2025 | AllBuilding change detection for remote sensing images | CodeCode Available | 3 |
| MoAI: Mixture of All Intelligence for Large Language and Vision Models | Mar 12, 2024 | AllMixture-of-Experts | CodeCode Available | 3 |
| Large Language Models are Superpositions of All Characters: Attaining Arbitrary Role-play via Self-Alignment | Jan 23, 2024 | AllInstruction Following | CodeCode Available | 3 |
| Local All-Pair Correspondence for Point Tracking | Jul 22, 2024 | AllPoint Tracking | CodeCode Available | 3 |
| All-atom Diffusion Transformers: Unified generative modelling of molecules and materials | Mar 5, 2025 | AllUnconditional Crystal Generation | CodeCode Available | 3 |
| All are Worth Words: A ViT Backbone for Diffusion Models | Sep 25, 2022 | AllConditional Image Generation | CodeCode Available | 3 |
| MAD-ICP: It Is All About Matching Data -- Robust and Informed LiDAR Odometry | May 9, 2024 | All | CodeCode Available | 3 |
| GTSinger: A Global Multi-Technique Singing Corpus with Realistic Music Scores for All Singing Tasks | Sep 20, 2024 | AllSinging Voice Synthesis | CodeCode Available | 3 |
| How Far Are We From AGI: Are LLMs All We Need? | May 16, 2024 | All | CodeCode Available | 3 |
| AiOS: All-in-One-Stage Expressive Human Pose and Shape Estimation | Mar 26, 2024 | 3D Multi-Person Mesh RecoveryAll | CodeCode Available | 3 |
| A Survey on All-in-One Image Restoration: Taxonomy, Evaluation and Future Trends | Oct 19, 2024 | AllImage Restoration | CodeCode Available | 3 |
| GraphStorm: all-in-one graph machine learning framework for industry applications | Jun 10, 2024 | Allgraph construction | CodeCode Available | 3 |
| AM-RADIO: Agglomerative Vision Foundation Model -- Reduce All Domains Into One | Dec 10, 2023 | AllBenchmarking | CodeCode Available | 3 |
| Gemma Scope: Open Sparse Autoencoders Everywhere All At Once on Gemma 2 | Aug 9, 2024 | All | CodeCode Available | 3 |
| General-Reasoner: Advancing LLM Reasoning Across All Domains | May 20, 2025 | AllMath | CodeCode Available | 3 |