SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

659,983 papers248,104 code links4,818 tasks

Papers

Showing 326350 of 659983 papers

TitleStatusHype
DoMINO: A Decomposable Multi-scale Iterative Neural Operator for Modeling Large Scale Engineering SimulationsCode7
Rethinking the Sample Relations for Few-Shot ClassificationCode7
Kimi k1.5: Scaling Reinforcement Learning with LLMsCode7
EvoGP: A GPU-accelerated Framework for Tree-based Genetic ProgrammingCode7
A Survey of Graph Retrieval-Augmented Generation for Customized Large Language ModelsCode7
PIKE-RAG: sPecIalized KnowledgE and Rationale Augmented GenerationCode7
Gradient-Based Multi-Objective Deep Learning: Algorithms, Theories, Applications, and BeyondCode7
FoundationStereo: Zero-Shot Stereo MatchingCode7
MiniMax-01: Scaling Foundation Models with Lightning AttentionCode7
rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep ThinkingCode7
PPTAgent: Generating and Evaluating Presentations Beyond Text-to-SlidesCode7
VITA-1.5: Towards GPT-4o Level Real-Time Vision and Speech InteractionCode7
Simulating 500 million years of evolution with a language modelCode7
Revisiting PCA for time series reduction in temporal dimensionCode7
Efficient MedSAMs: Segment Anything in Medical Images on LaptopCode7
Align Anything: Training All-Modality Models to Follow Instructions with Language FeedbackCode7
MMAudio: Taming Multimodal Joint Training for High-Quality Video-to-Audio SynthesisCode7
3DGUT: Enabling Distorted Cameras and Secondary Rays in Gaussian SplattingCode7
MASt3R-SLAM: Real-Time Dense SLAM with 3D Reconstruction PriorsCode7
AniSora: Exploring the Frontiers of Animation Video Generation in the Sora EraCode7
Byte Latent Transformer: Patches Scale Better Than TokensCode7
A Library for Learning Neural OperatorsCode7
Imitate, Explore, and Self-Improve: A Reproduction Report on Slow-thinking Reasoning SystemsCode7
Large Concept Models: Language Modeling in a Sentence Representation SpaceCode7
Flow Matching Guide and CodeCode7
Show:102550
← PrevPage 14 of 26400Next →