SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

658,356 papers258,216 code links4,818 tasks

Papers

Showing 301350 of 658356 papers

TitleStatusHype
Search-R1: Training LLMs to Reason and Leverage Search Engines with Reinforcement LearningCode7
VACE: All-in-One Video Creation and EditingCode7
HuixiangDou2: A Robustly Optimized GraphRAG ApproachCode7
AgiBot World Colosseo: A Large-scale Manipulation Platform for Scalable and Intelligent Embodied SystemsCode7
EAGLE-3: Scaling up Inference Acceleration of Large Language Models via Training-Time TestCode7
Visual-RFT: Visual Reinforcement Fine-TuningCode7
DiffRhythm: Blazingly Fast and Embarrassingly Simple End-to-End Full-Length Song Generation with Latent DiffusionCode7
LLM Post-Training: A Deep Dive into Reasoning Large Language ModelsCode7
Muon is Scalable for LLM TrainingCode7
Logic-RL: Unleashing LLM Reasoning with Rule-Based Reinforcement LearningCode7
From RAG to Memory: Non-Parametric Continual Learning for Large Language ModelsCode7
S*: Test Time Scaling for Code GenerationCode7
YOLOv12: Attention-Centric Real-Time Object DetectorsCode7
MoBA: Mixture of Block Attention for Long-Context LLMsCode7
Step-Audio: Unified Understanding and Generation in Intelligent Speech InteractionCode7
pySLAM: An Open-Source, Modular, and Extensible Framework for SLAMCode7
Step-Video-T2V Technical Report: The Practice, Challenges, and Future of Video Foundation ModelCode7
Large Language Diffusion ModelsCode7
LLMs Can Easily Learn to Reason from Demonstrations Structure, not content, is what matters!Code7
Efficient-vDiT: Efficient Video Diffusion Transformers With Attention TileCode7
Goku: Flow Based Video Generative Foundation ModelsCode7
Fast Video Generation with Sliding Tile AttentionCode7
VideoRAG: Retrieval-Augmented Generation with Extreme Long-Context VideosCode7
LLM-AutoDiff: Auto-Differentiate Any LLM WorkflowCode7
Training AI to be LoyalCode7
EvoRL: A GPU-accelerated Framework for Evolutionary Reinforcement LearningCode7
Rethinking the Sample Relations for Few-Shot ClassificationCode7
DoMINO: A Decomposable Multi-scale Iterative Neural Operator for Modeling Large Scale Engineering SimulationsCode7
Kimi k1.5: Scaling Reinforcement Learning with LLMsCode7
A Survey of Graph Retrieval-Augmented Generation for Customized Large Language ModelsCode7
EvoGP: A GPU-accelerated Framework for Tree-based Genetic ProgrammingCode7
PIKE-RAG: sPecIalized KnowledgE and Rationale Augmented GenerationCode7
Gradient-Based Multi-Objective Deep Learning: Algorithms, Theories, Applications, and BeyondCode7
FoundationStereo: Zero-Shot Stereo MatchingCode7
MiniMax-01: Scaling Foundation Models with Lightning AttentionCode7
rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep ThinkingCode7
PPTAgent: Generating and Evaluating Presentations Beyond Text-to-SlidesCode7
VITA-1.5: Towards GPT-4o Level Real-Time Vision and Speech InteractionCode7
Simulating 500 million years of evolution with a language modelCode7
Revisiting PCA for time series reduction in temporal dimensionCode7
Align Anything: Training All-Modality Models to Follow Instructions with Language FeedbackCode7
Efficient MedSAMs: Segment Anything in Medical Images on LaptopCode7
MMAudio: Taming Multimodal Joint Training for High-Quality Video-to-Audio SynthesisCode7
3DGUT: Enabling Distorted Cameras and Secondary Rays in Gaussian SplattingCode7
MASt3R-SLAM: Real-Time Dense SLAM with 3D Reconstruction PriorsCode7
A Library for Learning Neural OperatorsCode7
Byte Latent Transformer: Patches Scale Better Than TokensCode7
AniSora: Exploring the Frontiers of Animation Video Generation in the Sora EraCode7
Imitate, Explore, and Self-Improve: A Reproduction Report on Slow-thinking Reasoning SystemsCode7
Large Concept Models: Language Modeling in a Sentence Representation SpaceCode7
Show:102550
← PrevPage 7 of 13168Next →