SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 49264950 of 661570 papers

TitleStatusHype
Bolmo: Byteifying the Next Generation of Language Models2
How to Correctly Report LLM-as-a-Judge Evaluations2
Dr. MAS: Stable Reinforcement Learning for Multi-Agent LLM Systems2
InternSVG: Towards Unified SVG Tasks with Multimodal Large Language Models2
MotionCrafter: Dense Geometry and Motion Reconstruction with a 4D VAE2
ArcFlow: Unleashing 2-Step Text-to-Image Generation via High-Precision Non-Linear Flow Distillation2
Learning to Continually Learn via Meta-learning Agentic Memory Designs2
RAP: 3D Rasterization Augmented End-to-End Planning2
RealPDEBench: A Benchmark for Complex Physical Systems with Real-World Data2
Error-Free Linear Attention is a Free Lunch: Exact Solution from Continuous-Time Dynamics2
Learning a Generative Meta-Model of LLM Activations2
compar:IA: The French Government's LLM arena to collect French-language human prompts and preference data2
FloodDiffusion: Tailored Diffusion Forcing for Streaming Motion Generation2
Context Forcing: Consistent Autoregressive Video Generation with Long Context2
EEG Foundation Models: Progresses, Benchmarking, and Open Problems2
Sparse Video Generation Propels Real-World Beyond-the-View Vision-Language Navigation2
Theory of Space: Can Foundation Models Construct Spatial Beliefs through Active Exploration?2
Rethinking the Trust Region in LLM Reinforcement Learning2
Diversity-Preserved Distribution Matching Distillation for Fast Visual Synthesis2
Infinite-World: Scaling Interactive World Models to 1000-Frame Horizons via Pose-Free Hierarchical Memory2
Mind-Brush: Integrating Agentic Cognitive Search and Reasoning into Image Generation2
Modality Gap-Driven Subspace Alignment Training Paradigm For Multimodal Large Language Models2
A Survey on Efficient Vision-Language-Action Models2
SERA: Soft-Verified Efficient Repository Agents2
RE-TRAC: REcursive TRAjectory Compression for Deep Search Agents2
Show:102550
← PrevPage 198 of 26463Next →