SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

659,983 papers248,104 code links4,818 tasks

Papers

Showing 321330 of 177339 papers

TitleStatusHype
On the Vulnerability of LLM/VLM-Controlled RoboticsCode7
Grounding Image Matching in 3D with MASt3RCode7
PPTAgent: Generating and Evaluating Presentations Beyond Text-to-SlidesCode7
VACE: All-in-One Video Creation and EditingCode7
Revisiting PCA for time series reduction in temporal dimensionCode7
Marigold: Affordable Adaptation of Diffusion-Based Image Generators for Image AnalysisCode7
RAGEN: Understanding Self-Evolution in LLM Agents via Multi-Turn Reinforcement LearningCode7
Flow-GRPO: Training Flow Matching Models via Online RLCode7
AReaL: A Large-Scale Asynchronous Reinforcement Learning System for Language ReasoningCode7
Pre^3: Enabling Deterministic Pushdown Automata for Faster Structured LLM GenerationCode7
Show:102550
← PrevPage 33 of 17734Next →