SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 23762400 of 661570 papers

TitleStatusHype
LLM-in-Sandbox Elicits General Agentic Intelligence3
HY3D-Bench: Generation of 3D Assets3
MetricAnything: Scaling Metric Depth Pretraining with Noisy Heterogeneous Sources3
SparkVSR: Interactive Video Super-Resolution via Sparse Keyframe Propagation3
DeepGen 1.0: A Lightweight Unified Multimodal Model for Advancing Image Generation and Editing3
GEM: A Gym for Agentic LLMs3
LaTeXTrans: Structured LaTeX Translation with Multi-Agent Coordination3
ReactMotion: Generating Reactive Listener Motions from Speaker Utterance3
LeWorldModel: Stable End-to-End Joint-Embedding Predictive Architecture from Pixels3
SWE-Pruner: Self-Adaptive Context Pruning for Coding Agents3
Qianfan-OCR: A Unified End-to-End Model for Document Intelligence3
pi-Flow: Policy-Based Few-Step Generation via Imitation Distillation3
Grounding World Simulation Models in a Real-World Metropolis3
tttLRM: Test-Time Training for Long Context and Autoregressive 3D Reconstruction3
Toward Ultra-Long-Horizon Agentic Science: Cognitive Accumulation for Machine Learning Engineering3
Geometry-Grounded Gaussian Splatting3
FireRed-OCR Technical Report3
Scaling Multiagent Systems with Process Rewards3
Making Avatars Interact: Towards Text-Driven Human-Object Interaction for Controllable Talking Avatars3
TwinFlow: Realizing One-step Generation on Large Models with Self-adversarial Flows3
PartUV: Part-Based UV Unwrapping of 3D Meshes3
Much Ado About Noising: Dispelling the Myths of Generative Robotic Control3
EasySteer: A Unified Framework for High-Performance and Extensible LLM Steering3
RLP: Reinforcement as a Pretraining Objective3
SoulX-Singer: Towards High-Quality Zero-Shot Singing Voice Synthesis3
Show:102550
← PrevPage 96 of 26463Next →