SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 23762400 of 661570 papers

TitleStatusHype
DreamID-Omni: Unified Framework for Controllable Human-Centric Audio-Video Generation3
LLM-in-Sandbox Elicits General Agentic Intelligence3
SceneSmith: Agentic Generation of Simulation-Ready Indoor Scenes3
Yunjue Agent Tech Report: A Fully Reproducible, Zero-Start In-Situ Self-Evolving Agent System for Open-Ended Tasks3
Baichuan-M3: Modeling Clinical Inquiry for Reliable Medical Decision-Making3
Simulating the Visual World with Artificial Intelligence: A Roadmap3
Scaling Multiagent Systems with Process Rewards3
SWE-Pruner: Self-Adaptive Context Pruning for Coding Agents3
HY3D-Bench: Generation of 3D Assets3
CL-bench: A Benchmark for Context Learning3
MemSkill: Learning and Evolving Memory Skills for Self-Evolving Agents3
Making Avatars Interact: Towards Text-Driven Human-Object Interaction for Controllable Talking Avatars3
A Survey of Token Compression for Efficient Multimodal Large Language Models3
EditScore: Unlocking Online RL for Image Editing via High-Fidelity Reward Modeling3
LongCat-Flash-Thinking-2601 Technical Report3
DyPE: Dynamic Position Extrapolation for Ultra High Resolution Diffusion3
MetricAnything: Scaling Metric Depth Pretraining with Noisy Heterogeneous Sources3
Deep Delta Learning3
JUST-DUB-IT: Video Dubbing via Joint Audio-Visual Diffusion3
TwinFlow: Realizing One-step Generation on Large Models with Self-adversarial Flows3
Geometry-Grounded Gaussian Splatting3
Self-Distillation Enables Continual Learning3
VoXtream: Full-Stream Text-to-Speech with Extremely Low Latency3
AgentDoG: A Diagnostic Guardrail Framework for AI Agent Safety and Security3
EvoCUA: Evolving Computer Use Agents via Learning from Scalable Synthetic Experience3
Show:102550
← PrevPage 96 of 26463Next →