SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

659,983 papers248,104 code links4,818 tasks

Papers

Showing 226250 of 659983 papers

TitleStatusHype
Fine-mixing: Mitigating Backdoors in Fine-tuned Language ModelsCode8
DocLayNet: A Large Human-Annotated Dataset for Document-Layout AnalysisCode8
Attention Residuals7
WideSeek-R1: Exploring Width Scaling for Broad Information Seeking via Multi-Agent Reinforcement Learning7
Let It Flow: Agentic Crafting on Rock and Roll, Building the ROME Model within an Open Agentic Learning Ecosystem7
Pretraining Large Language Models with NVFP47
GigaBrain-0.5M*: a VLA That Learns From World Model-Based Reinforcement Learning7
dLLM: Simple Diffusion Language Modeling7
SAM 3D Body: Robust Full-Body Human Mesh Recovery7
Qwen3-ASR Technical Report7
Advancing Open-source World Models7
Skywork-R1V3 Technical ReportCode7
Is Diversity All You Need for Scalable Robotic Manipulation?Code7
EvoAgentX: An Automated Framework for Evolving Agentic WorkflowsCode7
GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning with Scalable Reinforcement LearningCode7
OmniGen2: Exploration to Advanced Multimodal GenerationCode7
From Bytes to Ideas: Language Modeling with Autoregressive U-NetsCode7
MiniMax-M1: Scaling Test-Time Compute Efficiently with Lightning AttentionCode7
AgentOrchestra: A Hierarchical Multi-Agent Framework for General-Purpose Task SolvingCode7
V-JEPA 2: Self-Supervised Video Models Enable Understanding, Prediction and PlanningCode7
ComfyUI-R1: Exploring Reasoning Models for Workflow GenerationCode7
Step-Audio-AQAA: a Fully End-to-End Expressive Large Audio Language ModelCode7
Reinforcement Learning Optimization for Large-Scale Learning: An Efficient and User-Friendly Scaling LibraryCode7
ComfyUI-Copilot: An Intelligent Assistant for Automated Workflow DevelopmentCode7
MMSU: A Massive Multi-task Spoken Language Understanding and Reasoning BenchmarkCode7
Show:102550
← PrevPage 10 of 26400Next →