SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 25512560 of 474278 papers

TitleStatusHype
CoMotion: Concurrent Multi-person 3D MotionCode3
Kimina-Prover Preview: Towards Large Formal Reasoning Models with Reinforcement LearningCode3
Elucidating the Design Space of Multimodal Protein Language ModelsCode3
REAL: Benchmarking Autonomous Agents on Deterministic Simulations of Real WebsitesCode3
DataDecide: How to Predict Best Pretraining Data with Small ExperimentsCode3
REPA-E: Unlocking VAE for End-to-End Tuning of Latent Diffusion TransformersCode3
SimpleAR: Pushing the Frontier of Autoregressive Visual Generation through Pretraining, SFT, and RLCode3
A Clean Slate for Offline Reinforcement LearningCode3
DataSentinel: A Game-Theoretic Detection of Prompt Injection AttacksCode3
Evaluation Report on MCP ServersCode3
Show:102550
← PrevPage 256 of 47428Next →