SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 39813990 of 177340 papers

TitleStatusHype
AdaWorld: Learning Adaptable World Models with Latent ActionsCode3
SIMPL: A Simple and Efficient Multi-agent Motion Prediction Baseline for Autonomous DrivingCode3
cmaes : A Simple yet Practical Python Library for CMA-ESCode3
Emu: Generative Pretraining in MultimodalityCode3
BlenderLLM: Training Large Language Models for Computer-Aided Design with Self-improvementCode3
Automatically Interpreting Millions of Features in Large Language ModelsCode3
GTSinger: A Global Multi-Technique Singing Corpus with Realistic Music Scores for All Singing TasksCode3
KIVI: A Tuning-Free Asymmetric 2bit Quantization for KV CacheCode3
AgentDojo: A Dynamic Environment to Evaluate Prompt Injection Attacks and Defenses for LLM AgentsCode3
AndroidLab: Training and Systematic Benchmarking of Android Autonomous AgentsCode3
Show:102550
← PrevPage 399 of 17734Next →