SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 40214030 of 177340 papers

TitleStatusHype
ForestColl: Throughput-Optimal Collective Communications on Heterogeneous Network FabricsCode3
GPT-ImgEval: A Comprehensive Benchmark for Diagnosing GPT4o in Image GenerationCode3
DreamPhysics: Learning Physics-Based 3D Dynamics with Video Diffusion PriorsCode3
Can Mamba Learn How to Learn? A Comparative Study on In-Context Learning TasksCode3
Hydra-MDP: End-to-end Multimodal Planning with Multi-target Hydra-DistillationCode3
Model-based Asynchronous Hyperparameter and Neural Architecture SearchCode3
ContextCite: Attributing Model Generation to ContextCode3
Evaluation of the MACE Force Field Architecture: from Medicinal Chemistry to Materials ScienceCode3
Language Model InversionCode3
Evalverse: Unified and Accessible Library for Large Language Model EvaluationCode3
Show:102550
← PrevPage 403 of 17734Next →