SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 2130 of 177340 papers

TitleStatusHype
1-bit AI Infra: Part 1.1, Fast and Lossless BitNet b1.58 Inference on CPUsCode14
FLUX that Plays MusicCode14
Chatbot Arena: An Open Platform for Evaluating LLMs by Human PreferenceCode14
UI-TARS: Pioneering Automated GUI Interaction with Native AgentsCode14
Open-Sora 2.0: Training a Commercial-Level Video Generation Model in $200kCode14
Relevance Isn't All You Need: Scaling RAG Systems With Inference-Time Compute Via Multi-Criteria RerankingCode14
Autonomous Agents for Collaborative Task under Information AsymmetryCode14
Qwen3 Technical ReportCode14
Qwen2.5 Technical ReportCode13
Qwen2 Technical ReportCode13
Show:102550
← PrevPage 3 of 17734Next →