SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 38113820 of 177340 papers

TitleStatusHype
Neural Ordinary Differential EquationsCode3
LEADS: Lightweight Embedded Assisted Driving SystemCode3
Fine-Tuning Language Models with Just Forward PassesCode3
USB: A Unified Semi-supervised Learning Benchmark for ClassificationCode3
ITBench: Evaluating AI Agents across Diverse Real-World IT Automation TasksCode3
ChartX & ChartVLM: A Versatile Benchmark and Foundation Model for Complicated Chart ReasoningCode3
The Breeze 2 Herd of Models: Traditional Chinese LLMs Based on Llama with Vision-Aware and Function-Calling CapabilitiesCode3
The Role of Generative Systems in Historical Photography Management: A Case Study on Catalan ArchivesCode3
ResumeFlow: An LLM-facilitated Pipeline for Personalized Resume Generation and RefinementCode3
SpecInfer: Accelerating Generative Large Language Model Serving with Tree-based Speculative Inference and VerificationCode3
Show:102550
← PrevPage 382 of 17734Next →