SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

658,356 papers258,164 code links4,818 tasks

Papers

Showing 5160 of 658356 papers

TitleStatusHype
RLAIF-V: Open-Source AI Feedback Leads to Super GPT-4V TrustworthinessCode11
Qwen2.5-VL Technical ReportCode11
FlashAttention-3: Fast and Accurate Attention with Asynchrony and Low-precisionCode11
KAN: Kolmogorov-Arnold NetworksCode11
Demonstration of DB-GPT: Next Generation Data Interaction System Empowered by Large Language ModelsCode11
ROMAS: A Role-Based Multi-Agent System for Database monitoring and PlanningCode11
Agent S: An Open Agentic Framework that Uses Computers Like a HumanCode11
The AI Scientist: Towards Fully Automated Open-Ended Scientific DiscoveryCode11
WebLLM: A High-Performance In-Browser LLM Inference EngineCode11
Introduction to Reinforcement LearningCode11
Show:102550
← PrevPage 6 of 65836Next →