SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

658,356 papers258,216 code links4,818 tasks

Papers

Showing 91100 of 658356 papers

TitleStatusHype
IndexTTS: An Industrial-Level Controllable and Efficient Zero-Shot Text-To-Speech SystemCode11
Scaling Synthetic Data Creation with 1,000,000,000 PersonasCode11
AutoDev: Automated AI-Driven DevelopmentCode11
SWE-agent: Agent-Computer Interfaces Enable Automated Software EngineeringCode11
HybridFlow: A Flexible and Efficient RLHF FrameworkCode11
DeepSeek-Coder: When the Large Language Model Meets Programming -- The Rise of Code IntelligenceCode11
Qwen2.5-Coder Technical ReportCode11
EAP4EMSIG -- Experiment Automation Pipeline for Event-Driven Microscopy to Smart Microfluidic Single-Cells AnalysisCode11
AgentScope: A Flexible yet Robust Multi-Agent PlatformCode11
NYU CTF Bench: A Scalable Open-Source Benchmark Dataset for Evaluating LLMs in Offensive SecurityCode11
Show:102550
← PrevPage 10 of 65836Next →