SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 10911100 of 661570 papers

TitleStatusHype
AssetOpsBench: Benchmarking AI Agents for Task Automation in Industrial Asset Operations and MaintenanceCode5
DeTikZify: Synthesizing Graphics Programs for Scientific Figures and Sketches with TikZCode5
Noisereduce: Domain General Noise Reduction for Time Series SignalsCode5
Evaluating Real-World Robot Manipulation Policies in SimulationCode5
LLaMA-Adapter V2: Parameter-Efficient Visual Instruction ModelCode5
Orbit: A Unified Simulation Framework for Interactive Robot Learning EnvironmentsCode5
ProRL: Prolonged Reinforcement Learning Expands Reasoning Boundaries in Large Language ModelsCode5
WizardMath: Empowering Mathematical Reasoning for Large Language Models via Reinforced Evol-InstructCode5
Break the Sequential Dependency of LLM Inference Using Lookahead DecodingCode5
Allegro: Open the Black Box of Commercial-Level Video Generation ModelCode5
Show:102550
← PrevPage 110 of 66157Next →