SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

658,356 papers258,216 code links4,818 tasks

Papers

Showing 101110 of 658356 papers

TitleStatusHype
WebWalker: Benchmarking LLMs in Web TraversalCode11
SAM 2: Segment Anything in Images and VideosCode11
Gymnasium: A Standard Interface for Reinforcement Learning EnvironmentsCode11
PaperBanana: Automating Academic Illustration for AI Scientists9
Qwen3-TTS Technical Report9
Depth Pro: Sharp Monocular Metric Depth in Less Than a SecondCode9
StableToolBench: Towards Stable Large-Scale Benchmarking on Tool Learning of Large Language ModelsCode9
DeepSeek LLM: Scaling Open-Source Language Models with LongtermismCode9
Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale PredictionCode9
Sapiens: Foundation for Human Vision ModelsCode9
Show:102550
← PrevPage 11 of 65836Next →