SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 17211730 of 177340 papers

TitleStatusHype
pgmpy: A Python Toolkit for Bayesian NetworksCode4
OCRBench v2: An Improved Benchmark for Evaluating Large Multimodal Models on Visual Text Localization and ReasoningCode4
Rethinking Inductive Biases for Surface Normal EstimationCode4
UniAnimate: Taming Unified Video Diffusion Models for Consistent Human Image AnimationCode4
InkSight: Offline-to-Online Handwriting Conversion by Learning to Read and WriteCode4
Long-form factuality in large language modelsCode4
Molecular-driven Foundation Model for Oncologic PathologyCode4
Natural Language GenerationCode4
Medical SAM 2: Segment medical images as video via Segment Anything Model 2Code4
From Web Search towards Agentic Deep Research: Incentivizing Search with Reasoning AgentsCode4
Show:102550
← PrevPage 173 of 17734Next →