SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 19762000 of 661570 papers

TitleStatusHype
OCRBench v2: An Improved Benchmark for Evaluating Large Multimodal Models on Visual Text Localization and ReasoningCode4
Rethinking Inductive Biases for Surface Normal EstimationCode4
UniAnimate: Taming Unified Video Diffusion Models for Consistent Human Image AnimationCode4
InkSight: Offline-to-Online Handwriting Conversion by Learning to Read and WriteCode4
Long-form factuality in large language modelsCode4
Molecular-driven Foundation Model for Oncologic PathologyCode4
Natural Language GenerationCode4
Medical SAM 2: Segment medical images as video via Segment Anything Model 2Code4
From Web Search towards Agentic Deep Research: Incentivizing Search with Reasoning AgentsCode4
Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language ModelingCode4
3D-aware Conditional Image SynthesisCode4
NeuPAN: Direct Point Robot Navigation with End-to-End Model-based LearningCode4
LLaVA-Med: Training a Large Language-and-Vision Assistant for Biomedicine in One DayCode4
MMMU-Pro: A More Robust Multi-discipline Multimodal Understanding BenchmarkCode4
Pen and Paper Exercises in Machine LearningCode4
RewardBench: Evaluating Reward Models for Language ModelingCode4
Zero-Shot Image Restoration Using Denoising Diffusion Null-Space ModelCode4
Taming Rectified Flow for Inversion and EditingCode4
A Foundation Model for Zero-shot Logical Query ReasoningCode4
DoRA: Weight-Decomposed Low-Rank AdaptationCode4
Blind Image Deblurring with Unknown Kernel Size and Substantial NoiseCode4
Human Motion Diffusion ModelCode4
Fast Inference of Mixture-of-Experts Language Models with OffloadingCode4
Zero123++: a Single Image to Consistent Multi-view Diffusion Base ModelCode4
BitDistiller: Unleashing the Potential of Sub-4-Bit LLMs via Self-DistillationCode4
Show:102550
← PrevPage 80 of 26463Next →