SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

659,983 papers248,104 code links4,818 tasks

Papers

Showing 17261750 of 177339 papers

TitleStatusHype
Natural Language GenerationCode4
Medical SAM 2: Segment medical images as video via Segment Anything Model 2Code4
From Web Search towards Agentic Deep Research: Incentivizing Search with Reasoning AgentsCode4
Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language ModelingCode4
3D-aware Conditional Image SynthesisCode4
NeuPAN: Direct Point Robot Navigation with End-to-End Model-based LearningCode4
LLaVA-Med: Training a Large Language-and-Vision Assistant for Biomedicine in One DayCode4
MMMU-Pro: A More Robust Multi-discipline Multimodal Understanding BenchmarkCode4
Pen and Paper Exercises in Machine LearningCode4
RewardBench: Evaluating Reward Models for Language ModelingCode4
Zero-Shot Image Restoration Using Denoising Diffusion Null-Space ModelCode4
Taming Rectified Flow for Inversion and EditingCode4
A Foundation Model for Zero-shot Logical Query ReasoningCode4
DoRA: Weight-Decomposed Low-Rank AdaptationCode4
Blind Image Deblurring with Unknown Kernel Size and Substantial NoiseCode4
Human Motion Diffusion ModelCode4
Fast Inference of Mixture-of-Experts Language Models with OffloadingCode4
Zero123++: a Single Image to Consistent Multi-view Diffusion Base ModelCode4
BitDistiller: Unleashing the Potential of Sub-4-Bit LLMs via Self-DistillationCode4
TerraTorch: The Geospatial Foundation Models ToolkitCode4
Video-R1: Reinforcing Video Reasoning in MLLMsCode4
SongBloom: Coherent Song Generation via Interleaved Autoregressive Sketching and Diffusion RefinementCode4
SpatialTrackerV2: 3D Point Tracking Made EasyCode4
Proactive Detection of Voice Cloning with Localized WatermarkingCode4
Eliciting Latent Predictions from Transformers with the Tuned LensCode4
Show:102550
← PrevPage 70 of 7094Next →