The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

659,983 papers248,104 code links4,818 tasks

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1726–1750 of 177339 papers

Title	Date	Tasks	Status	Hype	Score
Natural Language Generation	Feb 20, 2025	Text Generation	CodeCode Available	4	5
Medical SAM 2: Segment medical images as video via Segment Anything Model 2	Aug 1, 2024	Image SegmentationInteractive Segmentation	CodeCode Available	4	5
From Web Search towards Agentic Deep Research: Incentivizing Search with Reasoning Agents	Jun 23, 2025	Information RetrievalRetrieval	CodeCode Available	4	5
Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling	Jun 11, 2024	4kLanguage Modeling	CodeCode Available	4	5
3D-aware Conditional Image Synthesis	Feb 16, 2023	Image Generation	CodeCode Available	4	5
NeuPAN: Direct Point Robot Navigation with End-to-End Model-based Learning	Mar 11, 2024	Collision AvoidanceMotion Generation	CodeCode Available	4	5
LLaVA-Med: Training a Large Language-and-Vision Assistant for Biomedicine in One Day	Jun 1, 2023	Image ClassificationInstruction Following	CodeCode Available	4	5
MMMU-Pro: A More Robust Multi-discipline Multimodal Understanding Benchmark	Sep 4, 2024	Optical Character Recognition (OCR)	CodeCode Available	4	5
Pen and Paper Exercises in Machine Learning	Jun 27, 2022	BIG-bench Machine Learning	CodeCode Available	4	5
RewardBench: Evaluating Reward Models for Language Modeling	Mar 20, 2024	Instruction FollowingLanguage Modeling	CodeCode Available	4	5
Zero-Shot Image Restoration Using Denoising Diffusion Null-Space Model	Dec 1, 2022	Colorizationcompressed sensing	CodeCode Available	4	5
Taming Rectified Flow for Inversion and Editing	Nov 7, 2024	Image GenerationText-to-Image Generation	CodeCode Available	4	5
A Foundation Model for Zero-shot Logical Query Reasoning	Apr 10, 2024	Complex Query AnsweringKnowledge Graph Completion	CodeCode Available	4	5
DoRA: Weight-Decomposed Low-Rank Adaptation	Feb 14, 2024	parameter-efficient fine-tuning	CodeCode Available	4	5
Blind Image Deblurring with Unknown Kernel Size and Substantial Noise	Aug 18, 2022	Blind Image DeblurringDeblurring	CodeCode Available	4	5
Human Motion Diffusion Model	Sep 29, 2022	3D Generationmodel	CodeCode Available	4	5
Fast Inference of Mixture-of-Experts Language Models with Offloading	Dec 28, 2023	Mixture-of-ExpertsQuantization	CodeCode Available	4	5
Zero123++: a Single Image to Consistent Multi-view Diffusion Base Model	Oct 23, 2023		CodeCode Available	4	5
BitDistiller: Unleashing the Potential of Sub-4-Bit LLMs via Self-Distillation	Feb 16, 2024	Knowledge DistillationQuantization	CodeCode Available	4	5
TerraTorch: The Geospatial Foundation Models Toolkit	Mar 26, 2025	BenchmarkingDecoder	CodeCode Available	4	5
Video-R1: Reinforcing Video Reasoning in MLLMs	Mar 27, 2025	MVBenchReinforcement Learning (RL)	CodeCode Available	4	5
SongBloom: Coherent Song Generation via Interleaved Autoregressive Sketching and Diffusion Refinement	Jun 9, 2025	Music Generation	CodeCode Available	4	5
SpatialTrackerV2: 3D Point Tracking Made Easy	Jul 16, 2025	3D ReconstructionCamera Pose Estimation	CodeCode Available	4	5
Proactive Detection of Voice Cloning with Localized Watermarking	Jan 30, 2024	Voice Cloning	CodeCode Available	4	5
Eliciting Latent Predictions from Transformers with the Tuned Lens	Mar 14, 2023	Language Modelling	CodeCode Available	4	5