The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

659,983 papers248,104 code links4,818 tasks

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1951–2000 of 659983 papers

Title	Date	Tasks	Status	Hype
PufferLib: Making Reinforcement Learning Libraries and Environments Play Nice	Jun 11, 2024	NetHackreinforcement-learning	CodeCode Available	4
Latent Swap Joint Diffusion for 2D Long-Form Latent Generation	Feb 7, 2025	Audio GenerationDenoising	CodeCode Available	4
Elucidating the Design Space of Diffusion-Based Generative Models	Jun 1, 2022	Image Generation	CodeCode Available	4
Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models	Jun 9, 2022	Common Sense ReasoningMath	CodeCode Available	4
BitNet a4.8: 4-bit Activations for 1-bit LLMs	Nov 7, 2024	Quantization	CodeCode Available	4
A Survey on Vision-Language-Action Models for Embodied AI	May 23, 2024	Image CaptioningInstruction Following	CodeCode Available	4
DiffuCoder: Understanding and Improving Masked Diffusion Models for Code Generation	Jun 25, 2025	Code GenerationDenoising	CodeCode Available	4
DeepFilterNet2: Towards Real-Time Speech Enhancement on Embedded Devices for Full-Band Audio	May 11, 2022	CPUData Augmentation	CodeCode Available	4
Efficient Few-Shot Learning Without Prompts	Sep 22, 2022	Few-Shot LearningFew-Shot Text Classification	CodeCode Available	4
AndroidWorld: A Dynamic Benchmarking Environment for Autonomous Agents	May 23, 2024	Benchmarking	CodeCode Available	4
Adapters: A Unified Library for Parameter-Efficient and Modular Transfer Learning	Nov 18, 2023	Transfer Learning	CodeCode Available	4
Scalable 3D Panoptic Segmentation As Superpoint Graph Clustering	Jan 12, 2024	3D Panoptic Segmentation3D Semantic Segmentation	CodeCode Available	4
Generalizable and Animatable Gaussian Head Avatar	Oct 10, 2024		CodeCode Available	4
Deep Industrial Image Anomaly Detection: A Survey	Jan 27, 2023	Anomaly DetectionDeep Learning	CodeCode Available	4
PharMolixFM: All-Atom Foundation Models for Molecular Modeling and Generation	Mar 12, 2025	AllDenoising	CodeCode Available	4
Transformer for Object Re-Identification: A Survey	Jan 13, 2024	ObjectSurvey	CodeCode Available	4
FRESCO: Spatial-Temporal Correspondence for Zero-Shot Video Translation	Mar 19, 2024	Translationvalid	CodeCode Available	4
FLEX: FLEXible Federated Learning Framework	Apr 9, 2024	Federated Learning	CodeCode Available	4
Deep Multi-Frame Filtering for Hearing Aids	May 14, 2023	Speech Enhancement	CodeCode Available	4
Neuralangelo: High-Fidelity Neural Surface Reconstruction	Jun 5, 2023	Neural RenderingSurface Reconstruction	CodeCode Available	4
Is Sora a World Simulator? A Comprehensive Survey on General World Models and Beyond	May 6, 2024	Autonomous DrivingDecision Making	CodeCode Available	4
PAIR Diffusion: A Comprehensive Multimodal Object-Level Image Editor	Jan 1, 2024	Object	CodeCode Available	4
Training Software Engineering Agents and Verifiers with SWE-Gym	Dec 30, 2024	Language ModelingLanguage Modelling	CodeCode Available	4
VideoPainter: Any-length Video Inpainting and Editing with Plug-and-Play Context Control	Mar 7, 2025	Image InpaintingOptical Flow Estimation	CodeCode Available	4
pgmpy: A Python Toolkit for Bayesian Networks	Apr 17, 2023	Causal DiscoveryCausal Identification	CodeCode Available	4
OCRBench v2: An Improved Benchmark for Evaluating Large Multimodal Models on Visual Text Localization and Reasoning	Dec 31, 2024	BenchmarkingLogical Reasoning	CodeCode Available	4
Rethinking Inductive Biases for Surface Normal Estimation	Mar 1, 2024	Surface Normal Estimation	CodeCode Available	4
UniAnimate: Taming Unified Video Diffusion Models for Consistent Human Image Animation	Jun 3, 2024	Image AnimationVideo Generation	CodeCode Available	4
InkSight: Offline-to-Online Handwriting Conversion by Learning to Read and Write	Feb 8, 2024	Derendering	CodeCode Available	4
Long-form factuality in large language models	Mar 27, 2024	16kForm	CodeCode Available	4
Molecular-driven Foundation Model for Oncologic Pathology	Jan 28, 2025	BenchmarkingDiagnostic	CodeCode Available	4
Natural Language Generation	Feb 20, 2025	Text Generation	CodeCode Available	4
Medical SAM 2: Segment medical images as video via Segment Anything Model 2	Aug 1, 2024	Image SegmentationInteractive Segmentation	CodeCode Available	4
From Web Search towards Agentic Deep Research: Incentivizing Search with Reasoning Agents	Jun 23, 2025	Information RetrievalRetrieval	CodeCode Available	4
Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling	Jun 11, 2024	4kLanguage Modeling	CodeCode Available	4
3D-aware Conditional Image Synthesis	Feb 16, 2023	Image Generation	CodeCode Available	4
NeuPAN: Direct Point Robot Navigation with End-to-End Model-based Learning	Mar 11, 2024	Collision AvoidanceMotion Generation	CodeCode Available	4
LLaVA-Med: Training a Large Language-and-Vision Assistant for Biomedicine in One Day	Jun 1, 2023	Image ClassificationInstruction Following	CodeCode Available	4
MMMU-Pro: A More Robust Multi-discipline Multimodal Understanding Benchmark	Sep 4, 2024	Optical Character Recognition (OCR)	CodeCode Available	4
Pen and Paper Exercises in Machine Learning	Jun 27, 2022	BIG-bench Machine Learning	CodeCode Available	4
RewardBench: Evaluating Reward Models for Language Modeling	Mar 20, 2024	Instruction FollowingLanguage Modeling	CodeCode Available	4
Zero-Shot Image Restoration Using Denoising Diffusion Null-Space Model	Dec 1, 2022	Colorizationcompressed sensing	CodeCode Available	4
Taming Rectified Flow for Inversion and Editing	Nov 7, 2024	Image GenerationText-to-Image Generation	CodeCode Available	4
A Foundation Model for Zero-shot Logical Query Reasoning	Apr 10, 2024	Complex Query AnsweringKnowledge Graph Completion	CodeCode Available	4
DoRA: Weight-Decomposed Low-Rank Adaptation	Feb 14, 2024	parameter-efficient fine-tuning	CodeCode Available	4
Blind Image Deblurring with Unknown Kernel Size and Substantial Noise	Aug 18, 2022	Blind Image DeblurringDeblurring	CodeCode Available	4
Human Motion Diffusion Model	Sep 29, 2022	3D Generationmodel	CodeCode Available	4
Fast Inference of Mixture-of-Experts Language Models with Offloading	Dec 28, 2023	Mixture-of-ExpertsQuantization	CodeCode Available	4
Zero123++: a Single Image to Consistent Multi-view Diffusion Base Model	Oct 23, 2023		CodeCode Available	4
BitDistiller: Unleashing the Potential of Sub-4-Bit LLMs via Self-Distillation	Feb 16, 2024	Knowledge DistillationQuantization	CodeCode Available	4