The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

659,983 papers248,104 code links4,818 tasks

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 2501–2550 of 177339 papers

Title	Date	Tasks	Status	Hype	Score
A Common Interface for Automatic Differentiation	May 8, 2025		CodeCode Available	3	5
GameGen-X: Interactive Open-world Game Video Generation	Nov 1, 2024	Text-to-Video GenerationVideo Generation	CodeCode Available	3	5
Valley2: Exploring Multimodal Models with Scalable Vision-Language Design	Jan 10, 2025	Image CaptioningLanguage Modeling	CodeCode Available	3	5
Measuring AI Ability to Complete Long Tasks	Mar 18, 2025	Logical Reasoning	CodeCode Available	3	5
Computation-Efficient Era: A Comprehensive Survey of State Space Models in Medical Image Analysis	Jun 5, 2024	MambaMedical Image Analysis	CodeCode Available	3	5
InterpretML: A Unified Framework for Machine Learning Interpretability	Sep 19, 2019	Additive modelsBIG-bench Machine Learning	CodeCode Available	3	5
Cosmos-Drive-Dreams: Scalable Synthetic Driving Data Generation with World Foundation Models	Jun 10, 2025	3D Lane Detection3D Object Detection	CodeCode Available	3	5
AlphaMath Almost Zero: Process Supervision without Process	May 6, 2024	Mathematical ReasoningMath Word Problem Solving	CodeCode Available	3	5
Impromptu VLA: Open Weights and Open Data for Driving Vision-Language-Action Models	May 29, 2025	Autonomous DrivingDiagnostic	CodeCode Available	3	5
Normalizing Flows are Capable Generative Models	Dec 9, 2024	Conditional Image GenerationDensity Estimation	CodeCode Available	3	5
3D Photography using Context-aware Layered Depth Inpainting	Apr 9, 2020	Novel View Synthesis	CodeCode Available	3	5
SMILE: Zero-Shot Sparse Mixture of Low-Rank Experts Construction From Pre-Trained Foundation Models	Aug 19, 2024	image-classificationImage Classification	CodeCode Available	3	5
Focused Transformer: Contrastive Training for Context Scaling	Jul 6, 2023	Contrastive Learning	CodeCode Available	3	5
Deep Neural Networks for Encrypted Inference with TFHE	Feb 13, 2023	Privacy Preserving	CodeCode Available	3	5
Foundations of Large Language Models	Jan 16, 2025		CodeCode Available	3	5
MobileMamba: Lightweight Multi-Receptive Visual Mamba Network	Nov 24, 2024	GPUMamba	CodeCode Available	3	5
Developing Generalist Foundation Models from a Multimodal Dataset for 3D Computed Tomography	Mar 26, 2024	Anomaly DetectionLarge Language Model	CodeCode Available	3	5
EMCAD: Efficient Multi-scale Convolutional Attention Decoding for Medical Image Segmentation	May 11, 2024	Computational EfficiencyDecoder	CodeCode Available	3	5
Expanding Language-Image Pretrained Models for General Video Recognition	Aug 4, 2022	Action ClassificationAction Recognition	CodeCode Available	3	5
WavChat: A Survey of Spoken Dialogue Models	Nov 15, 2024	speech-recognitionSpeech Recognition	CodeCode Available	3	5
PirateNets: Physics-informed Deep Learning with Residual Adaptive Networks	Feb 1, 2024	Deep Learning	CodeCode Available	3	5
FastViT: A Fast Hybrid Vision Transformer using Structural Reparameterization	Mar 24, 2023	3D Hand Pose EstimationGPU	CodeCode Available	3	5
Style Aligned Image Generation via Shared Attention	Dec 4, 2023	Image Generation	CodeCode Available	3	5
Camera Calibration via Circular Patterns: A Comprehensive Framework with Measurement Uncertainty and Unbiased Projection Model	Jun 20, 2025	Camera Calibration	CodeCode Available	3	5
emg2pose: A Large and Diverse Benchmark for Surface Electromyographic Hand Pose Estimation	Dec 2, 2024	AnatomyHand Pose Estimation	CodeCode Available	3	5
Separable Self-attention for Mobile Vision Transformers	Jun 6, 2022	Image ClassificationObject Detection	CodeCode Available	3	5
Safety at Scale: A Comprehensive Survey of Large Model Safety	Feb 2, 2025	Autonomous DrivingData Poisoning	CodeCode Available	3	5
ESRGAN: Enhanced Super-Resolution Generative Adversarial Networks	Sep 1, 2018	Face HallucinationGenerative Adversarial Network	CodeCode Available	3	5
CBGBench: Fill in the Blank of Protein-Molecule Complex Binding Graph	Jun 16, 2024	Drug DesignFairness	CodeCode Available	3	5
A Declarative System for Optimizing AI Workloads	May 23, 2024		CodeCode Available	3	5
MaskGWM: A Generalizable Driving World Model with Video Mask Reconstruction	Feb 17, 2025	2kAutonomous Driving	CodeCode Available	3	5
How to Evaluate the Generalization of Detection? A Benchmark for Comprehensive Open-Vocabulary Detection	Aug 25, 2023	Object Detection	CodeCode Available	3	5
PULSE: Self-Supervised Photo Upsampling via Latent Space Exploration of Generative Models	Mar 8, 2020	Face HallucinationHallucination	CodeCode Available	3	5
DETRs with Collaborative Hybrid Assignments Training	Nov 22, 2022	DecoderInstance Segmentation	CodeCode Available	3	5
Video-RAG: Visually-aligned Retrieval-Augmented Long Video Comprehension	Nov 20, 2024	GPUMME	CodeCode Available	3	5
SparseTSF: Modeling Long-term Time Series Forecasting with 1k Parameters	May 2, 2024	Time SeriesTime Series Forecasting	CodeCode Available	3	5
SelfCodeAlign: Self-Alignment for Code Generation	Oct 31, 2024	Code GenerationHumanEval	CodeCode Available	3	5
BioReason: Incentivizing Multimodal Biological Reasoning within a DNA-LLM Model	May 29, 2025	Large Language Modelscientific discovery	CodeCode Available	3	5
Scientific Large Language Models: A Survey on Biological & Chemical Domains	Jan 26, 2024	scientific discoverySurvey	CodeCode Available	3	5
TaxDiff: Taxonomic-Guided Diffusion Model for Protein Sequence Generation	Feb 27, 2024	Protein Design	CodeCode Available	3	5
Language Models Are Greedy Reasoners: A Systematic Formal Analysis of Chain-of-Thought	Oct 3, 2022	Mathematical ReasoningQuestion Answering	CodeCode Available	3	5
FramePainter: Endowing Interactive Image Editing with Video Diffusion Priors	Jan 14, 2025	Image to Video GenerationVideo Generation	CodeCode Available	3	5
Dopamine: A Research Framework for Deep Reinforcement Learning	Dec 14, 2018	Deep Reinforcement Learningreinforcement-learning	CodeCode Available	3	5
ModelScope Text-to-Video Technical Report	Aug 12, 2023	DenoisingImage Generation	CodeCode Available	3	5
DocAgent: A Multi-Agent System for Automated Code Documentation Generation	Apr 11, 2025	Code Documentation Generation	CodeCode Available	3	5
Geometric-aware Pretraining for Vision-centric 3D Object Detection	Apr 6, 2023	3D Object DetectionAutonomous Driving	CodeCode Available	3	5
Physics-Informed Diffusion Models	Mar 21, 2024	Denoising	CodeCode Available	3	5
An end-to-end strategy for recovering a free-form potential from a snapshot of stellar coordinates	May 26, 2023	FormSymbolic Regression	CodeCode Available	3	5
MELODI: Exploring Memory Compression for Long Contexts	Oct 4, 2024		CodeCode Available	3	5
Accelerating Production LLMs with Combined Token/Embedding Speculators	Apr 29, 2024		CodeCode Available	3	5