The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 10476–10500 of 177340 papers

Title	Date	Tasks	Status	Hype	Score
Dynamic Spatial Propagation Network for Depth Completion	Feb 20, 2022	Depth Completion	CodeCode Available	2	5
OctoThinker: Mid-training Incentivizes Reinforcement Learning Scaling	Jun 25, 2025	Language ModelingLanguage Modelling	CodeCode Available	2	5
Perception Test: A Diagnostic Benchmark for Multimodal Video Models	May 23, 2023	DiagnosticGrounded Video Question Answering	CodeCode Available	2	5
RITA: a Study on Scaling Up Generative Protein Sequence Models	May 11, 2022	PredictionProtein Design	CodeCode Available	2	5
Multi-target stain normalization for histology slides	Jun 4, 2024		CodeCode Available	2	5
MedS^3: Towards Medical Small Language Models with Self-Evolved Slow Thinking	Jan 21, 2025	Multiple-choice	CodeCode Available	2	5
Long-term Frame-Event Visual Tracking: Benchmark Dataset and Baseline	Mar 9, 2024	Object TrackingRgb-T Tracking	CodeCode Available	2	5
ChaCha for Online AutoML	Jun 9, 2021	AutoMLScheduling	CodeCode Available	2	5
Graph-based Topology Reasoning for Driving Scenes	Apr 11, 2023	3D Lane DetectionAutonomous Driving	CodeCode Available	2	5
SegFix: Model-Agnostic Boundary Refinement for Segmentation	Jul 8, 2020	modelSegmentation	CodeCode Available	2	5
Geo-Neus: Geometry-Consistent Neural Implicit Surfaces Learning for Multi-view Reconstruction	May 31, 2022	Surface Reconstruction	CodeCode Available	2	5
TrafficGPT: An LLM Approach for Open-Set Encrypted Traffic Classification	Aug 6, 2024	Traffic Classification	CodeCode Available	2	5
Shadowcast: Stealthy Data Poisoning Attacks Against Vision-Language Models	Feb 5, 2024	Data AugmentationData Poisoning	CodeCode Available	2	5
DiLu: A Knowledge-Driven Approach to Autonomous Driving with Large Language Models	Sep 28, 2023	10-shot image generation1 Image, 2*2 Stitchi	CodeCode Available	2	5
Map It Anywhere (MIA): Empowering Bird's Eye View Mapping using Large-scale Public Data	Jul 11, 2024	Autonomous NavigationPrediction	CodeCode Available	2	5
Probability density estimation for sets of large graphs with respect to spectral information using stochastic block models	Jul 5, 2022	Density Estimation	CodeCode Available	2	5
MTAD: Tools and Benchmarks for Multivariate Time Series Anomaly Detection	Jan 10, 2024	Anomaly DetectionTime Series	CodeCode Available	2	5
One Model to Rule them All: Towards Universal Segmentation for Medical Images with Text Prompts	Dec 28, 2023	AllAnatomy	CodeCode Available	2	5
OctGPT: Octree-based Multiscale Autoregressive Models for 3D Shape Generation	Apr 14, 2025	3D Shape Generation	CodeCode Available	2	5
LeanDojo: Theorem Proving with Retrieval-Augmented Language Models	Jun 27, 2023	Automated Theorem ProvingGPU	CodeCode Available	2	5
LongVLM: Efficient Long Video Understanding via Large Language Models	Apr 4, 2024	Question AnsweringVideo Question Answering	CodeCode Available	2	5
Geometry-Informed Neural Networks	Feb 21, 2024	Diversity	CodeCode Available	2	5
MOROCCO: Model Resource Comparison Framework	Nov 16, 2021	Computational Efficiencymodel	CodeCode Available	2	5
DeepCache: Accelerating Diffusion Models for Free	Dec 1, 2023	DenoisingImage Generation	CodeCode Available	2	5
Benchmarking Uncertainty Quantification Methods for Large Language Models with LM-Polygraph	Jun 21, 2024	BenchmarkingText Generation	CodeCode Available	2	5