The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

659,983 papers248,104 code links4,818 tasks

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1351–1375 of 177339 papers

Title	Date	Tasks	Status	Hype	Score
Deep Patch Visual SLAM	Aug 3, 2024	GPUVisual Odometry	CodeCode Available	4	5
Towards Automated Circuit Discovery for Mechanistic Interpretability	Apr 28, 2023		CodeCode Available	4	5
VisionReasoner: Unified Visual Perception and Reasoning via Reinforcement Learning	May 17, 2025	2D Object DetectionObject Counting	CodeCode Available	4	5
TigerBot: An Open Multilingual Multitask LLM	Dec 14, 2023		CodeCode Available	4	5
PLAID: An Efficient Engine for Late Interaction Retrieval	May 19, 2022	CPUGPU	CodeCode Available	4	5
Knowledge Fusion of Large Language Models	Jan 19, 2024	Code GenerationCommon Sense Reasoning	CodeCode Available	4	5
TALENT: A Tabular Analytics and Learning Toolbox	Jul 4, 2024		CodeCode Available	4	5
Osprey: Pixel Understanding with Visual Instruction Tuning	Dec 15, 2023	Language Modelling	CodeCode Available	4	5
Let's Verify Step by Step	May 31, 2023	Active LearningMath	CodeCode Available	4	5
Agent-as-a-Judge: Evaluate Agents with Agents	Oct 14, 2024	Code Generation	CodeCode Available	4	5
TUMTraf V2X Cooperative Perception Dataset	Mar 2, 2024	3D Object DetectionAutonomous Vehicles	CodeCode Available	4	5
Attention on the Sphere	May 16, 2025	Depth EstimationImage Segmentation	CodeCode Available	4	5
Generalized Recorrupted-to-Recorrupted: Self-Supervised Learning Beyond Gaussian Noise	Dec 5, 2024	DenoisingImage Restoration	CodeCode Available	4	5
GaussianFormer-2: Probabilistic Gaussian Superposition for Efficient 3D Occupancy Prediction	Dec 5, 2024	3D Semantic Occupancy PredictionAutonomous Driving	CodeCode Available	4	5
Vision-Language Models for Vision Tasks: A Survey	Apr 3, 2023	BenchmarkingKnowledge Distillation	CodeCode Available	4	5
A Survey on Visual Mamba	Apr 24, 2024	Image RegistrationImage Restoration	CodeCode Available	4	5
End-to-end Autonomous Driving: Challenges and Frontiers	Jun 29, 2023	Autonomous Drivingmotion prediction	CodeCode Available	4	5
TensoRF: Tensorial Radiance Fields	Mar 17, 2022	Low-Dose X-Ray Ct ReconstructionNeRF	CodeCode Available	4	5
A Convergent Single-Loop Algorithm for Relaxation of Gromov-Wasserstein in Graph Data	Mar 12, 2023	Computational Efficiency	CodeCode Available	4	5
Uni-Perceiver v2: A Generalist Model for Large-Scale Vision and Vision-Language Tasks	Nov 17, 2022	DecoderLanguage Modelling	CodeCode Available	4	5
Generating Structured Outputs from Language Models: Benchmark and Studies	Jan 18, 2025		CodeCode Available	4	5
Semi-Mamba-UNet: Pixel-Level Contrastive and Pixel-Level Cross-Supervised Visual Mamba-based UNet for Semi-Supervised Medical Image Segmentation	Feb 11, 2024	Cardiac SegmentationContrastive Learning	CodeCode Available	4	5
Radiative Gaussian Splatting for Efficient X-ray Novel View Synthesis	Mar 7, 2024	CT ReconstructionNeRF	CodeCode Available	4	5
Timer-XL: Long-Context Transformers for Unified Time Series Forecasting	Oct 7, 2024	Time SeriesTime Series Forecasting	CodeCode Available	4	5
TRUE: Re-evaluating Factual Consistency Evaluation	Apr 11, 2022	Question GenerationQuestion-Generation	CodeCode Available	4	5