The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 2651–2675 of 177340 papers

Title	Date	Tasks	Status	Hype	Score
Vision-Language Pre-training: Basics, Recent Advances, and Future Trends	Oct 17, 2022	Few-Shot LearningImage Captioning	CodeCode Available	3	5
CausalVLR: A Toolbox and Benchmark for Visual-Linguistic Causal Reasoning	Jun 30, 2023	Causal InferenceMedical Report Generation	CodeCode Available	3	5
PiSSA: Principal Singular Values and Singular Vectors Adaptation of Large Language Models	Apr 3, 2024	GSM8KQuantization	CodeCode Available	3	5
MLZero: A Multi-Agent System for End-to-end Machine Learning Automation	May 20, 2025	AutoMLCode Generation	CodeCode Available	3	5
Deformable DETR: Deformable Transformers for End-to-End Object Detection	Oct 8, 2020	2D Object DetectionObject Detection	CodeCode Available	3	5
VILA-U: a Unified Foundation Model Integrating Visual Understanding and Generation	Sep 6, 2024	Image Generation	CodeCode Available	3	5
Reinforcement Learning for Reasoning in Small LLMs: What Works and What Doesn't	Mar 20, 2025	Mathematical ReasoningReinforcement Learning (RL)	CodeCode Available	3	5
Vine Copulas as Differentiable Computational Graphs	Jun 16, 2025	GPUScheduling	CodeCode Available	3	5
Safe RLHF: Safe Reinforcement Learning from Human Feedback	Oct 19, 2023	reinforcement-learningReinforcement Learning	CodeCode Available	3	5
Predicting from Strings: Language Model Embeddings for Bayesian Optimization	Oct 14, 2024	Bayesian OptimizationExperimental Design	CodeCode Available	3	5
Discovering Language Model Behaviors with Model-Written Evaluations	Dec 19, 2022	Language ModelingLanguage Modelling	CodeCode Available	3	5
A Survey of Camouflaged Object Detection and Beyond	Aug 26, 2024	Instance SegmentationObject	CodeCode Available	3	5
MCTrack: A Unified 3D Multi-Object Tracking Framework for Autonomous Driving	Sep 23, 2024	3D Multi-Object TrackingAutonomous Driving	CodeCode Available	3	5
Trial and Error: Exploration-Based Trajectory Optimization for LLM Agents	Mar 4, 2024	Contrastive Learning	CodeCode Available	3	5
PutnamBench: Evaluating Neural Theorem-Provers on the Putnam Mathematical Competition	Jul 15, 2024	Automated Theorem Proving	CodeCode Available	3	5
A Survey of Neural Code Intelligence: Paradigms, Advances and Beyond	Mar 21, 2024	Survey	CodeCode Available	3	5
MVSFormer++: Revealing the Devil in Transformer's Details for Multi-View Stereo	Jan 22, 2024	3D ReconstructionDepth Estimation	CodeCode Available	3	5
Prisma: An Open Source Toolkit for Mechanistic Interpretability in Vision and Video	Apr 28, 2025		CodeCode Available	3	5
MyoSuite -- A contact-rich simulation suite for musculoskeletal motor control	May 26, 2022	continuous-controlContinuous Control	CodeCode Available	3	5
Effects of charging and discharging capabilities on trade-offs between model accuracy and computational efficiency in pumped thermal electricity storage	Nov 8, 2024	Computational Efficiency	CodeCode Available	3	5
Evolving from Single-modal to Multi-modal Facial Deepfake Detection: A Survey	Jun 11, 2024	DeepFake DetectionFace Swapping	CodeCode Available	3	5
Towards Kinetic Manipulation of the Latent Space	Sep 15, 2024		CodeCode Available	3	5
Medical SAM Adapter: Adapting Segment Anything Model for Medical Image Segmentation	Apr 25, 2023	Image SegmentationMedical Image Segmentation	CodeCode Available	3	5
AA-CLIP: Enhancing Zero-shot Anomaly Detection via Anomaly-Aware CLIP	Mar 9, 2025	Anomaly DetectionAnomaly Localization	CodeCode Available	3	5
xLSTM-UNet can be an Effective 2D & 3D Medical Image Segmentation Backbone with Vision-LSTM (ViL) better than its Mamba Counterpart	Jul 1, 2024	3D Medical Imaging Segmentationimage-classification	CodeCode Available	3	5