The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

659,983 papers248,104 code links4,818 tasks

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 2651–2700 of 177339 papers

Title	Date	Tasks	Status	Hype	Score
CausalVLR: A Toolbox and Benchmark for Visual-Linguistic Causal Reasoning	Jun 30, 2023	Causal InferenceMedical Report Generation	CodeCode Available	3	5
PiSSA: Principal Singular Values and Singular Vectors Adaptation of Large Language Models	Apr 3, 2024	GSM8KQuantization	CodeCode Available	3	5
MLZero: A Multi-Agent System for End-to-end Machine Learning Automation	May 20, 2025	AutoMLCode Generation	CodeCode Available	3	5
Deformable DETR: Deformable Transformers for End-to-End Object Detection	Oct 8, 2020	2D Object DetectionObject Detection	CodeCode Available	3	5
VILA-U: a Unified Foundation Model Integrating Visual Understanding and Generation	Sep 6, 2024	Image Generation	CodeCode Available	3	5
Reinforcement Learning for Reasoning in Small LLMs: What Works and What Doesn't	Mar 20, 2025	Mathematical ReasoningReinforcement Learning (RL)	CodeCode Available	3	5
Vine Copulas as Differentiable Computational Graphs	Jun 16, 2025	GPUScheduling	CodeCode Available	3	5
Safe RLHF: Safe Reinforcement Learning from Human Feedback	Oct 19, 2023	reinforcement-learningReinforcement Learning	CodeCode Available	3	5
Predicting from Strings: Language Model Embeddings for Bayesian Optimization	Oct 14, 2024	Bayesian OptimizationExperimental Design	CodeCode Available	3	5
Discovering Language Model Behaviors with Model-Written Evaluations	Dec 19, 2022	Language ModelingLanguage Modelling	CodeCode Available	3	5
A Survey of Camouflaged Object Detection and Beyond	Aug 26, 2024	Instance SegmentationObject	CodeCode Available	3	5
MCTrack: A Unified 3D Multi-Object Tracking Framework for Autonomous Driving	Sep 23, 2024	3D Multi-Object TrackingAutonomous Driving	CodeCode Available	3	5
Trial and Error: Exploration-Based Trajectory Optimization for LLM Agents	Mar 4, 2024	Contrastive Learning	CodeCode Available	3	5
PutnamBench: Evaluating Neural Theorem-Provers on the Putnam Mathematical Competition	Jul 15, 2024	Automated Theorem Proving	CodeCode Available	3	5
A Survey of Neural Code Intelligence: Paradigms, Advances and Beyond	Mar 21, 2024	Survey	CodeCode Available	3	5
MVSFormer++: Revealing the Devil in Transformer's Details for Multi-View Stereo	Jan 22, 2024	3D ReconstructionDepth Estimation	CodeCode Available	3	5
Prisma: An Open Source Toolkit for Mechanistic Interpretability in Vision and Video	Apr 28, 2025		CodeCode Available	3	5
MyoSuite -- A contact-rich simulation suite for musculoskeletal motor control	May 26, 2022	continuous-controlContinuous Control	CodeCode Available	3	5
Effects of charging and discharging capabilities on trade-offs between model accuracy and computational efficiency in pumped thermal electricity storage	Nov 8, 2024	Computational Efficiency	CodeCode Available	3	5
Evolving from Single-modal to Multi-modal Facial Deepfake Detection: A Survey	Jun 11, 2024	DeepFake DetectionFace Swapping	CodeCode Available	3	5
Towards Kinetic Manipulation of the Latent Space	Sep 15, 2024		CodeCode Available	3	5
Medical SAM Adapter: Adapting Segment Anything Model for Medical Image Segmentation	Apr 25, 2023	Image SegmentationMedical Image Segmentation	CodeCode Available	3	5
AA-CLIP: Enhancing Zero-shot Anomaly Detection via Anomaly-Aware CLIP	Mar 9, 2025	Anomaly DetectionAnomaly Localization	CodeCode Available	3	5
xLSTM-UNet can be an Effective 2D & 3D Medical Image Segmentation Backbone with Vision-LSTM (ViL) better than its Mamba Counterpart	Jul 1, 2024	3D Medical Imaging Segmentationimage-classification	CodeCode Available	3	5
Open-Source Skull Reconstruction with MONAI	Nov 25, 2022	C++ codeDeep Learning	CodeCode Available	3	5
MMedAgent: Learning to Use Medical Tools with Multi-modal Agent	Jul 2, 2024		CodeCode Available	3	5
DiarizationLM: Speaker Diarization Post-Processing with Large Language Models	Jan 7, 2024	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	CodeCode Available	3	5
RelBench: A Benchmark for Deep Learning on Relational Databases	Jul 29, 2024	Deep LearningFeature Engineering	CodeCode Available	3	5
A Survey on Text-guided 3D Visual Grounding: Elements, Recent Advances, and Future Directions	Jun 9, 2024	3D visual groundingSurvey	CodeCode Available	3	5
Learning Bipedal Walking On Planned Footsteps For Humanoid Robots	Jul 26, 2022	Deep Reinforcement LearningMuJoCo	CodeCode Available	3	5
Large Language Monkeys: Scaling Inference Compute with Repeated Sampling	Jul 31, 2024	GSM8KMath	CodeCode Available	3	5
ECG-FM: An Open Electrocardiogram Foundation Model	Aug 9, 2024	Contrastive LearningDiagnostic	CodeCode Available	3	5
Hyper-YOLO: When Visual Object Detection Meets Hypergraph Computation	Aug 9, 2024	object-detectionObject Detection	CodeCode Available	3	5
SoftMatch: Addressing the Quantity-Quality Trade-off in Semi-supervised Learning	Jan 26, 2023	imbalanced classification	CodeCode Available	3	5
SGFormer: Single-Layer Graph Transformers with Approximation-Free Linear Complexity	Sep 13, 2024	Deep AttentionRepresentation Learning	CodeCode Available	3	5
CAD-Recode: Reverse Engineering CAD Code from Point Clouds	Dec 18, 2024	CAD ReconstructionDecoder	CodeCode Available	3	5
EmergentTTS-Eval: Evaluating TTS Models on Complex Prosodic, Expressiveness, and Linguistic Challenges Using Model-as-a-Judge	May 29, 2025	text-to-speechText to Speech	CodeCode Available	3	5
DeepfakeBench: A Comprehensive Benchmark of Deepfake Detection	Jul 4, 2023	DeepFake DetectionFace Swapping	CodeCode Available	3	5
FlowDock: Geometric Flow Matching for Generative Protein-Ligand Docking and Affinity Prediction	Dec 14, 2024	Blind DockingDrug Discovery	CodeCode Available	3	5
LoftUp: Learning a Coordinate-Based Feature Upsampler for Vision Foundation Models	Apr 18, 2025	Feature Upsampling	CodeCode Available	3	5
ImageFolder: Autoregressive Image Generation with Folded Tokens	Oct 2, 2024	Image GenerationImage Reconstruction	CodeCode Available	3	5
ConsistI2V: Enhancing Visual Consistency for Image-to-Video Generation	Feb 6, 2024	Image to Video GenerationVideo Generation	CodeCode Available	3	5
Simple linear attention language models balance the recall-throughput tradeoff	Feb 28, 2024	Language ModellingMamba	CodeCode Available	3	5
MoC: Mixtures of Text Chunking Learners for Retrieval-Augmented Generation System	Mar 12, 2025	ChunkingComputational Efficiency	CodeCode Available	3	5
The Tabular Foundation Model TabPFN Outperforms Specialized Time Series Forecasting Models Based on Simple Features	Jan 6, 2025	Feature EngineeringTime Series	CodeCode Available	3	5
Flow Straight and Fast: Learning to Generate and Transfer Data with Rectified Flow	Sep 7, 2022	Domain AdaptationImage Generation	CodeCode Available	3	5
LLaVA-UHD v2: an MLLM Integrating High-Resolution Feature Pyramid via Hierarchical Window Transformer	Dec 18, 2024	AttributeText Generation	CodeCode Available	3	5
IMDL-BenCo: A Comprehensive Benchmark and Codebase for Image Manipulation Detection & Localization	Jun 15, 2024	GPUImage Manipulation	CodeCode Available	3	5
IntactKV: Improving Large Language Model Quantization by Keeping Pivot Tokens Intact	Mar 2, 2024	Language ModelingLanguage Modelling	CodeCode Available	3	5
Multi-agent Architecture Search via Agentic Supernet	Feb 6, 2025	Language ModelingLanguage Modelling	CodeCode Available	3	5