The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 7951–7975 of 474278 papers

Title	Date	Status
GeneFlow: Translation of Single-cell Gene Expression to Histopathological Images via Rectified Flow	Oct 31, 2025	CodeCode Available
A Dual Large Language Models Architecture with Herald Guided Prompts for Parallel Fine Grained Traffic Signal Control	Oct 31, 2025	CodeCode Available
BlurGuard: A Simple Approach for Robustifying Image Protection Against AI-Powered Editing	Oct 31, 2025	CodeCode Available
RaanA: A Fast, Flexible, and Data-Efficient Post-Training Quantization Algorithm	Oct 31, 2025	CodeCode Available
MedM2T: A MultiModal Framework for Time-Aware Modeling with Electronic Health Record and Electrocardiogram Data	Oct 31, 2025	CodeCode Available
MeisenMeister: A Simple Two Stage Pipeline for Breast Cancer Classification on MRI	Oct 31, 2025	CodeCode Available
Understanding the Implicit User Intention via Reasoning with Large Language Model for Image Editing	Oct 31, 2025	CodeCode Available
Context-Gated Cross-Modal Perception with Visual Mamba for PET-CT Lung Tumor Segmentation	Oct 31, 2025	CodeCode Available
VCORE: Variance-Controlled Optimization-based Reweighting for Chain-of-Thought Supervision	Oct 31, 2025	CodeCode Available
HiRA: A Hierarchical Reasoning Framework for Decoupled Planning and Execution in Deep Search	Oct 31, 2025	CodeCode Available
Uncertainty-Based Smooth Policy Regularisation for Reinforcement Learning with Few Demonstrations	Oct 31, 2025	CodeCode Available
Normative Reasoning in Large Language Models: A Comparative Benchmark from Logical and Modal Perspectives	Oct 31, 2025	CodeCode Available
Adaptive Defense against Harmful Fine-Tuning for Large Language Models via Bayesian Data Scheduler	Oct 31, 2025	CodeCode Available
MedCalc-Eval and MedCalc-Env: Advancing Medical Calculation Capabilities of Large Language Models	Oct 31, 2025	CodeCode Available
NAUTILUS: A Large Multimodal Model for Underwater Scene Understanding	Oct 31, 2025	CodeCode Available
Mechanics of Learned Reasoning 1: TempoBench, A Benchmark for Interpretable Deconstruction of Reasoning System Performance	Oct 31, 2025	CodeCode Available
Sketch-to-Layout: Sketch-Guided Multimodal Layout Generation	Oct 31, 2025	CodeCode Available
Gaussian Combined Distance: A Generic Metric for Object Detection	Oct 31, 2025	CodeCode Available
Continuous Autoregressive Language Models	Oct 31, 2025	CodeCode Available
Soft Task-Aware Routing of Experts for Equivariant Representation Learning	Oct 31, 2025	CodeCode Available
Learning Sparse Approximate Inverse Preconditioners for Conjugate Gradient Solvers on GPUs	Oct 31, 2025	CodeCode Available
Higher-order Linear Attention	Oct 31, 2025	CodeCode Available
T3: Test-Time Model Merging in VLMs for Zero-Shot Medical Imaging Analysis	Oct 31, 2025	CodeCode Available
RL-Exec: Impact-Aware Reinforcement Learning for Opportunistic Optimal Liquidation, Outperforms TWAP and a Book-Liquidity VWAP on BTC-USD Replays	Oct 30, 2025	CodeCode Available
Urban-MAS: Human-Centered Urban Prediction with LLM-Based Multi-Agent System	Oct 30, 2025	CodeCode Available