The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 8401–8425 of 474278 papers

Title	Date	Status
Horizon Reduction Makes RL Scalable	Oct 22, 2025	CodeCode Available
PRING: Rethinking Protein-Protein Interaction Prediction from Pairs to Graphs	Oct 22, 2025	CodeCode Available
Beyond GPT-5: Making LLMs Cheaper and Better via Performance-Efficiency Optimized Routing	Oct 22, 2025	CodeCode Available
Preconditioned Norms: A Unified Framework for Steepest Descent, Quasi-Newton and Adaptive Methods	Oct 22, 2025	CodeCode Available
X-Ego: Acquiring Team-Level Tactical Situational Awareness via Cross-Egocentric Contrastive Video Representation Learning	Oct 22, 2025	CodeCode Available
The Zero-Step Thinking: An Empirical Study of Mode Selection as Harder Early Exit in Reasoning Models	Oct 22, 2025	CodeCode Available
SCEESR: Semantic-Control Edge Enhancement for Diffusion-Based Super-Resolution	Oct 22, 2025	CodeCode Available
JointCQ: Improving Factual Hallucination Detection with Joint Claim and Query Generation	Oct 22, 2025	CodeCode Available
Balancing Rewards in Text Summarization: Multi-Objective Reinforcement Learning via HyperVolume Optimization	Oct 22, 2025	CodeCode Available
Graph Unlearning Meets Influence-aware Negative Preference Optimization	Oct 22, 2025	CodeCode Available
Decomposed Attention Fusion in MLLMs for Training-Free Video Reasoning Segmentation	Oct 22, 2025	CodeCode Available
XBench: A Comprehensive Benchmark for Visual-Language Explanations in Chest Radiography	Oct 22, 2025	CodeCode Available
Middo: Model-Informed Dynamic Data Optimization for Enhanced LLM Fine-Tuning via Closed-Loop Learning	Oct 22, 2025	CodeCode Available
Monitoring LLM-based Multi-Agent Systems Against Corruptions via Node Evaluation	Oct 22, 2025	CodeCode Available
AdaSPEC: Selective Knowledge Distillation for Efficient Speculative Decoders	Oct 22, 2025	CodeCode Available
Enhanced Cyclic Coordinate Descent Methods for Elastic Net Penalized Linear Models	Oct 22, 2025	CodeCode Available
Towards Strong Certified Defense with Universal Asymmetric Randomization	Oct 22, 2025	CodeCode Available
Motion2Meaning: A Clinician-Centered Framework for Contestable LLM in Parkinson's Disease Gait Interpretation	Oct 21, 2025	CodeCode Available
VITA-E: Natural Embodied Interaction with Concurrent Seeing, Hearing, Speaking, and Acting	Oct 21, 2025	—Unverified
DiffGRM: Diffusion-based Generative Recommendation Model	Oct 21, 2025	CodeCode Available
Crucible: Quantifying the Potential of Control Algorithms through LLM Agents	Oct 21, 2025	CodeCode Available
R-Horizon: How Far Can Your Large Reasoning Model Really Go in Breadth and Depth?	Oct 21, 2025	—Unverified
SimKO: Simple Pass@K Policy Optimization	Oct 21, 2025	—Unverified
MoGA: Mixture-of-Groups Attention for End-to-End Long Video Generation	Oct 21, 2025	—Unverified
IF-VidCap: Can Video Caption Models Follow Instructions?	Oct 21, 2025	—Unverified