The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 11401–11450 of 661570 papers

Title	Date	Status	Hype
EvoPrune: Early-Stage Visual Token Pruning for Efficient MLLMs	Mar 4, 2026	—Unverified	0
When to restart? Exploring escalating restarts on convergence	Mar 4, 2026	—Unverified	0
CONCUR: Benchmarking LLMs for Concurrent Code Generation	Mar 4, 2026	—Unverified	0
MOOSE-Star: Unlocking Tractable Training for Scientific Discovery by Breaking the Complexity Barrier	Mar 4, 2026	—Unverified	1
UrbanHuRo: A Two-Layer Human-Robot Collaboration Framework for the Joint Optimization of Heterogeneous Urban Services	Mar 4, 2026	—Unverified	0
MPFlow: Multi-modal Posterior-Guided Flow Matching for Zero-Shot MRI Reconstruction	Mar 4, 2026	—Unverified	0
Why Do Unlearnable Examples Work: A Novel Perspective of Mutual Information	Mar 4, 2026	—Unverified	0
PROSPECT: Unified Streaming Vision-Language Navigation via Semantic--Spatial Fusion and Latent Predictive Representation	Mar 4, 2026	—Unverified	0
HALyPO: Heterogeneous-Agent Lyapunov Policy Optimization for Human-Robot Collaboration	Mar 4, 2026	—Unverified	0
ErrorLLM: Modeling SQL Errors for Text-to-SQL Refinement	Mar 4, 2026	—Unverified	0
RAGNav: A Retrieval-Augmented Topological Reasoning Framework for Multi-Goal Visual-Language Navigation	Mar 4, 2026	—Unverified	0
JANUS: Structured Bidirectional Generation for Guaranteed Constraints and Analytical Uncertainty	Mar 4, 2026	—Unverified	0
Assessing the Effectiveness of LLMs in Delivering Cognitive Behavioral Therapy	Mar 4, 2026	—Unverified	0
WSI-INR: Implicit Neural Representations for Lesion Segmentation in Whole-Slide Images	Mar 4, 2026	—Unverified	0
Interaction-Aware Whole-Body Control for Compliant Object Transport	Mar 4, 2026	—Unverified	0
Confidence-Calibrated Small-Large Language Model Collaboration for Cost-Efficient Reasoning	Mar 4, 2026	—Unverified	0
Agentic Peer-to-Peer Networks: From Content Distribution to Capability and Action Sharing	Mar 4, 2026	—Unverified	0
Seeing as Experts Do: A Knowledge-Augmented Agent for Open-Set Fine-Grained Visual Understanding	Mar 4, 2026	—Unverified	0
LiDAR Prompted Spatio-Temporal Multi-View Stereo for Autonomous Driving	Mar 4, 2026	—Unverified	0
Cognition to Control - Multi-Agent Learning for Human-Humanoid Collaborative Transport	Mar 4, 2026	—Unverified	0
Not All Candidates are Created Equal: A Heterogeneity-Aware Approach to Pre-ranking in Recommender Systems	Mar 4, 2026	—Unverified	0
Towards Effective Orchestration of AI x DB Workloads	Mar 4, 2026	—Unverified	0
Inverse Contextual Bandits without Rewards: Learning from a Non-Stationary Learner via Suffix Imitation	Mar 4, 2026	—Unverified	0
MACC: Multi-Agent Collaborative Competition for Scientific Exploration	Mar 4, 2026	—Unverified	0
DisenReason: Behavior Disentanglement and Latent Reasoning for Shared-Account Sequential Recommendation	Mar 4, 2026	—Unverified	0
Specification-Driven Generation and Evaluation of Discrete-Event World Models via the DEVS Formalism	Mar 4, 2026	—Unverified	0
Observationally Informed Adaptive Causal Experimental Design	Mar 4, 2026	—Unverified	0
Small Object Detection in Complex Backgrounds with Multi-Scale Attention and Global Relation Modeling	Mar 4, 2026	—Unverified	0
T2S-Bench & Structure-of-Thought: Benchmarking and Prompting Comprehensive Text-to-Structure Reasoning	Mar 4, 2026	—Unverified	0
TAP: A Token-Adaptive Predictor Framework for Training-Free Diffusion Acceleration	Mar 4, 2026	—Unverified	0
A Rubric-Supervised Critic from Sparse Real-World Outcomes	Mar 4, 2026	—Unverified	0
Unsupervised Surrogate-Assisted Synthesis of Free-Form Planar Antenna Topologies for IoT Applications	Mar 4, 2026	—Unverified	0
Separators in Enhancing Autoregressive Pretraining for Vision Mamba	Mar 4, 2026	—Unverified	0
Universal Pansharpening Foundation Model	Mar 4, 2026	—Unverified	0
Adaptive Enhancement and Dual-Pooling Sequential Attention for Lightweight Underwater Object Detection with YOLOv10	Mar 4, 2026	—Unverified	0
In-Context Environments Induce Evaluation-Awareness in Language Models	Mar 4, 2026	—Unverified	0
PatchDecomp: Interpretable Patch-Based Time Series Forecasting	Mar 4, 2026	—Unverified	0
Semantic Bridging Domains: Pseudo-Source as Test-Time Connector	Mar 4, 2026	—Unverified	0
Non-Invasive Reconstruction of Cardiac Activation Dynamics Using Physics-Informed Neural Networks	Mar 4, 2026	—Unverified	0
Structure-Aware Distributed Backdoor Attacks in Federated Learning	Mar 4, 2026	—Unverified	0
All-in-One Image Restoration via Causal-Deconfounding Wavelet-Disentangled Prompt Network	Mar 4, 2026	—Unverified	0
On the Suitability of LLM-Driven Agents for Dark Pattern Audits	Mar 4, 2026	—Unverified	0
Benchmarking Motivational Interviewing Competence of Large Language Models	Mar 4, 2026	—Unverified	0
Coupling Local Context and Global Semantic Prototypes via a Hierarchical Architecture for Rhetorical Roles Labeling	Mar 4, 2026	—Unverified	0
k-hop Fairness: Addressing Disparities in Graph Link Prediction Beyond First-Order Neighborhoods	Mar 4, 2026	—Unverified	0
Believe Your Model: Distribution-Guided Confidence Calibration	Mar 4, 2026	—Unverified	0
How Predicted Links Influence Network Evolution: Disentangling Choice and Algorithmic Feedback in Dynamic Graphs	Mar 4, 2026	—Unverified	0
UniRain: Unified Image Deraining with RAG-based Dataset Distillation and Multi-objective Reweighted Optimization	Mar 4, 2026	—Unverified	0
UniSync: Towards Generalizable and High-Fidelity Lip Synchronization for Challenging Scenarios	Mar 4, 2026	—Unverified	0
A novel network for classification of cuneiform tablet metadata	Mar 4, 2026	—Unverified	0