The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 4501–4525 of 661570 papers

Title	Date	Status	Hype
Meta-Reinforcement Learning with Self-Reflection for Agentic Search	Mar 18, 2026	CodeCode Available	0
Rationale Matters: Learning Transferable Rubrics via Proxy-Guided Critique for VLM Reward Models	Mar 18, 2026	CodeCode Available	0
Towards Motion-aware Referring Image Segmentation	Mar 18, 2026	CodeCode Available	0
UniSAFE: A Comprehensive Benchmark for Safety Evaluation of Unified Multimodal Models	Mar 18, 2026	CodeCode Available	0
Procedural Generation of Algorithm Discovery Tasks in Machine Learning	Mar 18, 2026	CodeCode Available	0
Fine-Grained Post-Training Quantization for Large Vision Language Models with Quantization-Aware Integrated Gradients	Mar 18, 2026	CodeCode Available	0
Self-Attention And Beyond the Infinite: Towards Linear Transformers with Infinite Self-Attention	Mar 18, 2026	CodeCode Available	0
Learning to See and Act: Task-Aware Virtual View Exploration for Robotic Manipulation	Mar 18, 2026	—Unverified	1
Skyfall-GS: Synthesizing Immersive 3D Urban Scenes from Satellite Imagery	Mar 18, 2026	—Unverified	4
Complementary Reinforcement Learning	Mar 18, 2026	—Unverified	1
Stereo World Model: Camera-Guided Stereo Video Generation	Mar 18, 2026	—Unverified	1
Tree Search for LLM Agent Reinforcement Learning	Mar 18, 2026	—Unverified	3
Generative Refocusing: Flexible Defocus Control from a Single Image	Mar 18, 2026	—Unverified	3
Learning Goal-Oriented Vision-and-Language Navigation with Self-Improving Demonstrations at Scale	Mar 18, 2026	—Unverified	1
FoMo X: Modular Explainability Signals for Outlier Detection Foundation Models	Mar 18, 2026	—Unverified	0
Parameter-Efficient Modality-Balanced Symmetric Fusion for Multimodal Remote Sensing Semantic Segmentation	Mar 18, 2026	CodeCode Available	0
Deep Learning Multi-Horizon Irradiance Nowcasting: A Comparative Evaluation of Three Methods for Leveraging Sky Images	Mar 17, 2026	—Unverified	0
PI-Mamba: Linear-Time Protein Backbone Generation via Spectrally Initialized Flow Matching	Mar 17, 2026	—Unverified	0
The Cognitive Divergence: AI Context Windows, Human Attention Decline, and the Delegation Feedback Loop	Mar 17, 2026	—Unverified	0
Agentic AI for Human Resources: LLM-Driven Candidate Assessment	Mar 17, 2026	—Unverified	0
On the Carbon Footprint of Economic Research in the Age of Generative AI	Mar 17, 2026	—Unverified	0
EMPD: An Event-based Multimodal Physiological Dataset for Remote Pulse Wave Detection	Mar 17, 2026	—Unverified	0
Between the Layers Lies the Truth: Uncertainty Estimation in LLMs Using Intra-Layer Local Information Scores	Mar 17, 2026	—Unverified	0
Scaling Attention via Feature Sparsity	Mar 17, 2026	—Unverified	0
Latent Semantic Manifolds in Large Language Models	Mar 17, 2026	—Unverified	0