The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 7401–7425 of 474278 papers

Title	Date	Status
Prompt Triage: Structured Optimization Enhances Vision-Language Model Performance on Medical Imaging Benchmarks	Nov 14, 2025	CodeCode Available
Latent Motion Profiling for Annotation-free Cardiac Phase Detection in Adult and Fetal Echocardiography Videos	Nov 14, 2025	CodeCode Available
ICL-Router: In-Context Learned Model Representations for LLM Routing	Nov 14, 2025	CodeCode Available
Hierarchical Mixing Architecture for Low-light RAW Image Enhancement	Nov 14, 2025	CodeCode Available
A Closer Look at Knowledge Distillation in Spiking Neural Network Training	Nov 14, 2025	CodeCode Available
FreDFT: Frequency Domain Fusion Transformer for Visible-Infrared Object Detection	Nov 14, 2025	CodeCode Available
From Proof to Program: Characterizing Tool-Induced Reasoning Hallucinations in Large Language Models	Nov 14, 2025	CodeCode Available
Exposing Weak Links in Multi-Agent Systems under Adversarial Prompting	Nov 14, 2025	CodeCode Available
Multi-agent Undercover Gaming: Hallucination Removal via Counterfactual Test for Multimodal Reasoning	Nov 14, 2025	CodeCode Available
Q-Doc: Benchmarking Document Image Quality Assessment Capabilities in Multi-modal Large Language Models	Nov 14, 2025	CodeCode Available
VoxTell: Free-Text Promptable Universal 3D Medical Image Segmentation	Nov 14, 2025	CodeCode Available
Towards Mitigating Systematics in Large-Scale Surveys via Few-Shot Optimal Transport-Based Feature Alignment	Nov 14, 2025	CodeCode Available
TopoPerception: A Shortcut-Free Evaluation of Global Visual Perception in Large Vision-Language Models	Nov 14, 2025	CodeCode Available
PI-NAIM: Path-Integrated Neural Adaptive Imputation Model	Nov 14, 2025	CodeCode Available
Multi-agent In-context Coordination via Decentralized Memory Retrieval	Nov 13, 2025	CodeCode Available
Beyond Perplexity: Let the Reader Select Retrieval Summaries via Spectrum Projection Score	Nov 13, 2025	—Unverified
Test-Time Reinforcement Learning for GUI Grounding via Region Consistency	Nov 13, 2025	—Unverified
MVU-Eval: Towards Multi-Video Understanding Evaluation for Multimodal LLMs	Nov 13, 2025	—Unverified
PROPA: Toward Process-level Optimization in Visual Reasoning via Reinforcement Learning	Nov 13, 2025	CodeCode Available
SIMS-V: Simulated Instruction-Tuning for Spatial Video Understanding	Nov 13, 2025	—Unverified
URaG: Unified Retrieval and Generation in Multimodal LLMs for Efficient Long Document Understanding	Nov 13, 2025	CodeCode Available
Depth Anything 3: Recovering the Visual Space from Any Views	Nov 13, 2025	—Unverified
SPOT: Sparsification with Attention Dynamics via Token Relevance in Vision Transformers	Nov 13, 2025	CodeCode Available
BanglaTalk: Towards Real-Time Speech Assistance for Bengali Regional Dialects	Nov 13, 2025	CodeCode Available
Retrieval-Augmented Generation for Reliable Interpretation of Radio Regulations	Nov 13, 2025	CodeCode Available