The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 8251–8300 of 661570 papers

Title	Date	Status	Hype
Compose Your Policies! Improving Diffusion-based or Flow-based Robot Policies via Test-time Distribution-level Composition	Mar 10, 2026	—Unverified	1
REAP the Experts: Why Pruning Prevails for One-Shot MoE compression	Mar 10, 2026	—Unverified	0
Proper Body Landmark Subset Enables More Accurate and 5X Faster Recognition of Isolated Signs in LIBRAS	Mar 10, 2026	—Unverified	0
SynHLMA:Synthesizing Hand Language Manipulation for Articulated Object with Discrete Human Object Interaction Representation	Mar 10, 2026	—Unverified	0
GraphKeeper: Graph Domain-Incremental Learning via Knowledge Disentanglement and Preservation	Mar 10, 2026	—Unverified	0
SPAN: Spatial-Projection Alignment for Monocular 3D Object Detection	Mar 10, 2026	—Unverified	0
PRISM of Opinions: A Persona-Reasoned Multimodal Framework for User-centric Conversational Stance Detection	Mar 10, 2026	—Unverified	0
Mitigating Long-Tail Bias in HOI Detection via Adaptive Diversity Cache	Mar 10, 2026	—Unverified	0
Bootstrap Dynamic-Aware 3D Visual Representation for Scalable Robot Learning	Mar 10, 2026	—Unverified	0
Audio-Visual World Models: Towards Multisensory Imagination in Sight and Sound	Mar 10, 2026	—Unverified	0
AVGGT: Rethinking Global Attention for Accelerating VGGT	Mar 10, 2026	—Unverified	0
From Veracity to Diffusion: Adressing Operational Challenges in Moving From Fake-News Detection to Information Disorders	Mar 10, 2026	—Unverified	0
ADHint: Adaptive Hints with Difficulty Priors for Reinforcement Learning	Mar 10, 2026	—Unverified	0
Do Spatial Descriptors Improve Multi-DoF Finger Movement Decoding from HD sEMG?	Mar 10, 2026	—Unverified	0
Taming Preference Mode Collapse via Directional Decoupling Alignment in Diffusion Reinforcement Learning	Mar 10, 2026	—Unverified	0
Empowering All-in-Loop Health Management of Spacecraft Power System in the Mega-Constellation Era via Human-AI Collaboration	Mar 10, 2026	—Unverified	0
Rewards as Labels: Revisiting RLVR from a Classification Perspective	Mar 10, 2026	—Unverified	0
Energy-Aware Spike Budgeting for Continual Learning in Spiking Neural Networks for Neuromorphic Vision	Mar 10, 2026	—Unverified	0
Continual uncertainty learning	Mar 10, 2026	—Unverified	0
VLN-Cache: Enabling Token Caching for VLN Models with Visual/Semantic Dynamics Awareness	Mar 10, 2026	—Unverified	0
OrthoAI: A Neurosymbolic Framework for Evidence-Grounded Biomechanical Reasoning in Clear Aligner Orthodontics	Mar 10, 2026	—Unverified	0
DUEL: Exact Likelihood for Masked Diffusion via Deterministic Unmasking	Mar 10, 2026	—Unverified	0
PonderLM-3: Adaptive Token-Wise Pondering with Differentiable Masking	Mar 10, 2026	—Unverified	0
Censored LLMs as a Natural Testbed for Secret Knowledge Elicitation	Mar 10, 2026	—Unverified	0
Latent Generative Models with Tunable Complexity for Compressed Sensing and other Inverse Problems	Mar 10, 2026	—Unverified	0
PolyBlocks: A Compiler Infrastructure for AI Chips and Programming Frameworks	Mar 10, 2026	—Unverified	0
Property-driven Protein Inverse Folding With Multi-Objective Preference Alignment	Mar 10, 2026	—Unverified	0
VirtueBench: Evaluating Trustworthiness under Uncertainty in Long Video Understanding	Mar 10, 2026	—Unverified	0
Fanar-Sadiq: A Multi-Agent Architecture for Grounded Islamic QA	Mar 10, 2026	—Unverified	0
ICDAR 2025 Competition on End-to-End Document Image Machine Translation Towards Complex Layouts	Mar 10, 2026	—Unverified	0
From Flow to One Step: Real-Time Multi-Modal Trajectory Policies via Implicit Maximum Likelihood Estimation-based Distribution Distillation	Mar 10, 2026	—Unverified	0
YOLO-NAS-Bench: A Surrogate Benchmark with Self-Evolving Predictors for YOLO Architecture Search	Mar 10, 2026	—Unverified	0
Reviving ConvNeXt for Efficient Convolutional Diffusion Models	Mar 10, 2026	—Unverified	0
RiO-DETR: DETR for Real-time Oriented Object Detection	Mar 10, 2026	—Unverified	0
Investigating Gender Stereotypes in Large Language Models via Social Determinants of Health	Mar 10, 2026	—Unverified	0
You Didn't Have to Say It like That: Subliminal Learning from Faithful Paraphrases	Mar 10, 2026	—Unverified	0
MetaDAT: Generalizable Trajectory Prediction via Meta Pre-training and Data-Adaptive Test-Time Updating	Mar 10, 2026	—Unverified	0
CERES: A Probabilistic Early Warning System for Acute Food Insecurity	Mar 10, 2026	—Unverified	0
AI Act Evaluation Benchmark: An Open, Transparent, and Reproducible Evaluation Dataset for NLP and RAG Systems	Mar 10, 2026	—Unverified	0
From Weighting to Modeling: A Nonparametric Estimator for Off-Policy Evaluation	Mar 10, 2026	—Unverified	0
GIIM: Graph-based Learning of Inter- and Intra-view Dependencies for Multi-view Medical Image Diagnosis	Mar 10, 2026	—Unverified	0
A Guideline-Aware AI Agent for Zero-Shot Target Volume Auto-Delineation	Mar 10, 2026	—Unverified	0
Declarative Scenario-based Testing with RoadLogic	Mar 10, 2026	—Unverified	0
TopoOR: A Unified Topological Scene Representation for the Operating Room	Mar 10, 2026	—Unverified	0
Evolving Prompt Adaptation for Vision-Language Models	Mar 10, 2026	—Unverified	0
OmniEarth: A Benchmark for Evaluating Vision-Language Models in Geospatial Tasks	Mar 10, 2026	—Unverified	0
Telogenesis: Goal Is All U Need	Mar 10, 2026	—Unverified	0
Vibe-Creation: The Epistemology of Human-AI Emergent Cognition	Mar 10, 2026	—Unverified	0
TrainDeeploy: Hardware-Accelerated Parameter-Efficient Fine-Tuning of Small Transformer Models at the Extreme Edge	Mar 10, 2026	—Unverified	0
Probing the Reliability of Driving VLMs: From Inconsistent Responses to Grounded Temporal Reasoning	Mar 10, 2026	—Unverified	0