The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 8551–8600 of 661570 papers

Title	Date	Status	Hype
Stepping VLMs onto the Court: Benchmarking Spatial Intelligence in Sports	Mar 10, 2026	—Unverified	2
Robot Control Stack: A Lean Ecosystem for Robot Learning at Scale	Mar 10, 2026	—Unverified	2
Reward Prediction with Factorized World States	Mar 10, 2026	—Unverified	1
ID-LoRA: Identity-Driven Audio-Video Personalization with In-Context LoRA	Mar 10, 2026	—Unverified	2
GNNs for Time Series Anomaly Detection: An Open-Source Framework and a Critical Evaluation	Mar 10, 2026	—Unverified	0
Temporal-Conditioned Normalizing Flows for Multivariate Time Series Anomaly Detection	Mar 10, 2026	—Unverified	0
TemporalDoRA: Temporal PEFT for Robust Surgical Video Question Answering	Mar 10, 2026	—Unverified	0
Model Merging in the Era of Large Language Models: Methods, Applications, and Future Directions	Mar 10, 2026	—Unverified	0
BiasBusters: Uncovering and Mitigating Tool Selection Bias in Large Language Models	Mar 10, 2026	CodeCode Available	0
Missing-by-Design: Certifiable Modality Deletion for Revocable Multimodal Sentiment Analysis	Mar 10, 2026	—Unverified	0
DOCFORGE-BENCH: A Comprehensive 0-shot Benchmark for Document Forgery Detection and Analysis	Mar 10, 2026	—Unverified	0
Distributed Convolutional Neural Networks for Object Recognition	Mar 10, 2026	—Unverified	0
NavSpace: How Navigation Agents Follow Spatial Intelligence Instructions	Mar 10, 2026	—Unverified	0
Robust Provably Secure Image Steganography via Latent Iterative Optimization	Mar 10, 2026	—Unverified	0
SCALAR: Learning and Composing Skills through LLM Guided Symbolic Planning and Deep RL Grounding	Mar 10, 2026	—Unverified	0
QUSR: Quality-Aware and Uncertainty-Guided Image Super-Resolution Diffusion Model	Mar 10, 2026	CodeCode Available	0
TableMind++: An Uncertainty-Aware Programmatic Agent for Tool-Augmented Table Reasoning	Mar 10, 2026	—Unverified	0
MM-Zero: Self-Evolving Multi-Model Vision Language Models From Zero Data	Mar 10, 2026	—Unverified	2
How Contrastive Decoding Enhances Large Audio Language Models?	Mar 10, 2026	—Unverified	0
Predictive Spectral Calibration for Source-Free Test-Time Regression	Mar 10, 2026	—Unverified	0
DenoiseSplat: Feed-Forward Gaussian Splatting for Noisy 3D Scene Reconstruction	Mar 10, 2026	—Unverified	0
Reinforcing Numerical Reasoning in LLMs for Tabular Prediction via Structural Priors	Mar 10, 2026	—Unverified	0
Exploiting the Final Component of Generator Architectures for AI-Generated Image Detection	Mar 10, 2026	—Unverified	0
Active Prompt Learning with Vision-Language Model Priors	Mar 10, 2026	—Unverified	0
OPENXRD: A Comprehensive Benchmark Framework for LLM/MLLM XRD Question Answering	Mar 10, 2026	—Unverified	0
Operator Learning for Consolidation: An Architectural Comparison for DeepONet Variants	Mar 10, 2026	—Unverified	0
Improving Large Vision-Language Models' Understanding for Flow Field Data	Mar 10, 2026	—Unverified	0
EgoCross: Benchmarking Multimodal Large Language Models for Cross-Domain Egocentric Video Question Answering	Mar 10, 2026	—Unverified	0
You Only Pose Once: A Minimalist's Detection Transformer for Monocular RGB Category-level 9D Multi-Object Pose Estimation	Mar 10, 2026	—Unverified	0
RF-Informed Graph Neural Networks for Accurate and Data-Efficient Circuit Performance Prediction	Mar 10, 2026	—Unverified	0
A Surrogate model for High Temperature Superconducting Magnets to Predict Current Distribution with Neural Network	Mar 10, 2026	—Unverified	0
VocSegMRI: Multimodal Learning for Precise Vocal Tract Segmentation in Real-time MRI	Mar 10, 2026	—Unverified	0
Automated Coral Spawn Monitoring for Reef Restoration: The Coral Spawn and Larvae Imaging Camera System (CSLICS)	Mar 10, 2026	—Unverified	0
RECODE: Reasoning Through Code Generation for Visual Question Answering	Mar 10, 2026	—Unverified	0
ZeroSiam: An Efficient Asymmetry for Test-Time Entropy Optimization without Collapse	Mar 10, 2026	—Unverified	0
VSSFlow: Unifying Video-conditioned Sound and Speech Generation via Joint Learning	Mar 10, 2026	—Unverified	0
v-HUB: A Benchmark for Video Humor Understanding from Vision and Sound	Mar 10, 2026	—Unverified	0
Real-Time Neural Video Compression with Unified Intra and Inter Coding	Mar 10, 2026	—Unverified	0
AlphaApollo: A System for Deep Agentic Reasoning	Mar 10, 2026	—Unverified	1
Does Scientific Writing Converge to U.S. English? Evidence from Generative AI-Assisted Publications	Mar 10, 2026	—Unverified	0
Lightweight Time Series Data Valuation on Time Series Foundation Models via In-Context Finetuning	Mar 10, 2026	—Unverified	0
When Robots Obey the Patch: Universal Transferable Patch Attacks on Vision-Language-Action Models	Mar 10, 2026	—Unverified	0
Multi-Agent Reinforcement Learning with Communication-Constrained Priors	Mar 10, 2026	—Unverified	0
SA^2GFM: Enhancing Robust Graph Foundation Models with Structure-Aware Semantic Augmentation	Mar 10, 2026	—Unverified	0
EMFusion: Conditional Diffusion Framework for Trustworthy Frequency Selective EMF Forecasting in Wireless Networks	Mar 10, 2026	—Unverified	0
Reinforcement Learning for Self-Improving Agent with Skill Library	Mar 10, 2026	—Unverified	0
DEER: A Benchmark for Evaluating Deep Research Agents on Expert Report Generation	Mar 10, 2026	—Unverified	0
An AI-powered Bayesian Generative Modeling Approach for Arbitrary Conditional Inference	Mar 10, 2026	CodeCode Available	0
Low-rank Orthogonal Subspace Intervention for Generalizable Face Forgery Detection	Mar 10, 2026	—Unverified	0
From Self-Evolving Synthetic Data to Verifiable-Reward RL: Post-Training Multi-turn Interactive Tool-Using Agents	Mar 10, 2026	—Unverified	0