The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 6176–6200 of 474278 papers

Title	Date	Status
VABench: A Comprehensive Benchmark for Audio-Video Generation	Dec 10, 2025	—Unverified
H2R-Grounder: A Paired-Data-Free Paradigm for Translating Human Interaction Videos into Physically Grounded Robot Videos	Dec 10, 2025	—Unverified
Weird Generalization and Inductive Backdoors: New Ways to Corrupt LLMs	Dec 10, 2025	—Unverified
Closing the Train-Test Gap in World Models for Gradient-Based Planning	Dec 10, 2025	—Unverified
AGORA: Adversarial Generation Of Real-time Animatable 3D Gaussian Head Avatars	Dec 10, 2025	—Unverified
Interpretable Embeddings with Sparse Autoencoders: A Data Analysis Toolkit	Dec 10, 2025	—Unverified
Push Smarter, Not Harder: Hierarchical RL-Diffusion Policy for Efficient Nonprehensile Manipulation	Dec 10, 2025	CodeCode Available
ARE: Scaling Up Agent Environments and Evaluations	Dec 10, 2025	—Unverified
TeleEgo: Benchmarking Egocentric AI Assistants in the Wild	Dec 10, 2025	—Unverified
Attention Sinks in Diffusion Language Models	Dec 10, 2025	—Unverified
VFM-ISRefiner: Towards Better Adapting Vision Foundation Models for Interactive Segmentation of Remote Sensing Images	Dec 10, 2025	CodeCode Available
Deep Edge Filter: Return of the Human-Crafted Layer in Deep Learning	Dec 10, 2025	CodeCode Available
Adaptive Gradient Calibration for Single-Positive Multi-Label Learning in Remote Sensing Image Scene Classification	Dec 10, 2025	CodeCode Available
Dual Refinement Cycle Learning: Unsupervised Text Classification of Mamba and Community Detection on Text Attributed Graph	Dec 10, 2025	CodeCode Available
GLACIA: Instance-Aware Positional Reasoning for Glacial Lake Segmentation via Multimodal Large Language Model	Dec 10, 2025	CodeCode Available
Contrastive Learning for Semi-Supervised Deep Regression with Generalized Ordinal Rankings from Spectral Seriation	Dec 10, 2025	CodeCode Available
MelanomaNet: Explainable Deep Learning for Skin Lesion Classification	Dec 10, 2025	CodeCode Available
Label-free Motion-Conditioned Diffusion Model for Cardiac Ultrasound Synthesis	Dec 10, 2025	CodeCode Available
NeuroSketch: An Effective Framework for Neural Decoding via Systematic Architectural Optimization	Dec 10, 2025	CodeCode Available
Local LLM Ensembles for Zero-shot Portuguese Named Entity Recognition	Dec 10, 2025	CodeCode Available
LxCIM: a new rank-based binary classifier performance metric invariant to local exchange of classes	Dec 10, 2025	CodeCode Available
Visual Heading Prediction for Autonomous Aerial Vehicles	Dec 10, 2025	CodeCode Available
Rethinking Chain-of-Thought Reasoning for Videos	Dec 10, 2025	CodeCode Available
Bring Your Dreams to Life: Continual Text-to-Video Customization	Dec 10, 2025	CodeCode Available
Decoupling Template Bias in CLIP: Harnessing Empty Prompts for Enhanced Few-Shot Learning	Dec 10, 2025	CodeCode Available