The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 9301–9325 of 474278 papers

Title	Date	Status
Code4MeV2: a Research-oriented Code-completion Platform	Oct 4, 2025	—Unverified
SAR-TEXT: A Large-Scale SAR Image-Text Dataset Built with SAR-Narrator and A Progressive Learning Strategy for Downstream Tasks	Oct 4, 2025	CodeCode Available
Self-Correction Bench: Uncovering and Addressing the Self-Correction Blind Spot in Large Language Models	Oct 4, 2025	—Unverified
Rainbow Padding: Mitigating Early Termination in Instruction-Tuned Diffusion LLMs	Oct 4, 2025	CodeCode Available
ReMoMask: Retrieval-Augmented Masked Motion Generation	Oct 4, 2025	CodeCode Available
ArcMemo: Abstract Reasoning Composition with Lifelong LLM Memory	Oct 4, 2025	CodeCode Available
Adaptively Sampling-Reusing-Mixing Decomposed Gradients to Speed Up Sharpness Aware Minimization	Oct 4, 2025	CodeCode Available
LLM-Guided Evolutionary Program Synthesis for Quasi-Monte Carlo Design	Oct 4, 2025	CodeCode Available
Neural Low-Discrepancy Sequences	Oct 4, 2025	CodeCode Available
Cross-Lingual Multi-Granularity Framework for Interpretable Parkinson's Disease Diagnosis from Speech	Oct 4, 2025	CodeCode Available
Destination-to-Chutes Task Mapping Optimization for Multi-Robot Coordination in Robotic Sorting Systems	Oct 3, 2025	CodeCode Available
EGSTalker: Real-Time Audio-Driven Talking Head Generation with Efficient Gaussian Deformation	Oct 3, 2025	—Unverified
Knowledge Graph-Guided Multi-Agent Distillation for Reliable Industrial Question Answering with Datasets	Oct 3, 2025	CodeCode Available
Towards Size-invariant Salient Object Detection: A Generic Evaluation and Optimization Approach	Oct 3, 2025	CodeCode Available
Self-Reflective Generation at Test Time	Oct 3, 2025	—Unverified
Dual-Stage Reweighted MoE for Long-Tailed Egocentric Mistake Detection	Oct 3, 2025	CodeCode Available
ZeroShotOpt: Towards Zero-Shot Pretrained Models for Efficient Black-Box Optimization	Oct 3, 2025	CodeCode Available
Consolidating Reinforcement Learning for Multimodal Discrete Diffusion Models	Oct 3, 2025	—Unverified
LayerCake: Token-Aware Contrastive Decoding within Large Language Model Layers	Oct 3, 2025	—Unverified
Hyperparameter Loss Surfaces Are Simple Near their Optima	Oct 3, 2025	CodeCode Available
Towards Scalable and Consistent 3D Editing	Oct 3, 2025	—Unverified
LHGEL: Large Heterogeneous Graph Ensemble Learning using Batch View Aggregation	Oct 3, 2025	CodeCode Available
Reactive Transformer (RxT) -- Stateful Real-Time Processing for Event-Driven Reactive Language Models	Oct 3, 2025	—Unverified
Leave No TRACE: Black-box Detection of Copyrighted Dataset Usage in Large Language Models via Watermarking	Oct 3, 2025	CodeCode Available
Memory-Efficient Backpropagation for Fine-Tuning LLMs on Resource-Constrained Mobile Devices	Oct 3, 2025	CodeCode Available