The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 3701–3725 of 661570 papers

Title	Date	Status	Hype
OpenT2M: No-frill Motion Generation with Open-source,Large-scale, High-quality Data	Mar 19, 2026	—Unverified	0
Automatic detection of Gen-AI texts: A comparative framework of neural models	Mar 19, 2026	—Unverified	0
WeNLEX: Weakly Supervised Natural Language Explanations for Multilabel Chest X-ray Classification	Mar 19, 2026	—Unverified	0
Towards Verifiable AI with Lightweight Cryptographic Proofs of Inference	Mar 19, 2026	—Unverified	0
Reasonably reasoning AI agents can avoid game-theoretic failures in zero-shot, provably	Mar 19, 2026	—Unverified	0
D-Mem: A Dual-Process Memory System for LLM Agents	Mar 19, 2026	—Unverified	0
Communication-Efficient and Robust Multi-Modal Federated Learning via Latent-Space Consensus	Mar 19, 2026	—Unverified	0
Multi-Domain Causal Empirical Bayes Under Linear Mixing	Mar 19, 2026	—Unverified	0
Multimodal Task Interference: A Benchmark and Analysis of History-Target Mismatch in Multimodal LLMs	Mar 19, 2026	—Unverified	0
On the Peril of (Even a Little) Nonstationarity in Satisficing Regret Minimization	Mar 19, 2026	—Unverified	0
Can LLM generate interesting mathematical research problems?	Mar 19, 2026	—Unverified	0
SuperDec: 3D Scene Decomposition with Superquadric Primitives	Mar 19, 2026	—Unverified	0
Online Convex Optimization with Heavy Tails: Old Algorithms, New Regrets, and Applications	Mar 19, 2026	—Unverified	0
CrossHOI-Bench: A Unified Benchmark for HOI Evaluation across Vision-Language Models and HOI-Specific Methods	Mar 19, 2026	—Unverified	0
AI-driven Dispensing of Coral Reseeding Devices for Broad-scale Restoration of the Great Barrier Reef	Mar 19, 2026	—Unverified	0
Closed-form _r norm scaling with data for overparameterized linear regression and diagonal linear networks under _p bias	Mar 19, 2026	—Unverified	0
LucidFlux: Caption-Free Photo-Realistic Image Restoration via a Large-Scale Diffusion Transformer	Mar 19, 2026	—Unverified	5
Splines-Based Feature Importance in Kolmogorov-Arnold Networks: A Framework for Supervised Tabular Data Dimensionality Reduction	Mar 19, 2026	—Unverified	0
Support Basis: Fast Attention Beyond Bounded Entries	Mar 19, 2026	—Unverified	0
If Probable, Then Acceptable? Understanding Conditional Acceptability Judgments in Large Language Models	Mar 19, 2026	—Unverified	0
Evaluating Hallucinations in Audio-Visual Multimodal LLMs with Spoken Queries under Diverse Acoustic Conditions	Mar 19, 2026	—Unverified	0
Manual2Skill++: Connector-Aware General Robotic Assembly from Instruction Manuals via Vision-Language Models	Mar 19, 2026	—Unverified	0
StoryBox: Collaborative Multi-Agent Simulation for Hybrid Bottom-Up Long-Form Story Generation Using Large Language Models	Mar 19, 2026	—Unverified	0
Linear Attention for Joint Power Optimization and User-Centric Clustering in Cell-Free Networks	Mar 19, 2026	—Unverified	0
Adaptive Accountability in Networked MAS: Tracing and Mitigating Emergent Norms at Scale	Mar 19, 2026	—Unverified	0