The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 8626–8650 of 474278 papers

Title	Date	Status
LeMat-Traj: A Scalable and Unified Dataset of Materials Trajectories for Atomistic Modeling	Oct 17, 2025	CodeCode Available
Learning More with Less: A Generalizable, Self-Supervised Framework for Privacy-Preserving Capacity Estimation with EV Charging Data	Oct 17, 2025	CodeCode Available
BSGS: Bi-stage 3D Gaussian Splatting for Camera Motion Deblurring	Oct 17, 2025	CodeCode Available
The Face of Persuasion: Analyzing Bias and Generating Culture-Aware Ads	Oct 17, 2025	CodeCode Available
Layer as Puzzle Pieces: Compressing Large Language Models through Layer Concatenation	Oct 17, 2025	CodeCode Available
The Road Less Traveled: Enhancing Exploration in LLMs via Sequential Sampling	Oct 17, 2025	CodeCode Available
GraphMind: Interactive Novelty Assessment System for Accelerating Scientific Discovery	Oct 17, 2025	CodeCode Available
NDM: A Noise-driven Detection and Mitigation Framework against Implicit Sexual Intentions in Text-to-Image Generation	Oct 17, 2025	CodeCode Available
ReCon: Region-Controllable Data Augmentation with Rectification and Alignment for Object Detection	Oct 17, 2025	CodeCode Available
RewardRank: Optimizing True Learning-to-Rank Utility	Oct 17, 2025	CodeCode Available
EvolveR: Self-Evolving LLM Agents through an Experience-Driven Lifecycle	Oct 17, 2025	CodeCode Available
STABLE: Gated Continual Learning for Large Language Models	Oct 17, 2025	CodeCode Available
Narrowing Action Choices with AI Improves Human Sequential Decisions	Oct 17, 2025	CodeCode Available
AtomBench: A Benchmark for Generative Atomic Structure Models using GPT, Diffusion, and Flow Architectures	Oct 17, 2025	CodeCode Available
Memory-SAM: Human-Prompt-Free Tongue Segmentation via Retrieval-to-Prompt	Oct 17, 2025	CodeCode Available
OpenDerisk: An Industrial Framework for AI-Driven SRE, with Design, Implementation, and Case Studies	Oct 16, 2025	CodeCode Available
MathCanvas: Intrinsic Visual Chain-of-Thought for Multimodal Mathematical Reasoning	Oct 16, 2025	—Unverified
Information Gain-based Policy Optimization: A Simple and Effective Approach for Multi-Turn LLM Agents	Oct 16, 2025	—Unverified
TokDrift: When LLM Speaks in Subwords but Code Speaks in Grammar	Oct 16, 2025	—Unverified
Are Large Reasoning Models Interruptible?	Oct 16, 2025	—Unverified
LaSeR: Reinforcement Learning with Last-Token Self-Rewarding	Oct 16, 2025	—Unverified
C4D: 4D Made from 3D through Dual Correspondences	Oct 16, 2025	—Unverified
LLMs as Scalable, General-Purpose Simulators For Evolving Digital Agent Training	Oct 16, 2025	—Unverified
WithAnyone: Towards Controllable and ID Consistent Image Generation	Oct 16, 2025	—Unverified
Ponimator: Unfolding Interactive Pose for Versatile Human-human Interaction Animation	Oct 16, 2025	—Unverified