The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 8776–8800 of 474278 papers

Title	Date	Status
Designing Tools with Control Confidence	Oct 14, 2025	CodeCode Available
Few Shot Semi-Supervised Learning for Abnormal Stop Detection from Sparse GPS Trajectories	Oct 14, 2025	CodeCode Available
PET Head Motion Estimation Using Supervised Deep Learning with Attention	Oct 14, 2025	CodeCode Available
The Harder The Better: Maintaining Supervised Fine-tuning Generalization with Less but Harder Data	Oct 14, 2025	CodeCode Available
KnowledgeSmith: Uncovering Knowledge Updating in LLMs with Model Editing and Unlearning	Oct 14, 2025	CodeCode Available
CAGE: Continuity-Aware edGE Network Unlocks Robust Floorplan Reconstruction	Oct 14, 2025	CodeCode Available
KonfAI: A Modular and Fully Configurable Framework for Deep Learning in Medical Imaging	Oct 14, 2025	CodeCode Available
Limited Preference Data? Learning Better Reward Model with Latent Space Synthesis	Oct 14, 2025	CodeCode Available
Biased-Attention Guided Risk Prediction for Safe Decision-Making at Unsignalized Intersections	Oct 14, 2025	CodeCode Available
One Life to Learn: Inferring Symbolic World Models for Stochastic Environments from Unguided Exploration	Oct 14, 2025	—Unverified
Robot Learning: A Tutorial	Oct 14, 2025	—Unverified
Dual Learning with Dynamic Knowledge Distillation and Soft Alignment for Partially Relevant Video Retrieval	Oct 14, 2025	CodeCode Available
GeoVLM-R1: Reinforcement Fine-Tuning for Improved Remote Sensing Reasoning	Oct 14, 2025	—Unverified
BrowserAgent: Building Web Agents with Human-Inspired Web Browsing Actions	Oct 14, 2025	—Unverified
An Adaptive Edge-Guided Dual-Network Framework for Fast QR Code Motion Deblurring	Oct 14, 2025	CodeCode Available
In the Eye of MLLM: Benchmarking Egocentric Video Intent Understanding with Gaze-Guided Prompting	Oct 14, 2025	—Unverified
InternScenes: A Large-scale Simulatable Indoor Scene Dataset with Realistic Layouts	Oct 14, 2025	—Unverified
SCAN: Self-Denoising Monte Carlo Annotation for Robust Process Reward Learning	Oct 14, 2025	—Unverified
Clean First, Align Later: Benchmarking Preference Data Cleaning for Reliable LLM Alignment	Oct 14, 2025	CodeCode Available
CoIRL-AD: Collaborative-Competitive Imitation-Reinforcement Learning in Latent World Models for Autonomous Driving	Oct 14, 2025	CodeCode Available
PricingLogic: Evaluating LLMs Reasoning on Complex Tourism Pricing Tasks	Oct 14, 2025	CodeCode Available
MMOT: The First Challenging Benchmark for Drone-based Multispectral Multi-Object Tracking	Oct 14, 2025	CodeCode Available
Probing Latent Knowledge Conflict for Faithful Retrieval-Augmented Generation	Oct 14, 2025	CodeCode Available
Towards Robust and Realible Multimodal Misinformation Recognition with Incomplete Modality	Oct 14, 2025	CodeCode Available
Boundary-Guided Policy Optimization for Memory-efficient RL of Diffusion Large Language Models	Oct 14, 2025	CodeCode Available