The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 7851–7875 of 474278 papers

Title	Date	Status
Language Arithmetics: Towards Systematic Language Neuron Identification and Manipulation	Nov 3, 2025	CodeCode Available
SE-Agent: Self-Evolution Trajectory Optimization in Multi-Step Reasoning with LLM-Based Agents	Nov 3, 2025	CodeCode Available
Rethinking LLM Human Simulation: When a Graph is What You Need	Nov 3, 2025	CodeCode Available
UniLION: Towards Unified Autonomous Driving Model with Linear Group RNNs	Nov 3, 2025	CodeCode Available
Black-Box Membership Inference Attack for LVLMs via Prior Knowledge-Calibrated Memory Probing	Nov 3, 2025	CodeCode Available
SEPS: Semantic-enhanced Patch Slimming Framework for fine-grained cross-modal alignment	Nov 3, 2025	CodeCode Available
NSYNC: Negative Synthetic Image Generation for Contrastive Training to Improve Stylized Text-To-Image Translation	Nov 3, 2025	CodeCode Available
Driving scenario generation and evaluation using a structured layer representation and foundational models	Nov 3, 2025	CodeCode Available
HADSF: Aspect Aware Semantic Control for Explainable Recommendation	Nov 3, 2025	CodeCode Available
Perturb a Model, Not an Image: Towards Robust Privacy Protection via Anti-Personalized Diffusion Models	Nov 3, 2025	CodeCode Available
ChartAB: A Benchmark for Chart Grounding & Dense Alignment	Nov 3, 2025	—Unverified
ROVER: Benchmarking Reciprocal Cross-Modal Reasoning for Omnimodal Generation	Nov 3, 2025	—Unverified
Vision Foundation Models Can Be Good Tokenizers for Latent Diffusion Models	Nov 3, 2025	—Unverified
Enhancing Time Awareness in Generative Recommendation	Nov 3, 2025	CodeCode Available
MicroAUNet: Boundary-Enhanced Multi-scale Fusion with Knowledge Distillation for Colonoscopy Polyp Image Segmentation	Nov 3, 2025	CodeCode Available
Web-Scale Collection of Video Data for 4D Animal Reconstruction	Nov 3, 2025	CodeCode Available
MVSMamba: Multi-View Stereo with State Space Model	Nov 3, 2025	CodeCode Available
DepthVanish: Optimizing Adversarial Interval Structures for Stereo-Depth-Invisible Patches	Nov 3, 2025	CodeCode Available
FlexQ: Efficient Post-training INT6 Quantization for LLM Serving via Algorithm-System Co-Design	Nov 3, 2025	CodeCode Available
MicroRemed: Benchmarking LLMs in Microservices Remediation	Nov 3, 2025	CodeCode Available
FEval-TTC: Fair Evaluation Protocol for Test-Time Compute	Nov 3, 2025	CodeCode Available
Detecting Generated Images by Fitting Natural Image Distributions	Nov 3, 2025	CodeCode Available
Pragmatic Heterogeneous Collaborative Perception via Generative Communication Mechanism	Nov 3, 2025	CodeCode Available
Reflectance Prediction-based Knowledge Distillation for Robust 3D Object Detection in Compressed Point Clouds	Nov 2, 2025	CodeCode Available
Video Models Start to Solve Chess, Maze, Sudoku, Mental Rotation, and Raven' Matrices	Nov 2, 2025	CodeCode Available