The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 10726–10750 of 177340 papers

Title	Date	Tasks	Status	Hype	Score
FlexCloud: Direct, Modular Georeferencing and Drift-Correction of Point Cloud Maps	Feb 1, 2025	Autonomous Drivingmotion prediction	CodeCode Available	2	5
The Hidden Life of Tokens: Reducing Hallucination of Large Vision-Language Models via Visual Information Steering	Feb 5, 2025	Hallucination	CodeCode Available	2	5
SQL-o1: A Self-Reward Heuristic Dynamic Search Method for Text-to-SQL	Feb 17, 2025	Few-Shot LearningHeuristic Search	CodeCode Available	2	5
VaViM and VaVAM: Autonomous Driving through Video Generative Modeling	Feb 21, 2025	Autonomous DrivingImitation Learning	CodeCode Available	2	5
Generalized Semantic Contrastive Learning via Embedding Side Information for Few-Shot Object Detection	Apr 9, 2025	Contrastive Learningcounterfactual	CodeCode Available	2	5
A Survey on Industrial Anomalies Synthesis	Feb 23, 2025	Survey	CodeCode Available	2	5
Multimodal Representation Alignment for Image Generation: Text-Image Interleaved Control Is Easier Than You Think	Feb 27, 2025	Image GenerationText to Image Generation	CodeCode Available	2	5
InsTaG: Learning Personalized 3D Talking Head from Few-Second Video	Feb 27, 2025	3DGSTalking Head Generation	CodeCode Available	2	5
AutoLUT: LUT-Based Image Super-Resolution with Automatic Sampling and Adaptive Residual Learning	Mar 3, 2025	Image Super-ResolutionSuper-Resolution	CodeCode Available	2	5
WritingBench: A Comprehensive Benchmark for Generative Writing	Mar 7, 2025	Text Generation	CodeCode Available	2	5
SALAD: Skeleton-aware Latent Diffusion for Text-driven Motion Generation and Editing	Mar 18, 2025	DenoisingMotion Generation	CodeCode Available	2	5
Advancing Language Model Reasoning through Reinforcement Learning and Inference Scaling	Jan 20, 2025	Imitation LearningLanguage Modeling	CodeCode Available	2	5
MegaMath: Pushing the Limits of Open Math Corpora	Apr 3, 2025	DiversityMath	CodeCode Available	2	5
POMATO: Marrying Pointmap Matching with Temporal Motion for Dynamic 3D Reconstruction	Apr 8, 2025	3D ReconstructionDepth Estimation	CodeCode Available	2	5
Dynam3D: Dynamic Layered 3D Tokens Empower VLM for Vision-and-Language Navigation	May 16, 2025	3D geometryNavigate	CodeCode Available	2	5
GuardReasoner-VL: Safeguarding VLMs via Reinforced Reasoning	May 16, 2025	Data Augmentation	CodeCode Available	2	5
μPC: Scaling Predictive Coding to 100+ Layer Networks	May 19, 2025		CodeCode Available	2	5
VisualQuality-R1: Reasoning-Induced Image Quality Assessment via Reinforcement Learning to Rank	May 20, 2025	Image GenerationImage Quality Assessment	CodeCode Available	2	5
CSTrack: Enhancing RGB-X Tracking via Compact Spatiotemporal Features	May 26, 2025		CodeCode Available	2	5
DetailFlow: 1D Coarse-to-Fine Autoregressive Image Generation via Next-Detail Prediction	May 27, 2025	Image Generation	CodeCode Available	2	5
Play to Generalize: Learning to Reason Through Game Play	Jun 9, 2025	Domain GeneralizationMath	CodeCode Available	2	5
ChineseHarm-Bench: A Chinese Harmful Content Detection Benchmark	Jun 12, 2025		CodeCode Available	2	5
Curve-Aware Gaussian Splatting for 3D Parametric Curve Reconstruction	Jun 26, 2025	Point cloud reconstruction	CodeCode Available	2	5
Pre-Trained LLM is a Semantic-Aware and Generalizable Segmentation Booster	Jun 22, 2025	DecoderImage Segmentation	CodeCode Available	2	5
LLM2Rec: Large Language Models Are Powerful Embedding Models for Sequential Recommendation	Jun 16, 2025	Collaborative FilteringSequential Recommendation	CodeCode Available	2	5