The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 9351–9375 of 177340 papers

Title	Date	Tasks	Status	Hype	Score
ExpeL: LLM Agents Are Experiential Learners	Aug 20, 2023	Decision MakingTransfer Learning	CodeCode Available	2	5
SD-Eval: A Benchmark Dataset for Spoken Dialogue Understanding Beyond Words	Jun 19, 2024	Dialogue Understanding	CodeCode Available	2	5
MuMA-ToM: Multi-modal Multi-Agent Theory of Mind	Aug 22, 2024		CodeCode Available	2	5
Retrieval-Augmented Diffusion Models for Time Series Forecasting	Oct 24, 2024	DenoisingRetrieval	CodeCode Available	2	5
Collaborative Decoding Makes Visual Auto-Regressive Modeling Efficient	Nov 26, 2024	GPUImage Generation	CodeCode Available	2	5
Time Will Tell: New Outlooks and A Baseline for Temporal Multi-View 3D Object Detection	Oct 5, 2022	3D Object Detectionobject-detection	CodeCode Available	2	5
Brax -- A Differentiable Physics Engine for Large Scale Rigid Body Simulation	Jun 24, 2021	MuJoCoOpenAI Gym	CodeCode Available	2	5
SynRS3D: A Synthetic Dataset for Global 3D Semantic Understanding from Monocular Remote Sensing Imagery	Jun 26, 2024	Domain AdaptationEarth Observation	CodeCode Available	2	5
Machine learning interatomic potential can infer electrical response	Apr 7, 2025		CodeCode Available	2	5
HeuriGym: An Agentic Benchmark for LLM-Crafted Heuristics in Combinatorial Optimization	Jun 9, 2025	Combinatorial OptimizationMemorization	CodeCode Available	2	5
Fully Sparse 3D Occupancy Prediction	Dec 28, 2023	Autonomous DrivingPrediction	CodeCode Available	2	5
SensorLLM: Human-Intuitive Alignment of Multivariate Sensor Data with LLMs for Activity Recognition	Oct 14, 2024	Activity RecognitionDescriptive	CodeCode Available	2	5
MambaMorph: a Mamba-based Framework for Medical MR-CT Deformable Registration	Jan 25, 2024	Computed Tomography (CT)Image Registration	CodeCode Available	2	5
Common Objects in 3D: Large-Scale Learning and Evaluation of Real-life 3D Category Reconstruction	Sep 1, 2021	3D ReconstructionNeural Rendering	CodeCode Available	2	5
Human Pose as Compositional Tokens	Mar 21, 2023	DecoderPose Estimation	CodeCode Available	2	5
Dense Distinct Query for End-to-End Object Detection	Mar 22, 2023	Objectobject-detection	CodeCode Available	2	5
Deduplicating Training Data Makes Language Models Better	Jul 14, 2021	Language ModelingLanguage Modelling	CodeCode Available	2	5
Approximate Convex Decomposition for 3D Meshes with Collision-Aware Concavity and Tree Search	May 5, 2022		CodeCode Available	2	5
Autonomous GIS: the next-generation AI-powered GIS	May 10, 2023	Code GenerationInformation Retrieval	CodeCode Available	2	5
The Surprising Effectiveness of Negative Reinforcement in LLM Reasoning	Jun 2, 2025	MathMathematical Reasoning	CodeCode Available	2	5
DIRECT-3D: Learning Direct Text-to-3D Generation on Massive Noisy 3D Data	Jun 6, 2024	3D GenerationText to 3D	CodeCode Available	2	5
TinyVLA: Towards Fast, Data-Efficient Vision-Language-Action Models for Robotic Manipulation	Sep 19, 2024	Vision-Language-Action	CodeCode Available	2	5
Graph Neural Network Surrogates to leverage Mechanistic Expert Knowledge towards Reliable and Immediate Pandemic Response	Nov 10, 2024	Decision MakingGraph Neural Network	CodeCode Available	2	5
UniHDSA: A Unified Relation Prediction Approach for Hierarchical Document Structure Analysis	Mar 20, 2025	Document Layout AnalysisDocument Summarization	CodeCode Available	2	5
LVM-Med: Learning Large-Scale Self-Supervised Vision Models for Medical Imaging via Second-order Graph Matching	Jun 20, 2023	Brain Tumor ClassificationContrastive Learning	CodeCode Available	2	5