The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 8251–8300 of 177340 papers

Title	Date	Tasks	Status	Hype	Score
BioDiscoveryAgent: An AI Agent for Designing Genetic Perturbation Experiments	May 27, 2024	AI AgentBayesian Optimization	CodeCode Available	2	5
3DFaceShop: Explicitly Controllable 3D-Aware Portrait Generation	Sep 12, 2022	3D Face AnimationDisentanglement	CodeCode Available	2	5
ExpeL: LLM Agents Are Experiential Learners	Aug 20, 2023	Decision MakingTransfer Learning	CodeCode Available	2	5
SD-Eval: A Benchmark Dataset for Spoken Dialogue Understanding Beyond Words	Jun 19, 2024	Dialogue Understanding	CodeCode Available	2	5
MuMA-ToM: Multi-modal Multi-Agent Theory of Mind	Aug 22, 2024		CodeCode Available	2	5
Retrieval-Augmented Diffusion Models for Time Series Forecasting	Oct 24, 2024	DenoisingRetrieval	CodeCode Available	2	5
Collaborative Decoding Makes Visual Auto-Regressive Modeling Efficient	Nov 26, 2024	GPUImage Generation	CodeCode Available	2	5
Time Will Tell: New Outlooks and A Baseline for Temporal Multi-View 3D Object Detection	Oct 5, 2022	3D Object Detectionobject-detection	CodeCode Available	2	5
Brax -- A Differentiable Physics Engine for Large Scale Rigid Body Simulation	Jun 24, 2021	MuJoCoOpenAI Gym	CodeCode Available	2	5
SynRS3D: A Synthetic Dataset for Global 3D Semantic Understanding from Monocular Remote Sensing Imagery	Jun 26, 2024	Domain AdaptationEarth Observation	CodeCode Available	2	5
Machine learning interatomic potential can infer electrical response	Apr 7, 2025		CodeCode Available	2	5
HeuriGym: An Agentic Benchmark for LLM-Crafted Heuristics in Combinatorial Optimization	Jun 9, 2025	Combinatorial OptimizationMemorization	CodeCode Available	2	5
Fully Sparse 3D Occupancy Prediction	Dec 28, 2023	Autonomous DrivingPrediction	CodeCode Available	2	5
SensorLLM: Human-Intuitive Alignment of Multivariate Sensor Data with LLMs for Activity Recognition	Oct 14, 2024	Activity RecognitionDescriptive	CodeCode Available	2	5
MambaMorph: a Mamba-based Framework for Medical MR-CT Deformable Registration	Jan 25, 2024	Computed Tomography (CT)Image Registration	CodeCode Available	2	5
Common Objects in 3D: Large-Scale Learning and Evaluation of Real-life 3D Category Reconstruction	Sep 1, 2021	3D ReconstructionNeural Rendering	CodeCode Available	2	5
Human Pose as Compositional Tokens	Mar 21, 2023	DecoderPose Estimation	CodeCode Available	2	5
Dense Distinct Query for End-to-End Object Detection	Mar 22, 2023	Objectobject-detection	CodeCode Available	2	5
Deduplicating Training Data Makes Language Models Better	Jul 14, 2021	Language ModelingLanguage Modelling	CodeCode Available	2	5
Approximate Convex Decomposition for 3D Meshes with Collision-Aware Concavity and Tree Search	May 5, 2022		CodeCode Available	2	5
Autonomous GIS: the next-generation AI-powered GIS	May 10, 2023	Code GenerationInformation Retrieval	CodeCode Available	2	5
The Surprising Effectiveness of Negative Reinforcement in LLM Reasoning	Jun 2, 2025	MathMathematical Reasoning	CodeCode Available	2	5
DIRECT-3D: Learning Direct Text-to-3D Generation on Massive Noisy 3D Data	Jun 6, 2024	3D GenerationText to 3D	CodeCode Available	2	5
TinyVLA: Towards Fast, Data-Efficient Vision-Language-Action Models for Robotic Manipulation	Sep 19, 2024	Vision-Language-Action	CodeCode Available	2	5
Graph Neural Network Surrogates to leverage Mechanistic Expert Knowledge towards Reliable and Immediate Pandemic Response	Nov 10, 2024	Decision MakingGraph Neural Network	CodeCode Available	2	5
UniHDSA: A Unified Relation Prediction Approach for Hierarchical Document Structure Analysis	Mar 20, 2025	Document Layout AnalysisDocument Summarization	CodeCode Available	2	5
LVM-Med: Learning Large-Scale Self-Supervised Vision Models for Medical Imaging via Second-order Graph Matching	Jun 20, 2023	Brain Tumor ClassificationContrastive Learning	CodeCode Available	2	5
ST-P3: End-to-end Vision-based Autonomous Driving via Spatial-Temporal Feature Learning	Jul 15, 2022	Autonomous DrivingBird's-Eye View Semantic Segmentation	CodeCode Available	2	5
PoseFormerV2: Exploring Frequency Domain for Efficient and Robust 3D Human Pose Estimation	Mar 30, 2023	3D Human Pose EstimationClassification	CodeCode Available	2	5
SIDA: Social Media Image Deepfake Detection, Localization and Explanation with Large Multimodal Model	Dec 5, 2024	DeepFake DetectionFace Swapping	CodeCode Available	2	5
Bracketing Image Restoration and Enhancement with High-Low Frequency Decomposition	Apr 21, 2024	Image Restoration	CodeCode Available	2	5
LLM4EDA: Emerging Progress in Large Language Models for Electronic Design Automation	Dec 28, 2023	Answer GenerationChatbot	CodeCode Available	2	5
Overview of the PromptCBLUE Shared Task in CHIP2023	Dec 29, 2023	In-Context Learning	CodeCode Available	2	5
DebugBench: Evaluating Debugging Capability of Large Language Models	Jan 9, 2024	Code Generation	CodeCode Available	2	5
SMACv2: An Improved Benchmark for Cooperative Multi-Agent Reinforcement Learning	Dec 14, 2022	Multi-agent Reinforcement Learningreinforcement-learning	CodeCode Available	2	5
Competition Report: Finding Universal Jailbreak Backdoors in Aligned LLMs	Apr 22, 2024	Misinformation	CodeCode Available	2	5
PMFSNet: Polarized Multi-scale Feature Self-attention Network For Lightweight Medical Image Segmentation	Jan 15, 2024	Image SegmentationMedical Image Segmentation	CodeCode Available	2	5
CREMA: Generalizable and Efficient Video-Language Reasoning via Multimodal Modular Fusion	Feb 8, 2024	Computational EfficiencyMultimodal Reasoning	CodeCode Available	2	5
VLFM: Vision-Language Frontier Maps for Zero-Shot Semantic Navigation	Dec 6, 2023	Language ModellingNavigate	CodeCode Available	2	5
STEVE-1: A Generative Model for Text-to-Behavior in Minecraft	Jun 1, 2023	Decision MakingImage Generation	CodeCode Available	2	5
An Efficient and Mixed Heterogeneous Model for Image Restoration	Apr 15, 2025	Image RestorationMamba	CodeCode Available	2	5
Open-LLM-Leaderboard: From Multi-choice to Open-style Questions for LLMs Evaluation, Benchmark, and Arena	Jun 11, 2024	Multiple-choiceSelection bias	CodeCode Available	2	5
DreamLIP: Language-Image Pre-training with Long Captions	Mar 25, 2024	Contrastive LearningImage-text Retrieval	CodeCode Available	2	5
ECLIPSE: Efficient Continual Learning in Panoptic Segmentation with Visual Prompt Tuning	Mar 29, 2024	Continual LearningContinual Panoptic Segmentation	CodeCode Available	2	5
Therapeutics Data Commons: Machine Learning Datasets and Tasks for Drug Discovery and Development	Feb 18, 2021	BIG-bench Machine LearningDrug Discovery	CodeCode Available	2	5
Unleashing the Power of Multi-Task Learning: A Comprehensive Survey Spanning Traditional, Deep, and Pretrained Foundation Model Eras	Apr 29, 2024	Multi-Task LearningPrognosis	CodeCode Available	2	5
2nd Place Winning Solution for the CVPR2023 Visual Anomaly and Novelty Detection Challenge: Multimodal Prompting for Data-centric Anomaly Detection	Jun 15, 2023	Anomaly DetectionAnomaly Localization	CodeCode Available	2	5
TeCH: Text-guided Reconstruction of Lifelike Clothed Humans	Aug 16, 2023	DescriptiveQuestion Answering	CodeCode Available	2	5
BMFM-RNA: An Open Framework for Building and Evaluating Transcriptomic Foundation Models	Jun 17, 2025	BenchmarkingLanguage Modeling	CodeCode Available	2	5
LeapVAD: A Leap in Autonomous Driving via Cognitive Perception and Dual-Process Thinking	Jan 14, 2025	Autonomous DrivingDecision Making	CodeCode Available	2	5