The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

659,983 papers248,104 code links4,818 tasks

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 3001–3050 of 659983 papers

Title	Date	Tasks	Status	Hype
RobustSAM: Segment Anything Robustly on Degraded Images	Jun 13, 2024	DeblurringImage Dehazing	CodeCode Available	3
Centaur: a foundation model of human cognition	Oct 26, 2024	Language ModelingLanguage Modelling	CodeCode Available	3
ForestColl: Throughput-Optimal Collective Communications on Heterogeneous Network Fabrics	Feb 9, 2024		CodeCode Available	3
GPT-ImgEval: A Comprehensive Benchmark for Diagnosing GPT4o in Image Generation	Apr 3, 2025	Image GenerationWorld Knowledge	CodeCode Available	3
DreamPhysics: Learning Physics-Based 3D Dynamics with Video Diffusion Priors	Jun 3, 2024		CodeCode Available	3
Can Mamba Learn How to Learn? A Comparative Study on In-Context Learning Tasks	Feb 6, 2024	In-Context LearningLanguage Modeling	CodeCode Available	3
Hydra-MDP: End-to-end Multimodal Planning with Multi-target Hydra-Distillation	Jun 11, 2024	DecoderKnowledge Distillation	CodeCode Available	3
Model-based Asynchronous Hyperparameter and Neural Architecture Search	Mar 24, 2020	AutoMLBayesian Optimization	CodeCode Available	3
ContextCite: Attributing Model Generation to Context	Sep 1, 2024	Language ModelingLanguage Modelling	CodeCode Available	3
Evaluation of the MACE Force Field Architecture: from Medicinal Chemistry to Materials Science	May 23, 2023		CodeCode Available	3
Language Model Inversion	Nov 22, 2023	Language ModelingLanguage Modelling	CodeCode Available	3
Evalverse: Unified and Accessible Library for Large Language Model Evaluation	Apr 1, 2024	Language Model EvaluationLanguage Modeling	CodeCode Available	3
DBA-Fusion: Tightly Integrating Deep Dense Visual Bundle Adjustment with Multiple Sensors for Large-Scale Localization and Mapping	Mar 20, 2024	Optical Flow EstimationSensor Fusion	CodeCode Available	3
GSFusion: Online RGB-D Mapping Where Gaussian Splatting Meets TSDF Fusion	Aug 22, 2024	Computational Efficiency	CodeCode Available	3
Improved motif-scaffolding with SE(3) flow matching	Jan 8, 2024	Data AugmentationDiversity	CodeCode Available	3
GPU-accelerated Evolutionary Multiobjective Optimization Using Tensorized RVEA	Apr 1, 2024	GPUMultiobjective Optimization	CodeCode Available	3
SVD-LLM: Truncation-aware Singular Value Decomposition for Large Language Model Compression	Mar 12, 2024	Language ModelingLanguage Modelling	CodeCode Available	3
OmniPred: Language Models as Universal Regressors	Feb 22, 2024	Experimental Designregression	CodeCode Available	3
Deep OC-SORT: Multi-Pedestrian Tracking by Adaptive Re-Identification	Feb 23, 2023	Multi-Object TrackingObject	CodeCode Available	3
ReEvo: Large Language Models as Hyper-Heuristics with Reflective Evolution	Feb 2, 2024	Combinatorial OptimizationEvolutionary Algorithms	CodeCode Available	3
ADBench: Anomaly Detection Benchmark	Jun 19, 2022	Anomaly DetectionOutlier Detection	CodeCode Available	3
94% on CIFAR-10 in 3.29 Seconds on a Single GPU	Mar 30, 2024	GPU	CodeCode Available	3
MapTRv2: An End-to-End Framework for Online Vectorized HD Map Construction	Aug 10, 2023	Autonomous DrivingOnline Vectorized HD Map Construction	CodeCode Available	3
On the Trajectory Regularity of ODE-based Diffusion Sampling	May 18, 2024	DenoisingImage Generation	CodeCode Available	3
Amplifier: Bringing Attention to Neglected Low-Energy Components in Time Series Forecasting	Jan 28, 2025	SpecificityTime Series	CodeCode Available	3
IFEval-Audio: Benchmarking Instruction-Following Capability in Audio-based Large Language Models	May 22, 2025	BenchmarkingInstruction Following	CodeCode Available	3
SyncTalk: The Devil is in the Synchronization for Talking Head Synthesis	Nov 29, 2023	NeRFTalking Face Generation	CodeCode Available	3
Single-Image Shadow Removal Using Deep Learning: A Comprehensive Survey	Jul 11, 2024	Deep LearningImage Restoration	CodeCode Available	3
VARGPT: Unified Understanding and Generation in a Visual Autoregressive Multimodal Large Language Model	Jan 21, 2025	Image GenerationInstruction Following	CodeCode Available	3
Parameter-Efficient Fine-Tuning in Spectral Domain for Point Cloud Learning	Oct 10, 2024	3D Parameter-Efficient Fine-Tuning for Classification3D Point Cloud Classification	CodeCode Available	3
GaussianEditor: Swift and Controllable 3D Editing with Gaussian Splatting	Nov 24, 2023	NeRF	CodeCode Available	3
GraphStorm: all-in-one graph machine learning framework for industry applications	Jun 10, 2024	Allgraph construction	CodeCode Available	3
TokenPacker: Efficient Visual Projector for Multimodal LLM	Jul 2, 2024	Language ModellingLarge Language Model	CodeCode Available	3
WeatherMesh-3: Fast and accurate operational global weather forecasting	Mar 28, 2025	Computational EfficiencyGPU	CodeCode Available	3
NdLinear Is All You Need for Representation Learning	Mar 21, 2025	AllRepresentation Learning	CodeCode Available	3
Bake off redux: a review and experimental evaluation of recent time series classification algorithms	Apr 25, 2023	Dynamic Time WarpingTime Series	CodeCode Available	3
TrafficLLM: Enhancing Large Language Models for Network Traffic Analysis with Generic Traffic Representation	Apr 5, 2025		CodeCode Available	3
CameraHMR: Aligning People with Perspective	Nov 12, 2024	3D human pose and shape estimation	CodeCode Available	3
DreamVLA: A Vision-Language-Action Model Dreamed with Comprehensive World Knowledge	Jul 6, 2025	Image GenerationMultimodal Reasoning	CodeCode Available	3
DEFOM-Stereo: Depth Foundation Model Based Stereo Matching	Jan 16, 2025	Depth EstimationDisparity Estimation	CodeCode Available	3
Rainbow: Combining Improvements in Deep Reinforcement Learning	Oct 6, 2017	Atari GamesDeep Reinforcement Learning	CodeCode Available	3
Mambular: A Sequential Model for Tabular Deep Learning	Aug 12, 2024	Deep LearningMamba	CodeCode Available	3
Aesthetic Post-Training Diffusion Models from Generic Preferences with Step-by-step Preference Optimization	Jun 6, 2024	DenoisingImage Generation	CodeCode Available	3
WHAC: World-grounded Humans and Cameras	Mar 19, 2024	Camera Pose EstimationPose Estimation	CodeCode Available	3
GTBench: Uncovering the Strategic Reasoning Limitations of LLMs via Game-Theoretic Evaluations	Feb 19, 2024	Card GamesLogical Reasoning	CodeCode Available	3
Generative AI Act II: Test Time Scaling Drives Cognition Engineering	Apr 18, 2025	Prompt Engineering	CodeCode Available	3
ArxivDIGESTables: Synthesizing Scientific Literature into Tables using Language Models	Oct 25, 2024		CodeCode Available	3
Cognify: Supercharging Gen-AI Workflows With Hierarchical Autotuning	Feb 12, 2025	RAGText to SQL	CodeCode Available	3
Unitxt: Flexible, Shareable and Reusable Data Preparation and Evaluation for Generative AI	Jan 25, 2024		CodeCode Available	3
Navigating the Digital World as Humans Do: Universal Visual Grounding for GUI Agents	Oct 7, 2024	Natural Language Visual GroundingNavigate	CodeCode Available	3