The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 3751–3800 of 177340 papers

Title	Date	Tasks	Status	Hype	Score
Zero-Shot Surgical Tool Segmentation in Monocular Video Using Segment Anything Model 2	Aug 3, 2024	DiversitySegmentation	CodeCode Available	3	5
Large-Scale 3D Medical Image Pre-training with Geometric Context Priors	Oct 13, 2024	Contrastive LearningMedical Image Analysis	CodeCode Available	3	5
ERNIE 2.0: A Continual Pre-training Framework for Language Understanding	Jul 29, 2019	Chinese Named Entity RecognitionChinese Reading Comprehension	CodeCode Available	3	5
PromptAD: Learning Prompts with only Normal Samples for Few-Shot Anomaly Detection	Apr 8, 2024	Anomaly DetectionLanguage Modeling	CodeCode Available	3	5
SINERGYM -- A virtual testbed for building energy optimization with Reinforcement Learning	Dec 11, 2024	continuous-controlContinuous Control	CodeCode Available	3	5
ONE-PEACE: Exploring One General Representation Model Toward Unlimited Modalities	May 18, 2023	1 Image, 2*2 StitchiAction Classification	CodeCode Available	3	5
Video ReCap: Recursive Captioning of Hour-Long Videos	Feb 20, 2024	EgoSchemaVideo Captioning	CodeCode Available	3	5
Magnitude-aware Probabilistic Speaker Embeddings	Feb 28, 2022	Out-of-Distribution DetectionSpeaker Verification	CodeCode Available	3	5
RobustSAM: Segment Anything Robustly on Degraded Images	Jun 13, 2024	DeblurringImage Dehazing	CodeCode Available	3	5
Centaur: a foundation model of human cognition	Oct 26, 2024	Language ModelingLanguage Modelling	CodeCode Available	3	5
ForestColl: Throughput-Optimal Collective Communications on Heterogeneous Network Fabrics	Feb 9, 2024		CodeCode Available	3	5
GPT-ImgEval: A Comprehensive Benchmark for Diagnosing GPT4o in Image Generation	Apr 3, 2025	Image GenerationWorld Knowledge	CodeCode Available	3	5
DreamPhysics: Learning Physics-Based 3D Dynamics with Video Diffusion Priors	Jun 3, 2024		CodeCode Available	3	5
Can Mamba Learn How to Learn? A Comparative Study on In-Context Learning Tasks	Feb 6, 2024	In-Context LearningLanguage Modeling	CodeCode Available	3	5
Hydra-MDP: End-to-end Multimodal Planning with Multi-target Hydra-Distillation	Jun 11, 2024	DecoderKnowledge Distillation	CodeCode Available	3	5
Model-based Asynchronous Hyperparameter and Neural Architecture Search	Mar 24, 2020	AutoMLBayesian Optimization	CodeCode Available	3	5
ContextCite: Attributing Model Generation to Context	Sep 1, 2024	Language ModelingLanguage Modelling	CodeCode Available	3	5
Evaluation of the MACE Force Field Architecture: from Medicinal Chemistry to Materials Science	May 23, 2023		CodeCode Available	3	5
Language Model Inversion	Nov 22, 2023	Language ModelingLanguage Modelling	CodeCode Available	3	5
Evalverse: Unified and Accessible Library for Large Language Model Evaluation	Apr 1, 2024	Language Model EvaluationLanguage Modeling	CodeCode Available	3	5
DBA-Fusion: Tightly Integrating Deep Dense Visual Bundle Adjustment with Multiple Sensors for Large-Scale Localization and Mapping	Mar 20, 2024	Optical Flow EstimationSensor Fusion	CodeCode Available	3	5
GSFusion: Online RGB-D Mapping Where Gaussian Splatting Meets TSDF Fusion	Aug 22, 2024	Computational Efficiency	CodeCode Available	3	5
Improved motif-scaffolding with SE(3) flow matching	Jan 8, 2024	Data AugmentationDiversity	CodeCode Available	3	5
GPU-accelerated Evolutionary Multiobjective Optimization Using Tensorized RVEA	Apr 1, 2024	GPUMultiobjective Optimization	CodeCode Available	3	5
SVD-LLM: Truncation-aware Singular Value Decomposition for Large Language Model Compression	Mar 12, 2024	Language ModelingLanguage Modelling	CodeCode Available	3	5
OmniPred: Language Models as Universal Regressors	Feb 22, 2024	Experimental Designregression	CodeCode Available	3	5
Deep OC-SORT: Multi-Pedestrian Tracking by Adaptive Re-Identification	Feb 23, 2023	Multi-Object TrackingObject	CodeCode Available	3	5
ReEvo: Large Language Models as Hyper-Heuristics with Reflective Evolution	Feb 2, 2024	Combinatorial OptimizationEvolutionary Algorithms	CodeCode Available	3	5
ADBench: Anomaly Detection Benchmark	Jun 19, 2022	Anomaly DetectionOutlier Detection	CodeCode Available	3	5
94% on CIFAR-10 in 3.29 Seconds on a Single GPU	Mar 30, 2024	GPU	CodeCode Available	3	5
MapTRv2: An End-to-End Framework for Online Vectorized HD Map Construction	Aug 10, 2023	Autonomous DrivingOnline Vectorized HD Map Construction	CodeCode Available	3	5
On the Trajectory Regularity of ODE-based Diffusion Sampling	May 18, 2024	DenoisingImage Generation	CodeCode Available	3	5
Amplifier: Bringing Attention to Neglected Low-Energy Components in Time Series Forecasting	Jan 28, 2025	SpecificityTime Series	CodeCode Available	3	5
IFEval-Audio: Benchmarking Instruction-Following Capability in Audio-based Large Language Models	May 22, 2025	BenchmarkingInstruction Following	CodeCode Available	3	5
SyncTalk: The Devil is in the Synchronization for Talking Head Synthesis	Nov 29, 2023	NeRFTalking Face Generation	CodeCode Available	3	5
Single-Image Shadow Removal Using Deep Learning: A Comprehensive Survey	Jul 11, 2024	Deep LearningImage Restoration	CodeCode Available	3	5
VARGPT: Unified Understanding and Generation in a Visual Autoregressive Multimodal Large Language Model	Jan 21, 2025	Image GenerationInstruction Following	CodeCode Available	3	5
Parameter-Efficient Fine-Tuning in Spectral Domain for Point Cloud Learning	Oct 10, 2024	3D Parameter-Efficient Fine-Tuning for Classification3D Point Cloud Classification	CodeCode Available	3	5
GaussianEditor: Swift and Controllable 3D Editing with Gaussian Splatting	Nov 24, 2023	NeRF	CodeCode Available	3	5
GraphStorm: all-in-one graph machine learning framework for industry applications	Jun 10, 2024	Allgraph construction	CodeCode Available	3	5
TokenPacker: Efficient Visual Projector for Multimodal LLM	Jul 2, 2024	Language ModellingLarge Language Model	CodeCode Available	3	5
WeatherMesh-3: Fast and accurate operational global weather forecasting	Mar 28, 2025	Computational EfficiencyGPU	CodeCode Available	3	5
NdLinear Is All You Need for Representation Learning	Mar 21, 2025	AllRepresentation Learning	CodeCode Available	3	5
Bake off redux: a review and experimental evaluation of recent time series classification algorithms	Apr 25, 2023	Dynamic Time WarpingTime Series	CodeCode Available	3	5
TrafficLLM: Enhancing Large Language Models for Network Traffic Analysis with Generic Traffic Representation	Apr 5, 2025		CodeCode Available	3	5
CameraHMR: Aligning People with Perspective	Nov 12, 2024	3D human pose and shape estimation	CodeCode Available	3	5
DreamVLA: A Vision-Language-Action Model Dreamed with Comprehensive World Knowledge	Jul 6, 2025	Image GenerationMultimodal Reasoning	CodeCode Available	3	5
DEFOM-Stereo: Depth Foundation Model Based Stereo Matching	Jan 16, 2025	Depth EstimationDisparity Estimation	CodeCode Available	3	5
Rainbow: Combining Improvements in Deep Reinforcement Learning	Oct 6, 2017	Atari GamesDeep Reinforcement Learning	CodeCode Available	3	5
Mambular: A Sequential Model for Tabular Deep Learning	Aug 12, 2024	Deep LearningMamba	CodeCode Available	3	5