The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 4051–4075 of 177340 papers

Title	Date	Tasks	Status	Hype	Score
Do generative video models understand physical principles?	Jan 14, 2025	Video Generation	CodeCode Available	3	5
Distance Adaptive Beam Search for Provably Accurate Graph-Based Nearest Neighbor Search	May 21, 2025	Information Retrieval	CodeCode Available	3	5
Video PreTraining (VPT): Learning to Act by Watching Unlabeled Online Videos	Jun 23, 2022	Imitation LearningMinecraft	CodeCode Available	3	5
VISTA3D: Versatile Imaging SegmenTation and Annotation model for 3D Computed Tomography	Jun 7, 2024	Computed Tomography (CT)Image Segmentation	CodeCode Available	3	5
Remote Sensing Temporal Vision-Language Models: A Comprehensive Survey	Dec 3, 2024	Change DetectionDescriptive	CodeCode Available	3	5
STG-Mamba: Spatial-Temporal Graph Learning via Selective State Space Model	Mar 19, 2024	Computational EfficiencyGraph Learning	CodeCode Available	3	5
Rethinking Evaluation Metrics of Open-Vocabulary Segmentaion	Nov 6, 2023	Segmentation	CodeCode Available	3	5
Trajectory Consistency Distillation: Improved Latent Consistency Distillation by Semi-Linear Consistency Function with Trajectory Mapping	Feb 29, 2024	Image Generation	CodeCode Available	3	5
ALLaVA: Harnessing GPT4V-Synthesized Data for Lite Vision-Language Models	Feb 18, 2024	Language ModellingQuestion Answering	CodeCode Available	3	5
ACE2: Accurately learning subseasonal to decadal atmospheric variability and forced responses	Nov 18, 2024		CodeCode Available	3	5
ESPnet-Codec: Comprehensive Training and Evaluation of Neural Codecs for Audio, Music, and Speech	Sep 24, 2024	Audio Generation	CodeCode Available	3	5
KnowAgent: Knowledge-Augmented Planning for LLM-Based Agents	Mar 5, 2024	HallucinationSelf-Learning	CodeCode Available	3	5
Scaling Analysis of Interleaved Speech-Text Language Models	Apr 3, 2025	Transfer Learning	CodeCode Available	3	5
MM-Vet v2: A Challenging Benchmark to Evaluate Large Multimodal Models for Integrated Capabilities	Aug 1, 2024	MathMM-Vet	CodeCode Available	3	5
GPU-accelerated Evolutionary Many-objective Optimization Using Tensorized NSGA-III	Apr 8, 2025	Computational EfficiencyCPU	CodeCode Available	3	5
Docs2KG: Unified Knowledge Graph Construction from Heterogeneous Documents Assisted by Large Language Models	Jun 5, 2024	Data Integrationgraph construction	CodeCode Available	3	5
EmoBox: Multilingual Multi-corpus Speech Emotion Recognition Toolkit and Benchmark	Jun 11, 2024	Cross-corpusEmotion Recognition	CodeCode Available	3	5
A Survey on LLM Test-Time Compute via Search: Tasks, LLM Profiling, Search Algorithms, and Relevant Frameworks	Jan 17, 2025	Survey	CodeCode Available	3	5
PyThaiNLP: Thai Natural Language Processing in Python	Dec 7, 2023		CodeCode Available	3	5
FRACTAL: An Ultra-Large-Scale Aerial Lidar Dataset for 3D Semantic Segmentation of Diverse Landscapes	May 7, 2024	3D Point Cloud Classification3D Semantic Segmentation	CodeCode Available	3	5
A Survey of Large Language Models in Medicine: Progress, Application, and Challenge	Nov 9, 2023		CodeCode Available	3	5
MultiDiffusion: Fusing Diffusion Paths for Controlled Image Generation	Feb 16, 2023	Image GenerationText to Image Generation	CodeCode Available	3	5
Rule Based Rewards for Language Model Safety	Nov 2, 2024	Language ModelingLanguage Modelling	CodeCode Available	3	5
Hyper-parameter tuning for text guided image editing	Jul 31, 2024	text-guided-image-editing	CodeCode Available	3	5
RepoGraph: Enhancing AI Software Engineering with Repository-level Code Graph	Oct 3, 2024	Code Generation	CodeCode Available	3	5