The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 176–200 of 177340 papers

Title	Date	Tasks	Status	Hype	Score
A Comprehensive Survey of Mixture-of-Experts: Algorithms, Theory, and Applications	Mar 10, 2025	Continual LearningMeta-Learning	CodeCode Available	9	5
2 OLMo 2 Furious	Dec 31, 2024		CodeCode Available	9	5
LTX-Video: Realtime Video Latent Diffusion	Dec 30, 2024	DenoisingGPU	CodeCode Available	9	5
VideoCrafter2: Overcoming Data Limitations for High-Quality Video Diffusion Models	Jan 17, 2024	Text-to-Video GenerationVideo Generation	CodeCode Available	9	5
s1: Simple test-time scaling	Jan 31, 2025	Language ModelingLanguage Modelling	CodeCode Available	9	5
FastVLM: Efficient Vision Encoding for Vision Language Models	Dec 17, 2024		CodeCode Available	9	5
Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data	Jan 19, 2024	Data AugmentationDepth Estimation	CodeCode Available	9	5
Arcee's MergeKit: A Toolkit for Merging Large Language Models	Mar 20, 2024	Language ModelingLanguage Modelling	CodeCode Available	9	5
SkyServe: Serving AI Models across Regions and Clouds with Spot Instances	Nov 3, 2024		CodeCode Available	9	5
PP-FormulaNet: Bridging Accuracy and Efficiency in Advanced Formula Recognition	Mar 24, 2025		CodeCode Available	9	5
When Do We Not Need Larger Vision Models?	Mar 19, 2024	Depth Estimation	CodeCode Available	9	5
garak: A Framework for Security Probing Large Language Models	Jun 16, 2024	Red Teaming	CodeCode Available	9	5
LLMLingua-2: Data Distillation for Efficient and Faithful Task-Agnostic Prompt Compression	Mar 19, 2024	GSM8KLanguage Modelling	CodeCode Available	9	5
Toward Guidance-Free AR Visual Generation via Condition Contrastive Alignment	Oct 12, 2024	Language ModellingPhilosophy	CodeCode Available	9	5
DeepSeek-Coder-V2: Breaking the Barrier of Closed-Source Models in Code Intelligence	Jun 17, 2024	16kLanguage Modeling	CodeCode Available	9	5
SAMURAI: Adapting Segment Anything Model for Zero-Shot Visual Tracking with Motion-Aware Memory	Nov 18, 2024	Object TrackingVisual Object Tracking	CodeCode Available	9	5
InternLM2 Technical Report	Mar 26, 2024	4kLong-Context Understanding	CodeCode Available	9	5
DocLayout-YOLO: Enhancing Document Layout Analysis through Diverse Synthetic Data and Global-to-Local Adaptive Perception	Oct 16, 2024	Document Layout Analysisdocument understanding	CodeCode Available	9	5
PP-DocLayout: A Unified Document Layout Detection Model to Accelerate Large-Scale Data Construction	Mar 21, 2025	CPUDocument Layout Analysis	CodeCode Available	9	5
VLM-R1: A Stable and Generalizable R1-style Large Vision-Language Model	Apr 10, 2025	Language ModelingLanguage Modelling	CodeCode Available	9	5
UFO: A UI-Focused Agent for Windows OS Interaction	Feb 8, 2024	Navigate	CodeCode Available	9	5
AniPortrait: Audio-Driven Synthesis of Photorealistic Portrait Animation	Mar 26, 2024	DiversityFace Reenactment	CodeCode Available	9	5
RULER: What's the Real Context Size of Your Long-Context Language Models?	Apr 9, 2024	Long-Context Understanding	CodeCode Available	9	5
MindSearch: Mimicking Human Minds Elicits Deep AI Searcher	Jul 29, 2024	2D Semantic Segmentation task 1 (8 classes)graph construction	CodeCode Available	9	5
Emilia: A Large-Scale, Extensive, Multilingual, and Diverse Dataset for Speech Generation	Jan 27, 2025		CodeCode Available	9	5