The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

658,356 papers247,172 code links4,818 tasks

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 226–250 of 658356 papers

Title	Date	Tasks	Status	Hype
DocLayNet: A Large Human-Annotated Dataset for Document-Layout Analysis	Jun 2, 2022	Document Layout AnalysisObject Detection	CodeCode Available	8
Fine-mixing: Mitigating Backdoors in Fine-tuned Language Models	Oct 18, 2022	Language ModellingSentence	CodeCode Available	8
Qwen3-ASR Technical Report	Jan 30, 2026		—Unverified	7
SAM 3D Body: Robust Full-Body Human Mesh Recovery	Feb 17, 2026		—Unverified	7
Attention Residuals	Mar 16, 2026		—Unverified	7
WideSeek-R1: Exploring Width Scaling for Broad Information Seeking via Multi-Agent Reinforcement Learning	Mar 12, 2026		—Unverified	7
dLLM: Simple Diffusion Language Modeling	Feb 26, 2026		—Unverified	7
Pretraining Large Language Models with NVFP4	Mar 4, 2026		—Unverified	7
GigaBrain-0.5M*: a VLA That Learns From World Model-Based Reinforcement Learning	Feb 26, 2026		—Unverified	7
Advancing Open-source World Models	Jan 28, 2026		—Unverified	7
Let It Flow: Agentic Crafting on Rock and Roll, Building the ROME Model within an Open Agentic Learning Ecosystem	Mar 12, 2026		—Unverified	7
Transparent Image Layer Diffusion using Latent Transparency	Feb 27, 2024		CodeCode Available	7
One-Step Image Translation with Text-to-Image Models	Mar 18, 2024	DenoisingTranslation	CodeCode Available	7
MiniGPT-4: Enhancing Vision-Language Understanding with Advanced Large Language Models	Apr 20, 2023	Image DescriptionLanguage Modelling	CodeCode Available	7
SageAttention: Accurate 8-Bit Attention for Plug-and-play Inference Acceleration	Oct 3, 2024	Image GenerationQuantization	CodeCode Available	7
From Bytes to Ideas: Language Modeling with Autoregressive U-Nets	Jun 17, 2025	Language ModelingLanguage Modelling	CodeCode Available	7
Robust Inverse Graphics via Probabilistic Inference	Feb 2, 2024	NeRF	CodeCode Available	7
InternVideo2: Scaling Foundation Models for Multimodal Video Understanding	Mar 22, 2024	Action ClassificationAction Recognition	CodeCode Available	7
PIKE-RAG: sPecIalized KnowledgE and Rationale Augmented Generation	Jan 20, 2025	Language ModelingLanguage Modelling	CodeCode Available	7
HealthBench: Evaluating Large Language Models Towards Improved Human Health	May 13, 2025	Instruction FollowingMultiple-choice	CodeCode Available	7
Prometheus: Inducing Fine-grained Evaluation Capability in Language Models	Oct 12, 2023	Language ModellingLarge Language Model	CodeCode Available	7
Scalable High-Resolution Pixel-Space Image Synthesis with Hourglass Diffusion Transformers	Jan 21, 2024	Image Generation	CodeCode Available	7
OmniGen: Unified Image Generation	Sep 17, 2024	Edge DetectionImage Generation	CodeCode Available	7
LHM: Large Animatable Human Reconstruction Model from a Single Image in Seconds	Mar 13, 2025	3D Human Reconstruction	CodeCode Available	7
FourierKAN outperforms MLP on Text Classification Head Fine-tuning	Aug 16, 2024	ClassificationKolmogorov-Arnold Networks	CodeCode Available	7