The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

659,983 papers248,104 code links4,818 tasks

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 451–475 of 177339 papers

Title	Date	Tasks	Status	Hype	Score
aiXcoder-7B: A Lightweight and Effective Large Language Model for Code Processing	Oct 17, 2024	AttributeCode Completion	CodeCode Available	7	5
AgentOrchestra: A Hierarchical Multi-Agent Framework for General-Purpose Task Solving	Jun 14, 2025		CodeCode Available	7	5
MAGI-1: Autoregressive Video Generation at Scale	May 19, 2025	Video Generation	CodeCode Available	7	5
Hunyuan-DiT: A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding	May 14, 2024	Image GenerationLanguage Modeling	CodeCode Available	7	5
ComfyUI-Copilot: An Intelligent Assistant for Automated Workflow Development	Jun 5, 2025	Large Language Model	CodeCode Available	7	5
Kimi-Audio Technical Report	Apr 25, 2025	Audio Question AnsweringQuestion Answering	CodeCode Available	7	5
Bilateral Reference for High-Resolution Dichotomous Image Segmentation	Jan 7, 2024	Camouflaged Object SegmentationDichotomous Image Segmentation	CodeCode Available	7	5
EvoGP: A GPU-accelerated Framework for Tree-based Genetic Programming	Jan 21, 2025	Feature EngineeringGPU	CodeCode Available	7	5
AgiBot World Colosseo: A Large-scale Manipulation Platform for Scalable and Intelligent Embodied Systems	Mar 9, 2025		CodeCode Available	7	5
StarCoder 2 and The Stack v2: The Next Generation	Feb 29, 2024	Code CompletionCode Generation	CodeCode Available	7	5
Mini-Omni: Language Models Can Hear, Talk While Thinking in Streaming	Aug 29, 2024	Speech Synthesis	CodeCode Available	7	5
Imitate, Explore, and Self-Improve: A Reproduction Report on Slow-thinking Reasoning Systems	Dec 12, 2024		CodeCode Available	7	5
Neural Codec Language Models are Zero-Shot Text to Speech Synthesizers	Jan 5, 2023	In-Context LearningLanguage Modeling	CodeCode Available	7	5
DocETL: Agentic Query Rewriting and Evaluation for Complex Document Processing	Oct 16, 2024		CodeCode Available	7	5
Intent-based Prompt Calibration: Enhancing prompt optimization with synthetic boundary cases	Feb 5, 2024	Prompt Engineering	CodeCode Available	7	5
Improving Sample Quality of Diffusion Models Using Self-Attention Guidance	Oct 3, 2022	DenoisingDiversity	CodeCode Available	7	5
EasyAnimate: A High-Performance Long Video Generation Method based on Transformer Architecture	May 29, 2024	Image GenerationVideo Generation	CodeCode Available	7	5
HunyuanVideo-Avatar: High-Fidelity Audio-Driven Human Animation for Multiple Characters	May 26, 2025	Human Animation	CodeCode Available	7	5
MagicQuill: An Intelligent Interactive Image Editing System	Nov 14, 2024	Language ModelingLanguage Modelling	CodeCode Available	7	5
SageAttention3: Microscaling FP4 Attention for Inference and An Exploration of 8-Bit Training	May 16, 2025		CodeCode Available	7	5
The Scandinavian Embedding Benchmarks: Comprehensive Assessment of Multilingual and Monolingual Text Embedding	Jun 4, 2024		CodeCode Available	7	5
Faster Video Diffusion with Trainable Sparse Attention	May 19, 2025		CodeCode Available	7	5
SWE-bench Multimodal: Do AI Systems Generalize to Visual Software Domains?	Oct 4, 2024	Data Visualization	CodeCode Available	7	5
EasySpider: A No-Code Visual System for Crawling the Web	Apr 30, 2023	Data IntegrationMarketing	CodeCode Available	7	5
FastSwitch: Optimizing Context Switching Efficiency in Fairness-aware Large Language Model Serving	Nov 27, 2024	FairnessGPU	CodeCode Available	7	5