The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

659,983 papers248,104 code links4,818 tasks

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 501–550 of 659983 papers

Title	Date	Tasks	Status	Hype
MiniGPT-v2: large language model as a unified interface for vision-language multi-task learning	Oct 14, 2023	Image ClassificationImage Description	CodeCode Available	7
AudioLM: a Language Modeling Approach to Audio Generation	Sep 7, 2022	Audio Generation	CodeCode Available	7
Open-Reasoner-Zero: An Open Source Approach to Scaling Up Reinforcement Learning on the Base Model	Mar 31, 2025		CodeCode Available	7
Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models	Mar 27, 2024	Image ClassificationImage Comprehension	CodeCode Available	7
Goku: Flow Based Video Generative Foundation Models	Feb 7, 2025	Image GenerationText to Image Generation	CodeCode Available	7
NVILA: Efficient Frontier Visual Language Models	Dec 5, 2024	Video Question Answering	CodeCode Available	7
OpenVoice: Versatile Instant Voice Cloning	Dec 3, 2023	RhythmVoice Cloning	CodeCode Available	7
Efficient-vDiT: Efficient Video Diffusion Transformers With Attention Tile	Feb 10, 2025	Video Generation	CodeCode Available	7
Semantic Routing for Enhanced Performance of LLM-Assisted Intent-Based 5G Core Network Management and Orchestration	Apr 24, 2024	ManagementPrompt Engineering	CodeCode Available	7
Byte Latent Transformer: Patches Scale Better Than Tokens	Dec 13, 2024		CodeCode Available	7
EchoMimicV2: Towards Striking, Simplified, and Semi-Body Human Animation	Nov 15, 2024	Audio-Driven Body AnimationHuman Animation	CodeCode Available	7
OmniGen2: Exploration to Advanced Multimodal Generation	Jun 23, 2025	Image Generationmultimodal generation	CodeCode Available	7
LGM: Large Multi-View Gaussian Model for High-Resolution 3D Content Creation	Feb 7, 2024		CodeCode Available	7
Champ: Controllable and Consistent Human Image Animation with 3D Parametric Guidance	Mar 21, 2024	Animated GIF GenerationImage Animation	CodeCode Available	7
GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning with Scalable Reinforcement Learning	Jul 1, 2025	document understandingMultimodal Reasoning	CodeCode Available	7
Gravity-aligned Rotation Averaging with Circular Regression	Oct 16, 2024	Mixed Realityregression	CodeCode Available	7
LLMs Can Easily Learn to Reason from Demonstrations Structure, not content, is what matters!	Feb 11, 2025	Large Language ModelMath	CodeCode Available	7
Search-R1: Training LLMs to Reason and Leverage Search Engines with Reinforcement Learning	Mar 12, 2025	Question AnsweringRAG	CodeCode Available	7
HiDream-I1: A High-Efficient Image Generative Foundation Model with Sparse Diffusion Transformer	May 28, 2025	Image GenerationMixture-of-Experts	CodeCode Available	7
LLM Post-Training: A Deep Dive into Reasoning Large Language Models	Feb 28, 2025		CodeCode Available	7
Hallo2: Long-Duration and High-Resolution Audio-Driven Portrait Image Animation	Oct 10, 2024	4kImage Animation	CodeCode Available	7
LMSYS-Chat-1M: A Large-Scale Real-World LLM Conversation Dataset	Sep 21, 2023	ChatbotDiversity	CodeCode Available	7
T-Rex2: Towards Generic Object Detection via Text-Visual Prompt Synergy	Mar 21, 2024	Contrastive LearningDescriptive	CodeCode Available	7
HuixiangDou2: A Robustly Optimized GraphRAG Approach	Mar 9, 2025	RetrievalRetrieval-augmented Generation	CodeCode Available	7
MaskSketch: Unpaired Structure-guided Masked Image Generation	Feb 10, 2023	Conditional Image GenerationDiversity	CodeCode Available	7
MoE-LLaVA: Mixture of Experts for Large Vision-Language Models	Jan 29, 2024	HallucinationMixture-of-Experts	CodeCode Available	7
Step-Audio: Unified Understanding and Generation in Intelligent Speech Interaction	Feb 17, 2025	Instruction FollowingVoice Cloning	CodeCode Available	7
Revisiting MoE and Dense Speed-Accuracy Comparisons for LLM Training	May 23, 2024	GSM8KMixture-of-Experts	CodeCode Available	7
Step1X-Edit: A Practical Framework for General Image Editing	Apr 24, 2025	Image Editing	CodeCode Available	7
LLaVA-CoT: Let Vision Language Models Reason Step-by-Step	Nov 15, 2024	Logical ReasoningMultimodal Reasoning	CodeCode Available	7
Zero-shot Voice Conversion with Diffusion Transformers	Nov 15, 2024	In-Context LearningVoice Conversion	CodeCode Available	7
xLSTM: Extended Long Short-Term Memory	May 7, 2024	Language ModelingLanguage Modelling	CodeCode Available	7
Full Scaling Automation for Sustainable Development of Green Data Centers	May 1, 2023	Cloud ComputingCPU	CodeCode Available	7
LHM: Large Animatable Human Reconstruction Model from a Single Image in Seconds	Mar 13, 2025	3D Human Reconstruction	CodeCode Available	7
LLaMA: Open and Efficient Foundation Language Models	Feb 27, 2023	Arithmetic ReasoningCode Generation	CodeCode Available	7
SageAttention2: Efficient Attention with Thorough Outlier Smoothing and Per-thread INT4 Quantization	Nov 17, 2024	Image GenerationQuantization	CodeCode Available	7
Scalable High-Resolution Pixel-Space Image Synthesis with Hourglass Diffusion Transformers	Jan 21, 2024	Image Generation	CodeCode Available	7
Transparent Image Layer Diffusion using Latent Transparency	Feb 27, 2024		CodeCode Available	7
InternVideo2: Scaling Foundation Models for Multimodal Video Understanding	Mar 22, 2024	Action ClassificationAction Recognition	CodeCode Available	7
AutoRAG: Automated Framework for optimization of Retrieval Augmented Generation Pipeline	Oct 28, 2024	RAGRetrieval	CodeCode Available	7
Robust Inverse Graphics via Probabilistic Inference	Feb 2, 2024	NeRF	CodeCode Available	7
Unpacking DPO and PPO: Disentangling Best Practices for Learning from Preference Feedback	Jun 13, 2024	Instruction FollowingMath	CodeCode Available	7
From Bytes to Ideas: Language Modeling with Autoregressive U-Nets	Jun 17, 2025	Language ModelingLanguage Modelling	CodeCode Available	7
Direct Preference Optimization: Your Language Model is Secretly a Reward Model	May 29, 2023	Language ModelingLanguage Modelling	CodeCode Available	6
Vision Transformers Need Registers	Sep 28, 2023	Object DiscoverySelf-Supervised Image Classification	CodeCode Available	6
iTransformer: Inverted Transformers Are Effective for Time Series Forecasting	Oct 10, 2023	Time SeriesTime Series Forecasting	CodeCode Available	6
L-Eval: Instituting Standardized Evaluation for Long Context Language Models	Jul 20, 2023	Instruction Following	CodeCode Available	6
Res-Tuning: A Flexible and Efficient Tuning Paradigm via Unbinding Tuner from Backbone	Oct 30, 2023	Disentanglement	CodeCode Available	6
RWKV: Reinventing RNNs for the Transformer Era	May 22, 2023	Computational EfficiencyNatural Language Inference	CodeCode Available	6
A Watermark for Large Language Models	Jan 24, 2023	Language ModelingLanguage Modelling	CodeCode Available	6