The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

659,983 papers248,104 code links4,818 tasks

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 676–700 of 177339 papers

Title	Date	Tasks	Status	Hype	Score
VideoLLaMA 3: Frontier Multimodal Foundation Models for Image and Video Understanding	Jan 22, 2025	PhilosophyVideo Question Answering	CodeCode Available	5	5
StreamingT2V: Consistent, Dynamic, and Extendable Long Video Generation from Text	Mar 21, 2024	Text-to-Video GenerationVideo Generation	CodeCode Available	5	5
GauStudio: A Modular Framework for 3D Gaussian Splatting and Beyond	Mar 28, 2024	3DGSNovel View Synthesis	CodeCode Available	5	5
Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection	Mar 9, 2023	DecoderObject Detection	CodeCode Available	5	5
Go-with-the-Flow: Motion-Controllable Video Diffusion Models Using Real-Time Warped Noise	Jan 14, 2025	Optical Flow Estimation	CodeCode Available	5	5
TrustRAG: An Information Assistant with Retrieval Augmented Generation	Feb 19, 2025	Answer GenerationChunking	CodeCode Available	5	5
ConsistentID: Portrait Generation with Multimodal Fine-Grained Identity Preserving	Apr 25, 2024	Diversity	CodeCode Available	5	5
Parrot: Multilingual Visual Instruction Tuning	Jun 4, 2024	Mixture-of-Experts	CodeCode Available	5	5
Improved Differentially Private Regression via Gradient Boosting	Mar 6, 2023	regression	CodeCode Available	5	5
AIDE: AI-Driven Exploration in the Space of Code	Feb 18, 2025		CodeCode Available	5	5
WizardLM: Empowering Large Language Models to Follow Complex Instructions	Apr 24, 2023	Instruction Following	CodeCode Available	5	5
Ovis: Structural Embedding Alignment for Multimodal Large Language Model	May 31, 2024	Language ModelingMultimodal Large Language Model	CodeCode Available	5	5
DreamClear: High-Capacity Real-World Image Restoration with Privacy-Safe Dataset Curation	Oct 24, 2024	Image RestorationPrompt Learning	CodeCode Available	5	5
Qwen-VL: A Versatile Vision-Language Model for Understanding, Localization, Text Reading, and Beyond	Aug 24, 2023	Chart Question AnsweringFS-MEVQA	CodeCode Available	5	5
MuSR: Testing the Limits of Chain-of-thought with Multistep Soft Reasoning	Oct 24, 2023		CodeCode Available	5	5
Prometheus 2: An Open Source Language Model Specialized in Evaluating Other Language Models	May 2, 2024	Language ModelingLanguage Modelling	CodeCode Available	5	5
Assessing Language Model Deployment with Risk Cards	Mar 31, 2023	Language ModelingLanguage Modelling	CodeCode Available	5	5
UniVLA: Learning to Act Anywhere with Task-centric Latent Actions	May 9, 2025	Robot ManipulationVision-Language-Action	CodeCode Available	5	5
SantaCoder: don't reach for the stars!	Jan 9, 2023	Code GenerationPII Redaction	CodeCode Available	5	5
Uni-MoE: Scaling Unified Multimodal LLMs with Mixture of Experts	May 18, 2024	Mixture-of-ExpertsVisual Question Answering	CodeCode Available	5	5
Evolutionary Optimization of Model Merging Recipes	Mar 19, 2024	Evolutionary AlgorithmsMath	CodeCode Available	5	5
Automatic Interactive Evaluation for Large Language Models with State Aware Patient Simulator	Mar 13, 2024		CodeCode Available	5	5
R-CoT: Reverse Chain-of-Thought Problem Generation for Geometric Reasoning in Large Multimodal Models	Oct 23, 2024	Diversity	CodeCode Available	5	5
Q-GaLore: Quantized GaLore with INT4 Projection and Layer-Adaptive Low-Rank Gradients	Jul 11, 2024	Quantization	CodeCode Available	5	5
GraphCast: Learning skillful medium-range global weather forecasting	Dec 24, 2022	Decision MakingWeather Forecasting	CodeCode Available	5	5