The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1951–1975 of 177340 papers

Title	Date	Tasks	Status	Hype	Score
ELEVATER: A Benchmark and Toolkit for Evaluating Language-Augmented Visual Models	Apr 19, 2022	FairnessFew-Shot Image Classification	CodeCode Available	4	5
UniAnimate-DiT: Human Image Animation with Large-Scale Video Diffusion Transformer	Apr 15, 2025	Image Animation	CodeCode Available	4	5
AlpaServe: Statistical Multiplexing with Model Parallelism for Deep Learning Serving	Feb 22, 2023	Deep Learning	CodeCode Available	4	5
ControlNet++: Improving Conditional Controls with Efficient Consistency Feedback	Apr 11, 2024	SSIM	CodeCode Available	4	5
VisionLLM: Large Language Model is also an Open-Ended Decoder for Vision-Centric Tasks	May 18, 2023	DecoderLanguage Modeling	CodeCode Available	4	5
Enhancing Suno's Bark Text-to-Speech Model: Addressing Limitations Through Meta's Encodec and Pre-Trained Hubert	Apr 18, 2023	Audio GenerationExpressive Speech Synthesis	CodeCode Available	4	5
Debug like a Human: A Large Language Model Debugger via Verifying Runtime Execution Step-by-step	Feb 25, 2024	Code GenerationHumanEval	CodeCode Available	4	5
A New Formulation of Lipschitz Constrained With Functional Gradient Learning for GANs	Jan 20, 2025	DiversityImage Generation	CodeCode Available	4	5
Rephrase and Respond: Let Large Language Models Ask Better Questions for Themselves	Nov 7, 2023		CodeCode Available	4	5
AnyDoor: Zero-shot Object-level Image Customization	Jul 18, 2023	ObjectVirtual Try-on	CodeCode Available	4	5
GPUDrive: Data-driven, multi-agent driving simulation at 1 million FPS	Aug 2, 2024	GPUNavigate	CodeCode Available	4	5
S^3Gaussian: Self-Supervised Street Gaussians for Autonomous Driving	May 30, 2024	3DGS3D Reconstruction	CodeCode Available	4	5
Tracking Everything Everywhere All at Once	Jun 8, 2023	AllMotion Estimation	CodeCode Available	4	5
Moûsai: Text-to-Music Generation with Long-Context Latent Diffusion	Jan 27, 2023	GPUImage Generation	CodeCode Available	4	5
sbi reloaded: a toolkit for simulation-based inference workflows	Nov 26, 2024	Bayesian InferenceDiagnostic	CodeCode Available	4	5
Enhancing Chat Language Models by Scaling High-quality Instructional Conversations	May 23, 2023	Diversity	CodeCode Available	4	5
Zero-shot forecasting of chaotic systems	Sep 24, 2024	AttributeIn-Context Learning	CodeCode Available	4	5
One Embedder, Any Task: Instruction-Finetuned Text Embeddings	Dec 19, 2022	Information RetrievalLearning Word Embeddings	CodeCode Available	4	5
JetMoE: Reaching Llama2 Performance with 0.1M Dollars	Apr 11, 2024	GPUMixture-of-Experts	CodeCode Available	4	5
LLMDet: Learning Strong Open-Vocabulary Object Detectors under the Supervision of Large Language Models	Jan 31, 2025	Caption GenerationLanguage Modeling	CodeCode Available	4	5
A Survey on Occupancy Perception for Autonomous Driving: The Information Fusion Perspective	May 8, 2024	Autonomous DrivingAutonomous Vehicles	CodeCode Available	4	5
T2I-Adapter: Learning Adapters to Dig out More Controllable Ability for Text-to-Image Diffusion Models	Feb 16, 2023	Image GenerationStyle Transfer	CodeCode Available	4	5
Video-LLaMA: An Instruction-tuned Audio-Visual Language Model for Video Understanding	Jun 5, 2023	Language ModelingLanguage Modelling	CodeCode Available	4	5
Text2Video-Zero: Text-to-Image Diffusion Models are Zero-Shot Video Generators	Mar 23, 2023	Image GenerationText-to-Video Generation	CodeCode Available	4	5
HarmBench: A Standardized Evaluation Framework for Automated Red Teaming and Robust Refusal	Feb 6, 2024	Red Teaming	CodeCode Available	4	5