The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 51–75 of 474278 papers

Title	Date	Tasks	Status	Hype
CosyVoice 3: Towards In-the-wild Speech Generation via Scaling-up and Post-training	May 23, 2025	Automatic Speech RecognitionEmotion Recognition	CodeCode Available	11
CogVideoX: Text-to-Video Diffusion Models with An Expert Transformer	Aug 12, 2024	Text-to-Video GenerationVideo Alignment	CodeCode Available	11
Eliza: A Web3 friendly AI Agent Operating System	Jan 12, 2025	AI AgentRAG	CodeCode Available	11
Janus: Decoupling Visual Encoding for Unified Multimodal Understanding and Generation	Oct 17, 2024	Visual Question Answering	CodeCode Available	11
SWIFT:A Scalable lightWeight Infrastructure for Fine-Tuning	Aug 10, 2024	HallucinationOptical Character Recognition	CodeCode Available	11
LangGPT: Rethinking Structured Reusable Prompt Design Framework for LLMs from the Programming Language	Feb 26, 2024	Prompt Engineering	CodeCode Available	11
Pixtral 12B	Oct 9, 2024	Language ModelingLanguage Modelling	CodeCode Available	11
Structured 3D Latents for Scalable and Versatile 3D Generation	Dec 2, 2024	3D Generation	CodeCode Available	11
RLAIF-V: Open-Source AI Feedback Leads to Super GPT-4V Trustworthiness	May 27, 2024	HallucinationImage Captioning	CodeCode Available	11
Qwen2.5-VL Technical Report	Feb 19, 2025	document understanding	CodeCode Available	11
WF-VAE: Enhancing Video VAE by Wavelet-Driven Energy Flow for Latent Video Diffusion Model	Nov 26, 2024		CodeCode Available	11
Demonstration of DB-GPT: Next Generation Data Interaction System Empowered by Large Language Models	Apr 16, 2024	Data InteractionText to SQL	CodeCode Available	11
ROMAS: A Role-Based Multi-Agent System for Database monitoring and Planning	Dec 18, 2024		CodeCode Available	11
Agent S: An Open Agentic Framework that Uses Computers Like a Human	Oct 10, 2024	AI AgentTask Planning	CodeCode Available	11
The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery	Aug 12, 2024	Language ModelingLanguage Modelling	CodeCode Available	11
WebLLM: A High-Performance In-Browser LLM Inference Engine	Dec 20, 2024	CPUGPU	CodeCode Available	11
Deep Time Series Models: A Comprehensive Survey and Benchmark	Jul 18, 2024	SurveyTime Series	CodeCode Available	11
Janus-Pro: Unified Multimodal Understanding and Generation with Data and Model Scaling	Jan 29, 2025	Image Generation	CodeCode Available	11
LivePortrait: Efficient Portrait Animation with Stitching and Retargeting Control	Jul 3, 2024	Computational EfficiencyFace Reenactment	CodeCode Available	11
OWL: Optimized Workforce Learning for General Multi-Agent Assistance in Real-World Task Automation	May 29, 2025	Large Language Model	CodeCode Available	11
Wan: Open and Advanced Large-Scale Video Generative Models	Mar 26, 2025	Video EditingVideo Generation	CodeCode Available	11
Hunyuan3D 2.0: Scaling Diffusion Models for High Resolution Textured 3D Assets Generation	Jan 21, 2025	Texture Synthesis	CodeCode Available	11
SCORE: Systematic COnsistency and Robustness Evaluation for Large Language Models	Feb 28, 2025	MMLU	CodeCode Available	11
Attentive Reasoning Queries: A Systematic Method for Optimizing Instruction-Following in Large Language Models	Mar 5, 2025	HallucinationInstruction Following	CodeCode Available	11
CosyVoice 2: Scalable Streaming Speech Synthesis with Large Language Models	Dec 13, 2024	In-Context LearningQuantization	CodeCode Available	11