The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 76–100 of 177340 papers

Title	Date	Tasks	Status	Hype	Score
Wan: Open and Advanced Large-Scale Video Generative Models	Mar 26, 2025	Video EditingVideo Generation	CodeCode Available	11	5
Hunyuan3D 2.0: Scaling Diffusion Models for High Resolution Textured 3D Assets Generation	Jan 21, 2025	Texture Synthesis	CodeCode Available	11	5
SCORE: Systematic COnsistency and Robustness Evaluation for Large Language Models	Feb 28, 2025	MMLU	CodeCode Available	11	5
Attentive Reasoning Queries: A Systematic Method for Optimizing Instruction-Following in Large Language Models	Mar 5, 2025	HallucinationInstruction Following	CodeCode Available	11	5
CosyVoice 2: Scalable Streaming Speech Synthesis with Large Language Models	Dec 13, 2024	In-Context LearningQuantization	CodeCode Available	11	5
FunAudioLLM: Voice Understanding and Generation Foundation Models for Natural Interaction Between Humans and LLMs	Jul 4, 2024	Emotion RecognitionEvent Detection	CodeCode Available	11	5
Data Formulator 2: Iterative Creation of Data Visualizations, with AI Transforming Data Along the Way	Aug 28, 2024	Code GenerationNavigate	CodeCode Available	11	5
WebDancer: Towards Autonomous Information Seeking Agency	May 28, 2025		CodeCode Available	11	5
YOLOE: Real-Time Seeing Anything	Mar 10, 2025	10-shot image generation	CodeCode Available	11	5
VGGT: Visual Geometry Grounded Transformer	Mar 14, 2025	Depth EstimationNovel View Synthesis	CodeCode Available	11	5
Bridging The Gap between Low-rank and Orthogonal Adaptation via Householder Reflection Adaptation	May 24, 2024		CodeCode Available	11	5
Transformers are SSMs: Generalized Models and Efficient Algorithms Through Structured State Space Duality	May 31, 2024	Language ModelingLanguage Modelling	CodeCode Available	11	5
Packing Input Frame Context in Next-Frame Prediction Models for Video Generation	Apr 17, 2025		CodeCode Available	11	5
Spark-TTS: An Efficient LLM-Based Text-to-Speech Model with Single-Stream Decoupled Speech Tokens	Mar 3, 2025	Attributetext-to-speech	CodeCode Available	11	5
olmOCR: Unlocking Trillions of Tokens in PDFs with Vision Language Models	Feb 25, 2025	DiversityLanguage Modeling	CodeCode Available	11	5
Agent S2: A Compositional Generalist-Specialist Framework for Computer Use Agents	Apr 1, 2025	AI AgentTask Planning	CodeCode Available	11	5
TinyLlama: An Open-Source Small Language Model	Jan 4, 2024	Computational EfficiencyLanguage Modeling	CodeCode Available	11	5
Open-Sora Plan: Open-Source Large Video Generation Model	Nov 28, 2024	Video Generation	CodeCode Available	11	5
YOLOv10: Real-Time End-to-End Object Detection	May 23, 2024	2D Object DetectionData Augmentation	CodeCode Available	11	5
Ditto: Motion-Space Diffusion for Controllable Realtime Talking Head Synthesis	Nov 29, 2024	DisentanglementMotion Generation	CodeCode Available	11	5
Very Large-Scale Multi-Agent Simulation in AgentScope	Jul 25, 2024		CodeCode Available	11	5
CosyVoice: A Scalable Multilingual Zero-shot Text-to-speech Synthesizer based on Supervised Semantic Tokens	Jul 7, 2024	Language ModellingLarge Language Model	CodeCode Available	11	5
JanusFlow: Harmonizing Autoregression and Rectified Flow for Unified Multimodal Understanding and Generation	Nov 12, 2024	Language ModelingLanguage Modelling	CodeCode Available	11	5
WebSailor: Navigating Super-human Reasoning for Web Agent	Jul 3, 2025		CodeCode Available	11	5
Magika: AI-Powered Content-Type Detection	Sep 18, 2024	CPUMalware Analysis	CodeCode Available	11	5