The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 2251–2275 of 177340 papers

Title	Date	Tasks	Status	Hype	Score
VideoFusion: Decomposed Diffusion Models for High-Quality Video Generation	Mar 15, 2023	Code GenerationDenoising	CodeCode Available	4	5
LLM Inference Unveiled: Survey and Roofline Model Insights	Feb 26, 2024	Knowledge DistillationLanguage Modelling	CodeCode Available	4	5
Multimodal Whole Slide Foundation Model for Pathology	Nov 29, 2024	Cross-Modal Retrievalmodel	CodeCode Available	4	5
TorchAudio 2.1: Advancing speech recognition, self-supervised learning, and audio processing components for PyTorch	Oct 27, 2023	Self-Supervised LearningSpeech Enhancement	CodeCode Available	4	5
Magpie: Alignment Data Synthesis from Scratch by Prompting Aligned LLMs with Nothing	Jun 12, 2024		CodeCode Available	4	5
MonSter: Marry Monodepth to Stereo Unleashes Power	Jan 15, 2025	Depth EstimationMonocular Depth Estimation	CodeCode Available	4	5
Large Models for Time Series and Spatio-Temporal Data: A Survey and Outlook	Oct 16, 2023	Time SeriesTime Series Analysis	CodeCode Available	4	5
Cost-Effective Hyperparameter Optimization for Large Language Model Generation Inference	Mar 8, 2023	Hyperparameter OptimizationLanguage Modeling	CodeCode Available	4	5
Efficient Post-training Quantization with FP8 Formats	Sep 26, 2023	image-classificationImage Classification	CodeCode Available	4	5
Enabling more efficient and cost-effective AI/ML systems with Collective Mind, virtualized MLOps, MLPerf, Collective Knowledge Playground and reproducible optimization tournaments	Jun 24, 2024	Benchmarking	CodeCode Available	4	5
Transformers in Time Series: A Survey	Feb 15, 2022	Anomaly DetectionSurvey	CodeCode Available	4	5
RaTEScore: A Metric for Radiology Report Generation	Jun 24, 2024	DiagnosticEntity Embeddings	CodeCode Available	4	5
ZipVoice-Dialog: Non-Autoregressive Spoken Dialogue Generation with Flow Matching	Jul 12, 2025	Dialogue Generationtext-to-speech	CodeCode Available	4	5
Atom of Thoughts for Markov LLM Test-Time Scaling	Feb 17, 2025		CodeCode Available	4	5
Mixtral of Experts	Jan 8, 2024	Code GenerationCommon Sense Reasoning	CodeCode Available	4	5
ConRFT: A Reinforced Fine-tuning Method for VLA Models via Consistency Policy	Feb 8, 2025	Q-LearningSafe Exploration	CodeCode Available	3	5
KwaiAgents: Generalized Information-seeking Agent System with Large Language Models	Dec 8, 2023		CodeCode Available	3	5
FlexRAG: A Flexible and Comprehensive Framework for Retrieval-Augmented Generation	Jun 14, 2025	Language ModelingLanguage Modelling	CodeCode Available	3	5
How Far Are We From AGI: Are LLMs All We Need?	May 16, 2024	All	CodeCode Available	3	5
Make-Your-Anchor: A Diffusion-based 2D Avatar Generation Framework	Mar 25, 2024	Denoising	CodeCode Available	3	5
Controllable Text-to-3D Generation via Surface-Aligned Gaussian Splatting	Mar 15, 2024	3D GenerationImage to 3D	CodeCode Available	3	5
TKAN: Temporal Kolmogorov-Arnold Networks	May 12, 2024	Kolmogorov-Arnold NetworksManagement	CodeCode Available	3	5
How Abilities in Large Language Models are Affected by Supervised Fine-tuning Data Composition	Oct 9, 2023	Code GenerationInstruction Following	CodeCode Available	3	5
What Matters When Repurposing Diffusion Models for General Dense Perception Tasks?	Mar 10, 2024	Depth EstimationImage Matting	CodeCode Available	3	5
HERMES: A Unified Self-Driving World Model for Simultaneous 3D Scene Understanding and Generation	Jan 24, 2025	Autonomous DrivingLanguage Modeling	CodeCode Available	3	5