The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

659,983 papers248,104 code links4,818 tasks

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1001–1025 of 177339 papers

Title	Date	Tasks	Status	Hype	Score
Online Iterative Reinforcement Learning from Human Feedback with General Preference Model	Feb 11, 2024		CodeCode Available	5	5
Segment Anything Model for Medical Image Segmentation: Current Applications and Future Directions	Jan 7, 2024	BenchmarkingImage Segmentation	CodeCode Available	5	5
aeon: a Python toolkit for learning from time series	Jun 20, 2024	Anomaly DetectionModel Selection	CodeCode Available	5	5
Controllable Generation with Text-to-Image Diffusion Models: A Survey	Mar 7, 2024	Denoising	CodeCode Available	5	5
Datasets for Large Language Models: A Comprehensive Survey	Feb 28, 2024	Language ModellingLarge Language Model	CodeCode Available	5	5
Unlocking Efficiency in Large Language Model Inference: A Comprehensive Survey of Speculative Decoding	Jan 15, 2024	Language ModelingLanguage Modelling	CodeCode Available	5	5
Real3D-Portrait: One-shot Realistic 3D Talking Portrait Synthesis	Jan 16, 2024	3D ReconstructionFace Generation	CodeCode Available	5	5
Make Your LLM Fully Utilize the Context	Apr 25, 2024	4kInformation Retrieval	CodeCode Available	5	5
Know Your Self-supervised Learning: A Survey on Image-based Generative and Discriminative Training	May 23, 2023	Contrastive LearningSelf-Supervised Learning	CodeCode Available	5	5
DepthCrafter: Generating Consistent Long Depth Sequences for Open-world Videos	Sep 3, 2024	Depth EstimationDiversity	CodeCode Available	5	5
ASAP: Aligning Simulation and Real-World Physics for Learning Agile Humanoid Whole-Body Skills	Feb 3, 2025		CodeCode Available	5	5
MambaIRv2: Attentive State Space Restoration	Nov 22, 2024	Computational EfficiencyImage Restoration	CodeCode Available	5	5
WebLINX: Real-World Website Navigation with Multi-Turn Dialogue	Feb 8, 2024	Conversational Web NavigationText Generation	CodeCode Available	5	5
VBench++: Comprehensive and Versatile Benchmark Suite for Video Generative Models	Nov 20, 2024	BenchmarkingImage Generation	CodeCode Available	5	5
Trust Regions for Explanations via Black-Box Probabilistic Certification	Feb 17, 2024		CodeCode Available	5	5
MEIA: Multimodal Embodied Perception and Interaction in Unknown Environments	Feb 1, 2024	Embodied Question AnsweringLanguage Modeling	CodeCode Available	5	5
EasyPhoto: Your Smart AI Photo Generator	Oct 7, 2023		CodeCode Available	5	5
Language Agents as Optimizable Graphs	Feb 26, 2024	Prompt Engineering	CodeCode Available	5	5
Data-Juicer: A One-Stop Data Processing System for Large Language Models	Sep 5, 2023	Distributed Computing	CodeCode Available	5	5
Training Large Language Models to Reason in a Continuous Latent Space	Dec 9, 2024	Logical Reasoning	CodeCode Available	5	5
YOLOv13: Real-Time Object Detection with Hypergraph-Enhanced Adaptive Visual Perception	Jun 21, 2025	Computational Efficiencyobject-detection	CodeCode Available	5	5
YOLOv6: A Single-Stage Object Detection Framework for Industrial Applications	Sep 7, 2022	GPUObject Detection	CodeCode Available	5	5
FasterDiT: Towards Faster Diffusion Transformers Training without Architecture Modification	Oct 14, 2024	Image Generation	CodeCode Available	5	5
OminiControl2: Efficient Conditioning for Diffusion Transformers	Mar 11, 2025	Conditional Image GenerationDenoising	CodeCode Available	5	5
Accessing GPT-4 level Mathematical Olympiad Solutions via Monte Carlo Tree Self-refine with LLaMa-3 8B	Jun 11, 2024	Decision MakingGSM8K	CodeCode Available	5	5