The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

659,983 papers248,104 code links4,818 tasks

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1001–1050 of 177339 papers

Title	Date	Tasks	Status	Hype	Score
Segment Anything Model for Medical Image Segmentation: Current Applications and Future Directions	Jan 7, 2024	BenchmarkingImage Segmentation	CodeCode Available	5	5
aeon: a Python toolkit for learning from time series	Jun 20, 2024	Anomaly DetectionModel Selection	CodeCode Available	5	5
Controllable Generation with Text-to-Image Diffusion Models: A Survey	Mar 7, 2024	Denoising	CodeCode Available	5	5
Datasets for Large Language Models: A Comprehensive Survey	Feb 28, 2024	Language ModellingLarge Language Model	CodeCode Available	5	5
Unlocking Efficiency in Large Language Model Inference: A Comprehensive Survey of Speculative Decoding	Jan 15, 2024	Language ModelingLanguage Modelling	CodeCode Available	5	5
Real3D-Portrait: One-shot Realistic 3D Talking Portrait Synthesis	Jan 16, 2024	3D ReconstructionFace Generation	CodeCode Available	5	5
Make Your LLM Fully Utilize the Context	Apr 25, 2024	4kInformation Retrieval	CodeCode Available	5	5
Know Your Self-supervised Learning: A Survey on Image-based Generative and Discriminative Training	May 23, 2023	Contrastive LearningSelf-Supervised Learning	CodeCode Available	5	5
DepthCrafter: Generating Consistent Long Depth Sequences for Open-world Videos	Sep 3, 2024	Depth EstimationDiversity	CodeCode Available	5	5
ASAP: Aligning Simulation and Real-World Physics for Learning Agile Humanoid Whole-Body Skills	Feb 3, 2025		CodeCode Available	5	5
MambaIRv2: Attentive State Space Restoration	Nov 22, 2024	Computational EfficiencyImage Restoration	CodeCode Available	5	5
WebLINX: Real-World Website Navigation with Multi-Turn Dialogue	Feb 8, 2024	Conversational Web NavigationText Generation	CodeCode Available	5	5
VBench++: Comprehensive and Versatile Benchmark Suite for Video Generative Models	Nov 20, 2024	BenchmarkingImage Generation	CodeCode Available	5	5
Trust Regions for Explanations via Black-Box Probabilistic Certification	Feb 17, 2024		CodeCode Available	5	5
MEIA: Multimodal Embodied Perception and Interaction in Unknown Environments	Feb 1, 2024	Embodied Question AnsweringLanguage Modeling	CodeCode Available	5	5
EasyPhoto: Your Smart AI Photo Generator	Oct 7, 2023		CodeCode Available	5	5
Language Agents as Optimizable Graphs	Feb 26, 2024	Prompt Engineering	CodeCode Available	5	5
Data-Juicer: A One-Stop Data Processing System for Large Language Models	Sep 5, 2023	Distributed Computing	CodeCode Available	5	5
Training Large Language Models to Reason in a Continuous Latent Space	Dec 9, 2024	Logical Reasoning	CodeCode Available	5	5
YOLOv13: Real-Time Object Detection with Hypergraph-Enhanced Adaptive Visual Perception	Jun 21, 2025	Computational Efficiencyobject-detection	CodeCode Available	5	5
YOLOv6: A Single-Stage Object Detection Framework for Industrial Applications	Sep 7, 2022	GPUObject Detection	CodeCode Available	5	5
FasterDiT: Towards Faster Diffusion Transformers Training without Architecture Modification	Oct 14, 2024	Image Generation	CodeCode Available	5	5
OminiControl2: Efficient Conditioning for Diffusion Transformers	Mar 11, 2025	Conditional Image GenerationDenoising	CodeCode Available	5	5
Accessing GPT-4 level Mathematical Olympiad Solutions via Monte Carlo Tree Self-refine with LLaMa-3 8B	Jun 11, 2024	Decision MakingGSM8K	CodeCode Available	5	5
Semantic Operators: A Declarative Model for Rich, AI-based Data Processing	Jul 16, 2024	Extreme Multi-Label ClassificationFact Checking	CodeCode Available	5	5
OMG-Seg: Is One Model Good Enough For All Segmentation?	Jan 18, 2024	AllDecoder	CodeCode Available	5	5
Ferret: Refer and Ground Anything Anywhere at Any Granularity	Oct 11, 2023	HallucinationLanguage Modeling	CodeCode Available	5	5
TimeMixer: Decomposable Multiscale Mixing for Time Series Forecasting	May 23, 2024	Future predictionTime Series	CodeCode Available	5	5
MMMU: A Massive Multi-discipline Multimodal Understanding and Reasoning Benchmark for Expert AGI	Nov 27, 2023	Complex Query AnsweringLogical Reasoning	CodeCode Available	5	5
SoftHGNN: Soft Hypergraph Neural Networks for General Visual Recognition	May 21, 2025		CodeCode Available	5	5
Masked Completion via Structured Diffusion with White-Box Transformers	Apr 3, 2024	Representation Learning	CodeCode Available	5	5
Inpaint Anything: Segment Anything Meets Image Inpainting	Apr 13, 2023	Image Inpainting	CodeCode Available	5	5
Extreme Compression of Large Language Models via Additive Quantization	Jan 11, 2024	CPUGPU	CodeCode Available	5	5
Structural Generalization in Autonomous Cyber Incident Response with Message-Passing Neural Networks and Reinforcement Learning	Jul 8, 2024		CodeCode Available	5	5
FlexLLM: A System for Co-Serving Large Language Model Inference and Parameter-Efficient Finetuning	Feb 29, 2024	GPULanguage Modeling	CodeCode Available	5	5
CPsyCoun: A Report-based Multi-turn Dialogue Reconstruction and Evaluation Framework for Chinese Psychological Counseling	May 26, 2024		CodeCode Available	5	5
Unified Multimodal Understanding and Generation Models: Advances, Challenges, and Opportunities	May 5, 2025	Image GenerationSurvey	CodeCode Available	5	5
Towards Better Instruction Following Language Models for Chinese: Investigating the Impact of Training Data and Evaluation	Apr 16, 2023	Instruction Following	CodeCode Available	5	5
MarS: a Financial Market Simulation Engine Powered by Generative Foundation Model	Sep 4, 2024	Language ModelingLanguage Modelling	CodeCode Available	5	5
Mulberry: Empowering MLLM with o1-like Reasoning and Reflection via Collective Monte Carlo Tree Search	Dec 24, 2024		CodeCode Available	5	5
CatVTON: Concatenation Is All You Need for Virtual Try-On with Diffusion Models	Jul 21, 2024	AllFashion Synthesis	CodeCode Available	5	5
Arbitrary-steps Image Super-resolution via Diffusion Inversion	Dec 12, 2024	Image Super-ResolutionSuper-Resolution	CodeCode Available	5	5
SQUAT: Stateful Quantization-Aware Training in Recurrent Spiking Neural Networks	Apr 15, 2024	Quantization	CodeCode Available	5	5
SymbolicAI: A framework for logic-based approaches combining generative models and solvers	Feb 1, 2024	Few-Shot LearningIn-Context Learning	CodeCode Available	5	5
That Chip Has Sailed: A Critique of Unfounded Skepticism Around AI for Chip Design	Nov 15, 2024	Deep Reinforcement Learning	CodeCode Available	5	5
GAPartManip: A Large-scale Part-centric Dataset for Material-Agnostic Articulated Object Manipulation	Nov 27, 2024	Depth EstimationDiversity	CodeCode Available	5	5
Very Low Complexity Speech Synthesis Using Framewise Autoregressive GAN (FARGAN) with Pitch Prediction	May 31, 2024	Speech Synthesis	CodeCode Available	5	5
A quantum semantic framework for natural language processing	Jun 11, 2025		CodeCode Available	5	5
Single-seed generation of Brownian paths and integrals for adaptive and high order SDE solvers	May 10, 2024		CodeCode Available	5	5
The Path To Autonomous Cyber Defense	Apr 12, 2024		CodeCode Available	5	5