The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

659,983 papers248,104 code links4,818 tasks

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 951–975 of 659983 papers

Title	Date	Tasks	Status	Hype
UQLM: A Python Package for Uncertainty Quantification in Large Language Models	Jul 8, 2025	HallucinationUncertainty Quantification	CodeCode Available	5
Chinese CLIP: Contrastive Vision-Language Pretraining in Chinese	Nov 2, 2022	Contrastive Learningimage-classification	CodeCode Available	5
ControlNeXt: Powerful and Efficient Control for Image and Video Generation	Aug 12, 2024	Video Generation	CodeCode Available	5
MegaScale: Scaling Large Language Model Training to More Than 10,000 GPUs	Feb 23, 2024	Language ModelingLanguage Modelling	CodeCode Available	5
MiniRAG: Towards Extremely Simple Retrieval-Augmented Generation	Jan 12, 2025	RAGRetrieval	CodeCode Available	5
SAM2-Adapter: Evaluating & Adapting Segment Anything 2 in Downstream Tasks: Camouflage, Shadow, Medical Image Segmentation, and More	Aug 8, 2024	Image SegmentationMedical Image Segmentation	CodeCode Available	5
WizardCoder: Empowering Code Large Language Models with Evol-Instruct	Jun 14, 2023	Code GenerationHumanEval	CodeCode Available	5
Audio Flamingo: A Novel Audio Language Model with Few-Shot Learning and Dialogue Abilities	Feb 2, 2024	Acoustic Scene ClassificationAudio captioning	CodeCode Available	5
Long-term Forecasting with TiDE: Time-series Dense Encoder	Apr 17, 2023	Anomaly DetectionDecoder	CodeCode Available	5
From System 1 to System 2: A Survey of Reasoning Large Language Models	Feb 24, 2025	Logical Reasoning	CodeCode Available	5
Train for the Worst, Plan for the Best: Understanding Token Ordering in Masked Diffusions	Feb 10, 2025		CodeCode Available	5
Wonder3D: Single Image to 3D using Cross-Domain Diffusion	Oct 23, 2023	3D geometryImage to 3D	CodeCode Available	5
MobileVLM V2: Faster and Stronger Baseline for Vision Language Model	Feb 6, 2024	AutoMLLanguage Modeling	CodeCode Available	5
MV-Adapter: Multi-view Consistent Image Generation Made Easy	Dec 4, 2024	3D GenerationImage Generation	CodeCode Available	5
DataDreamer: A Tool for Synthetic Data Generation and Reproducible LLM Workflows	Feb 16, 2024	Synthetic Data Generation	CodeCode Available	5
DeepPhase: Periodic Autoencoders for Learning Motion Phase Manifolds	Jul 22, 2022	Motion Synthesis	CodeCode Available	5
WavTokenizer: an Efficient Acoustic Discrete Codec Tokenizer for Audio Language Modeling	Aug 29, 2024	Language ModelingLanguage Modelling	CodeCode Available	5
Stream-Omni: Simultaneous Multimodal Interactions with Large Language-Vision-Speech Model	Jun 16, 2025	Large Language Modelmultimodal interaction	CodeCode Available	5
Understanding R1-Zero-Like Training: A Critical Perspective	Mar 26, 2025	Reinforcement Learning (RL)	CodeCode Available	5
OmniDocBench: Benchmarking Diverse PDF Document Parsing with Comprehensive Annotations	Dec 10, 2024	AttributeBenchmarking	CodeCode Available	5
NovelSeek: When Agent Becomes the Scientist -- Building Closed-Loop System from Hypothesis to Verification	May 22, 2025	2D Semantic SegmentationActivity Prediction	CodeCode Available	5
CogAgent: A Visual Language Model for GUI Agents	Dec 14, 2023	Language Modeling	CodeCode Available	5
Transformer-Squared: Self-adaptive LLMs	Jan 9, 2025		CodeCode Available	5
CogVLM: Visual Expert for Pretrained Language Models	Nov 6, 2023	1 Image, 2*2 StitchingFS-MEVQA	CodeCode Available	5
Aria: An Open Multimodal Native Mixture-of-Experts Model	Oct 8, 2024	Instruction FollowingMixture-of-Experts	CodeCode Available	5