The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

659,983 papers248,104 code links4,818 tasks

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1101–1150 of 659983 papers

Title	Date	Tasks	Status	Hype
MixTex: Unambiguous Recognition Should Not Rely Solely on Real Data	Jun 24, 2024	Data AugmentationOptical Character Recognition (OCR)	CodeCode Available	5
LLaMA-Adapter: Efficient Fine-tuning of Language Models with Zero-init Attention	Mar 28, 2023	Instruction FollowingLanguage Modelling	CodeCode Available	5
InstantSplat: Sparse-view SfM-free Gaussian Splatting in Seconds	Mar 29, 2024	3D ReconstructionNovel View Synthesis	CodeCode Available	5
AugLy: Data Augmentations for Robustness	Jan 17, 2022	Adversarial RobustnessData Augmentation	CodeCode Available	5
The Rise and Potential of Large Language Model Based Agents: A Survey	Sep 14, 2023	Language ModelingLanguage Modelling	CodeCode Available	5
Cambrian-1: A Fully Open, Vision-Centric Exploration of Multimodal LLMs	Jun 24, 2024	Representation LearningVisual Grounding	CodeCode Available	5
Feature Refinement to Improve High Resolution Image Inpainting	Jun 27, 2022	Image InpaintingVocal Bursts Intensity Prediction	CodeCode Available	5
Orthogonal Subspace Decomposition for Generalizable AI-Generated Image Detection	Nov 23, 2024	Face SwappingSynthetic Image Detection	CodeCode Available	5
3DTopia-XL: Scaling High-quality 3D Asset Generation via Primitive Diffusion	Sep 19, 2024		CodeCode Available	5
Tree of Thoughts: Deliberate Problem Solving with Large Language Models	May 17, 2023	Arithmetic ReasoningDecision Making	CodeCode Available	5
SMPLest-X: Ultimate Scaling for Expressive Human Pose and Shape Estimation	Jan 16, 2025	Benchmarking	CodeCode Available	5
Monolith: Real Time Recommendation System With Collisionless Embedding Table	Sep 16, 2022		CodeCode Available	5
Consistency Models	Mar 2, 2023	ColorizationImage Generation	CodeCode Available	5
Process Reinforcement through Implicit Rewards	Feb 3, 2025	MathReinforcement Learning (RL)	CodeCode Available	5
FlexGen: High-Throughput Generative Inference of Large Language Models with a Single GPU	Mar 13, 2023	CPUGPU	CodeCode Available	5
RealFusion: 360° Reconstruction of Any Object from a Single Image	Feb 21, 2023	3D ReconstructionObject	CodeCode Available	5
YOLOv6 v3.0: A Full-Scale Reloading	Jan 13, 2023	GPUObject Detection	CodeCode Available	5
Text-to-Image Rectified Flow as Plug-and-Play Priors	Jun 5, 2024	3D GenerationText to 3D	CodeCode Available	5
Agents: An Open-source Framework for Autonomous Language Agents	Sep 14, 2023		CodeCode Available	5
MMInference: Accelerating Pre-filling for Long-Context VLMs via Modality-Aware Permutation Sparse Attention	Apr 22, 2025	GPU	CodeCode Available	5
ESC-Eval: Evaluating Emotion Support Conversations in Large Language Models	Jun 21, 2024		CodeCode Available	5
LongWriter-Zero: Mastering Ultra-Long Text Generation via Reinforcement Learning	Jun 23, 2025	Reinforcement Learning (RL)Text Generation	CodeCode Available	5
LLMLingua: Compressing Prompts for Accelerated Inference of Large Language Models	Oct 9, 2023	GSM8KIn-Context Learning	CodeCode Available	5
Chatlaw: A Multi-Agent Collaborative Legal Assistant with Knowledge Graph Enhanced Mixture-of-Experts Large Language Model	Jun 28, 2023	HallucinationKnowledge Graphs	CodeCode Available	5
Prompting Depth Anything for 4K Resolution Accurate Metric Depth Estimation	Dec 18, 2024	3D Reconstruction4k	CodeCode Available	5
OPT: Open Pre-trained Transformer Language Models	May 2, 2022	DecoderHate Speech Detection	CodeCode Available	5
Low Bitrate High-Quality RVQGAN-based Discrete Speech Tokenizer	Oct 10, 2024		CodeCode Available	5
CodeGeeX: A Pre-Trained Model for Code Generation with Multilingual Benchmarking on HumanEval-X	Mar 30, 2023	BenchmarkingCode Generation	CodeCode Available	5
Deep Confident Steps to New Pockets: Strategies for Docking Generalization	Feb 28, 2024	Blind Docking	CodeCode Available	5
Conditional Generative Models for Contrast-Enhanced Synthesis of T1w and T1 Maps in Brain MRI	Oct 11, 2024	Uncertainty Quantification	CodeCode Available	5
skfolio: Portfolio Optimization in Python	Jul 5, 2025	ManagementPortfolio Optimization	CodeCode Available	5
Agentic Retrieval-Augmented Generation: A Survey on Agentic RAG	Jan 15, 2025	Natural Language UnderstandingRAG	CodeCode Available	5
Instruction-Following Evaluation for Large Language Models	Nov 14, 2023	Instruction Following	CodeCode Available	5
ShowUI: One Vision-Language-Action Model for GUI Visual Agent	Nov 26, 2024	Instruction FollowingNatural Language Visual Grounding	CodeCode Available	5
NTIRE 2024 Challenge on Low Light Image Enhancement: Methods and Results	Apr 22, 2024	4kImage Enhancement	CodeCode Available	5
SpatialTracker: Tracking Any 2D Pixels in 3D Space	Apr 5, 2024		CodeCode Available	5
Autoformalization in the Era of Large Language Models: A Survey	May 29, 2025	Automated Theorem Proving	CodeCode Available	5
BM25S: Orders of magnitude faster lexical search via eager sparse scoring	Jul 4, 2024	Passage RetrievalRetrieval	CodeCode Available	5
DEIM: DETR with Improved Matching for Fast Convergence	Dec 5, 2024	Data AugmentationGPU	CodeCode Available	5
UQLM: A Python Package for Uncertainty Quantification in Large Language Models	Jul 8, 2025	HallucinationUncertainty Quantification	CodeCode Available	5
Chinese CLIP: Contrastive Vision-Language Pretraining in Chinese	Nov 2, 2022	Contrastive Learningimage-classification	CodeCode Available	5
ControlNeXt: Powerful and Efficient Control for Image and Video Generation	Aug 12, 2024	Video Generation	CodeCode Available	5
MegaScale: Scaling Large Language Model Training to More Than 10,000 GPUs	Feb 23, 2024	Language ModelingLanguage Modelling	CodeCode Available	5
MiniRAG: Towards Extremely Simple Retrieval-Augmented Generation	Jan 12, 2025	RAGRetrieval	CodeCode Available	5
SAM2-Adapter: Evaluating & Adapting Segment Anything 2 in Downstream Tasks: Camouflage, Shadow, Medical Image Segmentation, and More	Aug 8, 2024	Image SegmentationMedical Image Segmentation	CodeCode Available	5
WizardCoder: Empowering Code Large Language Models with Evol-Instruct	Jun 14, 2023	Code GenerationHumanEval	CodeCode Available	5
Audio Flamingo: A Novel Audio Language Model with Few-Shot Learning and Dialogue Abilities	Feb 2, 2024	Acoustic Scene ClassificationAudio captioning	CodeCode Available	5
Long-term Forecasting with TiDE: Time-series Dense Encoder	Apr 17, 2023	Anomaly DetectionDecoder	CodeCode Available	5
From System 1 to System 2: A Survey of Reasoning Large Language Models	Feb 24, 2025	Logical Reasoning	CodeCode Available	5
Train for the Worst, Plan for the Best: Understanding Token Ordering in Masked Diffusions	Feb 10, 2025		CodeCode Available	5