The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 4001–4050 of 661570 papers

Title	Date	Tasks	Status	Hype
HtFLlib: A Comprehensive Heterogeneous Federated Learning Library and Benchmark	Jun 4, 2025	Federated LearningTransfer Learning	CodeCode Available	3
Motion Anything: Any to Motion Generation	Mar 10, 2025	Motion GenerationMotion Synthesis	CodeCode Available	3
RAP-SAM: Towards Real-Time All-Purpose Segment Anything	Jan 18, 2024	AllDecoder	CodeCode Available	3
AvaTaR: Optimizing LLM Agents for Tool Usage via Contrastive Reasoning	Jun 17, 2024	Language ModelingLanguage Modelling	CodeCode Available	3
EnvGS: Modeling View-Dependent Appearance with Environment Gaussian	Dec 19, 2024	Novel View Synthesis	CodeCode Available	3
A Survey on Data Selection for Language Models	Feb 26, 2024	SurveyUnsupervised Pre-training	CodeCode Available	3
MagicLens: Self-Supervised Image Retrieval with Open-Ended Instructions	Mar 28, 2024	Image RetrievalImplicit Relations	CodeCode Available	3
FlashVideo:Flowing Fidelity to Detail for Efficient High-Resolution Video Generation	Feb 7, 2025	Computational EfficiencyText-to-Video Generation	CodeCode Available	3
A Survey on Deep Learning for Theorem Proving	Apr 15, 2024	Automated Theorem ProvingDeep Learning	CodeCode Available	3
APOLLO: SGD-like Memory, AdamW-level Performance	Dec 6, 2024	GPUQuantization	CodeCode Available	3
MUSt3R: Multi-view Network for Stereo 3D Reconstruction	Mar 3, 2025	3D ReconstructionArticles	CodeCode Available	3
SparseVLM: Visual Token Sparsification for Efficient Vision-Language Model Inference	Oct 6, 2024	Language ModelingLanguage Modelling	CodeCode Available	3
Inversion-Free Image Editing with Language-Guided Diffusion Models	Jan 1, 2024	DenoisingImage Manipulation	CodeCode Available	3
Beyond Quacking: Deep Integration of Language Models and RAG into DuckDB	Apr 1, 2025	Decision MakingRAG	CodeCode Available	3
OpenSpiel: A Framework for Reinforcement Learning in Games	Aug 26, 2019	General Reinforcement Learningreinforcement-learning	CodeCode Available	3
Scikit-fingerprints: easy and efficient computation of molecular fingerprints in Python	Jul 18, 2024	Molecular Property PredictionProperty Prediction	CodeCode Available	3
NeRF-SLAM: Real-Time Dense Monocular SLAM with Neural Radiance Fields	Oct 24, 2022	NeRF	CodeCode Available	3
CLIMB: Class-imbalanced Learning Benchmark on Tabular Data	May 23, 2025		CodeCode Available	3
Around the World in 80 Timesteps: A Generative Approach to Global Visual Geolocation	Dec 9, 2024	DenoisingPhoto geolocation estimation	CodeCode Available	3
Take the aTrain. Introducing an Interface for the Accessible Transcription of Interviews	Oct 18, 2023	CPUGPU	CodeCode Available	3
Meta-Transformer: A Unified Framework for Multimodal Learning	Jul 20, 2023	Time Series	CodeCode Available	3
GroundingGPT:Language Enhanced Multi-modal Grounding Model	Jan 11, 2024	Language ModellingLarge Language Model	CodeCode Available	3
Evaluating Large Language Models with fmeval	Jul 15, 2024	Question Answering	CodeCode Available	3
Harmful Fine-tuning Attacks and Defenses for Large Language Models: A Survey	Sep 26, 2024	Safety Alignment	CodeCode Available	3
HuatuoGPT-Vision, Towards Injecting Medical Visual Knowledge into Multimodal LLMs at Scale	Jun 27, 2024	Visual Question Answering (VQA)	CodeCode Available	3
APPL: A Prompt Programming Language for Harmonious Integration of Programs and Large Language Model Prompts	Jun 19, 2024	Language ModelingLanguage Modelling	CodeCode Available	3
Rethinking Early Stopping: Refine, Then Calibrate	Jan 31, 2025	Decision Making	CodeCode Available	3
CORE-Bench: Fostering the Credibility of Published Research Through a Computational Reproducibility Agent Benchmark	Sep 17, 2024		CodeCode Available	3
Better by Default: Strong Pre-Tuned MLPs and Boosted Trees on Tabular Data	Jul 5, 2024	Classificationregression	CodeCode Available	3
Automatic Gradient Estimation for Calibrating Crowd Models with Discrete Decision Making	Apr 6, 2024	Decision Making	CodeCode Available	3
DRT-o1: Optimized Deep Reasoning Translation via Long Chain-of-Thought	Dec 23, 2024	Machine TranslationMath	CodeCode Available	3
Graph-Mamba: Towards Long-Range Graph Sequence Modeling with Selective State Spaces	Feb 1, 2024	Computational EfficiencyGPU	CodeCode Available	3
MVGS: Multi-view-regulated Gaussian Splatting for Novel View Synthesis	Oct 2, 2024	3DGSNeRF	CodeCode Available	3
Classification Done Right for Vision-Language Pre-Training	Nov 5, 2024	Classification	CodeCode Available	3
Whisper-Flamingo: Integrating Visual Features into Whisper for Audio-Visual Speech Recognition and Translation	Jun 14, 2024	Audio-Visual Speech RecognitionAutomatic Speech Recognition (ASR)	CodeCode Available	3
ComfyBench: Benchmarking LLM-based Agents in ComfyUI for Autonomously Designing Collaborative AI Systems	Sep 2, 2024	BenchmarkingInstruction Following	CodeCode Available	3
AutoScraper: A Progressive Understanding Web Agent for Web Scraper Generation	Apr 19, 2024	Action Generation	CodeCode Available	3
Anatomy-informed Data Augmentation for Enhanced Prostate Cancer Detection	Sep 7, 2023	AnatomyData Augmentation	CodeCode Available	3
Improving Model Evaluation using SMART Filtering of Benchmark Datasets	Oct 26, 2024	ChatbotDiversity	CodeCode Available	3
ZO2: Scalable Zeroth-Order Fine-Tuning for Extremely Large Language Models with Limited GPU Memory	Mar 16, 2025	CPUGPU	CodeCode Available	3
A new face swap method for image and video domains: a technical report	Feb 7, 2022	Action Recognition In VideosFace Recognition	CodeCode Available	3
MooER: LLM-based Speech Recognition and Translation Models from Moore Threads	Aug 9, 2024	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	CodeCode Available	3
Reinforcement Learning Enhanced LLMs: A Survey	Dec 5, 2024	reinforcement-learningReinforcement Learning	CodeCode Available	3
PF3plat: Pose-Free Feed-Forward 3D Gaussian Splatting	Oct 29, 2024	3DGS3D Reconstruction	CodeCode Available	3
RT-DETRv3: Real-time End-to-End Object Detection with Hierarchical Dense Positive Supervision	Sep 13, 2024	Decoderobject-detection	CodeCode Available	3
AesBench: An Expert Benchmark for Multimodal Large Language Models on Image Aesthetics Perception	Jan 16, 2024	MLLM Evaluation: Aesthetics	CodeCode Available	3
One-Prompt-One-Story: Free-Lunch Consistent Text-to-Image Generation Using a Single Prompt	Jan 23, 2025	Image GenerationStory Generation	CodeCode Available	3
An Imitative Reinforcement Learning Framework for Autonomous Dogfight	Jun 17, 2024	Imitation Learningreinforcement-learning	CodeCode Available	3
FusionBench: A Comprehensive Benchmark of Deep Model Fusion	Jun 5, 2024	image-classificationImage Classification	CodeCode Available	3
FedMKT: Federated Mutual Knowledge Transfer for Large and Small Language Models	Jun 4, 2024	Text GenerationTransfer Learning	CodeCode Available	3