The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 3901–3925 of 177340 papers

Title	Date	Tasks	Status	Hype	Score
APOLLO: SGD-like Memory, AdamW-level Performance	Dec 6, 2024	GPUQuantization	CodeCode Available	3	5
MUSt3R: Multi-view Network for Stereo 3D Reconstruction	Mar 3, 2025	3D ReconstructionArticles	CodeCode Available	3	5
SparseVLM: Visual Token Sparsification for Efficient Vision-Language Model Inference	Oct 6, 2024	Language ModelingLanguage Modelling	CodeCode Available	3	5
Inversion-Free Image Editing with Language-Guided Diffusion Models	Jan 1, 2024	DenoisingImage Manipulation	CodeCode Available	3	5
Beyond Quacking: Deep Integration of Language Models and RAG into DuckDB	Apr 1, 2025	Decision MakingRAG	CodeCode Available	3	5
OpenSpiel: A Framework for Reinforcement Learning in Games	Aug 26, 2019	General Reinforcement Learningreinforcement-learning	CodeCode Available	3	5
Scikit-fingerprints: easy and efficient computation of molecular fingerprints in Python	Jul 18, 2024	Molecular Property PredictionProperty Prediction	CodeCode Available	3	5
NeRF-SLAM: Real-Time Dense Monocular SLAM with Neural Radiance Fields	Oct 24, 2022	NeRF	CodeCode Available	3	5
CLIMB: Class-imbalanced Learning Benchmark on Tabular Data	May 23, 2025		CodeCode Available	3	5
Around the World in 80 Timesteps: A Generative Approach to Global Visual Geolocation	Dec 9, 2024	DenoisingPhoto geolocation estimation	CodeCode Available	3	5
Take the aTrain. Introducing an Interface for the Accessible Transcription of Interviews	Oct 18, 2023	CPUGPU	CodeCode Available	3	5
Meta-Transformer: A Unified Framework for Multimodal Learning	Jul 20, 2023	Time Series	CodeCode Available	3	5
GroundingGPT:Language Enhanced Multi-modal Grounding Model	Jan 11, 2024	Language ModellingLarge Language Model	CodeCode Available	3	5
Evaluating Large Language Models with fmeval	Jul 15, 2024	Question Answering	CodeCode Available	3	5
Harmful Fine-tuning Attacks and Defenses for Large Language Models: A Survey	Sep 26, 2024	Safety Alignment	CodeCode Available	3	5
HuatuoGPT-Vision, Towards Injecting Medical Visual Knowledge into Multimodal LLMs at Scale	Jun 27, 2024	Visual Question Answering (VQA)	CodeCode Available	3	5
APPL: A Prompt Programming Language for Harmonious Integration of Programs and Large Language Model Prompts	Jun 19, 2024	Language ModelingLanguage Modelling	CodeCode Available	3	5
Rethinking Early Stopping: Refine, Then Calibrate	Jan 31, 2025	Decision Making	CodeCode Available	3	5
CORE-Bench: Fostering the Credibility of Published Research Through a Computational Reproducibility Agent Benchmark	Sep 17, 2024		CodeCode Available	3	5
Better by Default: Strong Pre-Tuned MLPs and Boosted Trees on Tabular Data	Jul 5, 2024	Classificationregression	CodeCode Available	3	5
Automatic Gradient Estimation for Calibrating Crowd Models with Discrete Decision Making	Apr 6, 2024	Decision Making	CodeCode Available	3	5
DRT-o1: Optimized Deep Reasoning Translation via Long Chain-of-Thought	Dec 23, 2024	Machine TranslationMath	CodeCode Available	3	5
Graph-Mamba: Towards Long-Range Graph Sequence Modeling with Selective State Spaces	Feb 1, 2024	Computational EfficiencyGPU	CodeCode Available	3	5
MVGS: Multi-view-regulated Gaussian Splatting for Novel View Synthesis	Oct 2, 2024	3DGSNeRF	CodeCode Available	3	5
Classification Done Right for Vision-Language Pre-Training	Nov 5, 2024	Classification	CodeCode Available	3	5