The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 2051–2075 of 177340 papers

Title	Date	Tasks	Status	Hype	Score
Discovering faster matrix multiplication algorithms with reinforcement learning	Oct 5, 2022	Deep Reinforcement Learningreinforcement-learning	CodeCode Available	4	5
SAM2Long: Enhancing SAM 2 for Long Video Segmentation with a Training-Free Memory Tree	Oct 21, 2024	Heuristic SearchObject	CodeCode Available	4	5
TotalSegmentator: robust segmentation of 104 anatomical structures in CT images	Aug 11, 2022	Segmentation	CodeCode Available	4	5
Minigrid & Miniworld: Modular & Customizable Reinforcement Learning Environments for Goal-Oriented Tasks	Jun 24, 2023	PhilosophyTransfer Learning	CodeCode Available	4	5
Benchmarking Neural Network Training Algorithms	Jun 12, 2023	Benchmarking	CodeCode Available	4	5
RenderFormer: Transformer-based Neural Rendering of Triangle Meshes with Global Illumination	May 28, 2025	Neural Rendering	CodeCode Available	4	5
XVerse: Consistent Multi-Subject Control of Identity and Semantic Attributes via DiT Modulation	Jun 26, 2025	AttributeImage Generation	CodeCode Available	4	5
Deepchecks: A Library for Testing and Validating Machine Learning Models and Data	Mar 16, 2022	BIG-bench Machine Learning	CodeCode Available	4	5
Effective Whole-body Pose Estimation with Two-stages Distillation	Jul 29, 2023	2D Human Pose EstimationKnowledge Distillation	CodeCode Available	4	5
CLAY: A Controllable Large-scale Generative Model for Creating High-quality 3D Assets	May 30, 2024	2k3D geometry	CodeCode Available	4	5
The Importance of Directional Feedback for LLM-based Optimizers	May 26, 2024		CodeCode Available	4	5
Rerender A Video: Zero-Shot Text-Guided Video-to-Video Translation	Jun 13, 2023	Patch MatchingTranslation	CodeCode Available	4	5
Theseus: A Library for Differentiable Nonlinear Optimization	Jul 19, 2022	GPU	CodeCode Available	4	5
SnAG: Scalable and Accurate Video Grounding	Apr 2, 2024	Video GroundingVideo Understanding	CodeCode Available	4	5
From Discrete Tokens to High-Fidelity Audio Using Multi-Band Diffusion	Aug 2, 2023		CodeCode Available	4	5
DeepSpeed Ulysses: System Optimizations for Enabling Training of Extreme Long Sequence Transformer Models	Sep 25, 2023	Language ModellingLarge Language Model	CodeCode Available	4	5
FancyVideo: Towards Dynamic and Consistent Video Generation via Cross-frame Textual Guidance	Aug 15, 2024	TARVideo Generation	CodeCode Available	4	5
Old Optimizer, New Norm: An Anthology	Sep 30, 2024		CodeCode Available	4	5
Time-LLM: Time Series Forecasting by Reprogramming Large Language Models	Oct 3, 2023	Time SeriesTime Series Forecasting	CodeCode Available	4	5
The Llama 3 Herd of Models	Jul 31, 2024	answerability predictionLanguage Modeling	CodeCode Available	4	5
ControlVAE: Tuning, Analytical Properties, and Performance Analysis	Oct 31, 2020	DisentanglementImage Generation	CodeCode Available	4	5
UltimateDO: An Efficient Framework to Marry Occupancy Prediction with 3D Object Detection via Channel2height	Sep 17, 2024	3D Object DetectionAutonomous Driving	CodeCode Available	4	5
Diffusion Policy Policy Optimization	Sep 1, 2024	continuous-controlContinuous Control	CodeCode Available	4	5
Scaling Granite Code Models to 128K Context	Jul 18, 2024	2k4k	CodeCode Available	4	5
AgentSociety: Large-Scale Simulation of LLM-Driven Generative Agents Advances Understanding of Human Behaviors and Society	Feb 12, 2025		CodeCode Available	4	5