The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 11001–11050 of 661570 papers

Title	Date	Tasks	Status	Hype
Execution Guided Line-by-Line Code Generation	Jun 12, 2025	Code Generation	CodeCode Available	2
SEMv3: A Fast and Robust Approach to Table Separation Line Detection	May 20, 2024	Line Detection	CodeCode Available	2
GO-SLAM: Global Optimization for Consistent 3D Instant Reconstruction	Sep 5, 2023	3D Reconstructionglobal-optimization	CodeCode Available	2
Interpreting the Weight Space of Customized Diffusion Models	Jun 13, 2024		CodeCode Available	2
HugNLP: A Unified and Comprehensive Library for Natural Language Processing	Feb 28, 2023		CodeCode Available	2
UNeXt: MLP-based Rapid Medical Image Segmentation Network	Mar 9, 2022	DecoderImage Segmentation	CodeCode Available	2
FrontierNet: Learning Visual Cues to Explore	Jan 8, 2025	Object Discovery	CodeCode Available	2
POTATO: The Portable Text Annotation Tool	Dec 16, 2022	Active Learningtext annotation	CodeCode Available	2
LLaVAction: evaluating and training multi-modal large language models for action recognition	Mar 24, 2025	Action RecognitionAction Understanding	CodeCode Available	2
Multi-Agent Reinforcement Learning is a Sequence Modeling Problem	May 30, 2022	Decision MakingMuJoCo	CodeCode Available	2
Convergence Analysis of Probability Flow ODE for Score-based Generative Models	Apr 15, 2024		CodeCode Available	2
From Hours to Minutes: Lossless Acceleration of Ultra Long Sequence Generation up to 100K Tokens	Feb 26, 2025		CodeCode Available	2
E3x: E(3)-Equivariant Deep Learning Made Easy	Jan 15, 2024	Deep Learning	CodeCode Available	2
An Economic Framework for 6-DoF Grasp Detection	Jul 11, 2024	Robotic Grasping	CodeCode Available	2
Discrete Diffusion Modeling by Estimating the Ratios of the Data Distribution	Oct 25, 2023	DenoisingLanguage Modeling	CodeCode Available	2
REINFORCE++: A Simple and Efficient Approach for Aligning Large Language Models	Jan 4, 2025	Computational Efficiency	CodeCode Available	2
Fine-tuned In-Context Learning Transformers are Excellent Tabular Data Classifiers	May 22, 2024	In-Context Learning	CodeCode Available	2
Three Bricks to Consolidate Watermarks for Large Language Models	Jul 26, 2023	valid	CodeCode Available	2
NeuRBF: A Neural Fields Representation with Adaptive Radial Basis Functions	Sep 27, 2023		CodeCode Available	2
SINDy-RL: Interpretable and Efficient Model-Based Reinforcement Learning	Mar 14, 2024	Deep Reinforcement LearningDictionary Learning	CodeCode Available	2
High-Performance Transformers for Table Structure Recognition Need Early Convolutions	Nov 9, 2023	DecoderRepresentation Learning	CodeCode Available	2
mbrs: A Library for Minimum Bayes Risk Decoding	Aug 8, 2024	Text Generation	CodeCode Available	2
DeepAAT: Deep Automated Aerial Triangulation for Fast UAV-based Mapping	Feb 2, 2024	3D ReconstructionEarth Observation	CodeCode Available	2
Text2Light: Zero-Shot Text-Driven HDR Panorama Generation	Sep 20, 2022	4kinverse tone mapping	CodeCode Available	2
MTLoRA: Low-Rank Adaptation Approach for Efficient Multi-Task Learning	Jan 1, 2024	Multi-Task Learningparameter-efficient fine-tuning	CodeCode Available	2
Zeus: Understanding and Optimizing GPU Energy Consumption of DNN Training	Aug 12, 2022	GPU	CodeCode Available	2
Retriever-and-Memory: Towards Adaptive Note-Enhanced Retrieval-Augmented Generation	Oct 11, 2024	Open-Domain Question AnsweringQuestion Answering	CodeCode Available	2
GUI Odyssey: A Comprehensive Dataset for Cross-App GUI Navigation on Mobile Devices	Jun 12, 2024	Navigate	CodeCode Available	2
Blind Video Deflickering by Neural Filtering with a Flawed Atlas	Mar 14, 2023	Video GenerationVideo Temporal Consistency	CodeCode Available	2
4Hammer: a board-game reinforcement learning environment for the hour long time frame	May 19, 2025	Board Gamesreinforcement-learning	CodeCode Available	2
WeKws: A production first small-footprint end-to-end Keyword Spotting Toolkit	Oct 30, 2022	Keyword Spotting	CodeCode Available	2
OPERA: Alleviating Hallucination in Multi-Modal Large Language Models via Over-Trust Penalty and Retrospection-Allocation	Nov 29, 2023	Hallucination	CodeCode Available	2
RoCo: Dialectic Multi-Robot Collaboration with Large Language Models	Jul 10, 2023	Trajectory Planning	CodeCode Available	2
Towards a Unified Multi-Dimensional Evaluator for Text Generation	Oct 13, 2022	nlg evaluationQuestion Answering	CodeCode Available	2
Mass-Editing Memory in a Transformer	Oct 13, 2022	Language ModelingLanguage Modelling	CodeCode Available	2
[Reproducibility Report] Path Planning using Neural A* Search	Jul 16, 2022	Motion Planning	CodeCode Available	2
BitDecoding: Unlocking Tensor Cores for Long-Context LLMs Decoding with Low-Bit KV Cache	Mar 24, 2025	Computational EfficiencyGPU	CodeCode Available	2
Graph Data Augmentation for Graph Machine Learning: A Survey	Feb 17, 2022	BIG-bench Machine LearningData Augmentation	CodeCode Available	2
PMC-CLIP: Contrastive Language-Image Pre-training using Biomedical Documents	Mar 13, 2023	image-classificationImage Classification	CodeCode Available	2
VampNet: Music Generation via Masked Acoustic Token Modeling	Jul 10, 2023	Music CompressionMusic Generation	CodeCode Available	2
TextBox: A Unified, Modularized, and Extensible Framework for Text Generation	Jan 6, 2021	Text Generation	CodeCode Available	2
Generative Image as Action Models	Jul 10, 2024	Image GenerationRobot Manipulation	CodeCode Available	2
DiffusionFake: Enhancing Generalization in Deepfake Detection via Guided Stable Diffusion	Oct 6, 2024	DeepFake DetectionDomain Generalization	CodeCode Available	2
Fine-grained Late-interaction Multi-modal Retrieval for Retrieval Augmented Visual Question Answering	Sep 29, 2023	Image to textPassage Retrieval	CodeCode Available	2
DataComp: In search of the next generation of multimodal datasets	Apr 27, 2023		CodeCode Available	2
Do We Need Domain-Specific Embedding Models? An Empirical Investigation	Sep 27, 2024		CodeCode Available	2
MACE: An Efficient Model-Agnostic Framework for Counterfactual Explanation	May 31, 2022	BIG-bench Machine Learningcounterfactual	CodeCode Available	2
EVA2.0: Investigating Open-Domain Chinese Dialogue Systems with Large-Scale Pre-Training	Mar 17, 2022	Chatbot	CodeCode Available	2
EasyHOI: Unleashing the Power of Large Models for Reconstructing Hand-Object Interactions in the Wild	Nov 21, 2024	3D ReconstructionObject	CodeCode Available	2
P-Tuning v2: Prompt Tuning Can Be Comparable to Fine-tuning Universally Across Scales and Tasks	Oct 14, 2021	Language ModelingLanguage Modelling	CodeCode Available	2