The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 4126–4150 of 177340 papers

Title	Date	Tasks	Status	Hype	Score
ACEGEN: Reinforcement learning of generative chemical agents for drug discovery	May 7, 2024	BenchmarkingDecision Making	CodeCode Available	3	5
Mastering the Game of No-Press Diplomacy via Human-Regularized Reinforcement Learning and Planning	Oct 11, 2022	reinforcement-learningReinforcement Learning	CodeCode Available	3	5
RiNALMo: General-Purpose RNA Language Models Can Generalize Well on Structure Prediction Tasks	Feb 29, 2024	Language ModelingLanguage Modelling	CodeCode Available	3	5
Embodied Understanding of Driving Scenarios	Mar 7, 2024	Autonomous DrivingLanguage Modeling	CodeCode Available	3	5
Personalized Image Generation with Deep Generative Models: A Decade Survey	Feb 18, 2025	Image GenerationPersonalized Image Generation	CodeCode Available	3	5
R1-ShareVL: Incentivizing Reasoning Capability of Multimodal Large Language Models via Share-GRPO	May 22, 2025	Reinforcement Learning (RL)	CodeCode Available	3	5
Datasheet for the Pile	Jan 13, 2022	Language ModelingLanguage Modelling	CodeCode Available	3	5
UniMERNet: A Universal Network for Real-World Mathematical Expression Recognition	Apr 23, 2024	DecoderDiversity	CodeCode Available	3	5
imitation: Clean Imitation Learning Implementations	Nov 22, 2022	Imitation Learningreinforcement-learning	CodeCode Available	3	5
Efficient Video Action Detection with Token Dropout and Context Refinement	Apr 17, 2023	Action DetectionDecoder	CodeCode Available	3	5
Grokked Transformers are Implicit Reasoners: A Mechanistic Journey to the Edge of Generalization	May 23, 2024		CodeCode Available	3	5
LLM-Pruner: On the Structural Pruning of Large Language Models	May 19, 2023	Text Generationzero-shot-classification	CodeCode Available	3	5
BTLM-3B-8K: 7B Parameter Performance in a 3B Parameter Model	Sep 20, 2023	8kLanguage Modeling	CodeCode Available	3	5
HI-SLAM2: Geometry-Aware Gaussian SLAM for Fast Monocular Scene Reconstruction	Nov 27, 2024	3DGS	CodeCode Available	3	5
EfficientTrain++: Generalized Curriculum Learning for Efficient Visual Backbone Training	May 14, 2024	Data AugmentationSelf-Supervised Learning	CodeCode Available	3	5
White-Box Transformers via Sparse Rate Reduction	Jun 1, 2023	Representation Learning	CodeCode Available	3	5
SCSegamba: Lightweight Structure-Aware Vision Mamba for Crack Segmentation in Structures	Mar 3, 2025	Crack SegmentationMamba	CodeCode Available	3	5
Fine-Tuning Language Models from Human Preferences	Sep 18, 2019	DescriptiveLanguage Modelling	CodeCode Available	3	5
GuardT2I: Defending Text-to-Image Models from Adversarial Prompts	Mar 3, 2024	Binary ClassificationLanguage Modeling	CodeCode Available	3	5
Diffree: Text-Guided Shape Free Object Inpainting with Diffusion Model	Jul 24, 2024	Image InpaintingObject	CodeCode Available	3	5
Beyond Specialization: Assessing the Capabilities of MLLMs in Age and Gender Estimation	Mar 4, 2024	Age And Gender ClassificationAge and Gender Estimation	CodeCode Available	3	5
EvoTorch: Scalable Evolutionary Computation in Python	Feb 24, 2023	GPUreinforcement-learning	CodeCode Available	3	5
Leveraging Vision-Centric Multi-Modal Expertise for 3D Object Detection	Oct 24, 2023	3D Object Detectionobject-detection	CodeCode Available	3	5
Are We Done with MMLU?	Jun 6, 2024	MMLUVirology	CodeCode Available	3	5
Does End-to-End Autonomous Driving Really Need Perception Tasks?	Sep 26, 2024	Autonomous Driving	CodeCode Available	3	5