The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

659,983 papers248,104 code links4,818 tasks

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 951–975 of 659983 papers

Title	Date	Tasks	Status	Hype
Uni-Mol Docking V2: Towards Realistic and Accurate Binding Pose Prediction	May 20, 2024	Drug DesignMolecular Docking	CodeCode Available	5
Uni-MoE: Scaling Unified Multimodal LLMs with Mixture of Experts	May 18, 2024	Mixture-of-ExpertsVisual Question Answering	CodeCode Available	5
RLHF Workflow: From Reward Modeling to Online RLHF	May 13, 2024	ChatbotHumanEval	CodeCode Available	5
Single-seed generation of Brownian paths and integrals for adaptive and high order SDE solvers	May 10, 2024		CodeCode Available	5
Evaluating Real-World Robot Manipulation Policies in Simulation	May 9, 2024	Robotic GraspingRobot Manipulation	CodeCode Available	5
Granite Code Models: A Family of Open Foundation Models for Code Intelligence	May 7, 2024	Code GenerationDecoder	CodeCode Available	5
AniTalker: Animate Vivid and Diverse Talking Faces through Identity-Decoupled Facial Motion Encoding	May 6, 2024	Metric LearningSelf-Supervised Learning	CodeCode Available	5
When LLMs Meet Cybersecurity: A Systematic Literature Review	May 6, 2024	Systematic Literature Review	CodeCode Available	5
Balance Reward and Safety Optimization for Safe Reinforcement Learning: A Perspective of Gradient Manipulation	May 2, 2024	MuJoCoReinforcement Learning (RL)	CodeCode Available	5
Prometheus 2: An Open Source Language Model Specialized in Evaluating Other Language Models	May 2, 2024	Language ModelingLanguage Modelling	CodeCode Available	5
XFeat: Accelerated Features for Lightweight Image Matching	Apr 30, 2024	CPUKeypoint detection and image matching	CodeCode Available	5
Make Your LLM Fully Utilize the Context	Apr 25, 2024	4kInformation Retrieval	CodeCode Available	5
ConsistentID: Portrait Generation with Multimodal Fine-Grained Identity Preserving	Apr 25, 2024	Diversity	CodeCode Available	5
NTIRE 2024 Challenge on Low Light Image Enhancement: Methods and Results	Apr 22, 2024	4kImage Enhancement	CodeCode Available	5
MARIO Eval: Evaluate Your Math LLM with your Math LLM--A mathematical dataset evaluation toolkit	Apr 22, 2024	Math	CodeCode Available	5
Do "English" Named Entity Recognizers Work Well on Global Englishes?	Apr 20, 2024	named-entity-recognitionNamed Entity Recognition	CodeCode Available	5
Sample Design Engineering: An Empirical Study of What Makes Good Downstream Fine-Tuning Samples for LLMs	Apr 19, 2024	Event ExtractionIn-Context Learning	CodeCode Available	5
Lean Copilot: Large Language Models as Copilots for Theorem Proving in Lean	Apr 18, 2024	Automated Theorem ProvingHallucination	CodeCode Available	5
Gaussian Opacity Fields: Efficient Adaptive Surface Reconstruction in Unbounded Scenes	Apr 16, 2024	3DGSNovel View Synthesis	CodeCode Available	5
SQUAT: Stateful Quantization-Aware Training in Recurrent Spiking Neural Networks	Apr 15, 2024	Quantization	CodeCode Available	5
Magic Clothing: Controllable Garment-Driven Image Synthesis	Apr 15, 2024	Image Generation	CodeCode Available	5
Tango 2: Aligning Diffusion-based Text-to-Audio Generations through Direct Preference Optimization	Apr 15, 2024	Audio Generation	CodeCode Available	5
MING-MOE: Enhancing Medical Multi-Task Learning in Large Language Models with Sparse Mixture of Low-Rank Adapter Experts	Apr 13, 2024	DiversityLanguage Modeling	CodeCode Available	5
The Path To Autonomous Cyber Defense	Apr 12, 2024		CodeCode Available	5
LM Transparency Tool: Interactive Tool for Analyzing Transformer Language Models	Apr 10, 2024	Decision Making	CodeCode Available	5