The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 7426–7450 of 177340 papers

Title	Date	Tasks	Status	Hype	Score
Incremental Transformer Structure Enhanced Image Inpainting with Masking Positional Encoding	Mar 2, 2022	Image Inpainting	CodeCode Available	2	5
Integrating Reinforcement Learning with Foundation Models for Autonomous Robotics: Methods and Perspectives	Oct 21, 2024	Reinforcement Learning (RL)	CodeCode Available	2	5
Asynchronous RLHF: Faster and More Efficient Off-Policy RL for Language Models	Oct 23, 2024	Instruction FollowingLanguage Modelling	CodeCode Available	2	5
Infinite Recommendation Networks: A Data-Centric Approach	Jun 3, 2022	Information RetrievalRecommendation Systems	CodeCode Available	2	5
Training a Helpful and Harmless Assistant with Reinforcement Learning from Human Feedback	Apr 12, 2022	Code GenerationOut of Distribution (OOD) Detection	CodeCode Available	2	5
Efficient LLM Inference on CPUs	Nov 1, 2023	Quantization	CodeCode Available	2	5
Challenges and Opportunities in Offline Reinforcement Learning from Visual Observations	Jun 9, 2022	Benchmarkingcontinuous-control	CodeCode Available	2	5
VMAS: A Vectorized Multi-Agent Simulator for Collective Robot Learning	Jul 7, 2022	BenchmarkingMulti-agent Reinforcement Learning	CodeCode Available	2	5
Democratizing Contrastive Language-Image Pre-training: A CLIP Benchmark of Data, Model, and Supervision	Mar 11, 2022		CodeCode Available	2	5
Deep Learning Methods for Partial Differential Equations and Related Parameter Identification Problems	Dec 6, 2022	Deep Learning	CodeCode Available	2	5
Samba: Semantic Segmentation of Remotely Sensed Images with State Space Model	Apr 2, 2024	DecoderMamba	CodeCode Available	2	5
Arbitrary-Scale Point Cloud Upsampling by Voxel-Based Network with Latent Geometric-Consistent Learning	Mar 8, 2024	point cloud upsampling	CodeCode Available	2	5
Critique-out-Loud Reward Models	Aug 21, 2024	Language ModellingLarge Language Model	CodeCode Available	2	5
Low-light Image Enhancement via CLIP-Fourier Guided Wavelet Diffusion	Jan 8, 2024	Image EnhancementLow-Light Image Enhancement	CodeCode Available	2	5
LLM-PBE: Assessing Data Privacy in Large Language Models	Aug 23, 2024		CodeCode Available	2	5
Challenging BIG-Bench Tasks and Whether Chain-of-Thought Can Solve Them	Oct 17, 2022	Language Modelling	CodeCode Available	2	5
PCA-Bench: Evaluating Multimodal Large Language Models in Perception-Cognition-Action Chain	Feb 21, 2024	Autonomous DrivingDecision Making	CodeCode Available	2	5
LeviTor: 3D Trajectory Oriented Image-to-Video Synthesis	Dec 19, 2024	Object	CodeCode Available	2	5
MAT: Mask-Aware Transformer for Large Hole Image Inpainting	Mar 29, 2022	DiversityImage Inpainting	CodeCode Available	2	5
MANIQA: Multi-dimension Attention Network for No-Reference Image Quality Assessment	Apr 19, 2022	Image Quality AssessmentNo-Reference Image Quality Assessment	CodeCode Available	2	5
A Closer Look at Learned Optimization: Stability, Robustness, and Inductive Biases	Sep 22, 2022	Inductive Bias	CodeCode Available	2	5
Rec-R1: Bridging Generative Large Language Models and User-Centric Recommendation Systems via Reinforcement Learning	Mar 31, 2025	General Reinforcement LearningInstruction Following	CodeCode Available	2	5
MultiZoo & MultiBench: A Standardized Toolkit for Multimodal Deep Learning	Jun 28, 2023	Deep LearningMultimodal Deep Learning	CodeCode Available	2	5
RecDiff: Diffusion Model for Social Recommendation	Jun 1, 2024	Denoisingmodel	CodeCode Available	2	5
BEVHeight: A Robust Framework for Vision-based Roadside 3D Object Detection	Mar 15, 2023	3D Object DetectionAutonomous Driving	CodeCode Available	2	5