The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 4151–4200 of 661570 papers

Title	Date	Tasks	Status	Hype
LLMServingSim: A HW/SW Co-Simulation Infrastructure for LLM Inference Serving at Scale	Aug 10, 2024	GPULanguage Modelling	CodeCode Available	3
OctoPack: Instruction Tuning Code Large Language Models	Aug 14, 2023	Code GenerationCode Repair	CodeCode Available	3
Learning Smooth Humanoid Locomotion through Lipschitz-Constrained Policies	Oct 15, 2024		CodeCode Available	3
Low-Pass Filtering SGD for Recovering Flat Optima in the Deep Learning Optimization Landscape	Jan 20, 2022		CodeCode Available	3
MVMoE: Multi-Task Vehicle Routing Solver with Mixture-of-Experts	May 2, 2024	Combinatorial OptimizationMixture-of-Experts	CodeCode Available	3
On the use of deep learning for phase recovery	Aug 2, 2023	Deep Learning	CodeCode Available	3
Agent-FLAN: Designing Data and Methods of Effective Agent Tuning for Large Language Models	Mar 19, 2024	Hallucination	CodeCode Available	3
NVS-Solver: Video Diffusion Model as Zero-Shot Novel View Synthesizer	May 24, 2024	Novel View Synthesis	CodeCode Available	3
Using DeepSpeed and Megatron to Train Megatron-Turing NLG 530B, A Large-Scale Generative Language Model	Jan 28, 2022	Few-Shot LearningLanguage Modeling	CodeCode Available	3
MAPIE: an open-source library for distribution-free uncertainty quantification	Jul 25, 2022	Conformal PredictionMulti-class Classification	CodeCode Available	3
PhysX: Physical-Grounded 3D Asset Generation	Jul 16, 2025	3D GenerationImage to 3D	CodeCode Available	3
Sigma: Siamese Mamba Network for Multi-Modal Semantic Segmentation	Apr 5, 2024	DecoderMamba	CodeCode Available	3
HyperAgent: Generalist Software Engineering Agents to Solve Coding Tasks at Scale	Sep 9, 2024	Code GenerationFault localization	CodeCode Available	3
X-LLM: Bootstrapping Advanced Large Language Models by Treating Multi-Modalities as Foreign Languages	May 7, 2023	AttributeInstruction Following	CodeCode Available	3
DeSiRe-GS: 4D Street Gaussians for Static-Dynamic Decomposition and Surface Reconstruction for Urban Driving Scenes	Nov 18, 2024	Autonomous DrivingSurface Reconstruction	CodeCode Available	3
Automatic Intrinsic Reward Shaping for Exploration in Deep Reinforcement Learning	Jan 26, 2023	BenchmarkingDeep Reinforcement Learning	CodeCode Available	3
LLM4CP: Adapting Large Language Models for Channel Prediction	Jun 20, 2024	PredictionTime Series Analysis	CodeCode Available	3
Universal Actions for Enhanced Embodied Foundation Models	Jan 17, 2025		CodeCode Available	3
ChatRex: Taming Multimodal LLM for Joint Perception and Understanding	Nov 27, 2024		CodeCode Available	3
DROID-Splat: Combining end-to-end SLAM with 3D Gaussian Splatting	Nov 26, 2024	Camera CalibrationDepth Estimation	CodeCode Available	3
OpenTAD: A Unified Framework and Comprehensive Study of Temporal Action Detection	Feb 27, 2025	Action DetectionBenchmarking	CodeCode Available	3
Relaxing Accurate Initialization Constraint for 3D Gaussian Splatting	Mar 14, 2024	3DGS3D Reconstruction	CodeCode Available	3
MER 2024: Semi-Supervised Learning, Noise Robustness, and Open-Vocabulary Multimodal Emotion Recognition	Apr 26, 2024	Emotion RecognitionMulti-Label Learning	CodeCode Available	3
DDColor: Towards Photo-Realistic Image Colorization via Dual Decoders	Dec 22, 2022	ColorizationDecoder	CodeCode Available	3
PCDCNet: A Surrogate Model for Air Quality Forecasting with Physical-Chemical Dynamics and Constraints	May 26, 2025	Deep Learning	CodeCode Available	3
MACE: Mass Concept Erasure in Diffusion Models	Mar 10, 2024	Text-to-Image Generation	CodeCode Available	3
MAPE-PPI: Towards Effective and Efficient Protein-Protein Interaction Prediction via Microenvironment-Aware Protein Embedding	Feb 22, 2024	Computational EfficiencyPrediction	CodeCode Available	3
TopoTune : A Framework for Generalized Combinatorial Complex Neural Networks	Oct 9, 2024	Graph Neural Network	CodeCode Available	3
FlipSketch: Flipping Static Drawings to Text-Guided Sketch Animations	Nov 16, 2024	Visual Storytelling	CodeCode Available	3
DoWhy: An End-to-End Library for Causal Inference	Nov 9, 2020	Causal Inferencevalid	CodeCode Available	3
Relative Pose Estimation through Affine Corrections of Monocular Depth Priors	Jan 9, 2025	Depth EstimationMonocular Depth Estimation	CodeCode Available	3
DistiLLM: Towards Streamlined Distillation for Large Language Models	Feb 6, 2024	Instruction FollowingKnowledge Distillation	CodeCode Available	3
TimeChat-Online: 80% Visual Tokens are Naturally Redundant in Streaming Videos	Apr 24, 2025	MMEVideo MME	CodeCode Available	3
R1-VL: Learning to Reason with Multimodal Large Language Models via Step-wise Group Relative Policy Optimization	Mar 17, 2025		CodeCode Available	3
Music2Latent: Consistency Autoencoders for Latent Audio Compression	Aug 12, 2024	Audio CompressionInformation Retrieval	CodeCode Available	3
Advanced Video Inpainting Using Optical Flow-Guided Efficient Diffusion	Dec 1, 2024	DenoisingOptical Flow Estimation	CodeCode Available	3
MambaAD: Exploring State Space Models for Multi-class Unsupervised Anomaly Detection	Apr 9, 2024	Anomaly DetectionDecoder	CodeCode Available	3
A Survey on the Memory Mechanism of Large Language Model based Agents	Apr 21, 2024	Language ModelingLanguage Modelling	CodeCode Available	3
ACEGEN: Reinforcement learning of generative chemical agents for drug discovery	May 7, 2024	BenchmarkingDecision Making	CodeCode Available	3
Mastering the Game of No-Press Diplomacy via Human-Regularized Reinforcement Learning and Planning	Oct 11, 2022	reinforcement-learningReinforcement Learning	CodeCode Available	3
RiNALMo: General-Purpose RNA Language Models Can Generalize Well on Structure Prediction Tasks	Feb 29, 2024	Language ModelingLanguage Modelling	CodeCode Available	3
Embodied Understanding of Driving Scenarios	Mar 7, 2024	Autonomous DrivingLanguage Modeling	CodeCode Available	3
Personalized Image Generation with Deep Generative Models: A Decade Survey	Feb 18, 2025	Image GenerationPersonalized Image Generation	CodeCode Available	3
R1-ShareVL: Incentivizing Reasoning Capability of Multimodal Large Language Models via Share-GRPO	May 22, 2025	Reinforcement Learning (RL)	CodeCode Available	3
Datasheet for the Pile	Jan 13, 2022	Language ModelingLanguage Modelling	CodeCode Available	3
UniMERNet: A Universal Network for Real-World Mathematical Expression Recognition	Apr 23, 2024	DecoderDiversity	CodeCode Available	3
imitation: Clean Imitation Learning Implementations	Nov 22, 2022	Imitation Learningreinforcement-learning	CodeCode Available	3
Efficient Video Action Detection with Token Dropout and Context Refinement	Apr 17, 2023	Action DetectionDecoder	CodeCode Available	3
Grokked Transformers are Implicit Reasoners: A Mechanistic Journey to the Edge of Generalization	May 23, 2024		CodeCode Available	3
LLM-Pruner: On the Structural Pruning of Large Language Models	May 19, 2023	Text Generationzero-shot-classification	CodeCode Available	3