The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 4351–4400 of 661570 papers

Title	Date	Tasks	Status	Hype
Is Value Learning Really the Main Bottleneck in Offline RL?	Jun 13, 2024	Imitation LearningOffline RL	CodeCode Available	3
DANA: Domain-Aware Neurosymbolic Agents for Consistency and Accuracy	Sep 27, 2024	Financial Analysis	CodeCode Available	3
Compact 3D Gaussian Splatting for Static and Dynamic Radiance Fields	Aug 7, 2024	3DGSModel Compression	CodeCode Available	3
MAGiC-SLAM: Multi-Agent Gaussian Globally Consistent SLAM	Nov 25, 2024	Autonomous DrivingNovel View Synthesis	CodeCode Available	3
Gemma Scope: Open Sparse Autoencoders Everywhere All At Once on Gemma 2	Aug 9, 2024	All	CodeCode Available	3
DPLM-2: A Multimodal Diffusion Protein Language Model	Oct 17, 2024	Language ModelingLanguage Modelling	CodeCode Available	3
Automated Formulaic Alpha Generation for Quantitative Investing using Evolutionary Algorithms	Mar 13, 2022	Evolutionary Algorithms	CodeCode Available	3
The False Promise of Imitating Proprietary LLMs	May 25, 2023	Language Modelling	CodeCode Available	3
Visual Geometry Grounded Deep Structure From Motion	Dec 7, 2023	Point Tracking	CodeCode Available	3
A Foundation Model for the Earth System	May 20, 2024	Computational EfficiencyDeep Learning	CodeCode Available	3
DigiRL: Training In-The-Wild Device-Control Agents with Autonomous Reinforcement Learning	Jun 14, 2024	Offline RL	CodeCode Available	3
Human-level play in the game of Diplomacy by combining language models with strategic reasoning	Nov 22, 2022	AI AgentLanguage Modeling	CodeCode Available	3
Improving Text Embeddings with Large Language Models	Dec 31, 2023	DecoderDiversity	CodeCode Available	3
Performance Analysis of Open Source Machine Learning Frameworks for Various Parameters in Single-Threaded and Multi-Threaded Modes	Aug 29, 2017	BIG-bench Machine LearningCPU	CodeCode Available	3
Revisit Large-Scale Image-Caption Data in Pre-training Multimodal Foundation Models	Oct 3, 2024		CodeCode Available	3
RB-Modulation: Training-Free Personalization of Diffusion Models using Stochastic Optimal Control	May 27, 2024		CodeCode Available	3
Towards Generalist Robot Policies: What Matters in Building Vision-Language-Action Models	Dec 18, 2024	Representation LearningRobot Manipulation	CodeCode Available	3
RAT: Retrieval Augmented Thoughts Elicit Context-Aware Reasoning in Long-Horizon Generation	Mar 8, 2024	Code GenerationHallucination	CodeCode Available	3
Jumping Ahead: Improving Reconstruction Fidelity with JumpReLU Sparse Autoencoders	Jul 19, 2024		CodeCode Available	3
DataDecide: How to Predict Best Pretraining Data with Small Experiments	Apr 15, 2025	ARCHellaSwag	CodeCode Available	3
The Hedgehog & the Porcupine: Expressive Linear Attentions with Softmax Mimicry	Feb 6, 2024		CodeCode Available	3
UCF: Uncovering Common Features for Generalizable Deepfake Detection	Apr 27, 2023	Binary ClassificationDecoder	CodeCode Available	3
Real-IAD: A Real-World Multi-View Dataset for Benchmarking Versatile Industrial Anomaly Detection	Mar 19, 2024	Anomaly DetectionBenchmarking	CodeCode Available	3
REPA-E: Unlocking VAE for End-to-End Tuning of Latent Diffusion Transformers	Apr 15, 2025	Image Generation	CodeCode Available	3
C-Adapter: Adapting Deep Classifiers for Efficient Conformal Prediction Sets	Oct 12, 2024	Conformal PredictionPrediction	CodeCode Available	3
Semantic Gesticulator: Semantics-Aware Co-Speech Gesture Synthesis	May 16, 2024	Language ModellingLarge Language Model	CodeCode Available	3
CMKD: CNN/Transformer-Based Cross-Model Knowledge Distillation for Audio Classification	Mar 13, 2022	Audio ClassificationKnowledge Distillation	CodeCode Available	3
Modular Duality in Deep Learning	Oct 28, 2024	Deep LearningGPU	CodeCode Available	3
Distributed Prioritized Experience Replay	Mar 2, 2018	Atari GamesDeep Reinforcement Learning	CodeCode Available	3
PromptHMR: Promptable Human Mesh Recovery	Apr 8, 2025	3D Human Pose EstimationHuman Mesh Recovery	CodeCode Available	3
Pushing the Limits of Large Language Model Quantization via the Linearity Theorem	Nov 26, 2024	GPULanguage Modeling	CodeCode Available	3
U-Net: Convolutional Networks for Biomedical Image Segmentation	May 18, 2015	Cell SegmentationCell Tracking	CodeCode Available	3
History-Guided Video Diffusion	Feb 10, 2025	Video Generation	CodeCode Available	3
Andes: Defining and Enhancing Quality-of-Experience in LLM-Based Text Streaming Services	Apr 25, 2024	GPU	CodeCode Available	3
Any Information Is Just Worth One Single Screenshot: Unifying Search With Visualized Information Retrieval	Feb 17, 2025	Information RetrievalRetrieval	CodeCode Available	3
Probabilistic Volumetric Fusion for Dense Monocular SLAM	Oct 3, 2022		CodeCode Available	3
Where's the Point? Self-Supervised Multilingual Punctuation-Agnostic Sentence Segmentation	May 30, 2023	Machine TranslationSegmentation	CodeCode Available	3
Discovered Policy Optimisation	Oct 11, 2022	IngenuityMeta-Learning	CodeCode Available	3
MuMath-Code: Combining Tool-Use Large Language Models with Multi-perspective Data Augmentation for Mathematical Reasoning	May 13, 2024	Data AugmentationGSM8K	CodeCode Available	3
On Distillation of Guided Diffusion Models	Oct 6, 2022	DenoisingImage Generation	CodeCode Available	3
SWE-bench-java: A GitHub Issue Resolving Benchmark for Java	Aug 26, 2024		CodeCode Available	3
SoundStream: An End-to-End Neural Audio Codec	Jul 7, 2021	CPUDecoder	CodeCode Available	3
Gradient Alignment in Physics-informed Neural Networks: A Second-Order Optimization Perspective	Feb 2, 2025	Multi-Task Learning	CodeCode Available	3
On the Content Bias in Fréchet Video Distance	Apr 18, 2024	Video Generation	CodeCode Available	3
Flow Matching for Generative Modeling	Oct 6, 2022	Density EstimationImage Generation	CodeCode Available	3
W2v-BERT: Combining Contrastive Learning and Masked Language Modeling for Self-Supervised Speech Pre-Training	Aug 7, 2021	Contrastive LearningLanguage Modeling	CodeCode Available	3
3D Diffuser Actor: Policy Diffusion with 3D Scene Representations	Feb 16, 2024	DenoisingRobot Manipulation	CodeCode Available	3
Physics3D: Learning Physical Properties of 3D Gaussians via Video Diffusion	Jun 6, 2024	3D Generation	CodeCode Available	3
SkyMath: Technical Report	Oct 25, 2023	GSM8KLanguage Modeling	CodeCode Available	3
XuanYuan 2.0: A Large Chinese Financial Chat Model with Hundreds of Billions Parameters	May 19, 2023		CodeCode Available	3