The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 3851–3900 of 661570 papers

Title	Date	Tasks	Status	Hype
Distributed Prioritized Experience Replay	Mar 2, 2018	Atari GamesDeep Reinforcement Learning	CodeCode Available	3
PromptHMR: Promptable Human Mesh Recovery	Apr 8, 2025	3D Human Pose EstimationHuman Mesh Recovery	CodeCode Available	3
Pushing the Limits of Large Language Model Quantization via the Linearity Theorem	Nov 26, 2024	GPULanguage Modeling	CodeCode Available	3
U-Net: Convolutional Networks for Biomedical Image Segmentation	May 18, 2015	Cell SegmentationCell Tracking	CodeCode Available	3
History-Guided Video Diffusion	Feb 10, 2025	Video Generation	CodeCode Available	3
Andes: Defining and Enhancing Quality-of-Experience in LLM-Based Text Streaming Services	Apr 25, 2024	GPU	CodeCode Available	3
Any Information Is Just Worth One Single Screenshot: Unifying Search With Visualized Information Retrieval	Feb 17, 2025	Information RetrievalRetrieval	CodeCode Available	3
Probabilistic Volumetric Fusion for Dense Monocular SLAM	Oct 3, 2022		CodeCode Available	3
Where's the Point? Self-Supervised Multilingual Punctuation-Agnostic Sentence Segmentation	May 30, 2023	Machine TranslationSegmentation	CodeCode Available	3
Discovered Policy Optimisation	Oct 11, 2022	IngenuityMeta-Learning	CodeCode Available	3
MuMath-Code: Combining Tool-Use Large Language Models with Multi-perspective Data Augmentation for Mathematical Reasoning	May 13, 2024	Data AugmentationGSM8K	CodeCode Available	3
On Distillation of Guided Diffusion Models	Oct 6, 2022	DenoisingImage Generation	CodeCode Available	3
SWE-bench-java: A GitHub Issue Resolving Benchmark for Java	Aug 26, 2024		CodeCode Available	3
SoundStream: An End-to-End Neural Audio Codec	Jul 7, 2021	CPUDecoder	CodeCode Available	3
Gradient Alignment in Physics-informed Neural Networks: A Second-Order Optimization Perspective	Feb 2, 2025	Multi-Task Learning	CodeCode Available	3
On the Content Bias in Fréchet Video Distance	Apr 18, 2024	Video Generation	CodeCode Available	3
Flow Matching for Generative Modeling	Oct 6, 2022	Density EstimationImage Generation	CodeCode Available	3
W2v-BERT: Combining Contrastive Learning and Masked Language Modeling for Self-Supervised Speech Pre-Training	Aug 7, 2021	Contrastive LearningLanguage Modeling	CodeCode Available	3
3D Diffuser Actor: Policy Diffusion with 3D Scene Representations	Feb 16, 2024	DenoisingRobot Manipulation	CodeCode Available	3
Physics3D: Learning Physical Properties of 3D Gaussians via Video Diffusion	Jun 6, 2024	3D Generation	CodeCode Available	3
SkyMath: Technical Report	Oct 25, 2023	GSM8KLanguage Modeling	CodeCode Available	3
XuanYuan 2.0: A Large Chinese Financial Chat Model with Hundreds of Billions Parameters	May 19, 2023		CodeCode Available	3
Reason-RFT: Reinforcement Fine-Tuning for Visual Reasoning	Mar 26, 2025	Few-Shot LearningVisual Reasoning	CodeCode Available	3
Designing and building the mlpack open-source machine learning library	Aug 17, 2017	BIG-bench Machine Learning	CodeCode Available	3
One-step Diffusion with Distribution Matching Distillation	Nov 30, 2023		CodeCode Available	3
EAFormer: Scene Text Segmentation with Edge-Aware Transformers	Jul 24, 2024	DecoderSegmentation	CodeCode Available	3
Accurate clinical and biomedical Named entity recognition at scale	Jul 19, 2022	Clinical Concept ExtractionDe-identification	CodeCode Available	3
Planning in Strawberry Fields: Evaluating and Improving the Planning and Scheduling Capabilities of LRM o1	Oct 3, 2024	Scheduling	CodeCode Available	3
EventRL: Enhancing Event Extraction with Outcome Supervision for Large Language Models	Feb 18, 2024	Event ExtractionHallucination	CodeCode Available	3
LRM: Large Reconstruction Model for Single Image to 3D	Nov 8, 2023	Image to 3DNeRF	CodeCode Available	3
GluonTS: Probabilistic Time Series Models in Python	Jun 12, 2019	Anomaly DetectionTime Series	CodeCode Available	3
Practical Deep Reinforcement Learning Approach for Stock Trading	Nov 19, 2018	Deep Reinforcement Learningreinforcement-learning	CodeCode Available	3
CodeBLEU: a Method for Automatic Evaluation of Code Synthesis	Sep 22, 2020	Code TranslationTranslation	CodeCode Available	3
Aguvis: Unified Pure Vision Agents for Autonomous GUI Interaction	Dec 5, 2024	Multimodal ReasoningNatural Language Visual Grounding	CodeCode Available	3
Merlin: A Vision Language Foundation Model for 3D Computed Tomography	Jun 10, 2024	3D Semantic SegmentationComputed Tomography (CT)	CodeCode Available	3
Text Embeddings Reveal (Almost) As Much As Text	Oct 10, 2023		CodeCode Available	3
dLLM-Cache: Accelerating Diffusion Large Language Models with Adaptive Caching	May 17, 2025	Denoising	CodeCode Available	3
SkillMimic: Learning Basketball Interaction Skills from Demonstrations	Aug 12, 2024	DiversityHuman-Object Interaction Detection	CodeCode Available	3
DiffSplat: Repurposing Image Diffusion Models for Scalable Gaussian Splat Generation	Jan 28, 2025	3D Generation	CodeCode Available	3
MDocAgent: A Multi-Modal Multi-Agent Framework for Document Understanding	Mar 18, 2025	document understandingQuestion Answering	CodeCode Available	3
MiniViT: Compressing Vision Transformers with Weight Multiplexing	Apr 14, 2022	DiversityImage Classification	CodeCode Available	3
SPMamba: State-space model is all you need in speech separation	Apr 2, 2024	AllMamba	CodeCode Available	3
Lighthouse: A User-Friendly Library for Reproducible Video Moment Retrieval and Highlight Detection	Aug 6, 2024	audio moment retrievalHighlight Detection	CodeCode Available	3
Vision as LoRA	Mar 26, 2025		CodeCode Available	3
Deep Limit Order Book Forecasting	Mar 14, 2024	Deep Learning	CodeCode Available	3
Video Mamba Suite: State Space Model as a Versatile Alternative for Video Understanding	Mar 14, 2024	MambaMoment Retrieval	CodeCode Available	3
ResShift: Efficient Diffusion Model for Image Super-resolution by Residual Shifting	Jul 23, 2023	Image Super-ResolutionSuper-Resolution	CodeCode Available	3
EfficientFormer: Vision Transformers at MobileNet Speed	Jun 2, 2022		CodeCode Available	3
Demystify Mamba in Vision: A Linear Attention Perspective	May 26, 2024	image-classificationImage Classification	CodeCode Available	3
Visual Large Language Models for Generalized and Specialized Applications	Jan 6, 2025	Ethics	CodeCode Available	3