The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 5476–5500 of 177340 papers

Title	Date	Tasks	Status	Hype	Score
Diffusion Models and Representation Learning: A Survey	Jun 30, 2024	DenoisingRepresentation Learning	CodeCode Available	2	5
HybridDepth: Robust Metric Depth Fusion by Leveraging Depth from Focus and Single-Image Priors	Jul 26, 2024	Depth EstimationGPU	CodeCode Available	2	5
XMainframe: A Large Language Model for Mainframe Modernization	Aug 5, 2024	Code SummarizationLanguage Modeling	CodeCode Available	2	5
Learning Generative Interactive Environments By Trained Agent Exploration	Sep 10, 2024		CodeCode Available	2	5
Learning Efficient and Effective Trajectories for Differential Equation-based Image Restoration	Oct 7, 2024	Image RestorationNavigate	CodeCode Available	2	5
DyCoke: Dynamic Compression of Tokens for Fast Video Large Language Models	Nov 22, 2024		CodeCode Available	2	5
A Comprehensive Guide to Explainable AI: From Classical Models to LLMs	Dec 1, 2024	Causal Inferencecounterfactual	CodeCode Available	2	5
2DMamba: Efficient State Space Model for Image Representation with Applications on Giga-Pixel Whole Slide Image Classification	Dec 1, 2024	Computational Efficiencyimage-classification	CodeCode Available	2	5
B-STaR: Monitoring and Balancing Exploration and Exploitation in Self-Taught Reasoners	Dec 23, 2024	Mathematical Reasoning	CodeCode Available	2	5
Sparse Autoencoders Learn Monosemantic Features in Vision-Language Models	Apr 3, 2025		CodeCode Available	2	5
ImageDream: Image-Prompt Multi-view Diffusion for 3D Generation	Dec 2, 2023	3D GenerationObject	CodeCode Available	2	5
MMLongBench-Doc: Benchmarking Long-context Document Understanding with Visualizations	Jul 1, 2024	Benchmarkingdocument understanding	CodeCode Available	2	5
Saving 77% of the Parameters in Large Language Models Technical Report	Feb 9, 2025	GPUText Generation	CodeCode Available	2	5
Tensor field networks: Rotation- and translation-equivariant neural networks for 3D point clouds	Feb 22, 2018	Data AugmentationTranslation	CodeCode Available	2	5
RARE: Retrieval-Augmented Reasoning Modeling	Mar 30, 2025	HallucinationMemorization	CodeCode Available	2	5
Torch2Chip: An End-to-end Customizable Deep Neural Network Compression and Deployment Toolkit for Prototype Hardware Accelerator Design	May 2, 2024	Model CompressionNeural Network Compression	CodeCode Available	2	5
Data Science Education in Undergraduate Physics: Lessons Learned from a Community of Practice	Mar 1, 2024		CodeCode Available	2	5
Synthesize Diagnose and Optimize: Towards Fine-Grained Vision-Language Understanding	Jan 1, 2024	Attribute	CodeCode Available	2	5
Adaptive Multi-Agent Reasoning via Automated Workflow Generation	Jul 18, 2025		CodeCode Available	2	5
JaxMARL: Multi-Agent RL Environments and Algorithms in JAX	Nov 16, 2023	CPUGPU	CodeCode Available	2	5
CAT: Enhancing Multimodal Large Language Model to Answer Questions in Dynamic Audio-Visual Scenarios	Mar 7, 2024	Audio-visual Question AnsweringAudio-Visual Question Answering (AVQA)	CodeCode Available	2	5
LLM-Assisted Light: Leveraging Large Language Model Capabilities for Human-Mimetic Traffic Signal Control in Complex Urban Environments	Mar 13, 2024	Decision MakingLanguage Modeling	CodeCode Available	2	5
Interactive and Explainable Region-guided Radiology Report Generation	Apr 17, 2023	Medical Report Generation	CodeCode Available	2	5
LLMEmb: Large Language Model Can Be a Good Embedding Generator for Sequential Recommendation	Sep 30, 2024	AttributeCollaborative Filtering	CodeCode Available	2	5
COMPL-AI Framework: A Technical Interpretation and LLM Benchmarking Suite for the EU Artificial Intelligence Act	Oct 10, 2024	BenchmarkingFairness	CodeCode Available	2	5