The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 7276–7300 of 474278 papers

Title	Date	Tasks	Status	Hype
Gödel Agent: A Self-Referential Agent Framework for Recursive Self-Improvement	Oct 6, 2024	Mathematical ReasoningMeta-Learning	CodeCode Available	2
DeFoG: Discrete Flow Matching for Graph Generation	Oct 5, 2024	DenoisingGraph Generation	CodeCode Available	2
A Simple yet Effective Training-free Prompt-free Approach to Chinese Spelling Correction Based on Large Language Models	Oct 5, 2024	Language ModelingLanguage Modelling	CodeCode Available	2
Distillation-Free One-Step Diffusion for Real-World Image Super-Resolution	Oct 5, 2024	Image Super-ResolutionKnowledge Distillation	CodeCode Available	2
An Electrocardiogram Foundation Model Built on over 10 Million Recordings with External Evaluation across Multiple Domains	Oct 5, 2024	DiagnosticEvent Detection	CodeCode Available	2
SyllableLM: Learning Coarse Semantic Units for Speech Language Models	Oct 5, 2024	ClusteringLanguage Modeling	CodeCode Available	2
Steering Large Language Models between Code Execution and Textual Reasoning	Oct 4, 2024	Code GenerationMath	CodeCode Available	2
ToolGen: Unified Tool Retrieval and Calling via Generation	Oct 4, 2024	RetrievalText Generation	CodeCode Available	2
Scaling Large Motion Models with Million-Level Human Motions	Oct 4, 2024	Motion Generation	CodeCode Available	2
Mamba in Vision: A Comprehensive Survey of Techniques and Applications	Oct 4, 2024	MambaState Space Models	CodeCode Available	2
Learning Truncated Causal History Model for Video Restoration	Oct 4, 2024	DeblurringDenoising	CodeCode Available	2
Exploring the Benefit of Activation Sparsity in Pre-training	Oct 4, 2024		CodeCode Available	2
Generative Artificial Intelligence for Navigating Synthesizable Chemical Space	Oct 4, 2024	Drug DiscoveryNavigate	CodeCode Available	2
Learning from Committee: Reasoning Distillation from a Mixture of Teachers with Peer-Review	Oct 4, 2024	Knowledge DistillationLogical Reasoning	CodeCode Available	2
Look Twice Before You Answer: Memory-Space Visual Retracing for Hallucination Mitigation in Multimodal Large Language Models	Oct 4, 2024	DecoderHallucination	CodeCode Available	2
Dynamic Diffusion Transformer	Oct 4, 2024	Image Generation	CodeCode Available	2
AutoPenBench: Benchmarking Generative Agents for Penetration Testing	Oct 4, 2024	Benchmarking	CodeCode Available	2
GraphRouter: A Graph-based Router for LLM Selections	Oct 4, 2024	Transductive Learning	CodeCode Available	2
Multi-Robot Motion Planning with Diffusion Models	Oct 4, 2024	Motion Planning	CodeCode Available	2
Autoregressive Action Sequence Learning for Robotic Manipulation	Oct 4, 2024	ChunkingLanguage Modeling	CodeCode Available	2
MetricX-24: The Google Submission to the WMT 2024 Metrics Shared Task	Oct 4, 2024	Translation	CodeCode Available	2
Oscillatory State-Space Models	Oct 4, 2024	MambaState Space Models	CodeCode Available	2
Grounded-VideoLLM: Sharpening Fine-grained Temporal Grounding in Video Large Language Models	Oct 4, 2024	Dense Video CaptioningSentence	CodeCode Available	2
Refinement of Monocular Depth Maps via Multi-View Differentiable Rendering	Oct 4, 2024	Depth EstimationMonocular Depth Estimation	CodeCode Available	2
Unraveling Cross-Modality Knowledge Conflicts in Large Vision-Language Models	Oct 4, 2024		CodeCode Available	2