SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 72767300 of 474278 papers

TitleStatusHype
Gödel Agent: A Self-Referential Agent Framework for Recursive Self-ImprovementCode2
DeFoG: Discrete Flow Matching for Graph GenerationCode2
A Simple yet Effective Training-free Prompt-free Approach to Chinese Spelling Correction Based on Large Language ModelsCode2
Distillation-Free One-Step Diffusion for Real-World Image Super-ResolutionCode2
An Electrocardiogram Foundation Model Built on over 10 Million Recordings with External Evaluation across Multiple DomainsCode2
SyllableLM: Learning Coarse Semantic Units for Speech Language ModelsCode2
Steering Large Language Models between Code Execution and Textual ReasoningCode2
ToolGen: Unified Tool Retrieval and Calling via GenerationCode2
Scaling Large Motion Models with Million-Level Human MotionsCode2
Mamba in Vision: A Comprehensive Survey of Techniques and ApplicationsCode2
Learning Truncated Causal History Model for Video RestorationCode2
Exploring the Benefit of Activation Sparsity in Pre-trainingCode2
Generative Artificial Intelligence for Navigating Synthesizable Chemical SpaceCode2
Learning from Committee: Reasoning Distillation from a Mixture of Teachers with Peer-ReviewCode2
Look Twice Before You Answer: Memory-Space Visual Retracing for Hallucination Mitigation in Multimodal Large Language ModelsCode2
Dynamic Diffusion TransformerCode2
AutoPenBench: Benchmarking Generative Agents for Penetration TestingCode2
GraphRouter: A Graph-based Router for LLM SelectionsCode2
Multi-Robot Motion Planning with Diffusion ModelsCode2
Autoregressive Action Sequence Learning for Robotic ManipulationCode2
MetricX-24: The Google Submission to the WMT 2024 Metrics Shared TaskCode2
Oscillatory State-Space ModelsCode2
Grounded-VideoLLM: Sharpening Fine-grained Temporal Grounding in Video Large Language ModelsCode2
Refinement of Monocular Depth Maps via Multi-View Differentiable RenderingCode2
Unraveling Cross-Modality Knowledge Conflicts in Large Vision-Language ModelsCode2
Show:102550
← PrevPage 292 of 18972Next →