SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 25012510 of 474278 papers

TitleStatusHype
Harnessing the Universal Geometry of EmbeddingsCode3
Graph-Reward-SQL: Execution-Free Reinforcement Learning for Text-to-SQL via Graph Matching and Stepwise RewardCode3
dLLM-Cache: Accelerating Diffusion Large Language Models with Adaptive CachingCode3
SongEval: A Benchmark Dataset for Song Aesthetics EvaluationCode3
Questioning Representational Optimism in Deep Learning: The Fractured Entangled Representation HypothesisCode3
Time Travel is Cheating: Going Live with DeepFund for Real-Time Fund Investment BenchmarkingCode3
Visual Planning: Let's Think Only with ImagesCode3
MathCoder-VL: Bridging Vision and Code for Enhanced Multimodal Mathematical ReasoningCode3
Parallel Scaling Law for Language ModelsCode3
MTVCrafter: 4D Motion Tokenization for Open-World Human Image AnimationCode3
Show:102550
← PrevPage 251 of 47428Next →