SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 34713480 of 474278 papers

TitleStatusHype
Andes: Defining and Enhancing Quality-of-Experience in LLM-Based Text Streaming ServicesCode3
Any Information Is Just Worth One Single Screenshot: Unifying Search With Visualized Information RetrievalCode3
Probabilistic Volumetric Fusion for Dense Monocular SLAMCode3
Where's the Point? Self-Supervised Multilingual Punctuation-Agnostic Sentence SegmentationCode3
Discovered Policy OptimisationCode3
MuMath-Code: Combining Tool-Use Large Language Models with Multi-perspective Data Augmentation for Mathematical ReasoningCode3
On Distillation of Guided Diffusion ModelsCode3
SWE-bench-java: A GitHub Issue Resolving Benchmark for JavaCode3
SoundStream: An End-to-End Neural Audio CodecCode3
Gradient Alignment in Physics-informed Neural Networks: A Second-Order Optimization PerspectiveCode3
Show:102550
← PrevPage 348 of 47428Next →