SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 39713980 of 177340 papers

TitleStatusHype
Training Verifiers to Solve Math Word ProblemsCode3
Interactive Medical Image Segmentation: A Benchmark Dataset and BaselineCode3
Generating Long Sequences with Sparse TransformersCode3
Towards Generalizable Tumor SynthesisCode3
Kimina-Prover Preview: Towards Large Formal Reasoning Models with Reinforcement LearningCode3
Pipeline Parallelism with Controllable MemoryCode3
SoloSpeech: Enhancing Intelligibility and Quality in Target Speech Extraction through a Cascaded Generative PipelineCode3
L0: Reinforcement Learning to Become General AgentsCode3
MMAD: The First-Ever Comprehensive Benchmark for Multimodal Large Language Models in Industrial Anomaly DetectionCode3
ASFT: Aligned Supervised Fine-Tuning through Absolute LikelihoodCode3
Show:102550
← PrevPage 398 of 17734Next →