SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 12011210 of 177340 papers

TitleStatusHype
Safurai 001: New Qualitative Approach for Code LLM EvaluationCode4
The Optimal BERT Surgeon: Scalable and Accurate Second-Order Pruning for Large Language ModelsCode4
RePaint: Inpainting using Denoising Diffusion Probabilistic ModelsCode4
A Preview of XiYan-SQL: A Multi-Generator Ensemble Framework for Text-to-SQLCode4
MTEB: Massive Text Embedding BenchmarkCode4
R1-Searcher: Incentivizing the Search Capability in LLMs via Reinforcement LearningCode4
Identify Critical KV Cache in LLM Inference from an Output Perturbation PerspectiveCode4
Real-ESRGAN: Training Real-World Blind Super-Resolution with Pure Synthetic DataCode4
FinBen: A Holistic Financial Benchmark for Large Language ModelsCode4
SVDQuant: Absorbing Outliers by Low-Rank Components for 4-Bit Diffusion ModelsCode4
Show:102550
← PrevPage 121 of 17734Next →