SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 23012310 of 661570 papers

TitleStatusHype
Safurai 001: New Qualitative Approach for Code LLM EvaluationCode4
The Optimal BERT Surgeon: Scalable and Accurate Second-Order Pruning for Large Language ModelsCode4
RePaint: Inpainting using Denoising Diffusion Probabilistic ModelsCode4
A Preview of XiYan-SQL: A Multi-Generator Ensemble Framework for Text-to-SQLCode4
MTEB: Massive Text Embedding BenchmarkCode4
R1-Searcher: Incentivizing the Search Capability in LLMs via Reinforcement LearningCode4
Identify Critical KV Cache in LLM Inference from an Output Perturbation PerspectiveCode4
Real-ESRGAN: Training Real-World Blind Super-Resolution with Pure Synthetic DataCode4
FinBen: A Holistic Financial Benchmark for Large Language ModelsCode4
SVDQuant: Absorbing Outliers by Low-Rank Components for 4-Bit Diffusion ModelsCode4
Show:102550
← PrevPage 231 of 66157Next →