SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 33113320 of 474278 papers

TitleStatusHype
Evaluating Large Language Models with fmevalCode3
Fast Matrix Multiplications for Lookup Table-Quantized LLMsCode3
An Actionable Framework for Assessing Bias and Fairness in Large Language Model Use CasesCode3
Learning Dynamics of LLM FinetuningCode3
PutnamBench: Evaluating Neural Theorem-Provers on the Putnam Mathematical CompetitionCode3
OVLW-DETR: Open-Vocabulary Light-Weighted Detection TransformerCode3
Restoring Images in Adverse Weather Conditions via Histogram TransformerCode3
A Unified Anomaly Synthesis Strategy with Gradient Ascent for Industrial Anomaly Detection and LocalizationCode3
Human-like Episodic Memory for Infinite Context LLMsCode3
LightenDiffusion: Unsupervised Low-Light Image Enhancement with Latent-Retinex Diffusion ModelsCode3
Show:102550
← PrevPage 332 of 47428Next →