SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 20412050 of 661570 papers

TitleStatusHype
AgentBench: Evaluating LLMs as AgentsCode4
TOPIQ: A Top-down Approach from Semantics to Distortions for Image Quality AssessmentCode4
OpenFlamingo: An Open-Source Framework for Training Large Autoregressive Vision-Language ModelsCode4
From Discrete Tokens to High-Fidelity Audio Using Multi-Band DiffusionCode4
LISA: Reasoning Segmentation via Large Language ModelCode4
Do LLMs Possess a Personality? Making the MBTI Test an Amazing Evaluation for Large Language ModelsCode4
Effective Whole-body Pose Estimation with Two-stages DistillationCode4
Universal and Transferable Adversarial Attacks on Aligned Language ModelsCode4
Guaranteed Approximation Bounds for Mixed-Precision Neural OperatorsCode4
Turning Whisper into Real-Time Transcription SystemCode4
Show:102550
← PrevPage 205 of 66157Next →