SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

659,983 papers248,104 code links4,818 tasks

Papers

Showing 701725 of 177339 papers

TitleStatusHype
PowerInfer: Fast Large Language Model Serving with a Consumer-grade GPUCode5
Reconstruction vs. Generation: Taming Optimization Dilemma in Latent Diffusion ModelsCode5
Automated Design of Agentic SystemsCode5
EasyInstruct: An Easy-to-use Instruction Processing Framework for Large Language ModelsCode5
ReflecTool: Towards Reflection-Aware Tool-Augmented Clinical AgentsCode5
DeepSeekMoE: Towards Ultimate Expert Specialization in Mixture-of-Experts Language ModelsCode5
Off-Policy Primal-Dual Safe Reinforcement LearningCode5
Comet: Fine-grained Computation-communication Overlapping for Mixture-of-ExpertsCode5
XPhoneBERT: A Pre-trained Multilingual Model for Phoneme Representations for Text-to-SpeechCode5
AudioLCM: Text-to-Audio Generation with Latent Consistency ModelsCode5
When LLMs Meet Cybersecurity: A Systematic Literature ReviewCode5
Phantom: Subject-consistent video generation via cross-modal alignmentCode5
SpeechAlign: Aligning Speech Generation to Human PreferencesCode5
SLAM3R: Real-Time Dense Scene Reconstruction from Monocular RGB VideosCode5
Search-o1: Agentic Search-Enhanced Large Reasoning ModelsCode5
MuJoCo MPC for Humanoid Control: Evaluation on HumanoidBenchCode5
GaLore: Memory-Efficient LLM Training by Gradient Low-Rank ProjectionCode5
Getting SMARTER for Motion Planning in Autonomous Driving SystemsCode5
UnCommon Objects in 3DCode5
Hybrid Transformers for Music Source SeparationCode5
ImageBind: One Embedding Space To Bind Them AllCode5
R1-Omni: Explainable Omni-Multimodal Emotion Recognition with Reinforcement LearningCode5
rerankers: A Lightweight Python Library to Unify Ranking MethodsCode5
Xwin-LM: Strong and Scalable Alignment Practice for LLMsCode5
rStar-Coder: Scaling Competitive Code Reasoning with a Large-Scale Verified DatasetCode5
Show:102550
← PrevPage 29 of 7094Next →