SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

659,983 papers248,104 code links4,818 tasks

Papers

Showing 901925 of 659983 papers

TitleStatusHype
Automatic Interactive Evaluation for Large Language Models with State Aware Patient SimulatorCode5
R-CoT: Reverse Chain-of-Thought Problem Generation for Geometric Reasoning in Large Multimodal ModelsCode5
Q-GaLore: Quantized GaLore with INT4 Projection and Layer-Adaptive Low-Rank GradientsCode5
GraphCast: Learning skillful medium-range global weather forecastingCode5
PowerInfer: Fast Large Language Model Serving with a Consumer-grade GPUCode5
Reconstruction vs. Generation: Taming Optimization Dilemma in Latent Diffusion ModelsCode5
Automated Design of Agentic SystemsCode5
EasyInstruct: An Easy-to-use Instruction Processing Framework for Large Language ModelsCode5
ReflecTool: Towards Reflection-Aware Tool-Augmented Clinical AgentsCode5
DeepSeekMoE: Towards Ultimate Expert Specialization in Mixture-of-Experts Language ModelsCode5
Off-Policy Primal-Dual Safe Reinforcement LearningCode5
Comet: Fine-grained Computation-communication Overlapping for Mixture-of-ExpertsCode5
XPhoneBERT: A Pre-trained Multilingual Model for Phoneme Representations for Text-to-SpeechCode5
AudioLCM: Text-to-Audio Generation with Latent Consistency ModelsCode5
When LLMs Meet Cybersecurity: A Systematic Literature ReviewCode5
Phantom: Subject-consistent video generation via cross-modal alignmentCode5
SpeechAlign: Aligning Speech Generation to Human PreferencesCode5
SLAM3R: Real-Time Dense Scene Reconstruction from Monocular RGB VideosCode5
Search-o1: Agentic Search-Enhanced Large Reasoning ModelsCode5
MuJoCo MPC for Humanoid Control: Evaluation on HumanoidBenchCode5
GaLore: Memory-Efficient LLM Training by Gradient Low-Rank ProjectionCode5
Getting SMARTER for Motion Planning in Autonomous Driving SystemsCode5
UnCommon Objects in 3DCode5
Hybrid Transformers for Music Source SeparationCode5
ImageBind: One Embedding Space To Bind Them AllCode5
Show:102550
← PrevPage 37 of 26400Next →