SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

659,983 papers248,104 code links4,818 tasks

Papers

Showing 901950 of 659983 papers

TitleStatusHype
R-CoT: Reverse Chain-of-Thought Problem Generation for Geometric Reasoning in Large Multimodal ModelsCode5
Q-GaLore: Quantized GaLore with INT4 Projection and Layer-Adaptive Low-Rank GradientsCode5
GraphCast: Learning skillful medium-range global weather forecastingCode5
PowerInfer: Fast Large Language Model Serving with a Consumer-grade GPUCode5
Reconstruction vs. Generation: Taming Optimization Dilemma in Latent Diffusion ModelsCode5
Automated Design of Agentic SystemsCode5
EasyInstruct: An Easy-to-use Instruction Processing Framework for Large Language ModelsCode5
ReflecTool: Towards Reflection-Aware Tool-Augmented Clinical AgentsCode5
DeepSeekMoE: Towards Ultimate Expert Specialization in Mixture-of-Experts Language ModelsCode5
Off-Policy Primal-Dual Safe Reinforcement LearningCode5
Comet: Fine-grained Computation-communication Overlapping for Mixture-of-ExpertsCode5
XPhoneBERT: A Pre-trained Multilingual Model for Phoneme Representations for Text-to-SpeechCode5
AudioLCM: Text-to-Audio Generation with Latent Consistency ModelsCode5
When LLMs Meet Cybersecurity: A Systematic Literature ReviewCode5
Phantom: Subject-consistent video generation via cross-modal alignmentCode5
SpeechAlign: Aligning Speech Generation to Human PreferencesCode5
SLAM3R: Real-Time Dense Scene Reconstruction from Monocular RGB VideosCode5
Search-o1: Agentic Search-Enhanced Large Reasoning ModelsCode5
MuJoCo MPC for Humanoid Control: Evaluation on HumanoidBenchCode5
GaLore: Memory-Efficient LLM Training by Gradient Low-Rank ProjectionCode5
Getting SMARTER for Motion Planning in Autonomous Driving SystemsCode5
UnCommon Objects in 3DCode5
Hybrid Transformers for Music Source SeparationCode5
ImageBind: One Embedding Space To Bind Them AllCode5
R1-Omni: Explainable Omni-Multimodal Emotion Recognition with Reinforcement LearningCode5
rerankers: A Lightweight Python Library to Unify Ranking MethodsCode5
Xwin-LM: Strong and Scalable Alignment Practice for LLMsCode5
rStar-Coder: Scaling Competitive Code Reasoning with a Large-Scale Verified DatasetCode5
Enabling Auditory Large Language Models for Automatic Speech Quality EvaluationCode5
IterComp: Iterative Composition-Aware Feedback Learning from Model Gallery for Text-to-Image GenerationCode5
Underwater Camouflaged Object Tracking Meets Vision-Language SAM2Code5
UniRepLKNet: A Universal Perception Large-Kernel ConvNet for Audio Video Point Cloud Time-Series and Image RecognitionCode5
VideoMamba: State Space Model for Efficient Video UnderstandingCode5
Repetition Improves Language Model EmbeddingsCode5
Lanpaint: Training-Free Diffusion Inpainting with Exact and Fast Conditional InferenceCode5
KBLaM: Knowledge Base augmented Language ModelCode5
Faster Segment Anything: Towards Lightweight SAM for Mobile ApplicationsCode5
M-Prometheus: A Suite of Open Multilingual LLM JudgesCode5
NeMo Guardrails: A Toolkit for Controllable and Safe LLM Applications with Programmable RailsCode5
UniDepthV2: Universal Monocular Metric Depth Estimation Made SimplerCode5
Reinforcement Learning from Human FeedbackCode5
RDT-1B: a Diffusion Foundation Model for Bimanual ManipulationCode5
Slicing Aided Hyper Inference and Fine-tuning for Small Object DetectionCode5
EBEN: Extreme bandwidth extension network applied to speech signals captured with noise-resilient body-conduction microphonesCode5
Street Gaussians: Modeling Dynamic Urban Scenes with Gaussian SplattingCode5
Personal LLM Agents: Insights and Survey about the Capability, Efficiency and SecurityCode5
Point-E: A System for Generating 3D Point Clouds from Complex PromptsCode5
Segment AnythingCode5
Nougat: Neural Optical Understanding for Academic DocumentsCode5
HDVIO2.0: Wind and Disturbance Estimation with Hybrid Dynamics VIOCode5
Show:102550
← PrevPage 19 of 13200Next →