SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 126150 of 177340 papers

TitleStatusHype
DeepSeek LLM: Scaling Open-Source Language Models with LongtermismCode9
SANA 1.5: Efficient Scaling of Training-Time and Inference-Time Compute in Linear Diffusion TransformerCode9
DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language ModelCode9
Language agents achieve superhuman synthesis of scientific knowledgeCode9
TorchTitan: One-stop PyTorch native solution for production ready LLM pre-trainingCode9
Diffusion Forcing: Next-token Prediction Meets Full-Sequence DiffusionCode9
Liger Kernel: Efficient Triton Kernels for LLM TrainingCode9
CogVLM2: Visual Language Models for Image and Video UnderstandingCode9
SuperSimpleNet: Unifying Unsupervised and Supervised Learning for Fast and Reliable Surface Defect DetectionCode9
Grounded SAM: Assembling Open-World Models for Diverse Visual TasksCode9
StoryDiffusion: Consistent Self-Attention for Long-Range Image and Video GenerationCode9
MInference 1.0: Accelerating Pre-filling for Long-Context LLMs via Dynamic Sparse AttentionCode9
ORPO: Monolithic Preference Optimization without Reference ModelCode9
FlashInfer: Efficient and Customizable Attention Engine for LLM Inference ServingCode9
Mobile-Agent-v2: Mobile Device Operation Assistant with Effective Navigation via Multi-Agent CollaborationCode9
Symbolic Learning Enables Self-Evolving AgentsCode9
Aviary: training language agents on challenging scientific tasksCode9
Enhancing Investment Analysis: Optimizing AI-Agent Collaboration in Financial ResearchCode9
Metis: A Foundation Speech Generation Model with Masked Generative Pre-trainingCode9
Dolphin: Document Image Parsing via Heterogeneous Anchor PromptingCode9
CMMMU: A Chinese Massive Multi-discipline Multimodal Understanding BenchmarkCode9
YOLO-World: Real-Time Open-Vocabulary Object DetectionCode9
Yi: Open Foundation Models by 01.AICode9
Steering Language Models with Game-Theoretic SolversCode9
VoiceCraft: Zero-Shot Speech Editing and Text-to-Speech in the WildCode9
Show:102550
← PrevPage 6 of 7094Next →