SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

658,356 papers247,172 code links4,818 tasks

Papers

Showing 176200 of 658356 papers

TitleStatusHype
OpenVLA: An Open-Source Vision-Language-Action ModelCode9
Turbo Sparse: Achieving LLM SOTA Performance with Minimal Activated ParametersCode9
PowerInfer-2: Fast Large Language Model Inference on a SmartphoneCode9
LawGPT: A Chinese Legal Knowledge-Enhanced Large Language ModelCode9
LW-DETR: A Transformer Replacement to YOLO for Real-Time DetectionCode9
Mobile-Agent-v2: Mobile Device Operation Assistant with Effective Navigation via Multi-Agent CollaborationCode9
CacheBlend: Fast Large Language Model Serving for RAG with Cached Knowledge FusionCode9
FinRobot: An Open-Source AI Agent Platform for Financial Applications using Large Language ModelsCode9
(Perhaps) Beyond Human Translation: Harnessing Multi-Agent Collaboration for Translating Ultra-Long Literary TextsCode9
DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language ModelCode9
StoryDiffusion: Consistent Self-Attention for Long-Range Image and Video GenerationCode9
OpenELM: An Efficient Language Model Family with Open Training and Inference FrameworkCode9
Ferret-v2: An Improved Baseline for Referring and Grounding with Large Language ModelsCode9
Visually Descriptive Language Model for Vector Graphics ReasoningCode9
MiniCPM: Unveiling the Potential of Small Language Models with Scalable Training StrategiesCode9
RULER: What's the Real Context Size of Your Long-Context Language Models?Code9
Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale PredictionCode9
Model Stock: All we need is just a few fine-tuned modelsCode9
AniPortrait: Audio-Driven Synthesis of Photorealistic Portrait AnimationCode9
InternLM2 Technical ReportCode9
LISA: Layerwise Importance Sampling for Memory-Efficient Large Language Model Fine-TuningCode9
VoiceCraft: Zero-Shot Speech Editing and Text-to-Speech in the WildCode9
Arcee's MergeKit: A Toolkit for Merging Large Language ModelsCode9
When Do We Not Need Larger Vision Models?Code9
LLMLingua-2: Data Distillation for Efficient and Faithful Task-Agnostic Prompt CompressionCode9
Show:102550
← PrevPage 8 of 26335Next →