SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

658,356 papers258,216 code links4,818 tasks

Papers

Showing 151200 of 658356 papers

TitleStatusHype
MuseTalk: Real-Time High-Fidelity Video Dubbing via Spatio-Temporal SamplingCode9
SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion TransformersCode9
Toward Guidance-Free AR Visual Generation via Condition Contrastive AlignmentCode9
TorchTitan: One-stop PyTorch native solution for production ready LLM pre-trainingCode9
Depth Pro: Sharp Monocular Metric Depth in Less Than a SecondCode9
Moshi: a speech-text foundation model for real-time dialogueCode9
Do Large Language Models Need a Content Delivery Network?Code9
Language agents achieve superhuman synthesis of scientific knowledgeCode9
KAG: Boosting LLMs in Professional Domains via Knowledge Augmented GenerationCode9
General OCR Theory: Towards OCR-2.0 via a Unified End-to-end ModelCode9
MaskGCT: Zero-Shot Text-to-Speech with Masked Generative Codec TransformerCode9
CogVLM2: Visual Language Models for Image and Video UnderstandingCode9
Sapiens: Foundation for Human Vision ModelsCode9
Transformer Explainer: Interactive Learning of Text-Generative ModelsCode9
SuperSimpleNet: Unifying Unsupervised and Supervised Learning for Fast and Reliable Surface Defect DetectionCode9
MindSearch: Mimicking Human Minds Elicits Deep AI SearcherCode9
NeedleBench: Can LLMs Do Retrieval and Reasoning in Information-Dense Context?Code9
MInference 1.0: Accelerating Pre-filling for Long-Context LLMs via Dynamic Sparse AttentionCode9
Diffusion Forcing: Next-token Prediction Meets Full-Sequence DiffusionCode9
Symbolic Learning Enables Self-Evolving AgentsCode9
DeepSeek-Coder-V2: Breaking the Barrier of Closed-Source Models in Code IntelligenceCode9
Infinigen Indoors: Photorealistic Indoor Scenes using Procedural GenerationCode9
garak: A Framework for Security Probing Large Language ModelsCode9
BABILong: Testing the Limits of LLMs with Long Context Reasoning-in-a-HaystackCode9
Depth Anything V2Code9
Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image AnimationCode9
OpenVLA: An Open-Source Vision-Language-Action ModelCode9
Turbo Sparse: Achieving LLM SOTA Performance with Minimal Activated ParametersCode9
PowerInfer-2: Fast Large Language Model Inference on a SmartphoneCode9
LawGPT: A Chinese Legal Knowledge-Enhanced Large Language ModelCode9
LW-DETR: A Transformer Replacement to YOLO for Real-Time DetectionCode9
Mobile-Agent-v2: Mobile Device Operation Assistant with Effective Navigation via Multi-Agent CollaborationCode9
CacheBlend: Fast Large Language Model Serving for RAG with Cached Knowledge FusionCode9
FinRobot: An Open-Source AI Agent Platform for Financial Applications using Large Language ModelsCode9
(Perhaps) Beyond Human Translation: Harnessing Multi-Agent Collaboration for Translating Ultra-Long Literary TextsCode9
DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language ModelCode9
StoryDiffusion: Consistent Self-Attention for Long-Range Image and Video GenerationCode9
OpenELM: An Efficient Language Model Family with Open Training and Inference FrameworkCode9
Ferret-v2: An Improved Baseline for Referring and Grounding with Large Language ModelsCode9
Visually Descriptive Language Model for Vector Graphics ReasoningCode9
MiniCPM: Unveiling the Potential of Small Language Models with Scalable Training StrategiesCode9
RULER: What's the Real Context Size of Your Long-Context Language Models?Code9
Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale PredictionCode9
Model Stock: All we need is just a few fine-tuned modelsCode9
AniPortrait: Audio-Driven Synthesis of Photorealistic Portrait AnimationCode9
InternLM2 Technical ReportCode9
LISA: Layerwise Importance Sampling for Memory-Efficient Large Language Model Fine-TuningCode9
VoiceCraft: Zero-Shot Speech Editing and Text-to-Speech in the WildCode9
Arcee's MergeKit: A Toolkit for Merging Large Language ModelsCode9
When Do We Not Need Larger Vision Models?Code9
Show:102550
← PrevPage 4 of 13168Next →