SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

658,356 papers258,216 code links4,818 tasks

Papers

Showing 101125 of 658356 papers

TitleStatusHype
DeepSeek-Coder: When the Large Language Model Meets Programming -- The Rise of Code IntelligenceCode11
InstantID: Zero-shot Identity-Preserving Generation in SecondsCode11
TinyLlama: An Open-Source Small Language ModelCode11
PaperBanana: Automating Academic Illustration for AI Scientists9
Qwen3-TTS Technical Report9
Kodezi Chronos: A Debugging-First Language Model for Repository-Scale, Memory-Driven Code UnderstandingCode9
MiniCPM4: Ultra-Efficient LLMs on End DevicesCode9
MonkeyOCR: Document Parsing with a Structure-Recognition-Relation Triplet ParadigmCode9
SkyReels-Audio: Omni Audio-Conditioned Talking Portraits in Video Diffusion TransformersCode9
Dolphin: Document Image Parsing via Heterogeneous Anchor PromptingCode9
Emerging Properties in Unified Multimodal PretrainingCode9
UFO2: The Desktop AgentOSCode9
SkyReels-V2: Infinite-length Film Generative ModelCode9
VLM-R1: A Stable and Generalizable R1-style Large Vision-Language ModelCode9
UltraRAG: A Modular and Automated Toolkit for Adaptive Retrieval-Augmented GenerationCode9
PP-FormulaNet: Bridging Accuracy and Efficiency in Advanced Formula RecognitionCode9
AgentRxiv: Towards Collaborative Autonomous ResearchCode9
PP-DocLayout: A Unified Document Layout Detection Model to Accelerate Large-Scale Data ConstructionCode9
RWKV-7 "Goose" with Expressive Dynamic State EvolutionCode9
YuE: Scaling Open Foundation Models for Long-Form Music GenerationCode9
A Comprehensive Survey of Mixture-of-Experts: Algorithms, Theory, and ApplicationsCode9
PC-Agent: A Hierarchical Multi-Agent Collaboration Framework for Complex Task Automation on PCCode9
AutoAgent: A Fully-Automated and Zero-Code Framework for LLM AgentsCode9
Metis: A Foundation Speech Generation Model with Masked Generative Pre-trainingCode9
s1: Simple test-time scalingCode9
Show:102550
← PrevPage 5 of 26335Next →