SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

658,356 papers247,172 code links4,818 tasks

Papers

Showing 141150 of 658356 papers

TitleStatusHype
NeedleBench: Can LLMs Do Retrieval and Reasoning in Information-Dense Context?Code9
YuE: Scaling Open Foundation Models for Long-Form Music GenerationCode9
Depth Anything V2Code9
LISA: Layerwise Importance Sampling for Memory-Efficient Large Language Model Fine-TuningCode9
Visually Descriptive Language Model for Vector Graphics ReasoningCode9
KAG: Boosting LLMs in Professional Domains via Knowledge Augmented GenerationCode9
World Model on Million-Length Video And Language With Blockwise RingAttentionCode9
UFO2: The Desktop AgentOSCode9
Contextual Augmented Multi-Model Programming (CAMP): A Hybrid Local-Cloud Copilot FrameworkCode9
OOTDiffusion: Outfitting Fusion based Latent Diffusion for Controllable Virtual Try-onCode9
Show:102550
← PrevPage 15 of 65836Next →