SOTAVerified

Navigate

Papers

Showing 110 of 1982 papers

TitleStatusHype
Optimizing Instructions and Demonstrations for Multi-Stage Language Model ProgramsCode14
Data Formulator 2: Iterative Creation of Data Visualizations, with AI Transforming Data Along the WayCode11
SWE-agent: Agent-Computer Interfaces Enable Automated Software EngineeringCode11
UFO: A UI-Focused Agent for Windows OS InteractionCode9
Mirage: A Multi-Level Superoptimizer for Tensor ProgramsCode7
Patch n' Pack: NaViT, a Vision Transformer for any Aspect Ratio and ResolutionCode6
Training Compute-Optimal Large Language ModelsCode6
WebThinker: Empowering Large Reasoning Models with Deep Research CapabilityCode5
IntellAgent: A Multi-Agent Framework for Evaluating Conversational AI SystemsCode5
ChatDBG: Augmenting Debugging with Large Language ModelsCode5
Show:102550
← PrevPage 1 of 199Next →

No leaderboard results yet.