SOTAVerified

Decision Making

Papers

Showing 110 of 12311 papers

TitleStatusHype
FinRobot: An Open-Source AI Agent Platform for Financial Applications using Large Language ModelsCode9
Enhancing Investment Analysis: Optimizing AI-Agent Collaboration in Financial ResearchCode9
PC-Agent: A Hierarchical Multi-Agent Collaboration Framework for Complex Task Automation on PCCode9
Diffusion Forcing: Next-token Prediction Meets Full-Sequence DiffusionCode9
Better than classical? The subtle art of benchmarking quantum machine learning modelsCode7
RAGEN: Understanding Self-Evolution in LLM Agents via Multi-Turn Reinforcement LearningCode7
Enhancing Financial Sentiment Analysis via Retrieval Augmented Large Language ModelsCode6
Deep Lake: a Lakehouse for Deep LearningCode5
GenCast: Diffusion-based ensemble forecasting for medium-range weatherCode5
Differentiable Tree Search NetworkCode5
Show:102550
← PrevPage 1 of 1232Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1SRLAAverage Remaining Cycles6.4Unverified