SOTAVerified

Decision Making

Papers

Showing 261270 of 12311 papers

TitleStatusHype
RTBAgent: A LLM-based Agent System for Real-Time BiddingCode1
Vintix: Action Model via In-Context Reinforcement LearningCode1
Harnessing Diverse Perspectives: A Multi-Agent Framework for Enhanced Error Detection in Knowledge GraphsCode1
A Survey of World Models for Autonomous DrivingCode1
MyGO Multiplex CoT: A Method for Self-Reflection in Large Language Models via Double Chain of Thought ThinkingCode1
NS-Gym: Open-Source Simulation Environments and Benchmarks for Non-Stationary Markov Decision ProcessesCode1
O1 Replication Journey -- Part 3: Inference-time Scaling for Medical ReasoningCode1
ICFNet: Integrated Cross-modal Fusion Network for Survival PredictionCode1
Co-Activation Graph Analysis of Safety-Verified and Explainable Deep Reinforcement Learning PoliciesCode1
MIRAGE: Exploring How Large Language Models Perform in Complex Social Interactive EnvironmentsCode1
Show:102550
← PrevPage 27 of 1232Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1SRLAAverage Remaining Cycles6.4Unverified