SOTAVerified

Decision Making

Papers

Showing 126150 of 12311 papers

TitleStatusHype
Alphazero-like Tree-Search can Guide Large Language Model Decoding and TrainingCode2
ExpeL: LLM Agents Are Experiential LearnersCode2
Do As I Can, Not As I Say: Grounding Language in Robotic AffordancesCode2
Divide and Conquer: Grounding LLMs as Efficient Decision-Making Agents via Offline Hierarchical Reinforcement LearningCode2
Doe-1: Closed-Loop Autonomous Driving with Large World ModelCode2
Disentangling Memory and Reasoning Ability in Large Language ModelsCode2
Aligning Superhuman AI with Human Behavior: Chess as a Model SystemCode2
Distribution-Free, Risk-Controlling Prediction SetsCode2
DrivingSphere: Building a High-fidelity 4D World for Closed-loop SimulationCode2
AGIEval: A Human-Centric Benchmark for Evaluating Foundation ModelsCode2
Digital Player: Evaluating Large Language Models based Human-like Agent in GamesCode2
Revocable Deep Reinforcement Learning with Affinity Regularization for Outlier-Robust Graph MatchingCode2
DecisionNCE: Embodied Multimodal Representations via Implicit Preference LearningCode2
Deep Generative Models for Offline Policy Learning: Tutorial, Survey, and Perspectives on Future DirectionsCode2
Diffusion Actor-Critic with Entropy RegulatorCode2
DiLu: A Knowledge-Driven Approach to Autonomous Driving with Large Language ModelsCode2
Driving with LLMs: Fusing Object-Level Vector Modality for Explainable Autonomous DrivingCode2
Continuously Learning, Adapting, and Improving: A Dual-Process Approach to Autonomous DrivingCode2
AdaSociety: An Adaptive Environment with Social Structures for Multi-Agent Decision-MakingCode2
Cooperate or Collapse: Emergence of Sustainable Cooperation in a Society of LLM AgentsCode2
Context is Key: A Benchmark for Forecasting with Essential Textual InformationCode2
Agentic Knowledgeable Self-awarenessCode2
Counterfactual Explanations without Opening the Black Box: Automated Decisions and the GDPRCode2
ADAPT: Action-aware Driving Caption TransformerCode2
CombatVLA: An Efficient Vision-Language-Action Model for Combat Tasks in 3D Action Role-Playing GamesCode2
Show:102550
← PrevPage 6 of 493Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1SRLAAverage Remaining Cycles6.4Unverified