SOTAVerified

Decision Making

Papers

Showing 151175 of 12311 papers

TitleStatusHype
Doe-1: Closed-Loop Autonomous Driving with Large World ModelCode2
Disentangling Memory and Reasoning Ability in Large Language ModelsCode2
A Survey of Financial AI: Architectures, Advances and Open ChallengesCode2
Distribution-Free, Risk-Controlling Prediction SetsCode2
DrivingSphere: Building a High-fidelity 4D World for Closed-loop SimulationCode2
ExpeL: LLM Agents Are Experiential LearnersCode2
Revocable Deep Reinforcement Learning with Affinity Regularization for Outlier-Robust Graph MatchingCode2
Diffusion Actor-Critic with Entropy RegulatorCode2
Deep Generative Models for Offline Policy Learning: Tutorial, Survey, and Perspectives on Future DirectionsCode2
AdaSociety: An Adaptive Environment with Social Structures for Multi-Agent Decision-MakingCode2
A Survey of Time Series Foundation Models: Generalizing Time Series Representation with Large Language ModelCode2
FinCon: A Synthesized LLM Multi-Agent System with Conceptual Verbal Reinforcement for Enhanced Financial Decision MakingCode2
A New Approach to Solving SMAC Task: Generating Decision Tree Code from Large Language ModelsCode2
Cumulative Reasoning with Large Language ModelsCode2
A Comprehensive Guide to Explainable AI: From Classical Models to LLMsCode2
DecisionNCE: Embodied Multimodal Representations via Implicit Preference LearningCode2
Counterfactual Explanations without Opening the Black Box: Automated Decisions and the GDPRCode2
Cooperate or Collapse: Emergence of Sustainable Cooperation in a Society of LLM AgentsCode2
Cross-Prediction-Powered InferenceCode2
Digital Player: Evaluating Large Language Models based Human-like Agent in GamesCode2
GPD-1: Generative Pre-training for DrivingCode2
Adversarial attacks and defenses in explainable artificial intelligence: A surveyCode2
A Cognitive-Based Trajectory Prediction Approach for Autonomous DrivingCode2
Concept Bottleneck Language Models For protein designCode2
CoD, Towards an Interpretable Medical Agent using Chain of DiagnosisCode2
Show:102550
← PrevPage 7 of 493Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1SRLAAverage Remaining Cycles6.4Unverified