SOTAVerified

Decision Making

Papers

Showing 76100 of 12311 papers

TitleStatusHype
ExpeL: LLM Agents Are Experiential LearnersCode2
Expandable Subspace Ensemble for Pre-Trained Model-Based Class-Incremental LearningCode2
Harnessing Explanations: LLM-to-LM Interpreter for Enhanced Text-Attributed Graph Representation LearningCode2
AriGraph: Learning Knowledge Graph World Models with Episodic Memory for LLM AgentsCode2
A Review of Safe Reinforcement Learning: Methods, Theory and ApplicationsCode2
A Survey of Time Series Foundation Models: Generalizing Time Series Representation with Large Language ModelCode2
Fairness Evaluation for Uplift Modeling in the Absence of Ground TruthCode2
Enhancing Autonomous Driving Systems with On-Board Deployed Large Language ModelsCode2
A New Approach to Solving SMAC Task: Generating Decision Tree Code from Large Language ModelsCode2
Dungeons and Data: A Large-Scale NetHack DatasetCode2
Driving with LLMs: Fusing Object-Level Vector Modality for Explainable Autonomous DrivingCode2
Divide and Conquer: Grounding LLMs as Efficient Decision-Making Agents via Offline Hierarchical Reinforcement LearningCode2
Distribution-Free, Risk-Controlling Prediction SetsCode2
Do As I Can, Not As I Say: Grounding Language in Robotic AffordancesCode2
Distributional Soft Actor-Critic with Three RefinementsCode2
Embodied LLM Agents Learn to Cooperate in Organized TeamsCode2
Aligning Superhuman AI with Human Behavior: Chess as a Model SystemCode2
Diffusion Actor-Critic with Entropy RegulatorCode2
Deep Generative Models for Offline Policy Learning: Tutorial, Survey, and Perspectives on Future DirectionsCode2
Revocable Deep Reinforcement Learning with Affinity Regularization for Outlier-Robust Graph MatchingCode2
Cumulative Reasoning with Large Language ModelsCode2
Doe-1: Closed-Loop Autonomous Driving with Large World ModelCode2
DrivingSphere: Building a High-fidelity 4D World for Closed-loop SimulationCode2
Cross-Prediction-Powered InferenceCode2
DecisionNCE: Embodied Multimodal Representations via Implicit Preference LearningCode2
Show:102550
← PrevPage 4 of 493Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1SRLAAverage Remaining Cycles6.4Unverified