SOTAVerified

Sequential Decision Making

Papers

Showing 5175 of 1210 papers

TitleStatusHype
Hybrid Multi-agent Deep Reinforcement Learning for Autonomous Mobility on Demand SystemsCode1
Extracting Reward Functions from Diffusion ModelsCode1
Comparing Exploration-Exploitation Strategies of LLMs and Humans: Insights from Standard Multi-armed Bandit TasksCode1
ContainerGym: A Real-World Reinforcement Learning Benchmark for Resource AllocationCode1
Efficient Symptom Inquiring and Diagnosis via Adaptive Alignment of Reinforcement Learning and ClassificationCode1
On Generalization Across Environments In Multi-Objective Reinforcement LearningCode1
Free from Bellman Completeness: Trajectory Stitching via Model-based Return-conditioned Supervised LearningCode1
Premier-TACO is a Few-Shot Policy Learner: Pretraining Multitask Representation via Temporal Action-Driven Contrastive LossCode1
Pursuing Overall Welfare in Federated Learning through Sequential Decision MakingCode1
Curriculum-based Reinforcement Learning for Distribution System Critical Load RestorationCode1
DataEnvGym: Data Generation Agents in Teacher Environments with Student FeedbackCode1
Independent Reinforcement Learning for Weakly Cooperative Multiagent Traffic Control ProblemCode1
Dynamic Causal Bayesian OptimizationCode1
Does the Markov Decision Process Fit the Data: Testing for the Markov Property in Sequential Decision MakingCode1
Deep Reinforcement Learning for Entity AlignmentCode1
Dynamic Multi-Robot Task Allocation under Uncertainty and Temporal ConstraintsCode1
Decision Stacks: Flexible Reinforcement Learning via Modular Generative ModelsCode1
Deep Reinforcement Learning based Recommendation with Explicit User-Item Interactions ModelingCode1
An empirical evaluation of active inference in multi-armed banditsCode1
Approximate Inference in Discrete Distributions with Monte Carlo Tree Search and Value FunctionsCode1
RELIEF: Reinforcement Learning Empowered Graph Feature Prompt TuningCode1
Efficient Nonmyopic Bayesian Optimization via One-Shot Multi-Step TreesCode1
Enabling Intelligent Interactions between an Agent and an LLM: A Reinforcement Learning ApproachCode1
An Alternative Softmax Operator for Reinforcement LearningCode1
Comparing Deep Reinforcement Learning Algorithms in Two-Echelon Supply ChainsCode1
Show:102550
← PrevPage 3 of 49Next →

No leaderboard results yet.