| Finding Increasingly Large Extremal Graphs with AlphaZero and Tabu Search | Nov 6, 2023 | Decision MakingGraph Generation | —Unverified | 0 |
| Using General Value Functions to Learn Domain-Backed Inventory Management Policies | Nov 3, 2023 | Decision MakingManagement | —Unverified | 0 |
| Safe Sequential Optimization for Switching Environments | Nov 3, 2023 | Bayesian OptimizationChange Point Detection | —Unverified | 0 |
| Efficient Symbolic Policy Learning with Differentiable Symbolic Expression | Nov 2, 2023 | Decision MakingDeep Reinforcement Learning | CodeCode Available | 0 |
| Rethinking Decision Transformer via Hierarchical Reinforcement Learning | Nov 1, 2023 | Decision MakingHierarchical Reinforcement Learning | —Unverified | 0 |
| Safety-aware Causal Representation for Trustworthy Offline Reinforcement Learning in Autonomous Driving | Oct 31, 2023 | Autonomous DrivingAutonomous Vehicles | —Unverified | 0 |
| Regret-Minimization Algorithms for Multi-Agent Cooperative Learning Systems | Oct 30, 2023 | Cloud ComputingDecision Making | —Unverified | 0 |
| High-Dimensional Prediction for Sequential Decision Making | Oct 26, 2023 | Combinatorial OptimizationConformal Prediction | —Unverified | 0 |
| Robust Visual Imitation Learning with Inverse Dynamics Representations | Oct 22, 2023 | Decision MakingImitation Learning | —Unverified | 0 |
| O3D: Offline Data-driven Discovery and Distillation for Sequential Decision-Making with Large Language Models | Oct 22, 2023 | Decision MakingIn-Context Learning | —Unverified | 0 |
| Provable Benefits of Multi-task RL under Non-Markovian Decision Making Processes | Oct 20, 2023 | Decision MakingMulti-Task Learning | —Unverified | 0 |
| Auction-Based Scheduling | Oct 18, 2023 | Decision MakingFairness | —Unverified | 0 |
| Partially Observable Stochastic Games with Neural Perception Mechanisms | Oct 17, 2023 | Decision MakingDecision Making Under Uncertainty | —Unverified | 0 |
| Keep Various Trajectories: Promoting Exploration of Ensemble Policies in Continuous Control | Oct 17, 2023 | continuous-controlContinuous Control | —Unverified | 0 |
| Agent-Specific Effects: A Causal Effect Propagation Analysis in Multi-Agent MDPs | Oct 17, 2023 | counterfactualDecision Making | CodeCode Available | 0 |
| Autonomous Tree-search Ability of Large Language Models | Oct 14, 2023 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Imitation Learning from Purified Demonstrations | Oct 11, 2023 | Decision MakingImitation Learning | CodeCode Available | 0 |
| Evaluating Explanation Methods for Vision-and-Language Navigation | Oct 10, 2023 | Decision MakingNavigate | —Unverified | 0 |
| Global Convergence of Policy Gradient Methods in Reinforcement Learning, Games and Control | Oct 8, 2023 | Decision MakingPolicy Gradient Methods | —Unverified | 0 |
| Optimal Sequential Decision-Making in Geosteering: A Reinforcement Learning Approach | Oct 7, 2023 | Decision Makingreinforcement-learning | —Unverified | 0 |
| Beyond Stationarity: Convergence Analysis of Stochastic Softmax Policy Gradient Methods | Oct 4, 2023 | Decision MakingPolicy Gradient Methods | —Unverified | 0 |
| Learning to Reach Goals via Diffusion | Oct 4, 2023 | Computational EfficiencyDecision Making | CodeCode Available | 0 |
| Towards a Unified Framework for Sequential Decision Making | Oct 3, 2023 | Bayesian InferenceDecision Making | —Unverified | 0 |
| Learning to Make Adherence-Aware Advice | Oct 1, 2023 | Decision MakingSequential Decision Making | —Unverified | 0 |
| TraCE: Trajectory Counterfactual Explanation Scores | Sep 27, 2023 | counterfactualCounterfactual Explanation | CodeCode Available | 0 |
| State2Explanation: Concept-Based Explanations to Benefit Agent Learning and User Understanding | Sep 21, 2023 | Decision MakingSelf-Learning | —Unverified | 0 |
| Delays in Reinforcement Learning | Sep 20, 2023 | Decision Makingreinforcement-learning | —Unverified | 0 |
| Safe POMDP Online Planning via Shielding | Sep 19, 2023 | Autonomous DrivingDecision Making | —Unverified | 0 |
| Interactively Teaching an Inverse Reinforcement Learner with Limited Feedback | Sep 16, 2023 | Active LearningDecision Making | CodeCode Available | 0 |
| Efficient quantum recurrent reinforcement learning via quantum reservoir computing | Sep 13, 2023 | Decision Makingreinforcement-learning | —Unverified | 0 |
| Fidelity-Induced Interpretable Policy Extraction for Reinforcement Learning | Sep 12, 2023 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| Bootstrapping Adaptive Human-Machine Interfaces with Offline Reinforcement Learning | Sep 7, 2023 | Brain Computer InterfaceDecision Making | —Unverified | 0 |
| Adapting Static Fairness to Sequential Decision-Making: Bias Mitigation Strategies towards Equal Long-term Benefit Rate | Sep 7, 2023 | Decision MakingFairness | CodeCode Available | 0 |
| INTAGS: Interactive Agent-Guided Simulation | Sep 4, 2023 | Algorithmic TradingCausal Inference | —Unverified | 0 |
| Improving Generalization in Reinforcement Learning Training Regimes for Social Robot Navigation | Aug 29, 2023 | Decision MakingNavigate | CodeCode Available | 0 |
| Pure Exploration under Mediators' Feedback | Aug 29, 2023 | Decision MakingMulti-Armed Bandits | —Unverified | 0 |
| Simple Modification of the Upper Confidence Bound Algorithm by Generalized Weighted Averages | Aug 28, 2023 | Decision MakingDecision Making Under Uncertainty | CodeCode Available | 0 |
| Bayesian Exploration Networks | Aug 24, 2023 | Decision MakingDecision Making Under Uncertainty | —Unverified | 0 |
| LaGR-SEQ: Language-Guided Reinforcement Learning with Sample-Efficient Querying | Aug 21, 2023 | Decision Makingreinforcement-learning | CodeCode Available | 0 |
| A Robust Policy Bootstrapping Algorithm for Multi-objective Reinforcement Learning in Non-stationary Environments | Aug 18, 2023 | Decision MakingMulti-Objective Reinforcement Learning | —Unverified | 0 |
| Intrinsically Motivated Hierarchical Policy Learning in Multi-objective Markov Decision Processes | Aug 18, 2023 | Decision MakingMulti-Objective Reinforcement Learning | —Unverified | 0 |
| IMM: An Imitative Reinforcement Learning Approach with Predictive Representation Learning for Automatic Market Making | Aug 17, 2023 | Decision MakingImitation Learning | —Unverified | 0 |
| Value-Distributional Model-Based Reinforcement Learning | Aug 12, 2023 | continuous-controlContinuous Control | CodeCode Available | 0 |
| Multimodal Pretrained Models for Verifiable Sequential Decision-Making: Planning, Grounding, and Perception | Aug 10, 2023 | Decision MakingRobot Manipulation | —Unverified | 0 |
| Bayesian Inverse Transition Learning for Offline Settings | Aug 9, 2023 | Decision MakingSequential Decision Making | —Unverified | 0 |
| FLARE: Fingerprinting Deep Reinforcement Learning Agents using Universal Adversarial Masks | Jul 27, 2023 | Decision MakingDeep Reinforcement Learning | CodeCode Available | 0 |
| Deep Reinforcement Learning for Robust Goal-Based Wealth Management | Jul 25, 2023 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| DIP-RL: Demonstration-Inferred Preference Learning in Minecraft | Jul 22, 2023 | Decision MakingMinecraft | —Unverified | 0 |
| On the Expressivity of Multidimensional Markov Reward | Jul 22, 2023 | Decision MakingDecision Making Under Uncertainty | —Unverified | 0 |
| Hindsight-DICE: Stable Credit Assignment for Deep Reinforcement Learning | Jul 21, 2023 | Decision MakingDeep Reinforcement Learning | CodeCode Available | 0 |