| Out of the Cage: How Stochastic Parrots Win in Cyber Security Environments | Aug 23, 2023 | CyberBattleSimCyberBattleSim (RL) chain scenario | CodeCode Available | 1 |
| LaGR-SEQ: Language-Guided Reinforcement Learning with Sample-Efficient Querying | Aug 21, 2023 | Decision Makingreinforcement-learning | CodeCode Available | 0 |
| A Robust Policy Bootstrapping Algorithm for Multi-objective Reinforcement Learning in Non-stationary Environments | Aug 18, 2023 | Decision MakingMulti-Objective Reinforcement Learning | —Unverified | 0 |
| Intrinsically Motivated Hierarchical Policy Learning in Multi-objective Markov Decision Processes | Aug 18, 2023 | Decision MakingMulti-Objective Reinforcement Learning | —Unverified | 0 |
| IMM: An Imitative Reinforcement Learning Approach with Predictive Representation Learning for Automatic Market Making | Aug 17, 2023 | Decision MakingImitation Learning | —Unverified | 0 |
| Value-Distributional Model-Based Reinforcement Learning | Aug 12, 2023 | continuous-controlContinuous Control | CodeCode Available | 0 |
| Multimodal Pretrained Models for Verifiable Sequential Decision-Making: Planning, Grounding, and Perception | Aug 10, 2023 | Decision MakingRobot Manipulation | —Unverified | 0 |
| Bayesian Inverse Transition Learning for Offline Settings | Aug 9, 2023 | Decision MakingSequential Decision Making | —Unverified | 0 |
| FLARE: Fingerprinting Deep Reinforcement Learning Agents using Universal Adversarial Masks | Jul 27, 2023 | Decision MakingDeep Reinforcement Learning | CodeCode Available | 0 |
| Deep Reinforcement Learning for Robust Goal-Based Wealth Management | Jul 25, 2023 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| DIP-RL: Demonstration-Inferred Preference Learning in Minecraft | Jul 22, 2023 | Decision MakingMinecraft | —Unverified | 0 |
| On the Expressivity of Multidimensional Markov Reward | Jul 22, 2023 | Decision MakingDecision Making Under Uncertainty | —Unverified | 0 |
| Selective Perception: Optimizing State Descriptions with Reinforcement Learning for Language Model Actors | Jul 21, 2023 | Decision MakingLanguage Modeling | —Unverified | 0 |
| Hindsight-DICE: Stable Credit Assignment for Deep Reinforcement Learning | Jul 21, 2023 | Decision MakingDeep Reinforcement Learning | CodeCode Available | 0 |
| Breadcrumbs to the Goal: Goal-Conditioned Exploration from Human-in-the-Loop Feedback | Jul 20, 2023 | Decision Makingreinforcement-learning | CodeCode Available | 1 |
| Online Learning with Costly Features in Non-stationary Environments | Jul 18, 2023 | Decision MakingSequential Decision Making | CodeCode Available | 0 |
| Non-stationary Delayed Combinatorial Semi-Bandit with Causally Related Rewards | Jul 18, 2023 | Decision MakingDecision Making Under Uncertainty | —Unverified | 0 |
| POMDP inference and robust solution via deep reinforcement learning: An application to railway optimal maintenance | Jul 16, 2023 | Decision MakingDeep Reinforcement Learning | CodeCode Available | 0 |
| Multi-Player Zero-Sum Markov Games with Networked Separable Interactions | Jul 13, 2023 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Probabilistic Constrained Reinforcement Learning with Formal Interpretability | Jul 13, 2023 | Decision Makingreinforcement-learning | CodeCode Available | 0 |
| FAIRO: Fairness-aware Adaptation in Sequential-Decision Making for Human-in-the-Loop Systems | Jul 12, 2023 | Decision MakingFairness | —Unverified | 0 |
| BOF-UCB: A Bayesian-Optimistic Frequentist Algorithm for Non-Stationary Contextual Bandits | Jul 7, 2023 | Decision MakingMulti-Armed Bandits | —Unverified | 0 |
| ContainerGym: A Real-World Reinforcement Learning Benchmark for Resource Allocation | Jul 6, 2023 | Decision MakingDeep Reinforcement Learning | CodeCode Available | 1 |
| TGRL: An Algorithm for Teacher Guided Reinforcement Learning | Jul 6, 2023 | counterfactualDecision Making | —Unverified | 0 |
| Generative Flow Networks: a Markov Chain Perspective | Jul 4, 2023 | Decision MakingSequential Decision Making | —Unverified | 0 |