| Value Enhancement of Reinforcement Learning via Efficient and Robust Trust Region Optimization | Jan 5, 2023 | Decision Makingreinforcement-learning | —Unverified | 0 |
| Local Differential Privacy for Sequential Decision Making in a Changing Environment | Jan 2, 2023 | Decision MakingMulti-Armed Bandits | —Unverified | 0 |
| Online Statistical Inference for Contextual Bandits via Stochastic Gradient Descent | Dec 30, 2022 | Decision MakingMulti-Armed Bandits | —Unverified | 0 |
| Risk-Sensitive Policy with Distributional Reinforcement Learning | Dec 30, 2022 | Decision MakingDistributional Reinforcement Learning | CodeCode Available | 1 |
| Quantile Off-Policy Evaluation via Deep Conditional Generative Learning | Dec 29, 2022 | Decision MakingOff-policy evaluation | —Unverified | 0 |
| Offline Reinforcement Learning via Linear-Programming with Error-Bound Induced Constraints | Dec 28, 2022 | Decision MakingOffline RL | —Unverified | 0 |
| Linear Combinatorial Semi-Bandit with Causally Related Rewards | Dec 25, 2022 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Online Statistical Inference in Decision-Making with Matrix Context | Dec 21, 2022 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Bridging POMDPs and Bayesian decision making for robust maintenance planning under model uncertainty: An application to railway systems | Dec 15, 2022 | Decision MakingSequential Decision Making | CodeCode Available | 1 |
| Invariant Lipschitz Bandits: A Side Observation Approach | Dec 14, 2022 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Hybrid Multi-agent Deep Reinforcement Learning for Autonomous Mobility on Demand Systems | Dec 14, 2022 | Decision MakingDeep Reinforcement Learning | CodeCode Available | 1 |
| Autoregressive Bandits | Dec 12, 2022 | Decision MakingSequential Decision Making | CodeCode Available | 0 |
| Reinforced Approximate Exploratory Data Analysis | Dec 12, 2022 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| Information-Theoretic Safe Exploration with Gaussian Processes | Dec 9, 2022 | Decision MakingGaussian Processes | CodeCode Available | 0 |
| Model-based trajectory stitching for improved behavioural cloning and its applications | Dec 8, 2022 | Behavioural cloningBenchmarking | —Unverified | 0 |
| Winning the CityLearn Challenge: Adaptive Optimization with Evolutionary Search under Trajectory-based Guidance | Dec 4, 2022 | Decision MakingReinforcement Learning (RL) | —Unverified | 0 |
| Automaton-Based Representations of Task Knowledge from Generative Language Models | Dec 4, 2022 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Distributed Deep Reinforcement Learning: A Survey and A Multi-Player Multi-Agent Learning Toolbox | Dec 1, 2022 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| ACE: Cooperative Multi-agent Q-learning with Bidirectional Action-Dependency | Nov 29, 2022 | Decision MakingMulti-agent Reinforcement Learning | CodeCode Available | 2 |
| Anderson Acceleration for Partially Observable Markov Decision Processes: A Maximum Entropy Approach | Nov 28, 2022 | Decision MakingSequential Decision Making | CodeCode Available | 0 |
| Continuous Episodic Control | Nov 28, 2022 | continuous-controlContinuous Control | —Unverified | 0 |
| Is Conditional Generative Modeling all you need for Decision-Making? | Nov 28, 2022 | AllDecision Making | —Unverified | 0 |
| Multi-Environment Pretraining Enables Transfer to Action Limited Datasets | Nov 23, 2022 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Don't Watch Me: A Spatio-Temporal Trojan Attack on Deep-Reinforcement-Learning-Augment Autonomous Driving | Nov 22, 2022 | Autonomous DrivingDecision Making | —Unverified | 0 |
| β-Multivariational Autoencoder for Entangled Representation Learning in Video Frames | Nov 22, 2022 | Decision MakingObject | CodeCode Available | 0 |
| A Deep Reinforcement Learning Approach to Rare Event Estimation | Nov 22, 2022 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| UniMASK: Unified Inference in Sequential Decision Problems | Nov 20, 2022 | Decision MakingSequential Decision Making | CodeCode Available | 1 |
| Dynamical Linear Bandits | Nov 16, 2022 | Decision MakingSequential Decision Making | CodeCode Available | 0 |
| Agent-State Construction with Auxiliary Inputs | Nov 15, 2022 | Decision Makingreinforcement-learning | CodeCode Available | 0 |
| Doubly Inhomogeneous Reinforcement Learning | Nov 8, 2022 | Change Point DetectionClustering | CodeCode Available | 0 |
| Learning to Follow Instructions in Text-Based Games | Nov 8, 2022 | Decision MakingInstruction Following | CodeCode Available | 0 |
| A Survey on Reinforcement Learning in Aviation Applications | Nov 3, 2022 | Decision Makingreinforcement-learning | —Unverified | 0 |
| Dungeons and Data: A Large-Scale NetHack Dataset | Nov 1, 2022 | Decision MakingNetHack | CodeCode Available | 2 |
| Teacher-student curriculum learning for reinforcement learning | Oct 31, 2022 | Board GamesDecision Making | —Unverified | 0 |
| HSVI can solve zero-sum Partially Observable Stochastic Games | Oct 26, 2022 | Decision MakingHeuristic Search | —Unverified | 0 |
| Quantum deep recurrent reinforcement learning | Oct 26, 2022 | Decision MakingQ-Learning | —Unverified | 0 |
| PopArt: Efficient Sparse Regression and Experimental Design for Optimal Sparse Linear Bandits | Oct 25, 2022 | Decision MakingExperimental Design | CodeCode Available | 0 |
| Sequential Decision Making on Unmatched Data using Bayesian Kernel Embeddings | Oct 25, 2022 | Bayesian OptimizationDecision Making | —Unverified | 0 |
| Integrating Policy Summaries with Reward Decomposition for Explaining Reinforcement Learning Agents | Oct 21, 2022 | Decision Makingreinforcement-learning | —Unverified | 0 |
| Fine-Grained Session Recommendations in E-commerce using Deep Reinforcement Learning | Oct 20, 2022 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| Movement Penalized Bayesian Optimization with Application to Wind Energy Systems | Oct 14, 2022 | Bayesian OptimizationDecision Making | —Unverified | 0 |
| Markup-to-Image Diffusion Models with Scheduled Sampling | Oct 11, 2022 | Decision MakingDenoising | CodeCode Available | 1 |
| Learning on the Edge: Online Learning with Stochastic Feedback Graphs | Oct 9, 2022 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Advice Conformance Verification by Reinforcement Learning agents for Human-in-the-Loop | Oct 7, 2022 | Decision Makingreinforcement-learning | —Unverified | 0 |
| Reinforcement Learning Approach for Multi-Agent Flexible Scheduling Problems | Oct 7, 2022 | Combinatorial OptimizationDecision Making | —Unverified | 0 |
| Hyperbolic Deep Reinforcement Learning | Oct 4, 2022 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| Continuous Monte Carlo Graph Search | Oct 4, 2022 | continuous-controlContinuous Control | CodeCode Available | 0 |
| Generalizing Bayesian Optimization with Decision-theoretic Entropies | Oct 4, 2022 | Bayesian OptimizationDecision Making | —Unverified | 0 |
| Is Reinforcement Learning (Not) for Natural Language Processing: Benchmarks, Baselines, and Building Blocks for Natural Language Policy Optimization | Oct 3, 2022 | Decision MakingPolicy Gradient Methods | CodeCode Available | 1 |
| Offline Reinforcement Learning with Differentiable Function Approximation is Provably Efficient | Oct 3, 2022 | Decision MakingOffline RL | —Unverified | 0 |