Transfer learning with causal counterfactual reasoning in Decision Transformers Oct 27, 2021 counterfactual Counterfactual Reasoning
— Unverified 0Multi-Agent Reinforcement Learning for Active Voltage Control on Power Distribution Networks Oct 27, 2021 Multi-agent Reinforcement Learning reinforcement-learning
Code Code Available 1APPTeK: Agent-Based Predicate Prediction in Temporal Knowledge Graphs Oct 27, 2021 Knowledge Graphs Prediction
— Unverified 0Reinforcement Learning in Factored Action Spaces using Tensor Decompositions Oct 27, 2021 Multi-agent Reinforcement Learning reinforcement-learning
— Unverified 0DESTA: A Framework for Safe Reinforcement Learning with Markov Games of Intervention Oct 27, 2021 OpenAI Gym reinforcement-learning
— Unverified 0Learning Domain Invariant Representations in Goal-conditioned Block MDPs Oct 27, 2021 Deep Reinforcement Learning Domain Generalization
Code Code Available 1Finite Horizon Q-learning: Stability, Convergence, Simulations and an application on Smart Grids Oct 27, 2021 Q-Learning reinforcement-learning
— Unverified 0A Subgame Perfect Equilibrium Reinforcement Learning Approach to Time-inconsistent Problems Oct 27, 2021 reinforcement-learning Reinforcement Learning (RL)
— Unverified 0Enhancing Reinforcement Learning with discrete interfaces to learn the Dyck Language Oct 27, 2021 reinforcement-learning Reinforcement Learning (RL)
— Unverified 0DreamerPro: Reconstruction-Free Model-Based Reinforcement Learning with Prototypical Representations Oct 27, 2021 Model-based Reinforcement Learning reinforcement-learning
Code Code Available 1Comparing Heuristics, Constraint Optimization, and Reinforcement Learning for an Industrial 2D Packing Problem Oct 27, 2021 Deep Reinforcement Learning reinforcement-learning
— Unverified 0ABIDES-Gym: Gym Environments for Multi-Agent Discrete Event Simulation and Application to Financial Markets Oct 27, 2021 OpenAI Gym Reinforcement Learning (RL)
Code Code Available 1Fragment-based Sequential Translation for Molecular Optimization Oct 26, 2021 Drug Discovery Reinforcement Learning (RL)
— Unverified 0Multi-Agent Advisor Q-Learning Oct 26, 2021 Decision Making Multi-agent Reinforcement Learning
Code Code Available 0Towards Hyperparameter-free Policy Selection for Offline Reinforcement Learning Oct 26, 2021 Off-policy evaluation Open-Ended Question Answering
Code Code Available 0The Difficulty of Passive Learning in Deep Reinforcement Learning Oct 26, 2021 Deep Reinforcement Learning reinforcement-learning
— Unverified 0Fault-Tolerant Federated Reinforcement Learning with Theoretical Guarantee Oct 26, 2021 Decision Making Federated Learning
Code Code Available 1Landmark-Guided Subgoal Generation in Hierarchical Reinforcement Learning Oct 26, 2021 Efficient Exploration Hierarchical Reinforcement Learning
Code Code Available 1Distributed Multi-Agent Deep Reinforcement Learning Framework for Whole-building HVAC Control Oct 26, 2021 Deep Reinforcement Learning reinforcement-learning
— Unverified 0Applications of Multi-Agent Reinforcement Learning in Future Internet: A Comprehensive Survey Oct 26, 2021 Decision Making Deep Reinforcement Learning
— Unverified 0Accelerating Distributed Deep Reinforcement Learning by In-Network Experience Sampling Oct 26, 2021 Deep Reinforcement Learning reinforcement-learning
— Unverified 0Learning Robust Controllers Via Probabilistic Model-Based Policy Search Oct 26, 2021 Model-based Reinforcement Learning reinforcement-learning
— Unverified 0EnTRPO: Trust Region Policy Optimization Method with Entropy Regularization Oct 26, 2021 reinforcement-learning Reinforcement Learning
— Unverified 0Learning to Simulate Self-Driven Particles System with Coordinated Policy Optimization Oct 26, 2021 Multi-agent Reinforcement Learning reinforcement-learning
Code Code Available 1Average-Reward Learning and Planning with Options Oct 26, 2021 reinforcement-learning Reinforcement Learning (RL)
— Unverified 0Neural PPO-Clip Attains Global Optimality: A Hinge Loss Perspective Oct 26, 2021 Deep Reinforcement Learning reinforcement-learning
— Unverified 0Distributional Reinforcement Learning for Multi-Dimensional Reward Functions Oct 26, 2021 Distributional Reinforcement Learning reinforcement-learning
Code Code Available 0Automating Control of Overestimation Bias for Reinforcement Learning Oct 26, 2021 Continuous Control Q-Learning
— Unverified 0A Deep Reinforcement Learning Approach for Audio-based Navigation and Audio Source Localization in Multi-speaker Environments Oct 25, 2021 Deep Reinforcement Learning reinforcement-learning
— Unverified 0Common Information based Approximate State Representations in Multi-Agent Reinforcement Learning Oct 25, 2021 Multi-agent Reinforcement Learning reinforcement-learning
— Unverified 0Operator Shifting for Model-based Policy Evaluation Oct 25, 2021 model Model-based Reinforcement Learning
— Unverified 0Which Model to Trust: Assessing the Influence of Models on the Performance of Reinforcement Learning Algorithms for Continuous Control Tasks Oct 25, 2021 Benchmarking continuous-control
Code Code Available 0Mixture-of-Variational-Experts for Continual Learning Oct 25, 2021 Continual Learning Domain-IL Continual Learning
Code Code Available 0Unsupervised Domain Adaptation with Dynamics-Aware Rewards in Reinforcement Learning Oct 25, 2021 Domain Adaptation reinforcement-learning
— Unverified 0Recurrent Off-policy Baselines for Memory-based Continuous Control Oct 25, 2021 continuous-control Continuous Control
Code Code Available 1Uniformly Conservative Exploration in Reinforcement Learning Oct 25, 2021 Deep Reinforcement Learning reinforcement-learning
Code Code Available 1Self-Consistent Models and Values Oct 25, 2021 reinforcement-learning Reinforcement Learning (RL)
— Unverified 0Goal-Aware Cross-Entropy for Multi-Target Reinforcement Learning Oct 25, 2021 reinforcement-learning Reinforcement Learning
Code Code Available 1Learning What to Memorize: Using Intrinsic Motivation to Form Useful Memory in Partially Observable Reinforcement Learning Oct 25, 2021 Form Partially Observable Reinforcement Learning
— Unverified 0Can Q-Learning be Improved with Advice? Oct 25, 2021 Q-Learning reinforcement-learning
— Unverified 0Deep Reinforcement Learning for Simultaneous Sensing and Channel Access in Cognitive Networks Oct 24, 2021 Deep Reinforcement Learning Q-Learning
— Unverified 0Understanding the World Through Action Oct 24, 2021 reinforcement-learning Reinforcement Learning
Code Code Available 1False Correlation Reduction for Offline Reinforcement Learning Oct 24, 2021 D4RL Decision Making
Code Code Available 1Analysis of Thompson Sampling for Partially Observable Contextual Multi-Armed Bandits Oct 23, 2021 Decision Making Multi-Armed Bandits
— Unverified 0Fully Distributed Actor-Critic Architecture for Multitask Deep Reinforcement Learning Oct 23, 2021 continuous-control Continuous Control
— Unverified 0Foresight of Graph Reinforcement Learning Latent Permutations Learnt by Gumbel Sinkhorn Network Oct 23, 2021 Graph Attention reinforcement-learning
— Unverified 0Policy Search using Dynamic Mirror Descent MPC for Model Free Off Policy RL Oct 23, 2021 Model Predictive Control MuJoCo
— Unverified 0Off-policy Reinforcement Learning with Optimistic Exploration and Distribution Correction Oct 22, 2021 continuous-control Continuous Control
— Unverified 0A Reinforcement Learning Approach to Parameter Selection for Distributed Optimal Power Flow Oct 22, 2021 Distributed Optimization Q-Learning
— Unverified 0Convergence Rates of Average-Reward Multi-agent Reinforcement Learning via Randomized Linear Programming Oct 22, 2021 Multi-agent Reinforcement Learning Reinforcement Learning (RL)
— Unverified 0