A Deep Reinforcement Learning Approach for Audio-based Navigation and Audio Source Localization in Multi-speaker Environments Oct 25, 2021 Deep Reinforcement Learning reinforcement-learning
— Unverified 0Operator Shifting for Model-based Policy Evaluation Oct 25, 2021 model Model-based Reinforcement Learning
— Unverified 0Mixture-of-Variational-Experts for Continual Learning Oct 25, 2021 Continual Learning Domain-IL Continual Learning
Code Code Available 0Which Model to Trust: Assessing the Influence of Models on the Performance of Reinforcement Learning Algorithms for Continuous Control Tasks Oct 25, 2021 Benchmarking continuous-control
Code Code Available 0Self-Consistent Models and Values Oct 25, 2021 reinforcement-learning Reinforcement Learning (RL)
— Unverified 0Unsupervised Domain Adaptation with Dynamics-Aware Rewards in Reinforcement Learning Oct 25, 2021 Domain Adaptation reinforcement-learning
— Unverified 0Deep Reinforcement Learning for Simultaneous Sensing and Channel Access in Cognitive Networks Oct 24, 2021 Deep Reinforcement Learning Q-Learning
— Unverified 0Foresight of Graph Reinforcement Learning Latent Permutations Learnt by Gumbel Sinkhorn Network Oct 23, 2021 Graph Attention reinforcement-learning
— Unverified 0Fully Distributed Actor-Critic Architecture for Multitask Deep Reinforcement Learning Oct 23, 2021 continuous-control Continuous Control
— Unverified 0Analysis of Thompson Sampling for Partially Observable Contextual Multi-Armed Bandits Oct 23, 2021 Decision Making Multi-Armed Bandits
— Unverified 0Policy Search using Dynamic Mirror Descent MPC for Model Free Off Policy RL Oct 23, 2021 Model Predictive Control MuJoCo
— Unverified 0Off-policy Reinforcement Learning with Optimistic Exploration and Distribution Correction Oct 22, 2021 continuous-control Continuous Control
— Unverified 0ReLAX: Reinforcement Learning Agent eXplainer for Arbitrary Predictive Models Oct 22, 2021 counterfactual Decision Making
Code Code Available 0Reinforcement Learning for Process Control with Application in Semiconductor Manufacturing Oct 22, 2021 reinforcement-learning Reinforcement Learning
— Unverified 0Patient level simulation and reinforcement learning to discover novel strategies for treating ovarian cancer Oct 22, 2021 Prognosis reinforcement-learning
— Unverified 0Convergence Rates of Average-Reward Multi-agent Reinforcement Learning via Randomized Linear Programming Oct 22, 2021 Multi-agent Reinforcement Learning Reinforcement Learning (RL)
— Unverified 0A Reinforcement Learning Approach to Parameter Selection for Distributed Optimal Power Flow Oct 22, 2021 Distributed Optimization Q-Learning
— Unverified 0C-Planning: An Automatic Curriculum for Learning Goal-Reaching Tasks Oct 22, 2021 Reinforcement Learning (RL)
— Unverified 0Is High Variance Unavoidable in RL? A Case Study in Continuous Control Oct 21, 2021 continuous-control Continuous Control
— Unverified 0Efficient Robotic Manipulation Through Offline-to-Online Reinforcement Learning and Goal-Aware State Information Oct 21, 2021 Imitation Learning Reinforcement Learning (RL)
— Unverified 0Can Q-learning solve Multi Armed Bantids? Oct 21, 2021 Decision Making Q-Learning
— Unverified 0Anti-Concentrated Confidence Bonuses for Scalable Exploration Oct 21, 2021 Decision Making Deep Reinforcement Learning
— Unverified 0Deep Reinforcement Learning for Online Control of Stochastic Partial Differential Equations Oct 21, 2021 Deep Reinforcement Learning reinforcement-learning
— Unverified 0Off-Dynamics Inverse Reinforcement Learning from Hetero-Domain Oct 21, 2021 continuous-control Continuous Control
— Unverified 0Neuro-Symbolic Reinforcement Learning with First-Order Logic Oct 21, 2021 Deep Reinforcement Learning reinforcement-learning
— Unverified 0Model-based Reinforcement Learning for Service Mesh Fault Resiliency in a Web Application-level Oct 21, 2021 Attribute Management
— Unverified 0Reinforcement Learning Based Optimal Camera Placement for Depth Observation of Indoor Scenes Oct 21, 2021 reinforcement-learning Reinforcement Learning
— Unverified 0More Efficient Exploration with Symbolic Priors on Action Sequence Equivalences Oct 20, 2021 Efficient Exploration Open-Ended Question Answering
— Unverified 0Playing 2048 With Reinforcement Learning Oct 20, 2021 Playing the Game of 2048 Q-Learning
Code Code Available 0Transferring Reinforcement Learning for DC-DC Buck Converter Control via Duty Ratio Mapping: From Simulation to Implementation Oct 20, 2021 Deep Reinforcement Learning reinforcement-learning
— Unverified 0Computationally Efficient Safe Reinforcement Learning for Power Systems Oct 20, 2021 Model Predictive Control reinforcement-learning
— Unverified 0Socialbots on Fire: Modeling Adversarial Behaviors of Socialbots via Multi-Agent Hierarchical Reinforcement Learning Oct 20, 2021 Adversarial Attack Hierarchical Reinforcement Learning
— Unverified 0Distributed Reinforcement Learning for Privacy-Preserving Dynamic Edge Caching Oct 20, 2021 Edge-computing Federated Learning
— Unverified 0Feedback Linearization of Car Dynamics for Racing via Reinforcement Learning Oct 20, 2021 Car Racing reinforcement-learning
— Unverified 0Learning Robotic Manipulation Skills Using an Adaptive Force-Impedance Action Space Oct 19, 2021 Contact-rich Manipulation Decision Making
— Unverified 0Balancing Value Underestimation and Overestimation with Realistic Actor-Critic Oct 19, 2021 continuous-control Continuous Control
Code Code Available 0Beyond Exact Gradients: Convergence of Stochastic Soft-Max Policy Gradient Methods with Entropy Regularization Oct 19, 2021 Policy Gradient Methods Reinforcement Learning (RL)
— Unverified 0Continuous Control with Action Quantization from Demonstrations Oct 19, 2021 continuous-control Continuous Control
— Unverified 0Aesthetic Photo Collage with Deep Reinforcement Learning Oct 19, 2021 Deep Reinforcement Learning reinforcement-learning
— Unverified 0Improved cooperation by balancing exploration and exploitation in intertemporal social dilemma tasks Oct 19, 2021 Attribute Diversity
— Unverified 0Locally Differentially Private Reinforcement Learning for Linear Mixture Markov Decision Processes Oct 19, 2021 Privacy Preserving reinforcement-learning
— Unverified 0State-based Episodic Memory for Multi-Agent Reinforcement Learning Oct 19, 2021 Multi-agent Reinforcement Learning reinforcement-learning
— Unverified 0On Reward-Free RL with Kernel and Neural Function Approximations: Single-Agent MDP and Markov Game Oct 19, 2021 Reinforcement Learning (RL)
— Unverified 0Neural Network Compatible Off-Policy Natural Actor-Critic Algorithm Oct 19, 2021 Reinforcement Learning (RL)
— Unverified 0Sim-to-Real Transfer in Multi-agent Reinforcement Networking for Federated Edge Computing Oct 18, 2021 Edge-computing Federated Learning
— Unverified 0Optimistic Policy Optimization is Provably Efficient in Non-stationary MDPs Oct 18, 2021 Reinforcement Learning (RL)
— Unverified 0Provable Hierarchy-Based Meta-Reinforcement Learning Oct 18, 2021 Diversity Hierarchical Reinforcement Learning
— Unverified 0Reinforcement Learning-Based Coverage Path Planning with Implicit Cellular Decomposition Oct 18, 2021 reinforcement-learning Reinforcement Learning
— Unverified 0Option Transfer and SMDP Abstraction with Successor Features Oct 18, 2021 Reinforcement Learning (RL)
— Unverified 0Improving Robustness of Reinforcement Learning for Power System Control with Adversarial Training Oct 18, 2021 Decision Making reinforcement-learning
— Unverified 0