Is Vanilla Policy Gradient Overlooked? Analyzing Deep Reinforcement Learning for Hanabi Mar 22, 2022 Deep Reinforcement Learning reinforcement-learning
Code Code Available 0A Primer on Maximum Causal Entropy Inverse Reinforcement Learning Mar 22, 2022 reinforcement-learning Reinforcement Learning
— Unverified 0Insights From the NeurIPS 2021 NetHack Challenge Mar 22, 2022 NetHack Reinforcement Learning (RL)
Code Code Available 0Long Short-Term Memory for Spatial Encoding in Multi-Agent Path Planning Mar 21, 2022 reinforcement-learning Reinforcement Learning
Code Code Available 0Multitask Neuroevolution for Reinforcement Learning with Long and Short Episodes Mar 21, 2022 continuous-control Continuous Control
— Unverified 0Self-Imitation Learning from Demonstrations Mar 21, 2022 Imitation Learning Reinforcement Learning (RL)
— Unverified 0Optimizing Trajectories for Highway Driving with Offline Reinforcement Learning Mar 21, 2022 Autonomous Driving Offline RL
— Unverified 0ReCCoVER: Detecting Causal Confusion for Explainable Reinforcement Learning Mar 21, 2022 Deep Reinforcement Learning feature selection
Code Code Available 0Perceiving the World: Question-guided Reinforcement Learning for Text-based Games Mar 20, 2022 Deep Reinforcement Learning reinforcement-learning
Code Code Available 0Model-based Multi-agent Reinforcement Learning: Recent Progress and Prospects Mar 20, 2022 Decision Making Multi-agent Reinforcement Learning
— Unverified 0MicroRacer: a didactic environment for Deep Reinforcement Learning Mar 20, 2022 Car Racing Deep Reinforcement Learning
Code Code Available 0Reinforcement learning reward function in unmanned aerial vehicle control tasks Mar 20, 2022 Deep Reinforcement Learning reinforcement-learning
— Unverified 0Policy Gradients using Variational Quantum Circuits Mar 20, 2022 Benchmarking Quantum Machine Learning
— Unverified 0Entailment Relation Aware Paraphrase Generation Mar 20, 2022 Natural Language Inference Paraphrase Generation
— Unverified 0Explicit User Manipulation in Reinforcement Learning Based Recommender Systems Mar 20, 2022 Recommendation Systems reinforcement-learning
— Unverified 0Hierarchical Reinforcement Learning of Locomotion Policies in Response to Approaching Objects: A Preliminary Study Mar 20, 2022 Deep Reinforcement Learning Hierarchical Reinforcement Learning
— Unverified 0Learning on the Job: Long-Term Behavioural Adaptation in Human-Robot Interactions Mar 20, 2022 Reinforcement Learning (RL)
— Unverified 0Thompson Sampling on Asymmetric α-Stable Bandits Mar 19, 2022 reinforcement-learning Reinforcement Learning (RL)
— Unverified 0SURF: Semi-supervised Reward Learning with Data Augmentation for Feedback-efficient Preference-based Reinforcement Learning Mar 18, 2022 Data Augmentation Reinforcement Learning (RL)
— Unverified 0Risk-Sensitive Bayesian Games for Multi-Agent Reinforcement Learning under Policy Uncertainty Mar 18, 2022 Multi-agent Reinforcement Learning reinforcement-learning
— Unverified 0Privacy-Preserving Reinforcement Learning Beyond Expectation Mar 18, 2022 Decision Making Privacy Preserving
— Unverified 0Deep reinforcement learning guided graph neural networks for brain network analysis Mar 18, 2022 Deep Reinforcement Learning reinforcement-learning
— Unverified 0Infinite-Horizon Reach-Avoid Zero-Sum Games via Deep Reinforcement Learning Mar 18, 2022 Deep Reinforcement Learning Q-Learning
— Unverified 0GAC: A Deep Reinforcement Learning Model Toward User Incentivization in Unknown Social Networks Mar 17, 2022 Deep Reinforcement Learning reinforcement-learning
Code Code Available 0The Frost Hollow Experiments: Pavlovian Signalling as a Path to Coordination and Communication Between Agents Mar 17, 2022 Decision Making reinforcement-learning
— Unverified 0Near Instance-Optimal PAC Reinforcement Learning for Deterministic MDPs Mar 17, 2022 reinforcement-learning Reinforcement Learning
— Unverified 0Meta-Reinforcement Learning for the Tuning of PI Controllers: An Offline Approach Mar 17, 2022 Meta-Learning Meta Reinforcement Learning
— Unverified 0Semi-Markov Offline Reinforcement Learning for Healthcare Mar 17, 2022 Offline RL reinforcement-learning
Code Code Available 0Strategic Maneuver and Disruption with Reinforcement Learning Approaches for Multi-Agent Coordination Mar 17, 2022 reinforcement-learning Reinforcement Learning (RL)
— Unverified 0A Survey of Multi-Agent Deep Reinforcement Learning with Communication Mar 16, 2022 Deep Reinforcement Learning Multi-agent Reinforcement Learning
— Unverified 0Coach-assisted Multi-Agent Reinforcement Learning Framework for Unexpected Crashed Agents Mar 16, 2022 Multi-agent Reinforcement Learning reinforcement-learning
Code Code Available 0Lazy-MDPs: Towards Interpretable Reinforcement Learning by Learning When to Act Mar 16, 2022 Atari Games Decision Making
— Unverified 0How to Learn from Risk: Explicit Risk-Utility Reinforcement Learning for Efficient and Safe Driving Strategies Mar 16, 2022 Autonomous Driving Autonomous Vehicles
— Unverified 0A Deep Reinforcement Learning-Based Caching Strategy for IoT Networks with Transient Data Mar 16, 2022 Deep Reinforcement Learning reinforcement-learning
— Unverified 0COPA: Certifying Robust Policies for Offline Reinforcement Learning against Poisoning Attacks Mar 16, 2022 Offline RL reinforcement-learning
Code Code Available 0Backpropagation through Time and Space: Learning Numerical Methods with Multi-Agent Reinforcement Learning Mar 16, 2022 Multi-agent Reinforcement Learning reinforcement-learning
— Unverified 0Blocks Assemble! Learning to Assemble with Large-Scale Structured Reinforcement Learning Mar 15, 2022 Diagnostic reinforcement-learning
— Unverified 0Bi-Manual Manipulation and Attachment via Sim-to-Real Reinforcement Learning Mar 15, 2022 Collision Avoidance reinforcement-learning
— Unverified 0A Differentiable Approach to Combinatorial Optimization using Dataless Neural Networks Mar 15, 2022 Combinatorial Optimization Community Detection
— Unverified 0An Introduction to Multi-Agent Reinforcement Learning and Review of its Application to Autonomous Mobility Mar 15, 2022 Autonomous Vehicles Multi-agent Reinforcement Learning
— Unverified 0Multi-View Dreaming: Multi-View World Model with Contrastive Learning Mar 15, 2022 Contrastive Learning reinforcement-learning
— Unverified 0Non-Linear Reinforcement Learning in Large Action Spaces: Structural Conditions and Sample-efficiency of Posterior Sampling Mar 15, 2022 Reinforcement Learning (RL)
— Unverified 0Uncertainty Estimation for Language Reward Models Mar 14, 2022 Active Learning Reinforcement Learning (RL)
— Unverified 0The Multi-Agent Pickup and Delivery Problem: MAPF, MARL and Its Warehouse Applications Mar 14, 2022 Multi-Agent Path Finding Multi-agent Reinforcement Learning
— Unverified 0Reinforcement Learning for Optimal Control of a District Cooling Energy Plant Mar 14, 2022 Model Predictive Control Q-Learning
— Unverified 0Switch Trajectory Transformer with Distributional Value Approximation for Multi-Task Reinforcement Learning Mar 14, 2022 reinforcement-learning Reinforcement Learning (RL)
— Unverified 0Orchestrated Value Mapping for Reinforcement Learning Mar 14, 2022 Ensemble Learning Q-Learning
Code Code Available 0FRL-FI: Transient Fault Analysis for Federated Reinforcement Learning-Based Navigation Systems Mar 14, 2022 Fault Detection reinforcement-learning
— Unverified 0Efficient Model-based Multi-agent Reinforcement Learning via Optimistic Equilibrium Computation Mar 14, 2022 Autonomous Driving Gaussian Processes
— Unverified 0L2Explorer: A Lifelong Reinforcement Learning Assessment Environment Mar 14, 2022 Continual Learning Lifelong learning
Code Code Available 0