Iterative Bounding MDPs: Learning Interpretable Policies via Non-Interpretable Methods Feb 25, 2021 reinforcement-learning Reinforcement Learning
— Unverified 0Twin actor twin delayed deep deterministic policy gradient (TATD3) learning for batch process control Feb 25, 2021 continuous-control Continuous Control
— Unverified 0Emerging Trends in Federated Learning: From Model Fusion to Federated X Learning Feb 25, 2021 Federated Learning Meta-Learning
— Unverified 0No-Regret Reinforcement Learning with Heavy-Tailed Rewards Feb 25, 2021 Deep Reinforcement Learning Q-Learning
— Unverified 0Reinforcement learning approach for resource allocation in humanitarian logistics Feb 25, 2021 Humanitarian Q-Learning
— Unverified 0Reinforcement Learning of Implicit and Explicit Control Flow in Instructions Feb 25, 2021 Minecraft reinforcement-learning
— Unverified 0Where to go next: Learning a Subgoal Recommendation Policy for Navigation Among Pedestrians Feb 25, 2021 Collision Avoidance Deep Reinforcement Learning
— Unverified 0On The Effect of Auxiliary Tasks on Representation Dynamics Feb 25, 2021 Deep Reinforcement Learning reinforcement-learning
— Unverified 0PFRL: Pose-Free Reinforcement Learning for 6D Pose Estimation Feb 24, 2021 6D Pose Estimation Pose Estimation
— Unverified 0The Logical Options Framework Feb 24, 2021 Hierarchical Reinforcement Learning reinforcement-learning
— Unverified 0Modular Deep Reinforcement Learning for Continuous Motion Planning with Temporal Logic Feb 24, 2021 Deep Reinforcement Learning Motion Planning
Code Code Available 0Towards Safe Continuing Task Reinforcement Learning Feb 24, 2021 reinforcement-learning Reinforcement Learning
— Unverified 0PsiPhi-Learning: Reinforcement Learning with Demonstrations using Successor Features and Inverse Temporal Difference Learning Feb 24, 2021 Autonomous Driving reinforcement-learning
Code Code Available 0Deep Reinforcement Learning for Safe Landing Site Selection with Concurrent Consideration of Divert Maneuvers Feb 24, 2021 Deep Reinforcement Learning Reinforcement Learning (RL)
— Unverified 0Learning Emergent Discrete Message Communication for Cooperative Reinforcement Learning Feb 24, 2021 Multi-agent Reinforcement Learning reinforcement-learning
— Unverified 0Fast Approximate Solutions using Reinforcement Learning for Dynamic Capacitated Vehicle Routing with Time Windows Feb 24, 2021 Reinforcement Learning (RL)
— Unverified 0FIXAR: A Fixed-Point Deep Reinforcement Learning Platform with Quantization-Aware Training and Adaptive Parallelism Feb 24, 2021 CPU Deep Reinforcement Learning
— Unverified 0Annotating Motion Primitives for Simplifying Action Search in Reinforcement Learning Feb 24, 2021 reinforcement-learning Reinforcement Learning
— Unverified 0Beyond Fine-Tuning: Transferring Behavior in Reinforcement Learning Feb 24, 2021 reinforcement-learning Reinforcement Learning
— Unverified 0Credit Assignment with Meta-Policy Gradient for Multi-Agent Reinforcement Learning Feb 24, 2021 Meta-Learning Multi-agent Reinforcement Learning
— Unverified 0Combining Off and On-Policy Training in Model-Based Reinforcement Learning Feb 24, 2021 Atari Games Board Games
— Unverified 0Greedy-Step Off-Policy Reinforcement Learning Feb 23, 2021 Q-Learning reinforcement-learning
— Unverified 0Differentiable Logic Machines Feb 23, 2021 Decision Making Inductive logic programming
— Unverified 0Honey, I Shrunk The Actor: A Case Study on Preserving Performance with Smaller Actors in Actor-Critic RL Feb 23, 2021 Reinforcement Learning (RL)
— Unverified 0DeepThermal: Combustion Optimization for Thermal Power Generating Units Using Offline Reinforcement Learning Feb 23, 2021 Continuous Control Offline RL
— Unverified 0A Robotic Model of Hippocampal Reverse Replay for Reinforcement Learning Feb 23, 2021 Hippocampus reinforcement-learning
— Unverified 0State Augmented Constrained Reinforcement Learning: Overcoming the Limitations of Learning with Rewards Feb 23, 2021 reinforcement-learning Reinforcement Learning
— Unverified 0School of hard knocks: Curriculum analysis for Pommerman with a fixed computational budget Feb 23, 2021 Reinforcement Learning (RL)
— Unverified 0MUSBO: Model-based Uncertainty Regularized and Sample Efficient Batch Optimization for Deployment Constrained Reinforcement Learning Feb 23, 2021 Reinforcement Learning (RL) Uncertainty Quantification
— Unverified 0Stratified Experience Replay: Correcting Multiplicity Bias in Off-Policy Reinforcement Learning Feb 22, 2021 Deep Reinforcement Learning reinforcement-learning
— Unverified 0SENTINEL: Taming Uncertainty with Ensemble-based Distributional Reinforcement Learning Feb 22, 2021 Decision Making Distributional Reinforcement Learning
— Unverified 0Reinforcement Learning of the Prediction Horizon in Model Predictive Control Feb 22, 2021 Model Predictive Control Prediction
— Unverified 0Return-Based Contrastive Representation Learning for Reinforcement Learning Feb 22, 2021 Atari Games Deep Reinforcement Learning
— Unverified 0Communication Efficient Parallel Reinforcement Learning Feb 22, 2021 reinforcement-learning Reinforcement Learning
— Unverified 0Uncertainty Estimation Using Riemannian Model~Dynamics for Offline Reinforcement Learning Feb 22, 2021 Autonomous Driving continuous-control
— Unverified 0Provably Improved Context-Based Offline Meta-RL with Attention and Contrastive Learning Feb 22, 2021 Contrastive Learning Meta-Learning
— Unverified 0Action Redundancy in Reinforcement Learning Feb 22, 2021 MuJoCo reinforcement-learning
— Unverified 0Explore the Context: Optimal Data Collection for Context-Conditional Dynamics Models Feb 22, 2021 reinforcement-learning Reinforcement Learning (RL)
Code Code Available 0Improved Learning of Robot Manipulation Tasks via Tactile Intrinsic Motivation Feb 22, 2021 Deep Reinforcement Learning reinforcement-learning
— Unverified 0Escaping from Zero Gradient: Revisiting Action-Constrained Reinforcement Learning via Frank-Wolfe Policy Optimization Feb 22, 2021 Reinforcement Learning (RL) Scheduling
— Unverified 0A Novel Framework for Neural Architecture Search in the Hill Climbing Domain Feb 22, 2021 GPU Neural Architecture Search
— Unverified 0Deep Reinforcement Learning for Dynamic Spectrum Sharing of LTE and NR Feb 22, 2021 Deep Reinforcement Learning reinforcement-learning
— Unverified 0Efficient Text-based Reinforcement Learning by Jointly Leveraging State and Commonsense Graph Representations Feb 22, 2021 Decision Making Graph Attention
— Unverified 0Learning Efficient Navigation in Vortical Flow Fields Feb 21, 2021 reinforcement-learning Reinforcement Learning
— Unverified 0Safe Reinforcement Learning Using Robust Action Governor Feb 21, 2021 RAG reinforcement-learning
— Unverified 0Towards Automatic Evaluation of Dialog Systems: A Model-Free Off-Policy Evaluation Approach Feb 20, 2021 Model-based Reinforcement Learning Off-policy evaluation
— Unverified 0How To Train Your HERON Feb 20, 2021 Deep Reinforcement Learning reinforcement-learning
— Unverified 0Decaying Clipping Range in Proximal Policy Optimization Feb 20, 2021 reinforcement-learning Reinforcement Learning
Code Code Available 0Importance of Environment Design in Reinforcement Learning: A Study of a Robotic Environment Feb 20, 2021 Decision Making reinforcement-learning
— Unverified 0A Reinforcement Learning Approach to Age of Information in Multi-User Networks with HARQ Feb 19, 2021 reinforcement-learning Reinforcement Learning
— Unverified 0