The Role of Exploration for Task Transfer in Reinforcement Learning Oct 11, 2022 reinforcement-learning Reinforcement Learning
— Unverified 0Edge-Cloud Cooperation for DNN Inference via Reinforcement Learning and Supervised Learning Oct 11, 2022 image-classification Image Classification
— Unverified 0Broad-persistent Advice for Interactive Reinforcement Learning Scenarios Oct 11, 2022 reinforcement-learning Reinforcement Learning
— Unverified 0Experiential Explanations for Reinforcement Learning Oct 10, 2022 Chunking counterfactual
Code Code Available 0A policy gradient approach for Finite Horizon Constrained Markov Decision Processes Oct 10, 2022 reinforcement-learning Reinforcement Learning
Code Code Available 0Learning Explicit Credit Assignment for Cooperative Multi-Agent Reinforcement Learning via Polarization Policy Gradient Oct 10, 2022 Multi-agent Reinforcement Learning reinforcement-learning
Code Code Available 0Creating a Dynamic Quadrupedal Robotic Goalkeeper with Reinforcement Learning Oct 10, 2022 reinforcement-learning Reinforcement Learning
— Unverified 0Long N-step Surrogate Stage Reward to Reduce Variances of Deep Reinforcement Learning in Complex Problems Oct 10, 2022 continuous-control Continuous Control
— Unverified 0Simulating Coverage Path Planning with Roomba Oct 10, 2022 Deep Reinforcement Learning reinforcement-learning
— Unverified 0Towards a Theoretical Foundation of Policy Optimization for Learning Control Policies Oct 10, 2022 continuous-control Continuous Control
— Unverified 0State Advantage Weighting for Offline RL Oct 9, 2022 D4RL Offline RL
— Unverified 0The Role of Coverage in Online Reinforcement Learning Oct 9, 2022 Efficient Exploration Offline RL
— Unverified 0Equivalence of Optimality Criteria for Markov Decision Process and Model Predictive Control Oct 9, 2022 Model Predictive Control reinforcement-learning
— Unverified 0Dynamically meeting performance objectives for multiple services on a service mesh Oct 8, 2022 Blocking Management
— Unverified 0Cognitive Models as Simulators: The Case of Moral Decision-Making Oct 8, 2022 Decision Making Fairness
— Unverified 0Algorithmic Trading Using Continuous Action Space Deep Reinforcement Learning Oct 7, 2022 Algorithmic Trading Deep Reinforcement Learning
— Unverified 0Conservative Bayesian Model-Based Value Expansion for Offline Policy Optimization Oct 7, 2022 continuous-control Continuous Control
Code Code Available 0Advice Conformance Verification by Reinforcement Learning agents for Human-in-the-Loop Oct 7, 2022 Decision Making reinforcement-learning
— Unverified 0Large Language Models can Implement Policy Iteration Oct 7, 2022 In-Context Learning Language Modelling
— Unverified 0How to Enable Uncertainty Estimation in Proximal Policy Optimization Oct 7, 2022 Deep Reinforcement Learning Out of Distribution (OOD) Detection
— Unverified 0Exploration Policies for On-the-Fly Controller Synthesis: A Reinforcement Learning Approach Oct 7, 2022 Blocking reinforcement-learning
Code Code Available 0Multi-agent Deep Covering Skill Discovery Oct 7, 2022 Multi-agent Reinforcement Learning reinforcement-learning
— Unverified 0Reinforcement Learning Approach for Multi-Agent Flexible Scheduling Problems Oct 7, 2022 Combinatorial Optimization Decision Making
— Unverified 0Meta Reinforcement Learning for Optimal Design of Legged Robots Oct 6, 2022 Meta Reinforcement Learning reinforcement-learning
— Unverified 0Lyapunov Function Consistent Adaptive Network Signal Control with Back Pressure and Reinforcement Learning Oct 6, 2022 reinforcement-learning Reinforcement Learning (RL)
— Unverified 0Reinforcement Learning with Large Action Spaces for Neural Machine Translation Oct 6, 2022 Machine Translation NMT
— Unverified 0Low-Thrust Orbital Transfer using Dynamics-Agnostic Reinforcement Learning Oct 6, 2022 reinforcement-learning Reinforcement Learning
— Unverified 0Learning Algorithms for Intelligent Agents and Mechanisms Oct 6, 2022 Decision Making reinforcement-learning
— Unverified 0Distributionally Adaptive Meta Reinforcement Learning Oct 6, 2022 Meta Reinforcement Learning reinforcement-learning
— Unverified 0Deep Inventory Management Oct 6, 2022 Deep Reinforcement Learning Management
— Unverified 0Digital Human Interactive Recommendation Decision-Making Based on Reinforcement Learning Oct 6, 2022 Decision Making Graph Embedding
— Unverified 0A Novel Entropy-Maximizing TD3-based Reinforcement Learning for Automatic PID Tuning Oct 5, 2022 reinforcement-learning Reinforcement Learning (RL)
— Unverified 0Query The Agent: Improving sample efficiency through epistemic uncertainty estimation Oct 5, 2022 reinforcement-learning Reinforcement Learning (RL)
— Unverified 0Neural Distillation as a State Representation Bottleneck in Reinforcement Learning Oct 5, 2022 reinforcement-learning Reinforcement Learning
— Unverified 0On Neural Consolidation for Transfer in Reinforcement Learning Oct 5, 2022 Deep Reinforcement Learning reinforcement-learning
— Unverified 0Towards Safe Mechanical Ventilation Treatment Using Deep Offline Reinforcement Learning Oct 5, 2022 Deep Reinforcement Learning Q-Learning
Code Code Available 0Visual Backtracking Teleoperation: A Data Collection Protocol for Offline Image-Based Reinforcement Learning Oct 5, 2022 continuous-control Continuous Control
— Unverified 0Stateful active facilitator: Coordination and Environmental Heterogeneity in Cooperative Multi-Agent Reinforcement Learning Oct 4, 2022 Multi-agent Reinforcement Learning reinforcement-learning
Code Code Available 0Maximum-Likelihood Inverse Reinforcement Learning with Finite-Time Guarantees Oct 4, 2022 counterfactual Imitation Learning
— Unverified 0Using Deep Reinforcement Learning for mmWave Real-Time Scheduling Oct 4, 2022 Deep Reinforcement Learning reinforcement-learning
— Unverified 0Handling Sparse Rewards in Reinforcement Learning Using Model Predictive Control Oct 4, 2022 Model Predictive Control reinforcement-learning
— Unverified 0Federated Reinforcement Learning for Real-Time Electric Vehicle Charging and Discharging Control Oct 4, 2022 reinforcement-learning Reinforcement Learning (RL)
— Unverified 0Hyperbolic Deep Reinforcement Learning Oct 4, 2022 Decision Making Deep Reinforcement Learning
— Unverified 0Learning Dynamic Abstract Representations for Sample-Efficient Reinforcement Learning Oct 4, 2022 reinforcement-learning Reinforcement Learning
— Unverified 0Evaluating Disentanglement in Generative Models Without Knowledge of Latent Factors Oct 4, 2022 Disentanglement Fairness
— Unverified 0Learning Perception-Aware Agile Flight in Cluttered Environments Oct 4, 2022 Imitation Learning Reinforcement Learning (RL)
— Unverified 0Accelerate Reinforcement Learning with PID Controllers in the Pendulum Simulations Oct 3, 2022 reinforcement-learning Reinforcement Learning
Code Code Available 0Interpretable Option Discovery using Deep Q-Learning and Variational Autoencoders Oct 3, 2022 Deep Reinforcement Learning Q-Learning
— Unverified 0CostNet: An End-to-End Framework for Goal-Directed Reinforcement Learning Oct 3, 2022 reinforcement-learning Reinforcement Learning
— Unverified 0Offline Reinforcement Learning with Differentiable Function Approximation is Provably Efficient Oct 3, 2022 Decision Making Offline RL
— Unverified 0