DearFSAC: An Approach to Optimizing Unreliable Federated Learning via Deep Reinforcement Learning Jan 30, 2022 Deep Reinforcement Learning Federated Learning
— Unverified 0Bellman Meets Hawkes: Model-Based Reinforcement Learning via Temporal Point Processes Jan 29, 2022 Decision Making Model-based Reinforcement Learning
Code Code Available 0DeepRNG: Towards Deep Reinforcement Learning-Assisted Generative Testing of Software Jan 29, 2022 Deep Reinforcement Learning reinforcement-learning
— Unverified 0Explaining Reinforcement Learning Policies through Counterfactual Trajectories Jan 29, 2022 counterfactual Decision Making
Code Code Available 0ApolloRL: a Reinforcement Learning Platform for Autonomous Driving Jan 29, 2022 Autonomous Driving reinforcement-learning
— Unverified 0Zeroth-Order Actor-Critic: An Evolutionary Framework for Sequential Decision Problems Jan 29, 2022 continuous-control Continuous Control
Code Code Available 0Provably Efficient Primal-Dual Reinforcement Learning for CMDPs with Non-stationary Objectives and Constraints Jan 28, 2022 Reinforcement Learning (RL) Safe Exploration
— Unverified 0Overcoming Exploration: Deep Reinforcement Learning for Continuous Control in Cluttered Environments from Temporal Logic Specifications Jan 28, 2022 continuous-control Continuous Control
— Unverified 0Leveraging class abstraction for commonsense reinforcement learning via residual policy gradient methods Jan 28, 2022 Knowledge Graphs Policy Gradient Methods
Code Code Available 0A deep Q-learning method for optimizing visual search strategies in backgrounds of dynamic noise Jan 28, 2022 Q-Learning reinforcement-learning
— Unverified 0Dynamic Temporal Reconciliation by Reinforcement learning Jan 28, 2022 reinforcement-learning Reinforcement Learning
— Unverified 0FCMNet: Full Communication Memory Net for Team-Level Cooperation in Multi-Agent Systems Jan 28, 2022 Decision Making reinforcement-learning
Code Code Available 0Efficient Embedding of Semantic Similarity in Control Policies via Entangled Bisimulation Jan 28, 2022 Data Augmentation Reinforcement Learning (RL)
— Unverified 0Discovering Exfiltration Paths Using Reinforcement Learning with Attack Graphs Jan 28, 2022 reinforcement-learning Reinforcement Learning
— Unverified 0Joint Differentiable Optimization and Verification for Certified Reinforcement Learning Jan 28, 2022 Bilevel Optimization Model-based Reinforcement Learning
— Unverified 0Modeling Human Exploration Through Resource-Rational Reinforcement Learning Jan 27, 2022 Meta-Learning reinforcement-learning
Code Code Available 0Generative Adversarial Exploration for Reinforcement Learning Jan 27, 2022 Generative Adversarial Network Montezuma's Revenge
— Unverified 0Human-centered mechanism design with Democratic AI Jan 27, 2022 reinforcement-learning Reinforcement Learning (RL)
— Unverified 0The Challenges of Exploration for Offline Reinforcement Learning Jan 27, 2022 Model Predictive Control Offline RL
— Unverified 0Reinforcement Learning-Empowered Mobile Edge Computing for 6G Edge Intelligence Jan 27, 2022 Edge-computing reinforcement-learning
— Unverified 0Boosting Exploration in Multi-Task Reinforcement Learning using Adversarial Networks Jan 27, 2022 Decision Making reinforcement-learning
Code Code Available 0Quantile-Based Policy Optimization for Reinforcement Learning Jan 27, 2022 reinforcement-learning Reinforcement Learning
— Unverified 0Multi-Agent Reinforcement Learning for Network Load Balancing in Data Center Jan 27, 2022 Multi-agent Reinforcement Learning reinforcement-learning
— Unverified 0Probe-Based Interventions for Modifying Agent Behavior Jan 26, 2022 Decision Making Multi-agent Reinforcement Learning
— Unverified 0Reward-Free RL is No Harder Than Reward-Aware RL in Linear Markov Decision Processes Jan 26, 2022 Reinforcement Learning (RL)
— Unverified 0Exploiting Semantic Epsilon Greedy Exploration Strategy in Multi-Agent Reinforcement Learning Jan 26, 2022 Multi-agent Reinforcement Learning reinforcement-learning
— Unverified 0Hyperparameter Tuning for Deep Reinforcement Learning Applications Jan 26, 2022 Decision Making Deep Reinforcement Learning
— Unverified 0Learning Invariable Semantical Representation from Language for Extensible Policy Generalization Jan 26, 2022 Reinforcement Learning (RL)
— Unverified 0Using Deep Reinforcement Learning for Zero Defect Smart Forging Jan 25, 2022 Deep Reinforcement Learning reinforcement-learning
— Unverified 0MOORe: Model-based Offline-to-Online Reinforcement Learning Jan 25, 2022 D4RL model
— Unverified 0Reinforcement Learning Based Query Vertex Ordering Model for Subgraph Matching Jan 25, 2022 reinforcement-learning Reinforcement Learning
— Unverified 0Accelerated Intravascular Ultrasound Imaging using Deep Reinforcement Learning Jan 24, 2022 Deep Reinforcement Learning reinforcement-learning
— Unverified 0Generative Planning for Temporally Coordinated Exploration in Reinforcement Learning Jan 24, 2022 reinforcement-learning Reinforcement Learning
Code Code Available 0Large-Scale Graph Reinforcement Learning in Wireless Control Systems Jan 24, 2022 Deep Reinforcement Learning reinforcement-learning
— Unverified 0State-Conditioned Adversarial Subgoal Generation Jan 24, 2022 continuous-control Continuous Control
— Unverified 0Understanding the Effects of Second-Order Approximations in Natural Policy Gradient Reinforcement Learning Jan 22, 2022 Policy Gradient Methods reinforcement-learning
Code Code Available 0Online Attentive Kernel-Based Temporal Difference Learning Jan 22, 2022 Acrobot Reinforcement Learning (RL)
— Unverified 0Multi-Agent Adversarial Attacks for Multi-Channel Communications Jan 22, 2022 channel selection Reinforcement Learning (RL)
— Unverified 0Reinforcement Learning Your Way: Agent Characterization through Policy Regularization Jan 21, 2022 reinforcement-learning Reinforcement Learning
— Unverified 0Tensor and Matrix Low-Rank Value-Function Approximation in Reinforcement Learning Jan 21, 2022 reinforcement-learning Reinforcement Learning (RL)
Code Code Available 0Occupancy Information Ratio: Infinite-Horizon, Information-Directed, Parameterized Policy Search Jan 21, 2022 Multi-Armed Bandits Reinforcement Learning (RL)
— Unverified 0Reinforcement Learning for Personalized Drug Discovery and Design for Complex Diseases: A Systems Pharmacology Perspective Jan 21, 2022 Drug Design Drug Discovery
— Unverified 0Learning Two-Step Hybrid Policy for Graph-Based Interpretable Reinforcement Learning Jan 21, 2022 Decision Making Deep Reinforcement Learning
— Unverified 0Deep Reinforcement Learning with Spiking Q-learning Jan 21, 2022 Atari Games Deep Reinforcement Learning
— Unverified 0Instance-Dependent Confidence and Early Stopping for Reinforcement Learning Jan 21, 2022 reinforcement-learning Reinforcement Learning
— Unverified 0Environment Generation for Zero-Shot Compositional Reinforcement Learning Jan 21, 2022 Deep Reinforcement Learning Navigate
— Unverified 0Deep reinforcement learning under signal temporal logic constraints using Lagrangian relaxation Jan 21, 2022 Decision Making Deep Reinforcement Learning
— Unverified 0A Prescriptive Dirichlet Power Allocation Policy with Deep Reinforcement Learning Jan 20, 2022 Deep Reinforcement Learning reinforcement-learning
— Unverified 0Self-Awareness Safety of Deep Reinforcement Learning in Road Traffic Junction Driving Jan 20, 2022 Autonomous Driving Deep Reinforcement Learning
— Unverified 0Priors, Hierarchy, and Information Asymmetry for Skill Transfer in Reinforcement Learning Jan 20, 2022 reinforcement-learning Reinforcement Learning (RL)
— Unverified 0