Regret Bounds for Risk-Sensitive Reinforcement Learning Oct 11, 2022 reinforcement-learning Reinforcement Learning
— Unverified 0Multi-Object Navigation with dynamically learned neural implicit representations Oct 11, 2022 Object Reinforcement Learning (RL)
Code Code Available 1Multiagent Reinforcement Learning Based on Fusion-Multiactor-Attention-Critic for Multiple-Unmanned-Aerial-Vehicle Navigation Control Oct 10, 2022 Deep Reinforcement Learning Multi-agent Reinforcement Learning
Code Code Available 1Simulating Coverage Path Planning with Roomba Oct 10, 2022 Deep Reinforcement Learning reinforcement-learning
— Unverified 0Towards a Theoretical Foundation of Policy Optimization for Learning Control Policies Oct 10, 2022 continuous-control Continuous Control
— Unverified 0Long N-step Surrogate Stage Reward to Reduce Variances of Deep Reinforcement Learning in Complex Problems Oct 10, 2022 continuous-control Continuous Control
— Unverified 0A policy gradient approach for Finite Horizon Constrained Markov Decision Processes Oct 10, 2022 reinforcement-learning Reinforcement Learning
Code Code Available 0Learning Explicit Credit Assignment for Cooperative Multi-Agent Reinforcement Learning via Polarization Policy Gradient Oct 10, 2022 Multi-agent Reinforcement Learning reinforcement-learning
Code Code Available 0Benchmarking Reinforcement Learning Techniques for Autonomous Navigation Oct 10, 2022 Autonomous Navigation Benchmarking
Code Code Available 1A Comprehensive Survey of Data Augmentation in Visual Reinforcement Learning Oct 10, 2022 Data Augmentation reinforcement-learning
Code Code Available 1In-Hand Object Rotation via Rapid Motor Adaptation Oct 10, 2022 Object Reinforcement Learning (RL)
Code Code Available 2Creating a Dynamic Quadrupedal Robotic Goalkeeper with Reinforcement Learning Oct 10, 2022 reinforcement-learning Reinforcement Learning
— Unverified 0Experiential Explanations for Reinforcement Learning Oct 10, 2022 Chunking counterfactual
Code Code Available 0Equivalence of Optimality Criteria for Markov Decision Process and Model Predictive Control Oct 9, 2022 Model Predictive Control reinforcement-learning
— Unverified 0Decomposed Mutual Information Optimization for Generalized Context in Meta-Reinforcement Learning Oct 9, 2022 Decision Making Meta Reinforcement Learning
Code Code Available 1The Role of Coverage in Online Reinforcement Learning Oct 9, 2022 Efficient Exploration Offline RL
— Unverified 0Skeleton2Humanoid: Animating Simulated Characters for Physically-plausible Motion In-betweening Oct 9, 2022 motion in-betweening Motion Synthesis
Code Code Available 1State Advantage Weighting for Offline RL Oct 9, 2022 D4RL Offline RL
— Unverified 0Dynamically meeting performance objectives for multiple services on a service mesh Oct 8, 2022 Blocking Management
— Unverified 0Cognitive Models as Simulators: The Case of Moral Decision-Making Oct 8, 2022 Decision Making Fairness
— Unverified 0Winner Takes It All: Training Performant RL Populations for Combinatorial Optimization Oct 7, 2022 All Combinatorial Optimization
Code Code Available 1Flexible Attention-Based Multi-Policy Fusion for Efficient Deep Reinforcement Learning Oct 7, 2022 Deep Reinforcement Learning reinforcement-learning
Code Code Available 1Large Language Models can Implement Policy Iteration Oct 7, 2022 In-Context Learning Language Modelling
— Unverified 0Conservative Bayesian Model-Based Value Expansion for Offline Policy Optimization Oct 7, 2022 continuous-control Continuous Control
Code Code Available 0Exploration Policies for On-the-Fly Controller Synthesis: A Reinforcement Learning Approach Oct 7, 2022 Blocking reinforcement-learning
Code Code Available 0Reinforcement Learning Approach for Multi-Agent Flexible Scheduling Problems Oct 7, 2022 Combinatorial Optimization Decision Making
— Unverified 0Multi-agent Deep Covering Skill Discovery Oct 7, 2022 Multi-agent Reinforcement Learning reinforcement-learning
— Unverified 0BAFFLE: Hiding Backdoors in Offline Reinforcement Learning Datasets Oct 7, 2022 Autonomous Driving Backdoor Attack
Code Code Available 1How to Enable Uncertainty Estimation in Proximal Policy Optimization Oct 7, 2022 Deep Reinforcement Learning Out of Distribution (OOD) Detection
— Unverified 0Advice Conformance Verification by Reinforcement Learning agents for Human-in-the-Loop Oct 7, 2022 Decision Making reinforcement-learning
— Unverified 0Algorithmic Trading Using Continuous Action Space Deep Reinforcement Learning Oct 7, 2022 Algorithmic Trading Deep Reinforcement Learning
— Unverified 0Low-Thrust Orbital Transfer using Dynamics-Agnostic Reinforcement Learning Oct 6, 2022 reinforcement-learning Reinforcement Learning
— Unverified 0Digital Human Interactive Recommendation Decision-Making Based on Reinforcement Learning Oct 6, 2022 Decision Making Graph Embedding
— Unverified 0Deep Inventory Management Oct 6, 2022 Deep Reinforcement Learning Management
— Unverified 0Exploration via Planning for Information about the Optimal Trajectory Oct 6, 2022 Reinforcement Learning (RL)
Code Code Available 1Neuroevolution is a Competitive Alternative to Reinforcement Learning for Skill Discovery Oct 6, 2022 Deep Reinforcement Learning Diversity
Code Code Available 1Lyapunov Function Consistent Adaptive Network Signal Control with Back Pressure and Reinforcement Learning Oct 6, 2022 reinforcement-learning Reinforcement Learning (RL)
— Unverified 0Rainier: Reinforced Knowledge Introspector for Commonsense Question Answering Oct 6, 2022 Question Answering Reinforcement Learning (RL)
Code Code Available 1Meta Reinforcement Learning for Optimal Design of Legged Robots Oct 6, 2022 Meta Reinforcement Learning reinforcement-learning
— Unverified 0Reinforcement Learning with Large Action Spaces for Neural Machine Translation Oct 6, 2022 Machine Translation NMT
— Unverified 0Learning Algorithms for Intelligent Agents and Mechanisms Oct 6, 2022 Decision Making reinforcement-learning
— Unverified 0Deep Reinforcement Learning based Evasion Generative Adversarial Network for Botnet Detection Oct 6, 2022 Deep Reinforcement Learning Generative Adversarial Network
Code Code Available 1Distributionally Adaptive Meta Reinforcement Learning Oct 6, 2022 Meta Reinforcement Learning reinforcement-learning
— Unverified 0Discovering faster matrix multiplication algorithms with reinforcement learning Oct 5, 2022 Deep Reinforcement Learning reinforcement-learning
Code Code Available 4A Novel Entropy-Maximizing TD3-based Reinforcement Learning for Automatic PID Tuning Oct 5, 2022 reinforcement-learning Reinforcement Learning (RL)
— Unverified 0Option-Aware Adversarial Inverse Reinforcement Learning for Robotic Control Oct 5, 2022 Imitation Learning Multi-Task Learning
Code Code Available 1DreamShard: Generalizable Embedding Table Placement for Recommender Systems Oct 5, 2022 GPU Recommendation Systems
Code Code Available 1Query The Agent: Improving sample efficiency through epistemic uncertainty estimation Oct 5, 2022 reinforcement-learning Reinforcement Learning (RL)
— Unverified 0Towards Safe Mechanical Ventilation Treatment Using Deep Offline Reinforcement Learning Oct 5, 2022 Deep Reinforcement Learning Q-Learning
Code Code Available 0Real-Time Reinforcement Learning for Vision-Based Robotics Utilizing Local and Remote Computers Oct 5, 2022 Deep Reinforcement Learning Reinforcement Learning (RL)
Code Code Available 1