Scaling Marginalized Importance Sampling to High-Dimensional State-Spaces via State Abstraction Dec 14, 2022 Off-policy evaluation Reinforcement Learning (RL)
— Unverified 0Robust Policy Optimization in Deep Reinforcement Learning Dec 14, 2022 continuous-control Continuous Control
Code Code Available 0Quantum Control based on Deep Reinforcement Learning Dec 14, 2022 Deep Reinforcement Learning reinforcement-learning
— Unverified 0Hybrid Multi-agent Deep Reinforcement Learning for Autonomous Mobility on Demand Systems Dec 14, 2022 Decision Making Deep Reinforcement Learning
Code Code Available 1Cross-Domain Transfer via Semantic Skill Imitation Dec 14, 2022 Reinforcement Learning (RL) Robot Manipulation
— Unverified 0Hierarchical Strategies for Cooperative Multi-Agent Reinforcement Learning Dec 14, 2022 Graph Attention Multi-agent Reinforcement Learning
— Unverified 0Efficient Exploration in Resource-Restricted Reinforcement Learning Dec 14, 2022 Efficient Exploration reinforcement-learning
— Unverified 0Explaining Agent's Decision-making in a Hierarchical Reinforcement Learning Scenario Dec 14, 2022 Decision Making Hierarchical Reinforcement Learning
— Unverified 0Improving generalization in reinforcement learning through forked agents Dec 13, 2022 reinforcement-learning Reinforcement Learning
— Unverified 0A Review of Off-Policy Evaluation in Reinforcement Learning Dec 13, 2022 Off-policy evaluation reinforcement-learning
— Unverified 0Single Cell Training on Architecture Search for Image Denoising Dec 13, 2022 Computational Efficiency Denoising
— Unverified 0PPO-UE: Proximal Policy Optimization via Uncertainty-Aware Exploration Dec 13, 2022 continuous-control Continuous Control
— Unverified 0Model-Free Approach to Fair Solar PV Curtailment Using Reinforcement Learning Dec 13, 2022 Fairness reinforcement-learning
— Unverified 0Scalable and Sample Efficient Distributed Policy Gradient Algorithms in Multi-Agent Networked Systems Dec 13, 2022 Multi-agent Reinforcement Learning reinforcement-learning
— Unverified 0Variance-Reduced Conservative Policy Iteration Dec 12, 2022 reinforcement-learning Reinforcement Learning (RL)
— Unverified 0Nearly Minimax Optimal Reinforcement Learning for Linear Markov Decision Processes Dec 12, 2022 reinforcement-learning Reinforcement Learning
— Unverified 0MoDem: Accelerating Visual Model-Based Reinforcement Learning with Demonstrations Dec 12, 2022 Deep Reinforcement Learning Model-based Reinforcement Learning
Code Code Available 1VOQL: Towards Optimal Regret in Model-free RL with Nonlinear Function Approximation Dec 12, 2022 Q-Learning regression
— Unverified 0Reinforcement Learning and Tree Search Methods for the Unit Commitment Problem Dec 12, 2022 Decision Making reinforcement-learning
Code Code Available 1Corruption-Robust Algorithms with Uncertainty Weighting for Nonlinear Contextual Bandits and Markov Decision Processes Dec 12, 2022 Multi-Armed Bandits Reinforcement Learning (RL)
— Unverified 0Evaluating Model-free Reinforcement Learning toward Safety-critical Tasks Dec 12, 2022 Autonomous Driving reinforcement-learning
— Unverified 0A Survey on Reinforcement Learning Security with Application to Autonomous Driving Dec 12, 2022 Autonomous Driving reinforcement-learning
— Unverified 0Hierarchical Deep Reinforcement Learning for VWAP Strategy Optimization Dec 11, 2022 Deep Reinforcement Learning Hierarchical Reinforcement Learning
— Unverified 0Generalization Through the Lens of Learning Dynamics Dec 11, 2022 Deep Reinforcement Learning reinforcement-learning
— Unverified 0Off-Policy Deep Reinforcement Learning Algorithms for Handling Various Robotic Manipulator Tasks Dec 11, 2022 Deep Reinforcement Learning MuJoCo
— Unverified 0Relate to Predict: Towards Task-Independent Knowledge Representations for Reinforcement Learning Dec 10, 2022 Inductive Bias Object
— Unverified 0Targeted Adversarial Attacks on Deep Reinforcement Learning Policies via Model Checking Dec 10, 2022 Deep Reinforcement Learning reinforcement-learning
Code Code Available 1Leveraging Modality-specific Representations for Audio-visual Speech Recognition via Reinforcement Learning Dec 10, 2022 Audio-Visual Speech Recognition reinforcement-learning
— Unverified 0Effects of Spectral Normalization in Multi-agent Reinforcement Learning Dec 10, 2022 Multi-agent Reinforcement Learning reinforcement-learning
Code Code Available 0Reinforcement Learning for Predicting Traffic Accidents Dec 9, 2022 Accident Anticipation Autonomous Driving
— Unverified 0Reinforcement Learning and Mixed-Integer Programming for Power Plant Scheduling in Low Carbon Systems: Comparison and Hybridisation Dec 9, 2022 Reinforcement Learning (RL) Scheduling
— Unverified 0Near-Optimal Differentially Private Reinforcement Learning Dec 9, 2022 reinforcement-learning Reinforcement Learning
— Unverified 0System Design for an Integrated Lifelong Reinforcement Learning Agent for Real-Time Strategy Games Dec 8, 2022 Continual Learning Lifelong learning
— Unverified 0Compiler Optimization for Quantum Computing Using Reinforcement Learning Dec 8, 2022 Compiler Optimization reinforcement-learning
Code Code Available 1Confidence-Conditioned Value Functions for Offline Reinforcement Learning Dec 8, 2022 Offline RL reinforcement-learning
— Unverified 0A Novel Stochastic Gradient Descent Algorithm for Learning Principal Subspaces Dec 8, 2022 Deep Reinforcement Learning Image Compression
— Unverified 0Enhanced method for reinforcement learning based dynamic obstacle avoidance by assessment of collision risk Dec 8, 2022 Reinforcement Learning (RL)
— Unverified 0Design and Planning of Flexible Mobile Micro-Grids Using Deep Reinforcement Learning Dec 8, 2022 Deep Reinforcement Learning reinforcement-learning
— Unverified 0Reinforcement Learning for Resilient Power Grids Dec 8, 2022 Q-Learning reinforcement-learning
— Unverified 0Selector-Enhancer: Learning Dynamic Selection of Local and Non-local Attention Operation for Speech Enhancement Dec 7, 2022 Denoising Reinforcement Learning (RL)
— Unverified 0Accelerating Self-Imitation Learning from Demonstrations via Policy Constraints and Q-Ensemble Dec 7, 2022 continuous-control Continuous Control
— Unverified 0Adaptive Risk-Aware Bidding with Budget Constraint in Display Advertising Dec 6, 2022 reinforcement-learning Reinforcement Learning
Code Code Available 0First Go, then Post-Explore: the Benefits of Post-Exploration in Intrinsic Motivation Dec 6, 2022 continuous-control Continuous Control
— Unverified 0Few-Shot Preference Learning for Human-in-the-Loop RL Dec 6, 2022 Meta-Learning Multi-Task Learning
— Unverified 0Understanding Self-Predictive Learning for Reinforcement Learning Dec 6, 2022 reinforcement-learning Reinforcement Learning
— Unverified 0Reinforcement Learning for UAV control with Policy and Reward Shaping Dec 6, 2022 reinforcement-learning Reinforcement Learning
— Unverified 0Reinforcement Learning for Molecular Dynamics Optimization: A Stochastic Pontryagin Maximum Principle Approach Dec 6, 2022 Decision Making Drug Discovery
Code Code Available 0Safe Inverse Reinforcement Learning via Control Barrier Function Dec 6, 2022 reinforcement-learning Reinforcement Learning
— Unverified 0Scalable Planning and Learning Framework Development for Swarm-to-Swarm Engagement Problems Dec 6, 2022 Reinforcement Learning (RL)
— Unverified 0Switching to Discriminative Image Captioning by Relieving a Bottleneck of Reinforcement Learning Dec 6, 2022 Image Captioning reinforcement-learning
Code Code Available 0