Hindsight Learning for MDPs with Exogenous Inputs Jul 13, 2022 counterfactual Decision Making
Code Code Available 0Brick Tic-Tac-Toe: Exploring the Generalizability of AlphaZero to Novel Test Environments Jul 13, 2022 Reinforcement Learning (RL)
Code Code Available 0GriddlyJS: A Web IDE for Reinforcement Learning Jul 13, 2022 Offline RL reinforcement-learning
— Unverified 0Continual Meta-Reinforcement Learning for UAV-Aided Vehicular Wireless Networks Jul 13, 2022 Meta Reinforcement Learning reinforcement-learning
— Unverified 0Towards Global Optimality in Cooperative MARL with the Transformation And Distillation Framework Jul 12, 2022 Multi-agent Reinforcement Learning Policy Gradient Methods
— Unverified 0Optimistic PAC Reinforcement Learning: the Instance-Dependent View Jul 12, 2022 reinforcement-learning Reinforcement Learning
— Unverified 0Learning Bellman Complete Representations for Offline Policy Evaluation Jul 12, 2022 continuous-control Continuous Control
Code Code Available 0DGPO: Discovering Multiple Strategies with Diversity-Guided Policy Optimization Jul 12, 2022 Diversity reinforcement-learning
Code Code Available 1Temporal Disentanglement of Representations for Improved Generalisation in Reinforcement Learning Jul 12, 2022 Disentanglement reinforcement-learning
Code Code Available 1Online Game Level Generation from Music Jul 12, 2022 Game Design reinforcement-learning
Code Code Available 0Reactive Exploration to Cope with Non-Stationarity in Lifelong Reinforcement Learning Jul 12, 2022 Lifelong learning Policy Gradient Methods
Code Code Available 1PAC Reinforcement Learning for Predictive State Representations Jul 12, 2022 reinforcement-learning Reinforcement Learning
— Unverified 0Grounding Aleatoric Uncertainty for Unsupervised Environment Design Jul 11, 2022 Reinforcement Learning (RL)
— Unverified 0Don't Start From Scratch: Leveraging Prior Data to Automate Robotic Reinforcement Learning Jul 11, 2022 reinforcement-learning Reinforcement Learning
— Unverified 0Learning Temporally Extended Skills in Continuous Domains as Symbolic Actions for Planning Jul 11, 2022 continuous-control Continuous Control
— Unverified 0Reinforcement Learningx2013Based Transient Response Shaping for Microgrids Jul 11, 2022 reinforcement-learning Reinforcement Learning
— Unverified 0State Dropout-Based Curriculum Reinforcement Learning for Self-Driving at Unsignalized Intersections Jul 10, 2022 Autonomous Driving Autonomous Vehicles
— Unverified 0Deep Reinforcement Learning for Long-Term Voltage Stability Control Jul 9, 2022 Deep Reinforcement Learning reinforcement-learning
— Unverified 0Ablation Study of How Run Time Assurance Impacts the Training and Performance of Reinforcement Learning Agents Jul 8, 2022 reinforcement-learning Reinforcement Learning
— Unverified 0High Performance Simulation for Scalable Multi-Agent Reinforcement Learning Jul 8, 2022 GPU Multi-agent Reinforcement Learning
— Unverified 0Interaction Pattern Disentangling for Multi-Agent Reinforcement Learning Jul 8, 2022 Diversity Multi-agent Reinforcement Learning
Code Code Available 1CompoSuite: A Compositional Reinforcement Learning Benchmark Jul 8, 2022 reinforcement-learning Reinforcement Learning
Code Code Available 1Storehouse: a Reinforcement Learning Environment for Optimizing Warehouse Management Jul 8, 2022 Management reinforcement-learning
Code Code Available 1Safe reinforcement learning for multi-energy management systems with known constraint functions Jul 8, 2022 energy management Management
— Unverified 0Reinforced Lin-Kernighan-Helsgaun Algorithms for the Traveling Salesman Problems Jul 8, 2022 Combinatorial Optimization Q-Learning
Code Code Available 1Variational multiscale reinforcement learning for discovering reduced order closure models of nonlinear spatiotemporal transport systems Jul 7, 2022 Reinforcement Learning (RL)
— Unverified 0Domain Adapting Deep Reinforcement Learning for Real-world Speech Emotion Recognition Jul 7, 2022 Cross-corpus Deep Reinforcement Learning
— Unverified 0gym-DSSAT: a crop model turned into a Reinforcement Learning environment Jul 7, 2022 Management reinforcement-learning
— Unverified 0Evaluating Human-like Explanations for Robot Actions in Reinforcement Learning Scenarios Jul 7, 2022 counterfactual Decision Making
— Unverified 0DRL-ISP: Multi-Objective Camera ISP with Deep Reinforcement Learning Jul 7, 2022 2D Object Detection Deep Reinforcement Learning
— Unverified 0Stochastic optimal well control in subsurface reservoirs using reinforcement learning Jul 7, 2022 Management reinforcement-learning
Code Code Available 0Robust optimal well control using an adaptive multi-grid reinforcement learning framework Jul 7, 2022 Computational Efficiency reinforcement-learning
Code Code Available 0Vessel-following model for inland waterways based on deep reinforcement learning Jul 7, 2022 Deep Reinforcement Learning reinforcement-learning
— Unverified 0Multi-objective Optimization of Notifications Using Offline Reinforcement Learning Jul 7, 2022 Q-Learning reinforcement-learning
— Unverified 0Reinforcement Learning for Distributed Transient Frequency Control with Stability and Safety Guarantees Jul 7, 2022 reinforcement-learning Reinforcement Learning
— Unverified 0Inferring and Conveying Intentionality: Beyond Numerical Rewards to Logical Intentions Jul 6, 2022 reinforcement-learning Reinforcement Learning
— Unverified 0Instance-Dependent Near-Optimal Policy Identification in Linear MDPs via Online Experiment Design Jul 6, 2022 Reinforcement Learning (RL)
— Unverified 0A Learning System for Motion Planning of Free-Float Dual-Arm Space Manipulator towards Non-Cooperative Object Jul 6, 2022 Motion Planning Object
Code Code Available 1Model Selection in Reinforcement Learning with General Function Approximations Jul 6, 2022 Model Selection Multi-Armed Bandits
— Unverified 0Reinforcement Learning Portfolio Manager Framework with Monte Carlo Simulation Jul 6, 2022 Management reinforcement-learning
— Unverified 0Deep Reinforcement Learning Approach for Trading Automation in The Stock Market Jul 5, 2022 Decision Making Deep Reinforcement Learning
— Unverified 0Decentralized scheduling through an adaptive, trading-based multi-agent system Jul 5, 2022 Multi-agent Reinforcement Learning reinforcement-learning
— Unverified 0AVDDPG: Federated reinforcement learning applied to autonomous platoon control Jul 5, 2022 Federated Learning reinforcement-learning
— Unverified 0Learning Task Embeddings for Teamwork Adaptation in Multi-Agent Reinforcement Learning Jul 5, 2022 Decoder Multi-agent Reinforcement Learning
Code Code Available 1Implementing Reinforcement Learning Datacenter Congestion Control in NVIDIA NICs Jul 5, 2022 Fairness reinforcement-learning
Code Code Available 1CodeRL: Mastering Code Generation through Pretrained Models and Deep Reinforcement Learning Jul 5, 2022 Code Generation Decoder
Code Code Available 2Explainability in Deep Reinforcement Learning, a Review into Current Methods and Applications Jul 5, 2022 Deep Reinforcement Learning Explainable artificial intelligence
— Unverified 0Robust Reinforcement Learning in Continuous Control Tasks with Uncertainty Set Regularization Jul 5, 2022 continuous-control Continuous Control
Code Code Available 0Tackling Real-World Autonomous Driving using Deep Reinforcement Learning Jul 5, 2022 Autonomous Driving Deep Reinforcement Learning
— Unverified 0Resource Allocation in Multicore Elastic Optical Networks: A Deep Reinforcement Learning Approach Jul 5, 2022 Blocking Deep Reinforcement Learning
— Unverified 0