A Bayesian Approach to Learning Bandit Structure in Markov Decision Processes Jul 30, 2022 Decision Making reinforcement-learning
— Unverified 0Unified Automatic Control of Vehicular Systems with Reinforcement Learning Jul 30, 2022 Deep Reinforcement Learning reinforcement-learning
Code Code Available 1Solving the vehicle routing problem with deep reinforcement learning Jul 30, 2022 Combinatorial Optimization Deep Reinforcement Learning
— Unverified 0Reinforcement learning with experience replay and adaptation of action dispersion Jul 30, 2022 reinforcement-learning Reinforcement Learning
— Unverified 0Sampling Attacks on Meta Reinforcement Learning: A Minimax Formulation and Complexity Analysis Jul 29, 2022 Meta-Learning Meta Reinforcement Learning
Code Code Available 0Sample-efficient Safe Learning for Online Nonlinear Control with Control Barrier Functions Jul 29, 2022 Decision Making Reinforcement Learning (RL)
— Unverified 0Meta Reinforcement Learning with Successor Feature Based Context Jul 29, 2022 continuous-control Continuous Control
— Unverified 0Combining Evolutionary Search with Behaviour Cloning for Procedurally Generated Content Jul 29, 2022 Reinforcement Learning (RL) valid
— Unverified 0Cyclic Policy Distillation: Sample-Efficient Sim-to-Real Reinforcement Learning with Domain Randomization Jul 29, 2022 Deep Reinforcement Learning MuJoCo
Code Code Available 0Deep Reinforcement Learning for System-on-Chip: Myths and Realities Jul 29, 2022 Deep Reinforcement Learning reinforcement-learning
— Unverified 0Contrastive UCB: Provably Efficient Contrastive Self-Supervised Learning in Online Reinforcement Learning Jul 29, 2022 Contrastive Learning Deep Reinforcement Learning
Code Code Available 1Graph Inverse Reinforcement Learning from Diverse Videos Jul 28, 2022 Diversity reinforcement-learning
— Unverified 0Latent Properties of Lifelong Learning Systems Jul 28, 2022 Lifelong learning reinforcement-learning
— Unverified 0RangL: A Reinforcement Learning Competition Platform Jul 28, 2022 OpenAI Gym reinforcement-learning
— Unverified 0Playing a 2D Game Indefinitely using NEAT and Reinforcement Learning Jul 28, 2022 Q-Learning reinforcement-learning
— Unverified 0Raising Student Completion Rates with Adaptive Curriculum and Contextual Bandits Jul 28, 2022 Model-based Reinforcement Learning Multi-Armed Bandits
— Unverified 0POSET-RL: Phase ordering for Optimizing Size and Execution Time using Reinforcement Learning Jul 27, 2022 CPU reinforcement-learning
— Unverified 0Multi-Objective Provisioning of Network Slices using Deep Reinforcement Learning Jul 27, 2022 Deep Reinforcement Learning reinforcement-learning
— Unverified 0Structural Similarity for Improved Transfer in Reinforcement Learning Jul 27, 2022 Q-Learning reinforcement-learning
— Unverified 0Distributional Actor-Critic Ensemble for Uncertainty-Aware Continuous Control Jul 27, 2022 continuous-control Continuous Control
— Unverified 0Dynamic Shielding for Reinforcement Learning in Black-Box Environments Jul 27, 2022 reinforcement-learning Reinforcement Learning
— Unverified 0A Contact-Safe Reinforcement Learning Framework for Contact-Rich Robot Manipulation Jul 27, 2022 reinforcement-learning Reinforcement Learning
— Unverified 0Safe and Robust Experience Sharing for Deterministic Policy Gradient Algorithms Jul 27, 2022 continuous-control Continuous Control
Code Code Available 0Unsupervised Training for Neural TSP Solver Jul 27, 2022 Graph Neural Network reinforcement-learning
— Unverified 0Branch Ranking for Efficient Mixed-Integer Programming via Offline Ranking-based Policy Learning Jul 26, 2022 Decision Making Reinforcement Learning (RL)
— Unverified 0Learning Bipedal Walking On Planned Footsteps For Humanoid Robots Jul 26, 2022 Deep Reinforcement Learning MuJoCo
Code Code Available 3Offline Reinforcement Learning at Multiple Frequencies Jul 26, 2022 Offline RL reinforcement-learning
— Unverified 0Semi-analytical Industrial Cooling System Model for Reinforcement Learning Jul 26, 2022 reinforcement-learning Reinforcement Learning
— Unverified 0Planning and Learning: Path-Planning for Autonomous Vehicles, a Review of the Literature Jul 26, 2022 Autonomous Vehicles reinforcement-learning
— Unverified 0Dynamic Planning in Open-Ended Dialogue using Reinforcement Learning Jul 25, 2022 Natural Language Understanding reinforcement-learning
— Unverified 0Cooperative Actor-Critic via TD Error Aggregation Jul 25, 2022 Multi-agent Reinforcement Learning Reinforcement Learning (RL)
— Unverified 0Flowsheet synthesis through hierarchical reinforcement learning and graph neural networks Jul 25, 2022 Chemical Process Decision Making
— Unverified 0Live in the Moment: Learning Dynamics Model Adapted to Evolving Policy Jul 25, 2022 continuous-control Continuous Control
Code Code Available 0Online Reinforcement Learning for Periodic MDP Jul 25, 2022 reinforcement-learning Reinforcement Learning
— Unverified 0Adaptive Asynchronous Control Using Meta-learned Neural Ordinary Differential Equations Jul 25, 2022 Decision Making Meta-Learning
— Unverified 0Post-processing Networks: Method for Optimizing Pipeline Task-oriented Dialogue Systems using Reinforcement Learning Jul 25, 2022 reinforcement-learning Reinforcement Learning
Code Code Available 0REPNP: Plug-and-Play with Deep Reinforcement Learning Prior for Robust Image Restoration Jul 25, 2022 Deblurring Deep Reinforcement Learning
— Unverified 0Lifelong Machine Learning of Functionally Compositional Structures Jul 25, 2022 BIG-bench Machine Learning Continual Learning
Code Code Available 1Learning Soccer Juggling Skills with Layer-wise Mixture-of-Experts Jul 24, 2022 Deep Reinforcement Learning Humanoid Control
Code Code Available 1Adaptive Decision Making at the Intersection for Autonomous Vehicles Based on Skill Discovery Jul 24, 2022 Autonomous Driving Autonomous Vehicles
— Unverified 0Anti-Overestimation Dialogue Policy Learning for Task-Completion Dialogue System Jul 24, 2022 Reinforcement Learning (RL)
— Unverified 0Driver Dojo: A Benchmark for Generalizable Reinforcement Learning for Autonomous Driving Jul 23, 2022 Autonomous Driving reinforcement-learning
Code Code Available 1Halftoning with Multi-Agent Deep Reinforcement Learning Jul 23, 2022 Deep Reinforcement Learning reinforcement-learning
— Unverified 0Epersist: A Self Balancing Robot Using PID Controller And Deep Reinforcement Learning Jul 23, 2022 Deep Reinforcement Learning reinforcement-learning
— Unverified 0Hierarchical Kickstarting for Skill Transfer in Reinforcement Learning Jul 23, 2022 Inductive Bias NetHack
Code Code Available 1Learn Continuously, Act Discretely: Hybrid Action-Space Reinforcement Learning For Optimal Execution Jul 22, 2022 Algorithmic Trading continuous-control
— Unverified 0Robust Knowledge Adaptation for Dynamic Graph Neural Networks Jul 22, 2022 reinforcement-learning Reinforcement Learning
Code Code Available 1Towards Robust On-Ramp Merging via Augmented Multimodal Reinforcement Learning Jul 21, 2022 Autonomous Driving reinforcement-learning
— Unverified 0Solving the optimal stopping problem with reinforcement learning: an application in financial option exercise Jul 21, 2022 Management Reinforcement Learning (RL)
Code Code Available 0Strategising template-guided needle placement for MR-targeted prostate biopsy Jul 21, 2022 Anatomy Decision Making
— Unverified 0