Transportation-Inequalities, Lyapunov Stability and Sampling for Dynamical Systems on Continuous State Space May 25, 2022 reinforcement-learning Reinforcement Learning (RL)
— Unverified 0MAVIPER: Learning Decision Tree Policies for Interpretable Multi-Agent Reinforcement Learning May 25, 2022 Multi-agent Reinforcement Learning reinforcement-learning
— Unverified 0Skill Machines: Temporal Logic Skill Composition in Reinforcement Learning May 25, 2022 continuous-control Continuous Control
Code Code Available 0Trust-based Consensus in Multi-Agent Reinforcement Learning Systems May 25, 2022 Multi-agent Reinforcement Learning reinforcement-learning
— Unverified 0Robust Reinforcement Learning on Graphs for Logistics optimization May 25, 2022 Graph Neural Network reinforcement-learning
— Unverified 0Tiered Reinforcement Learning: Pessimism in the Face of Uncertainty and Constant Regret May 25, 2022 reinforcement-learning Reinforcement Learning (RL)
Code Code Available 0An Experimental Comparison Between Temporal Difference and Residual Gradient with Neural Network Approximation May 25, 2022 Q-Learning reinforcement-learning
— Unverified 0Impartial Games: A Challenge for Reinforcement Learning May 25, 2022 Board Games Position
Code Code Available 0Learning to Query Internet Text for Informing Reinforcement Learning Agents May 25, 2022 reinforcement-learning Reinforcement Learning
— Unverified 0Fast Inference and Transfer of Compositional Task Structures for Few-shot Task Generalization May 25, 2022 Hierarchical Reinforcement Learning Meta Reinforcement Learning
— Unverified 0Learning in Mean Field Games: A Survey May 25, 2022 Reinforcement Learning (RL) Survey
— Unverified 0Reward Uncertainty for Exploration in Preference-based Reinforcement Learning May 24, 2022 reinforcement-learning Reinforcement Learning
Code Code Available 1Penalized Proximal Policy Optimization for Safe Reinforcement Learning May 24, 2022 reinforcement-learning Reinforcement Learning
— Unverified 0Meta Policy Learning for Cold-Start Conversational Recommendation May 24, 2022 Conversational Recommendation Meta Reinforcement Learning
Code Code Available 0History Compression via Language Models in Reinforcement Learning May 24, 2022 reinforcement-learning Reinforcement Learning
Code Code Available 1Distributional Hamilton-Jacobi-Bellman Equations for Continuous-Time Reinforcement Learning May 24, 2022 reinforcement-learning Reinforcement Learning
— Unverified 0Deep Reinforcement Learning for Multi-class Imbalanced Training May 24, 2022 Deep Reinforcement Learning imbalanced classification
Code Code Available 0Concurrent Credit Assignment for Data-efficient Reinforcement Learning May 24, 2022 reinforcement-learning Reinforcement Learning
Code Code Available 0Learning to Drive Using Sparse Imitation Reinforcement Learning May 24, 2022 Autonomous Driving reinforcement-learning
— Unverified 0Graph Convolutional Reinforcement Learning for Collaborative Queuing Agents May 24, 2022 Multi-agent Reinforcement Learning reinforcement-learning
— Unverified 0Efficient Reinforcement Learning from Demonstration Using Local Ensemble and Reparameterization with Split and Merge of Expert Policies May 23, 2022 continuous-control Continuous Control
— Unverified 0Using Natural Language and Program Abstractions to Instill Human Inductive Biases in Machines May 23, 2022 Meta-Learning Meta Reinforcement Learning
Code Code Available 0RL with KL penalties is better viewed as Bayesian inference May 23, 2022 Bayesian Inference Language Modelling
— Unverified 0Multiple Domain Cyberspace Attack and Defense Game Based on Reward Randomization Reinforcement Learning May 23, 2022 reinforcement-learning Reinforcement Learning
— Unverified 0POLTER: Policy Trajectory Ensemble Regularization for Unsupervised Reinforcement Learning May 23, 2022 Open-Ended Question Answering reinforcement-learning
— Unverified 0Logarithmic regret bounds for continuous-time average-reward Markov decision processes May 23, 2022 Point Processes reinforcement-learning
— Unverified 0Spreading Factor assisted LoRa Localization with Deep Reinforcement Learning May 23, 2022 Deep Reinforcement Learning reinforcement-learning
— Unverified 0Computationally Efficient Horizon-Free Reinforcement Learning for Linear Mixture MDPs May 23, 2022 Multi-Armed Bandits reinforcement-learning
— Unverified 0An Evaluation Study of Intrinsic Motivation Techniques applied to Reinforcement Learning over Hard Exploration Environments May 23, 2022 reinforcement-learning Reinforcement Learning (RL)
Code Code Available 0When Data Geometry Meets Deep Function: Generalizing Offline Reinforcement Learning May 23, 2022 D4RL Offline RL
Code Code Available 1Learning to Advise and Learning from Advice in Cooperative Multi-Agent Reinforcement Learning May 23, 2022 Multi-agent Reinforcement Learning reinforcement-learning
— Unverified 0Human-in-the-loop: Provably Efficient Preference-based Reinforcement Learning with General Function Approximation May 23, 2022 Reinforcement Learning (RL)
— Unverified 0Learning to branch with Tree MDPs May 23, 2022 Reinforcement Learning (RL)
Code Code Available 1Cooperative Reinforcement Learning on Traffic Signal Control May 23, 2022 reinforcement-learning Reinforcement Learning
— Unverified 0Contextual Information-Directed Sampling May 22, 2022 Multi-Armed Bandits Reinforcement Learning (RL)
— Unverified 0Inverse-Inverse Reinforcement Learning. How to Hide Strategy from an Adversarial Inverse Reinforcement Learner May 22, 2022 Reinforcement Learning (RL)
— Unverified 0Power and accountability in reinforcement learning applications to environmental policy May 22, 2022 Decision Making Management
— Unverified 0Memory-efficient Reinforcement Learning with Value-based Knowledge Consolidation May 22, 2022 Deep Reinforcement Learning reinforcement-learning
Code Code Available 1A Dirichlet Process Mixture of Robust Task Models for Scalable Lifelong Reinforcement Learning May 22, 2022 reinforcement-learning Reinforcement Learning (RL)
— Unverified 0Reinforced Pedestrian Attribute Recognition with Group Optimization Reward May 21, 2022 Attribute Decision Making
— Unverified 0User-Interactive Offline Reinforcement Learning May 21, 2022 Offline RL reinforcement-learning
— Unverified 0De novo design of protein target specific scaffold-based Inhibitors via Reinforcement Learning May 21, 2022 Drug Discovery Graph Neural Network
— Unverified 0CORAL: Contextual Response Retrievability Loss Function for Training Dialog Generation Models May 21, 2022 Reinforcement Learning (RL) Text Generation
— Unverified 0Coordinating Policies Among Multiple Agents via an Intelligent Communication Channel May 21, 2022 Intelligent Communication Multi-agent Reinforcement Learning
— Unverified 0ARLO: A Framework for Automated Reinforcement Learning May 20, 2022 feature selection MuJoCo
Code Code Available 1Prototyping three key properties of specific curiosity in computational reinforcement learning May 20, 2022 Decision Making reinforcement-learning
— Unverified 0Synthesis from Satisficing and Temporal Goals May 20, 2022 Reinforcement Learning (RL)
Code Code Available 0Towards biologically plausible Dreaming and Planning in recurrent spiking networks May 20, 2022 Autonomous Driving Model-based Reinforcement Learning
Code Code Available 0Long Run Incremental Cost (LRIC) Distribution Network Pricing in UK, advising China's Distribution Network May 20, 2022 Deep Reinforcement Learning reinforcement-learning
— Unverified 0Survey on Fair Reinforcement Learning: Theory and Practice May 20, 2022 Articles Decision Making
— Unverified 0