Convergence and Optimality of Policy Gradient Methods in Weakly Smooth Settings Oct 30, 2021 Policy Gradient Methods reinforcement-learning
— Unverified 0Context Meta-Reinforcement Learning via Neuromodulation Oct 30, 2021 continuous-control Continuous Control
Code Code Available 0Adjacency constraint for efficient hierarchical reinforcement learning Oct 30, 2021 continuous-control Continuous Control
— Unverified 0Learning Coordinated Terrain-Adaptive Locomotion by Imitating a Centroidal Dynamics Planner Oct 30, 2021 Imitation Learning Reinforcement Learning (RL)
— Unverified 0A Decentralized Reinforcement Learning Framework for Efficient Passage of Emergency Vehicles Oct 30, 2021 reinforcement-learning Reinforcement Learning (RL)
— Unverified 0Brick-by-Brick: Combinatorial Construction with Deep Reinforcement Learning Oct 29, 2021 Deep Reinforcement Learning Object
— Unverified 0Learning to Communicate with Reinforcement Learning for an Adaptive Traffic Control System Oct 29, 2021 Multi-agent Reinforcement Learning Q-Learning
— Unverified 0GalilAI: Out-of-Task Distribution Detection using Causal Active Experimentation for Safe Transfer RL Oct 29, 2021 Out of Distribution (OOD) Detection Reinforcement Learning (RL)
— Unverified 0Adaptive Discretization in Online Reinforcement Learning Oct 29, 2021 Management reinforcement-learning
— Unverified 0Reinforced Workload Distribution Fairness Oct 29, 2021 Fairness Reinforcement Learning (RL)
— Unverified 0Mixed Cooperative-Competitive Communication Using Multi-Agent Reinforcement Learning Oct 29, 2021 Multi-agent Reinforcement Learning reinforcement-learning
— Unverified 0Open Problem: Tight Online Confidence Intervals for RKHS Elements Oct 28, 2021 Reinforcement Learning (RL)
— Unverified 0Efficient Meta Subspace Optimization Oct 28, 2021 Reinforcement Learning (RL)
Code Code Available 0Proximal Reinforcement Learning: Efficient Off-Policy Evaluation in Partially Observed Markov Decision Processes Oct 28, 2021 Causal Inference Management
Code Code Available 0Accelerating Robotic Reinforcement Learning via Parameterized Action Primitives Oct 28, 2021 Efficient Exploration reinforcement-learning
— Unverified 0An Adaptable Approach to Learn Realistic Legged Locomotion without Examples Oct 28, 2021 Reinforcement Learning (RL)
— Unverified 0Choosing the Best of Both Worlds: Diverse and Novel Recommendations through Multi-Objective Reinforcement Learning Oct 28, 2021 Diversity Multi-Objective Reinforcement Learning
— Unverified 0Extracting Expert's Goals by What-if Interpretable Modeling Oct 28, 2021 Additive models reinforcement-learning
— Unverified 0Bayesian Sequential Optimal Experimental Design for Nonlinear Models Using Policy Gradient Reinforcement Learning Oct 28, 2021 Experimental Design reinforcement-learning
— Unverified 0Data Informed Residual Reinforcement Learning for High-Dimensional Robotic Tracking Control Oct 28, 2021 reinforcement-learning Reinforcement Learning (RL)
— Unverified 0D2RLIR : an improved and diversified ranking function in interactive recommendation systems based on deep reinforcement learning Oct 28, 2021 Deep Reinforcement Learning Diversity
— Unverified 0Comparing Heuristics, Constraint Optimization, and Reinforcement Learning for an Industrial 2D Packing Problem Oct 27, 2021 Deep Reinforcement Learning reinforcement-learning
— Unverified 0Enhancing Reinforcement Learning with discrete interfaces to learn the Dyck Language Oct 27, 2021 reinforcement-learning Reinforcement Learning (RL)
— Unverified 0A Law of Iterated Logarithm for Multi-Agent Reinforcement Learning Oct 27, 2021 Decision Making Multi-agent Reinforcement Learning
— Unverified 0Finite Horizon Q-learning: Stability, Convergence, Simulations and an application on Smart Grids Oct 27, 2021 Q-Learning reinforcement-learning
— Unverified 0DESTA: A Framework for Safe Reinforcement Learning with Markov Games of Intervention Oct 27, 2021 OpenAI Gym reinforcement-learning
— Unverified 0A Subgame Perfect Equilibrium Reinforcement Learning Approach to Time-inconsistent Problems Oct 27, 2021 reinforcement-learning Reinforcement Learning (RL)
— Unverified 0The ODE Method for Asymptotic Statistics in Stochastic Approximation and Reinforcement Learning Oct 27, 2021 reinforcement-learning Reinforcement Learning (RL)
— Unverified 0Stabilising viscous extensional flows using Reinforcement Learning Oct 27, 2021 reinforcement-learning Reinforcement Learning
Code Code Available 0APPTeK: Agent-Based Predicate Prediction in Temporal Knowledge Graphs Oct 27, 2021 Knowledge Graphs Prediction
— Unverified 0Model based Multi-agent Reinforcement Learning with Tensor Decompositions Oct 27, 2021 Model-based Reinforcement Learning Multi-agent Reinforcement Learning
— Unverified 0Transfer learning with causal counterfactual reasoning in Decision Transformers Oct 27, 2021 counterfactual Counterfactual Reasoning
— Unverified 0Reinforcement Learning in Linear MDPs: Constant Regret and Representation Selection Oct 27, 2021 reinforcement-learning Reinforcement Learning
— Unverified 0Reinforcement Learning in Factored Action Spaces using Tensor Decompositions Oct 27, 2021 Multi-agent Reinforcement Learning reinforcement-learning
— Unverified 0Towards Hyperparameter-free Policy Selection for Offline Reinforcement Learning Oct 26, 2021 Off-policy evaluation Open-Ended Question Answering
Code Code Available 0The Difficulty of Passive Learning in Deep Reinforcement Learning Oct 26, 2021 Deep Reinforcement Learning reinforcement-learning
— Unverified 0Multi-Agent Advisor Q-Learning Oct 26, 2021 Decision Making Multi-agent Reinforcement Learning
Code Code Available 0Fragment-based Sequential Translation for Molecular Optimization Oct 26, 2021 Drug Discovery Reinforcement Learning (RL)
— Unverified 0Average-Reward Learning and Planning with Options Oct 26, 2021 reinforcement-learning Reinforcement Learning (RL)
— Unverified 0Distributional Reinforcement Learning for Multi-Dimensional Reward Functions Oct 26, 2021 Distributional Reinforcement Learning reinforcement-learning
Code Code Available 0Accelerating Distributed Deep Reinforcement Learning by In-Network Experience Sampling Oct 26, 2021 Deep Reinforcement Learning reinforcement-learning
— Unverified 0EnTRPO: Trust Region Policy Optimization Method with Entropy Regularization Oct 26, 2021 reinforcement-learning Reinforcement Learning
— Unverified 0Neural PPO-Clip Attains Global Optimality: A Hinge Loss Perspective Oct 26, 2021 Deep Reinforcement Learning reinforcement-learning
— Unverified 0Automating Control of Overestimation Bias for Reinforcement Learning Oct 26, 2021 Continuous Control Q-Learning
— Unverified 0Learning Robust Controllers Via Probabilistic Model-Based Policy Search Oct 26, 2021 Model-based Reinforcement Learning reinforcement-learning
— Unverified 0Applications of Multi-Agent Reinforcement Learning in Future Internet: A Comprehensive Survey Oct 26, 2021 Decision Making Deep Reinforcement Learning
— Unverified 0Distributed Multi-Agent Deep Reinforcement Learning Framework for Whole-building HVAC Control Oct 26, 2021 Deep Reinforcement Learning reinforcement-learning
— Unverified 0Learning What to Memorize: Using Intrinsic Motivation to Form Useful Memory in Partially Observable Reinforcement Learning Oct 25, 2021 Form Partially Observable Reinforcement Learning
— Unverified 0Can Q-Learning be Improved with Advice? Oct 25, 2021 Q-Learning reinforcement-learning
— Unverified 0Common Information based Approximate State Representations in Multi-Agent Reinforcement Learning Oct 25, 2021 Multi-agent Reinforcement Learning reinforcement-learning
— Unverified 0