L2C2: Locally Lipschitz Continuous Constraint towards Stable and Smooth Reinforcement Learning Feb 15, 2022 reinforcement-learning Reinforcement Learning (RL)
— Unverified 0Deep Reinforcement Learning Based Multi-Access Edge Computing Schedule for Internet of Vehicle Feb 15, 2022 Deep Reinforcement Learning Edge-computing
— Unverified 0Learning to Mitigate AI Collusion on Economic Platforms Feb 15, 2022 reinforcement-learning Reinforcement Learning
— Unverified 0Energy-Efficient Parking Analytics System using Deep Reinforcement Learning Feb 15, 2022 Deep Reinforcement Learning reinforcement-learning
Code Code Available 0Learning Reward Models for Cooperative Trajectory Planning with Inverse Reinforcement Learning and Monte Carlo Tree Search Feb 14, 2022 Decision Making reinforcement-learning
Code Code Available 0Convex Programs and Lyapunov Functions for Reinforcement Learning: A Unified Perspective on the Analysis of Value-Based Methods Feb 14, 2022 Reinforcement Learning (RL)
— Unverified 0Statistical Inference After Adaptive Sampling for Longitudinal Data Feb 14, 2022 reinforcement-learning Reinforcement Learning
— Unverified 0Provably Efficient Causal Model-Based Reinforcement Learning for Systematic Generalization Feb 14, 2022 Decision Making Model-based Reinforcement Learning
— Unverified 0Robust Policy Learning over Multiple Uncertainty Sets Feb 14, 2022 Reinforcement Learning (RL)
— Unverified 0Reinforcement Learning in Presence of Discrete Markovian Context Evolution Feb 14, 2022 reinforcement-learning Reinforcement Learning
— Unverified 0Sequential Bayesian experimental designs via reinforcement learning Feb 14, 2022 Bayesian Inference Decision Making
— Unverified 0Saute RL: Almost Surely Safe Reinforcement Learning Using State Augmentation Feb 14, 2022 reinforcement-learning Reinforcement Learning
— Unverified 0Towards Deployment-Efficient Reinforcement Learning: Lower Bound and Optimality Feb 14, 2022 reinforcement-learning Reinforcement Learning
— Unverified 0Motivating Physical Activity via Competitive Human-Robot Interaction Feb 14, 2022 Multi-agent Reinforcement Learning Reinforcement Learning (RL)
— Unverified 0Sample-Efficient Reinforcement Learning with loglog(T) Switching Cost Feb 13, 2022 reinforcement-learning Reinforcement Learning
— Unverified 0Deep Reinforcement Learning and Convex Mean-Variance Optimisation for Portfolio Management Feb 13, 2022 Deep Reinforcement Learning Management
— Unverified 0Individual-Level Inverse Reinforcement Learning for Mean Field Games Feb 13, 2022 reinforcement-learning Reinforcement Learning
— Unverified 0Goal Recognition as Reinforcement Learning Feb 13, 2022 Q-Learning reinforcement-learning
Code Code Available 0Autonomous Drone Swarm Navigation and Multi-target Tracking in 3D Environments with Dynamic Obstacles Feb 13, 2022 Deep Reinforcement Learning Management
— Unverified 0End-to-end Reinforcement Learning of Robotic Manipulation with Robust Keypoints Representation Feb 12, 2022 reinforcement-learning Reinforcement Learning (RL)
— Unverified 0Neural NID Rules Feb 12, 2022 Common Sense Reasoning Graph Neural Network
— Unverified 0Robust Learning from Observation with Model Misspecification Feb 12, 2022 continuous-control Continuous Control
Code Code Available 0Rate-matching the regret lower-bound in the linear quadratic regulator with unknown dynamics Feb 11, 2022 reinforcement-learning Reinforcement Learning
— Unverified 0Understanding Curriculum Learning in Policy Optimization for Online Combinatorial Optimization Feb 11, 2022 Combinatorial Optimization Reinforcement Learning (RL)
Code Code Available 0Regularized Q-learning Feb 11, 2022 Q-Learning reinforcement-learning
— Unverified 0Computational-Statistical Gaps in Reinforcement Learning Feb 11, 2022 reinforcement-learning Reinforcement Learning
— Unverified 0A Unified Perspective on Value Backup and Exploration in Monte-Carlo Tree Search Feb 11, 2022 Atari Games Decision Making
— Unverified 0Group-Agent Reinforcement Learning Feb 10, 2022 Multi-agent Reinforcement Learning reinforcement-learning
— Unverified 0AI-based Robust Resource Allocation in End-to-End Network Slicing under Demand and CSI Uncertainties Feb 10, 2022 Deep Reinforcement Learning reinforcement-learning
— Unverified 0Interpretable pipelines with evolutionarily optimized modules for RL tasks with visual inputs Feb 10, 2022 Decision Making Evolutionary Algorithms
— Unverified 0Abstraction for Deep Reinforcement Learning Feb 10, 2022 BIG-bench Machine Learning Deep Reinforcement Learning
— Unverified 0Universal Learning Waveform Selection Strategies for Adaptive Target Tracking Feb 10, 2022 reinforcement-learning Reinforcement Learning (RL)
— Unverified 0Settling the Communication Complexity for Distributed Offline Reinforcement Learning Feb 10, 2022 Multi-Armed Bandits Offline RL
— Unverified 0Understanding Value Decomposition Algorithms in Deep Cooperative Multi-Agent Reinforcement Learning Feb 10, 2022 Multi-agent Reinforcement Learning reinforcement-learning
— Unverified 0SAFER: Data-Efficient and Safe Reinforcement Learning via Skill Acquisition Feb 10, 2022 reinforcement-learning Reinforcement Learning
— Unverified 0Uncovering Instabilities in Variational-Quantum Deep Q-Networks Feb 10, 2022 Deep Reinforcement Learning reinforcement-learning
Code Code Available 0Off-Policy Fitted Q-Evaluation with Differentiable Function Approximators: Z-Estimation and Inference Theory Feb 10, 2022 Off-policy evaluation Reinforcement Learning (RL)
— Unverified 0Reinforcement Learning in the Wild: Scalable RL Dispatching Algorithm Deployed in Ridehailing Marketplace Feb 10, 2022 Causal Inference reinforcement-learning
— Unverified 0Scenario-Assisted Deep Reinforcement Learning Feb 9, 2022 Deep Reinforcement Learning reinforcement-learning
— Unverified 0Understanding and Shifting Preferences for Battery Electric Vehicles Feb 9, 2022 Reinforcement Learning (RL)
— Unverified 0Offline Reinforcement Learning with Realizability and Single-policy Concentrability Feb 9, 2022 Offline RL reinforcement-learning
— Unverified 0Transferred Q-learning Feb 9, 2022 Offline RL Q-Learning
— Unverified 0Bayesian Nonparametrics for Offline Skill Discovery Feb 9, 2022 Imitation Learning reinforcement-learning
Code Code Available 0Intelligent Autonomous Intersection Management Feb 9, 2022 Autonomous Vehicles Management
— Unverified 0A Reinforcement Learning Approach to Domain-Knowledge Inclusion Using Grammar Guided Symbolic Regression Feb 9, 2022 regression reinforcement-learning
Code Code Available 0Independent Policy Gradient for Large-Scale Markov Potential Games: Sharper Rates, Function Approximation, and Game-Agnostic Convergence Feb 8, 2022 Multi-agent Reinforcement Learning Policy Gradient Methods
— Unverified 0Energy Management Based on Multi-Agent Deep Reinforcement Learning for A Multi-Energy Industrial Park Feb 8, 2022 counterfactual Deep Reinforcement Learning
— Unverified 0Local Explanations for Reinforcement Learning Feb 8, 2022 Deep Reinforcement Learning reinforcement-learning
— Unverified 0PolicyCleanse: Backdoor Detection and Mitigation in Reinforcement Learning Feb 8, 2022 Machine Unlearning reinforcement-learning
— Unverified 0GrASP: Gradient-Based Affordance Selection for Planning Feb 8, 2022 Reinforcement Learning (RL)
— Unverified 0