Beyond Value: CHECKLIST for Testing Inferences in Planning-Based RL Jun 4, 2022 Reinforcement Learning (RL)
— Unverified 0Reward Poisoning Attacks on Offline Multi-Agent Reinforcement Learning Jun 4, 2022 Multi-agent Reinforcement Learning reinforcement-learning
— Unverified 0MACC: Cross-Layer Multi-Agent Congestion Control with Deep Reinforcement Learning Jun 4, 2022 Deep Reinforcement Learning Management
— Unverified 0Reinforcement Learning with Neural Radiance Fields Jun 3, 2022 Decoder NeRF
— Unverified 0Offline Reinforcement Learning with Causal Structured World Models Jun 3, 2022 Model-based Reinforcement Learning Offline RL
— Unverified 0Disentangling Epistemic and Aleatoric Uncertainty in Reinforcement Learning Jun 3, 2022 reinforcement-learning Reinforcement Learning
— Unverified 0A Deep Reinforcement Learning Framework For Column Generation Jun 3, 2022 Decision Making Deep Reinforcement Learning
Code Code Available 0Joint Energy Dispatch and Unit Commitment in Microgrids Based on Deep Reinforcement Learning Jun 3, 2022 Deep Reinforcement Learning energy management
— Unverified 0KCRL: Krasovskii-Constrained Reinforcement Learning with Guaranteed Stability in Nonlinear Dynamical Systems Jun 3, 2022 reinforcement-learning Reinforcement Learning (RL)
— Unverified 0Equivariant Reinforcement Learning for Quadrotor UAV Jun 2, 2022 reinforcement-learning Reinforcement Learning
— Unverified 0Incorporating Explicit Uncertainty Estimates into Deep Offline Reinforcement Learning Jun 2, 2022 reinforcement-learning Reinforcement Learning
— Unverified 0Incrementality Bidding via Reinforcement Learning under Mixed and Delayed Rewards Jun 2, 2022 reinforcement-learning Reinforcement Learning
— Unverified 0HEX: Human-in-the-loop Explainability via Deep Reinforcement Learning Jun 2, 2022 Decision Making Deep Reinforcement Learning
— Unverified 0Sample-Efficient Reinforcement Learning of Partially Observable Markov Games Jun 2, 2022 Multi-agent Reinforcement Learning reinforcement-learning
— Unverified 0Policy Gradient Algorithms with Monte Carlo Tree Learning for Non-Markov Decision Processes Jun 2, 2022 Reinforcement Learning (RL)
— Unverified 0Posterior Coreset Construction with Kernelized Stein Discrepancy for Model-Based Reinforcement Learning Jun 2, 2022 continuous-control Continuous Control
— Unverified 0RACA: Relation-Aware Credit Assignment for Ad-Hoc Cooperation in Multi-Agent Deep Reinforcement Learning Jun 2, 2022 Deep Reinforcement Learning Reinforcement Learning (RL)
— Unverified 0Reinforcement learning based parameters adaption method for particle swarm optimization Jun 2, 2022 reinforcement-learning Reinforcement Learning
— Unverified 0Offline Reinforcement Learning with Differential Privacy Jun 2, 2022 Offline RL reinforcement-learning
— Unverified 0RLSS: A Deep Reinforcement Learning Algorithm for Sequential Scene Generation Jun 1, 2022 Deep Reinforcement Learning reinforcement-learning
— Unverified 0On Gap-dependent Bounds for Offline Reinforcement Learning Jun 1, 2022 reinforcement-learning Reinforcement Learning
— Unverified 0Predecessor Features Jun 1, 2022 reinforcement-learning Reinforcement Learning
— Unverified 0Model Generation with Provable Coverability for Offline Reinforcement Learning Jun 1, 2022 Offline RL Out-of-Distribution Generalization
— Unverified 0Provably Efficient Offline Multi-agent Reinforcement Learning via Strategy-wise Bonus Jun 1, 2022 Multi-agent Reinforcement Learning reinforcement-learning
— Unverified 0Provably Efficient Lifelong Reinforcement Learning with Linear Function Approximation Jun 1, 2022 4k Lifelong learning
— Unverified 0Neural Improvement Heuristics for Graph Combinatorial Optimization Problems Jun 1, 2022 Combinatorial Optimization Graph Neural Network
Code Code Available 0The Phenomenon of Policy Churn Jun 1, 2022 reinforcement-learning Reinforcement Learning (RL)
— Unverified 0DM^2: Decentralized Multi-Agent Reinforcement Learning for Distribution Matching Jun 1, 2022 Multi-agent Reinforcement Learning reinforcement-learning
Code Code Available 0Efficient Scheduling of Data Augmentation for Deep Reinforcement Learning Jun 1, 2022 Data Augmentation Deep Reinforcement Learning
— Unverified 0Byzantine-Robust Online and Offline Distributed Reinforcement Learning Jun 1, 2022 reinforcement-learning Reinforcement Learning
— Unverified 0Know Your Boundaries: The Necessity of Explicit Behavioral Cloning in Offline RL Jun 1, 2022 D4RL Offline RL
— Unverified 0A Database of Multimodal Data to Construct a Simulated Dialogue Partner with Varying Degrees of Cognitive Health Jun 1, 2022 Dialogue Management Management
— Unverified 0A Meta Reinforcement Learning Approach for Predictive Autoscaling in the Cloud May 31, 2022 CPU Decision Making
Code Code Available 0A Mixture-of-Expert Approach to RL-based Dialogue Management May 31, 2022 Attribute Dialogue Management
— Unverified 0Lessons Learned from Data-Driven Building Control Experiments: Contrasting Gaussian Process-based MPC, Bilevel DeePC, and Deep Reinforcement Learning May 31, 2022 Deep Reinforcement Learning Gaussian Processes
— Unverified 0Graph Backup: Data Efficient Backup Exploiting Markovian Transitions May 31, 2022 Atari Games counterfactual
Code Code Available 0k-Means Maximum Entropy Exploration May 31, 2022 Density Estimation reinforcement-learning
— Unverified 0Provable General Function Class Representation Learning in Multitask Bandits and MDPs May 31, 2022 Multi-Armed Bandits Reinforcement Learning (RL)
— Unverified 0One Policy is Enough: Parallel Exploration with a Single Policy is Near-Optimal for Reward-Free Reinforcement Learning May 31, 2022 Reinforcement Learning (RL)
— Unverified 0Multi-Agent Learning of Numerical Methods for Hyperbolic PDEs with Factored Dec-MDP May 31, 2022 Decision Making reinforcement-learning
— Unverified 0Timing is Everything: Learning to Act Selectively with Costly Actions and Budgetary Constraints May 31, 2022 Reinforcement Learning (RL)
— Unverified 0Robust Longitudinal Control for Vehicular Autonomous Platoons Using Deep Reinforcement Learning May 31, 2022 Deep Reinforcement Learning reinforcement-learning
— Unverified 0Sample-Efficient, Exploration-Based Policy Optimisation for Routing Problems May 31, 2022 Efficient Exploration reinforcement-learning
— Unverified 0Nearly Minimax Optimal Offline Reinforcement Learning with Linear Function Approximation: Single-Agent MDP and Markov Game May 31, 2022 Offline RL Reinforcement Learning (RL)
— Unverified 0Non-Markovian Reward Modelling from Trajectory Labels via Interpretable Multiple Instance Learning May 30, 2022 Multiple Instance Learning Reinforcement Learning (RL)
Code Code Available 0Residual Q-Networks for Value Function Factorizing in Multi-Agent Reinforcement Learning May 30, 2022 Multi-agent Reinforcement Learning reinforcement-learning
— Unverified 0Quantum Multi-Armed Bandits and Stochastic Linear Bandits Enjoy Logarithmic Regrets May 30, 2022 Multi-Armed Bandits reinforcement-learning
— Unverified 0Reinforcement Learning with a Terminator May 30, 2022 Autonomous Driving reinforcement-learning
Code Code Available 0SEREN: Knowing When to Explore and When to Exploit May 30, 2022 MuJoCo Reinforcement Learning (RL)
— Unverified 0Stock Trading Optimization through Model-based Reinforcement Learning with Resistance Support Relative Strength May 30, 2022 Decision Making Model-based Reinforcement Learning
— Unverified 0