Disentangling Epistemic and Aleatoric Uncertainty in Reinforcement Learning Jun 3, 2022 reinforcement-learning Reinforcement Learning
— Unverified 0KCRL: Krasovskii-Constrained Reinforcement Learning with Guaranteed Stability in Nonlinear Dynamical Systems Jun 3, 2022 reinforcement-learning Reinforcement Learning (RL)
— Unverified 0Reinforcement Learning with Neural Radiance Fields Jun 3, 2022 Decoder NeRF
— Unverified 0Offline Reinforcement Learning with Causal Structured World Models Jun 3, 2022 Model-based Reinforcement Learning Offline RL
— Unverified 0Joint Energy Dispatch and Unit Commitment in Microgrids Based on Deep Reinforcement Learning Jun 3, 2022 Deep Reinforcement Learning energy management
— Unverified 0Reincarnating Reinforcement Learning: Reusing Prior Computation to Accelerate Progress Jun 3, 2022 Atari Games Humanoid Control
Code Code Available 1Incrementality Bidding via Reinforcement Learning under Mixed and Delayed Rewards Jun 2, 2022 reinforcement-learning Reinforcement Learning
— Unverified 0HEX: Human-in-the-loop Explainability via Deep Reinforcement Learning Jun 2, 2022 Decision Making Deep Reinforcement Learning
— Unverified 0Equivariant Reinforcement Learning for Quadrotor UAV Jun 2, 2022 reinforcement-learning Reinforcement Learning
— Unverified 0RACA: Relation-Aware Credit Assignment for Ad-Hoc Cooperation in Multi-Agent Deep Reinforcement Learning Jun 2, 2022 Deep Reinforcement Learning Reinforcement Learning (RL)
— Unverified 0Posterior Coreset Construction with Kernelized Stein Discrepancy for Model-Based Reinforcement Learning Jun 2, 2022 continuous-control Continuous Control
— Unverified 0Sample-Efficient Reinforcement Learning of Partially Observable Markov Games Jun 2, 2022 Multi-agent Reinforcement Learning reinforcement-learning
— Unverified 0Offline Reinforcement Learning with Differential Privacy Jun 2, 2022 Offline RL reinforcement-learning
— Unverified 0When does return-conditioned supervised learning work for offline reinforcement learning? Jun 2, 2022 D4RL reinforcement-learning
Code Code Available 1Reinforcement learning based parameters adaption method for particle swarm optimization Jun 2, 2022 reinforcement-learning Reinforcement Learning
— Unverified 0NeuralSympCheck: A Symptom Checking and Disease Diagnostic Neural Model with Logic Regularization Jun 2, 2022 Diagnostic Reinforcement Learning (RL)
Code Code Available 1Policy Gradient Algorithms with Monte Carlo Tree Learning for Non-Markov Decision Processes Jun 2, 2022 Reinforcement Learning (RL)
— Unverified 0Deep Transformer Q-Networks for Partially Observable Reinforcement Learning Jun 2, 2022 Partially Observable Reinforcement Learning reinforcement-learning
Code Code Available 1Incorporating Explicit Uncertainty Estimates into Deep Offline Reinforcement Learning Jun 2, 2022 reinforcement-learning Reinforcement Learning
— Unverified 0A Database of Multimodal Data to Construct a Simulated Dialogue Partner with Varying Degrees of Cognitive Health Jun 1, 2022 Dialogue Management Management
— Unverified 0RLSS: A Deep Reinforcement Learning Algorithm for Sequential Scene Generation Jun 1, 2022 Deep Reinforcement Learning reinforcement-learning
— Unverified 0ResAct: Reinforcing Long-term Engagement in Sequential Recommendation with Residual Actor Jun 1, 2022 Reinforcement Learning (RL) Sequential Recommendation
Code Code Available 1Model Generation with Provable Coverability for Offline Reinforcement Learning Jun 1, 2022 Offline RL Out-of-Distribution Generalization
— Unverified 0Neural Improvement Heuristics for Graph Combinatorial Optimization Problems Jun 1, 2022 Combinatorial Optimization Graph Neural Network
Code Code Available 0Provably Efficient Offline Multi-agent Reinforcement Learning via Strategy-wise Bonus Jun 1, 2022 Multi-agent Reinforcement Learning reinforcement-learning
— Unverified 0On Reinforcement Learning and Distribution Matching for Fine-Tuning Language Models with no Catastrophic Forgetting Jun 1, 2022 Language Modelling Reinforcement Learning (RL)
Code Code Available 1Predecessor Features Jun 1, 2022 reinforcement-learning Reinforcement Learning
— Unverified 0Provably Efficient Lifelong Reinforcement Learning with Linear Function Approximation Jun 1, 2022 4k Lifelong learning
— Unverified 0On Gap-dependent Bounds for Offline Reinforcement Learning Jun 1, 2022 reinforcement-learning Reinforcement Learning
— Unverified 0The Phenomenon of Policy Churn Jun 1, 2022 reinforcement-learning Reinforcement Learning (RL)
— Unverified 0Know Your Boundaries: The Necessity of Explicit Behavioral Cloning in Offline RL Jun 1, 2022 D4RL Offline RL
— Unverified 0DM^2: Decentralized Multi-Agent Reinforcement Learning for Distribution Matching Jun 1, 2022 Multi-agent Reinforcement Learning reinforcement-learning
Code Code Available 0Efficient Scheduling of Data Augmentation for Deep Reinforcement Learning Jun 1, 2022 Data Augmentation Deep Reinforcement Learning
— Unverified 0Byzantine-Robust Online and Offline Distributed Reinforcement Learning Jun 1, 2022 reinforcement-learning Reinforcement Learning
— Unverified 0IGLU Gridworld: Simple and Fast Environment for Embodied Dialog Agents May 31, 2022 reinforcement-learning Reinforcement Learning
Code Code Available 1A Mixture-of-Expert Approach to RL-based Dialogue Management May 31, 2022 Attribute Dialogue Management
— Unverified 0Human-AI Shared Control via Policy Dissection May 31, 2022 Autonomous Driving Reinforcement Learning (RL)
Code Code Available 2Robust Longitudinal Control for Vehicular Autonomous Platoons Using Deep Reinforcement Learning May 31, 2022 Deep Reinforcement Learning reinforcement-learning
— Unverified 0Nearly Minimax Optimal Offline Reinforcement Learning with Linear Function Approximation: Single-Agent MDP and Markov Game May 31, 2022 Offline RL Reinforcement Learning (RL)
— Unverified 0Provable General Function Class Representation Learning in Multitask Bandits and MDPs May 31, 2022 Multi-Armed Bandits Reinforcement Learning (RL)
— Unverified 0Timing is Everything: Learning to Act Selectively with Costly Actions and Budgetary Constraints May 31, 2022 Reinforcement Learning (RL)
— Unverified 0Multi-Agent Learning of Numerical Methods for Hyperbolic PDEs with Factored Dec-MDP May 31, 2022 Decision Making reinforcement-learning
— Unverified 0One Policy is Enough: Parallel Exploration with a Single Policy is Near-Optimal for Reward-Free Reinforcement Learning May 31, 2022 Reinforcement Learning (RL)
— Unverified 0Sample-Efficient, Exploration-Based Policy Optimisation for Routing Problems May 31, 2022 Efficient Exploration reinforcement-learning
— Unverified 0k-Means Maximum Entropy Exploration May 31, 2022 Density Estimation reinforcement-learning
— Unverified 0Graph Backup: Data Efficient Backup Exploiting Markovian Transitions May 31, 2022 Atari Games counterfactual
Code Code Available 0Lessons Learned from Data-Driven Building Control Experiments: Contrasting Gaussian Process-based MPC, Bilevel DeePC, and Deep Reinforcement Learning May 31, 2022 Deep Reinforcement Learning Gaussian Processes
— Unverified 0A Meta Reinforcement Learning Approach for Predictive Autoscaling in the Cloud May 31, 2022 CPU Decision Making
Code Code Available 0DEP-RL: Embodied Exploration for Reinforcement Learning in Overactuated and Musculoskeletal Systems May 30, 2022 Diversity reinforcement-learning
Code Code Available 2A Simulation Environment and Reinforcement Learning Method for Waste Reduction May 30, 2022 Distributional Reinforcement Learning reinforcement-learning
— Unverified 0