Specializing Versatile Skill Libraries using Local Mixture of Experts Dec 8, 2021 Incremental Learning Mixture-of-Experts
Code Code Available 0CoMPS: Continual Meta Policy Search Dec 8, 2021 Continual Learning continuous-control
— Unverified 0Hyper-parameter optimization based on soft actor critic and hierarchical mixture regularization Dec 8, 2021 Bayesian Optimization reinforcement-learning
— Unverified 0Application of Deep Reinforcement Learning to Payment Fraud Dec 8, 2021 Decision Making Deep Reinforcement Learning
— Unverified 0Learning to Select the Next Reasonable Mention for Entity Linking Dec 8, 2021 Entity Linking Knowledge Graphs
— Unverified 0Learning over All Stabilizing Nonlinear Controllers for a Partially-Observed Linear System Dec 8, 2021 All Reinforcement Learning (RL)
— Unverified 0A Review for Deep Reinforcement Learning in Atari:Benchmarks, Challenges, and Solutions Dec 8, 2021 Atari Games Deep Reinforcement Learning
— Unverified 0Ambiguous Dynamic Treatment Regimes: A Reinforcement Learning Approach Dec 8, 2021 counterfactual Decision Making
— Unverified 0Deep Q-Learning Market Makers in a Multi-Agent Simulated Stock Market Dec 8, 2021 Q-Learning Reinforcement Learning (RL)
— Unverified 0JueWu-MC: Playing Minecraft with Sample-efficient Hierarchical Reinforcement Learning Dec 7, 2021 Efficient Exploration Hierarchical Reinforcement Learning
— Unverified 0Attention-Based Model and Deep Reinforcement Learning for Distribution of Event Processing Tasks Dec 7, 2021 Deep Reinforcement Learning reinforcement-learning
Code Code Available 0A Transferable Approach for Partitioning Machine Learning Models on Multi-Chip-Modules Dec 7, 2021 BIG-bench Machine Learning Deep Reinforcement Learning
— Unverified 0First-Order Regret in Reinforcement Learning with Linear Function Approximation: A Robust Estimation Approach Dec 7, 2021 Decision Making reinforcement-learning
— Unverified 0Synthetic Acute Hypotension and Sepsis Datasets Based on MIMIC-III and Published as Part of the Health Gym Project Dec 7, 2021 reinforcement-learning Reinforcement Learning (RL)
— Unverified 0QKSA: Quantum Knowledge Seeking Agent -- resource-optimized reinforcement learning using quantum process tomography Dec 7, 2021 Quantum Machine Learning reinforcement-learning
— Unverified 0Model-free Nearly Optimal Control of Constrained-Input Nonlinear Systems Based on Synchronous Reinforcement Learning Dec 7, 2021 Reinforcement Learning (RL)
— Unverified 0MESA: Offline Meta-RL for Safe Adaptation and Fault Tolerance Dec 7, 2021 continuous-control Continuous Control
— Unverified 0Organ localisation using supervised and semi supervised approaches combining reinforcement learning with imitation learning Dec 6, 2021 Imitation Learning Reinforcement Learning (RL)
— Unverified 0Temporal-Spatial Causal Interpretations for Vision-Based Reinforcement Learning Dec 6, 2021 Causal Discovery Decision Making
— Unverified 0Virtual Replay Cache Dec 6, 2021 Atari Games Deep Reinforcement Learning
Code Code Available 0MDPFuzz: Testing Models Solving Markov Decision Processes Dec 6, 2021 Autonomous Driving Collision Avoidance
— Unverified 0MDPGT: Momentum-based Decentralized Policy Gradient Tracking Dec 6, 2021 Multi-agent Reinforcement Learning Policy Gradient Methods
Code Code Available 0Flexible Option Learning Dec 6, 2021 Deep Reinforcement Learning Hierarchical Reinforcement Learning
Code Code Available 0Lecture Notes on Partially Known MDPs Dec 6, 2021 reinforcement-learning Reinforcement Learning
— Unverified 0Deep differentiable reinforcement learning and optimal trading Dec 6, 2021 Portfolio Optimization reinforcement-learning
— Unverified 0Distilled Domain Randomization Dec 6, 2021 Deep Reinforcement Learning reinforcement-learning
— Unverified 0Benchmark for Out-of-Distribution Detection in Deep Reinforcement Learning Dec 5, 2021 Deep Reinforcement Learning Out-of-Distribution Detection
— Unverified 0Deep Policy Iteration with Integer Programming for Inventory Management Dec 4, 2021 Decision Making Management
— Unverified 0Reinforcement learning for options on target volatility funds Dec 3, 2021 reinforcement-learning Reinforcement Learning
— Unverified 0An Analytical Update Rule for General Policy Optimization Dec 3, 2021 reinforcement-learning Reinforcement Learning (RL)
— Unverified 0Divergent representations of ethological visual inputs emerge from supervised, unsupervised, and reinforcement learning Dec 3, 2021 reinforcement-learning Reinforcement Learning
— Unverified 0Differentially Private Exploration in Reinforcement Learning with Linear Representation Dec 2, 2021 Privacy Preserving reinforcement-learning
— Unverified 0Adversarial Robustness of Deep Reinforcement Learning based Dynamic Recommender Systems Dec 2, 2021 Adversarial Robustness counterfactual
— Unverified 0Convergence Guarantees for Deep Epsilon Greedy Policy Learning Dec 2, 2021 reinforcement-learning Reinforcement Learning (RL)
— Unverified 0Architecting and Visualizing Deep Reinforcement Learning Models Dec 2, 2021 Deep Reinforcement Learning reinforcement-learning
— Unverified 0Towards Personalization of User Preferences in Partially Observable Smart Home Environments Dec 2, 2021 Hierarchical Reinforcement Learning reinforcement-learning
— Unverified 0Maximum Entropy Model-based Reinforcement Learning Dec 2, 2021 Dota 2 model
— Unverified 0A Generic Graph Sparsification Framework using Deep Reinforcement Learning Dec 2, 2021 Decision Making Deep Reinforcement Learning
Code Code Available 0Towards Interactive Reinforcement Learning with Intrinsic Feedback Dec 2, 2021 reinforcement-learning Reinforcement Learning
— Unverified 0Reward-Free Attacks in Multi-Agent Reinforcement Learning Dec 2, 2021 Multi-agent Reinforcement Learning reinforcement-learning
— Unverified 0Sample Complexity of Robust Reinforcement Learning with a Generative Model Dec 2, 2021 Model-based Reinforcement Learning reinforcement-learning
Code Code Available 0Safe Reinforcement Learning for Grid Voltage Control Dec 2, 2021 reinforcement-learning Reinforcement Learning
— Unverified 0Reinforcement Learning Enhanced Explainer for Graph Neural Networks Dec 1, 2021 Combinatorial Optimization Graph Generation
— Unverified 0Multi-Agent Transfer Learning in Reinforcement Learning-Based Ride-Sharing Systems Dec 1, 2021 reinforcement-learning Reinforcement Learning (RL)
— Unverified 0Reinforcement Learning with State Observation Costs in Action-Contingent Noiselessly Observable Markov Decision Processes Dec 1, 2021 Reinforcement Learning (RL)
Code Code Available 0DOPE: Doubly Optimistic and Pessimistic Exploration for Safe Reinforcement Learning Dec 1, 2021 reinforcement-learning Reinforcement Learning
Code Code Available 0Taming Communication and Sample Complexities in Decentralized Policy Evaluation for Cooperative Multi-Agent Reinforcement Learning Dec 1, 2021 Multi-agent Reinforcement Learning Reinforcement Learning (RL)
— Unverified 0On the Practical Consistency of Meta-Reinforcement Learning Algorithms Dec 1, 2021 Meta-Learning Meta Reinforcement Learning
— Unverified 0Structural Credit Assignment in Neural Networks using Reinforcement Learning Dec 1, 2021 reinforcement-learning Reinforcement Learning
— Unverified 0RMIX: Learning Risk-Sensitive Policies forCooperative Reinforcement Learning Agents Dec 1, 2021 Multi-agent Reinforcement Learning quantile regression
— Unverified 0