| Show Us the Way: Learning to Manage Dialog from Demonstrations | Apr 17, 2020 | dialog state trackingManagement | —Unverified | 0 |
| K-spin Hamiltonian for quantum-resolvable Markov decision processes | Apr 13, 2020 | Q-LearningReinforcement Learning | —Unverified | 0 |
| Self Punishment and Reward Backfill for Deep Q-Learning | Apr 10, 2020 | Atari GamesDeep Reinforcement Learning | CodeCode Available | 0 |
| Zero-Shot Learning of Text Adventure Games with Sentence-Level Semantics | Apr 6, 2020 | ClusteringQ-Learning | —Unverified | 0 |
| Multi-agent Reinforcement Learning for Resource Allocation in IoT networks with Edge Computing | Apr 5, 2020 | Cloud ComputingDistributed Computing | —Unverified | 0 |
| Minimizing Age-of-Information for Fog Computing-supported Vehicular Networks with Deep Q-learning | Apr 4, 2020 | Autonomous DrivingQ-Learning | —Unverified | 0 |
| Reinforcement Learning for Mixed-Integer Problems Based on MPC | Apr 3, 2020 | Model Predictive ControlQ-Learning | —Unverified | 0 |
| Safe Reinforcement Learning via Projection on a Safe Set: How to Achieve Optimality? | Apr 2, 2020 | Policy Gradient MethodsQ-Learning | —Unverified | 0 |
| Statistically Model Checking PCTL Specifications on Markov Decision Processes via Reinforcement Learning | Apr 1, 2020 | NegationQ-Learning | —Unverified | 0 |
| Augmented Q Imitation Learning (AQIL) | Mar 31, 2020 | Deep Reinforcement LearningImitation Learning | CodeCode Available | 0 |
| Enhanced Rolling Horizon Evolution Algorithm with Opponent Model Learning: Results for the Fighting Game AI Competition | Mar 31, 2020 | Q-LearningReinforcement Learning | —Unverified | 0 |
| Learning medical triage from clinicians using Deep Q-Learning | Mar 28, 2020 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| Robust Q-learning | Mar 27, 2020 | Q-Learningregression | —Unverified | 0 |
| A Distributional Analysis of Sampling-Based Reinforcement Learning Algorithms | Mar 27, 2020 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Convergence of Recursive Stochastic Algorithms using Wasserstein Divergence | Mar 25, 2020 | Q-LearningReinforcement Learning | —Unverified | 0 |
| Q-Learning in Regularized Mean-field Games | Mar 24, 2020 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Using Deep Reinforcement Learning Methods for Autonomous Vessels in 2D Environments | Mar 23, 2020 | Decision MakingDeep Reinforcement Learning | CodeCode Available | 1 |
| Distributed Reinforcement Learning for Cooperative Multi-Robot Object Manipulation | Mar 21, 2020 | ObjectQ-Learning | —Unverified | 0 |
| FlapAI Bird: Training an Agent to Play Flappy Bird Using Reinforcement Learning Techniques | Mar 21, 2020 | Q-Learningreinforcement-learning | CodeCode Available | 1 |
| Deep Constrained Q-learning | Mar 20, 2020 | Autonomous DrivingDecision Making | —Unverified | 0 |
| Deep Reinforcement Learning with Weighted Q-Learning | Mar 20, 2020 | Deep Reinforcement LearningGaussian Processes | —Unverified | 0 |
| DisCor: Corrective Feedback in Reinforcement Learning via Distribution Correction | Mar 16, 2020 | Deep Reinforcement LearningMeta-Learning | CodeCode Available | 1 |
| Active Perception and Representation for Robotic Manipulation | Mar 15, 2020 | Q-LearningReinforcement Learning | —Unverified | 0 |
| FACMAC: Factored Multi-Agent Centralised Policy Gradients | Mar 14, 2020 | MuJoCoMulti-agent Reinforcement Learning | CodeCode Available | 1 |
| Application of Deep Q-Network in Portfolio Management | Mar 13, 2020 | Deep Reinforcement LearningFace Recognition | —Unverified | 0 |
| A General Framework for Learning Mean-Field Games | Mar 13, 2020 | Decision MakingMulti-agent Reinforcement Learning | —Unverified | 0 |
| Provably Efficient Model-Free Algorithm for MDPs with Peak Constraints | Mar 11, 2020 | Q-LearningScheduling | —Unverified | 0 |
| Privacy-Cost Management in Smart Meters Using Deep Reinforcement Learning | Mar 10, 2020 | Deep Reinforcement LearningManagement | —Unverified | 0 |
| Indirect and Direct Training of Spiking Neural Networks for End-to-End Control of a Lane-Keeping Vehicle | Mar 10, 2020 | Q-LearningReinforcement Learning | —Unverified | 0 |
| Software-Level Accuracy Using Stochastic Computing With Charge-Trap-Flash Based Weight Matrix | Mar 9, 2020 | Q-LearningReinforcement Learning | —Unverified | 0 |
| A Multi-Agent Reinforcement Learning Approach For Safe and Efficient Behavior Planning Of Connected Autonomous Vehicles | Mar 9, 2020 | Autonomous VehiclesMulti-agent Reinforcement Learning | —Unverified | 0 |
| Transfer Reinforcement Learning under Unobserved Contextual Information | Mar 9, 2020 | Motion PlanningQ-Learning | —Unverified | 0 |
| Reinforcement Learning Based Cooperative Coded Caching under Dynamic Popularities in Ultra-Dense Networks | Mar 8, 2020 | Q-LearningReinforcement Learning | —Unverified | 0 |
| Relevance-Guided Modeling of Object Dynamics for Reinforcement Learning | Mar 3, 2020 | Atari GamesDeep Reinforcement Learning | —Unverified | 0 |
| Adaptive Structural Hyper-Parameter Configuration by Q-Learning | Mar 2, 2020 | Evolutionary AlgorithmsQ-Learning | —Unverified | 0 |
| Contextual Policy Transfer in Reinforcement Learning Domains via Deep Mixtures-of-Experts | Feb 29, 2020 | Mixture-of-ExpertsOpenAI Gym | —Unverified | 0 |
| Deep Reinforcement Learning for FlipIt Security Game | Feb 28, 2020 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| ConQUR: Mitigating Delusional Bias in Deep Q-learning | Feb 27, 2020 | Atari GamesQ-Learning | CodeCode Available | 0 |
| Optimistic Exploration even with a Pessimistic Initialisation | Feb 26, 2020 | Efficient ExplorationQ-Learning | CodeCode Available | 1 |
| Simultaneously Evolving Deep Reinforcement Learning Models using Multifactorial Optimization | Feb 25, 2020 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| G-Learner and GIRL: Goal Based Wealth Management with Reinforcement Learning | Feb 25, 2020 | ManagementQ-Learning | —Unverified | 0 |
| A Double Q-Learning Approach for Navigation of Aerial Vehicles with Connectivity Constraint | Feb 24, 2020 | Q-LearningReinforcement Learning | —Unverified | 0 |
| Millimeter Wave Communications with an Intelligent Reflector: Performance Optimization and Distributional Reinforcement Learning | Feb 24, 2020 | Distributional Reinforcement LearningQ-Learning | —Unverified | 0 |
| Q-learning with Uniformly Bounded Variance: Large Discounting is Not a Barrier to Fast Learning | Feb 24, 2020 | Q-LearningReinforcement Learning | —Unverified | 0 |
| Periodic Q-Learning | Feb 23, 2020 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| Anypath Routing Protocol Design via Q-Learning for Underwater Sensor Networks | Feb 22, 2020 | Q-Learning | —Unverified | 0 |
| UAV Aided Search and Rescue Operation Using Reinforcement Learning | Feb 19, 2020 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Agnostic Q-learning with Function Approximation in Deterministic Systems: Tight Bounds on Approximation Error and Sample Complexity | Feb 17, 2020 | Q-Learning | —Unverified | 0 |
| Maxmin Q-learning: Controlling the Estimation Bias of Q-learning | Feb 16, 2020 | Q-Learning | CodeCode Available | 1 |
| Listwise Learning to Rank with Deep Q-Networks | Feb 13, 2020 | Decision MakingLearning-To-Rank | —Unverified | 0 |