| Regret Bounds for Discounted MDPs | Feb 12, 2020 | Q-LearningReinforcement Learning | —Unverified | 0 |
| Mean-Field Controls with Q-learning for Cooperative MARL: Convergence and Complexity Analysis | Feb 10, 2020 | Multi-agent Reinforcement LearningQ-Learning | —Unverified | 0 |
| Learning State Abstractions for Transfer in Continuous Control | Feb 8, 2020 | continuous-controlContinuous Control | CodeCode Available | 0 |
| GLSearch: Maximum Common Subgraph Detection via Learning to Search | Feb 8, 2020 | Cloud ComputingGraph Embedding | —Unverified | 0 |
| Manipulating Reinforcement Learning: Poisoning Attacks on Cost Signals | Feb 7, 2020 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Safe Wasserstein Constrained Deep Q-Learning | Feb 7, 2020 | Q-Learning | —Unverified | 0 |
| A Stochastic Game Framework for Efficient Energy Management in Microgrid Networks | Feb 6, 2020 | energy managementenergy trading | CodeCode Available | 1 |
| Finite-Sample Analysis of Stochastic Approximation Using Smooth Convex Envelopes | Feb 3, 2020 | Q-LearningReinforcement Learning | —Unverified | 0 |
| Finite-Time Analysis of Asynchronous Stochastic Approximation and Q-Learning | Feb 1, 2020 | Q-Learning | —Unverified | 0 |
| Autonomous Control of a Line Follower Robot Using a Q-Learning Controller | Jan 23, 2020 | FrictionQ-Learning | —Unverified | 0 |
| Q-Learning in enormous action spaces via amortized approximate maximization | Jan 22, 2020 | continuous-controlContinuous Control | —Unverified | 0 |
| Discriminator Soft Actor Critic without Extrinsic Rewards | Jan 19, 2020 | Imitation LearningQ-Learning | CodeCode Available | 1 |
| Model-based Multi-Agent Reinforcement Learning with Cooperative Prioritized Sweeping | Jan 15, 2020 | Model-based Reinforcement LearningMulti-agent Reinforcement Learning | —Unverified | 0 |
| A storage expansion planning framework using reinforcement learning and simulation-based optimization | Jan 10, 2020 | Decision MakingQ-Learning | —Unverified | 0 |
| A Probabilistic Simulator of Spatial Demand for Product Allocation | Jan 9, 2020 | Q-Learning | —Unverified | 0 |
| EEG-based Drowsiness Estimation for Driving Safety using Deep Q-Learning | Jan 8, 2020 | Brain Computer InterfaceDeep Reinforcement Learning | —Unverified | 0 |
| Experimental Analysis of Reinforcement Learning Techniques for Spectrum Sharing Radar | Jan 6, 2020 | Q-Learningreinforcement-learning | —Unverified | 0 |
| An Optimistic Perspective on Offline Deep Reinforcement Learning | Jan 1, 2020 | Atari GamesDeep Reinforcement Learning | CodeCode Available | 1 |
| SVQN: Sequential Variational Soft Q-Learning Networks | Jan 1, 2020 | Decision MakingQ-Learning | —Unverified | 0 |
| Way Off-Policy Batch Deep Reinforcement Learning of Human Preferences in Dialog | Jan 1, 2020 | Deep Reinforcement LearningOpenAI Gym | —Unverified | 0 |
| Information Theoretic Model Predictive Q-Learning | Dec 31, 2019 | Decision Makingmodel | —Unverified | 0 |
| The Gambler's Problem and Beyond | Dec 31, 2019 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Learning in Discounted-cost and Average-cost Mean-field Games | Dec 31, 2019 | Q-Learning | —Unverified | 0 |
| Hamilton-Jacobi-Bellman Equations for Q-Learning in Continuous Time | Dec 23, 2019 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Learning an Interpretable Traffic Signal Control Policy | Dec 23, 2019 | Q-LearningReinforcement Learning | CodeCode Available | 0 |
| Soft Q Network | Dec 20, 2019 | Q-Learning | —Unverified | 0 |
| Sepsis World Model: A MIMIC-based OpenAI Gym "World Model" Simulator for Sepsis Treatment | Dec 15, 2019 | modelOpenAI Gym | —Unverified | 0 |
| Provably Efficient Reinforcement Learning with Aggregated States | Dec 13, 2019 | Q-Learningreinforcement-learning | —Unverified | 0 |
| High dimensional precision medicine from patient-derived xenografts | Dec 13, 2019 | Q-LearningVocal Bursts Intensity Prediction | —Unverified | 0 |
| A Finite-Time Analysis of Q-Learning with Neural Network Function Approximation | Dec 10, 2019 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| Value-of-Information based Arbitration between Model-based and Model-free Control | Dec 8, 2019 | Computational Efficiencymodel | —Unverified | 0 |
| Hierarchical Cooperative Multi-Agent Reinforcement Learning with Skill Discovery | Dec 7, 2019 | Multi-agent Reinforcement LearningQ-Learning | CodeCode Available | 0 |
| Combining Q-Learning and Search with Amortized Value Estimates | Dec 5, 2019 | Q-Learning | —Unverified | 0 |
| Reinforcement Learning with Non-Markovian Rewards | Dec 5, 2019 | Q-Learningreinforcement-learning | —Unverified | 0 |
| A Unified Switching System Perspective and O.D.E. Analysis of Q-Learning Algorithms | Dec 4, 2019 | Q-Learning | —Unverified | 0 |
| Learning to Dynamically Coordinate Multi-Robot Teams in Graph Attention Networks | Dec 4, 2019 | Combinatorial OptimizationGraph Attention | —Unverified | 0 |
| Neighborhood Cognition Consistent Multi-Agent Reinforcement Learning | Dec 3, 2019 | Multi-agent Reinforcement LearningQ-Learning | —Unverified | 0 |
| Neural Temporal-Difference Learning Converges to Global Optima | Dec 1, 2019 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| Provably Efficient Q-learning with Function Approximation via Distribution Shift Error Checking Oracle | Dec 1, 2019 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Propagating Uncertainty in Reinforcement Learning via Wasserstein Barycenters | Dec 1, 2019 | Atari GamesQ-Learning | CodeCode Available | 0 |
| Privacy-Preserving Q-Learning with Functional Noise in Continuous Spaces | Dec 1, 2019 | Privacy PreservingQ-Learning | CodeCode Available | 0 |
| Modelling the Dynamics of Multiagent Q-Learning in Repeated Symmetric Games: a Mean Field Theoretic Approach | Dec 1, 2019 | Q-Learning | —Unverified | 0 |
| Quadratic Q-network for Learning Continuous Control for Autonomous Vehicles | Nov 29, 2019 | Autonomous DrivingAutonomous Vehicles | —Unverified | 0 |
| QMR:Q-learning based Multi-objective optimization Routing protocol for Flying Ad Hoc Networks | Nov 27, 2019 | Q-Learning | CodeCode Available | 0 |
| Join Query Optimization with Deep Reinforcement Learning Algorithms | Nov 26, 2019 | AttributeDeep Reinforcement Learning | CodeCode Available | 0 |
| Control-Tutored Reinforcement Learning: an application to the Herding Problem | Nov 26, 2019 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Adaptive Modulation and Coding based on Reinforcement Learning for 5G Networks | Nov 25, 2019 | Q-Learningreinforcement-learning | —Unverified | 0 |
| A Deep Reinforcement Learning Architecture for Multi-stage Optimal Control | Nov 25, 2019 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| Mitigate Bias in Face Recognition using Skewness-Aware Reinforcement Learning | Nov 25, 2019 | Face RecognitionFairness | —Unverified | 0 |
| Which Channel to Ask My Question? Personalized Customer Service RequestStream Routing using DeepReinforcement Learning | Nov 24, 2019 | ChatbotDeep Reinforcement Learning | —Unverified | 0 |