| Stochastic approximation with cone-contractive operators: Sharp _-bounds for Q-learning | May 15, 2019 | Q-LearningReinforcement Learning | CodeCode Available | 0 |
| Reinforcement Learning for Robotics and Control with Active Uncertainty Reduction | May 15, 2019 | ManagementOpenAI Gym | —Unverified | 0 |
| Design of Artificial Intelligence Agents for Games using Deep Reinforcement Learning | May 10, 2019 | Deep Reinforcement LearningOpenAI Gym | —Unverified | 0 |
| Domain Adversarial Reinforcement Learning for Partial Domain Adaptation | May 10, 2019 | Domain AdaptationPartial Domain Adaptation | —Unverified | 0 |
| A Reinforcement Learning Perspective on the Optimal Control of Mutation Probabilities for the (1+1) Evolutionary Algorithm: First Results on the OneMax Problem | May 9, 2019 | Evolutionary AlgorithmsQ-Learning | —Unverified | 0 |
| Pretrain Soft Q-Learning with Imperfect Demonstrations | May 9, 2019 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Toward Packet Routing with Fully-distributed Multi-agent Deep Reinforcement Learning | May 9, 2019 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| Accelerated Target Updates for Q-learning | May 7, 2019 | Atari GamesQ-Learning | —Unverified | 0 |
| Deep Ordinal Reinforcement Learning | May 6, 2019 | Deep Reinforcement LearningOpenAI Gym | CodeCode Available | 0 |
| Comprehensible Context-driven Text Game Playing | May 6, 2019 | Q-Learning | CodeCode Available | 0 |
| Efficient Model-free Reinforcement Learning in Metric Spaces | May 1, 2019 | Q-Learningreinforcement-learning | CodeCode Available | 0 |
| Soft Q-Learning with Mutual-Information Regularization | May 1, 2019 | Decision MakingQ-Learning | —Unverified | 0 |
| Learning agents with prioritization and parameter noise in continuous state and action space | May 1, 2019 | Autonomous VehiclesQ-Learning | —Unverified | 0 |
| Two-Timescale Networks for Nonlinear Value Function Approximation | May 1, 2019 | Q-LearningReinforcement Learning | —Unverified | 0 |
| A Deep Q-Learning Method for Downlink Power Allocation in Multi-Cell Networks | Apr 30, 2019 | BenchmarkingDeep Reinforcement Learning | —Unverified | 0 |
| Zap Q-Learning for Optimal Stopping Time Problems | Apr 25, 2019 | Q-LearningReinforcement Learning | —Unverified | 0 |
| Target-Based Temporal Difference Learning | Apr 24, 2019 | Q-LearningReinforcement Learning | —Unverified | 0 |
| Stochastic Lipschitz Q-Learning | Apr 24, 2019 | Q-LearningReinforcement Learning | —Unverified | 0 |
| Deep Q-Learning for Nash Equilibria: Nash-DQN | Apr 23, 2019 | Q-LearningReinforcement Learning | CodeCode Available | 0 |
| Driving Decision and Control for Autonomous Lane Change based on Deep Reinforcement Learning | Apr 23, 2019 | Autonomous DrivingDecision Making | —Unverified | 0 |
| Deep Q Learning Driven CT Pancreas Segmentation with Geometry-Aware U-Net | Apr 19, 2019 | Medical Image AnalysisPancreas Segmentation | —Unverified | 0 |
| "Jam Me If You Can'': Defeating Jammer with Deep Dueling Neural Network Architecture and Ambient Backscattering Augmented Communications | Apr 8, 2019 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| Patchwork: A Patch-wise Attention Network for Efficient Object Detection and Segmentation in Video Streams | Apr 3, 2019 | Hard Attentionobject-detection | —Unverified | 0 |
| Personalized Cancer Chemotherapy Schedule: a numerical comparison of performance and robustness in model-based and model-free scheduling methodologies | Apr 2, 2019 | Deep Reinforcement Learningmodel | —Unverified | 0 |
| Learning Automata Based Q-learning for Content Placement in Cooperative Caching | Mar 30, 2019 | Q-Learning | —Unverified | 0 |
| Improved robustness of reinforcement learning policies upon conversion to spiking neuronal network platforms applied to ATARI games | Mar 26, 2019 | Atari GamesDeep Reinforcement Learning | CodeCode Available | 0 |
| Q-Learning for Continuous Actions with Cross-Entropy Guided Policies | Mar 25, 2019 | Q-LearningReinforcement Learning | —Unverified | 0 |
| Towards Characterizing Divergence in Deep Q-Learning | Mar 21, 2019 | continuous-controlContinuous Control | —Unverified | 0 |
| Online Antenna Tuning in Heterogeneous Cellular Networks with Deep Reinforcement Learning | Mar 15, 2019 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| Reinforcement Learning with Dynamic Boltzmann Softmax Updates | Mar 14, 2019 | Atari GamesQ-Learning | CodeCode Available | 0 |
| Deep Multi-Agent Reinforcement Learning with Discrete-Continuous Hybrid Action Spaces | Mar 12, 2019 | Deep Reinforcement LearningMulti-agent Reinforcement Learning | —Unverified | 0 |
| Deep Recurrent Q-Learning vs Deep Q-Learning on a simple Partially Observable Markov Decision Process with Minecraft | Mar 11, 2019 | MinecraftQ-Learning | CodeCode Available | 0 |
| Multi-Agent Deep Reinforcement Learning for Large-scale Traffic Signal Control | Mar 11, 2019 | Deep Reinforcement LearningQ-Learning | CodeCode Available | 0 |
| Successive Over Relaxation Q-Learning | Mar 9, 2019 | Q-LearningReinforcement Learning | —Unverified | 0 |
| Learning Heuristics over Large Graphs via Deep Reinforcement Learning | Mar 8, 2019 | Combinatorial OptimizationDeep Reinforcement Learning | CodeCode Available | 0 |
| Distributed Edge Caching via Reinforcement Learning in Fog Radio Access Networks | Feb 27, 2019 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Unifying Ensemble Methods for Q-learning via Social Choice Theory | Feb 27, 2019 | DiversityQ-Learning | —Unverified | 0 |
| Diagnosing Bottlenecks in Deep Q-learning Algorithms | Feb 26, 2019 | continuous-controlContinuous Control | CodeCode Available | 0 |
| Optimal and Fast Real-time Resources Slicing with Deep Dueling Neural Networks | Feb 26, 2019 | Combinatorial OptimizationQ-Learning | —Unverified | 0 |
| Distributionally Robust Reinforcement Learning | Feb 23, 2019 | continuous-controlContinuous Control | —Unverified | 0 |
| Autonomous Airline Revenue Management: A Deep Reinforcement Learning Approach to Seat Inventory Control and Overbooking | Feb 18, 2019 | Deep Reinforcement LearningManagement | —Unverified | 0 |
| Heuristics, Answer Set Programming and Markov Decision Process for Solving a Set of Spatial Puzzles | Feb 16, 2019 | Q-LearningReinforcement Learning | CodeCode Available | 0 |
| Long and Short Memory Balancing in Visual Co-Tracking using Q-Learning | Feb 14, 2019 | Q-LearningReinforcement Learning | —Unverified | 0 |
| Sample-Optimal Parametric Q-Learning Using Linearly Additive Features | Feb 13, 2019 | Q-Learning | —Unverified | 0 |
| Learning Best Response Strategies for Agents in Ad Exchanges | Feb 10, 2019 | Q-Learning | —Unverified | 0 |
| Dynamic-Weighted Simplex Strategy for Learning Enabled Cyber Physical Systems | Feb 6, 2019 | Autonomous DrivingQ-Learning | CodeCode Available | 0 |
| Finite-Sample Analysis for SARSA with Linear Function Approximation | Feb 6, 2019 | Q-LearningReinforcement Learning | —Unverified | 0 |
| A Theory of Regularized Markov Decision Processes | Jan 31, 2019 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| Privacy-preserving Q-Learning with Functional Noise in Continuous State Spaces | Jan 30, 2019 | Privacy PreservingQ-Learning | CodeCode Available | 0 |
| Making Deep Q-learning methods robust to time discretization | Jan 28, 2019 | Deep Reinforcement LearningQ-Learning | CodeCode Available | 0 |