| A Learning Based Framework for Handling Uncertain Lead Times in Multi-Product Inventory Management | Mar 2, 2022 | ManagementQ-Learning | —Unverified | 0 |
| Pessimistic Q-Learning for Offline Reinforcement Learning: Towards Optimal Sample Complexity | Feb 28, 2022 | Offline RLQ-Learning | —Unverified | 0 |
| Whittle Index based Q-Learning for Wireless Edge Caching with Linear Function Approximation | Feb 26, 2022 | Edge-computingQ-Learning | —Unverified | 0 |
| Autonomous Warehouse Robot using Deep Q-Learning | Feb 21, 2022 | Deep Reinforcement LearningNavigate | —Unverified | 0 |
| PooL: Pheromone-inspired Communication Framework forLarge Scale Multi-Agent Reinforcement Learning | Feb 20, 2022 | Multi-agent Reinforcement LearningQ-Learning | —Unverified | 0 |
| UAV Base Station Trajectory Optimization Based on Reinforcement Learning in Post-disaster Search and Rescue Operations | Feb 17, 2022 | ClusteringQ-Learning | —Unverified | 0 |
| Goal Recognition as Reinforcement Learning | Feb 13, 2022 | Q-Learningreinforcement-learning | CodeCode Available | 0 |
| Artificial Intelligence and Auction Design | Feb 12, 2022 | Q-Learning | —Unverified | 0 |
| Microservice Deployment in Edge Computing Based on Deep Q Learning | Feb 11, 2022 | Edge-computingQ-Learning | CodeCode Available | 1 |
| Regularized Q-learning | Feb 11, 2022 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Transferred Q-learning | Feb 9, 2022 | Offline RLQ-Learning | —Unverified | 0 |
| Intelligent Autonomous Intersection Management | Feb 9, 2022 | Autonomous VehiclesManagement | —Unverified | 0 |
| Multiple Correlated Jammers Nullification using LSTM-based Deep Dueling Neural Network | Feb 8, 2022 | Q-Learning | —Unverified | 0 |
| Stochastic Gradient Descent with Dependent Data for Offline Reinforcement Learning | Feb 6, 2022 | Q-Learningreinforcement-learning | —Unverified | 0 |
| A deep Q-learning method for optimizing visual search strategies in backgrounds of dynamic noise | Jan 28, 2022 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Deep Q-learning: a robust control approach | Jan 21, 2022 | OpenAI GymQ-Learning | CodeCode Available | 0 |
| Optimal variance-reduced stochastic approximation in Banach spaces | Jan 21, 2022 | Q-Learning | —Unverified | 0 |
| Deep Reinforcement Learning with Spiking Q-learning | Jan 21, 2022 | Atari GamesDeep Reinforcement Learning | —Unverified | 0 |
| Addressing Maximization Bias in Reinforcement Learning with Two-Sample Testing | Jan 20, 2022 | Q-Learningreinforcement-learning | CodeCode Available | 1 |
| A Family of Cognitively Realistic Parsing Environments for Deep Reinforcement Learning | Jan 16, 2022 | Deep Reinforcement LearningHierarchical Reinforcement Learning | —Unverified | 0 |
| Criticality-Based Varying Step-Number Algorithm for Reinforcement Learning | Jan 13, 2022 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Task Independent Capsule-Based Agents for Deep Q-Learning | Jan 11, 2022 | Deep Reinforcement LearningObject Recognition | —Unverified | 0 |
| Age-of-information minimization via opportunistic sampling by an energy harvesting source | Jan 8, 2022 | Q-Learning | —Unverified | 0 |
| Sales Time Series Analytics Using Deep Q-Learning | Jan 6, 2022 | Active LearningDecision Making | —Unverified | 0 |
| Reinforcement Learning for Task Specifications with Action-Constraints | Jan 2, 2022 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Operator Deep Q-Learning: Zero-Shot Reward Transferring in Reinforcement Learning | Jan 1, 2022 | Q-Learningreinforcement-learning | —Unverified | 0 |
| A Statistical Analysis of Polyak-Ruppert Averaged Q-learning | Dec 29, 2021 | Q-Learning | CodeCode Available | 0 |
| A Graph Attention Learning Approach to Antenna Tilt Optimization | Dec 27, 2021 | Graph AttentionQ-Learning | —Unverified | 0 |
| Task and Model Agnostic Adversarial Attack on Graph Neural Networks | Dec 25, 2021 | Adversarial AttackQ-Learning | CodeCode Available | 0 |
| Safety and Liveness Guarantees through Reach-Avoid Reinforcement Learning | Dec 23, 2021 | Deep Reinforcement LearningQ-Learning | CodeCode Available | 1 |
| Aerial Base Station Positioning and Power Control for Securing Communications: A Deep Q-Network Approach | Dec 21, 2021 | PositionQ-Learning | —Unverified | 0 |
| Amortized Noisy Channel Neural Machine Translation | Dec 16, 2021 | Imitation LearningKnowledge Distillation | —Unverified | 0 |
| Finite-Sample Analysis of Decentralized Q-Learning for Stochastic Games | Dec 15, 2021 | Multi-agent Reinforcement LearningQ-Learning | —Unverified | 0 |
| Teaching a Robot to Walk Using Reinforcement Learning | Dec 13, 2021 | OpenAI GymQ-Learning | —Unverified | 0 |
| Control-Tutored Reinforcement Learning: Towards the Integration of Data-Driven and Model-Based Control | Dec 11, 2021 | OpenAI GymQ-Learning | —Unverified | 0 |
| Quantum Architecture Search via Continual Reinforcement Learning | Dec 10, 2021 | Continual LearningDeep Reinforcement Learning | —Unverified | 0 |
| High-Dimensional Stock Portfolio Trading with Deep Reinforcement Learning | Dec 9, 2021 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| Deep Q-Learning Market Makers in a Multi-Agent Simulated Stock Market | Dec 8, 2021 | Q-LearningReinforcement Learning (RL) | —Unverified | 0 |
| ShinRL: A Library for Evaluating RL Algorithms from Theoretical and Practical Perspectives | Dec 8, 2021 | Q-LearningReinforcement Learning (RL) | CodeCode Available | 1 |
| Convergence Results For Q-Learning With Experience Replay | Dec 8, 2021 | Q-Learning | —Unverified | 0 |
| Application of Deep Reinforcement Learning to Payment Fraud | Dec 8, 2021 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| Replay For Safety | Dec 8, 2021 | Q-Learning | —Unverified | 0 |
| Pragmatic Implementation of Reinforcement Algorithms For Path Finding On Raspberry Pi | Dec 7, 2021 | Collision AvoidanceQ-Learning | —Unverified | 0 |
| A Risk-Averse Preview-based Q-Learning Algorithm: Application to Highway Driving of Autonomous Vehicles | Dec 6, 2021 | Autonomous VehiclesQ-Learning | —Unverified | 0 |
| Regularized Softmax Deep Multi-Agent Q-Learning | Dec 1, 2021 | Multi-agent Reinforcement LearningQ-Learning | CodeCode Available | 1 |
| Finite Sample Analysis of Average-Reward TD Learning and Q-Learning | Dec 1, 2021 | Q-Learning | —Unverified | 0 |
| Faster Non-asymptotic Convergence for Double Q-learning | Dec 1, 2021 | Q-Learning | —Unverified | 0 |
| Solving reward-collecting problems with UAVs: a comparison of online optimization and Q-learning | Nov 30, 2021 | Autonomous VehiclesQ-Learning | CodeCode Available | 0 |
| Continuous Control With Ensemble Deep Deterministic Policy Gradients | Nov 30, 2021 | continuous-controlContinuous Control | CodeCode Available | 0 |
| DeepCQ+: Robust and Scalable Routing with Multi-Agent Deep Reinforcement Learning for Highly Dynamic Networks | Nov 29, 2021 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |