| Transferred Q-learning | Feb 9, 2022 | Offline RLQ-Learning | —Unverified | 0 |
| Intelligent Autonomous Intersection Management | Feb 9, 2022 | Autonomous VehiclesManagement | —Unverified | 0 |
| Multiple Correlated Jammers Nullification using LSTM-based Deep Dueling Neural Network | Feb 8, 2022 | Q-Learning | —Unverified | 0 |
| Stochastic Gradient Descent with Dependent Data for Offline Reinforcement Learning | Feb 6, 2022 | Q-Learningreinforcement-learning | —Unverified | 0 |
| A deep Q-learning method for optimizing visual search strategies in backgrounds of dynamic noise | Jan 28, 2022 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Deep Reinforcement Learning with Spiking Q-learning | Jan 21, 2022 | Atari GamesDeep Reinforcement Learning | —Unverified | 0 |
| Optimal variance-reduced stochastic approximation in Banach spaces | Jan 21, 2022 | Q-Learning | —Unverified | 0 |
| Deep Q-learning: a robust control approach | Jan 21, 2022 | OpenAI GymQ-Learning | CodeCode Available | 0 |
| A Family of Cognitively Realistic Parsing Environments for Deep Reinforcement Learning | Jan 16, 2022 | Deep Reinforcement LearningHierarchical Reinforcement Learning | —Unverified | 0 |
| Criticality-Based Varying Step-Number Algorithm for Reinforcement Learning | Jan 13, 2022 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Task Independent Capsule-Based Agents for Deep Q-Learning | Jan 11, 2022 | Deep Reinforcement LearningObject Recognition | —Unverified | 0 |
| Age-of-information minimization via opportunistic sampling by an energy harvesting source | Jan 8, 2022 | Q-Learning | —Unverified | 0 |
| Sales Time Series Analytics Using Deep Q-Learning | Jan 6, 2022 | Active LearningDecision Making | —Unverified | 0 |
| Reinforcement Learning for Task Specifications with Action-Constraints | Jan 2, 2022 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Operator Deep Q-Learning: Zero-Shot Reward Transferring in Reinforcement Learning | Jan 1, 2022 | Q-Learningreinforcement-learning | —Unverified | 0 |
| A Statistical Analysis of Polyak-Ruppert Averaged Q-learning | Dec 29, 2021 | Q-Learning | CodeCode Available | 0 |
| A Graph Attention Learning Approach to Antenna Tilt Optimization | Dec 27, 2021 | Graph AttentionQ-Learning | —Unverified | 0 |
| Task and Model Agnostic Adversarial Attack on Graph Neural Networks | Dec 25, 2021 | Adversarial AttackQ-Learning | CodeCode Available | 0 |
| Aerial Base Station Positioning and Power Control for Securing Communications: A Deep Q-Network Approach | Dec 21, 2021 | PositionQ-Learning | —Unverified | 0 |
| Amortized Noisy Channel Neural Machine Translation | Dec 16, 2021 | Imitation LearningKnowledge Distillation | —Unverified | 0 |
| Finite-Sample Analysis of Decentralized Q-Learning for Stochastic Games | Dec 15, 2021 | Multi-agent Reinforcement LearningQ-Learning | —Unverified | 0 |
| Teaching a Robot to Walk Using Reinforcement Learning | Dec 13, 2021 | OpenAI GymQ-Learning | —Unverified | 0 |
| Control-Tutored Reinforcement Learning: Towards the Integration of Data-Driven and Model-Based Control | Dec 11, 2021 | OpenAI GymQ-Learning | —Unverified | 0 |
| Quantum Architecture Search via Continual Reinforcement Learning | Dec 10, 2021 | Continual LearningDeep Reinforcement Learning | —Unverified | 0 |
| High-Dimensional Stock Portfolio Trading with Deep Reinforcement Learning | Dec 9, 2021 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| Replay For Safety | Dec 8, 2021 | Q-Learning | —Unverified | 0 |
| Deep Q-Learning Market Makers in a Multi-Agent Simulated Stock Market | Dec 8, 2021 | Q-LearningReinforcement Learning (RL) | —Unverified | 0 |
| Application of Deep Reinforcement Learning to Payment Fraud | Dec 8, 2021 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| Convergence Results For Q-Learning With Experience Replay | Dec 8, 2021 | Q-Learning | —Unverified | 0 |
| Pragmatic Implementation of Reinforcement Algorithms For Path Finding On Raspberry Pi | Dec 7, 2021 | Collision AvoidanceQ-Learning | —Unverified | 0 |
| A Risk-Averse Preview-based Q-Learning Algorithm: Application to Highway Driving of Autonomous Vehicles | Dec 6, 2021 | Autonomous VehiclesQ-Learning | —Unverified | 0 |
| Finite Sample Analysis of Average-Reward TD Learning and Q-Learning | Dec 1, 2021 | Q-Learning | —Unverified | 0 |
| Faster Non-asymptotic Convergence for Double Q-learning | Dec 1, 2021 | Q-Learning | —Unverified | 0 |
| Solving reward-collecting problems with UAVs: a comparison of online optimization and Q-learning | Nov 30, 2021 | Autonomous VehiclesQ-Learning | CodeCode Available | 0 |
| Continuous Control With Ensemble Deep Deterministic Policy Gradients | Nov 30, 2021 | continuous-controlContinuous Control | CodeCode Available | 0 |
| Final Adaptation Reinforcement Learning for N-Player Games | Nov 29, 2021 | Board GamesQ-Learning | —Unverified | 0 |
| DeepCQ+: Robust and Scalable Routing with Multi-Agent Deep Reinforcement Learning for Highly Dynamic Networks | Nov 29, 2021 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| Count-Based Temperature Scheduling for Maximum Entropy Reinforcement Learning | Nov 28, 2021 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Deep Q-Learning based Reinforcement Learning Approach for Network Intrusion Detection | Nov 27, 2021 | Intrusion DetectionNetwork Intrusion Detection | CodeCode Available | 0 |
| Multicrew Scheduling and Routing in Road Network Restoration Based on Deep Q-learning | Nov 24, 2021 | Q-LearningScheduling | —Unverified | 0 |
| Reversible Action Design for Combinatorial Optimization with ReinforcementLearning | Nov 24, 2021 | Combinatorial OptimizationQ-Learning | —Unverified | 0 |
| The Impact of Data Distribution on Q-learning with Function Approximation | Nov 23, 2021 | DiversityQ-Learning | CodeCode Available | 0 |
| Multi-agent Bayesian Deep Reinforcement Learning for Microgrid Energy Management under Communication Failures | Nov 22, 2021 | Deep Reinforcement Learningenergy management | —Unverified | 0 |
| Aggressive Q-Learning with Ensembles: Achieving Both High Sample Efficiency and High Asymptotic Performance | Nov 17, 2021 | continuous-controlContinuous Control | —Unverified | 0 |
| Compressive Features in Offline Reinforcement Learning for Recommender Systems | Nov 16, 2021 | Q-LearningRecommendation Systems | —Unverified | 0 |
| Consecutive Task-oriented Dialog Policy Learning | Nov 16, 2021 | Continual LearningManagement | —Unverified | 0 |
| Where to Look: A Unified Attention Model for Visual Recognition with Reinforcement Learning | Nov 13, 2021 | Q-LearningReinforcement Learning (RL) | —Unverified | 0 |
| Q-Learning for MDPs with General Spaces: Convergence and Near Optimality via Quantization under Weak Continuity | Nov 12, 2021 | Q-LearningQuantization | —Unverified | 0 |
| On Assessing The Safety of Reinforcement Learning algorithms Using Formal Methods | Nov 8, 2021 | Autonomous VehiclesQ-Learning | —Unverified | 0 |
| Supervised Advantage Actor-Critic for Recommender Systems | Nov 5, 2021 | Q-LearningRecommendation Systems | —Unverified | 0 |