| Sequential Learning-based IaaS Composition | Feb 24, 2021 | ClusteringQ-Learning | —Unverified | 0 |
| Balancing Rational and Other-Regarding Preferences in Cooperative-Competitive Environments | Feb 24, 2021 | Multi-agent Reinforcement LearningQ-Learning | CodeCode Available | 0 |
| Greedy-Step Off-Policy Reinforcement Learning | Feb 23, 2021 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Understanding algorithmic collusion with experience replay | Feb 18, 2021 | Q-Learning | CodeCode Available | 0 |
| A Discrete-Time Switching System Analysis of Q-learning | Feb 17, 2021 | Q-Learning | —Unverified | 0 |
| DFAC Framework: Factorizing the Value Function via Quantile Mixture for Multi-Agent Distributional Q-Learning | Feb 16, 2021 | Multi-agent Reinforcement LearningQ-Learning | CodeCode Available | 1 |
| Cooperation and Reputation Dynamics with Reinforcement Learning | Feb 15, 2021 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Reversible Action Design for Combinatorial Optimization with Reinforcement Learning | Feb 14, 2021 | Combinatorial OptimizationQ-Learning | —Unverified | 0 |
| Is Q-Learning Minimax Optimal? A Tight Sample Complexity Analysis | Feb 12, 2021 | Natural QuestionsQ-Learning | —Unverified | 0 |
| Hedging of Financial Derivative Contracts via Monte Carlo Tree Search | Feb 11, 2021 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Simple Agent, Complex Environment: Efficient Reinforcement Learning with Agent States | Feb 10, 2021 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Benchmarking Deep Graph Generative Models for Optimizing New Drug Molecules for COVID-19 | Feb 9, 2021 | BenchmarkingQ-Learning | CodeCode Available | 1 |
| Model-Augmented Q-learning | Feb 7, 2021 | modelQ-Learning | —Unverified | 0 |
| Experience-Based Heuristic Search: Robust Motion Planning with Deep Q-Learning | Feb 5, 2021 | Autonomous DrivingAutonomous Vehicles | —Unverified | 0 |
| Revisiting Prioritized Experience Replay: A Value Perspective | Feb 5, 2021 | Atari GamesQ-Learning | CodeCode Available | 0 |
| Deep reinforcement learning-based image classification achieves perfect testing set accuracy for MRI brain tumors with a training set of only 30 images | Feb 4, 2021 | ClassificationDeep Reinforcement Learning | —Unverified | 0 |
| A review of motion planning algorithms for intelligent robotics | Feb 4, 2021 | Motion PlanningQ-Learning | —Unverified | 0 |
| A step toward a reinforcement learning de novo genome assembler | Feb 2, 2021 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| A Lyapunov Theory for Finite-Sample Guarantees of Asynchronous Q-Learning and TD-Learning Variants | Feb 2, 2021 | Q-LearningReinforcement Learning (RL) | —Unverified | 0 |
| QoS-Aware Power Minimization of Distributed Many-Core Servers using Transfer Q-Learning | Feb 2, 2021 | Q-Learning | —Unverified | 0 |
| Variation-resistant Q-learning: Controlling and Utilizing Estimation Bias in Reinforcement Learning for Better Performance | Feb 1, 2021 | Q-Learningreinforcement-learning | CodeCode Available | 0 |
| CoordiQ : Coordinated Q-learning for Electric Vehicle Charging Recommendation | Jan 28, 2021 | Decision MakingQ-Learning | —Unverified | 0 |
| Acting in Delayed Environments with Non-Stationary Markov Policies | Jan 28, 2021 | Cloud ComputingQ-Learning | CodeCode Available | 1 |
| Reinforcement Learning based Per-antenna Discrete Power Control for Massive MIMO Systems | Jan 28, 2021 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Reinforcement Learning Assisted Beamforming for Inter-cell Interference Mitigation in 5G Massive MIMO Networks | Jan 27, 2021 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Robust Android Malware Detection System against Adversarial Attacks using Q-Learning | Jan 27, 2021 | Adversarial DefenseAndroid Malware Detection | —Unverified | 0 |
| Channel Estimation via Successive Denoising in MIMO OFDM Systems: A Reinforcement Learning Approach | Jan 25, 2021 | DenoisingQ-Learning | —Unverified | 0 |
| Solving optimal stopping problems with Deep Q-Learning | Jan 24, 2021 | Q-LearningReinforcement Learning (RL) | —Unverified | 0 |
| Fire Threat Detection From Videos with Q-Rough Sets | Jan 21, 2021 | Q-LearningSegmentation | —Unverified | 0 |
| Breaking the Deadly Triad with a Target Network | Jan 21, 2021 | Q-Learning | —Unverified | 0 |
| Reinforcement learning based recommender systems: A survey | Jan 15, 2021 | Collaborative FilteringDeep Reinforcement Learning | —Unverified | 0 |
| Randomized Ensembled Double Q-Learning: Learning Fast Without a Model | Jan 15, 2021 | MuJoCoQ-Learning | CodeCode Available | 1 |
| Continuous Deep Q-Learning with Simulator for Stabilization of Uncertain Discrete-Time Systems | Jan 13, 2021 | Q-LearningReinforcement Learning (RL) | CodeCode Available | 0 |
| Learning Augmented Index Policy for Optimal Service Placement at the Network Edge | Jan 10, 2021 | Q-Learning | —Unverified | 0 |
| Robust and Scalable Routing with Multi-Agent Deep Reinforcement Learning for MANETs | Jan 9, 2021 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| Safe Coupled Deep Q-Learning for Recommendation Systems | Jan 8, 2021 | Q-LearningRecommendation Systems | —Unverified | 0 |
| Simulating SQL Injection Vulnerability Exploitation Using Q-Learning Reinforcement Learning Agents | Jan 8, 2021 | Q-Learningreinforcement-learning | CodeCode Available | 1 |
| Deep Reinforcement Learning-based Anti-jamming Power Allocation in a Two-cell NOMA Network | Jan 1, 2021 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| Multi-Agent Trust Region Learning | Jan 1, 2021 | Atari GamesMuJoCo | CodeCode Available | 1 |
| Preventing Value Function Collapse in Ensemble Q-Learning by Maximizing Representation Diversity | Jan 1, 2021 | DiversityQ-Learning | —Unverified | 0 |
| Learning Movement Strategies for Moving Target Defense | Jan 1, 2021 | Q-Learning | —Unverified | 0 |
| Uncertainty Weighted Offline Reinforcement Learning | Jan 1, 2021 | Offline RLQ-Learning | —Unverified | 0 |
| Optimistic Exploration with Backward Bootstrapped Bonus for Deep Reinforcement Learning | Jan 1, 2021 | Atari GamesDeep Reinforcement Learning | —Unverified | 0 |
| Addressing Distribution Shift in Online Reinforcement Learning with Offline Datasets | Jan 1, 2021 | D4RLMuJoCo | —Unverified | 0 |
| Deep Q Learning from Dynamic Demonstration with Behavioral Cloning | Jan 1, 2021 | Deep Reinforcement LearningOpenAI Gym | —Unverified | 0 |
| Deep Q-Learning with Low Switching Cost | Jan 1, 2021 | Atari GamesDeep Reinforcement Learning | —Unverified | 0 |
| Double Q-learning: New Analysis and Sharper Finite-time Bound | Jan 1, 2021 | Q-Learning | —Unverified | 0 |
| Success-Rate Targeted Reinforcement Learning by Disorientation Penalty | Jan 1, 2021 | Decision MakingQ-Learning | —Unverified | 0 |
| Weighted Bellman Backups for Improved Signal-to-Noise in Q-Updates | Jan 1, 2021 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| Disentangled Planning and Control in Vision Based Robotics via Reward Machines | Dec 28, 2020 | Q-Learning | —Unverified | 0 |