| Multi-Agent Deep Reinforcement Learning for Large-scale Traffic Signal Control | Mar 11, 2019 | Deep Reinforcement LearningQ-Learning | CodeCode Available | 0 |
| Heuristics, Answer Set Programming and Markov Decision Process for Solving a Set of Spatial Puzzles | Feb 16, 2019 | Q-LearningReinforcement Learning | CodeCode Available | 0 |
| QFlip: An Adaptive Reinforcement Learning Strategy for the FlipIt Security Game | Jun 27, 2019 | OpenAI GymQ-Learning | CodeCode Available | 0 |
| Decision Making in Non-Stationary Environments with Policy-Augmented Search | Jan 6, 2024 | Decision MakingDecision Making Under Uncertainty | CodeCode Available | 0 |
| Action Candidate Based Clipped Double Q-learning for Discrete and Continuous Action Tasks | May 3, 2021 | Q-Learning | CodeCode Available | 0 |
| Hierarchical Cooperative Multi-Agent Reinforcement Learning with Skill Discovery | Dec 7, 2019 | Multi-agent Reinforcement LearningQ-Learning | CodeCode Available | 0 |
| Combining No-regret and Q-learning | Oct 7, 2019 | counterfactualQ-Learning | CodeCode Available | 0 |
| Playing Doom with SLAM-Augmented Deep Reinforcement Learning | Dec 1, 2016 | Deep Reinforcement Learningobject-detection | CodeCode Available | 0 |
| Hierarchical Reinforcement Learning with the MAXQ Value Function Decomposition | May 21, 1999 | Hierarchical Reinforcement LearningQ-Learning | CodeCode Available | 0 |
| Playing FPS Games with Deep Reinforcement Learning | Sep 18, 2016 | Deep Reinforcement LearningFPS Games | CodeCode Available | 0 |
| Regularized Q-learning through Robust Averaging | May 3, 2024 | Q-Learning | CodeCode Available | 0 |
| Policy Learning for Malaria Control | Oct 20, 2019 | Bayesian OptimizationDecision Making | CodeCode Available | 0 |
| A DQN-based Approach to Finding Precise Evidences for Fact Verification | Aug 1, 2021 | Claim VerificationFact Verification | CodeCode Available | 0 |
| EASpace: Enhanced Action Space for Policy Transfer | Dec 7, 2022 | Q-LearningTransfer Learning | CodeCode Available | 0 |
| Belief-Enriched Pessimistic Q-Learning against Adversarial State Perturbations | Mar 6, 2024 | Q-LearningReinforcement Learning (RL) | CodeCode Available | 0 |
| A Statistical Analysis of Polyak-Ruppert Averaged Q-learning | Dec 29, 2021 | Q-Learning | CodeCode Available | 0 |
| Augmented Q Imitation Learning (AQIL) | Mar 31, 2020 | Deep Reinforcement LearningImitation Learning | CodeCode Available | 0 |
| Superior Genetic Algorithms for the Target Set Selection Problem Based on Power-Law Parameter Choices and Simple Greedy Heuristics | Apr 5, 2024 | Q-Learning | CodeCode Available | 0 |
| CytonRL: an Efficient Reinforcement Learning Open-source Toolkit Implemented in C++ | Apr 14, 2018 | GPUQ-Learning | CodeCode Available | 0 |
| Deep Reinforcement Learning for Vision-Based Robotic Grasping: A Simulated Comparative Evaluation of Off-Policy Methods | Feb 28, 2018 | Deep Reinforcement LearningDiversity | CodeCode Available | 0 |
| Combinational Q-Learning for Dou Di Zhu | Jan 24, 2019 | Atari GamesCard Games | CodeCode Available | 0 |
| POPO: Pessimistic Offline Policy Optimization | Dec 26, 2020 | Offline RLQ-Learning | CodeCode Available | 0 |
| Crowd Intelligence for Early Misinformation Prediction on Social Media | Aug 8, 2024 | Fact CheckingMisinformation | CodeCode Available | 0 |
| Deep Reinforcement Learning for Traffic Light Control in Vehicular Networks | Mar 29, 2018 | Deep Reinforcement LearningQ-Learning | CodeCode Available | 0 |
| Greedy Actor-Critic: A New Conditional Cross-Entropy Method for Policy Improvement | Oct 22, 2018 | Policy Gradient MethodsQ-Learning | CodeCode Available | 0 |
| Deep reinforcement learning for time series: playing idealized trading games | Mar 11, 2018 | Deep Reinforcement LearningQ-Learning | CodeCode Available | 0 |
| Learning to Play Text-based Adventure Games with Maximum Entropy Reinforcement Learning | Feb 21, 2023 | Q-Learningreinforcement-learning | CodeCode Available | 0 |
| Robotic Surgery With Lean Reinforcement Learning | May 3, 2021 | Q-Learningreinforcement-learning | CodeCode Available | 0 |
| Practical Block-wise Neural Network Architecture Generation | Aug 18, 2017 | image-classificationImage Classification | CodeCode Available | 0 |
| Implications of Decentralized Q-learning Resource Allocation in Wireless Networks | May 30, 2017 | Q-LearningReinforcement Learning | CodeCode Available | 0 |
| Training Transition Policies via Distribution Matching for Complex Tasks | Oct 8, 2021 | Hierarchical Reinforcement LearningQ-Learning | CodeCode Available | 0 |
| Balancing Value Underestimation and Overestimation with Realistic Actor-Critic | Oct 19, 2021 | continuous-controlContinuous Control | CodeCode Available | 0 |
| Improved robustness of reinforcement learning policies upon conversion to spiking neuronal network platforms applied to ATARI games | Mar 26, 2019 | Atari GamesDeep Reinforcement Learning | CodeCode Available | 0 |
| A Multi-Agent Multi-Environment Mixed Q-Learning for Partially Decentralized Wireless Network Optimization | Sep 24, 2024 | Q-Learning | CodeCode Available | 0 |
| Improving Exploration in Evolution Strategies for Deep Reinforcement Learning via a Population of Novelty-Seeking Agents | Dec 18, 2017 | Deep Reinforcement LearningPolicy Gradient Methods | CodeCode Available | 0 |
| Urban Driving with Multi-Objective Deep Reinforcement Learning | Nov 21, 2018 | Autonomous DrivingDeep Reinforcement Learning | CodeCode Available | 0 |
| Deep Reinforcement Learning for Optimal Stopping with Application in Financial Engineering | May 19, 2021 | Deep Reinforcement LearningQ-Learning | CodeCode Available | 0 |
| Collaborative Multi-BS Power Management for Dense Radio Access Network using Deep Reinforcement Learning | Apr 17, 2023 | Deep Reinforcement LearningManagement | CodeCode Available | 0 |
| CleanSurvival: Automated data preprocessing for time-to-event models using reinforcement learning | Feb 6, 2025 | ImputationOutlier Detection | CodeCode Available | 0 |
| Pre-training with Synthetic Data Helps Offline Reinforcement Learning | Oct 1, 2023 | D4RLDeep Reinforcement Learning | CodeCode Available | 0 |
| Increasing the Action Gap: New Operators for Reinforcement Learning | Dec 15, 2015 | Atari GamesQ-Learning | CodeCode Available | 0 |
| Understanding algorithmic collusion with experience replay | Feb 18, 2021 | Q-Learning | CodeCode Available | 0 |
| Multi-Timescale Ensemble Q-learning for Markov Decision Process Policy Optimization | Feb 8, 2024 | Q-Learningreinforcement-learning | CodeCode Available | 0 |
| Information-Directed Exploration for Deep Reinforcement Learning | Dec 18, 2018 | Atari GamesDeep Reinforcement Learning | CodeCode Available | 0 |
| A disembodied developmental robotic agent called Samu Bátfai | Nov 9, 2015 | Q-LearningReinforcement Learning | CodeCode Available | 0 |
| Audio-Driven Reinforcement Learning for Head-Orientation in Naturalistic Environments | Sep 16, 2024 | Audio Signal ProcessingDeep Reinforcement Learning | CodeCode Available | 0 |
| Information-Theoretic State Variable Selection for Reinforcement Learning | Jan 21, 2024 | Decision Makingfeature selection | CodeCode Available | 0 |
| VQC-Based Reinforcement Learning with Data Re-uploading: Performance and Trainability | Jan 21, 2024 | Q-Learningreinforcement-learning | CodeCode Available | 0 |
| Mutual Information Regularized Offline Reinforcement Learning | Oct 14, 2022 | D4RLOffline RL | CodeCode Available | 0 |
| SwiftRL: Towards Efficient Reinforcement Learning on Real Processing-In-Memory Systems | May 7, 2024 | CPUGPU | CodeCode Available | 0 |