| Reinforcement Learning for Learning of Dynamical Systems in Uncertain Environment: a Tutorial | May 19, 2019 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Reinforcement Learning for Mean Field Games, with Applications to Economics | Jun 25, 2021 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Reinforcement Learning for Mixed-Integer Problems Based on MPC | Apr 3, 2020 | Model Predictive ControlQ-Learning | —Unverified | 0 |
| Reinforcement Learning for Online Testing of Autonomous Driving Systems: a Replication and Extension Study | Mar 20, 2024 | Autonomous DrivingQ-Learning | —Unverified | 0 |
| Reinforcement Learning for Optimal Control of a District Cooling Energy Plant | Mar 14, 2022 | Model Predictive ControlQ-Learning | —Unverified | 0 |
| Reinforcement Learning for Optimal Execution when Liquidity is Time-Varying | Feb 19, 2024 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Reinforcement Learning for Quantum Circuit Design: Using Matrix Representations | Jan 27, 2025 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| Reinforcement Learning for Rate Maximization in IRS-aided OWC Networks | Sep 7, 2024 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Reinforcement Learning for Resilient Power Grids | Dec 8, 2022 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Reinforcement Learning for Resource Allocation in Steerable Laser-based Optical Wireless Systems | Jun 21, 2021 | ManagementQ-Learning | —Unverified | 0 |
| Reinforcement Learning for Robotics and Control with Active Uncertainty Reduction | May 15, 2019 | ManagementOpenAI Gym | —Unverified | 0 |
| Dual Ensembled Multiagent Q-Learning with Hypernet Regularizer | Feb 4, 2025 | Q-LearningSMAC | CodeCode Available | 0 |
| Sample Efficient Reinforcement Learning with Partial Dynamics Knowledge | Dec 19, 2023 | Q-Learningreinforcement-learning | CodeCode Available | 0 |
| Dynamic control of self-assembly of quasicrystalline structures through reinforcement learning | Sep 13, 2023 | Q-Learningreinforcement-learning | CodeCode Available | 0 |
| AFU: Actor-Free critic Updates in off-policy RL for continuous control | Apr 24, 2024 | continuous-controlContinuous Control | CodeCode Available | 0 |
| DynamicLight: Two-Stage Dynamic Traffic Signal Timing | Nov 2, 2022 | Q-LearningReinforcement Learning (RL) | CodeCode Available | 0 |
| Deep Reinforcement Learning Based Parameter Control in Differential Evolution | May 20, 2019 | Deep Reinforcement LearningQ-Learning | CodeCode Available | 0 |
| A Semantic-Aware Multiple Access Scheme for Distributed, Dynamic 6G-Based Applications | Jan 12, 2024 | Decision MakingDeep Reinforcement Learning | CodeCode Available | 0 |
| Learning Heuristics over Large Graphs via Deep Reinforcement Learning | Mar 8, 2019 | Combinatorial OptimizationDeep Reinforcement Learning | CodeCode Available | 0 |
| Stabilizing Off-Policy Q-Learning via Bootstrapping Error Reduction | Jun 3, 2019 | continuous-controlContinuous Control | CodeCode Available | 0 |
| Reinforcement Learning for Physical Layer Communications | Jun 22, 2021 | Deep Reinforcement LearningMulti-Armed Bandits | CodeCode Available | 0 |
| Task and Model Agnostic Adversarial Attack on Graph Neural Networks | Dec 25, 2021 | Adversarial AttackQ-Learning | CodeCode Available | 0 |
| Towards Better Interpretability in Deep Q-Networks | Sep 15, 2018 | Deep Reinforcement LearningQ-Learning | CodeCode Available | 0 |
| A Framework for Automated Cellular Network Tuning with Reinforcement Learning | Aug 13, 2018 | ManagementQ-Learning | CodeCode Available | 0 |
| Stabilizing Extreme Q-learning by Maclaurin Expansion | Jun 7, 2024 | D4RLOffline RL | CodeCode Available | 0 |
| Learning Principle of Least Action with Reinforcement Learning | Nov 24, 2020 | Q-Learningreinforcement-learning | CodeCode Available | 0 |
| Learning RL-Policies for Joint Beamforming Without Exploration: A Batch Constrained Off-Policy Approach | Oct 12, 2023 | Deep Reinforcement LearningQ-Learning | CodeCode Available | 0 |
| Scalable Online Exploration via Coverability | Mar 11, 2024 | Efficient ExplorationQ-Learning | CodeCode Available | 0 |
| DRL4AOI: A DRL Framework for Semantic-aware AOI Segmentation in Location-Based Services | Dec 6, 2024 | Deep Reinforcement LearningQ-Learning | CodeCode Available | 0 |
| Efficient Collaborative Multi-Agent Deep Reinforcement Learning for Large-Scale Fleet Management | Feb 18, 2018 | Deep Reinforcement LearningManagement | CodeCode Available | 0 |
| ZPD Teaching Strategies for Deep Reinforcement Learning from Demonstrations | Oct 26, 2019 | Atari GamesDeep Reinforcement Learning | CodeCode Available | 0 |
| Efficient Model-free Reinforcement Learning in Metric Spaces | May 1, 2019 | Q-Learningreinforcement-learning | CodeCode Available | 0 |
| Reinforcement Learning for Sampling on Temporal Medical Imaging Sequences | Aug 28, 2023 | Image ReconstructionQ-Learning | CodeCode Available | 0 |
| Towards Empathic Deep Q-Learning | Jun 26, 2019 | EthicsQ-Learning | CodeCode Available | 0 |
| Learning Simple Algorithms from Examples | Nov 23, 2015 | Q-Learning | CodeCode Available | 0 |
| Learning State Abstractions for Transfer in Continuous Control | Feb 8, 2020 | continuous-controlContinuous Control | CodeCode Available | 0 |
| Efficient Sparse-Reward Goal-Conditioned Reinforcement Learning with a High Replay Ratio and Regularization | Dec 10, 2023 | Q-LearningReinforcement Learning (RL) | CodeCode Available | 0 |
| Schrödinger's Camera: First Steps Towards a Quantum-Based Privacy Preserving Camera | Mar 13, 2023 | Privacy PreservingQ-Learning | CodeCode Available | 0 |
| Deep Reinforcement Learning Algorithms for Option Hedging | Apr 7, 2025 | Deep Reinforcement LearningQ-Learning | CodeCode Available | 0 |
| Deep Recurrent Q-Learning vs Deep Q-Learning on a simple Partially Observable Markov Decision Process with Minecraft | Mar 11, 2019 | MinecraftQ-Learning | CodeCode Available | 0 |
| Temporal-Difference Learning Using Distributed Error Signals | Nov 6, 2024 | Q-Learning | CodeCode Available | 0 |
| A Self-Adaptive Proposal Model for Temporal Action Detection based on Reinforcement Learning | Jun 22, 2017 | Action DetectionPosition | CodeCode Available | 0 |
| Q-learning for Quantile MDPs: A Decomposition, Performance, and Convergence Analysis | Oct 31, 2024 | Q-Learning | CodeCode Available | 0 |
| Learning to Communicate with Deep Multi-Agent Reinforcement Learning | May 21, 2016 | Multi-agent Reinforcement LearningQ-Learning | CodeCode Available | 0 |
| Towards Few-shot Coordination: Revisiting Ad-hoc Teamplay Challenge In the Game of Hanabi | Aug 20, 2023 | Game of HanabiMulti-agent Reinforcement Learning | CodeCode Available | 0 |
| A Comparison of Reward Functions in Q-Learning Applied to a Cart Position Problem | May 25, 2021 | PositionQ-Learning | CodeCode Available | 0 |
| SEED RL: Scalable and Efficient Deep-RL with Accelerated Central Inference | Oct 15, 2019 | Q-LearningReinforcement Learning | CodeCode Available | 0 |
| Towards Model-based Reinforcement Learning for Industry-near Environments | Jul 27, 2019 | Deep Reinforcement LearningModel-based Reinforcement Learning | CodeCode Available | 0 |
| A Deep Q-Learning Agent for the L-Game with Variable Batch Training | Feb 17, 2018 | Q-LearningSelf-Learning | CodeCode Available | 0 |
| A Deep Learning Approach to Grasping the Invisible | Sep 11, 2019 | Deep LearningQ-Learning | CodeCode Available | 0 |