| Enhancing Robot Assistive Behaviour with Reinforcement Learning and Theory of Mind | Nov 11, 2024 | Q-Learning | CodeCode Available | 0 | 5 |
| Exploring reinforcement learning techniques for discrete and continuous control tasks in the MuJoCo environment | Jul 20, 2023 | continuous-controlContinuous Control | CodeCode Available | 0 | 5 |
| A Fairness-Oriented Reinforcement Learning Approach for the Operation and Control of Shared Micromobility Services | Mar 23, 2024 | FairnessQ-Learning | CodeCode Available | 0 | 5 |
| Estimation Error Correction in Deep Reinforcement Learning for Deterministic Actor-Critic Methods | Sep 22, 2021 | continuous-controlContinuous Control | CodeCode Available | 0 | 5 |
| DynamicLight: Two-Stage Dynamic Traffic Signal Timing | Nov 2, 2022 | Q-LearningReinforcement Learning (RL) | CodeCode Available | 0 | 5 |
| Approximating two value functions instead of one: towards characterizing a new family of Deep Reinforcement Learning algorithms | Sep 1, 2019 | Deep Reinforcement LearningQ-Learning | CodeCode Available | 0 | 5 |
| Efficient Collaborative Multi-Agent Deep Reinforcement Learning for Large-Scale Fleet Management | Feb 18, 2018 | Deep Reinforcement LearningManagement | CodeCode Available | 0 | 5 |
| Dual Ensembled Multiagent Q-Learning with Hypernet Regularizer | Feb 4, 2025 | Q-LearningSMAC | CodeCode Available | 0 | 5 |
| Dynamic control of self-assembly of quasicrystalline structures through reinforcement learning | Sep 13, 2023 | Q-Learningreinforcement-learning | CodeCode Available | 0 | 5 |
| Efficient Model-free Reinforcement Learning in Metric Spaces | May 1, 2019 | Q-Learningreinforcement-learning | CodeCode Available | 0 | 5 |
| Greedy Actor-Critic: A New Conditional Cross-Entropy Method for Policy Improvement | Oct 22, 2018 | Policy Gradient MethodsQ-Learning | CodeCode Available | 0 | 5 |
| GAN Q-learning | May 13, 2018 | Distributional Reinforcement LearningOpenAI Gym | CodeCode Available | 0 | 5 |
| Double Q-PID algorithm for mobile robot control | Nov 1, 2018 | Active LearningQ-Learning | CodeCode Available | 0 | 5 |
| Generalized Value Iteration Networks: Life Beyond Lattices | Jun 8, 2017 | Q-Learning | CodeCode Available | 0 | 5 |
| Distributionally Robust Deep Q-Learning | May 25, 2025 | Q-Learning | CodeCode Available | 0 | 5 |
| Double Successive Over-Relaxation Q-Learning with an Extension to Deep Reinforcement Learning | Sep 10, 2024 | Deep Reinforcement LearningOpenAI Gym | CodeCode Available | 0 | 5 |
| A Framework for Automated Cellular Network Tuning with Reinforcement Learning | Aug 13, 2018 | ManagementQ-Learning | CodeCode Available | 0 | 5 |
| Group Equivariant Deep Reinforcement Learning | Jul 1, 2020 | Deep Reinforcement LearningInductive Bias | CodeCode Available | 0 | 5 |
| Active exploration in parameterized reinforcement learning | Oct 6, 2016 | Meta-LearningQ-Learning | CodeCode Available | 0 | 5 |
| Distributed-Training-and-Execution Multi-Agent Reinforcement Learning for Power Control in HetNet | Dec 15, 2022 | Deep Reinforcement LearningMulti-agent Reinforcement Learning | CodeCode Available | 0 | 5 |
| AFU: Actor-Free critic Updates in off-policy RL for continuous control | Apr 24, 2024 | continuous-controlContinuous Control | CodeCode Available | 0 | 5 |
| Hierarchical Cooperative Multi-Agent Reinforcement Learning with Skill Discovery | Dec 7, 2019 | Multi-agent Reinforcement LearningQ-Learning | CodeCode Available | 0 | 5 |
| DRL4AOI: A DRL Framework for Semantic-aware AOI Segmentation in Location-Based Services | Dec 6, 2024 | Deep Reinforcement LearningQ-Learning | CodeCode Available | 0 | 5 |
| Efficient Sparse-Reward Goal-Conditioned Reinforcement Learning with a High Replay Ratio and Regularization | Dec 10, 2023 | Q-LearningReinforcement Learning (RL) | CodeCode Available | 0 | 5 |
| Stabilizing Off-Policy Q-Learning via Bootstrapping Error Reduction | Jun 3, 2019 | continuous-controlContinuous Control | CodeCode Available | 0 | 5 |
| Implications of Decentralized Q-learning Resource Allocation in Wireless Networks | May 30, 2017 | Q-LearningReinforcement Learning | CodeCode Available | 0 | 5 |
| A Self-Adaptive Proposal Model for Temporal Action Detection based on Reinforcement Learning | Jun 22, 2017 | Action DetectionPosition | CodeCode Available | 0 | 5 |
| A Semantic-Aware Multiple Access Scheme for Distributed, Dynamic 6G-Based Applications | Jan 12, 2024 | Decision MakingDeep Reinforcement Learning | CodeCode Available | 0 | 5 |
| Agent Performing Autonomous Stock Trading under Good and Bad Situations | Jun 6, 2023 | Decision MakingDeep Reinforcement Learning | CodeCode Available | 0 | 5 |
| Information-Theoretic State Variable Selection for Reinforcement Learning | Jan 21, 2024 | Decision Makingfeature selection | CodeCode Available | 0 | 5 |
| Inverse Q-Learning Done Right: Offline Imitation Learning in Q^π-Realizable MDPs | May 26, 2025 | Imitation LearningQ-Learning | CodeCode Available | 0 | 5 |
| Investigating the Performance and Reliability, of the Q-Learning Algorithm in Various Unknown Environments | Dec 19, 2023 | OpenAI GymPathfinder | CodeCode Available | 0 | 5 |
| Mastering Percolation-like Games with Deep Learning | May 12, 2023 | Deep LearningQ-Learning | CodeCode Available | 0 | 5 |
| Assessing the Potential of Classical Q-learning in General Game Playing | Oct 14, 2018 | Board GamesDeep Reinforcement Learning | CodeCode Available | 0 | 5 |
| Assumed Density Filtering Q-learning | Dec 9, 2017 | Atari GamesBayesian Inference | CodeCode Available | 0 | 5 |
| Diagnosing Bottlenecks in Deep Q-learning Algorithms | Feb 26, 2019 | continuous-controlContinuous Control | CodeCode Available | 0 | 5 |
| A Novel Update Mechanism for Q-Networks Based On Extreme Learning Machines | Jun 4, 2020 | Q-Learningreinforcement-learning | CodeCode Available | 0 | 5 |
| Deterministic Implementations for Reproducibility in Deep Reinforcement Learning | Sep 15, 2018 | Deep Reinforcement LearningQ-Learning | CodeCode Available | 0 | 5 |
| Imitating from auxiliary imperfect demonstrations via Adversarial Density Weighted Regression | May 28, 2024 | Imitation LearningMuJoCo | CodeCode Available | 0 | 5 |
| DeepTraffic: Crowdsourced Hyperparameter Tuning of Deep Reinforcement Learning Systems for Multi-Agent Dense Traffic Navigation | Jan 9, 2018 | Autonomous DrivingAutonomous Navigation | CodeCode Available | 0 | 5 |
| Learning RL-Policies for Joint Beamforming Without Exploration: A Batch Constrained Off-Policy Approach | Oct 12, 2023 | Deep Reinforcement LearningQ-Learning | CodeCode Available | 0 | 5 |
| Learning Simple Algorithms from Examples | Nov 23, 2015 | Q-Learning | CodeCode Available | 0 | 5 |
| A DQN-based Approach to Finding Precise Evidences for Fact Verification | Aug 1, 2021 | Claim VerificationFact Verification | CodeCode Available | 0 | 5 |
| Learning to Communicate with Deep Multi-Agent Reinforcement Learning | May 21, 2016 | Multi-agent Reinforcement LearningQ-Learning | CodeCode Available | 0 | 5 |
| Learning to Play in a Day: Faster Deep Reinforcement Learning by Optimality Tightening | Nov 5, 2016 | Atari GamesDeep Reinforcement Learning | CodeCode Available | 0 | 5 |
| Learning Visual Tracking and Reaching with Deep Reinforcement Learning on a UR10e Robotic Arm | Aug 28, 2023 | Deep Reinforcement LearningQ-Learning | CodeCode Available | 0 | 5 |
| Active inference: demystified and compared | Sep 24, 2019 | Atari GamesOpenAI Gym | CodeCode Available | 0 | 5 |
| Logical Specifications-guided Dynamic Task Sampling for Reinforcement Learning Agents | Feb 6, 2024 | continuous-controlContinuous Control | CodeCode Available | 0 | 5 |
| Introspective Experience Replay: Look Back When Surprised | Jun 7, 2022 | Q-Learningreinforcement-learning | CodeCode Available | 0 | 5 |
| Active Collection of Well-Being and Health Data in Mobile Devices | Jul 7, 2023 | Q-LearningReinforcement Learning (RL) | CodeCode Available | 0 | 5 |