| Average Reward Reinforcement Learning with Monotonic Policy Improvement | Jan 1, 2021 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 | 0 |
| A Versatile Adaptive Curriculum Learning Framework for Task-oriented Dialogue Policy Learning | Jul 1, 2022 | Deep Reinforcement Learning | —Unverified | 0 | 0 |
| A Vision Based Deep Reinforcement Learning Algorithm for UAV Obstacle Avoidance | Mar 11, 2021 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 | 0 |
| A Visual Communication Map for Multi-Agent Deep Reinforcement Learning | Feb 27, 2020 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 | 0 |
| Avoidance Navigation Based on Offline Pre-Training Reinforcement Learning | Aug 3, 2023 | Deep Reinforcement LearningNavigate | —Unverified | 0 | 0 |
| Avoiding Catastrophic States with Intrinsic Fear | Jan 1, 2018 | Atari GamesDeep Reinforcement Learning | —Unverified | 0 | 0 |
| AWD3: Dynamic Reduction of the Estimation Bias | Nov 12, 2021 | continuous-controlContinuous Control | —Unverified | 0 | 0 |
| A Wireless Collaborated Inference Acceleration Framework for Plant Disease Recognition | May 5, 2025 | Collaborative InferenceDeep Reinforcement Learning | —Unverified | 0 | 0 |
| AXIOM: Learning to Play Games in Minutes with Expanding Object-Centric Models | May 30, 2025 | Deep Reinforcement Learning | —Unverified | 0 | 0 |
| A Zero-Shot Reinforcement Learning Strategy for Autonomous Guidewire Navigation | Mar 5, 2024 | Deep Reinforcement LearningNavigate | —Unverified | 0 | 0 |
| Bach2Bach: Generating Music Using A Deep Reinforcement Learning Approach | Dec 3, 2018 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 | 0 |
| Backbones-Review: Feature Extraction Networks for Deep Learning and Deep Reinforcement Learning Approaches | Jun 16, 2022 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 | 0 |
| BACKDOORL: Backdoor Attack against Competitive Reinforcement Learning | May 2, 2021 | Atari GamesBackdoor Attack | —Unverified | 0 | 0 |
| Backdoors in DRL: Four Environments Focusing on In-distribution Triggers | May 22, 2025 | Backdoor AttackData Poisoning | —Unverified | 0 | 0 |
| Balance Between Efficient and Effective Learning: Dense2Sparse Reward Shaping for Robot Manipulation with Environment Uncertainty | Mar 5, 2020 | Deep Reinforcement LearningReinforcement Learning | —Unverified | 0 | 0 |
| Balancing SoC in Battery Cells using Safe Action Perturbations | Mar 11, 2025 | Deep Reinforcement LearningReinforcement Learning (RL) | —Unverified | 0 | 0 |
| Bandwidth Reservation for Time-Critical Vehicular Applications: A Multi-Operator Environment | Mar 22, 2025 | Deep Reinforcement LearningFairness | —Unverified | 0 | 0 |
| Barrier Function-based Safe Reinforcement Learning for Emergency Control of Power Systems | Mar 26, 2021 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 | 0 |
| Basal Glucose Control in Type 1 Diabetes using Deep Reinforcement Learning: An In Silico Validation | May 18, 2020 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 | 0 |
| BASIL: Best-Action Symbolic Interpretable Learning for Evolving Compact RL Policies | May 31, 2025 | AcrobotDeep Reinforcement Learning | —Unverified | 0 | 0 |
| Batch-Constrained Distributional Reinforcement Learning for Session-based Recommendation | Dec 16, 2020 | Deep Reinforcement LearningDistributional Reinforcement Learning | —Unverified | 0 | 0 |
| BCQQ: Batch-Constraint Quantum Q-Learning with Cyclic Data Re-uploading | Apr 27, 2023 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 | 0 |
| Battery and Hydrogen Energy Storage Control in a Smart Energy Network with Flexible Energy Demand using Deep Reinforcement Learning | Aug 26, 2022 | Deep Reinforcement LearningScheduling | —Unverified | 0 | 0 |
| Battery Model Calibration with Deep Reinforcement Learning | Dec 7, 2020 | BIG-bench Machine LearningDeep Reinforcement Learning | —Unverified | 0 | 0 |
| Bayesian Controller Fusion: Leveraging Control Priors in Deep Reinforcement Learning for Robotics | Jul 21, 2021 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 | 0 |