| Battery and Hydrogen Energy Storage Control in a Smart Energy Network with Flexible Energy Demand using Deep Reinforcement Learning | Aug 26, 2022 | Deep Reinforcement LearningScheduling | —Unverified | 0 |
| A Machine Learning Approach to Routing | Aug 10, 2017 | BIG-bench Machine LearningDeep Reinforcement Learning | —Unverified | 0 |
| Adaptive Warm-Start MCTS in AlphaZero-like Deep Reinforcement Learning | May 13, 2021 | Board GamesDeep Reinforcement Learning | —Unverified | 0 |
| Climate Change Policy Exploration using Reinforcement Learning | Oct 23, 2022 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| BCQQ: Batch-Constraint Quantum Q-Learning with Cyclic Data Re-uploading | Apr 27, 2023 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| Benchmarking Feature Extractors for Reinforcement Learning-Based Semiconductor Defect Localization | Nov 18, 2023 | BenchmarkingDeep Reinforcement Learning | —Unverified | 0 |
| Benchmarking Lane-changing Decision-making for Deep Reinforcement Learning | Sep 22, 2021 | Autonomous DrivingBenchmarking | —Unverified | 0 |
| Benchmarking Multi-Agent Deep Reinforcement Learning Algorithms | Jan 1, 2021 | BenchmarkingDeep Reinforcement Learning | —Unverified | 0 |
| A Deep Actor-Critic Reinforcement Learning Framework for Dynamic Multichannel Access | Aug 20, 2019 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Batch-Constrained Distributional Reinforcement Learning for Session-based Recommendation | Dec 16, 2020 | Deep Reinforcement LearningDistributional Reinforcement Learning | —Unverified | 0 |
| Alzheimers Disease Diagnosis using Machine Learning: A Review | Apr 17, 2023 | Deep LearningDeep Reinforcement Learning | —Unverified | 0 |
| BASIL: Best-Action Symbolic Interpretable Learning for Evolving Compact RL Policies | May 31, 2025 | AcrobotDeep Reinforcement Learning | —Unverified | 0 |
| Basal Glucose Control in Type 1 Diabetes using Deep Reinforcement Learning: An In Silico Validation | May 18, 2020 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| BET: Explaining Deep Reinforcement Learning through The Error-Prone Decisions | Jan 14, 2024 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| Beyond Tabula-Rasa: a Modular Reinforcement Learning Approach for Physically Embedded 3D Sokoban | Oct 3, 2020 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| A Multi-Agent Deep Reinforcement Learning Approach for a Distributed Energy Marketplace in Smart Grids | Sep 23, 2020 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Alphazzle: Jigsaw Puzzle Solver with Deep Monte-Carlo Tree Search | Feb 1, 2023 | Deep Reinforcement Learning | —Unverified | 0 |
| Beyond Traditional DoE: Deep Reinforcement Learning for Optimizing Experiments in Model Identification of Battery Dynamics | Oct 12, 2023 | Deep Reinforcement Learningenergy management | —Unverified | 0 |
| Adaptive Transit Signal Priority based on Deep Reinforcement Learning and Connected Vehicles in a Traffic Microsimulation Environment | Jul 31, 2024 | Deep Reinforcement LearningReinforcement Learning (RL) | —Unverified | 0 |
| Beyond Training-time Poisoning: Component-level and Post-training Backdoors in Deep Reinforcement Learning | Jul 7, 2025 | Backdoor AttackDeep Reinforcement Learning | —Unverified | 0 |
| BIBI System Description: Building with CNNs and Breaking with Deep Reinforcement Learning | Sep 1, 2017 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| BIDA: A Bi-level Interaction Decision-making Algorithm for Autonomous Vehicles in Dynamic Traffic Scenarios | Jun 19, 2025 | Autonomous VehiclesDecision Making | —Unverified | 0 |
| Barrier Function-based Safe Reinforcement Learning for Emergency Control of Power Systems | Mar 26, 2021 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Bi-Level Control of Weaving Sections in Mixed Traffic Environments with Connected and Automated Vehicles | Mar 24, 2024 | Deep Reinforcement Learning | —Unverified | 0 |
| Bandwidth Reservation for Time-Critical Vehicular Applications: A Multi-Operator Environment | Mar 22, 2025 | Deep Reinforcement LearningFairness | —Unverified | 0 |