| Bounded Exploration with World Model Uncertainty in Soft Actor-Critic Reinforcement Learning Algorithm | Dec 9, 2024 | Deep Reinforcement Learning | —Unverified | 0 |
| Bounded Myopic Adversaries for Deep Reinforcement Learning Agents | Jan 1, 2021 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Branching Dueling Q-Network Based Online Scheduling of a Microgrid With Distributed Energy Storage Systems | May 27, 2021 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Breaking (Global) Barriers in Parallel Stochastic Optimization with Wait-Avoiding Group Averaging | Apr 30, 2020 | Deep Reinforcement LearningMachine Translation | —Unverified | 0 |
| Basal Glucose Control in Type 1 Diabetes using Deep Reinforcement Learning: An In Silico Validation | May 18, 2020 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| Brick-by-Brick: Combinatorial Construction with Deep Reinforcement Learning | Oct 29, 2021 | Deep Reinforcement LearningObject | —Unverified | 0 |
| Bridging Declarative, Procedural, and Conditional Metacognitive Knowledge Gap Using Deep Reinforcement Learning | Apr 23, 2023 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Bridging Econometrics and AI: VaR Estimation via Reinforcement Learning and GARCH Models | Apr 23, 2025 | Deep Reinforcement LearningEconometrics | —Unverified | 0 |
| Alphazzle: Jigsaw Puzzle Solver with Deep Monte-Carlo Tree Search | Feb 1, 2023 | Deep Reinforcement Learning | —Unverified | 0 |
| Adaptive Transit Signal Priority based on Deep Reinforcement Learning and Connected Vehicles in a Traffic Microsimulation Environment | Jul 31, 2024 | Deep Reinforcement LearningReinforcement Learning (RL) | —Unverified | 0 |
| Barrier Function-based Safe Reinforcement Learning for Emergency Control of Power Systems | Mar 26, 2021 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Bandwidth Reservation for Time-Critical Vehicular Applications: A Multi-Operator Environment | Mar 22, 2025 | Deep Reinforcement LearningFairness | —Unverified | 0 |
| Bridging the Gap Between Target Networks and Functional Regularization | Oct 21, 2022 | Deep Reinforcement Learning | —Unverified | 0 |
| AlphaStock: A Buying-Winners-and-Selling-Losers Investment Strategy using Interpretable Deep Reinforcement Attention Networks | Jul 24, 2019 | Deep AttentionDeep Reinforcement Learning | —Unverified | 0 |
| Bridging Transient and Steady-State Performance in Voltage Control: A Reinforcement Learning Approach with Safe Gradient Flow | Mar 20, 2023 | Deep Reinforcement Learning | —Unverified | 0 |
| Broad Critic Deep Actor Reinforcement Learning for Continuous Control | Nov 24, 2024 | Computational Efficiencycontinuous-control | —Unverified | 0 |
| Combinatorial Keyword Recommendations for Sponsored Search with Deep Reinforcement Learning | Jul 18, 2019 | ClusteringCombinatorial Optimization | —Unverified | 0 |
| Buffer-aware Wireless Scheduling based on Deep Reinforcement Learning | Nov 13, 2019 | Deep Reinforcement LearningFairness | —Unverified | 0 |
| Buffer Pool Aware Query Scheduling via Deep Reinforcement Learning | Jul 21, 2020 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Combinatorial Rising Bandit | Dec 1, 2024 | Deep Reinforcement LearningRecommendation Systems | —Unverified | 0 |
| The f-Divergence Reinforcement Learning Framework | Sep 24, 2021 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| Building Decision Forest via Deep Reinforcement Learning | Apr 1, 2022 | Binary ClassificationDeep Reinforcement Learning | —Unverified | 0 |
| Building HVAC Scheduling Using Reinforcement Learning via Neural Network Based Model Approximation | Oct 11, 2019 | Deep Reinforcement LearningModel-based Reinforcement Learning | —Unverified | 0 |
| Building Safer Autonomous Agents by Leveraging Risky Driving Behavior Knowledge | Mar 16, 2021 | Autonomous DrivingDeep Reinforcement Learning | —Unverified | 0 |
| Balancing SoC in Battery Cells using Safe Action Perturbations | Mar 11, 2025 | Deep Reinforcement LearningReinforcement Learning (RL) | —Unverified | 0 |