| Bounded Exploration with World Model Uncertainty in Soft Actor-Critic Reinforcement Learning Algorithm | Dec 9, 2024 | Deep Reinforcement Learning | —Unverified | 0 |
| Bounded Myopic Adversaries for Deep Reinforcement Learning Agents | Jan 1, 2021 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Branching Dueling Q-Network Based Online Scheduling of a Microgrid With Distributed Energy Storage Systems | May 27, 2021 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Breaking (Global) Barriers in Parallel Stochastic Optimization with Wait-Avoiding Group Averaging | Apr 30, 2020 | Deep Reinforcement LearningMachine Translation | —Unverified | 0 |
| Batch-Constrained Distributional Reinforcement Learning for Session-based Recommendation | Dec 16, 2020 | Deep Reinforcement LearningDistributional Reinforcement Learning | —Unverified | 0 |
| Brick-by-Brick: Combinatorial Construction with Deep Reinforcement Learning | Oct 29, 2021 | Deep Reinforcement LearningObject | —Unverified | 0 |
| Bridging Declarative, Procedural, and Conditional Metacognitive Knowledge Gap Using Deep Reinforcement Learning | Apr 23, 2023 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Bridging Econometrics and AI: VaR Estimation via Reinforcement Learning and GARCH Models | Apr 23, 2025 | Deep Reinforcement LearningEconometrics | —Unverified | 0 |
| Alzheimers Disease Diagnosis using Machine Learning: A Review | Apr 17, 2023 | Deep LearningDeep Reinforcement Learning | —Unverified | 0 |
| BASIL: Best-Action Symbolic Interpretable Learning for Evolving Compact RL Policies | May 31, 2025 | AcrobotDeep Reinforcement Learning | —Unverified | 0 |
| Basal Glucose Control in Type 1 Diabetes using Deep Reinforcement Learning: An In Silico Validation | May 18, 2020 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| Bridging the gap between Markowitz planning and deep reinforcement learning | Sep 30, 2020 | Asset ManagementAutonomous Driving | —Unverified | 0 |
| Bridging the Gap Between Target Networks and Functional Regularization | Oct 21, 2022 | Deep Reinforcement Learning | —Unverified | 0 |
| Alphazzle: Jigsaw Puzzle Solver with Deep Monte-Carlo Tree Search | Feb 1, 2023 | Deep Reinforcement Learning | —Unverified | 0 |
| Bridging Transient and Steady-State Performance in Voltage Control: A Reinforcement Learning Approach with Safe Gradient Flow | Mar 20, 2023 | Deep Reinforcement Learning | —Unverified | 0 |
| Broad Critic Deep Actor Reinforcement Learning for Continuous Control | Nov 24, 2024 | Computational Efficiencycontinuous-control | —Unverified | 0 |
| Adaptive Transit Signal Priority based on Deep Reinforcement Learning and Connected Vehicles in a Traffic Microsimulation Environment | Jul 31, 2024 | Deep Reinforcement LearningReinforcement Learning (RL) | —Unverified | 0 |
| Buffer-aware Wireless Scheduling based on Deep Reinforcement Learning | Nov 13, 2019 | Deep Reinforcement LearningFairness | —Unverified | 0 |
| Buffer Pool Aware Query Scheduling via Deep Reinforcement Learning | Jul 21, 2020 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Barrier Function-based Safe Reinforcement Learning for Emergency Control of Power Systems | Mar 26, 2021 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Bandwidth Reservation for Time-Critical Vehicular Applications: A Multi-Operator Environment | Mar 22, 2025 | Deep Reinforcement LearningFairness | —Unverified | 0 |
| Building Decision Forest via Deep Reinforcement Learning | Apr 1, 2022 | Binary ClassificationDeep Reinforcement Learning | —Unverified | 0 |
| Building HVAC Scheduling Using Reinforcement Learning via Neural Network Based Model Approximation | Oct 11, 2019 | Deep Reinforcement LearningModel-based Reinforcement Learning | —Unverified | 0 |
| Building Safer Autonomous Agents by Leveraging Risky Driving Behavior Knowledge | Mar 16, 2021 | Autonomous DrivingDeep Reinforcement Learning | —Unverified | 0 |
| By Fair Means or Foul: Quantifying Collusion in a Market Simulation with Deep Reinforcement Learning | Jun 4, 2024 | Deep Reinforcement LearningReinforcement Learning (RL) | —Unverified | 0 |
| A Deep Reinforcement Learning Approach for Security-Aware Service Acquisition in IoT | Apr 4, 2024 | Deep Reinforcement Learning | —Unverified | 0 |
| Caching-at-STARS: the Next Generation Edge Caching | Aug 1, 2023 | Deep Reinforcement Learning | —Unverified | 0 |
| Adaptive traffic signal safety and efficiency improvement by multi objective deep reinforcement learning approach | Aug 1, 2024 | Deep Reinforcement LearningTraffic Signal Control | —Unverified | 0 |
| AlphaStock: A Buying-Winners-and-Selling-Losers Investment Strategy using Interpretable Deep Reinforcement Attention Networks | Jul 24, 2019 | Deep AttentionDeep Reinforcement Learning | —Unverified | 0 |
| Evading Community Detection via Counterfactual Neighborhood Search | Oct 13, 2023 | Community Detectioncounterfactual | —Unverified | 0 |
| CAE: Repurposing the Critic as an Explorer in Deep Reinforcement Learning | Mar 23, 2025 | Deep Reinforcement LearningEfficient Exploration | —Unverified | 0 |
| Anderson Acceleration for Reinforcement Learning | Sep 25, 2018 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Comparing Approaches to Distributed Control of Fluid Systems based on Multi-Agent Systems | Dec 16, 2022 | Deep Reinforcement LearningModel Predictive Control | —Unverified | 0 |
| Balancing SoC in Battery Cells using Safe Action Perturbations | Mar 11, 2025 | Deep Reinforcement LearningReinforcement Learning (RL) | —Unverified | 0 |
| Balance Between Efficient and Effective Learning: Dense2Sparse Reward Shaping for Robot Manipulation with Environment Uncertainty | Mar 5, 2020 | Deep Reinforcement LearningReinforcement Learning | —Unverified | 0 |
| Can a Robot Become a Movie Director? Learning Artistic Principles for Aerial Cinematography | Apr 4, 2019 | Deep Reinforcement LearningMotion Planning | —Unverified | 0 |
| Can a Robot Trust You? A DRL-Based Approach to Trust-Driven Human-Guided Navigation | Nov 1, 2020 | Deep Reinforcement LearningNavigate | —Unverified | 0 |
| Can Artificial Intelligence Trade the Stock Market? | Jun 5, 2025 | Deep Reinforcement Learning | —Unverified | 0 |
| Learning Multi-Agent Coordination through Connectivity-driven Communication | Feb 12, 2020 | Deep Reinforcement LearningMulti-agent Reinforcement Learning | —Unverified | 0 |
| AlphaSeq: Sequence Discovery with Deep Reinforcement Learning | Sep 26, 2018 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Adaptive trading strategies across liquidity pools | Aug 18, 2020 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Can Reinforcement Learning for Continuous Control Generalize Across Physics Engines? | Oct 27, 2020 | continuous-controlContinuous Control | —Unverified | 0 |
| Can Temporal-Difference and Q-Learning Learn Representation? A Mean-Field Theory | Jun 8, 2020 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| Can Temporal-Difference and Q-Learning Learn Representation? A Mean-Field Theory | Dec 1, 2020 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| Can We Optimize Deep RL Policy Weights as Trajectory Modeling? | Mar 6, 2025 | Deep Reinforcement LearningReinforcement Learning (RL) | —Unverified | 0 |
| Can You Fix My Neural Network? Real-Time Adaptive Waveform Synthesis for Resilient Wireless Signal Classification | Mar 5, 2021 | Deep Reinforcement Learning | —Unverified | 0 |
| Carbon emissions and sustainability of launching 5G mobile networks in China | Jun 14, 2023 | Deep Reinforcement Learning | —Unverified | 0 |
| AlphaRank: An Artificial Intelligence Approach for Ranking and Selection Problems | Feb 1, 2024 | Deep Reinforcement Learning | —Unverified | 0 |
| Carl-Lead: Lidar-based End-to-End Autonomous Driving with Contrastive Deep Reinforcement Learning | Sep 17, 2021 | Autonomous DrivingDeep Reinforcement Learning | —Unverified | 0 |
| Communication-Control Codesign for Large-Scale Wireless Networked Control Systems | Oct 15, 2024 | Deep Reinforcement LearningScheduling | —Unverified | 0 |