| Bayesian Controller Fusion: Leveraging Control Priors in Deep Reinforcement Learning for Robotics | Jul 21, 2021 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| A machine learning pipeline for autonomous numerical analytic continuation of Dyson-Schwinger equations | Dec 24, 2021 | BIG-bench Machine LearningDeep Reinforcement Learning | —Unverified | 0 |
| Battery Model Calibration with Deep Reinforcement Learning | Dec 7, 2020 | BIG-bench Machine LearningDeep Reinforcement Learning | —Unverified | 0 |
| Battery and Hydrogen Energy Storage Control in a Smart Energy Network with Flexible Energy Demand using Deep Reinforcement Learning | Aug 26, 2022 | Deep Reinforcement LearningScheduling | —Unverified | 0 |
| A Machine Learning Approach to Routing | Aug 10, 2017 | BIG-bench Machine LearningDeep Reinforcement Learning | —Unverified | 0 |
| Benchmarking Feature Extractors for Reinforcement Learning-Based Semiconductor Defect Localization | Nov 18, 2023 | BenchmarkingDeep Reinforcement Learning | —Unverified | 0 |
| Benchmarking Lane-changing Decision-making for Deep Reinforcement Learning | Sep 22, 2021 | Autonomous DrivingBenchmarking | —Unverified | 0 |
| Benchmarking Multi-Agent Deep Reinforcement Learning Algorithms | Jan 1, 2021 | BenchmarkingDeep Reinforcement Learning | —Unverified | 0 |
| A Deep Actor-Critic Reinforcement Learning Framework for Dynamic Multichannel Access | Aug 20, 2019 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Adaptive Warm-Start MCTS in AlphaZero-like Deep Reinforcement Learning | May 13, 2021 | Board GamesDeep Reinforcement Learning | —Unverified | 0 |
| Collaborative Computing in Non-Terrestrial Networks: A Multi-Time-Scale Deep Reinforcement Learning Approach | Feb 7, 2024 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| Collaborative Deep Reinforcement Learning for Resource Optimization in Non-Terrestrial Networks | Feb 6, 2024 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| BCQQ: Batch-Constraint Quantum Q-Learning with Cyclic Data Re-uploading | Apr 27, 2023 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| BET: Explaining Deep Reinforcement Learning through The Error-Prone Decisions | Jan 14, 2024 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| Beyond Tabula-Rasa: a Modular Reinforcement Learning Approach for Physically Embedded 3D Sokoban | Oct 3, 2020 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| A Multi-Agent Deep Reinforcement Learning Approach for a Distributed Energy Marketplace in Smart Grids | Sep 23, 2020 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Batch-Constrained Distributional Reinforcement Learning for Session-based Recommendation | Dec 16, 2020 | Deep Reinforcement LearningDistributional Reinforcement Learning | —Unverified | 0 |
| Beyond Traditional DoE: Deep Reinforcement Learning for Optimizing Experiments in Model Identification of Battery Dynamics | Oct 12, 2023 | Deep Reinforcement Learningenergy management | —Unverified | 0 |
| Alzheimers Disease Diagnosis using Machine Learning: A Review | Apr 17, 2023 | Deep LearningDeep Reinforcement Learning | —Unverified | 0 |
| Beyond Training-time Poisoning: Component-level and Post-training Backdoors in Deep Reinforcement Learning | Jul 7, 2025 | Backdoor AttackDeep Reinforcement Learning | —Unverified | 0 |
| BIBI System Description: Building with CNNs and Breaking with Deep Reinforcement Learning | Sep 1, 2017 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| BIDA: A Bi-level Interaction Decision-making Algorithm for Autonomous Vehicles in Dynamic Traffic Scenarios | Jun 19, 2025 | Autonomous VehiclesDecision Making | —Unverified | 0 |
| BASIL: Best-Action Symbolic Interpretable Learning for Evolving Compact RL Policies | May 31, 2025 | AcrobotDeep Reinforcement Learning | —Unverified | 0 |
| Bi-Level Control of Weaving Sections in Mixed Traffic Environments with Connected and Automated Vehicles | Mar 24, 2024 | Deep Reinforcement Learning | —Unverified | 0 |
| Bilevel Learning Model Towards Industrial Scheduling | Aug 10, 2020 | Deep Reinforcement Learningmodel | —Unverified | 0 |
| Bi-Manual Block Assembly via Sim-to-Real Reinforcement Learning | Mar 27, 2023 | Collision AvoidanceDeep Reinforcement Learning | —Unverified | 0 |
| Biologically inspired architectures for sample-efficient deep reinforcement learning | Nov 25, 2019 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Biologically-Plausible Topology Improved Spiking Actor Network for Efficient Deep Reinforcement Learning | Mar 29, 2024 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| Biological Neurons Compete with Deep Reinforcement Learning in Sample Efficiency in a Simulated Gameworld | May 27, 2024 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Biomechanic Posture Stabilisation via Iterative Training of Multi-policy Deep Reinforcement Learning Agents | Aug 21, 2020 | AI AgentDeep Reinforcement Learning | —Unverified | 0 |
| Basal Glucose Control in Type 1 Diabetes using Deep Reinforcement Learning: An In Silico Validation | May 18, 2020 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| BitE : Accelerating Learned Query Optimization in a Mixed-Workload Environment | Jun 1, 2023 | Deep Reinforcement LearningEnsemble Learning | —Unverified | 0 |
| An Actor-Critic-Attention Mechanism for Deep Reinforcement Learning in Multi-view Environments | Jul 19, 2019 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| Black-Box Targeted Reward Poisoning Attack Against Online Deep Reinforcement Learning | May 18, 2023 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Alphazzle: Jigsaw Puzzle Solver with Deep Monte-Carlo Tree Search | Feb 1, 2023 | Deep Reinforcement Learning | —Unverified | 0 |
| Blockchain-assisted Demonstration Cloning for Multi-Agent Deep Reinforcement Learning | Jan 19, 2025 | Deep Reinforcement LearningImitation Learning | —Unverified | 0 |
| Blockchain-based Pseudonym Management for Vehicle Twin Migrations in Vehicular Edge Metaverse | Mar 22, 2024 | Deep Reinforcement LearningEdge-computing | —Unverified | 0 |
| Adaptive Transit Signal Priority based on Deep Reinforcement Learning and Connected Vehicles in a Traffic Microsimulation Environment | Jul 31, 2024 | Deep Reinforcement LearningReinforcement Learning (RL) | —Unverified | 0 |
| An adaptive synchronization approach for weights of deep reinforcement learning | Aug 16, 2020 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Barrier Function-based Safe Reinforcement Learning for Emergency Control of Power Systems | Mar 26, 2021 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| BOOK: Storing Algorithm-Invariant Episodes for Deep Reinforcement Learning | Sep 5, 2017 | Deep Reinforcement LearningImitation Learning | —Unverified | 0 |
| Boosting 5G on Smart Grid Communication: A Smart RAN Slicing Approach | Aug 30, 2022 | Deep Reinforcement LearningManagement | —Unverified | 0 |
| Bandwidth Reservation for Time-Critical Vehicular Applications: A Multi-Operator Environment | Mar 22, 2025 | Deep Reinforcement LearningFairness | —Unverified | 0 |
| Boosting Exploration in Actor-Critic Algorithms by Incentivizing Plausible Novel States | Oct 1, 2022 | continuous-controlContinuous Control | —Unverified | 0 |
| AlphaStock: A Buying-Winners-and-Selling-Losers Investment Strategy using Interpretable Deep Reinforcement Attention Networks | Jul 24, 2019 | Deep AttentionDeep Reinforcement Learning | —Unverified | 0 |
| Bootstrap Latent-Predictive Representations for Multitask Reinforcement Learning | Apr 30, 2020 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Bootstrapping a DQN Replay Memory with Synthetic Experiences | Feb 4, 2020 | Deep Reinforcement LearningReinforcement Learning | —Unverified | 0 |
| Analysing Deep Reinforcement Learning Agents Trained with Domain Randomisation | Dec 18, 2019 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Analysis and Optimisation of Bellman Residual Errors with Neural Function Approximation | Jun 16, 2021 | continuous-controlContinuous Control | —Unverified | 0 |
| Co-design of Embodied Neural Intelligence via Constrained Evolution | May 21, 2022 | Deep Reinforcement LearningGPU | —Unverified | 0 |