| Modular Recurrence in Contextual MDPs for Universal Morphology Control | Jun 10, 2025 | Deep Reinforcement LearningMuJoCo | —Unverified | 0 | 0 |
| Where to Look Next: Unsupervised Active Visual Exploration on 360° Input | Sep 23, 2019 | Deep Reinforcement LearningReinforcement Learning | —Unverified | 0 | 0 |
| Deep Reinforcement Active Learning for Human-in-the-Loop Person Re-Identification | Oct 1, 2019 | Active LearningDeep Reinforcement Learning | —Unverified | 0 | 0 |
| Attention-Augmented Inverse Reinforcement Learning with Graph Convolutions for Multi-Agent Task Allocation | Apr 7, 2025 | Deep Reinforcement LearningGraph Attention | —Unverified | 0 | 0 |
| Deep reinforced active learning for multi-class image classification | Jun 20, 2022 | Active LearningClassification | —Unverified | 0 | 0 |
| DeePref: Deep Reinforcement Learning For Video Prefetching In Content Delivery Networks | Oct 11, 2023 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 | 0 |
| Attend2Pack: Bin Packing through Deep Reinforcement Learning with Attention | Jul 9, 2021 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 | 0 |
| Deep Reactive Planning in Dynamic Environments | Oct 31, 2020 | Deep Reinforcement Learning | —Unverified | 0 | 0 |
| Hierarchical Primitive Composition: Simultaneous Activation of Skills with Inconsistent Action Dimensions in Multiple Hierarchies | Oct 5, 2021 | Deep Reinforcement LearningHierarchical Reinforcement Learning | —Unverified | 0 | 0 |
| Deep Randomized Least Squares Value Iteration | Jan 1, 2020 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 | 0 |
| DeepRacer: Educational Autonomous Racing Platform for Experimentation with Sim2Real Reinforcement Learning | Nov 5, 2019 | Autonomous RacingDeep Reinforcement Learning | —Unverified | 0 | 0 |
| Attacking Deep Reinforcement Learning-Based Traffic Signal Control Systems with Colluding Vehicles | Nov 4, 2021 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 | 0 |
| A Framework for Adversarial Analysis of Decision Support Systems Prior to Deployment | May 27, 2025 | Adversarial AttackAdversarial Defense | —Unverified | 0 | 0 |
| Adaptive Discretization for Continuous Control using Particle Filtering Policy Network | Sep 28, 2020 | continuous-controlContinuous Control | —Unverified | 0 | 0 |
| Attacking and Defending Deep Reinforcement Learning Policies | May 16, 2022 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 | 0 |
| Deep Q-Networks for Accelerating the Training of Deep Neural Networks | Jun 5, 2016 | Deep Reinforcement LearningReinforcement Learning | —Unverified | 0 | 0 |
| A* Tree Search for Portfolio Management | Jan 7, 2019 | Deep Reinforcement LearningManagement | —Unverified | 0 | 0 |
| Deep Q-Learning with Low Switching Cost | Jan 1, 2021 | Atari GamesDeep Reinforcement Learning | —Unverified | 0 | 0 |
| Deep Q-Learning versus Proximal Policy Optimization: Performance Comparison in a Material Sorting Task | Jun 2, 2023 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 | 0 |
| A transformer-based deep reinforcement learning approach to spatial navigation in a partially observable Morris Water Maze | Oct 1, 2024 | Decision MakingDecoder | —Unverified | 0 | 0 |
| Deep Q Learning from Dynamic Demonstration with Behavioral Cloning | Jan 1, 2021 | Deep Reinforcement LearningOpenAI Gym | —Unverified | 0 | 0 |
| A Transferable Approach for Partitioning Machine Learning Models on Multi-Chip-Modules | Dec 7, 2021 | BIG-bench Machine LearningDeep Reinforcement Learning | —Unverified | 0 | 0 |
| A Finite-Time Analysis of Q-Learning with Neural Network Function Approximation | Dec 10, 2019 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 | 0 |
| Adaptive Data Transport Mechanism for UAV Surveillance Missions in Lossy Environments | Sep 30, 2024 | Deep Reinforcement LearningMoving Object Detection | —Unverified | 0 | 0 |
| Deep Q-Learning for Self-Organizing Networks Fault Management and Radio Performance Improvement | Jul 10, 2017 | Deep Reinforcement LearningManagement | —Unverified | 0 | 0 |