| Deep Policy Gradient Methods Without Batch Updates, Target Networks, or Replay Buffers | Nov 22, 2024 | AvgDeep Reinforcement Learning | CodeCode Available | 1 |
| DRL-Based Optimization for AoI and Energy Consumption in C-V2X Enabled IoV | Nov 20, 2024 | Deep Reinforcement LearningScheduling | CodeCode Available | 1 |
| Learning Generalizable Policy for Obstacle-Aware Autonomous Drone Racing | Nov 6, 2024 | Deep Reinforcement LearningDrone navigation | CodeCode Available | 1 |
| Beyond The Rainbow: High Performance Deep Reinforcement Learning on a Desktop PC | Nov 6, 2024 | Computational EfficiencyDeep Reinforcement Learning | CodeCode Available | 1 |
| Learning Successor Features the Simple Way | Oct 29, 2024 | Continual LearningDeep Reinforcement Learning | CodeCode Available | 1 |
| Enhancing Battery Storage Energy Arbitrage with Deep Reinforcement Learning and Time-Series Forecasting | Oct 25, 2024 | Deep Reinforcement LearningTime Series | CodeCode Available | 1 |
| Entity-based Reinforcement Learning for Autonomous Cyber Defence | Oct 23, 2024 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |
| Benchmarking Deep Reinforcement Learning for Navigation in Denied Sensor Environments | Oct 18, 2024 | Autonomous NavigationBenchmarking | CodeCode Available | 1 |
| Generative Artificial Intelligence (GAI) for Mobile Communications: A Diffusion Model Perspective | Oct 8, 2024 | Deep Reinforcement LearningManagement | CodeCode Available | 1 |
| Mitigating Adversarial Perturbations for Deep Reinforcement Learning via Vector Quantization | Oct 4, 2024 | Deep Reinforcement LearningQuantization | CodeCode Available | 1 |
| Scalable Multi-Robot Informative Path Planning for Target Mapping via Deep Reinforcement Learning | Sep 25, 2024 | Collision AvoidanceDeep Reinforcement Learning | CodeCode Available | 1 |
| Learning Discrete World Models for Heuristic Search | Sep 14, 2024 | Deep Reinforcement LearningHeuristic Search | CodeCode Available | 1 |
| Solving Integrated Process Planning and Scheduling Problem via Graph Neural Network Based Deep Reinforcement Learning | Sep 2, 2024 | Deep Reinforcement LearningGraph Neural Network | CodeCode Available | 1 |
| Efficient Camera Exposure Control for Visual Odometry via Deep Reinforcement Learning | Aug 30, 2024 | CPUDeep Reinforcement Learning | CodeCode Available | 1 |
| Control-Informed Reinforcement Learning for Chemical Processes | Aug 24, 2024 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |
| Hologram Reasoning for Solving Algebra Problems with Geometry Diagrams | Aug 20, 2024 | Deep Reinforcement LearningModel Selection | CodeCode Available | 1 |
| Integrating Saliency Ranking and Reinforcement Learning for Enhanced Object Detection | Aug 13, 2024 | Deep Reinforcement LearningObject | CodeCode Available | 1 |
| Model-Based Transfer Learning for Contextual Reinforcement Learning | Aug 8, 2024 | Bayesian Optimizationcontinuous-control | CodeCode Available | 1 |
| A Comparative Study of Deep Reinforcement Learning Models: DQN vs PPO vs A2C | Jul 19, 2024 | Deep Reinforcement Learning | CodeCode Available | 1 |
| Reconfigurable Intelligent Surface Aided Vehicular Edge Computing: Joint Phase-shift Optimization and Multi-User Power Allocation | Jul 18, 2024 | Deep Reinforcement LearningEdge-computing | CodeCode Available | 1 |
| Joint Optimization of Age of Information and Energy Consumption in NR-V2X System based on Deep Reinforcement Learning | Jul 11, 2024 | Autonomous DrivingDeep Reinforcement Learning | CodeCode Available | 1 |
| Resource Allocation for Twin Maintenance and Computing Task Processing in Digital Twin Vehicular Edge Computing Network | Jul 10, 2024 | Deep Reinforcement LearningEdge-computing | CodeCode Available | 1 |
| Deep-Reinforcement-Learning-Based AoI-Aware Resource Allocation for RIS-Aided IoV Networks | Jun 17, 2024 | Deep Reinforcement Learning | CodeCode Available | 1 |
| Reconfigurable Intelligent Surface Assisted VEC Based on Multi-Agent Reinforcement Learning | Jun 17, 2024 | Deep Reinforcement LearningEdge-computing | CodeCode Available | 1 |
| Semantic-Aware Spectrum Sharing in Internet of Vehicles Based on Deep Reinforcement Learning | Jun 11, 2024 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |
| Learning to Play Air Hockey with Model-Based Deep Reinforcement Learning | Jun 1, 2024 | Deep Reinforcement LearningPosition | CodeCode Available | 1 |
| Amortizing intractable inference in diffusion models for vision, language, and control | May 31, 2024 | continuous-controlContinuous Control | CodeCode Available | 1 |
| OpenTensor: Reproducing Faster Matrix Multiplication Discovering Algorithms | May 31, 2024 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |
| Aquatic Navigation: A Challenging Benchmark for Deep Reinforcement Learning | May 30, 2024 | Autonomous DrivingBenchmarking | CodeCode Available | 1 |
| Interpretable and Editable Programmatic Tree Policies for Reinforcement Learning | May 23, 2024 | Atari GamesDeep Reinforcement Learning | CodeCode Available | 1 |
| Uncertainty-Aware DRL for Autonomous Vehicle Crowd Navigation in Shared Space | May 22, 2024 | Autonomous VehiclesDeep Reinforcement Learning | CodeCode Available | 1 |
| RICE: Breaking Through the Training Bottlenecks of Reinforcement Learning with Explanation | May 5, 2024 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |
| Hand-Object Interaction Controller (HOIC): Deep Reinforcement Learning for Reconstructing Interactions with Physics | May 4, 2024 | Deep Reinforcement LearningObject | CodeCode Available | 1 |
| Leveraging Procedural Generation for Learning Autonomous Peg-in-Hole Assembly in Space | May 2, 2024 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |
| DPO Meets PPO: Reinforced Token Optimization for RLHF | Apr 29, 2024 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |
| SwarmRL: Building the Future of Smart Active Systems | Apr 25, 2024 | Deep Reinforcement Learning | CodeCode Available | 1 |
| A fast balance optimization approach for charging enhancement of lithium-ion battery packs through deep reinforcement learning | Apr 24, 2024 | Deep Reinforcement Learningenergy management | CodeCode Available | 1 |
| Learning Heuristics for Transit Network Design and Improvement with Deep Reinforcement Learning | Apr 8, 2024 | Deep Reinforcement Learning | CodeCode Available | 1 |
| PeersimGym: An Environment for Solving the Task Offloading Problem with Reinforcement Learning | Mar 26, 2024 | Deep Reinforcement LearningDistributed Computing | CodeCode Available | 1 |
| FootstepNet: an Efficient Actor-Critic Method for Fast On-line Bipedal Footstep Planning and Forecasting | Mar 19, 2024 | Deep Reinforcement LearningNavigate | CodeCode Available | 1 |
| Quality-Diversity Actor-Critic: Learning High-Performing and Diverse Behaviors via Value and Successor Features Critics | Mar 15, 2024 | continuous-controlContinuous Control | CodeCode Available | 1 |
| A Holistic Power Optimization Approach for Microgrid Control Based on Deep Reinforcement Learning | Mar 1, 2024 | Deep Reinforcement Learningenergy management | CodeCode Available | 1 |
| Beacon, a lightweight deep reinforcement learning benchmark library for flow control | Feb 27, 2024 | BenchmarkingCPU | CodeCode Available | 1 |
| Flexible Robust Beamforming for Multibeam Satellite Downlink using Reinforcement Learning | Feb 26, 2024 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |
| Transformable Gaussian Reward Function for Socially-Aware Navigation with Deep Reinforcement Learning | Feb 22, 2024 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |
| A Competition Winning Deep Reinforcement Learning Agent in microRTS | Feb 12, 2024 | Deep Reinforcement LearningImitation Learning | CodeCode Available | 1 |
| FedAA: A Reinforcement Learning Perspective on Adaptive Aggregation for Fair and Robust Federated Learning | Feb 8, 2024 | continuous-controlContinuous Control | CodeCode Available | 1 |
| Deep Reinforcement Learning with Dynamic Graphs for Adaptive Informative Path Planning | Feb 7, 2024 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |
| INViT: A Generalizable Routing Problem Solver with Invariant Nested View Transformer | Feb 4, 2024 | Deep Reinforcement Learning | CodeCode Available | 1 |
| Towards Optimal Adversarial Robust Q-learning with Bellman Infinity-error | Feb 3, 2024 | Adversarial RobustnessDeep Reinforcement Learning | CodeCode Available | 1 |