| Learning to Play Air Hockey with Model-Based Deep Reinforcement Learning | Jun 1, 2024 | Deep Reinforcement LearningPosition | CodeCode Available | 1 |
| Amortizing intractable inference in diffusion models for vision, language, and control | May 31, 2024 | continuous-controlContinuous Control | CodeCode Available | 1 |
| OpenTensor: Reproducing Faster Matrix Multiplication Discovering Algorithms | May 31, 2024 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |
| Aquatic Navigation: A Challenging Benchmark for Deep Reinforcement Learning | May 30, 2024 | Autonomous DrivingBenchmarking | CodeCode Available | 1 |
| Interpretable and Editable Programmatic Tree Policies for Reinforcement Learning | May 23, 2024 | Atari GamesDeep Reinforcement Learning | CodeCode Available | 1 |
| Uncertainty-Aware DRL for Autonomous Vehicle Crowd Navigation in Shared Space | May 22, 2024 | Autonomous VehiclesDeep Reinforcement Learning | CodeCode Available | 1 |
| RICE: Breaking Through the Training Bottlenecks of Reinforcement Learning with Explanation | May 5, 2024 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |
| Hand-Object Interaction Controller (HOIC): Deep Reinforcement Learning for Reconstructing Interactions with Physics | May 4, 2024 | Deep Reinforcement LearningObject | CodeCode Available | 1 |
| Leveraging Procedural Generation for Learning Autonomous Peg-in-Hole Assembly in Space | May 2, 2024 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |
| DPO Meets PPO: Reinforced Token Optimization for RLHF | Apr 29, 2024 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |
| SwarmRL: Building the Future of Smart Active Systems | Apr 25, 2024 | Deep Reinforcement Learning | CodeCode Available | 1 |
| A fast balance optimization approach for charging enhancement of lithium-ion battery packs through deep reinforcement learning | Apr 24, 2024 | Deep Reinforcement Learningenergy management | CodeCode Available | 1 |
| Learning Heuristics for Transit Network Design and Improvement with Deep Reinforcement Learning | Apr 8, 2024 | Deep Reinforcement Learning | CodeCode Available | 1 |
| PeersimGym: An Environment for Solving the Task Offloading Problem with Reinforcement Learning | Mar 26, 2024 | Deep Reinforcement LearningDistributed Computing | CodeCode Available | 1 |
| FootstepNet: an Efficient Actor-Critic Method for Fast On-line Bipedal Footstep Planning and Forecasting | Mar 19, 2024 | Deep Reinforcement LearningNavigate | CodeCode Available | 1 |
| Quality-Diversity Actor-Critic: Learning High-Performing and Diverse Behaviors via Value and Successor Features Critics | Mar 15, 2024 | continuous-controlContinuous Control | CodeCode Available | 1 |
| A Holistic Power Optimization Approach for Microgrid Control Based on Deep Reinforcement Learning | Mar 1, 2024 | Deep Reinforcement Learningenergy management | CodeCode Available | 1 |
| Beacon, a lightweight deep reinforcement learning benchmark library for flow control | Feb 27, 2024 | BenchmarkingCPU | CodeCode Available | 1 |
| Flexible Robust Beamforming for Multibeam Satellite Downlink using Reinforcement Learning | Feb 26, 2024 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |
| Transformable Gaussian Reward Function for Socially-Aware Navigation with Deep Reinforcement Learning | Feb 22, 2024 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |
| A Competition Winning Deep Reinforcement Learning Agent in microRTS | Feb 12, 2024 | Deep Reinforcement LearningImitation Learning | CodeCode Available | 1 |
| FedAA: A Reinforcement Learning Perspective on Adaptive Aggregation for Fair and Robust Federated Learning | Feb 8, 2024 | continuous-controlContinuous Control | CodeCode Available | 1 |
| Deep Reinforcement Learning with Dynamic Graphs for Adaptive Informative Path Planning | Feb 7, 2024 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |
| INViT: A Generalizable Routing Problem Solver with Invariant Nested View Transformer | Feb 4, 2024 | Deep Reinforcement Learning | CodeCode Available | 1 |
| Towards Optimal Adversarial Robust Q-learning with Bellman Infinity-error | Feb 3, 2024 | Adversarial RobustnessDeep Reinforcement Learning | CodeCode Available | 1 |