| Deep Reinforcement Learning with Dynamic Graphs for Adaptive Informative Path Planning | Feb 7, 2024 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 | 5 |
| DRLComplex: Reconstruction of protein quaternary structures using deep reinforcement learning | May 26, 2022 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 | 5 |
| Efficient Adversarial Training without Attacking: Worst-Case-Aware Robust Reinforcement Learning | Oct 12, 2022 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 | 5 |
| An Equivalence between Loss Functions and Non-Uniform Sampling in Experience Replay | Jul 12, 2020 | Deep Reinforcement LearningMuJoCo | CodeCode Available | 1 | 5 |
| DPO Meets PPO: Reinforced Token Optimization for RLHF | Apr 29, 2024 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 | 5 |
| Deep Reinforcement Learning with Population-Coded Spiking Neural Network for Continuous Control | Oct 19, 2020 | continuous-controlContinuous Control | CodeCode Available | 1 | 5 |
| Deep reinforcement learning-designed radiofrequency waveform in MRI | May 7, 2021 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 | 5 |
| Deep Reinforcement Trading with Predictable Returns | Apr 29, 2021 | ClusteringDeep Reinforcement Learning | CodeCode Available | 1 | 5 |
| Deep RL Agent for a Real-Time Action Strategy Game | Feb 15, 2020 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 | 5 |
| DeepSoCS: A Neural Scheduler for Heterogeneous System-on-Chip (SoC) Resource Scheduling | May 15, 2020 | Deep Reinforcement LearningGPU | CodeCode Available | 1 | 5 |
| Asset Allocation: From Markowitz to Deep Reinforcement Learning | Jul 14, 2022 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 | 5 |
| Drafting in Collectible Card Games via Reinforcement Learning | Nov 7, 2020 | Card GamesDeep Reinforcement Learning | CodeCode Available | 1 | 5 |
| DouZero+: Improving DouDizhu AI by Opponent Modeling and Coach-guided Learning | Apr 6, 2022 | Deep Reinforcement Learning | CodeCode Available | 1 | 5 |
| DREAM: Deep Regret minimization with Advantage baselines and Model-free learning | Jun 18, 2020 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 | 5 |
| A Scalable and Reproducible System-on-Chip Simulation for Reinforcement Learning | Apr 27, 2021 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 | 5 |
| Diversity-based Trajectory and Goal Selection with Hindsight Experience Replay | Aug 17, 2021 | Deep Reinforcement LearningDiversity | CodeCode Available | 1 | 5 |
| Domain Adaptation In Reinforcement Learning Via Latent Unified State Representation | Feb 10, 2021 | Autonomous DrivingDeep Reinforcement Learning | CodeCode Available | 1 | 5 |
| Distributed Two-tier DRL Framework for Cell-Free Network: Association, Beamforming and Power Allocation | Mar 22, 2023 | Deep Reinforcement Learning | CodeCode Available | 1 | 5 |
| A Deep Reinforcement Learning Approach to First-Order Logic Theorem Proving | Nov 5, 2019 | Automated Theorem ProvingDeep Reinforcement Learning | CodeCode Available | 1 | 5 |
| Divergence-Augmented Policy Optimization | Jan 25, 2025 | Atari GamesDeep Reinforcement Learning | CodeCode Available | 1 | 5 |
| DRIFT: Deep Reinforcement Learning for Intelligent Floating Platforms Trajectories | Oct 6, 2023 | Autonomous NavigationDeep Reinforcement Learning | CodeCode Available | 1 | 5 |
| A Reinforcement Learning Based Encoder-Decoder Framework for Learning Stock Trading Rules | Jan 8, 2021 | DecoderDeep Reinforcement Learning | CodeCode Available | 1 | 5 |
| A Reinforcement Learning Environment For Job-Shop Scheduling | Apr 8, 2021 | Combinatorial OptimizationDeep Reinforcement Learning | CodeCode Available | 1 | 5 |
| Action Space Shaping in Deep Reinforcement Learning | Apr 2, 2020 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 | 5 |
| Discriminative Particle Filter Reinforcement Learning for Complex Partial Observations | Feb 23, 2020 | Atari GamesDecision Making | CodeCode Available | 1 | 5 |