| Navigating Data Corruption in Machine Learning: Balancing Quality, Quantity, and Imputation Strategies | Dec 24, 2024 | Deep Reinforcement LearningImputation | CodeCode Available | 0 |
| Navigating Demand Uncertainty in Container Shipping: Deep Reinforcement Learning for Enabling Adaptive and Feasible Master Stowage Planning | Feb 18, 2025 | Combinatorial OptimizationDeep Reinforcement Learning | CodeCode Available | 0 |
| ReCCoVER: Detecting Causal Confusion for Explainable Reinforcement Learning | Mar 21, 2022 | Deep Reinforcement Learningfeature selection | CodeCode Available | 0 |
| Self Punishment and Reward Backfill for Deep Q-Learning | Apr 10, 2020 | Atari GamesDeep Reinforcement Learning | CodeCode Available | 0 |
| Self-Refined Large Language Model as Automated Reward Function Designer for Deep Reinforcement Learning in Robotics | Sep 13, 2023 | Common Sense ReasoningDeep Reinforcement Learning | CodeCode Available | 0 |
| Reconciling λ-Returns with Experience Replay | Dec 1, 2019 | Atari GamesDeep Reinforcement Learning | CodeCode Available | 0 |
| Intrinsic Rewards from Self-Organizing Feature Maps for Exploration in Reinforcement Learning | Feb 6, 2023 | ClusteringDeep Reinforcement Learning | CodeCode Available | 0 |
| Contrastive Representation for Interactive Recommendation | Dec 24, 2024 | Contrastive LearningDeep Reinforcement Learning | CodeCode Available | 0 |
| Deep Reinforcement Learning for Swarm Systems | Jul 17, 2018 | Decision MakingDeep Reinforcement Learning | CodeCode Available | 0 |
| Invariant Transform Experience Replay: Data Augmentation for Deep Reinforcement Learning | Sep 24, 2019 | Data AugmentationDeep Reinforcement Learning | CodeCode Available | 0 |
| Nav-Q: Quantum Deep Reinforcement Learning for Collision-Free Navigation of Self-Driving Cars | Nov 20, 2023 | Deep Reinforcement LearningDescriptive | CodeCode Available | 0 |
| Inverse reinforcement learning for video games | Oct 24, 2018 | continuous-controlContinuous Control | CodeCode Available | 0 |
| Contrastive Explanations for Reinforcement Learning via Embedded Self Predictions | Oct 11, 2020 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 0 |
| Continuous Transition: Improving Sample Efficiency for Continuous Control Problems via MixUp | Nov 30, 2020 | continuous-controlContinuous Control | CodeCode Available | 0 |
| Self Reward Design with Fine-grained Interpretability | Dec 30, 2021 | Deep Reinforcement LearningFairness | CodeCode Available | 0 |
| Near-optimal Deep Reinforcement Learning Policies from Data for Zone Temperature Control | Mar 10, 2022 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 0 |
| Self-supervised Deep Reinforcement Learning with Generalized Computation Graphs for Robot Navigation | Sep 29, 2017 | Deep Reinforcement LearningNavigate | CodeCode Available | 0 |
| Neighborhood Mixup Experience Replay: Local Convex Interpolation for Improved Sample Efficiency in Continuous Control Tasks | May 18, 2022 | continuous-controlContinuous Control | CodeCode Available | 0 |
| NeSIG: A Neuro-Symbolic Method for Learning to Generate Planning Problems | Jan 24, 2023 | Deep Reinforcement LearningDiversity | CodeCode Available | 0 |
| Autoregressive Policies for Continuous Control Deep Reinforcement Learning | Mar 27, 2019 | continuous-controlContinuous Control | CodeCode Available | 0 |
| Continuous Control With Ensemble Deep Deterministic Policy Gradients | Nov 30, 2021 | continuous-controlContinuous Control | CodeCode Available | 0 |
| Continual Reinforcement Learning for HVAC Systems Control: Integrating Hypernetworks and Transfer Learning | Mar 24, 2025 | Continual LearningDeep Reinforcement Learning | CodeCode Available | 0 |
| Continual Deep Reinforcement Learning with Task-Agnostic Policy Distillation | Nov 25, 2024 | Continual LearningDeep Reinforcement Learning | CodeCode Available | 0 |
| EX2: Exploration with Exemplar Models for Deep Reinforcement Learning | Mar 3, 2017 | Deep Reinforcement LearningDensity Estimation | CodeCode Available | 0 |
| I Open at the Close: A Deep Reinforcement Learning Evaluation of Open Streets Initiatives | Dec 12, 2023 | Deep Reinforcement LearningGraph Neural Network | CodeCode Available | 0 |
| Towards Human-Like RL: Taming Non-Naturalistic Behavior in Deep RL via Adaptive Behavioral Costs in 3D Games | Sep 27, 2023 | Decision MakingDeep Reinforcement Learning | CodeCode Available | 0 |
| IPCGRL: Language-Instructed Reinforcement Learning for Procedural Level Generation | Mar 16, 2025 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 0 |
| Reinforcement Learning of Self Enhancing Camera Image and Signal Processing | Nov 15, 2021 | BlockingData Augmentation | CodeCode Available | 0 |
| Symmetry-aware Reinforcement Learning for Robotic Assembly under Partial Observability with a Soft Wrist | Feb 28, 2024 | Deep Reinforcement Learning | CodeCode Available | 0 |
| An Automatic Cost Learning Framework for Image Steganography Using Deep Reinforcement Learning | Sep 25, 2020 | Deep Reinforcement LearningImage Steganography | CodeCode Available | 0 |
| Iroko: A Framework to Prototype Reinforcement Learning for Data Center Traffic Control | Dec 24, 2018 | Deep Reinforcement LearningOpenAI Gym | CodeCode Available | 0 |
| Neural Approximate Dynamic Programming for On-Demand Ride-Pooling | Nov 20, 2019 | Deep Reinforcement LearningReinforcement Learning | CodeCode Available | 0 |
| Reducing Action Space for Deep Reinforcement Learning via Causal Effect Estimation | Jan 24, 2025 | Decision MakingDeep Reinforcement Learning | CodeCode Available | 0 |
| Is Deep Reinforcement Learning Really Superhuman on Atari? Leveling the playing field | Aug 13, 2019 | Atari GamesDeep Reinforcement Learning | CodeCode Available | 0 |
| 5G Routing Interfered Environment | Mar 28, 2022 | Deep Reinforcement LearningReinforcement Learning (RL) | CodeCode Available | 0 |
| Why People Skip Music? On Predicting Music Skips using Deep Reinforcement Learning | Jan 10, 2023 | Deep Reinforcement LearningRecommendation Systems | CodeCode Available | 0 |
| An Atari Model Zoo for Analyzing, Visualizing, and Comparing Deep Reinforcement Learning Agents | Dec 17, 2018 | Atari GamesDeep Reinforcement Learning | CodeCode Available | 0 |
| Is Value Functions Estimation with Classification Plug-and-play for Offline Reinforcement Learning? | Jun 10, 2024 | Deep Reinforcement LearningOffline RL | CodeCode Available | 0 |
| Is Vanilla Policy Gradient Overlooked? Analyzing Deep Reinforcement Learning for Hanabi | Mar 22, 2022 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 0 |
| Neural DNF-MT: A Neuro-symbolic Approach for Learning Interpretable and Editable Policies | Jan 7, 2025 | Deep Reinforcement Learning | CodeCode Available | 0 |
| Evolution-Guided Policy Gradient in Reinforcement Learning | May 21, 2018 | continuous-controlContinuous Control | CodeCode Available | 0 |
| Neural Episodic Control | Mar 6, 2017 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 0 |
| Contextual Pre-planning on Reward Machine Abstractions for Enhanced Transfer in Deep Reinforcement Learning | Jul 11, 2023 | Deep Reinforcement Learning | CodeCode Available | 0 |
| Evaluating the Robustness of Deep Reinforcement Learning for Autonomous Policies in a Multi-agent Urban Driving Environment | Dec 22, 2021 | Autonomous DrivingBenchmarking | CodeCode Available | 0 |
| IxDRL: A Novel Explainable Deep Reinforcement Learning Toolkit based on Analyses of Interestingness | Jul 18, 2023 | Deep Reinforcement LearningReinforcement Learning (RL) | CodeCode Available | 0 |
| A Comparison of Reinforcement Learning Frameworks for Software Testing Tasks | Aug 25, 2022 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 0 |
| Whenever, Wherever: Towards Orchestrating Crowd Simulations with Spatio-Temporal Spawn Dynamics | Mar 20, 2025 | Deep Reinforcement LearningDiversity | CodeCode Available | 0 |
| Deep Reinforcement Learning for Surgical Gesture Segmentation and Classification | Jun 21, 2018 | Action SegmentationClassification | CodeCode Available | 0 |
| Synthesis of Biologically Realistic Human Motion Using Joint Torque Actuation | Apr 30, 2019 | Deep Reinforcement LearningReinforcement Learning | CodeCode Available | 0 |
| Evaluating task-agnostic exploration for fixed-batch learning of arbitrary future tasks | Nov 20, 2019 | continuous-controlContinuous Control | CodeCode Available | 0 |