| Towards Human-Like RL: Taming Non-Naturalistic Behavior in Deep RL via Adaptive Behavioral Costs in 3D Games | Sep 27, 2023 | Decision MakingDeep Reinforcement Learning | CodeCode Available | 0 |
| IPCGRL: Language-Instructed Reinforcement Learning for Procedural Level Generation | Mar 16, 2025 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 0 |
| Reinforcement Learning of Self Enhancing Camera Image and Signal Processing | Nov 15, 2021 | BlockingData Augmentation | CodeCode Available | 0 |
| Symmetry-aware Reinforcement Learning for Robotic Assembly under Partial Observability with a Soft Wrist | Feb 28, 2024 | Deep Reinforcement Learning | CodeCode Available | 0 |
| An Automatic Cost Learning Framework for Image Steganography Using Deep Reinforcement Learning | Sep 25, 2020 | Deep Reinforcement LearningImage Steganography | CodeCode Available | 0 |
| Iroko: A Framework to Prototype Reinforcement Learning for Data Center Traffic Control | Dec 24, 2018 | Deep Reinforcement LearningOpenAI Gym | CodeCode Available | 0 |
| Neural Approximate Dynamic Programming for On-Demand Ride-Pooling | Nov 20, 2019 | Deep Reinforcement LearningReinforcement Learning | CodeCode Available | 0 |
| Reducing Action Space for Deep Reinforcement Learning via Causal Effect Estimation | Jan 24, 2025 | Decision MakingDeep Reinforcement Learning | CodeCode Available | 0 |
| Is Deep Reinforcement Learning Really Superhuman on Atari? Leveling the playing field | Aug 13, 2019 | Atari GamesDeep Reinforcement Learning | CodeCode Available | 0 |
| 5G Routing Interfered Environment | Mar 28, 2022 | Deep Reinforcement LearningReinforcement Learning (RL) | CodeCode Available | 0 |
| Why People Skip Music? On Predicting Music Skips using Deep Reinforcement Learning | Jan 10, 2023 | Deep Reinforcement LearningRecommendation Systems | CodeCode Available | 0 |
| An Atari Model Zoo for Analyzing, Visualizing, and Comparing Deep Reinforcement Learning Agents | Dec 17, 2018 | Atari GamesDeep Reinforcement Learning | CodeCode Available | 0 |
| Is Value Functions Estimation with Classification Plug-and-play for Offline Reinforcement Learning? | Jun 10, 2024 | Deep Reinforcement LearningOffline RL | CodeCode Available | 0 |
| Is Vanilla Policy Gradient Overlooked? Analyzing Deep Reinforcement Learning for Hanabi | Mar 22, 2022 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 0 |
| Neural DNF-MT: A Neuro-symbolic Approach for Learning Interpretable and Editable Policies | Jan 7, 2025 | Deep Reinforcement Learning | CodeCode Available | 0 |
| Evolution-Guided Policy Gradient in Reinforcement Learning | May 21, 2018 | continuous-controlContinuous Control | CodeCode Available | 0 |
| Neural Episodic Control | Mar 6, 2017 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 0 |
| Contextual Pre-planning on Reward Machine Abstractions for Enhanced Transfer in Deep Reinforcement Learning | Jul 11, 2023 | Deep Reinforcement Learning | CodeCode Available | 0 |
| Evaluating the Robustness of Deep Reinforcement Learning for Autonomous Policies in a Multi-agent Urban Driving Environment | Dec 22, 2021 | Autonomous DrivingBenchmarking | CodeCode Available | 0 |
| IxDRL: A Novel Explainable Deep Reinforcement Learning Toolkit based on Analyses of Interestingness | Jul 18, 2023 | Deep Reinforcement LearningReinforcement Learning (RL) | CodeCode Available | 0 |
| A Comparison of Reinforcement Learning Frameworks for Software Testing Tasks | Aug 25, 2022 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 0 |
| Whenever, Wherever: Towards Orchestrating Crowd Simulations with Spatio-Temporal Spawn Dynamics | Mar 20, 2025 | Deep Reinforcement LearningDiversity | CodeCode Available | 0 |
| Deep Reinforcement Learning for Surgical Gesture Segmentation and Classification | Jun 21, 2018 | Action SegmentationClassification | CodeCode Available | 0 |
| Synthesis of Biologically Realistic Human Motion Using Joint Torque Actuation | Apr 30, 2019 | Deep Reinforcement LearningReinforcement Learning | CodeCode Available | 0 |
| Evaluating task-agnostic exploration for fixed-batch learning of arbitrary future tasks | Nov 20, 2019 | continuous-controlContinuous Control | CodeCode Available | 0 |