| Generalized Gaussian Temporal Difference Error for Uncertainty-aware Reinforcement Learning | Aug 5, 2024 | continuous-controlContinuous Control | CodeCode Available | 0 | 5 |
| AI Safety Gridworlds | Nov 27, 2017 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 0 | 5 |
| 3D Traffic Simulation for Autonomous Vehicles in Unity and Python | Oct 30, 2018 | Autonomous VehiclesDeep Reinforcement Learning | CodeCode Available | 0 | 5 |
| Generalized Adaptive Transfer Network: Enhancing Transfer Learning in Reinforcement Learning Across Domains | Jul 2, 2025 | Atari GamesChatbot | CodeCode Available | 0 | 5 |
| Generative Market Equilibrium Models with Stable Adversarial Learning via Reinforcement | Apr 5, 2025 | Deep Reinforcement Learning | CodeCode Available | 0 | 5 |
| Graph Attention-based Deep Reinforcement Learning for solving the Chinese Postman Problem with Load-dependent costs | Oct 24, 2023 | ARCDeep Reinforcement Learning | CodeCode Available | 0 | 5 |
| Hierarchical Deep Reinforcement Learning: Integrating Temporal Abstraction and Intrinsic Motivation | Apr 20, 2016 | Deep Reinforcement LearningMontezuma's Revenge | CodeCode Available | 0 | 5 |
| Implementing the Deep Q-Network | Nov 20, 2017 | Atari GamesDeep Reinforcement Learning | CodeCode Available | 0 | 5 |
| GAC: A Deep Reinforcement Learning Model Toward User Incentivization in Unknown Social Networks | Mar 17, 2022 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 0 | 5 |
| Generalizable Resource Allocation in Stream Processing via Deep Reinforcement Learning | Nov 19, 2019 | DecoderDeep Reinforcement Learning | CodeCode Available | 0 | 5 |
| From Static to Adaptive Defense: Federated Multi-Agent Deep Reinforcement Learning-Driven Moving Target Defense Against DoS Attacks in UAV Swarm Networks | Jun 9, 2025 | Deep Reinforcement Learning | CodeCode Available | 0 | 5 |
| AI Olympics challenge with Evolutionary Soft Actor Critic | Sep 2, 2024 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 0 | 5 |
| From Video Game to Real Robot: The Transfer between Action Spaces | May 2, 2019 | Deep Reinforcement LearningReinforcement Learning | CodeCode Available | 0 | 5 |
| Free-Lunch Saliency via Attention in Atari Agents | Aug 7, 2019 | Decision MakingDeep Reinforcement Learning | CodeCode Available | 0 | 5 |
| From Gameplay to Symbolic Reasoning: Learning SAT Solver Heuristics in the Style of Alpha(Go) Zero | Feb 14, 2018 | Decision MakingDeep Reinforcement Learning | CodeCode Available | 0 | 5 |
| Analyzing Generalization in Policy Networks: A Case Study with the Double-Integrator System | Dec 16, 2023 | continuous-controlContinuous Control | CodeCode Available | 0 | 5 |
| ForestProtector: An IoT Architecture Integrating Machine Vision and Deep Reinforcement Learning for Efficient Wildfire Monitoring | Jan 17, 2025 | Deep Reinforcement LearningFire Detection | CodeCode Available | 0 | 5 |
| Action Advising with Advice Imitation in Deep Reinforcement Learning | Apr 17, 2021 | Atari GamesBehavioural cloning | CodeCode Available | 0 | 5 |
| Autonomous Navigation via Deep Reinforcement Learning for Resource Constraint Edge Nodes using Transfer Learning | Oct 12, 2019 | Autonomous NavigationDeep Reinforcement Learning | CodeCode Available | 0 | 5 |
| Formally Verifying Deep Reinforcement Learning Controllers with Lyapunov Barrier Certificates | May 22, 2024 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 0 | 5 |
| Autonomous Management of Energy-Harvesting IoT Nodes Using Deep Reinforcement Learning | May 10, 2019 | Deep Reinforcement LearningManagement | CodeCode Available | 0 | 5 |
| Flight Controller Synthesis Via Deep Reinforcement Learning | Sep 14, 2019 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 0 | 5 |
| Learning Humanoid Robot Running Skills through Proximal Policy Optimization | Oct 22, 2019 | Deep Reinforcement LearningReinforcement Learning | CodeCode Available | 0 | 5 |
| Flexible Option Learning | Dec 6, 2021 | Deep Reinforcement LearningHierarchical Reinforcement Learning | CodeCode Available | 0 | 5 |
| Generalization and Regularization in DQN | Sep 29, 2018 | Atari GamesBenchmarking | CodeCode Available | 0 | 5 |
| C-3PO: Cyclic-Three-Phase Optimization for Human-Robot Motion Retargeting based on Reinforcement Learning | Sep 25, 2019 | Deep Reinforcement Learningmotion retargeting | CodeCode Available | 0 | 5 |
| An Automatic Cost Learning Framework for Image Steganography Using Deep Reinforcement Learning | Sep 25, 2020 | Deep Reinforcement LearningImage Steganography | CodeCode Available | 0 | 5 |
| CAD2RL: Real Single-Image Flight without a Single Real Image | Nov 13, 2016 | 3D geometryCollision Avoidance | CodeCode Available | 0 | 5 |
| Fire Burns, Sword Cuts: Commonsense Inductive Bias for Exploration in Text-based Games | May 1, 2022 | Deep Reinforcement LearningEfficient Exploration | CodeCode Available | 0 | 5 |
| Flappy Hummingbird: An Open Source Dynamic Simulation of Flapping Wing Robots and Animals | Feb 25, 2019 | Deep Reinforcement LearningOpenAI Gym | CodeCode Available | 0 | 5 |
| Task and Domain Adaptive Reinforcement Learning for Robot Control | Apr 29, 2024 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 0 | 5 |
| Calibrated Model-Based Deep Reinforcement Learning | Jun 19, 2019 | Deep Reinforcement Learningmodel | CodeCode Available | 0 | 5 |
| FLARE: Fingerprinting Deep Reinforcement Learning Agents using Universal Adversarial Masks | Jul 27, 2023 | Decision MakingDeep Reinforcement Learning | CodeCode Available | 0 | 5 |
| CAMP in the Odyssey: Provably Robust Reinforcement Learning with Certified Radius Maximization | Jan 29, 2025 | Adversarial RobustnessDeep Reinforcement Learning | CodeCode Available | 0 | 5 |
| Adaptive Regularization of Representation Rank as an Implicit Constraint of Bellman Equation | Apr 19, 2024 | continuous-controlContinuous Control | CodeCode Available | 0 | 5 |
| Learning on a Budget via Teacher Imitation | Apr 17, 2021 | Atari GamesDeep Reinforcement Learning | CodeCode Available | 0 | 5 |
| Fighter Jet Navigation and Combat using Deep Reinforcement Learning with Explainable AI | Feb 19, 2025 | counterfactualDecision Making | CodeCode Available | 0 | 5 |
| Financial Trading as a Game: A Deep Reinforcement Learning Approach | Jul 8, 2018 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 0 | 5 |
| Can Deep Reinforcement Learning Solve Erdos-Selfridge-Spencer Games? | Nov 7, 2017 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 0 | 5 |
| AI2STOW: End-to-End Deep Reinforcement Learning to Construct Master Stowage Plans under Demand Uncertainty | Apr 6, 2025 | Computational EfficiencyDeep Reinforcement Learning | CodeCode Available | 0 | 5 |
| Autonomous Braking System via Deep Reinforcement Learning | Feb 8, 2017 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 0 | 5 |
| FedMRL: Data Heterogeneity Aware Federated Multi-agent Deep Reinforcement Learning for Medical Imaging | Jul 8, 2024 | Deep Reinforcement LearningFairness | CodeCode Available | 0 | 5 |
| Federated Control with Hierarchical Multi-Agent Deep Reinforcement Learning | Dec 22, 2017 | Deep Reinforcement LearningEfficient Exploration | CodeCode Available | 0 | 5 |
| Learning Sparse Rewarded Tasks from Sub-Optimal Demonstrations | Apr 1, 2020 | continuous-controlContinuous Control | CodeCode Available | 0 | 5 |
| Faults in Deep Reinforcement Learning Programs: A Taxonomy and A Detection Approach | Jan 1, 2021 | Deep Reinforcement LearningFault Detection | CodeCode Available | 0 | 5 |
| Learning Symbolic Task Decompositions for Multi-Agent Teams | Feb 19, 2025 | Deep Reinforcement Learning | CodeCode Available | 0 | 5 |
| FedSlate:A Federated Deep Reinforcement Learning Recommender System | Sep 23, 2024 | Deep Reinforcement LearningFederated Learning | CodeCode Available | 0 | 5 |
| Fast deep reinforcement learning using online adjustments from the past | Oct 18, 2018 | Atari GamesDeep Reinforcement Learning | CodeCode Available | 0 | 5 |
| Generalization of Reinforcement Learners with Working and Episodic Memory | Oct 29, 2019 | Deep Reinforcement LearningHoldout Set | CodeCode Available | 0 | 5 |
| Failures Are Fated, But Can Be Faded: Characterizing and Mitigating Unwanted Behaviors in Large-Scale Vision and Language Models | Jun 11, 2024 | Deep Reinforcement Learning | CodeCode Available | 0 | 5 |