| C-3PO: Cyclic-Three-Phase Optimization for Human-Robot Motion Retargeting based on Reinforcement Learning | Sep 25, 2019 | Deep Reinforcement Learningmotion retargeting | CodeCode Available | 0 | 5 |
| An Automatic Cost Learning Framework for Image Steganography Using Deep Reinforcement Learning | Sep 25, 2020 | Deep Reinforcement LearningImage Steganography | CodeCode Available | 0 | 5 |
| CAD2RL: Real Single-Image Flight without a Single Real Image | Nov 13, 2016 | 3D geometryCollision Avoidance | CodeCode Available | 0 | 5 |
| Fire Burns, Sword Cuts: Commonsense Inductive Bias for Exploration in Text-based Games | May 1, 2022 | Deep Reinforcement LearningEfficient Exploration | CodeCode Available | 0 | 5 |
| Flappy Hummingbird: An Open Source Dynamic Simulation of Flapping Wing Robots and Animals | Feb 25, 2019 | Deep Reinforcement LearningOpenAI Gym | CodeCode Available | 0 | 5 |
| Task and Domain Adaptive Reinforcement Learning for Robot Control | Apr 29, 2024 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 0 | 5 |
| Calibrated Model-Based Deep Reinforcement Learning | Jun 19, 2019 | Deep Reinforcement Learningmodel | CodeCode Available | 0 | 5 |
| FLARE: Fingerprinting Deep Reinforcement Learning Agents using Universal Adversarial Masks | Jul 27, 2023 | Decision MakingDeep Reinforcement Learning | CodeCode Available | 0 | 5 |
| CAMP in the Odyssey: Provably Robust Reinforcement Learning with Certified Radius Maximization | Jan 29, 2025 | Adversarial RobustnessDeep Reinforcement Learning | CodeCode Available | 0 | 5 |
| Adaptive Regularization of Representation Rank as an Implicit Constraint of Bellman Equation | Apr 19, 2024 | continuous-controlContinuous Control | CodeCode Available | 0 | 5 |
| Learning on a Budget via Teacher Imitation | Apr 17, 2021 | Atari GamesDeep Reinforcement Learning | CodeCode Available | 0 | 5 |
| Fighter Jet Navigation and Combat using Deep Reinforcement Learning with Explainable AI | Feb 19, 2025 | counterfactualDecision Making | CodeCode Available | 0 | 5 |
| Financial Trading as a Game: A Deep Reinforcement Learning Approach | Jul 8, 2018 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 0 | 5 |
| Can Deep Reinforcement Learning Solve Erdos-Selfridge-Spencer Games? | Nov 7, 2017 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 0 | 5 |
| AI2STOW: End-to-End Deep Reinforcement Learning to Construct Master Stowage Plans under Demand Uncertainty | Apr 6, 2025 | Computational EfficiencyDeep Reinforcement Learning | CodeCode Available | 0 | 5 |
| Autonomous Braking System via Deep Reinforcement Learning | Feb 8, 2017 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 0 | 5 |
| FedMRL: Data Heterogeneity Aware Federated Multi-agent Deep Reinforcement Learning for Medical Imaging | Jul 8, 2024 | Deep Reinforcement LearningFairness | CodeCode Available | 0 | 5 |
| Federated Control with Hierarchical Multi-Agent Deep Reinforcement Learning | Dec 22, 2017 | Deep Reinforcement LearningEfficient Exploration | CodeCode Available | 0 | 5 |
| Learning Sparse Rewarded Tasks from Sub-Optimal Demonstrations | Apr 1, 2020 | continuous-controlContinuous Control | CodeCode Available | 0 | 5 |
| Faults in Deep Reinforcement Learning Programs: A Taxonomy and A Detection Approach | Jan 1, 2021 | Deep Reinforcement LearningFault Detection | CodeCode Available | 0 | 5 |
| Learning Symbolic Task Decompositions for Multi-Agent Teams | Feb 19, 2025 | Deep Reinforcement Learning | CodeCode Available | 0 | 5 |
| FedSlate:A Federated Deep Reinforcement Learning Recommender System | Sep 23, 2024 | Deep Reinforcement LearningFederated Learning | CodeCode Available | 0 | 5 |
| Fast deep reinforcement learning using online adjustments from the past | Oct 18, 2018 | Atari GamesDeep Reinforcement Learning | CodeCode Available | 0 | 5 |
| Generalization of Reinforcement Learners with Working and Episodic Memory | Oct 29, 2019 | Deep Reinforcement LearningHoldout Set | CodeCode Available | 0 | 5 |
| Failures Are Fated, But Can Be Faded: Characterizing and Mitigating Unwanted Behaviors in Large-Scale Vision and Language Models | Jun 11, 2024 | Deep Reinforcement Learning | CodeCode Available | 0 | 5 |