| D3PG: Dirichlet DDPG for Task Partitioning and Offloading With Constrained Hybrid Action Space in Mobile-Edge Computing | Apr 14, 2022 | Deep Reinforcement LearningEdge-computing | —Unverified | 0 |
| Doubly-Robust Estimation for Correcting Position-Bias in Click Feedback for Unbiased Learning to Rank | Mar 31, 2022 | counterfactualGeneral Reinforcement Learning | CodeCode Available | 0 |
| Reducing Planning Complexity of General Reinforcement Learning with Non-Markovian Abstractions | Dec 26, 2021 | Decision MakingGeneral Reinforcement Learning | —Unverified | 0 |
| Abstractions of General Reinforcement Learning | Dec 26, 2021 | General Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Intelligent Trading Systems: A Sentiment-Aware Reinforcement Learning Approach | Nov 14, 2021 | Algorithmic TradingGeneral Reinforcement Learning | CodeCode Available | 1 |
| ^2-exploration for Reinforcement Learning | Sep 29, 2021 | General Reinforcement LearningQ-Learning | —Unverified | 0 |
| Superior Performance with Diversified Strategic Control in FPS Games Using General Reinforcement Learning | Sep 29, 2021 | FPS GamesGeneral Reinforcement Learning | —Unverified | 0 |
| Catastrophic Interference in Reinforcement Learning: A Solution Based on Context Division and Knowledge Distillation | Sep 1, 2021 | Deep Reinforcement LearningGeneral Reinforcement Learning | CodeCode Available | 0 |
| A Policy Efficient Reduction Approach to Convex Constrained Deep Reinforcement Learning | Aug 29, 2021 | Deep Reinforcement LearningGeneral Reinforcement Learning | —Unverified | 0 |
| Low-Resource Machine Translation based on Asynchronous Dynamic Programming | Aug 1, 2021 | General Reinforcement LearningLow Resource Neural Machine Translation | —Unverified | 0 |
| QKSA: Quantum Knowledge Seeking Agent | Jul 3, 2021 | Artificial LifeGeneral Reinforcement Learning | CodeCode Available | 0 |
| Nearest-Neighbor-based Collision Avoidance for Quadrotors via Reinforcement Learning | Apr 30, 2021 | Collision AvoidanceGeneral Reinforcement Learning | —Unverified | 0 |
| FaiR-IoT: Fairness-aware Human-in-the-Loop Reinforcement Learning for Harnessing Human Variability in Personalized IoT | Mar 30, 2021 | FairnessGeneral Reinforcement Learning | —Unverified | 0 |
| Adaptive Rational Activations to Boost Deep Reinforcement Learning | Feb 18, 2021 | Atari GamesDeep Reinforcement Learning | CodeCode Available | 1 |
| End-to-End Egospheric Spatial Memory | Feb 15, 2021 | General Reinforcement LearningImitation Learning | CodeCode Available | 1 |
| Interactive Learning from Activity Description | Feb 13, 2021 | General Reinforcement LearningGrounded language learning | CodeCode Available | 0 |
| A State Representation Dueling Network for Deep Reinforcement Learning | Dec 24, 2020 | Deep Reinforcement LearningGeneral Reinforcement Learning | —Unverified | 0 |
| Exact Reduction of Huge Action Spaces in General Reinforcement Learning | Dec 18, 2020 | BinarizationGeneral Reinforcement Learning | —Unverified | 0 |
| Reinforcement Learning of Causal Variables Using Mediation Analysis | Oct 29, 2020 | General Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Learning to Represent Action Values as a Hypergraph on the Action Vertices | Oct 28, 2020 | Atari GamesContinuous Control | CodeCode Available | 0 |
| Align-RUDDER: Learning From Few Demonstrations by Reward Redistribution | Sep 29, 2020 | General Reinforcement LearningMinecraft | CodeCode Available | 1 |
| Developmental Reinforcement Learning of Control Policy of a Quadcopter UAV with Thrust Vectoring Rotors | Jul 15, 2020 | Developmental LearningDrone Controller | CodeCode Available | 1 |
| Data-Efficient Reinforcement Learning with Self-Predictive Representations | Jul 12, 2020 | Atari Games 100kData Augmentation | CodeCode Available | 1 |
| The LoCA Regret: A Consistent Metric to Evaluate Model-Based Behavior in Reinforcement Learning | Jul 7, 2020 | General Reinforcement LearningModel-based Reinforcement Learning | CodeCode Available | 0 |
| Counterfactual Data Augmentation using Locally Factored Dynamics | Jul 6, 2020 | counterfactualData Augmentation | CodeCode Available | 1 |