| Memory-based Deep Reinforcement Learning for POMDPs | Feb 24, 2021 | Deep Reinforcement LearningFeature Engineering | CodeCode Available | 1 |
| Modular Deep Reinforcement Learning for Continuous Motion Planning with Temporal Logic | Feb 24, 2021 | Deep Reinforcement LearningMotion Planning | CodeCode Available | 0 |
| FIXAR: A Fixed-Point Deep Reinforcement Learning Platform with Quantization-Aware Training and Adaptive Parallelism | Feb 24, 2021 | CPUDeep Reinforcement Learning | —Unverified | 0 |
| Deep Reinforcement Learning for Safe Landing Site Selection with Concurrent Consideration of Divert Maneuvers | Feb 24, 2021 | Deep Reinforcement LearningReinforcement Learning (RL) | —Unverified | 0 |
| Domain-invariant NBV Planner for Active Cross-domain Self-localization | Feb 23, 2021 | Deep Reinforcement Learning | —Unverified | 0 |
| Stratified Experience Replay: Correcting Multiplicity Bias in Off-Policy Reinforcement Learning | Feb 22, 2021 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Deep Reinforcement Learning for Dynamic Spectrum Sharing of LTE and NR | Feb 22, 2021 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Improved Learning of Robot Manipulation Tasks via Tactile Intrinsic Motivation | Feb 22, 2021 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Return-Based Contrastive Representation Learning for Reinforcement Learning | Feb 22, 2021 | Atari GamesDeep Reinforcement Learning | —Unverified | 0 |
| QoE Optimization for Live Video Streaming in UAV-to-UAV Communications via Deep Reinforcement Learning | Feb 21, 2021 | Deep Reinforcement Learning | —Unverified | 0 |
| Accelerated Sim-to-Real Deep Reinforcement Learning: Learning Collision Avoidance from Human Player | Feb 21, 2021 | Collision AvoidanceDeep Reinforcement Learning | CodeCode Available | 1 |
| Decoupling Value and Policy for Generalization in Reinforcement Learning | Feb 20, 2021 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |
| How To Train Your HERON | Feb 20, 2021 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| TacticZero: Learning to Prove Theorems from Scratch with Deep Reinforcement Learning | Feb 19, 2021 | Automated Theorem ProvingDeep Reinforcement Learning | —Unverified | 0 |
| Reinforcement Learning for Beam Pattern Design in Millimeter Wave and Massive MIMO Systems | Feb 18, 2021 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Training a Resilient Q-Network against Observational Interference | Feb 18, 2021 | Causal InferenceDeep Reinforcement Learning | CodeCode Available | 1 |
| State Entropy Maximization with Random Encoders for Efficient Exploration | Feb 18, 2021 | Deep Reinforcement LearningEfficient Exploration | CodeCode Available | 1 |
| Strategic bidding in freight transport using deep reinforcement learning | Feb 18, 2021 | Deep Reinforcement LearningFairness | —Unverified | 0 |
| Adaptive Rational Activations to Boost Deep Reinforcement Learning | Feb 18, 2021 | Atari GamesDeep Reinforcement Learning | CodeCode Available | 1 |
| Privacy-Preserving Kickstarting Deep Reinforcement Learning with Privacy-Aware Learners | Feb 18, 2021 | Deep Reinforcement LearningPrivacy Preserving | —Unverified | 0 |
| Smart Feasibility Pump: Reinforcement Learning for (Mixed) Integer Programming | Feb 18, 2021 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Efficient Scheduling of Data Augmentation for Deep Reinforcement Learning | Feb 17, 2021 | Data AugmentationDeep Reinforcement Learning | —Unverified | 0 |
| Active Privacy-utility Trade-off Against a Hypothesis Testing Adversary | Feb 16, 2021 | Deep Reinforcement LearningReinforcement Learning (RL) | —Unverified | 0 |
| Training Larger Networks for Deep Reinforcement Learning | Feb 16, 2021 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| ScrofaZero: Mastering Trick-taking Poker Game Gongzhu by Deep Reinforcement Learning | Feb 15, 2021 | Bayesian InferenceDeep Reinforcement Learning | CodeCode Available | 0 |
| Intelligent Electric Vehicle Charging Recommendation Based on Multi-Agent Reinforcement Learning | Feb 15, 2021 | Deep Reinforcement LearningMulti-agent Reinforcement Learning | CodeCode Available | 1 |
| Scaling Multi-Agent Reinforcement Learning with Selective Parameter Sharing | Feb 15, 2021 | Deep Reinforcement LearningMulti-agent Reinforcement Learning | CodeCode Available | 1 |
| Sparse Attention Guided Dynamic Value Estimation for Single-Task Multi-Scene Reinforcement Learning | Feb 14, 2021 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Reinforcement Learning for IoT Security: A Comprehensive Survey | Feb 14, 2021 | BIG-bench Machine LearningDeep Reinforcement Learning | —Unverified | 0 |
| LTL2Action: Generalizing LTL Instructions for Multi-Task RL | Feb 13, 2021 | Deep Reinforcement LearningDiversity | CodeCode Available | 1 |
| Modelling Cooperation in Network Games with Spatio-Temporal Complexity | Feb 13, 2021 | Deep Reinforcement LearningManagement | —Unverified | 0 |
| Generalizing Decision Making for Automated Driving with an Invariant Environment Representation using Deep Reinforcement Learning | Feb 12, 2021 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| Q-Value Weighted Regression: Reinforcement Learning with Limited Data | Feb 12, 2021 | Atari Gamescontinuous-control | CodeCode Available | 0 |
| Deep Reinforcement Learning for Backup Strategies against Adversaries | Feb 12, 2021 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Deep Reinforcement Learning for Portfolio Optimization using Latent Feature State Space (LFSS) Module | Feb 11, 2021 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| Deep Reinforcement Agent for Scheduling in HPC | Feb 11, 2021 | Deep Reinforcement LearningScheduling | CodeCode Available | 1 |
| Deep Reinforcement Learning for Combinatorial Optimization: Covering Salesman Problems | Feb 11, 2021 | Combinatorial OptimizationDeep Reinforcement Learning | —Unverified | 0 |
| Deep Reinforcement Learning with Symmetric Prior for Predictive Power Allocation to Mobile Users | Feb 10, 2021 | Deep Reinforcement LearningReinforcement Learning (RL) | —Unverified | 0 |
| Policy Augmentation: An Exploration Strategy for Faster Convergence of Deep Reinforcement Learning Algorithms | Feb 10, 2021 | Deep Reinforcement LearningMatrix Completion | CodeCode Available | 0 |
| Learning Equational Theorem Proving | Feb 10, 2021 | Automated Theorem ProvingDeep Reinforcement Learning | —Unverified | 0 |
| Domain Adaptation In Reinforcement Learning Via Latent Unified State Representation | Feb 10, 2021 | Autonomous DrivingDeep Reinforcement Learning | CodeCode Available | 1 |
| Adaptive Processor Frequency Adjustment for Mobile Edge Computing with Intermittent Energy Supply | Feb 10, 2021 | Deep Reinforcement LearningEdge-computing | —Unverified | 0 |
| Scheduling the NASA Deep Space Network with Deep Reinforcement Learning | Feb 9, 2021 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Measuring Progress in Deep Reinforcement Learning Sample Efficiency | Feb 9, 2021 | Atari Gamescontinuous-control | —Unverified | 0 |
| Adversarially Guided Actor-Critic | Feb 8, 2021 | Deep Reinforcement LearningEfficient Exploration | CodeCode Available | 1 |
| RL-Scope: Cross-Stack Profiling for Deep Reinforcement Learning Workloads | Feb 8, 2021 | CPUDeep Reinforcement Learning | CodeCode Available | 1 |
| Tactical Optimism and Pessimism for Deep Reinforcement Learning | Feb 7, 2021 | continuous-controlContinuous Control | CodeCode Available | 1 |
| Multi-Agent Deep Reinforcement Learning for Request Dispatching in Distributed-Controller Software-Defined Networking | Feb 6, 2021 | Deep Reinforcement LearningReinforcement Learning (RL) | —Unverified | 0 |
| Explainable Reinforcement Learning for Longitudinal Control | Feb 6, 2021 | Deep Reinforcement LearningOpenAI Gym | CodeCode Available | 1 |
| Improving Model and Search for Computer Go | Feb 6, 2021 | Deep Reinforcement Learningmodel | —Unverified | 0 |