| Deep Reinforcement Learning for Conservation Decisions | Jun 15, 2021 | BIG-bench Machine LearningDeep Reinforcement Learning | CodeCode Available | 1 |
| Population-coding and Dynamic-neurons improved Spiking Actor Network for Reinforcement Learning | Jun 15, 2021 | Deep Reinforcement LearningOpenAI Gym | —Unverified | 0 |
| Poisoning Deep Reinforcement Learning Agents with In-Distribution Triggers | Jun 14, 2021 | Data PoisoningDeep Reinforcement Learning | —Unverified | 0 |
| Analysis of a Target-Based Actor-Critic Algorithm with Linear Function Approximation | Jun 14, 2021 | Deep Reinforcement Learning | —Unverified | 0 |
| User-Guided Personalized Image Aesthetic Assessment based on Deep Reinforcement Learning | Jun 14, 2021 | Deep Reinforcement LearningImage Enhancement | —Unverified | 0 |
| On-Policy Deep Reinforcement Learning for the Average-Reward Criterion | Jun 14, 2021 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Learning-Aided Heuristics Design for Storage System | Jun 14, 2021 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Density-Based Bonuses on Learned Representations for Reward-Free Exploration in Deep Reinforcement Learning | Jun 13, 2021 | Deep Reinforcement LearningDensity Estimation | —Unverified | 0 |
| Intrinsic Control of Variational Beliefs in Dynamic Partially-Observed Visual Environments | Jun 13, 2021 | Deep Reinforcement Learning | —Unverified | 0 |
| Deep Reinforcement Learning based Group Recommender System | Jun 13, 2021 | Deep Reinforcement LearningRecommendation Systems | CodeCode Available | 1 |
| Learning on Abstract Domains: A New Approach for Verifiable Guarantee in Reinforcement Learning | Jun 13, 2021 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| A Deep Reinforcement Learning Approach to Marginalized Importance Sampling with the Successor Representation | Jun 12, 2021 | Deep Reinforcement LearningMuJoCo | CodeCode Available | 1 |
| A3C-S: Automated Agent Accelerator Co-Search towards Efficient Deep Reinforcement Learning | Jun 11, 2021 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| DouZero: Mastering DouDizhu with Self-Play Deep Reinforcement Learning | Jun 11, 2021 | Card GamesDeep Reinforcement Learning | CodeCode Available | 2 |
| DRLD-SP: A Deep Reinforcement Learning-based Dynamic Service Placement in Edge-Enabled Internet of Vehicles | Jun 11, 2021 | Deep Reinforcement LearningEdge-computing | —Unverified | 0 |
| GDI: Rethinking What Makes Reinforcement Learning Different From Supervised Learning | Jun 11, 2021 | Atari GamesDeep Reinforcement Learning | —Unverified | 0 |
| Courteous Behavior of Automated Vehicles at Unsignalized Intersections via Reinforcement Learning | Jun 11, 2021 | Autonomous VehiclesCollision Avoidance | —Unverified | 0 |
| AI-driven Prices for Externalities and Sustainability in Production Markets | Jun 10, 2021 | Deep Reinforcement LearningFairness | CodeCode Available | 0 |
| Data-driven battery operation for energy arbitrage using rainbow deep reinforcement learning | Jun 10, 2021 | continuous-controlContinuous Control | —Unverified | 0 |
| Simplifying Deep Reinforcement Learning via Self-Supervision | Jun 10, 2021 | Deep Reinforcement Learningregression | CodeCode Available | 0 |
| Reinforcement Learning for Industrial Control Network Cyber Security Orchestration | Jun 9, 2021 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Pretrained Encoders are All You Need | Jun 9, 2021 | AllContrastive Learning | CodeCode Available | 1 |
| Pretraining Representations for Data-Efficient Reinforcement Learning | Jun 9, 2021 | Atari GamesAtari Games 100k | CodeCode Available | 1 |
| PlayVirtual: Augmenting Cycle-Consistent Virtual Trajectories for Reinforcement Learning | Jun 8, 2021 | Continuous Control (100k environment steps)Continuous Control (500k environment steps) | CodeCode Available | 1 |
| Don't Get Yourself into Trouble! Risk-aware Decision-Making for Autonomous Vehicles | Jun 8, 2021 | Autonomous VehiclesDecision Making | —Unverified | 0 |
| A Deep Value-network Based Approach for Multi-Driver Order Dispatching | Jun 8, 2021 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Learning Markov State Abstractions for Deep Reinforcement Learning | Jun 8, 2021 | continuous-controlContinuous Control | CodeCode Available | 1 |
| Dynamic Sparse Training for Deep Reinforcement Learning | Jun 8, 2021 | continuous-controlContinuous Control | CodeCode Available | 1 |
| Towards Practical Credit Assignment for Deep Reinforcement Learning | Jun 8, 2021 | Atari GamesDeep Reinforcement Learning | —Unverified | 0 |
| Left Ventricle Contouring in Cardiac Images Based on Deep Reinforcement Learning | Jun 8, 2021 | Deep Reinforcement LearningImage Segmentation | CodeCode Available | 0 |
| Correcting Momentum in Temporal Difference Learning | Jun 7, 2021 | Deep Reinforcement LearningReinforcement Learning (RL) | CodeCode Available | 0 |
| Explainable Artificial Intelligence (XAI) for Increasing User Trust in Deep Reinforcement Learning Driven Autonomous Systems | Jun 7, 2021 | Deep Reinforcement LearningExplainable artificial intelligence | —Unverified | 0 |
| DisTop: Discovering a Topological representation to learn diverse and rewarding skills | Jun 6, 2021 | Deep Reinforcement LearningHierarchical Reinforcement Learning | —Unverified | 0 |
| 3D UAV Trajectory and Data Collection Optimisation via Deep Reinforcement Learning | Jun 6, 2021 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Online Trading Models with Deep Reinforcement Learning in the Forex Market Considering Transaction Costs | Jun 6, 2021 | Deep Reinforcement Learning | —Unverified | 0 |
| Bridging the Gap Between Target Networks and Functional Regularization | Jun 4, 2021 | Deep Reinforcement LearningQ-Learning | CodeCode Available | 0 |
| Model-agnostic and Scalable Counterfactual Explanations via Reinforcement Learning | Jun 4, 2021 | counterfactualDeep Reinforcement Learning | CodeCode Available | 2 |
| Feeling of Presence Maximization: mmWave-Enabled Virtual Reality Meets Deep Reinforcement Learning | Jun 3, 2021 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| MICo: Improved representations via sampling-based state similarity for Markov decision processes | Jun 3, 2021 | Atari GamesDeep Reinforcement Learning | CodeCode Available | 0 |
| A Consciousness-Inspired Planning Agent for Model-Based Reinforcement Learning | Jun 3, 2021 | Deep Reinforcement LearningModel-based Reinforcement Learning | CodeCode Available | 1 |
| High-Quality Diversification for Task-Oriented Dialogue Systems | Jun 2, 2021 | Conversational SearchDeep Reinforcement Learning | CodeCode Available | 0 |
| Deep Reinforcement Learning-based UAV Navigation and Control: A Soft Actor-Critic with Hindsight Experience Replay Approach | Jun 2, 2021 | Deep Reinforcement Learning | —Unverified | 0 |
| Towards Deeper Deep Reinforcement Learning with Spectral Normalization | Jun 2, 2021 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Quantitative Day Trading from Natural Language using Reinforcement Learning | Jun 1, 2021 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| A Coarse to Fine Question Answering System based on Reinforcement Learning | Jun 1, 2021 | Deep Reinforcement LearningQuestion Answering | —Unverified | 0 |
| Policies for the Dynamic Traveling Maintainer Problem with Alerts | May 31, 2021 | Combinatorial OptimizationDeep Reinforcement Learning | —Unverified | 0 |
| Deep Reinforcement Learning in Quantitative Algorithmic Trading: A Review | May 31, 2021 | Algorithmic TradingDeep Reinforcement Learning | CodeCode Available | 0 |
| Reducing the Deployment-Time Inference Control Costs of Deep Reinforcement Learning Agents via an Asymmetric Architecture | May 30, 2021 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| A Survey of Deep Reinforcement Learning Algorithms for Motion Planning and Control of Autonomous Vehicles | May 29, 2021 | Autonomous DrivingAutonomous Vehicles | —Unverified | 0 |
| Reconfigurable Intelligent Surface-assisted Multi-UAV Networks: Efficient Resource Allocation with Deep Reinforcement Learning | May 28, 2021 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |