| Deep Reinforcement Learning for Conservation Decisions | Jun 15, 2021 | BIG-bench Machine LearningDeep Reinforcement Learning | CodeCode Available | 1 |
| Population-coding and Dynamic-neurons improved Spiking Actor Network for Reinforcement Learning | Jun 15, 2021 | Deep Reinforcement LearningOpenAI Gym | —Unverified | 0 |
| Poisoning Deep Reinforcement Learning Agents with In-Distribution Triggers | Jun 14, 2021 | Data PoisoningDeep Reinforcement Learning | —Unverified | 0 |
| Analysis of a Target-Based Actor-Critic Algorithm with Linear Function Approximation | Jun 14, 2021 | Deep Reinforcement Learning | —Unverified | 0 |
| User-Guided Personalized Image Aesthetic Assessment based on Deep Reinforcement Learning | Jun 14, 2021 | Deep Reinforcement LearningImage Enhancement | —Unverified | 0 |
| On-Policy Deep Reinforcement Learning for the Average-Reward Criterion | Jun 14, 2021 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Learning-Aided Heuristics Design for Storage System | Jun 14, 2021 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Density-Based Bonuses on Learned Representations for Reward-Free Exploration in Deep Reinforcement Learning | Jun 13, 2021 | Deep Reinforcement LearningDensity Estimation | —Unverified | 0 |
| Intrinsic Control of Variational Beliefs in Dynamic Partially-Observed Visual Environments | Jun 13, 2021 | Deep Reinforcement Learning | —Unverified | 0 |
| Deep Reinforcement Learning based Group Recommender System | Jun 13, 2021 | Deep Reinforcement LearningRecommendation Systems | CodeCode Available | 1 |
| Learning on Abstract Domains: A New Approach for Verifiable Guarantee in Reinforcement Learning | Jun 13, 2021 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| A Deep Reinforcement Learning Approach to Marginalized Importance Sampling with the Successor Representation | Jun 12, 2021 | Deep Reinforcement LearningMuJoCo | CodeCode Available | 1 |
| A3C-S: Automated Agent Accelerator Co-Search towards Efficient Deep Reinforcement Learning | Jun 11, 2021 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| DouZero: Mastering DouDizhu with Self-Play Deep Reinforcement Learning | Jun 11, 2021 | Card GamesDeep Reinforcement Learning | CodeCode Available | 2 |
| DRLD-SP: A Deep Reinforcement Learning-based Dynamic Service Placement in Edge-Enabled Internet of Vehicles | Jun 11, 2021 | Deep Reinforcement LearningEdge-computing | —Unverified | 0 |
| GDI: Rethinking What Makes Reinforcement Learning Different From Supervised Learning | Jun 11, 2021 | Atari GamesDeep Reinforcement Learning | —Unverified | 0 |
| Courteous Behavior of Automated Vehicles at Unsignalized Intersections via Reinforcement Learning | Jun 11, 2021 | Autonomous VehiclesCollision Avoidance | —Unverified | 0 |
| AI-driven Prices for Externalities and Sustainability in Production Markets | Jun 10, 2021 | Deep Reinforcement LearningFairness | CodeCode Available | 0 |
| Data-driven battery operation for energy arbitrage using rainbow deep reinforcement learning | Jun 10, 2021 | continuous-controlContinuous Control | —Unverified | 0 |
| Simplifying Deep Reinforcement Learning via Self-Supervision | Jun 10, 2021 | Deep Reinforcement Learningregression | CodeCode Available | 0 |
| Reinforcement Learning for Industrial Control Network Cyber Security Orchestration | Jun 9, 2021 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Pretrained Encoders are All You Need | Jun 9, 2021 | AllContrastive Learning | CodeCode Available | 1 |
| Pretraining Representations for Data-Efficient Reinforcement Learning | Jun 9, 2021 | Atari GamesAtari Games 100k | CodeCode Available | 1 |
| PlayVirtual: Augmenting Cycle-Consistent Virtual Trajectories for Reinforcement Learning | Jun 8, 2021 | Continuous Control (100k environment steps)Continuous Control (500k environment steps) | CodeCode Available | 1 |
| Don't Get Yourself into Trouble! Risk-aware Decision-Making for Autonomous Vehicles | Jun 8, 2021 | Autonomous VehiclesDecision Making | —Unverified | 0 |