| GRI: General Reinforced Imitation and its Application to Vision-Based Autonomous Driving | Nov 16, 2021 | Autonomous DrivingCARLA MAP Leaderboard | —Unverified | 0 |
| Reinforcement Learning of Self Enhancing Camera Image and Signal Processing | Nov 15, 2021 | BlockingData Augmentation | CodeCode Available | 0 |
| Obstacle Avoidance for UAS in Continuous Action Space Using Deep Reinforcement Learning | Nov 13, 2021 | continuous-controlContinuous Control | —Unverified | 0 |
| Deep Reinforcement Learning with Shallow Controllers: An Experimental Application to PID Tuning | Nov 13, 2021 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Robust Deep Reinforcement Learning for Extractive Legal Summarization | Nov 13, 2021 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| AWD3: Dynamic Reduction of the Estimation Bias | Nov 12, 2021 | continuous-controlContinuous Control | —Unverified | 0 |
| Expert Human-Level Driving in Gran Turismo Sport Using Deep Reinforcement Learning with Image-based Representation | Nov 11, 2021 | AI AgentDeep Reinforcement Learning | —Unverified | 0 |
| Look Before You Leap: Safe Model-Based Reinforcement Learning with Human Intervention | Nov 10, 2021 | BlockingDecision Making | —Unverified | 0 |
| Explainable Deep Reinforcement Learning for Portfolio Management: An Empirical Approach | Nov 7, 2021 | Deep Reinforcement LearningManagement | —Unverified | 0 |
| FinRL-Podracer: High Performance and Scalable Deep Reinforcement Learning for Quantitative Finance | Nov 7, 2021 | Deep Reinforcement LearningGPU | —Unverified | 0 |
| FinRL: Deep Reinforcement Learning Framework to Automate Trading in Quantitative Finance | Nov 7, 2021 | Deep Reinforcement LearningFriction | —Unverified | 0 |
| d3rlpy: An Offline Deep Reinforcement Learning Library | Nov 6, 2021 | D4RLDeep Reinforcement Learning | CodeCode Available | 0 |
| Development of collective behavior in newborn artificial agents | Nov 6, 2021 | Deep Reinforcement LearningObject Recognition | —Unverified | 0 |
| A Deep Reinforcement Learning Approach for Composing Moving IoT Services | Nov 6, 2021 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Improving RNA Secondary Structure Design using Deep Reinforcement Learning | Nov 5, 2021 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Attacking Deep Reinforcement Learning-Based Traffic Signal Control Systems with Colluding Vehicles | Nov 4, 2021 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Control of a fly-mimicking flyer in complex flow using deep reinforcement learning | Nov 4, 2021 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Causal versus Marginal Shapley Values for Robotic Lever Manipulation Controlled using Deep Reinforcement Learning | Nov 4, 2021 | Deep Reinforcement LearningExplainable artificial intelligence | —Unverified | 0 |
| Towards an Understanding of Default Policies in Multitask Policy Optimization | Nov 4, 2021 | Deep Reinforcement Learning | —Unverified | 0 |
| Autonomous Attack Mitigation for Industrial Control Systems | Nov 3, 2021 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| Weighted Quantum Channel Compiling through Proximal Policy Optimization | Nov 3, 2021 | Deep Reinforcement Learning | —Unverified | 0 |
| Multi-Agent Deep Reinforcement Learning For Optimising Energy Efficiency of Fixed-Wing UAV Cellular Access Points | Nov 3, 2021 | Deep Reinforcement LearningTrajectory Planning | —Unverified | 0 |
| Online Service Provisioning in NFV-enabled Networks Using Deep Reinforcement Learning | Nov 3, 2021 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| What Robot do I Need? Fast Co-Adaptation of Morphology and Control using Graph Neural Networks | Nov 3, 2021 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Deployment Optimization for Shared e-Mobility Systems with Multi-agent Deep Neural Search | Nov 3, 2021 | Deep Reinforcement Learning | —Unverified | 0 |
| FedGraph: Federated Graph Learning with Intelligent Sampling | Nov 2, 2021 | Deep Reinforcement LearningFederated Learning | —Unverified | 0 |
| OnSlicing: Online End-to-End Network Slicing with Reinforcement Learning | Nov 2, 2021 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Efficient Dialogue Complementary Policy Learning via Deep Q-network Policy and Episodic Memory Policy | Nov 1, 2021 | Deep Reinforcement Learning | —Unverified | 0 |
| Human-Level Control without Server-Grade Hardware | Nov 1, 2021 | Cloud ComputingCPU | CodeCode Available | 0 |
| Machine Learning aided Crop Yield Optimization | Nov 1, 2021 | BIG-bench Machine LearningDeep Reinforcement Learning | —Unverified | 0 |
| Rewards with Negative Examples for Reinforced Topic-Focused Abstractive Summarization | Nov 1, 2021 | Abstractive Text SummarizationDeep Reinforcement Learning | —Unverified | 0 |
| Brick-by-Brick: Combinatorial Construction with Deep Reinforcement Learning | Oct 29, 2021 | Deep Reinforcement LearningObject | —Unverified | 0 |
| DeF-DReL: Systematic Deployment of Serverless Functions in Fog and Cloud environments using Deep Reinforcement Learning | Oct 29, 2021 | Cloud ComputingDeep Reinforcement Learning | —Unverified | 0 |
| Cooperative Deep Q-learning Framework for Environments Providing Image Feedback | Oct 28, 2021 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| Deep Reinforcement Learning Aided Packet-Routing For Aeronautical Ad-Hoc Networks Formed by Passenger Planes | Oct 28, 2021 | Deep Reinforcement Learning | —Unverified | 0 |
| A Novel Sample-efficient Deep Reinforcement Learning with Episodic Policy Transfer for PID-Based Control in Cardiac Catheterization Robots | Oct 28, 2021 | Deep Reinforcement Learning | —Unverified | 0 |
| D2RLIR : an improved and diversified ranking function in interactive recommendation systems based on deep reinforcement learning | Oct 28, 2021 | Deep Reinforcement LearningDiversity | —Unverified | 0 |
| Learning Diverse Policies in MOBA Games via Macro-Goals | Oct 27, 2021 | Deep Reinforcement LearningDiversity | —Unverified | 0 |
| Comparing Heuristics, Constraint Optimization, and Reinforcement Learning for an Industrial 2D Packing Problem | Oct 27, 2021 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Learning from demonstrations with SACR2: Soft Actor-Critic with Reward Relabeling | Oct 27, 2021 | Autonomous DrivingDecision Making | —Unverified | 0 |
| Distributed Multi-Agent Deep Reinforcement Learning Framework for Whole-building HVAC Control | Oct 26, 2021 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| The Difficulty of Passive Learning in Deep Reinforcement Learning | Oct 26, 2021 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Applications of Multi-Agent Reinforcement Learning in Future Internet: A Comprehensive Survey | Oct 26, 2021 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| Accelerating Distributed Deep Reinforcement Learning by In-Network Experience Sampling | Oct 26, 2021 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Neural PPO-Clip Attains Global Optimality: A Hinge Loss Perspective | Oct 26, 2021 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| A Deep Reinforcement Learning Approach for Audio-based Navigation and Audio Source Localization in Multi-speaker Environments | Oct 25, 2021 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Deep Reinforcement Learning for Simultaneous Sensing and Channel Access in Cognitive Networks | Oct 24, 2021 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| A Distributed Deep Reinforcement Learning Technique for Application Placement in Edge and Fog Computing Environments | Oct 24, 2021 | Deep Reinforcement LearningEdge-computing | —Unverified | 0 |
| Fully Distributed Actor-Critic Architecture for Multitask Deep Reinforcement Learning | Oct 23, 2021 | continuous-controlContinuous Control | —Unverified | 0 |
| ReLAX: Reinforcement Learning Agent eXplainer for Arbitrary Predictive Models | Oct 22, 2021 | counterfactualDecision Making | CodeCode Available | 0 |