| Component Transfer Learning for Deep RL Based on Abstract Representations | Nov 22, 2021 | Deep Reinforcement LearningTransfer Learning | CodeCode Available | 0 |
| Renewable energy integration and microgrid energy trading using multi-agent deep reinforcement learning | Nov 21, 2021 | Deep Reinforcement Learningenergy trading | —Unverified | 0 |
| Deep Reinforced Attention Regression for Partial Sketch Based Image Retrieval | Nov 21, 2021 | Deep Reinforcement LearningImage Retrieval | CodeCode Available | 0 |
| Vulcan: Solving the Steiner Tree Problem with Graph Neural Networks and Deep Reinforcement Learning | Nov 21, 2021 | Combinatorial OptimizationDeep Reinforcement Learning | —Unverified | 0 |
| Uncertainty-aware Low-Rank Q-Matrix Estimation for Deep Reinforcement Learning | Nov 19, 2021 | continuous-controlContinuous Control | —Unverified | 0 |
| A Survey of Zero-shot Generalisation in Deep Reinforcement Learning | Nov 18, 2021 | Deep Reinforcement LearningOffline RL | —Unverified | 0 |
| Aggressive Q-Learning with Ensembles: Achieving Both High Sample Efficiency and High Asymptotic Performance | Nov 17, 2021 | continuous-controlContinuous Control | —Unverified | 0 |
| Deep Reinforcement Learning for Entity Alignment | Nov 16, 2021 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| GRI: General Reinforced Imitation and its Application to Vision-Based Autonomous Driving | Nov 16, 2021 | Autonomous DrivingCARLA MAP Leaderboard | —Unverified | 0 |
| CleanRL: High-quality Single-file Implementations of Deep Reinforcement Learning Algorithms | Nov 16, 2021 | BenchmarkingDeep Reinforcement Learning | CodeCode Available | 3 |
| Reinforcement Learning of Self Enhancing Camera Image and Signal Processing | Nov 15, 2021 | BlockingData Augmentation | CodeCode Available | 0 |
| Deep Reinforcement Learning with Shallow Controllers: An Experimental Application to PID Tuning | Nov 13, 2021 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Obstacle Avoidance for UAS in Continuous Action Space Using Deep Reinforcement Learning | Nov 13, 2021 | continuous-controlContinuous Control | —Unverified | 0 |
| Robust Deep Reinforcement Learning for Extractive Legal Summarization | Nov 13, 2021 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| AWD3: Dynamic Reduction of the Estimation Bias | Nov 12, 2021 | continuous-controlContinuous Control | —Unverified | 0 |
| User Allocation in Mobile Edge Computing: A Deep Reinforcement Learning Approach | Nov 11, 2021 | CPUDeep Reinforcement Learning | CodeCode Available | 1 |
| Expert Human-Level Driving in Gran Turismo Sport Using Deep Reinforcement Learning with Image-based Representation | Nov 11, 2021 | AI AgentDeep Reinforcement Learning | —Unverified | 0 |
| Look Before You Leap: Safe Model-Based Reinforcement Learning with Human Intervention | Nov 10, 2021 | BlockingDecision Making | —Unverified | 0 |
| FinRL: Deep Reinforcement Learning Framework to Automate Trading in Quantitative Finance | Nov 7, 2021 | Deep Reinforcement LearningFriction | —Unverified | 0 |
| FinRL-Podracer: High Performance and Scalable Deep Reinforcement Learning for Quantitative Finance | Nov 7, 2021 | Deep Reinforcement LearningGPU | —Unverified | 0 |
| Data-Efficient Deep Reinforcement Learning for Attitude Control of Fixed-Wing UAVs: Field Experiments | Nov 7, 2021 | Deep Reinforcement LearningReinforcement Learning (RL) | CodeCode Available | 1 |
| Explainable Deep Reinforcement Learning for Portfolio Management: An Empirical Approach | Nov 7, 2021 | Deep Reinforcement LearningManagement | —Unverified | 0 |
| Development of collective behavior in newborn artificial agents | Nov 6, 2021 | Deep Reinforcement LearningObject Recognition | —Unverified | 0 |
| A Deep Reinforcement Learning Approach for Composing Moving IoT Services | Nov 6, 2021 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Robust Deep Reinforcement Learning for Quadcopter Control | Nov 6, 2021 | Deep Reinforcement LearningMuJoCo | CodeCode Available | 1 |
| d3rlpy: An Offline Deep Reinforcement Learning Library | Nov 6, 2021 | D4RLDeep Reinforcement Learning | CodeCode Available | 0 |
| Improving RNA Secondary Structure Design using Deep Reinforcement Learning | Nov 5, 2021 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Control of a fly-mimicking flyer in complex flow using deep reinforcement learning | Nov 4, 2021 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Causal versus Marginal Shapley Values for Robotic Lever Manipulation Controlled using Deep Reinforcement Learning | Nov 4, 2021 | Deep Reinforcement LearningExplainable artificial intelligence | —Unverified | 0 |
| Attacking Deep Reinforcement Learning-Based Traffic Signal Control Systems with Colluding Vehicles | Nov 4, 2021 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Towards an Understanding of Default Policies in Multitask Policy Optimization | Nov 4, 2021 | Deep Reinforcement Learning | —Unverified | 0 |
| Autonomous Attack Mitigation for Industrial Control Systems | Nov 3, 2021 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| Weighted Quantum Channel Compiling through Proximal Policy Optimization | Nov 3, 2021 | Deep Reinforcement Learning | —Unverified | 0 |
| Multi-Agent Deep Reinforcement Learning For Optimising Energy Efficiency of Fixed-Wing UAV Cellular Access Points | Nov 3, 2021 | Deep Reinforcement LearningTrajectory Planning | —Unverified | 0 |
| Online Service Provisioning in NFV-enabled Networks Using Deep Reinforcement Learning | Nov 3, 2021 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Deployment Optimization for Shared e-Mobility Systems with Multi-agent Deep Neural Search | Nov 3, 2021 | Deep Reinforcement Learning | —Unverified | 0 |
| What Robot do I Need? Fast Co-Adaptation of Morphology and Control using Graph Neural Networks | Nov 3, 2021 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| FedGraph: Federated Graph Learning with Intelligent Sampling | Nov 2, 2021 | Deep Reinforcement LearningFederated Learning | —Unverified | 0 |
| OnSlicing: Online End-to-End Network Slicing with Reinforcement Learning | Nov 2, 2021 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Learning Large Neighborhood Search Policy for Integer Programming | Nov 1, 2021 | Deep Reinforcement LearningReinforcement Learning (RL) | CodeCode Available | 1 |
| Rewards with Negative Examples for Reinforced Topic-Focused Abstractive Summarization | Nov 1, 2021 | Abstractive Text SummarizationDeep Reinforcement Learning | —Unverified | 0 |
| Efficient Dialogue Complementary Policy Learning via Deep Q-network Policy and Episodic Memory Policy | Nov 1, 2021 | Deep Reinforcement Learning | —Unverified | 0 |
| Human-Level Control without Server-Grade Hardware | Nov 1, 2021 | Cloud ComputingCPU | CodeCode Available | 0 |
| Machine Learning aided Crop Yield Optimization | Nov 1, 2021 | BIG-bench Machine LearningDeep Reinforcement Learning | —Unverified | 0 |
| DeF-DReL: Systematic Deployment of Serverless Functions in Fog and Cloud environments using Deep Reinforcement Learning | Oct 29, 2021 | Cloud ComputingDeep Reinforcement Learning | —Unverified | 0 |
| Brick-by-Brick: Combinatorial Construction with Deep Reinforcement Learning | Oct 29, 2021 | Deep Reinforcement LearningObject | —Unverified | 0 |
| A Novel Sample-efficient Deep Reinforcement Learning with Episodic Policy Transfer for PID-Based Control in Cardiac Catheterization Robots | Oct 28, 2021 | Deep Reinforcement Learning | —Unverified | 0 |
| Deep Reinforcement Learning Aided Packet-Routing For Aeronautical Ad-Hoc Networks Formed by Passenger Planes | Oct 28, 2021 | Deep Reinforcement Learning | —Unverified | 0 |
| URLB: Unsupervised Reinforcement Learning Benchmark | Oct 28, 2021 | continuous-controlContinuous Control | CodeCode Available | 1 |
| D2RLIR : an improved and diversified ranking function in interactive recommendation systems based on deep reinforcement learning | Oct 28, 2021 | Deep Reinforcement LearningDiversity | —Unverified | 0 |