| URLB: Unsupervised Reinforcement Learning Benchmark | Oct 28, 2021 | continuous-controlContinuous Control | CodeCode Available | 1 |
| Learning Domain Invariant Representations in Goal-conditioned Block MDPs | Oct 27, 2021 | Deep Reinforcement LearningDomain Generalization | CodeCode Available | 1 |
| Learning from demonstrations with SACR2: Soft Actor-Critic with Reward Relabeling | Oct 27, 2021 | Autonomous DrivingDecision Making | —Unverified | 0 |
| Learning Diverse Policies in MOBA Games via Macro-Goals | Oct 27, 2021 | Deep Reinforcement LearningDiversity | —Unverified | 0 |
| Comparing Heuristics, Constraint Optimization, and Reinforcement Learning for an Industrial 2D Packing Problem | Oct 27, 2021 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Towards Robust Bisimulation Metric Learning | Oct 27, 2021 | continuous-controlContinuous Control | CodeCode Available | 1 |
| Learning Collaborative Policies to Solve NP-hard Routing Problems | Oct 26, 2021 | Deep Reinforcement LearningTraveling Salesman Problem | CodeCode Available | 1 |
| The Difficulty of Passive Learning in Deep Reinforcement Learning | Oct 26, 2021 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Accelerating Distributed Deep Reinforcement Learning by In-Network Experience Sampling | Oct 26, 2021 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Neural PPO-Clip Attains Global Optimality: A Hinge Loss Perspective | Oct 26, 2021 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Applications of Multi-Agent Reinforcement Learning in Future Internet: A Comprehensive Survey | Oct 26, 2021 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| Distributed Multi-Agent Deep Reinforcement Learning Framework for Whole-building HVAC Control | Oct 26, 2021 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| A Deep Reinforcement Learning Approach for Audio-based Navigation and Audio Source Localization in Multi-speaker Environments | Oct 25, 2021 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Uniformly Conservative Exploration in Reinforcement Learning | Oct 25, 2021 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |
| Recurrent Off-policy Baselines for Memory-based Continuous Control | Oct 25, 2021 | continuous-controlContinuous Control | CodeCode Available | 1 |
| Deep Reinforcement Learning for Simultaneous Sensing and Channel Access in Cognitive Networks | Oct 24, 2021 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| A Distributed Deep Reinforcement Learning Technique for Application Placement in Edge and Fog Computing Environments | Oct 24, 2021 | Deep Reinforcement LearningEdge-computing | —Unverified | 0 |
| Fully Distributed Actor-Critic Architecture for Multitask Deep Reinforcement Learning | Oct 23, 2021 | continuous-controlContinuous Control | —Unverified | 0 |
| ReLAX: Reinforcement Learning Agent eXplainer for Arbitrary Predictive Models | Oct 22, 2021 | counterfactualDecision Making | CodeCode Available | 0 |
| Anti-Concentrated Confidence Bonuses for Scalable Exploration | Oct 21, 2021 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| Deep Reinforcement Learning for Online Control of Stochastic Partial Differential Equations | Oct 21, 2021 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Locality-Sensitive Experience Replay for Online Recommendation | Oct 21, 2021 | Deep Reinforcement LearningRecommendation Systems | —Unverified | 0 |
| Neuro-Symbolic Reinforcement Learning with First-Order Logic | Oct 21, 2021 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Deep Generative Models in Engineering Design: A Review | Oct 21, 2021 | Deep Reinforcement LearningDesign Synthesis | —Unverified | 0 |
| CIM-PPO:Proximal Policy Optimization with Liu-Correntropy Induced Metric | Oct 20, 2021 | Deep Reinforcement LearningMuJoCo | —Unverified | 0 |
| Transferring Reinforcement Learning for DC-DC Buck Converter Control via Duty Ratio Mapping: From Simulation to Implementation | Oct 20, 2021 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| FedParking: A Federated Learning based Parking Space Estimation with Parked Vehicle assisted Edge Computing | Oct 19, 2021 | Deep Reinforcement LearningEdge-computing | —Unverified | 0 |
| Aesthetic Photo Collage with Deep Reinforcement Learning | Oct 19, 2021 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Balancing Value Underestimation and Overestimation with Realistic Actor-Critic | Oct 19, 2021 | continuous-controlContinuous Control | CodeCode Available | 0 |
| Embracing advanced AI/ML to help investors achieve success: Vanguard Reinforcement Learning for Financial Goal Planning | Oct 18, 2021 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| An actor-critic algorithm with policy gradients to solve the job shop scheduling problem using deep double recurrent agents | Oct 18, 2021 | Deep Reinforcement LearningJob Shop Scheduling | CodeCode Available | 1 |
| In a Nutshell, the Human Asked for This: Latent Goals for Following Temporal Specifications | Oct 18, 2021 | Deep Reinforcement Learning | CodeCode Available | 0 |
| Damped Anderson Mixing for Deep Reinforcement Learning: Acceleration, Convergence, and Stabilization | Oct 17, 2021 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Emotion Style Transfer with a Specified Intensity Using Deep Reinforcement Learning | Oct 16, 2021 | Deep Reinforcement LearningDiversity | —Unverified | 0 |
| Lifting the veil on hyper-parameters for value-based deep reinforcement learning | Oct 16, 2021 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Case-based Reasoning for Better Generalization in Textual Reinforcement Learning | Oct 16, 2021 | Deep Reinforcement LearningOut-of-Distribution Generalization | —Unverified | 0 |
| Local Advantage Actor-Critic for Robust Multi-Agent Deep Reinforcement Learning | Oct 16, 2021 | Deep Reinforcement LearningMulti-agent Reinforcement Learning | —Unverified | 0 |
| A Broad-persistent Advising Approach for Deep Interactive Reinforcement Learning in Robotic Environments | Oct 15, 2021 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Next-Best-View Estimation based on Deep Reinforcement Learning for Active Object Classification | Oct 13, 2021 | Deep Reinforcement LearningObject | CodeCode Available | 0 |
| Scalable Traffic Signal Controls using Fog-Cloud Based Multiagent Reinforcement Learning | Oct 11, 2021 | Deep Reinforcement LearningGraph Attention | —Unverified | 0 |
| Navigation In Urban Environments Amongst Pedestrians Using Multi-Objective Deep Reinforcement Learning | Oct 11, 2021 | Autonomous DrivingAutonomous Navigation | —Unverified | 0 |
| REIN-2: Giving Birth to Prepared Reinforcement Learning Agents Using Reinforcement Learning Agents | Oct 11, 2021 | Deep Reinforcement LearningMeta-Learning | —Unverified | 0 |
| Learning a subspace of policies for online adaptation in Reinforcement Learning | Oct 11, 2021 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Learning Temporally-Consistent Representations for Data-Efficient Reinforcement Learning | Oct 11, 2021 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 0 |
| Deep Reinforcement Learning for Optimizing RIS-Assisted HD-FD Wireless Systems | Oct 10, 2021 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Multi-condition multi-objective optimization using deep reinforcement learning | Oct 10, 2021 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Hard instance learning for quantum adiabatic prime factorization | Oct 10, 2021 | Deep Reinforcement LearningReinforcement Learning (RL) | —Unverified | 0 |
| MARVEL: Raster Manga Vectorization via Primitive-wise Deep Reinforcement Learning | Oct 10, 2021 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |
| TiKick: Towards Playing Multi-agent Football Full Games from Single-agent Demonstrations | Oct 9, 2021 | Deep Reinforcement LearningStarcraft | CodeCode Available | 1 |
| Theoretically Principled Deep RL Acceleration via Nearest Neighbor Function Approximation | Oct 9, 2021 | Deep Reinforcement LearningMuJoCo | —Unverified | 0 |