| Distributed Online Service Coordination Using Deep Reinforcement Learning | Jul 7, 2021 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |
| Effects of Smart Traffic Signal Control on Air Quality | Jul 6, 2021 | Deep Reinforcement LearningTraffic Signal Control | —Unverified | 0 |
| Multi-Modal Mutual Information (MuMMI) Training for Robust Self-Supervised Deep Reinforcement Learning | Jul 6, 2021 | Deep Reinforcement LearningMuJoCo | CodeCode Available | 1 |
| Sample Efficient Reinforcement Learning via Model-Ensemble Exploration and Exploitation | Jul 5, 2021 | continuous-controlContinuous Control | CodeCode Available | 1 |
| Winning at Any Cost -- Infringing the Cartel Prohibition With Reinforcement Learning | Jul 5, 2021 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Ensemble and Auxiliary Tasks for Data-Efficient Deep Reinforcement Learning | Jul 5, 2021 | Atari GamesDeep Reinforcement Learning | CodeCode Available | 0 |
| Control of rough terrain vehicles using deep reinforcement learning | Jul 5, 2021 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Restless and Uncertain: Robust Policies for Restless Bandits via Deep Multi-Agent Reinforcement Learning | Jul 4, 2021 | Deep Reinforcement LearningMulti-agent Reinforcement Learning | —Unverified | 0 |
| Low-Dimensional State and Action Representation Learning with MDP Homomorphism Metrics | Jul 4, 2021 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Traffic Signal Control with Communicative Deep Reinforcement Learning Agents: a Case Study | Jul 3, 2021 | Deep Reinforcement LearningMulti-agent Reinforcement Learning | —Unverified | 0 |
| A Novel Deep Reinforcement Learning Based Stock Direction Prediction using Knowledge Graph and Community Aware Sentiments | Jul 2, 2021 | Deep Reinforcement LearningPrediction | —Unverified | 0 |
| SocialAI: Benchmarking Socio-Cognitive Abilities in Deep Reinforcement Learning Agents | Jul 2, 2021 | BenchmarkingDeep Reinforcement Learning | —Unverified | 0 |
| Drone swarm patrolling with uneven coverage requirements | Jul 1, 2021 | Deep Reinforcement Learning | —Unverified | 0 |
| Optimal Power Allocation for Rate Splitting Communications with Deep Reinforcement Learning | Jul 1, 2021 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Applications of the Free Energy Principle to Machine Learning and Neuroscience | Jun 30, 2021 | Bayesian InferenceBIG-bench Machine Learning | —Unverified | 0 |
| Understanding Adversarial Attacks on Observations in Deep Reinforcement Learning | Jun 30, 2021 | Deep Reinforcement LearningMuJoCo | CodeCode Available | 0 |
| UAV-assisted Online Machine Learning over Multi-Tiered Networks: A Hierarchical Nested Personalized Federated Learning Approach | Jun 29, 2021 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| DRILL-- Deep Reinforcement Learning for Refinement Operators in ALC | Jun 29, 2021 | Deep Reinforcement LearningKnowledge Graphs | —Unverified | 0 |
| Habitat 2.0: Training Home Assistants to Rearrange their Habitat | Jun 28, 2021 | Deep Reinforcement LearningGPU | CodeCode Available | 2 |
| Expert Q-learning: Deep Reinforcement Learning with Coarse State Values from Offline Expert Examples | Jun 28, 2021 | Deep Reinforcement LearningImitation Learning | —Unverified | 0 |
| Continuous Control with Deep Reinforcement Learning for Autonomous Vessels | Jun 27, 2021 | Collision Avoidancecontinuous-control | —Unverified | 0 |
| A nonlinear hidden layer enables actor-critic agents to learn multiple paired association navigation | Jun 25, 2021 | Deep Reinforcement LearningNavigate | CodeCode Available | 0 |
| Hierarchically Integrated Models: Learning to Navigate from Heterogeneous Robots | Jun 24, 2021 | Deep Reinforcement LearningNavigate | —Unverified | 0 |
| Provably Efficient Representation Selection in Low-rank Markov Decision Processes: From Online to Offline RL | Jun 22, 2021 | Deep Reinforcement LearningOffline RL | —Unverified | 0 |
| Off-Policy Reinforcement Learning with Delayed Rewards | Jun 22, 2021 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Reinforcement Learning for Physical Layer Communications | Jun 22, 2021 | Deep Reinforcement LearningMulti-Armed Bandits | CodeCode Available | 0 |
| Emphatic Algorithms for Deep Reinforcement Learning | Jun 21, 2021 | Atari GamesDeep Reinforcement Learning | —Unverified | 0 |
| Interpretable Model-based Hierarchical Reinforcement Learning using Inductive Logic Programming | Jun 21, 2021 | Deep Reinforcement LearningHierarchical Reinforcement Learning | —Unverified | 0 |
| Learning the Non-Differentiable Optimization for Blind Super-Resolution | Jun 19, 2021 | Blind Super-ResolutionDeep Reinforcement Learning | —Unverified | 0 |
| ColorRL: Reinforced Coloring for End-to-End Instance Segmentation | Jun 19, 2021 | Deep Reinforcement LearningInstance Segmentation | —Unverified | 0 |
| Predicting Human Scanpaths in Visual Question Answering | Jun 19, 2021 | Deep Reinforcement LearningQuestion Answering | CodeCode Available | 1 |
| LAU-Net: Latitude Adaptive Upscaling Network for Omnidirectional Image Super-Resolution | Jun 19, 2021 | Deep Reinforcement LearningImage Super-Resolution | —Unverified | 0 |
| Adversarially Trained Neural Policies in the Fourier Domain | Jun 18, 2021 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Non-Robust Feature Mapping in Deep Reinforcement Learning | Jun 18, 2021 | Atari GamesDeep Reinforcement Learning | —Unverified | 0 |
| Towards Safe Reinforcement Learning via Constraining Conditional Value at Risk | Jun 18, 2021 | continuous-controlContinuous Control | CodeCode Available | 1 |
| Strategically-timed State-Observation Attacks on Deep Reinforcement Learning Agents | Jun 18, 2021 | Adversarial Attackcontinuous-control | —Unverified | 0 |
| Prediction-Free, Real-Time Flexible Control of Tidal Lagoons through Proximal Policy Optimisation: A Case Study for the Swansea Lagoon | Jun 18, 2021 | Deep Reinforcement LearningUnity | —Unverified | 0 |
| Deep Reinforcement Learning Models Predict Visual Responses in the Brain: A Preliminary Result | Jun 18, 2021 | Deep Reinforcement LearningObject Recognition | —Unverified | 0 |
| Multi-Task Learning for User Engagement and Adoption in Live Video Streaming Events | Jun 18, 2021 | Deep Reinforcement LearningMulti-Task Learning | CodeCode Available | 0 |
| Many Agent Reinforcement Learning Under Partial Observability | Jun 17, 2021 | Deep Reinforcement LearningMulti-agent Reinforcement Learning | —Unverified | 0 |
| Deep reinforcement learning with automated label extraction from clinical reports accurately classifies 3D MRI brain volumes | Jun 17, 2021 | ClassificationDeep Reinforcement Learning | —Unverified | 0 |
| A Reinforcement Learning Approach for an IRS-assisted NOMA Network | Jun 17, 2021 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Deep Reinforcement Learning Based Optimization for IRS Based UAV-NOMA Downlink Networks | Jun 17, 2021 | Deep Reinforcement LearningPosition | —Unverified | 0 |
| A Deep Reinforcement Learning Approach towards Pendulum Swing-up Problem based on TF-Agents | Jun 17, 2021 | Deep Reinforcement LearningPosition | —Unverified | 0 |
| Modelling resource allocation in uncertain system environment through deep reinforcement learning | Jun 17, 2021 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Tactile Sim-to-Real Policy Transfer via Real-to-Sim Image Translation | Jun 16, 2021 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |
| Real-time Adversarial Perturbations against Deep Reinforcement Learning Policies: Attacks and Defenses | Jun 16, 2021 | Atari GamesDeep Reinforcement Learning | CodeCode Available | 0 |
| Analysis and Optimisation of Bellman Residual Errors with Neural Function Approximation | Jun 16, 2021 | continuous-controlContinuous Control | —Unverified | 0 |
| Deep reinforcement learning on a multi-asset environment for trading | Jun 15, 2021 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| On Multi-objective Policy Optimization as a Tool for Reinforcement Learning: Case Studies in Offline RL and Finetuning | Jun 15, 2021 | Deep Reinforcement LearningMixture-of-Experts | —Unverified | 0 |