| SocialAI: Benchmarking Socio-Cognitive Abilities in Deep Reinforcement Learning Agents | Jul 2, 2021 | BenchmarkingDeep Reinforcement Learning | —Unverified | 0 |
| A Novel Deep Reinforcement Learning Based Stock Direction Prediction using Knowledge Graph and Community Aware Sentiments | Jul 2, 2021 | Deep Reinforcement LearningPrediction | —Unverified | 0 |
| Drone swarm patrolling with uneven coverage requirements | Jul 1, 2021 | Deep Reinforcement Learning | —Unverified | 0 |
| Optimal Power Allocation for Rate Splitting Communications with Deep Reinforcement Learning | Jul 1, 2021 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Applications of the Free Energy Principle to Machine Learning and Neuroscience | Jun 30, 2021 | Bayesian InferenceBIG-bench Machine Learning | —Unverified | 0 |
| Understanding Adversarial Attacks on Observations in Deep Reinforcement Learning | Jun 30, 2021 | Deep Reinforcement LearningMuJoCo | CodeCode Available | 0 |
| DRILL-- Deep Reinforcement Learning for Refinement Operators in ALC | Jun 29, 2021 | Deep Reinforcement LearningKnowledge Graphs | —Unverified | 0 |
| UAV-assisted Online Machine Learning over Multi-Tiered Networks: A Hierarchical Nested Personalized Federated Learning Approach | Jun 29, 2021 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| Expert Q-learning: Deep Reinforcement Learning with Coarse State Values from Offline Expert Examples | Jun 28, 2021 | Deep Reinforcement LearningImitation Learning | —Unverified | 0 |
| Continuous Control with Deep Reinforcement Learning for Autonomous Vessels | Jun 27, 2021 | Collision Avoidancecontinuous-control | —Unverified | 0 |
| A nonlinear hidden layer enables actor-critic agents to learn multiple paired association navigation | Jun 25, 2021 | Deep Reinforcement LearningNavigate | CodeCode Available | 0 |
| Hierarchically Integrated Models: Learning to Navigate from Heterogeneous Robots | Jun 24, 2021 | Deep Reinforcement LearningNavigate | —Unverified | 0 |
| Provably Efficient Representation Selection in Low-rank Markov Decision Processes: From Online to Offline RL | Jun 22, 2021 | Deep Reinforcement LearningOffline RL | —Unverified | 0 |
| Reinforcement Learning for Physical Layer Communications | Jun 22, 2021 | Deep Reinforcement LearningMulti-Armed Bandits | CodeCode Available | 0 |
| Off-Policy Reinforcement Learning with Delayed Rewards | Jun 22, 2021 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Interpretable Model-based Hierarchical Reinforcement Learning using Inductive Logic Programming | Jun 21, 2021 | Deep Reinforcement LearningHierarchical Reinforcement Learning | —Unverified | 0 |
| Emphatic Algorithms for Deep Reinforcement Learning | Jun 21, 2021 | Atari GamesDeep Reinforcement Learning | —Unverified | 0 |
| LAU-Net: Latitude Adaptive Upscaling Network for Omnidirectional Image Super-Resolution | Jun 19, 2021 | Deep Reinforcement LearningImage Super-Resolution | —Unverified | 0 |
| ColorRL: Reinforced Coloring for End-to-End Instance Segmentation | Jun 19, 2021 | Deep Reinforcement LearningInstance Segmentation | —Unverified | 0 |
| Learning the Non-Differentiable Optimization for Blind Super-Resolution | Jun 19, 2021 | Blind Super-ResolutionDeep Reinforcement Learning | —Unverified | 0 |
| Non-Robust Feature Mapping in Deep Reinforcement Learning | Jun 18, 2021 | Atari GamesDeep Reinforcement Learning | —Unverified | 0 |
| Multi-Task Learning for User Engagement and Adoption in Live Video Streaming Events | Jun 18, 2021 | Deep Reinforcement LearningMulti-Task Learning | CodeCode Available | 0 |
| Deep Reinforcement Learning Models Predict Visual Responses in the Brain: A Preliminary Result | Jun 18, 2021 | Deep Reinforcement LearningObject Recognition | —Unverified | 0 |
| Prediction-Free, Real-Time Flexible Control of Tidal Lagoons through Proximal Policy Optimisation: A Case Study for the Swansea Lagoon | Jun 18, 2021 | Deep Reinforcement LearningUnity | —Unverified | 0 |
| Strategically-timed State-Observation Attacks on Deep Reinforcement Learning Agents | Jun 18, 2021 | Adversarial Attackcontinuous-control | —Unverified | 0 |
| Adversarially Trained Neural Policies in the Fourier Domain | Jun 18, 2021 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Deep Reinforcement Learning Based Optimization for IRS Based UAV-NOMA Downlink Networks | Jun 17, 2021 | Deep Reinforcement LearningPosition | —Unverified | 0 |
| Modelling resource allocation in uncertain system environment through deep reinforcement learning | Jun 17, 2021 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Many Agent Reinforcement Learning Under Partial Observability | Jun 17, 2021 | Deep Reinforcement LearningMulti-agent Reinforcement Learning | —Unverified | 0 |
| Deep reinforcement learning with automated label extraction from clinical reports accurately classifies 3D MRI brain volumes | Jun 17, 2021 | ClassificationDeep Reinforcement Learning | —Unverified | 0 |
| A Deep Reinforcement Learning Approach towards Pendulum Swing-up Problem based on TF-Agents | Jun 17, 2021 | Deep Reinforcement LearningPosition | —Unverified | 0 |
| A Reinforcement Learning Approach for an IRS-assisted NOMA Network | Jun 17, 2021 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Real-time Adversarial Perturbations against Deep Reinforcement Learning Policies: Attacks and Defenses | Jun 16, 2021 | Atari GamesDeep Reinforcement Learning | CodeCode Available | 0 |
| Analysis and Optimisation of Bellman Residual Errors with Neural Function Approximation | Jun 16, 2021 | continuous-controlContinuous Control | —Unverified | 0 |
| Deep reinforcement learning on a multi-asset environment for trading | Jun 15, 2021 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Population-coding and Dynamic-neurons improved Spiking Actor Network for Reinforcement Learning | Jun 15, 2021 | Deep Reinforcement LearningOpenAI Gym | —Unverified | 0 |
| On Multi-objective Policy Optimization as a Tool for Reinforcement Learning: Case Studies in Offline RL and Finetuning | Jun 15, 2021 | Deep Reinforcement LearningMixture-of-Experts | —Unverified | 0 |
| Analysis of a Target-Based Actor-Critic Algorithm with Linear Function Approximation | Jun 14, 2021 | Deep Reinforcement Learning | —Unverified | 0 |
| Poisoning Deep Reinforcement Learning Agents with In-Distribution Triggers | Jun 14, 2021 | Data PoisoningDeep Reinforcement Learning | —Unverified | 0 |
| User-Guided Personalized Image Aesthetic Assessment based on Deep Reinforcement Learning | Jun 14, 2021 | Deep Reinforcement LearningImage Enhancement | —Unverified | 0 |
| On-Policy Deep Reinforcement Learning for the Average-Reward Criterion | Jun 14, 2021 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Learning-Aided Heuristics Design for Storage System | Jun 14, 2021 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Learning on Abstract Domains: A New Approach for Verifiable Guarantee in Reinforcement Learning | Jun 13, 2021 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Intrinsic Control of Variational Beliefs in Dynamic Partially-Observed Visual Environments | Jun 13, 2021 | Deep Reinforcement Learning | —Unverified | 0 |
| Density-Based Bonuses on Learned Representations for Reward-Free Exploration in Deep Reinforcement Learning | Jun 13, 2021 | Deep Reinforcement LearningDensity Estimation | —Unverified | 0 |
| GDI: Rethinking What Makes Reinforcement Learning Different From Supervised Learning | Jun 11, 2021 | Atari GamesDeep Reinforcement Learning | —Unverified | 0 |
| A3C-S: Automated Agent Accelerator Co-Search towards Efficient Deep Reinforcement Learning | Jun 11, 2021 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| DRLD-SP: A Deep Reinforcement Learning-based Dynamic Service Placement in Edge-Enabled Internet of Vehicles | Jun 11, 2021 | Deep Reinforcement LearningEdge-computing | —Unverified | 0 |
| Courteous Behavior of Automated Vehicles at Unsignalized Intersections via Reinforcement Learning | Jun 11, 2021 | Autonomous VehiclesCollision Avoidance | —Unverified | 0 |
| Data-driven battery operation for energy arbitrage using rainbow deep reinforcement learning | Jun 10, 2021 | continuous-controlContinuous Control | —Unverified | 0 |