| Distributed Online Service Coordination Using Deep Reinforcement Learning | Jul 7, 2021 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |
| Effects of Smart Traffic Signal Control on Air Quality | Jul 6, 2021 | Deep Reinforcement LearningTraffic Signal Control | —Unverified | 0 |
| Multi-Modal Mutual Information (MuMMI) Training for Robust Self-Supervised Deep Reinforcement Learning | Jul 6, 2021 | Deep Reinforcement LearningMuJoCo | CodeCode Available | 1 |
| Sample Efficient Reinforcement Learning via Model-Ensemble Exploration and Exploitation | Jul 5, 2021 | continuous-controlContinuous Control | CodeCode Available | 1 |
| Winning at Any Cost -- Infringing the Cartel Prohibition With Reinforcement Learning | Jul 5, 2021 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Ensemble and Auxiliary Tasks for Data-Efficient Deep Reinforcement Learning | Jul 5, 2021 | Atari GamesDeep Reinforcement Learning | CodeCode Available | 0 |
| Control of rough terrain vehicles using deep reinforcement learning | Jul 5, 2021 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Restless and Uncertain: Robust Policies for Restless Bandits via Deep Multi-Agent Reinforcement Learning | Jul 4, 2021 | Deep Reinforcement LearningMulti-agent Reinforcement Learning | —Unverified | 0 |
| Low-Dimensional State and Action Representation Learning with MDP Homomorphism Metrics | Jul 4, 2021 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Traffic Signal Control with Communicative Deep Reinforcement Learning Agents: a Case Study | Jul 3, 2021 | Deep Reinforcement LearningMulti-agent Reinforcement Learning | —Unverified | 0 |
| A Novel Deep Reinforcement Learning Based Stock Direction Prediction using Knowledge Graph and Community Aware Sentiments | Jul 2, 2021 | Deep Reinforcement LearningPrediction | —Unverified | 0 |
| SocialAI: Benchmarking Socio-Cognitive Abilities in Deep Reinforcement Learning Agents | Jul 2, 2021 | BenchmarkingDeep Reinforcement Learning | —Unverified | 0 |
| Drone swarm patrolling with uneven coverage requirements | Jul 1, 2021 | Deep Reinforcement Learning | —Unverified | 0 |
| Optimal Power Allocation for Rate Splitting Communications with Deep Reinforcement Learning | Jul 1, 2021 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Applications of the Free Energy Principle to Machine Learning and Neuroscience | Jun 30, 2021 | Bayesian InferenceBIG-bench Machine Learning | —Unverified | 0 |
| Understanding Adversarial Attacks on Observations in Deep Reinforcement Learning | Jun 30, 2021 | Deep Reinforcement LearningMuJoCo | CodeCode Available | 0 |
| UAV-assisted Online Machine Learning over Multi-Tiered Networks: A Hierarchical Nested Personalized Federated Learning Approach | Jun 29, 2021 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| DRILL-- Deep Reinforcement Learning for Refinement Operators in ALC | Jun 29, 2021 | Deep Reinforcement LearningKnowledge Graphs | —Unverified | 0 |
| Habitat 2.0: Training Home Assistants to Rearrange their Habitat | Jun 28, 2021 | Deep Reinforcement LearningGPU | CodeCode Available | 2 |
| Expert Q-learning: Deep Reinforcement Learning with Coarse State Values from Offline Expert Examples | Jun 28, 2021 | Deep Reinforcement LearningImitation Learning | —Unverified | 0 |
| Continuous Control with Deep Reinforcement Learning for Autonomous Vessels | Jun 27, 2021 | Collision Avoidancecontinuous-control | —Unverified | 0 |
| A nonlinear hidden layer enables actor-critic agents to learn multiple paired association navigation | Jun 25, 2021 | Deep Reinforcement LearningNavigate | CodeCode Available | 0 |
| Hierarchically Integrated Models: Learning to Navigate from Heterogeneous Robots | Jun 24, 2021 | Deep Reinforcement LearningNavigate | —Unverified | 0 |
| Provably Efficient Representation Selection in Low-rank Markov Decision Processes: From Online to Offline RL | Jun 22, 2021 | Deep Reinforcement LearningOffline RL | —Unverified | 0 |
| Off-Policy Reinforcement Learning with Delayed Rewards | Jun 22, 2021 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |