| Deep Reinforcement Learning for Synthesizing Functions in Higher-Order Logic | Oct 25, 2019 | Automated Theorem ProvingBIG-bench Machine Learning | CodeCode Available | 0 | 5 |
| Collaborative Deep Reinforcement Learning | Feb 19, 2017 | Deep Reinforcement LearningKnowledge Distillation | CodeCode Available | 0 | 5 |
| Deep Reinforcement Learning in Large Discrete Action Spaces | Dec 24, 2015 | Deep Reinforcement LearningRecommendation Systems | CodeCode Available | 0 | 5 |
| A Hierarchical Approach to Population Training for Human-AI Collaboration | May 26, 2023 | Deep Reinforcement LearningHierarchical Reinforcement Learning | CodeCode Available | 0 | 5 |
| Deep Reinforcement Learning meets Graph Neural Networks: exploring a routing optimization use case | Oct 16, 2019 | Decision MakingDeep Reinforcement Learning | CodeCode Available | 0 | 5 |
| Deep reinforcement learning from human preferences | Jun 12, 2017 | Atari GamesDeep Reinforcement Learning | CodeCode Available | 0 | 5 |
| Deep Reinforcement Learning Framework for Thoracic Diseases Classification via Prior Knowledge Guidance | Jun 2, 2023 | Deep Reinforcement LearningDiagnostic | CodeCode Available | 0 | 5 |
| Deep Reinforcement Learning framework for Autonomous Driving | Apr 8, 2017 | Atari GamesAutonomous Driving | CodeCode Available | 0 | 5 |
| Deep Reinforcement Learning from Hierarchical Preference Design | Sep 6, 2023 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 0 | 5 |
| Deep Reinforcement Learning in Continuous Action Spaces: a Case Study in the Game of Simulated Curling | Jul 1, 2018 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 0 | 5 |
| Deep Reinforcement Learning for Traffic Light Control in Vehicular Networks | Mar 29, 2018 | Deep Reinforcement LearningQ-Learning | CodeCode Available | 0 | 5 |
| A Hierarchical Architecture for Sequential Decision-Making in Autonomous Driving using Deep Reinforcement Learning | Jun 20, 2019 | Autonomous DrivingDecision Making | CodeCode Available | 0 | 5 |
| Pre-training with Non-expert Human Demonstration for Deep Reinforcement Learning | Dec 21, 2018 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 0 | 5 |
| Deep reinforcement learning for time series: playing idealized trading games | Mar 11, 2018 | Deep Reinforcement LearningQ-Learning | CodeCode Available | 0 | 5 |
| Deep Reinforcement Learning for Unsupervised Video Summarization with Diversity-Representativeness Reward | Dec 29, 2017 | Decision MakingDeep Reinforcement Learning | CodeCode Available | 0 | 5 |
| Deep Reinforcement Learning for Swarm Systems | Jul 17, 2018 | Decision MakingDeep Reinforcement Learning | CodeCode Available | 0 | 5 |
| Deep Reinforcement Learning for Surgical Gesture Segmentation and Classification | Jun 21, 2018 | Action SegmentationClassification | CodeCode Available | 0 | 5 |
| Adaptive Power System Emergency Control using Deep Reinforcement Learning | Mar 9, 2019 | BenchmarkingDeep Reinforcement Learning | CodeCode Available | 0 | 5 |
| Deep Reinforcement Learning for Tactile Robotics: Learning to Type on a Braille Keyboard | Aug 6, 2020 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 0 | 5 |
| Neural-encoding Human Experts' Domain Knowledge to Warm Start Reinforcement Learning | Feb 15, 2019 | Deep Reinforcement LearningImitation Learning | CodeCode Available | 0 | 5 |
| Propagation Networks for Model-Based Control Under Partial Observation | Sep 28, 2018 | Deep Reinforcement LearningReinforcement Learning | CodeCode Available | 0 | 5 |
| Automating Reinforcement Learning with Example-based Resets | Apr 5, 2022 | continuous-controlContinuous Control | CodeCode Available | 0 | 5 |
| Prosocial learning agents solve generalized Stag Hunts better than selfish ones | Sep 8, 2017 | Deep Reinforcement LearningMulti-agent Reinforcement Learning | CodeCode Available | 0 | 5 |
| Deep reinforcement learning for smart calibration of radio telescopes | Feb 5, 2021 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 0 | 5 |
| COBRA: Data-Efficient Model-Based RL through Unsupervised Object Discovery and Curiosity-Driven Exploration | May 22, 2019 | continuous-controlContinuous Control | CodeCode Available | 0 | 5 |