| Deep Reinforcement Learning with Function Properties in Mean Reversion Strategies | Jan 9, 2021 | Decision MakingDeep Reinforcement Learning | CodeCode Available | 0 | 5 |
| Parameter Space Noise for Exploration | Jun 6, 2017 | continuous-controlContinuous Control | CodeCode Available | 0 | 5 |
| Deep Reinforcement Learning with Stacked Hierarchical Attention for Text-based Games | Oct 22, 2020 | Decision MakingDeep Reinforcement Learning | CodeCode Available | 0 | 5 |
| Experimental Study on The Effect of Multi-step Deep Reinforcement Learning in POMDPs | Sep 12, 2022 | Deep Reinforcement Learning | CodeCode Available | 0 | 5 |
| DeepTraffic: Crowdsourced Hyperparameter Tuning of Deep Reinforcement Learning Systems for Multi-Agent Dense Traffic Navigation | Jan 9, 2018 | Autonomous DrivingAutonomous Navigation | CodeCode Available | 0 | 5 |
| DutyTTE: Deciphering Uncertainty in Origin-Destination Travel Time Estimation | Aug 23, 2024 | Deep Reinforcement LearningMixture-of-Experts | CodeCode Available | 0 | 5 |
| Deep Reinforcement Learning via Object-Centric Attention | Apr 3, 2025 | Deep Reinforcement LearningInductive Bias | CodeCode Available | 0 | 5 |
| Deep reinforcement learning uncovers processes for separating azeotropic mixtures without prior knowledge | Oct 10, 2023 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 0 | 5 |
| Deep Reinforcement Learning with a Combinatorial Action Space for Predicting Popular Reddit Threads | Jun 12, 2016 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 0 | 5 |
| Deep Reinforcement Learning that Matters | Sep 19, 2017 | Atari GamesContinuous Control | CodeCode Available | 0 | 5 |
| Deep Reinforcement Learning with a Natural Language Action Space | Nov 14, 2015 | Deep Reinforcement LearningQ-Learning | CodeCode Available | 0 | 5 |
| Collaborative Multi-BS Power Management for Dense Radio Access Network using Deep Reinforcement Learning | Apr 17, 2023 | Deep Reinforcement LearningManagement | CodeCode Available | 0 | 5 |
| A nonlinear hidden layer enables actor-critic agents to learn multiple paired association navigation | Jun 25, 2021 | Deep Reinforcement LearningNavigate | CodeCode Available | 0 | 5 |
| PixelRL: Fully Convolutional Network with Reinforcement Learning for Image Processing | Dec 16, 2019 | Deep Reinforcement LearningDenoising | CodeCode Available | 0 | 5 |
| Deep Reinforcement Learning on a Budget: 3D Control and Reasoning Without a Supercomputer | Apr 3, 2019 | Deep Reinforcement LearningReinforcement Learning | CodeCode Available | 0 | 5 |
| Playing Atari with Six Neurons | Jun 4, 2018 | Atari GamesDecision Making | CodeCode Available | 0 | 5 |
| Deep Reinforcement Learning of Marked Temporal Point Processes | May 23, 2018 | Deep Reinforcement LearningMarketing | CodeCode Available | 0 | 5 |
| Collaborative Evolutionary Reinforcement Learning | May 2, 2019 | continuous-controlContinuous Control | CodeCode Available | 0 | 5 |
| Deep Reinforcement Learning of Region Proposal Networks for Object Detection | Jun 1, 2018 | Deep Reinforcement LearningObject | CodeCode Available | 0 | 5 |
| Deep Reinforcement Learning Radio Control and Signal Detection with KeRLym, a Gym RL Agent | May 30, 2016 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 0 | 5 |
| Deep Reinforcement Learning in Quantitative Algorithmic Trading: A Review | May 31, 2021 | Algorithmic TradingDeep Reinforcement Learning | CodeCode Available | 0 | 5 |
| Policy Abstraction and Nash Refinement in Tree-Exploiting PSRO | Feb 5, 2025 | Deep Reinforcement Learning | CodeCode Available | 0 | 5 |
| Adaptive Ordered Information Extraction with Deep Reinforcement Learning | Jun 19, 2023 | Deep Reinforcement LearningEvent Extraction | CodeCode Available | 0 | 5 |
| Policy Consolidation for Continual Reinforcement Learning | Feb 1, 2019 | Continual Learningcontinuous-control | CodeCode Available | 0 | 5 |
| Deep reinforcement learning in World-Earth system models to discover sustainable management strategies | Aug 15, 2019 | Deep Reinforcement LearningManagement | CodeCode Available | 0 | 5 |
| Deep Reinforcement Learning for Synthesizing Functions in Higher-Order Logic | Oct 25, 2019 | Automated Theorem ProvingBIG-bench Machine Learning | CodeCode Available | 0 | 5 |
| Collaborative Deep Reinforcement Learning | Feb 19, 2017 | Deep Reinforcement LearningKnowledge Distillation | CodeCode Available | 0 | 5 |
| Deep Reinforcement Learning in Large Discrete Action Spaces | Dec 24, 2015 | Deep Reinforcement LearningRecommendation Systems | CodeCode Available | 0 | 5 |
| A Hierarchical Approach to Population Training for Human-AI Collaboration | May 26, 2023 | Deep Reinforcement LearningHierarchical Reinforcement Learning | CodeCode Available | 0 | 5 |
| Deep Reinforcement Learning meets Graph Neural Networks: exploring a routing optimization use case | Oct 16, 2019 | Decision MakingDeep Reinforcement Learning | CodeCode Available | 0 | 5 |
| Deep reinforcement learning from human preferences | Jun 12, 2017 | Atari GamesDeep Reinforcement Learning | CodeCode Available | 0 | 5 |
| Deep Reinforcement Learning Framework for Thoracic Diseases Classification via Prior Knowledge Guidance | Jun 2, 2023 | Deep Reinforcement LearningDiagnostic | CodeCode Available | 0 | 5 |
| Deep Reinforcement Learning framework for Autonomous Driving | Apr 8, 2017 | Atari GamesAutonomous Driving | CodeCode Available | 0 | 5 |
| Deep Reinforcement Learning from Hierarchical Preference Design | Sep 6, 2023 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 0 | 5 |
| Deep Reinforcement Learning in Continuous Action Spaces: a Case Study in the Game of Simulated Curling | Jul 1, 2018 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 0 | 5 |
| Deep Reinforcement Learning for Traffic Light Control in Vehicular Networks | Mar 29, 2018 | Deep Reinforcement LearningQ-Learning | CodeCode Available | 0 | 5 |
| A Hierarchical Architecture for Sequential Decision-Making in Autonomous Driving using Deep Reinforcement Learning | Jun 20, 2019 | Autonomous DrivingDecision Making | CodeCode Available | 0 | 5 |
| Pre-training with Non-expert Human Demonstration for Deep Reinforcement Learning | Dec 21, 2018 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 0 | 5 |
| Deep reinforcement learning for time series: playing idealized trading games | Mar 11, 2018 | Deep Reinforcement LearningQ-Learning | CodeCode Available | 0 | 5 |
| Deep Reinforcement Learning for Unsupervised Video Summarization with Diversity-Representativeness Reward | Dec 29, 2017 | Decision MakingDeep Reinforcement Learning | CodeCode Available | 0 | 5 |
| Deep Reinforcement Learning for Swarm Systems | Jul 17, 2018 | Decision MakingDeep Reinforcement Learning | CodeCode Available | 0 | 5 |
| Deep Reinforcement Learning for Surgical Gesture Segmentation and Classification | Jun 21, 2018 | Action SegmentationClassification | CodeCode Available | 0 | 5 |
| Adaptive Power System Emergency Control using Deep Reinforcement Learning | Mar 9, 2019 | BenchmarkingDeep Reinforcement Learning | CodeCode Available | 0 | 5 |
| Deep Reinforcement Learning for Tactile Robotics: Learning to Type on a Braille Keyboard | Aug 6, 2020 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 0 | 5 |
| Neural-encoding Human Experts' Domain Knowledge to Warm Start Reinforcement Learning | Feb 15, 2019 | Deep Reinforcement LearningImitation Learning | CodeCode Available | 0 | 5 |
| Propagation Networks for Model-Based Control Under Partial Observation | Sep 28, 2018 | Deep Reinforcement LearningReinforcement Learning | CodeCode Available | 0 | 5 |
| Automating Reinforcement Learning with Example-based Resets | Apr 5, 2022 | continuous-controlContinuous Control | CodeCode Available | 0 | 5 |
| Prosocial learning agents solve generalized Stag Hunts better than selfish ones | Sep 8, 2017 | Deep Reinforcement LearningMulti-agent Reinforcement Learning | CodeCode Available | 0 | 5 |
| Deep reinforcement learning for smart calibration of radio telescopes | Feb 5, 2021 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 0 | 5 |
| COBRA: Data-Efficient Model-Based RL through Unsupervised Object Discovery and Curiosity-Driven Exploration | May 22, 2019 | continuous-controlContinuous Control | CodeCode Available | 0 | 5 |