| Benchmarking Actor-Critic Deep Reinforcement Learning Algorithms for Robotics Control with Action Constraints | Apr 18, 2023 | BenchmarkingDeep Reinforcement Learning | CodeCode Available | 1 |
| Learning Complex Dexterous Manipulation with Deep Reinforcement Learning and Demonstrations | Sep 28, 2017 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |
| Learning Decision Trees as Amortized Structure Inference | Mar 10, 2025 | Anomaly DetectionDeep Reinforcement Learning | CodeCode Available | 1 |
| Learning Discrete World Models for Heuristic Search | Sep 14, 2024 | Deep Reinforcement LearningHeuristic Search | CodeCode Available | 1 |
| Benchmarking Deep Reinforcement Learning for Navigation in Denied Sensor Environments | Oct 18, 2024 | Autonomous NavigationBenchmarking | CodeCode Available | 1 |
| Learning Multi-Pursuit Evasion for Safe Targeted Navigation of Drones | Apr 7, 2023 | Deep Reinforcement Learning | CodeCode Available | 1 |
| Learning Generalizable Policy for Obstacle-Aware Autonomous Drone Racing | Nov 6, 2024 | Deep Reinforcement LearningDrone navigation | CodeCode Available | 1 |
| Learning Guidance Rewards with Trajectory-space Smoothing | Oct 23, 2020 | AttributeDeep Reinforcement Learning | CodeCode Available | 1 |
| A Constraint Enforcement Deep Reinforcement Learning Framework for Optimal Energy Storage Systems Dispatch | Jul 26, 2023 | Deep Reinforcement Learning | CodeCode Available | 1 |
| Learning Large Neighborhood Search Policy for Integer Programming | Nov 1, 2021 | Deep Reinforcement LearningReinforcement Learning (RL) | CodeCode Available | 1 |
| A multi-agent reinforcement learning model of common-pool resource appropriation | Jul 20, 2017 | Deep Reinforcement LearningMulti-agent Reinforcement Learning | CodeCode Available | 1 |
| Learning Multi-Agent Communication through Structured Attentive Reasoning | Dec 1, 2020 | Decision MakingDeep Reinforcement Learning | CodeCode Available | 1 |
| Learning Selective Communication for Multi-Agent Path Finding | Sep 12, 2021 | Decision MakingDeep Reinforcement Learning | CodeCode Available | 1 |
| Learning Soccer Juggling Skills with Layer-wise Mixture-of-Experts | Jul 24, 2022 | Deep Reinforcement LearningHumanoid Control | CodeCode Available | 1 |
| An actor-critic algorithm with policy gradients to solve the job shop scheduling problem using deep double recurrent agents | Oct 18, 2021 | Deep Reinforcement LearningJob Shop Scheduling | CodeCode Available | 1 |
| Learning Symbolic Rules over Abstract Meaning Representations for Textual Reinforcement Learning | Jul 5, 2023 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |
| A Deep Reinforcement Learning Algorithm Using Dynamic Attention Model for Vehicle Routing Problems | Feb 9, 2020 | Combinatorial OptimizationDecoder | CodeCode Available | 1 |
| Learning to Identify Critical States for Reinforcement Learning from Videos | Aug 15, 2023 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |
| Learning to Play Air Hockey with Model-Based Deep Reinforcement Learning | Jun 1, 2024 | Deep Reinforcement LearningPosition | CodeCode Available | 1 |
| Learning to Play No-Press Diplomacy with Best Response Policy Iteration | Jun 8, 2020 | Deep Reinforcement LearningReinforcement Learning (RL) | CodeCode Available | 1 |
| Bridging State and History Representations: Understanding Self-Predictive RL | Jan 17, 2024 | Deep Reinforcement LearningReinforcement Learning (RL) | CodeCode Available | 1 |
| Learning to Solve Multiple-TSP with Time Window and Rejections via Deep Reinforcement Learning | Sep 13, 2022 | Deep Reinforcement Learning | CodeCode Available | 1 |
| AutoShard: Automated Embedding Table Sharding for Recommender Systems | Aug 12, 2022 | Deep Reinforcement LearningRecommendation Systems | CodeCode Available | 1 |
| Learning to Track Dynamic Targets in Partially Known Environments | Jun 17, 2020 | Deep Reinforcement LearningNavigate | CodeCode Available | 1 |
| AutoPhase: Juggling HLS Phase Orderings in Random Forests with Deep Reinforcement Learning | Mar 2, 2020 | Deep Reinforcement LearningHigh-Level Synthesis | CodeCode Available | 1 |