| Causal Explanations for Sequential Decision-Making in Multi-Agent Systems | Feb 21, 2023 | Autonomous DrivingAutonomous Vehicles | CodeCode Available | 0 |
| Preserving the Privacy of Reward Functions in MDPs through Deception | Jul 13, 2024 | Decision MakingSequential Decision Making | CodeCode Available | 0 |
| Risk-Aware Continuous Control with Neural Contextual Bandits | Dec 15, 2023 | continuous-controlContinuous Control | CodeCode Available | 0 |
| Flooding with Absorption: An Efficient Protocol for Heterogeneous Bandits over Complex Networks | Mar 9, 2023 | Decision MakingMulti-Armed Bandits | CodeCode Available | 0 |
| Improving Generalization in Reinforcement Learning Training Regimes for Social Robot Navigation | Aug 29, 2023 | Decision MakingNavigate | CodeCode Available | 0 |
| Catastrophic-risk-aware reinforcement learning with extreme-value-theory-based policy gradients | Jun 21, 2024 | Decision MakingManagement | CodeCode Available | 0 |
| Bridging by Word: Image Grounded Vocabulary Construction for Visual Captioning | Jul 1, 2019 | Decision MakingImage Captioning | CodeCode Available | 0 |
| Deep Reinforcement Learning for Unsupervised Video Summarization with Diversity-Representativeness Reward | Dec 29, 2017 | Decision MakingDeep Reinforcement Learning | CodeCode Available | 0 |
| Mutual Information Based Knowledge Transfer Under State-Action Dimension Mismatch | Jun 12, 2020 | Decision MakingDeep Reinforcement Learning | CodeCode Available | 0 |
| Decomposition Methods with Deep Corrections for Reinforcement Learning | Feb 6, 2018 | Autonomous DrivingDecision Making | CodeCode Available | 0 |