| A Review of Symbolic, Subsymbolic and Hybrid Methods for Sequential Decision Making | Apr 20, 2023 | Decision MakingReinforcement Learning (RL) | —Unverified | 0 | 0 |
| Advice Conformance Verification by Reinforcement Learning agents for Human-in-the-Loop | Oct 7, 2022 | Decision Makingreinforcement-learning | —Unverified | 0 | 0 |
| Active Reinforcement Learning Strategies for Offline Policy Improvement | Dec 17, 2024 | Active Learningcontinuous-control | —Unverified | 0 | 0 |
| Compare and Select: Video Summarization with Multi-Agent Reinforcement Learning | Jul 29, 2020 | Decision MakingMulti-agent Reinforcement Learning | —Unverified | 0 | 0 |
| A Review of Cooperation in Multi-agent Learning | Dec 8, 2023 | Decision MakingSequential Decision Making | —Unverified | 0 | 0 |
| Communication-Control Codesign for Large-Scale Wireless Networked Control Systems | Oct 15, 2024 | Deep Reinforcement LearningScheduling | —Unverified | 0 | 0 |
| Communication and Control Co-Design in 6G: Sequential Decision-Making with LLMs | Jul 6, 2024 | Decision MakingSequential Decision Making | —Unverified | 0 | 0 |
| A Reinforcement Learning Approach for Dynamic Rebalancing in Bike-Sharing System | Feb 5, 2024 | reinforcement-learningReinforcement Learning | —Unverified | 0 | 0 |
| A Reinforcement Learning Approach for Sequential Spatial Transformer Networks | Jun 27, 2021 | Decision Makingreinforcement-learning | —Unverified | 0 | 0 |
| The f-Divergence Reinforcement Learning Framework | Sep 24, 2021 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 | 0 |
| Collaborative Inter-agent Knowledge Distillation for Reinforcement Learning | Sep 25, 2019 | Decision MakingKnowledge Distillation | —Unverified | 0 | 0 |
| A Regret bound for Non-stationary Multi-Armed Bandits with Fairness Constraints | Dec 24, 2020 | Decision MakingFairness | —Unverified | 0 | 0 |
| Active Measure Reinforcement Learning for Observation Cost Minimization | May 26, 2020 | Decision MakingQ-Learning | —Unverified | 0 | 0 |
| Accelerating exploration and representation learning with offline pre-training | Mar 31, 2023 | Decision MakingNetHack | —Unverified | 0 | 0 |
| How to Provably Improve Return Conditioned Supervised Learning? | Jun 10, 2025 | Decision MakingOffline RL | —Unverified | 0 | 0 |
| Collaborative and Federated Black-box Optimization: A Bayesian Optimization Perspective | Nov 12, 2024 | Bayesian OptimizationDecision Making | —Unverified | 0 | 0 |
| A Reduction-based Framework for Sequential Decision Making with Delayed Feedback | Feb 3, 2023 | Decision MakingSequential Decision Making | —Unverified | 0 | 0 |
| Code Models are Zero-shot Precondition Reasoners | Nov 16, 2023 | Decision MakingSequential Decision Making | —Unverified | 0 | 0 |
| ARDuP: Active Region Video Diffusion for Universal Policies | Jun 19, 2024 | Decision MakingSequential Decision Making | —Unverified | 0 | 0 |
| Adversarial Deep Learning for Online Resource Allocation | Nov 19, 2021 | Decision MakingDeep Learning | —Unverified | 0 | 0 |
| A Practical Introduction to Deep Reinforcement Learning | May 13, 2025 | Autonomous DrivingDecision Making | —Unverified | 0 | 0 |
| Circuit Routing Using Monte Carlo Tree Search and Deep Neural Networks | Jun 24, 2020 | Decision MakingSequential Decision Making | —Unverified | 0 | 0 |
| Chasing Ghosts: Competing with Stateful Policies | Jul 29, 2014 | AttributeDecision Making | —Unverified | 0 | 0 |
| Adversarial Attacks on Online Learning to Rank with Click Feedback | May 26, 2023 | Decision MakingLearning-To-Rank | —Unverified | 0 | 0 |
| Active Learning for Accurate Estimation of Linear Models | Mar 2, 2017 | Active LearningDecision Making | —Unverified | 0 | 0 |