| Assessing AI Utility: The Random Guesser Test for Sequential Decision-Making Systems | Jul 25, 2024 | Decision MakingRecommendation Systems | —Unverified | 0 | 0 |
| A General Framework for Sequential Decision-Making under Adaptivity Constraints | Jun 26, 2023 | Decision MakingSequential Decision Making | —Unverified | 0 | 0 |
| Active Sensing as Bayes-Optimal Sequential Decision Making | Aug 9, 2014 | Decision MakingSensitivity | —Unverified | 0 | 0 |
| Continuous Episodic Control | Nov 28, 2022 | continuous-controlContinuous Control | —Unverified | 0 | 0 |
| A Short Survey On Memory Based Reinforcement Learning | Apr 14, 2019 | Decision Makingreinforcement-learning | —Unverified | 0 | 0 |
| Continual Vision-and-Language Navigation | Mar 22, 2024 | Continual LearningNavigate | —Unverified | 0 | 0 |
| Contextual Online Decision Making with Infinite-Dimensional Functional Regression | Jan 30, 2025 | Decision MakingMulti-Armed Bandits | —Unverified | 0 | 0 |
| A storage expansion planning framework using reinforcement learning and simulation-based optimization | Jan 10, 2020 | Decision MakingQ-Learning | —Unverified | 0 | 0 |
| A Unifying Framework for Reinforcement Learning and Planning | Jun 26, 2020 | Decision Makingreinforcement-learning | —Unverified | 0 | 0 |
| Contextual Experience Replay for Self-Improvement of Language Agents | Jun 7, 2025 | Decision MakingLarge Language Model | —Unverified | 0 | 0 |
| A Sequential Decision-Making Model for Perimeter Identification | Sep 4, 2024 | Decision Makingmodel | —Unverified | 0 | 0 |
| Contextual Bandits for Unbounded Context Distributions | Aug 19, 2024 | Decision MakingMulti-Armed Bandits | —Unverified | 0 | 0 |
| A Scheme for Dynamic Risk-Sensitive Sequential Decision Making | Jul 9, 2019 | Decision MakingSequential Decision Making | —Unverified | 0 | 0 |
| A finite time analysis of distributed Q-learning | May 23, 2024 | Decision MakingMulti-agent Reinforcement Learning | —Unverified | 0 | 0 |
| Accelerating Matrix Diagonalization through Decision Transformers with Epsilon-Greedy Optimization | Jun 23, 2024 | Decision MakingSequential Decision Making | —Unverified | 0 | 0 |
| Constrained Reinforcement Learning with Average Reward Objective: Model-Based and Model-Free Algorithms | Jun 17, 2024 | Autonomous DrivingDecision Making | —Unverified | 0 | 0 |
| Constrained Online Decision-Making: A Unified Framework | May 11, 2025 | Active Learningcounterfactual | —Unverified | 0 | 0 |
| A Scale-Independent Multi-Objective Reinforcement Learning with Convergence Analysis | Feb 8, 2023 | Decision MakingMulti-Objective Reinforcement Learning | —Unverified | 0 | 0 |
| Conservative Contextual Bandits: Beyond Linear Representations | Dec 9, 2024 | Multi-Armed BanditsSequential Decision Making | —Unverified | 0 | 0 |
| Consensus in Motion: A Case of Dynamic Rationality of Sequential Learning in Probability Aggregation | Apr 20, 2025 | Decision MakingSequential Decision Making | —Unverified | 0 | 0 |
| A Sample-Efficient Algorithm for Episodic Finite-Horizon MDP with Constraints | Sep 23, 2020 | Decision MakingSequential Decision Making | —Unverified | 0 | 0 |
| A Farewell to Arms: Sequential Reward Maximization on a Budget with a Giving Up Option | Mar 6, 2020 | Decision MakingMulti-Armed Bandits | —Unverified | 0 | 0 |
| Computing Preimages of Deep Neural Networks with Applications to Safety | Jan 1, 2021 | Collision AvoidanceDecision Making | —Unverified | 0 | 0 |
| A Robust Policy Bootstrapping Algorithm for Multi-objective Reinforcement Learning in Non-stationary Environments | Aug 18, 2023 | Decision MakingMulti-Objective Reinforcement Learning | —Unverified | 0 | 0 |
| Composite Bayesian Optimization In Function Spaces Using NEON -- Neural Epistemic Operator Networks | Apr 3, 2024 | Bayesian OptimizationDecision Making | —Unverified | 0 | 0 |
| A Review of Symbolic, Subsymbolic and Hybrid Methods for Sequential Decision Making | Apr 20, 2023 | Decision MakingReinforcement Learning (RL) | —Unverified | 0 | 0 |
| Advice Conformance Verification by Reinforcement Learning agents for Human-in-the-Loop | Oct 7, 2022 | Decision Makingreinforcement-learning | —Unverified | 0 | 0 |
| Active Reinforcement Learning Strategies for Offline Policy Improvement | Dec 17, 2024 | Active Learningcontinuous-control | —Unverified | 0 | 0 |
| Compare and Select: Video Summarization with Multi-Agent Reinforcement Learning | Jul 29, 2020 | Decision MakingMulti-agent Reinforcement Learning | —Unverified | 0 | 0 |
| A Review of Cooperation in Multi-agent Learning | Dec 8, 2023 | Decision MakingSequential Decision Making | —Unverified | 0 | 0 |
| Communication-Control Codesign for Large-Scale Wireless Networked Control Systems | Oct 15, 2024 | Deep Reinforcement LearningScheduling | —Unverified | 0 | 0 |
| Communication and Control Co-Design in 6G: Sequential Decision-Making with LLMs | Jul 6, 2024 | Decision MakingSequential Decision Making | —Unverified | 0 | 0 |
| A Reinforcement Learning Approach for Dynamic Rebalancing in Bike-Sharing System | Feb 5, 2024 | reinforcement-learningReinforcement Learning | —Unverified | 0 | 0 |
| A Reinforcement Learning Approach for Sequential Spatial Transformer Networks | Jun 27, 2021 | Decision Makingreinforcement-learning | —Unverified | 0 | 0 |
| The f-Divergence Reinforcement Learning Framework | Sep 24, 2021 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 | 0 |
| Collaborative Inter-agent Knowledge Distillation for Reinforcement Learning | Sep 25, 2019 | Decision MakingKnowledge Distillation | —Unverified | 0 | 0 |
| A Regret bound for Non-stationary Multi-Armed Bandits with Fairness Constraints | Dec 24, 2020 | Decision MakingFairness | —Unverified | 0 | 0 |
| Active Measure Reinforcement Learning for Observation Cost Minimization | May 26, 2020 | Decision MakingQ-Learning | —Unverified | 0 | 0 |
| Accelerating exploration and representation learning with offline pre-training | Mar 31, 2023 | Decision MakingNetHack | —Unverified | 0 | 0 |
| How to Provably Improve Return Conditioned Supervised Learning? | Jun 10, 2025 | Decision MakingOffline RL | —Unverified | 0 | 0 |
| Collaborative and Federated Black-box Optimization: A Bayesian Optimization Perspective | Nov 12, 2024 | Bayesian OptimizationDecision Making | —Unverified | 0 | 0 |
| A Reduction-based Framework for Sequential Decision Making with Delayed Feedback | Feb 3, 2023 | Decision MakingSequential Decision Making | —Unverified | 0 | 0 |
| Code Models are Zero-shot Precondition Reasoners | Nov 16, 2023 | Decision MakingSequential Decision Making | —Unverified | 0 | 0 |
| ARDuP: Active Region Video Diffusion for Universal Policies | Jun 19, 2024 | Decision MakingSequential Decision Making | —Unverified | 0 | 0 |
| Adversarial Deep Learning for Online Resource Allocation | Nov 19, 2021 | Decision MakingDeep Learning | —Unverified | 0 | 0 |
| A Practical Introduction to Deep Reinforcement Learning | May 13, 2025 | Autonomous DrivingDecision Making | —Unverified | 0 | 0 |
| Circuit Routing Using Monte Carlo Tree Search and Deep Neural Networks | Jun 24, 2020 | Decision MakingSequential Decision Making | —Unverified | 0 | 0 |
| Chasing Ghosts: Competing with Stateful Policies | Jul 29, 2014 | AttributeDecision Making | —Unverified | 0 | 0 |
| Adversarial Attacks on Online Learning to Rank with Click Feedback | May 26, 2023 | Decision MakingLearning-To-Rank | —Unverified | 0 | 0 |
| Active Learning for Accurate Estimation of Linear Models | Mar 2, 2017 | Active LearningDecision Making | —Unverified | 0 | 0 |