| Bridging the Gap: Providing Post-Hoc Symbolic Explanations for Sequential Decision-Making Problems with Inscrutable Representations | Feb 4, 2020 | Decision MakingMontezuma's Revenge | —Unverified | 0 |
| Bridging Visualization and Optimization: Multimodal Large Language Models on Graph-Structured Combinatorial Optimization | Jan 21, 2025 | Combinatorial OptimizationSequential Decision Making | —Unverified | 0 |
| Building Intelligent Autonomous Navigation Agents | Jun 25, 2021 | Autonomous NavigationDecision Making | —Unverified | 0 |
| Burning RED: Unlocking Subtask-Driven Reinforcement Learning and Risk-Awareness in Average-Reward Markov Decision Processes | Oct 14, 2024 | Decision MakingDecision Making Under Uncertainty | —Unverified | 0 |
| Can A User Anticipate What Her Followers Want? | Sep 1, 2019 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Bayesian Exploration Networks | Aug 24, 2023 | Decision MakingDecision Making Under Uncertainty | —Unverified | 0 |
| Bayesian decision-making under misspecified priors with applications to meta-learning | Jul 3, 2021 | Decision MakingMeta-Learning | —Unverified | 0 |
| A naive aggregation algorithm for improving generalization in a class of learning problems | Sep 6, 2024 | Decision MakingSequential Decision Making | —Unverified | 0 |
| A Deep Reinforcement Learning Framework for Continuous Intraday Market Bidding | Apr 13, 2020 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| Causal Bayesian Optimization | May 24, 2020 | Bayesian OptimizationCausal Inference | —Unverified | 0 |
| Adaptive Sampling for Discovery | May 30, 2022 | Decision MakingDrug Discovery | —Unverified | 0 |
| Causal Markov Decision Processes: Learning Good Interventions Efficiently | Feb 15, 2021 | Decision MakingMarketing | —Unverified | 0 |
| Application of Deep Reinforcement Learning to Payment Fraud | Dec 8, 2021 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| Learning "What-if" Explanations for Sequential Decision-Making | Jul 2, 2020 | counterfactualCounterfactual Reasoning | —Unverified | 0 |
| Batched Nonparametric Bandits via k-Nearest Neighbor UCB | May 15, 2025 | Decision MakingMarketing | —Unverified | 0 |
| An advantage based policy transfer algorithm for reinforcement learning with measures of transferability | Nov 12, 2023 | continuous-controlContinuous Control | —Unverified | 0 |
| A Practical Introduction to Deep Reinforcement Learning | May 13, 2025 | Autonomous DrivingDecision Making | —Unverified | 0 |
| Adversarial Attacks on Online Learning to Rank with Click Feedback | May 26, 2023 | Decision MakingLearning-To-Rank | —Unverified | 0 |
| Batched Neural Bandits | Feb 25, 2021 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Code Models are Zero-shot Precondition Reasoners | Nov 16, 2023 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Collaborative and Federated Black-box Optimization: A Bayesian Optimization Perspective | Nov 12, 2024 | Bayesian OptimizationDecision Making | —Unverified | 0 |
| A Reduction-based Framework for Sequential Decision Making with Delayed Feedback | Feb 3, 2023 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Adaptive Frontier Exploration on Graphs with Applications to Network-Based Disease Testing | May 27, 2025 | Sequential Decision Making | —Unverified | 0 |
| A Computational Framework for Motor Skill Acquisition | Jan 3, 2019 | Decision MakingReinforcement Learning | —Unverified | 0 |
| A Reinforcement Learning Approach for Sequential Spatial Transformer Networks | Jun 27, 2021 | Decision Makingreinforcement-learning | —Unverified | 0 |
| Active Measure Reinforcement Learning for Observation Cost Minimization | May 26, 2020 | Decision MakingQ-Learning | —Unverified | 0 |
| Communication and Control Co-Design in 6G: Sequential Decision-Making with LLMs | Jul 6, 2024 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Communication-Control Codesign for Large-Scale Wireless Networked Control Systems | Oct 15, 2024 | Deep Reinforcement LearningScheduling | —Unverified | 0 |
| A Review of Cooperation in Multi-agent Learning | Dec 8, 2023 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Compare and Select: Video Summarization with Multi-Agent Reinforcement Learning | Jul 29, 2020 | Decision MakingMulti-agent Reinforcement Learning | —Unverified | 0 |
| Data-Driven Online Model Selection With Regret Guarantees | Jun 5, 2023 | Decision Makingmodel | —Unverified | 0 |
| Composite Bayesian Optimization In Function Spaces Using NEON -- Neural Epistemic Operator Networks | Apr 3, 2024 | Bayesian OptimizationDecision Making | —Unverified | 0 |
| Computing Preimages of Deep Neural Networks with Applications to Safety | Jan 1, 2021 | Collision AvoidanceDecision Making | —Unverified | 0 |
| A Robust Policy Bootstrapping Algorithm for Multi-objective Reinforcement Learning in Non-stationary Environments | Aug 18, 2023 | Decision MakingMulti-Objective Reinforcement Learning | —Unverified | 0 |
| Consensus in Motion: A Case of Dynamic Rationality of Sequential Learning in Probability Aggregation | Apr 20, 2025 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Conservative Contextual Bandits: Beyond Linear Representations | Dec 9, 2024 | Multi-Armed BanditsSequential Decision Making | —Unverified | 0 |
| Constrained Online Decision-Making: A Unified Framework | May 11, 2025 | Active Learningcounterfactual | —Unverified | 0 |
| Constrained Reinforcement Learning with Average Reward Objective: Model-Based and Model-Free Algorithms | Jun 17, 2024 | Autonomous DrivingDecision Making | —Unverified | 0 |
| Data-Driven Upper Confidence Bounds with Near-Optimal Regret for Heavy-Tailed Bandits | Jun 9, 2024 | Decision MakingMulti-Armed Bandits | —Unverified | 0 |
| Bandits with Unobserved Confounders: A Causal Approach | Dec 1, 2015 | Decision MakingSequential Decision Making | —Unverified | 0 |
| A Sequential Decision-Making Model for Perimeter Identification | Sep 4, 2024 | Decision Makingmodel | —Unverified | 0 |
| Contextual Experience Replay for Self-Improvement of Language Agents | Jun 7, 2025 | Decision MakingLarge Language Model | —Unverified | 0 |
| Contextual Online Decision Making with Infinite-Dimensional Functional Regression | Jan 30, 2025 | Decision MakingMulti-Armed Bandits | —Unverified | 0 |
| Continual Vision-and-Language Navigation | Mar 22, 2024 | Continual LearningNavigate | —Unverified | 0 |
| Continuous Episodic Control | Nov 28, 2022 | continuous-controlContinuous Control | —Unverified | 0 |
| A Short Survey On Memory Based Reinforcement Learning | Apr 14, 2019 | Decision Makingreinforcement-learning | —Unverified | 0 |
| Convergence and sample complexity of natural policy gradient primal-dual methods for constrained MDPs | Jun 6, 2022 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Convex Markov Games: A New Frontier for Multi-Agent Reinforcement Learning | Oct 22, 2024 | Decision MakingDiversity | —Unverified | 0 |
| Convex Regularization in Monte-Carlo Tree Search | Jul 1, 2020 | Atari GamesDecision Making | —Unverified | 0 |
| Bandits in Matching Markets: Ideas and Proposals for Peer Lending | Oct 30, 2020 | Decision MakingFairness | —Unverified | 0 |