| Multi-Armed Bandits With Machine Learning-Generated Surrogate Rewards | Jun 20, 2025 | Decision Making Under UncertaintyMulti-Armed Bandits | —Unverified | 0 |
| Multi-echelon Supply Chains with Uncertain Seasonal Demands and Lead Times Using Deep Reinforcement Learning | Jan 12, 2022 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| Multi-Environment Pretraining Enables Transfer to Action Limited Datasets | Nov 23, 2022 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Multi-granular Adversarial Attacks against Black-box Neural Ranking Models | Apr 2, 2024 | Adversarial AttackDecision Making | —Unverified | 0 |
| Multimodal Pretrained Models for Verifiable Sequential Decision-Making: Planning, Grounding, and Perception | Aug 10, 2023 | Decision MakingRobot Manipulation | —Unverified | 0 |
| Multi-Player Zero-Sum Markov Games with Networked Separable Interactions | Jul 13, 2023 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Multi-shot Pedestrian Re-identification via Sequential Decision Making | Dec 19, 2017 | Decision MakingReinforcement Learning | —Unverified | 0 |
| Multi-Task Generative Adversarial Nets with Shared Memory for Cross-Domain Coordination Control | Jul 1, 2018 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Multi-task Representation Learning for Pure Exploration in Linear Bandits | Feb 9, 2023 | Decision MakingRepresentation Learning | —Unverified | 0 |
| MuZero with Self-competition for Rate Control in VP9 Video Compression | Feb 14, 2022 | Decision MakingQuantization | —Unverified | 0 |
| Natural Policy Gradient Primal-Dual Method for Constrained Markov Decision Processes | Dec 1, 2020 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Near-optimal Offline Reinforcement Learning with Linear Representation: Leveraging Variance Information with Pessimism | Mar 11, 2022 | Decision Makingreinforcement-learning | —Unverified | 0 |
| Negotiable Reinforcement Learning for Pareto Optimal Sequential Decision-Making | Dec 1, 2018 | Decision Makingreinforcement-learning | —Unverified | 0 |
| Compositional Q-learning for electrolyte repletion with imbalanced patient sub-populations | Oct 6, 2021 | Decision MakingNavigate | —Unverified | 0 |
| Network Offloading Policies for Cloud Robotics: a Learning-based Approach | Feb 15, 2019 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| Neural Bootstrapping Attention for Neural Processes | Sep 29, 2021 | Bayesian OptimizationDecision Making | —Unverified | 0 |
| Neural Column Generation for Capacitated Vehicle Routing | Nov 24, 2021 | Decision MakingImitation Learning | —Unverified | 0 |
| Neural Heterogeneous Scheduler | Jun 9, 2019 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| Neuro-symbolic Meta Reinforcement Learning for Trading | Jan 15, 2023 | Decision MakingMeta Reinforcement Learning | —Unverified | 0 |
| Neuro-Symbolic World Models for Adapting to Open World Novelty | Jan 16, 2023 | Decision Makingreinforcement-learning | —Unverified | 0 |
| No DBA? No regret! Multi-armed bandits for index tuning of analytical and HTAP workloads with provable guarantees | Aug 23, 2021 | Decision MakingDecision Making Under Uncertainty | —Unverified | 0 |
| Non-Deterministic Policies in Markovian Decision Processes | Jan 16, 2014 | Decision Makingreinforcement-learning | —Unverified | 0 |
| Non-maximizing policies that fulfill multi-criterion aspirations in expectation | Aug 8, 2024 | Sequential Decision Making | —Unverified | 0 |
| Non-Stationary Bandits with Habituation and Recovery Dynamics | Jul 26, 2017 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Non-stationary Delayed Combinatorial Semi-Bandit with Causally Related Rewards | Jul 18, 2023 | Decision MakingDecision Making Under Uncertainty | —Unverified | 0 |
| Not all users are the same: Providing personalized explanations for sequential decision making problems | Jun 23, 2021 | AllClustering | —Unverified | 0 |
| Novel Approaches to Accelerating the Convergence Rate of Markov Decision Process for Search Result Diversification | Feb 23, 2018 | Decision MakingInformation Retrieval | —Unverified | 0 |
| NovGrid: A Flexible Grid World for Evaluating Agent Response to Novelty | Mar 23, 2022 | Decision Makingreinforcement-learning | —Unverified | 0 |
| O3D: Offline Data-driven Discovery and Distillation for Sequential Decision-Making with Large Language Models | Oct 22, 2023 | Decision MakingIn-Context Learning | —Unverified | 0 |
| Observation Adaptation via Annealed Importance Resampling for Partially Observable Markov Decision Processes | Mar 25, 2025 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Offline Action-Free Learning of Ex-BMDPs by Comparing Diverse Datasets | Mar 26, 2025 | Representation LearningSequential Decision Making | —Unverified | 0 |
| Offline Hierarchical Reinforcement Learning via Inverse Optimization | Oct 10, 2024 | Decision MakingHierarchical Reinforcement Learning | —Unverified | 0 |
| Offline Imitation of Badminton Player Behavior via Experiential Contexts and Brownian Motion | Mar 19, 2024 | Decision MakingImitation Learning | —Unverified | 0 |
| Offline Learning for Combinatorial Multi-armed Bandits | Jan 31, 2025 | Decision MakingLanguage Modeling | —Unverified | 0 |
| Offline Reinforcement Learning with Differentiable Function Approximation is Provably Efficient | Oct 3, 2022 | Decision MakingOffline RL | —Unverified | 0 |
| Offline Risk-sensitive RL with Partial Observability to Enhance Performance in Human-Robot Teaming | Feb 8, 2024 | Decision MakingPhysiological Computing | —Unverified | 0 |
| Off-Policy Evaluation for Sequential Persuasion Process with Unobserved Confounding | Apr 1, 2025 | Decision MakingOff-policy evaluation | —Unverified | 0 |
| OMGPT: A Sequence Modeling Framework for Data-driven Operational Decision Making | May 19, 2025 | Decision MakingManagement | —Unverified | 0 |
| On adaptivity and minimax optimality of two-sided nearest neighbors | Nov 20, 2024 | Decision MakingMatrix Completion | —Unverified | 0 |
| On Bellman's Optimality Principle for zs-POSGs | Jun 29, 2020 | Decision MakingHeuristic Search | —Unverified | 0 |
| On Blame Attribution for Accountable Multi-Agent Sequential Decision Making | Jul 26, 2021 | Decision MakingFairness | —Unverified | 0 |
| On Computation and Generalization of Generative Adversarial Imitation Learning | Jan 9, 2020 | Decision MakingImitation Learning | —Unverified | 0 |
| On Efficiency in Hierarchical Reinforcement Learning | Dec 1, 2020 | Computational EfficiencyDecision Making | —Unverified | 0 |
| On Efficient Online Imitation Learning via Classification | Sep 26, 2022 | ClassificationDecision Making | —Unverified | 0 |
| One-shot learning and behavioral eligibility traces in sequential decision making | Nov 12, 2019 | Decision MakingLearning Theory | —Unverified | 0 |
| On Improving Deep Reinforcement Learning for POMDPs | Apr 17, 2018 | Atari GamesDecision Making | —Unverified | 0 |
| Online Batch Decision-Making with High-Dimensional Covariates | Feb 21, 2020 | Decision MakingMarketing | —Unverified | 0 |
| Online Clustering of Dueling Bandits | Feb 4, 2025 | ClusteringDecision Making | —Unverified | 0 |
| Online Convex Optimization for Sequential Decision Processes and Extensive-Form Games | Sep 10, 2018 | counterfactualDecision Making | —Unverified | 0 |
| Online Convex Optimization with Continuous Switching Constraint | Mar 21, 2021 | Decision MakingSequential Decision Making | —Unverified | 0 |