| Multi-Armed Bandits With Machine Learning-Generated Surrogate Rewards | Jun 20, 2025 | Decision Making Under UncertaintyMulti-Armed Bandits | —Unverified | 0 |
| Multi-echelon Supply Chains with Uncertain Seasonal Demands and Lead Times Using Deep Reinforcement Learning | Jan 12, 2022 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| Multi-Environment Pretraining Enables Transfer to Action Limited Datasets | Nov 23, 2022 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Multi-granular Adversarial Attacks against Black-box Neural Ranking Models | Apr 2, 2024 | Adversarial AttackDecision Making | —Unverified | 0 |
| Multimodal Pretrained Models for Verifiable Sequential Decision-Making: Planning, Grounding, and Perception | Aug 10, 2023 | Decision MakingRobot Manipulation | —Unverified | 0 |
| Multi-Player Zero-Sum Markov Games with Networked Separable Interactions | Jul 13, 2023 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Multi-shot Pedestrian Re-identification via Sequential Decision Making | Dec 19, 2017 | Decision MakingReinforcement Learning | —Unverified | 0 |
| Multi-Task Generative Adversarial Nets with Shared Memory for Cross-Domain Coordination Control | Jul 1, 2018 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Multi-task Representation Learning for Pure Exploration in Linear Bandits | Feb 9, 2023 | Decision MakingRepresentation Learning | —Unverified | 0 |
| MuZero with Self-competition for Rate Control in VP9 Video Compression | Feb 14, 2022 | Decision MakingQuantization | —Unverified | 0 |
| Natural Policy Gradient Primal-Dual Method for Constrained Markov Decision Processes | Dec 1, 2020 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Near-optimal Offline Reinforcement Learning with Linear Representation: Leveraging Variance Information with Pessimism | Mar 11, 2022 | Decision Makingreinforcement-learning | —Unverified | 0 |
| Negotiable Reinforcement Learning for Pareto Optimal Sequential Decision-Making | Dec 1, 2018 | Decision Makingreinforcement-learning | —Unverified | 0 |
| Compositional Q-learning for electrolyte repletion with imbalanced patient sub-populations | Oct 6, 2021 | Decision MakingNavigate | —Unverified | 0 |
| Network Offloading Policies for Cloud Robotics: a Learning-based Approach | Feb 15, 2019 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| Neural Bootstrapping Attention for Neural Processes | Sep 29, 2021 | Bayesian OptimizationDecision Making | —Unverified | 0 |
| Neural Column Generation for Capacitated Vehicle Routing | Nov 24, 2021 | Decision MakingImitation Learning | —Unverified | 0 |
| Neural Heterogeneous Scheduler | Jun 9, 2019 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| Neuro-symbolic Meta Reinforcement Learning for Trading | Jan 15, 2023 | Decision MakingMeta Reinforcement Learning | —Unverified | 0 |
| Neuro-Symbolic World Models for Adapting to Open World Novelty | Jan 16, 2023 | Decision Makingreinforcement-learning | —Unverified | 0 |
| No DBA? No regret! Multi-armed bandits for index tuning of analytical and HTAP workloads with provable guarantees | Aug 23, 2021 | Decision MakingDecision Making Under Uncertainty | —Unverified | 0 |
| Non-Deterministic Policies in Markovian Decision Processes | Jan 16, 2014 | Decision Makingreinforcement-learning | —Unverified | 0 |
| Non-maximizing policies that fulfill multi-criterion aspirations in expectation | Aug 8, 2024 | Sequential Decision Making | —Unverified | 0 |
| Non-Stationary Bandits with Habituation and Recovery Dynamics | Jul 26, 2017 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Non-stationary Delayed Combinatorial Semi-Bandit with Causally Related Rewards | Jul 18, 2023 | Decision MakingDecision Making Under Uncertainty | —Unverified | 0 |