| Action Set Based Policy Optimization for Safe Power Grid Management | Jun 29, 2021 | Decision MakingManagement | —Unverified | 0 |
| Annotation Efficiency: Identifying Hard Samples via Blocked Sparse Linear Bandits | Oct 26, 2024 | Active LearningBlocking | —Unverified | 0 |
| Few-shot Scooping Under Domain Shift via Simulated Maximal Deployment Gaps | Aug 6, 2024 | Bayesian OptimizationMeta-Learning | —Unverified | 0 |
| An Expandable Machine Learning-Optimization Framework to Sequential Decision-Making | Nov 12, 2023 | Combinatorial OptimizationDecision Making | —Unverified | 0 |
| Boltzmann Exploration Done Right | May 29, 2017 | Decision MakingDecision Making Under Uncertainty | —Unverified | 0 |
| Deep Symbolic Optimization: Reinforcement Learning for Symbolic Mathematics | May 16, 2025 | Equation Discoveryreinforcement-learning | —Unverified | 0 |
| Composite Bayesian Optimization In Function Spaces Using NEON -- Neural Epistemic Operator Networks | Apr 3, 2024 | Bayesian OptimizationDecision Making | —Unverified | 0 |
| Computing Preimages of Deep Neural Networks with Applications to Safety | Jan 1, 2021 | Collision AvoidanceDecision Making | —Unverified | 0 |
| Constrained Online Decision-Making: A Unified Framework | May 11, 2025 | Active Learningcounterfactual | —Unverified | 0 |
| Contextual Experience Replay for Self-Improvement of Language Agents | Jun 7, 2025 | Decision MakingLarge Language Model | —Unverified | 0 |
| Adversarial Agents: Black-Box Evasion Attacks with Reinforcement Learning | Mar 3, 2025 | reinforcement-learningReinforcement Learning | —Unverified | 0 |
| Statistical Complexity and Optimal Algorithms for Non-linear Ridge Bandits | Feb 12, 2023 | Decision MakingSequential Decision Making | —Unverified | 0 |
| The f-Divergence Reinforcement Learning Framework | Sep 24, 2021 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| A Near-Optimal Best-of-Both-Worlds Algorithm for Online Learning with Feedback Graphs | Jun 1, 2022 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Adaptive Rollout Length for Model-Based RL Using Model-Free Deep RL | Jun 6, 2022 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| Collaborative Inter-agent Knowledge Distillation for Reinforcement Learning | Sep 25, 2019 | Decision MakingKnowledge Distillation | —Unverified | 0 |
| Communication and Control Co-Design in 6G: Sequential Decision-Making with LLMs | Jul 6, 2024 | Decision MakingSequential Decision Making | —Unverified | 0 |
| An Arm-Wise Randomization Approach to Combinatorial Linear Semi-Bandits | Sep 5, 2019 | Decision MakingRecommendation Systems | —Unverified | 0 |
| A Contextual Bandit Approach for Stream-Based Active Learning | Jan 24, 2017 | Active LearningDecision Making | —Unverified | 0 |
| Code Models are Zero-shot Precondition Reasoners | Nov 16, 2023 | Decision MakingSequential Decision Making | —Unverified | 0 |
| An Analysis of Frame-skipping in Reinforcement Learning | Feb 7, 2021 | Decision Makingreinforcement-learning | —Unverified | 0 |
| Bayesian learning of the optimal action-value function in a Markov decision process | May 3, 2025 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Bayesian Inverse Transition Learning for Offline Settings | Aug 9, 2023 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Analysis of Thompson Sampling for Partially Observable Contextual Multi-Armed Bandits | Oct 23, 2021 | Decision MakingMulti-Armed Bandits | —Unverified | 0 |
| Adaptive Learning Rate for Follow-the-Regularized-Leader: Competitive Analysis and Best-of-Both-Worlds | Mar 1, 2024 | Decision MakingMulti-Armed Bandits | —Unverified | 0 |
| Collaborative and Federated Black-box Optimization: A Bayesian Optimization Perspective | Nov 12, 2024 | Bayesian OptimizationDecision Making | —Unverified | 0 |
| Bayesian optimization explains human active search | Dec 1, 2013 | Bayesian OptimizationDecision Making | —Unverified | 0 |
| Be Considerate: Objectives, Side Effects, and Deciding How to Act | Jun 4, 2021 | Decision MakingReinforcement Learning (RL) | —Unverified | 0 |
| An Anytime Algorithm for Task and Motion MDPs | Feb 16, 2018 | Decision MakingMotion Planning | —Unverified | 0 |
| Adaptive Robust Online Portfolio Selection | Jun 2, 2022 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Beta DVBF: Learning State-Space Models for Control from High Dimensional Observations | Nov 2, 2019 | Decision MakingSequential Decision Making | —Unverified | 0 |
| BeTAIL: Behavior Transformer Adversarial Imitation Learning from Human Racing Gameplay | Feb 22, 2024 | Autonomous RacingDecision Making | —Unverified | 0 |
| Between Rate-Distortion Theory & Value Equivalence in Model-Based Reinforcement Learning | Jun 4, 2022 | Decision MakingModel-based Reinforcement Learning | —Unverified | 0 |
| Beyond Adaptive Submodularity: Approximation Guarantees of Greedy Policy with Adaptive Submodularity Ratio | Apr 24, 2019 | Decision Makingfeature selection | —Unverified | 0 |
| Beyond O(T) Regret: Decoupling Learning and Decision-making in Online Linear Programming | Jan 6, 2025 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Beyond Stationarity: Convergence Analysis of Stochastic Softmax Policy Gradient Methods | Oct 4, 2023 | Decision MakingPolicy Gradient Methods | —Unverified | 0 |
| Communication-Control Codesign for Large-Scale Wireless Networked Control Systems | Oct 15, 2024 | Deep Reinforcement LearningScheduling | —Unverified | 0 |
| Blessing from Human-AI Interaction: Super Reinforcement Learning in Confounded Environments | Sep 29, 2022 | Decision Makingreinforcement-learning | —Unverified | 0 |
| An End-to-End Reinforcement Learning Based Approach for Micro-View Order-Dispatching in Ride-Hailing | Aug 20, 2024 | Combinatorial OptimizationDecision Making | —Unverified | 0 |
| BOF-UCB: A Bayesian-Optimistic Frequentist Algorithm for Non-Stationary Contextual Bandits | Jul 7, 2023 | Decision MakingMulti-Armed Bandits | —Unverified | 0 |
| Bayesian Graph Traversal | Mar 7, 2025 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Boosting Reinforcement Learning and Planning with Demonstrations: A Survey | Mar 23, 2023 | Decision Makingreinforcement-learning | —Unverified | 0 |
| Bootstrapping Adaptive Human-Machine Interfaces with Offline Reinforcement Learning | Sep 7, 2023 | Brain Computer InterfaceDecision Making | —Unverified | 0 |
| An Introduction to Quantum Reinforcement Learning (QRL) | Sep 9, 2024 | Decision Makingreinforcement-learning | —Unverified | 0 |
| Branch Ranking for Efficient Mixed-Integer Programming via Offline Ranking-based Policy Learning | Jul 26, 2022 | Decision MakingReinforcement Learning (RL) | —Unverified | 0 |
| Federated Learning with Uncertainty via Distilled Predictive Distributions | Jun 15, 2022 | Active LearningDecision Making | —Unverified | 0 |
| An Online Approach to Solve the Dynamic Vehicle Routing Problem with Stochastic Trip Requests for Paratransit Services | Mar 28, 2022 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Bridging Commonsense Reasoning and Probabilistic Planning via a Probabilistic Action Language | Jul 31, 2019 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Bayesian Exploration Networks | Aug 24, 2023 | Decision MakingDecision Making Under Uncertainty | —Unverified | 0 |
| Bayesian decision-making under misspecified priors with applications to meta-learning | Jul 3, 2021 | Decision MakingMeta-Learning | —Unverified | 0 |