| Annotation Efficiency: Identifying Hard Samples via Blocked Sparse Linear Bandits | Oct 26, 2024 | Active LearningBlocking | —Unverified | 0 | 0 |
| Estimating Link Flows in Road Networks with Synthetic Trajectory Data Generation: Reinforcement Learning-based Approaches | Jun 26, 2022 | Decision Makingreinforcement-learning | —Unverified | 0 | 0 |
| Entropy Regularization for Population Estimation | Aug 24, 2022 | Decision MakingSequential Decision Making | —Unverified | 0 | 0 |
| Bootstrapping Adaptive Human-Machine Interfaces with Offline Reinforcement Learning | Sep 7, 2023 | Brain Computer InterfaceDecision Making | —Unverified | 0 | 0 |
| Enhancing Q-Learning with Large Language Model Heuristics | May 6, 2024 | Decision MakingLanguage Modeling | —Unverified | 0 | 0 |
| Boosting Reinforcement Learning and Planning with Demonstrations: A Survey | Mar 23, 2023 | Decision Makingreinforcement-learning | —Unverified | 0 | 0 |
| Boltzmann Exploration Done Right | May 29, 2017 | Decision MakingDecision Making Under Uncertainty | —Unverified | 0 | 0 |
| An Introduction to Quantum Reinforcement Learning (QRL) | Sep 9, 2024 | Decision Makingreinforcement-learning | —Unverified | 0 | 0 |
| An Expandable Machine Learning-Optimization Framework to Sequential Decision-Making | Nov 12, 2023 | Combinatorial OptimizationDecision Making | —Unverified | 0 | 0 |
| Action Set Based Policy Optimization for Safe Power Grid Management | Jun 29, 2021 | Decision MakingManagement | —Unverified | 0 | 0 |
| Deep Symbolic Optimization: Reinforcement Learning for Symbolic Mathematics | May 16, 2025 | Equation Discoveryreinforcement-learning | —Unverified | 0 | 0 |
| Few-shot Scooping Under Domain Shift via Simulated Maximal Deployment Gaps | Aug 6, 2024 | Bayesian OptimizationMeta-Learning | —Unverified | 0 | 0 |
| Emergent Risk Awareness in Rational Agents under Resource Constraints | May 29, 2025 | Sequential Decision Making | —Unverified | 0 | 0 |
| Embodied Scene Understanding for Vision Language Models via MetaVQA | Jan 15, 2025 | Decision MakingQuestion Answering | —Unverified | 0 | 0 |
| BOF-UCB: A Bayesian-Optimistic Frequentist Algorithm for Non-Stationary Contextual Bandits | Jul 7, 2023 | Decision MakingMulti-Armed Bandits | —Unverified | 0 | 0 |
| Efficient Strategy Synthesis for MDPs via Hierarchical Block Decomposition | Jun 21, 2025 | Decision MakingSequential Decision Making | —Unverified | 0 | 0 |
| Blessing from Human-AI Interaction: Super Reinforcement Learning in Confounded Environments | Sep 29, 2022 | Decision Makingreinforcement-learning | —Unverified | 0 | 0 |
| Efficient Sequential Decision Making with Large Language Models | Jun 17, 2024 | Decision MakingModel Selection | —Unverified | 0 | 0 |
| Statistical Complexity and Optimal Algorithms for Non-linear Ridge Bandits | Feb 12, 2023 | Decision MakingSequential Decision Making | —Unverified | 0 | 0 |
| An End-to-End Reinforcement Learning Based Approach for Micro-View Order-Dispatching in Ride-Hailing | Aug 20, 2024 | Combinatorial OptimizationDecision Making | —Unverified | 0 | 0 |
| Efficient Reinforcement Learning with Large Language Model Priors | Oct 10, 2024 | Bayesian InferenceDecision Making | —Unverified | 0 | 0 |
| Efficient quantum recurrent reinforcement learning via quantum reservoir computing | Sep 13, 2023 | Decision Makingreinforcement-learning | —Unverified | 0 | 0 |
| Beyond Stationarity: Convergence Analysis of Stochastic Softmax Policy Gradient Methods | Oct 4, 2023 | Decision MakingPolicy Gradient Methods | —Unverified | 0 | 0 |
| Beyond O(T) Regret: Decoupling Learning and Decision-making in Online Linear Programming | Jan 6, 2025 | Decision MakingSequential Decision Making | —Unverified | 0 | 0 |
| Efficient and Optimal Algorithms for Contextual Dueling Bandits under Realizability | Nov 24, 2021 | Decision MakingSequential Decision Making | —Unverified | 0 | 0 |