| Thompson Sampling for Contextual Bandit Problems with Auxiliary Safety Constraints | Nov 2, 2019 | Bayesian OptimizationDecision Making | —Unverified | 0 | 0 |
| Thompson sampling for improved exploration in GFlowNets | Jun 30, 2023 | Active LearningDecision Making | —Unverified | 0 | 0 |
| Thompson Sampling on Symmetric α-Stable Bandits | Jul 8, 2019 | Bayesian InferenceDecision Making | —Unverified | 0 | 0 |
| Thompson Sampling with Virtual Helping Agents | Sep 16, 2022 | Decision MakingSequential Decision Making | —Unverified | 0 | 0 |
| Threshold UCT: Cost-Constrained Monte Carlo Tree Search with Pareto Curves | Dec 18, 2024 | Decision MakingSequential Decision Making | —Unverified | 0 | 0 |
| Tight Bayesian Ambiguity Sets for Robust MDPs | Nov 15, 2018 | Decision MakingReinforcement Learning | —Unverified | 0 | 0 |
| Tight Bounds for Bandit Combinatorial Optimization | Feb 24, 2017 | Combinatorial OptimizationDecision Making | —Unverified | 0 | 0 |
| Tight Lower Bounds for Combinatorial Multi-Armed Bandits | Feb 13, 2020 | Decision MakingMulti-Armed Bandits | —Unverified | 0 | 0 |
| Tight Regret Bounds for Infinite-armed Linear Contextual Bandits | May 4, 2019 | Decision MakingMulti-Armed Bandits | —Unverified | 0 | 0 |
| Time-Series-Informed Closed-loop Learning for Sequential Decision Making and Control | Dec 3, 2024 | Bayesian OptimizationDecision Making | —Unverified | 0 | 0 |
| TopRank: A practical algorithm for online stochastic ranking | Jun 6, 2018 | Decision MakingLearning-To-Rank | —Unverified | 0 | 0 |
| Toward negotiable reinforcement learning: shifting priorities in Pareto optimal sequential decision-making | Jan 5, 2017 | Decision MakingMulti-Objective Reinforcement Learning | —Unverified | 0 | 0 |
| Towards an Adaptable and Generalizable Optimization Engine in Decision and Control: A Meta Reinforcement Learning Approach | Jan 4, 2024 | Decision MakingImitation Learning | —Unverified | 0 | 0 |
| Towards a Unified Framework for Sequential Decision Making | Oct 3, 2023 | Bayesian InferenceDecision Making | —Unverified | 0 | 0 |
| Towards Enabling Learning for Time-Varying finite horizon Sequential Decision-Making Problems* | Apr 2, 2025 | Decision MakingSequential Decision Making | —Unverified | 0 | 0 |
| Towards Flexible Inference in Sequential Decision Problems via Bidirectional Transformers | Apr 28, 2022 | Decision MakingOffline RL | —Unverified | 0 | 0 |
| Towards More Efficient, Robust, Instance-adaptive, and Generalizable Sequential Decision making | Apr 12, 2025 | Decision MakingDecision Making Under Uncertainty | —Unverified | 0 | 0 |
| Towards Off-Policy Reinforcement Learning for Ranking Policies with Human Feedback | Jan 17, 2024 | Decision MakingLearning-To-Rank | —Unverified | 0 | 0 |
| Towards Responsible AI: Advances in Safety, Fairness, and Accountability of Autonomous Systems | Jun 11, 2025 | Autonomous VehiclesDecision Making | —Unverified | 0 | 0 |
| Towards Sample-Efficiency and Generalization of Transfer and Inverse Reinforcement Learning: A Comprehensive Literature Review | Nov 15, 2024 | Reinforcement Learning (RL)Sequential Decision Making | —Unverified | 0 | 0 |
| Toward the Fundamental Limits of Imitation Learning | Sep 13, 2020 | Decision MakingImitation Learning | —Unverified | 0 | 0 |
| Toward TransfORmers: Revolutionizing the Solution of Mixed Integer Programs with Transformers | Feb 20, 2024 | Decision MakingDecoder | —Unverified | 0 | 0 |
| Tracking the Race Between Deep Reinforcement Learning and Imitation Learning -- Extended Version | Aug 3, 2020 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 | 0 |
| Tradeoffs When Considering Deep Reinforcement Learning for Contingency Management in Advanced Air Mobility | Jun 28, 2024 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 | 0 |
| Training and Evaluation of Deep Policies using Reinforcement Learning and Generative Models | Apr 18, 2022 | Decision Makingreinforcement-learning | —Unverified | 0 | 0 |
| Trajectory VAE for multi-modal imitation | May 1, 2019 | continuous-controlContinuous Control | —Unverified | 0 | 0 |
| Transfer Learning in Deep Reinforcement Learning: A Survey | Sep 16, 2020 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 | 0 |
| Truncated Matrix Completion - An Empirical Study | Apr 14, 2025 | Decision MakingLow-Rank Matrix Completion | —Unverified | 0 | 0 |
| Two Phase Q-learning for Bidding-based Vehicle Sharing | Sep 29, 2015 | Decision MakingQ-Learning | —Unverified | 0 | 0 |
| Efficiently Training Deep-Learning Parametric Policies using Lagrangian Duality | May 23, 2024 | Decision MakingDecision Making Under Uncertainty | —Unverified | 0 | 0 |
| UAV-assisted Online Machine Learning over Multi-Tiered Networks: A Hierarchical Nested Personalized Federated Learning Approach | Jun 29, 2021 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 | 0 |
| UCBoost: A Boosting Approach to Tame Complexity and Optimality for Stochastic Bandits | Apr 16, 2018 | Decision MakingSequential Decision Making | —Unverified | 0 | 0 |
| Understanding and Leveraging Causal Relations in Deep Reinforcement Learning | Jan 1, 2021 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 | 0 |
| Understanding & Generalizing AlphaGo Zero | May 1, 2019 | Decision Makingreinforcement-learning | —Unverified | 0 | 0 |
| Understanding Human Behaviors in Crowds by Imitating the Decision-Making Process | Jan 25, 2018 | Collision AvoidanceDecision Making | —Unverified | 0 | 0 |
| Understanding the stochastic dynamics of sequential decision-making processes: A path-integral analysis of multi-armed bandits | Aug 11, 2022 | Decision MakingDecision Making Under Uncertainty | —Unverified | 0 | 0 |
| Understanding the Training and Generalization of Pretrained Transformer for Sequential Decision Making | May 23, 2024 | Decision MakingSequential Decision Making | —Unverified | 0 | 0 |
| Unifying and Optimizing Data Values for Selection via Sequential-Decision-Making | Feb 6, 2025 | Data ValuationDecision Making | —Unverified | 0 | 0 |
| Unlocking the Potential of Simulators: Design with RL in Mind | Jun 8, 2017 | Decision MakingFriction | —Unverified | 0 | 0 |
| UNO Arena for Evaluating Sequential Decision-Making Capability of Large Language Models | Jun 24, 2024 | Decision MakingSequential Decision Making | —Unverified | 0 | 0 |
| IR-VIC: Unsupervised Discovery of Sub-goals for Transfer in RL | Jul 24, 2019 | Decision MakingHierarchical Reinforcement Learning | —Unverified | 0 | 0 |
| Using General Value Functions to Learn Domain-Backed Inventory Management Policies | Nov 3, 2023 | Decision MakingManagement | —Unverified | 0 | 0 |
| Using Reinforcement Learning for Demand Response of Domestic Hot Water Buffers: a Real-Life Demonstration | Mar 16, 2017 | Decision MakingModel-based Reinforcement Learning | —Unverified | 0 | 0 |
| Utility-based Dueling Bandits as a Partial Monitoring Game | Jul 10, 2015 | Decision MakingSequential Decision Making | —Unverified | 0 | 0 |
| Utility Theory for Sequential Decision Making | Jun 27, 2022 | Decision MakingSequential Decision Making | —Unverified | 0 | 0 |
| Value Enhancement of Reinforcement Learning via Efficient and Robust Trust Region Optimization | Jan 5, 2023 | Decision Makingreinforcement-learning | —Unverified | 0 | 0 |
| Variance-aware robust reinforcement learning with linear function approximation under heavy-tailed rewards | Mar 9, 2023 | Decision Makingregression | —Unverified | 0 | 0 |
| Variance-Constrained Actor-Critic Algorithms for Discounted and Average Reward MDPs | Mar 25, 2014 | Decision MakingSequential Decision Making | —Unverified | 0 | 0 |
| Variational Deep Learning via Implicit Regularization | May 26, 2025 | Deep LearningInductive Bias | —Unverified | 0 | 0 |
| Variational Offline Multi-agent Skill Discovery | May 26, 2024 | Decision MakingMulti-agent Reinforcement Learning | —Unverified | 0 | 0 |