| Soft Q-Learning with Mutual-Information Regularization | May 1, 2019 | Decision MakingQ-Learning | —Unverified | 0 | 0 |
| Solving Robust Markov Decision Processes: Generic, Reliable, Efficient | Dec 13, 2024 | Decision MakingSequential Decision Making | —Unverified | 0 | 0 |
| Solving Transition-Independent Multi-agent MDPs with Sparse Interactions (Extended version) | Nov 29, 2015 | Decision MakingDecision Making Under Uncertainty | —Unverified | 0 | 0 |
| Spatial Privacy Pricing: The Interplay between Privacy, Utility and Price in Geo-Marketplaces | Aug 25, 2020 | Decision MakingSequential Decision Making | —Unverified | 0 | 0 |
| SS-MAIL: Self-Supervised Multi-Agent Imitation Learning | Oct 18, 2021 | Decision MakingImitation Learning | —Unverified | 0 | 0 |
| Stability-penalty-adaptive follow-the-regularized-leader: Sparsity, game-dependency, and best-of-both-worlds | May 26, 2023 | Decision MakingSequential Decision Making | —Unverified | 0 | 0 |
| Stagewise Safe Bayesian Optimization with Gaussian Processes | Jun 20, 2018 | Bayesian OptimizationDecision Making | —Unverified | 0 | 0 |
| State2Explanation: Concept-Based Explanations to Benefit Agent Learning and User Understanding | Sep 21, 2023 | Decision MakingSelf-Learning | —Unverified | 0 | 0 |
| State of the Art of User Simulation approaches for conversational information retrieval | Jan 10, 2022 | Decision MakingInformation Retrieval | —Unverified | 0 | 0 |
| State-Separated SARSA: A Practical Sequential Decision-Making Algorithm with Recovering Rewards | Mar 18, 2024 | Decision MakingQ-Learning | —Unverified | 0 | 0 |
| Exploring Offline Policy Evaluation for the Continuous-Armed Bandit Problem | Aug 21, 2019 | Decision MakingSequential Decision Making | —Unverified | 0 | 0 |
| Stealing Deep Reinforcement Learning Models for Fun and Profit | Jun 9, 2020 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 | 0 |
| STEVE-Audio: Expanding the Goal Conditioning Modalities of Embodied Agents in Minecraft | Dec 1, 2024 | Decision MakingMinecraft | —Unverified | 0 | 0 |
| Stochastic Contextual Bandits with Known Reward Functions | Apr 30, 2016 | Decision MakingMulti-Armed Bandits | —Unverified | 0 | 0 |
| Stochastic Dynamic Power Dispatch with High Generalization and Few-Shot Adaption via Contextual Meta Graph Reinforcement Learning | Jan 19, 2024 | Decision Makingreinforcement-learning | —Unverified | 0 | 0 |
| Stochastic Planning and Lifted Inference | Jan 4, 2017 | Decision MakingSequential Decision Making | —Unverified | 0 | 0 |
| Strategising template-guided needle placement for MR-targeted prostate biopsy | Jul 21, 2022 | AnatomyDecision Making | —Unverified | 0 | 0 |
| Streaming Adaptive Submodular Maximization | Aug 17, 2022 | Decision MakingSequential Decision Making | —Unverified | 0 | 0 |
| Structure-Adaptive Sequential Testing for Online False Discovery Rate Control | Feb 28, 2020 | Decision MakingSequential Decision Making | —Unverified | 0 | 0 |
| Structure and Reduction of MCTS for Explainable-AI | Aug 10, 2024 | Decision MakingSequential Decision Making | —Unverified | 0 | 0 |
| Structure Learning in Human Sequential Decision-Making | Dec 1, 2008 | Decision Makingreinforcement-learning | —Unverified | 0 | 0 |
| Subgoal-Based Explanations for Unreliable Intelligent Decision Support Systems | Jan 11, 2022 | Decision MakingSequential Decision Making | —Unverified | 0 | 0 |
| Subgoal Discovery Using a Free Energy Paradigm and State Aggregations | Dec 21, 2024 | Reinforcement Learning (RL)Sequential Decision Making | —Unverified | 0 | 0 |
| Successfully Guiding Humans with Imperfect Instructions by Highlighting Potential Errors and Suggesting Corrections | Feb 26, 2024 | Decision MakingSequential Decision Making | —Unverified | 0 | 0 |
| Supervised Fine-Tuning as Inverse Reinforcement Learning | Mar 18, 2024 | Decision MakingImitation Learning | —Unverified | 0 | 0 |
| Survey on Fair Reinforcement Learning: Theory and Practice | May 20, 2022 | ArticlesDecision Making | —Unverified | 0 | 0 |
| Swarm Behavior Cloning | Dec 10, 2024 | Decision MakingImitation Learning | —Unverified | 0 | 0 |
| Symbolic Dynamic Programming for Continuous State and Observation POMDPs | Dec 1, 2012 | Decision MakingSequential Decision Making | —Unverified | 0 | 0 |
| Synthetically Generating Human-like Data for Sequential Decision Making Tasks via Reward-Shaped Imitation Learning | Apr 14, 2023 | Decision MakingImitation Learning | —Unverified | 0 | 0 |
| Tableaux for Policy Synthesis for MDPs with PCTL* Constraints | Jun 30, 2017 | Decision MakingSequential Decision Making | —Unverified | 0 | 0 |
| TALES: Text Adventure Learning Environment Suite | Apr 19, 2025 | Decision MakingSequential Decision Making | —Unverified | 0 | 0 |
| TDM: Trustworthy Decision-Making via Interpretability Enhancement | Aug 13, 2021 | Decision MakingSequential Decision Making | —Unverified | 0 | 0 |
| Teacher-student curriculum learning for reinforcement learning | Oct 31, 2022 | Board GamesDecision Making | —Unverified | 0 | 0 |
| Technical Report on Reinforcement Learning Control on the Lucas-Nülle Inverted Pendulum | Dec 3, 2024 | Decision MakingReinforcement Learning (RL) | —Unverified | 0 | 0 |
| Techniques Toward Optimizing Viewability in RTB Ad Campaigns Using Reinforcement Learning | May 21, 2021 | Bayesian OptimizationDecision Making | —Unverified | 0 | 0 |
| Temporal Elections: Welfare, Strategyproofness, and Proportionality | Aug 24, 2024 | Decision MakingSequential Decision Making | —Unverified | 0 | 0 |
| Temporal-Spatial Causal Interpretations for Vision-Based Reinforcement Learning | Dec 6, 2021 | Causal DiscoveryDecision Making | —Unverified | 0 | 0 |
| Testing Optimality of Sequential Decision-Making | Jan 4, 2018 | Decision MakingSequential Decision Making | —Unverified | 0 | 0 |
| Text-based RL Agents with Commonsense Knowledge: New Challenges, Environments and Approaches | Jul 12, 2020 | Decision MakingReinforcement Learning (RL) | —Unverified | 0 | 0 |
| rfPG: Robust Finite-Memory Policy Gradients for Hidden-Model POMDPs | May 14, 2025 | Decision Making Under UncertaintySequential Decision Making | —Unverified | 0 | 0 |
| TGRL: An Algorithm for Teacher Guided Reinforcement Learning | Jul 6, 2023 | counterfactualDecision Making | —Unverified | 0 | 0 |
| The Bayesian Linear Information Filtering Problem | May 30, 2016 | ArticlesDecision Making | —Unverified | 0 | 0 |
| The Choice Function Framework for Online Policy Improvement | Oct 1, 2019 | Decision MakingSequential Decision Making | —Unverified | 0 | 0 |
| The Evolving Landscape of LLM- and VLM-Integrated Reinforcement Learning | Feb 21, 2025 | Decision Makingreinforcement-learning | —Unverified | 0 | 0 |
| The Extended UCB Policies for Frequentist Multi-armed Bandit Problems | Dec 8, 2011 | Decision MakingSequential Decision Making | —Unverified | 0 | 0 |
| The Knowledge Gradient with Logistic Belief Models for Binary Classification | Oct 8, 2015 | Binary ClassificationClassification | —Unverified | 0 | 0 |
| The MineRL 2020 Competition on Sample Efficient Reinforcement Learning using Human Priors | Jan 26, 2021 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 | 0 |
| The price of unfairness in linear bandits with biased feedback | Mar 18, 2022 | AttributeDecision Making | —Unverified | 0 | 0 |
| The Theory is Predictive, but is it Complete? An Application to Human Perception of Randomness | Jun 21, 2017 | BIG-bench Machine LearningDecision Making | —Unverified | 0 | 0 |
| The Value of Information When Deciding What to Learn | Oct 26, 2021 | Decision MakingSequential Decision Making | —Unverified | 0 | 0 |