| How to Provably Improve Return Conditioned Supervised Learning? | Jun 10, 2025 | Decision MakingOffline RL | —Unverified | 0 | 0 |
| A Bayesian Approach to Learning Bandit Structure in Markov Decision Processes | Jul 30, 2022 | Decision Makingreinforcement-learning | —Unverified | 0 | 0 |
| Accelerating E-Commerce Search Engine Ranking by Contextual Factor Selection | Mar 14, 2018 | Combinatorial OptimizationDecision Making | —Unverified | 0 | 0 |
| Accelerating exploration and representation learning with offline pre-training | Mar 31, 2023 | Decision MakingNetHack | —Unverified | 0 | 0 |
| Accelerating Matrix Diagonalization through Decision Transformers with Epsilon-Greedy Optimization | Jun 23, 2024 | Decision MakingSequential Decision Making | —Unverified | 0 | 0 |
| Accelerating Offline Reinforcement Learning Application in Real-Time Bidding and Recommendation: Potential Use of Simulation | Sep 17, 2021 | Decision MakingOffline RL | —Unverified | 0 | 0 |
| Accelerating Reinforcement Learning for Reaching using Continuous Curriculum Learning | Feb 7, 2020 | Decision Makingreinforcement-learning | —Unverified | 0 | 0 |
| A Classification View on Meta Learning Bandits | Apr 6, 2025 | ClassificationMeta-Learning | —Unverified | 0 | 0 |
| A Computational Framework for Motor Skill Acquisition | Jan 3, 2019 | Decision MakingReinforcement Learning | —Unverified | 0 | 0 |
| A Contextual Bandit Approach for Stream-Based Active Learning | Jan 24, 2017 | Active LearningDecision Making | —Unverified | 0 | 0 |
| Action Set Based Policy Optimization for Safe Power Grid Management | Jun 29, 2021 | Decision MakingManagement | —Unverified | 0 | 0 |
| Active Layer-Contrastive Decoding Reduces Hallucination in Large Language Model Generation | May 29, 2025 | Decision MakingHallucination | —Unverified | 0 | 0 |
| Active Learning-Based Multistage Sequential Decision-Making Model with Application on Common Bile Duct Stone Evaluation | Jan 13, 2022 | Active LearningDecision Making | —Unverified | 0 | 0 |
| Active Learning for Accurate Estimation of Linear Models | Mar 2, 2017 | Active LearningDecision Making | —Unverified | 0 | 0 |
| Active Measure Reinforcement Learning for Observation Cost Minimization | May 26, 2020 | Decision MakingQ-Learning | —Unverified | 0 | 0 |
| Active Reinforcement Learning Strategies for Offline Policy Improvement | Dec 17, 2024 | Active Learningcontinuous-control | —Unverified | 0 | 0 |
| Active Sensing as Bayes-Optimal Sequential Decision Making | Aug 9, 2014 | Decision MakingSensitivity | —Unverified | 0 | 0 |
| Active Sensing as Bayes-Optimal Sequential Decision Making | May 28, 2013 | Decision MakingSensitivity | —Unverified | 0 | 0 |
| Actor-Critic Algorithms for Risk-Sensitive MDPs | Dec 1, 2013 | Decision MakingSequential Decision Making | —Unverified | 0 | 0 |
| Actor-Critic Deep Reinforcement Learning for Solving Job Shop Scheduling Problems | Apr 14, 2020 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 | 0 |
| Actual Causality and Responsibility Attribution in Decentralized Partially Observable Markov Decision Processes | Apr 1, 2022 | Decision MakingDecision Making Under Uncertainty | —Unverified | 0 | 0 |
| A Customizable Generator for Comic-Style Visual Narrative | Dec 14, 2023 | ARCDecision Making | —Unverified | 0 | 0 |
| adaPARL: Adaptive Privacy-Aware Reinforcement Learning for Sequential-Decision Making Human-in-the-Loop Systems | Mar 7, 2023 | Decision MakingReinforcement Learning (RL) | —Unverified | 0 | 0 |
| Adaptive Exploration in Linear Contextual Bandit | Oct 15, 2019 | Decision MakingMulti-Armed Bandits | —Unverified | 0 | 0 |
| Adaptive Frontier Exploration on Graphs with Applications to Network-Based Disease Testing | May 27, 2025 | Sequential Decision Making | —Unverified | 0 | 0 |