| Adaptive Sampling for Discovery | May 30, 2022 | Decision MakingDrug Discovery | —Unverified | 0 | 0 |
| Adaptive Learning Rate for Follow-the-Regularized-Leader: Competitive Analysis and Best-of-Both-Worlds | Mar 1, 2024 | Decision MakingMulti-Armed Bandits | —Unverified | 0 | 0 |
| Adaptive Robust Online Portfolio Selection | Jun 2, 2022 | Decision MakingSequential Decision Making | —Unverified | 0 | 0 |
| Adaptive Rollout Length for Model-Based RL Using Model-Free Deep RL | Jun 6, 2022 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 | 0 |
| Adaptivity in Adaptive Submodularity | Nov 9, 2019 | Active LearningDecision Making | —Unverified | 0 | 0 |
| A Deep Reinforcement Learning Approach to Rare Event Estimation | Nov 22, 2022 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 | 0 |
| A Deep Reinforcement Learning Framework for Continuous Intraday Market Bidding | Apr 13, 2020 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 | 0 |
| Adversarial Attacks on Online Learning to Rank with Click Feedback | May 26, 2023 | Decision MakingLearning-To-Rank | —Unverified | 0 | 0 |
| Adversarial Deep Learning for Online Resource Allocation | Nov 19, 2021 | Decision MakingDeep Learning | —Unverified | 0 | 0 |
| Advice Conformance Verification by Reinforcement Learning agents for Human-in-the-Loop | Oct 7, 2022 | Decision Makingreinforcement-learning | —Unverified | 0 | 0 |
| A Farewell to Arms: Sequential Reward Maximization on a Budget with a Giving Up Option | Mar 6, 2020 | Decision MakingMulti-Armed Bandits | —Unverified | 0 | 0 |
| A finite time analysis of distributed Q-learning | May 23, 2024 | Decision MakingMulti-agent Reinforcement Learning | —Unverified | 0 | 0 |
| A Unifying Framework for Reinforcement Learning and Planning | Jun 26, 2020 | Decision Makingreinforcement-learning | —Unverified | 0 | 0 |
| A General Framework for Sequential Decision-Making under Adaptivity Constraints | Jun 26, 2023 | Decision MakingSequential Decision Making | —Unverified | 0 | 0 |
| A Generative Machine Learning Approach to Policy Optimization in Pursuit-Evasion Games | Oct 4, 2020 | BIG-bench Machine LearningDecision Making | —Unverified | 0 | 0 |
| AgentClinic: a multimodal agent benchmark to evaluate AI in simulated clinical environments | May 13, 2024 | Decision MakingDiagnostic | —Unverified | 0 | 0 |
| AI in Pharma for Personalized Sequential Decision-Making: Methods, Applications and Opportunities | Nov 30, 2023 | Decision MakingDrug Discovery | —Unverified | 0 | 0 |
| AirCapRL: Autonomous Aerial Human Motion Capture using Deep Reinforcement Learning | Jul 13, 2020 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 | 0 |
| AirLLM: Diffusion Policy-based Adaptive LoRA for Remote Fine-Tuning of LLM over the Air | Jul 15, 2025 | DenoisingSequential Decision Making | —Unverified | 0 | 0 |
| A Law of Iterated Logarithm for Multi-Agent Reinforcement Learning | Oct 27, 2021 | Decision MakingMulti-agent Reinforcement Learning | —Unverified | 0 | 0 |
| Algorithms for CVaR Optimization in MDPs | Jun 12, 2014 | Decision MakingSequential Decision Making | —Unverified | 0 | 0 |
| Align Your Intents: Offline Imitation Learning via Optimal Transport | Feb 20, 2024 | D4RLDecision Making | —Unverified | 0 | 0 |
| All AI Models are Wrong, but Some are Optimal | Jan 10, 2025 | AllDecision Making | —Unverified | 0 | 0 |
| A Machine of Few Words -- Interactive Speaker Recognition with Reinforcement Learning | Aug 7, 2020 | Decision Makingreinforcement-learning | —Unverified | 0 | 0 |
| A method for the online construction of the set of states of a Markov Decision Process using Answer Set Programming | Jun 5, 2017 | Decision MakingReinforcement Learning | —Unverified | 0 | 0 |
| A Minimax-MDP Framework with Future-imposed Conditions for Learning-augmented Problems | May 2, 2025 | Decision MakingPrediction Intervals | —Unverified | 0 | 0 |
| A Mini Review on the utilization of Reinforcement Learning with OPC UA | May 24, 2023 | Decision Makingreinforcement-learning | —Unverified | 0 | 0 |
| A modular framework for object-based saccadic decisions in dynamic scenes | Jun 10, 2021 | Decision MakingSequential Decision Making | —Unverified | 0 | 0 |
| MSPM: A Modularized and Scalable Multi-Agent Reinforcement Learning-based System for Financial Portfolio Management | Feb 6, 2021 | Decision MakingManagement | —Unverified | 0 | 0 |
| A Multi-Agent Reinforcement Learning Approach for Cooperative Air-Ground-Human Crowdsensing in Emergency Rescue | May 11, 2025 | Decision Making Under UncertaintyMulti-agent Reinforcement Learning | —Unverified | 0 | 0 |
| An advantage based policy transfer algorithm for reinforcement learning with measures of transferability | Nov 12, 2023 | continuous-controlContinuous Control | —Unverified | 0 | 0 |
| A naive aggregation algorithm for improving generalization in a class of learning problems | Sep 6, 2024 | Decision MakingSequential Decision Making | —Unverified | 0 | 0 |
| Analysis of Thompson Sampling for Partially Observable Contextual Multi-Armed Bandits | Oct 23, 2021 | Decision MakingMulti-Armed Bandits | —Unverified | 0 | 0 |
| An Analysis of Frame-skipping in Reinforcement Learning | Feb 7, 2021 | Decision Makingreinforcement-learning | —Unverified | 0 | 0 |
| An Anytime Algorithm for Task and Motion MDPs | Feb 16, 2018 | Decision MakingMotion Planning | —Unverified | 0 | 0 |
| An Arm-Wise Randomization Approach to Combinatorial Linear Semi-Bandits | Sep 5, 2019 | Decision MakingRecommendation Systems | —Unverified | 0 | 0 |
| A Near-Optimal Best-of-Both-Worlds Algorithm for Online Learning with Feedback Graphs | Jun 1, 2022 | Decision MakingSequential Decision Making | —Unverified | 0 | 0 |
| An End-to-End Reinforcement Learning Based Approach for Micro-View Order-Dispatching in Ride-Hailing | Aug 20, 2024 | Combinatorial OptimizationDecision Making | —Unverified | 0 | 0 |
| An Expandable Machine Learning-Optimization Framework to Sequential Decision-Making | Nov 12, 2023 | Combinatorial OptimizationDecision Making | —Unverified | 0 | 0 |
| An Introduction to Quantum Reinforcement Learning (QRL) | Sep 9, 2024 | Decision Makingreinforcement-learning | —Unverified | 0 | 0 |
| Annotation Efficiency: Identifying Hard Samples via Blocked Sparse Linear Bandits | Oct 26, 2024 | Active LearningBlocking | —Unverified | 0 | 0 |
| An Online Approach to Solve the Dynamic Vehicle Routing Problem with Stochastic Trip Requests for Paratransit Services | Mar 28, 2022 | Decision MakingSequential Decision Making | —Unverified | 0 | 0 |
| An Optimized and Energy-Efficient Parallel Implementation of Non-Iteratively Trained Recurrent Neural Networks | Nov 26, 2019 | Decision MakingGPU | —Unverified | 0 | 0 |
| A Note on Sample Complexity of Interactive Imitation Learning with Log Loss | Dec 9, 2024 | Decision MakingImitation Learning | —Unverified | 0 | 0 |
| Answer Set Programming for Non-Stationary Markov Decision Processes | May 3, 2017 | Decision Makingreinforcement-learning | —Unverified | 0 | 0 |
| Anti-Concentrated Confidence Bonuses for Scalable Exploration | Oct 21, 2021 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 | 0 |
| AoI-Delay Tradeoff in Mobile Edge Caching: A Mixed-Order Drift-Plus-Penalty Algorithm | Apr 18, 2023 | Decision MakingScheduling | —Unverified | 0 | 0 |
| A POMDP Extension with Belief-dependent Rewards | Dec 1, 2010 | Decision MakingSequential Decision Making | —Unverified | 0 | 0 |
| Application of Deep Reinforcement Learning to Payment Fraud | Dec 8, 2021 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 | 0 |
| A Practical Introduction to Deep Reinforcement Learning | May 13, 2025 | Autonomous DrivingDecision Making | —Unverified | 0 | 0 |