| A Minimax-MDP Framework with Future-imposed Conditions for Learning-augmented Problems | May 2, 2025 | Decision MakingPrediction Intervals | —Unverified | 0 | 0 |
| A Mini Review on the utilization of Reinforcement Learning with OPC UA | May 24, 2023 | Decision Makingreinforcement-learning | —Unverified | 0 | 0 |
| A modular framework for object-based saccadic decisions in dynamic scenes | Jun 10, 2021 | Decision MakingSequential Decision Making | —Unverified | 0 | 0 |
| MSPM: A Modularized and Scalable Multi-Agent Reinforcement Learning-based System for Financial Portfolio Management | Feb 6, 2021 | Decision MakingManagement | —Unverified | 0 | 0 |
| A Multi-Agent Reinforcement Learning Approach for Cooperative Air-Ground-Human Crowdsensing in Emergency Rescue | May 11, 2025 | Decision Making Under UncertaintyMulti-agent Reinforcement Learning | —Unverified | 0 | 0 |
| An advantage based policy transfer algorithm for reinforcement learning with measures of transferability | Nov 12, 2023 | continuous-controlContinuous Control | —Unverified | 0 | 0 |
| A naive aggregation algorithm for improving generalization in a class of learning problems | Sep 6, 2024 | Decision MakingSequential Decision Making | —Unverified | 0 | 0 |
| Analysis of Thompson Sampling for Partially Observable Contextual Multi-Armed Bandits | Oct 23, 2021 | Decision MakingMulti-Armed Bandits | —Unverified | 0 | 0 |
| An Analysis of Frame-skipping in Reinforcement Learning | Feb 7, 2021 | Decision Makingreinforcement-learning | —Unverified | 0 | 0 |
| An Anytime Algorithm for Task and Motion MDPs | Feb 16, 2018 | Decision MakingMotion Planning | —Unverified | 0 | 0 |
| An Arm-Wise Randomization Approach to Combinatorial Linear Semi-Bandits | Sep 5, 2019 | Decision MakingRecommendation Systems | —Unverified | 0 | 0 |
| A Near-Optimal Best-of-Both-Worlds Algorithm for Online Learning with Feedback Graphs | Jun 1, 2022 | Decision MakingSequential Decision Making | —Unverified | 0 | 0 |
| An End-to-End Reinforcement Learning Based Approach for Micro-View Order-Dispatching in Ride-Hailing | Aug 20, 2024 | Combinatorial OptimizationDecision Making | —Unverified | 0 | 0 |
| An Expandable Machine Learning-Optimization Framework to Sequential Decision-Making | Nov 12, 2023 | Combinatorial OptimizationDecision Making | —Unverified | 0 | 0 |
| An Introduction to Quantum Reinforcement Learning (QRL) | Sep 9, 2024 | Decision Makingreinforcement-learning | —Unverified | 0 | 0 |
| Annotation Efficiency: Identifying Hard Samples via Blocked Sparse Linear Bandits | Oct 26, 2024 | Active LearningBlocking | —Unverified | 0 | 0 |
| An Online Approach to Solve the Dynamic Vehicle Routing Problem with Stochastic Trip Requests for Paratransit Services | Mar 28, 2022 | Decision MakingSequential Decision Making | —Unverified | 0 | 0 |
| An Optimized and Energy-Efficient Parallel Implementation of Non-Iteratively Trained Recurrent Neural Networks | Nov 26, 2019 | Decision MakingGPU | —Unverified | 0 | 0 |
| A Note on Sample Complexity of Interactive Imitation Learning with Log Loss | Dec 9, 2024 | Decision MakingImitation Learning | —Unverified | 0 | 0 |
| Answer Set Programming for Non-Stationary Markov Decision Processes | May 3, 2017 | Decision Makingreinforcement-learning | —Unverified | 0 | 0 |
| Anti-Concentrated Confidence Bonuses for Scalable Exploration | Oct 21, 2021 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 | 0 |
| AoI-Delay Tradeoff in Mobile Edge Caching: A Mixed-Order Drift-Plus-Penalty Algorithm | Apr 18, 2023 | Decision MakingScheduling | —Unverified | 0 | 0 |
| A POMDP Extension with Belief-dependent Rewards | Dec 1, 2010 | Decision MakingSequential Decision Making | —Unverified | 0 | 0 |
| Application of Deep Reinforcement Learning to Payment Fraud | Dec 8, 2021 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 | 0 |
| A Practical Introduction to Deep Reinforcement Learning | May 13, 2025 | Autonomous DrivingDecision Making | —Unverified | 0 | 0 |