| Human AI interaction loop training: New approach for interactive reinforcement learning | Mar 9, 2020 | Decision MakingImitation Learning | —Unverified | 0 | 0 |
| Human-Aware Robot Navigation via Reinforcement Learning with Hindsight Experience Replay and Curriculum Learning | Oct 9, 2021 | Decision MakingReinforcement Learning (RL) | —Unverified | 0 | 0 |
| A Note on Sample Complexity of Interactive Imitation Learning with Log Loss | Dec 9, 2024 | Decision MakingImitation Learning | —Unverified | 0 | 0 |
| Human-in-the-loop Active Covariance Learning for Improving Prediction in Small Data Sets | Feb 26, 2019 | Decision MakingSequential Decision Making | —Unverified | 0 | 0 |
| Human-Modeling in Sequential Decision-Making: An Analysis through the Lens of Human-Aware AI | May 13, 2024 | Decision MakingSequential Decision Making | —Unverified | 0 | 0 |
| Multi-IRS-assisted Multi-Cell Uplink MIMO Communications under Imperfect CSI: A Deep Reinforcement Learning Approach | Nov 2, 2020 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 | 0 |
| Hyperbolic Deep Reinforcement Learning | Oct 4, 2022 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 | 0 |
| Hyperparameter Transfer Learning with Adaptive Complexity | Feb 25, 2021 | Bayesian OptimizationDecision Making | —Unverified | 0 | 0 |
| Hyper-parameter Tuning under a Budget Constraint | Feb 1, 2019 | Decision MakingSequential Decision Making | —Unverified | 0 | 0 |
| HyperQ-Opt: Q-learning for Hyperparameter Optimization | Dec 23, 2024 | Bayesian OptimizationHyperparameter Optimization | —Unverified | 0 | 0 |
| Exploiting Model Equivalences for Solving Interactive Dynamic Influence Diagrams | Jan 18, 2014 | Decision MakingSequential Decision Making | —Unverified | 0 | 0 |
| Bridging Rested and Restless Bandits with Graph-Triggering: Rising and Rotting | Sep 9, 2024 | Decision MakingSequential Decision Making | —Unverified | 0 | 0 |
| Explainable Reinforcement Learning via Temporal Policy Decomposition | Jan 7, 2025 | reinforcement-learningReinforcement Learning | —Unverified | 0 | 0 |
| Explainable Reinforcement Learning for Broad-XAI: A Conceptual Framework and Survey | Aug 20, 2021 | Decision MakingExplainable artificial intelligence | —Unverified | 0 | 0 |
| An Optimized and Energy-Efficient Parallel Implementation of Non-Iteratively Trained Recurrent Neural Networks | Nov 26, 2019 | Decision MakingGPU | —Unverified | 0 | 0 |
| Adaptivity in Adaptive Submodularity | Nov 9, 2019 | Active LearningDecision Making | —Unverified | 0 | 0 |
| Active Layer-Contrastive Decoding Reduces Hallucination in Large Language Model Generation | May 29, 2025 | Decision MakingHallucination | —Unverified | 0 | 0 |
| Explainable Reinforcement Learning Agents Using World Models | May 12, 2025 | counterfactualreinforcement-learning | —Unverified | 0 | 0 |
| Bridging Commonsense Reasoning and Probabilistic Planning via a Probabilistic Action Language | Jul 31, 2019 | Decision MakingSequential Decision Making | —Unverified | 0 | 0 |
| Experimentation Platforms Meet Reinforcement Learning: Bayesian Sequential Decision-Making for Continuous Monitoring | Apr 2, 2023 | Decision Makingreinforcement-learning | —Unverified | 0 | 0 |
| Experimental analysis of data-driven control for a building heating system | Jul 13, 2015 | Decision Makingreinforcement-learning | —Unverified | 0 | 0 |
| An Online Approach to Solve the Dynamic Vehicle Routing Problem with Stochastic Trip Requests for Paratransit Services | Mar 28, 2022 | Decision MakingSequential Decision Making | —Unverified | 0 | 0 |
| Evaluating Explanation Methods for Vision-and-Language Navigation | Oct 10, 2023 | Decision MakingNavigate | —Unverified | 0 | 0 |
| Evaluating Dynamic Conditional Quantile Treatment Effects with Applications in Ridesharing | May 17, 2023 | Decision MakingSequential Decision Making | —Unverified | 0 | 0 |
| Branch Ranking for Efficient Mixed-Integer Programming via Offline Ranking-based Policy Learning | Jul 26, 2022 | Decision MakingReinforcement Learning (RL) | —Unverified | 0 | 0 |
| Annotation Efficiency: Identifying Hard Samples via Blocked Sparse Linear Bandits | Oct 26, 2024 | Active LearningBlocking | —Unverified | 0 | 0 |
| Estimating Link Flows in Road Networks with Synthetic Trajectory Data Generation: Reinforcement Learning-based Approaches | Jun 26, 2022 | Decision Makingreinforcement-learning | —Unverified | 0 | 0 |
| Entropy Regularization for Population Estimation | Aug 24, 2022 | Decision MakingSequential Decision Making | —Unverified | 0 | 0 |
| Bootstrapping Adaptive Human-Machine Interfaces with Offline Reinforcement Learning | Sep 7, 2023 | Brain Computer InterfaceDecision Making | —Unverified | 0 | 0 |
| Enhancing Q-Learning with Large Language Model Heuristics | May 6, 2024 | Decision MakingLanguage Modeling | —Unverified | 0 | 0 |
| Boosting Reinforcement Learning and Planning with Demonstrations: A Survey | Mar 23, 2023 | Decision Makingreinforcement-learning | —Unverified | 0 | 0 |
| Boltzmann Exploration Done Right | May 29, 2017 | Decision MakingDecision Making Under Uncertainty | —Unverified | 0 | 0 |
| An Introduction to Quantum Reinforcement Learning (QRL) | Sep 9, 2024 | Decision Makingreinforcement-learning | —Unverified | 0 | 0 |
| An Expandable Machine Learning-Optimization Framework to Sequential Decision-Making | Nov 12, 2023 | Combinatorial OptimizationDecision Making | —Unverified | 0 | 0 |
| Action Set Based Policy Optimization for Safe Power Grid Management | Jun 29, 2021 | Decision MakingManagement | —Unverified | 0 | 0 |
| Deep Symbolic Optimization: Reinforcement Learning for Symbolic Mathematics | May 16, 2025 | Equation Discoveryreinforcement-learning | —Unverified | 0 | 0 |
| Few-shot Scooping Under Domain Shift via Simulated Maximal Deployment Gaps | Aug 6, 2024 | Bayesian OptimizationMeta-Learning | —Unverified | 0 | 0 |
| Emergent Risk Awareness in Rational Agents under Resource Constraints | May 29, 2025 | Sequential Decision Making | —Unverified | 0 | 0 |
| Embodied Scene Understanding for Vision Language Models via MetaVQA | Jan 15, 2025 | Decision MakingQuestion Answering | —Unverified | 0 | 0 |
| BOF-UCB: A Bayesian-Optimistic Frequentist Algorithm for Non-Stationary Contextual Bandits | Jul 7, 2023 | Decision MakingMulti-Armed Bandits | —Unverified | 0 | 0 |
| Efficient Strategy Synthesis for MDPs via Hierarchical Block Decomposition | Jun 21, 2025 | Decision MakingSequential Decision Making | —Unverified | 0 | 0 |
| Blessing from Human-AI Interaction: Super Reinforcement Learning in Confounded Environments | Sep 29, 2022 | Decision Makingreinforcement-learning | —Unverified | 0 | 0 |
| Efficient Sequential Decision Making with Large Language Models | Jun 17, 2024 | Decision MakingModel Selection | —Unverified | 0 | 0 |
| Statistical Complexity and Optimal Algorithms for Non-linear Ridge Bandits | Feb 12, 2023 | Decision MakingSequential Decision Making | —Unverified | 0 | 0 |
| An End-to-End Reinforcement Learning Based Approach for Micro-View Order-Dispatching in Ride-Hailing | Aug 20, 2024 | Combinatorial OptimizationDecision Making | —Unverified | 0 | 0 |
| Efficient Reinforcement Learning with Large Language Model Priors | Oct 10, 2024 | Bayesian InferenceDecision Making | —Unverified | 0 | 0 |
| Efficient quantum recurrent reinforcement learning via quantum reservoir computing | Sep 13, 2023 | Decision Makingreinforcement-learning | —Unverified | 0 | 0 |
| Beyond Stationarity: Convergence Analysis of Stochastic Softmax Policy Gradient Methods | Oct 4, 2023 | Decision MakingPolicy Gradient Methods | —Unverified | 0 | 0 |
| Beyond O(T) Regret: Decoupling Learning and Decision-making in Online Linear Programming | Jan 6, 2025 | Decision MakingSequential Decision Making | —Unverified | 0 | 0 |
| Efficient and Optimal Algorithms for Contextual Dueling Bandits under Realizability | Nov 24, 2021 | Decision MakingSequential Decision Making | —Unverified | 0 | 0 |