| Human AI interaction loop training: New approach for interactive reinforcement learning | Mar 9, 2020 | Decision MakingImitation Learning | —Unverified | 0 |
| Human-Aware Robot Navigation via Reinforcement Learning with Hindsight Experience Replay and Curriculum Learning | Oct 9, 2021 | Decision MakingReinforcement Learning (RL) | —Unverified | 0 |
| Human collective intelligence as distributed Bayesian inference | Aug 5, 2016 | Bayesian InferenceDecision Making | —Unverified | 0 |
| Human-in-the-loop Active Covariance Learning for Improving Prediction in Small Data Sets | Feb 26, 2019 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Human-Modeling in Sequential Decision-Making: An Analysis through the Lens of Human-Aware AI | May 13, 2024 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Decentralized Multi-Agent Reinforcement Learning with Networked Agents: Recent Advances | Dec 9, 2019 | Decision MakingMulti-agent Reinforcement Learning | —Unverified | 0 |
| Hyperbolic Deep Reinforcement Learning | Oct 4, 2022 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| Hyperparameter Transfer Learning with Adaptive Complexity | Feb 25, 2021 | Bayesian OptimizationDecision Making | —Unverified | 0 |
| Hyper-parameter Tuning under a Budget Constraint | Feb 1, 2019 | Decision MakingSequential Decision Making | —Unverified | 0 |
| HyperQ-Opt: Q-learning for Hyperparameter Optimization | Dec 23, 2024 | Bayesian OptimizationHyperparameter Optimization | —Unverified | 0 |
| A Theoretical Connection Between Statistical Physics and Reinforcement Learning | Jun 24, 2019 | Decision Makingreinforcement-learning | —Unverified | 0 |
| AirCapRL: Autonomous Aerial Human Motion Capture using Deep Reinforcement Learning | Jul 13, 2020 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| Large Sequence Models for Sequential Decision-Making: A Survey | Jun 24, 2023 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Decentralized Cross-Entropy Method for Model-Based Reinforcement Learning | Sep 29, 2021 | continuous-controlContinuous Control | —Unverified | 0 |
| Asynchronous training of quantum reinforcement learning | Jan 12, 2023 | Decision MakingQuantum Machine Learning | —Unverified | 0 |
| Actor-Critic Deep Reinforcement Learning for Solving Job Shop Scheduling Problems | Apr 14, 2020 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| DDO: Dual-Decision Optimization via Multi-Agent Collaboration for LLM-Based Medical Consultation | May 24, 2025 | Decision MakingSequential Decision Making | —Unverified | 0 |
| DBA bandits: Self-driving index tuning under ad-hoc, analytical workloads with safety guarantees | Oct 19, 2020 | AttributeDecision Making | —Unverified | 0 |
| Asymmetric Actor Critic for Image-Based Robot Learning | Oct 18, 2017 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| AI in Pharma for Personalized Sequential Decision-Making: Methods, Applications and Opportunities | Nov 30, 2023 | Decision MakingDrug Discovery | —Unverified | 0 |
| A Survey on Reinforcement Learning in Aviation Applications | Nov 3, 2022 | Decision Makingreinforcement-learning | —Unverified | 0 |
| Accelerating Reinforcement Learning for Reaching using Continuous Curriculum Learning | Feb 7, 2020 | Decision Makingreinforcement-learning | —Unverified | 0 |
| Invariant Lipschitz Bandits: A Side Observation Approach | Dec 14, 2022 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Interpretability in Action: Exploratory Analysis of VPT, a Minecraft Agent | Jul 16, 2024 | Decision MakingMinecraft | —Unverified | 0 |
| Data-efficient visuomotor policy training using reinforcement learning and generative models | Jul 26, 2020 | Decision MakingDisentanglement | —Unverified | 0 |
| A Survey on Model-based Reinforcement Learning | Jun 19, 2022 | Decision Makingmodel | —Unverified | 0 |
| Language Guided Exploration for RL Agents in Text Environments | Mar 5, 2024 | Decision MakingLanguage Modeling | —Unverified | 0 |
| Last-Iterate Global Convergence of Policy Gradients for Constrained Reinforcement Learning | Jul 15, 2024 | continuous-controlContinuous Control | —Unverified | 0 |
| Joint AP Probing and Scheduling: A Contextual Bandit Approach | Aug 6, 2021 | Decision MakingScheduling | —Unverified | 0 |
| Accelerating Offline Reinforcement Learning Application in Real-Time Bidding and Recommendation: Potential Use of Simulation | Sep 17, 2021 | Decision MakingOffline RL | —Unverified | 0 |
| Keep Various Trajectories: Promoting Exploration of Ensemble Policies in Continuous Control | Oct 17, 2023 | continuous-controlContinuous Control | —Unverified | 0 |
| Interactions between dynamic team composition and coordination: An agent-based modeling approach | Jan 11, 2024 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Integrating Reward Maximization and Population Estimation: Sequential Decision-Making for Internal Revenue Service Audit Selection | Apr 25, 2022 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Data-Driven Online Model Selection With Regret Guarantees | Jun 5, 2023 | Decision Makingmodel | —Unverified | 0 |
| Integrating Policy Summaries with Reward Decomposition for Explaining Reinforcement Learning Agents | Oct 21, 2022 | Decision Makingreinforcement-learning | —Unverified | 0 |
| Integrated Sensing and Communications for Low-Altitude Economy: A Deep Reinforcement Learning Approach | Dec 5, 2024 | Collision AvoidanceDeep Reinforcement Learning | —Unverified | 0 |
| D3HRL: A Distributed Hierarchical Reinforcement Learning Approach Based on Causal Discovery and Spurious Correlation Detection | May 4, 2025 | Causal DiscoveryDecision Making | —Unverified | 0 |
| A Survey on Interpretable Reinforcement Learning | Dec 24, 2021 | Autonomous DrivingDecision Making | —Unverified | 0 |
| Data-Driven Upper Confidence Bounds with Near-Optimal Regret for Heavy-Tailed Bandits | Jun 9, 2024 | Decision MakingMulti-Armed Bandits | —Unverified | 0 |
| A Survey on Large-Population Systems and Scalable Multi-Agent Reinforcement Learning | Sep 8, 2022 | Decision MakingEpidemiology | —Unverified | 0 |
| Data-Efficient Reinforcement Learning for Malaria Control | May 4, 2021 | Decision MakingModel-based Reinforcement Learning | —Unverified | 0 |
| Interpolating Between Softmax Policy Gradient and Neural Replicator Dynamics with Capped Implicit Exploration | Jun 4, 2022 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Knowledge-Based Sequential Decision-Making Under Uncertainty | May 16, 2019 | Decision MakingDecision Making Under Uncertainty | —Unverified | 0 |
| Intrinsically Motivated Hierarchical Policy Learning in Multi-objective Markov Decision Processes | Aug 18, 2023 | Decision MakingMulti-Objective Reinforcement Learning | —Unverified | 0 |
| A Survey on Explainable Deep Reinforcement Learning | Feb 8, 2025 | Adversarial RobustnessDecision Making | —Unverified | 0 |
| Inverse Design of Photonic Crystal Surface Emitting Lasers is a Sequence Modeling Problem | Mar 8, 2024 | Decision MakingReinforcement Learning (RL) | —Unverified | 0 |
| Inverse Policy Evaluation for Value-based Sequential Decision-making | Aug 26, 2020 | Decision MakingQ-Learning | —Unverified | 0 |
| Inverse-RLignment: Large Language Model Alignment from Demonstrations through Inverse Reinforcement Learning | May 24, 2024 | Decision MakingLanguage Modeling | —Unverified | 0 |
| Investigating Order Effects in Multidimensional Relevance Judgment using Query Logs | Jul 14, 2018 | Decision MakingSequential Decision Making | —Unverified | 0 |
| InfraLib: Enabling Reinforcement Learning and Decision-Making for Large-Scale Infrastructure Management | Sep 5, 2024 | BenchmarkingComputational Efficiency | —Unverified | 0 |