| A Survey on Model-based Reinforcement Learning | Jun 19, 2022 | Decision Makingmodel | —Unverified | 0 |
| Language Guided Exploration for RL Agents in Text Environments | Mar 5, 2024 | Decision MakingLanguage Modeling | —Unverified | 0 |
| Last-Iterate Global Convergence of Policy Gradients for Constrained Reinforcement Learning | Jul 15, 2024 | continuous-controlContinuous Control | —Unverified | 0 |
| Joint AP Probing and Scheduling: A Contextual Bandit Approach | Aug 6, 2021 | Decision MakingScheduling | —Unverified | 0 |
| Accelerating Offline Reinforcement Learning Application in Real-Time Bidding and Recommendation: Potential Use of Simulation | Sep 17, 2021 | Decision MakingOffline RL | —Unverified | 0 |
| Keep Various Trajectories: Promoting Exploration of Ensemble Policies in Continuous Control | Oct 17, 2023 | continuous-controlContinuous Control | —Unverified | 0 |
| Interactions between dynamic team composition and coordination: An agent-based modeling approach | Jan 11, 2024 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Integrating Reward Maximization and Population Estimation: Sequential Decision-Making for Internal Revenue Service Audit Selection | Apr 25, 2022 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Data-Driven Online Model Selection With Regret Guarantees | Jun 5, 2023 | Decision Makingmodel | —Unverified | 0 |
| Integrating Policy Summaries with Reward Decomposition for Explaining Reinforcement Learning Agents | Oct 21, 2022 | Decision Makingreinforcement-learning | —Unverified | 0 |
| Integrated Sensing and Communications for Low-Altitude Economy: A Deep Reinforcement Learning Approach | Dec 5, 2024 | Collision AvoidanceDeep Reinforcement Learning | —Unverified | 0 |
| D3HRL: A Distributed Hierarchical Reinforcement Learning Approach Based on Causal Discovery and Spurious Correlation Detection | May 4, 2025 | Causal DiscoveryDecision Making | —Unverified | 0 |
| A Survey on Interpretable Reinforcement Learning | Dec 24, 2021 | Autonomous DrivingDecision Making | —Unverified | 0 |
| Data-Driven Upper Confidence Bounds with Near-Optimal Regret for Heavy-Tailed Bandits | Jun 9, 2024 | Decision MakingMulti-Armed Bandits | —Unverified | 0 |
| A Survey on Large-Population Systems and Scalable Multi-Agent Reinforcement Learning | Sep 8, 2022 | Decision MakingEpidemiology | —Unverified | 0 |
| Data-Efficient Reinforcement Learning for Malaria Control | May 4, 2021 | Decision MakingModel-based Reinforcement Learning | —Unverified | 0 |
| Interpolating Between Softmax Policy Gradient and Neural Replicator Dynamics with Capped Implicit Exploration | Jun 4, 2022 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Knowledge-Based Sequential Decision-Making Under Uncertainty | May 16, 2019 | Decision MakingDecision Making Under Uncertainty | —Unverified | 0 |
| Intrinsically Motivated Hierarchical Policy Learning in Multi-objective Markov Decision Processes | Aug 18, 2023 | Decision MakingMulti-Objective Reinforcement Learning | —Unverified | 0 |
| A Survey on Explainable Deep Reinforcement Learning | Feb 8, 2025 | Adversarial RobustnessDecision Making | —Unverified | 0 |
| Inverse Design of Photonic Crystal Surface Emitting Lasers is a Sequence Modeling Problem | Mar 8, 2024 | Decision MakingReinforcement Learning (RL) | —Unverified | 0 |
| Inverse Policy Evaluation for Value-based Sequential Decision-making | Aug 26, 2020 | Decision MakingQ-Learning | —Unverified | 0 |
| Inverse-RLignment: Large Language Model Alignment from Demonstrations through Inverse Reinforcement Learning | May 24, 2024 | Decision MakingLanguage Modeling | —Unverified | 0 |
| Investigating Order Effects in Multidimensional Relevance Judgment using Query Logs | Jul 14, 2018 | Decision MakingSequential Decision Making | —Unverified | 0 |
| InfraLib: Enabling Reinforcement Learning and Decision-Making for Large-Scale Infrastructure Management | Sep 5, 2024 | BenchmarkingComputational Efficiency | —Unverified | 0 |