| Explainable Reinforcement Learning Agents Using World Models | May 12, 2025 | counterfactualreinforcement-learning | —Unverified | 0 |
| A Multi-Agent Reinforcement Learning Approach for Cooperative Air-Ground-Human Crowdsensing in Emergency Rescue | May 11, 2025 | Decision Making Under UncertaintyMulti-agent Reinforcement Learning | —Unverified | 0 |
| Constrained Online Decision-Making: A Unified Framework | May 11, 2025 | Active Learningcounterfactual | —Unverified | 0 |
| RL-DAUNCE: Reinforcement Learning-Driven Data Assimilation with Uncertainty-Aware Constrained Ensembles | May 8, 2025 | Computational EfficiencyReinforcement Learning (RL) | —Unverified | 0 |
| Active Sampling for MRI-based Sequential Decision Making | May 7, 2025 | Decision MakingDiagnostic | CodeCode Available | 0 |
| Policy-labeled Preference Learning: Is Preference Enough for RLHF? | May 6, 2025 | continuous-controlContinuous Control | —Unverified | 0 |
| MDPs with a State Sensing Cost | May 6, 2025 | Sequential Decision Making | —Unverified | 0 |
| D3HRL: A Distributed Hierarchical Reinforcement Learning Approach Based on Causal Discovery and Spurious Correlation Detection | May 4, 2025 | Causal DiscoveryDecision Making | —Unverified | 0 |
| Bayesian learning of the optimal action-value function in a Markov decision process | May 3, 2025 | Decision MakingSequential Decision Making | —Unverified | 0 |
| A Minimax-MDP Framework with Future-imposed Conditions for Learning-augmented Problems | May 2, 2025 | Decision MakingPrediction Intervals | —Unverified | 0 |