| Policy-labeled Preference Learning: Is Preference Enough for RLHF? | May 6, 2025 | continuous-controlContinuous Control | —Unverified | 0 |
| MDPs with a State Sensing Cost | May 6, 2025 | Sequential Decision Making | —Unverified | 0 |
| D3HRL: A Distributed Hierarchical Reinforcement Learning Approach Based on Causal Discovery and Spurious Correlation Detection | May 4, 2025 | Causal DiscoveryDecision Making | —Unverified | 0 |
| Bayesian learning of the optimal action-value function in a Markov decision process | May 3, 2025 | Decision MakingSequential Decision Making | —Unverified | 0 |
| A Minimax-MDP Framework with Future-imposed Conditions for Learning-augmented Problems | May 2, 2025 | Decision MakingPrediction Intervals | —Unverified | 0 |
| Self-Generated In-Context Examples Improve LLM Agents for Sequential Decision-Making Tasks | May 1, 2025 | Decision MakingLarge Language Model | —Unverified | 0 |
| Fast and Robust: Task Sampling with Posterior and Diversity Synergies for Adaptive Decision-Makers in Randomized Environments | Apr 27, 2025 | Decision MakingDiversity | —Unverified | 0 |
| SAPO-RL: Sequential Actuator Placement Optimization for Fuselage Assembly via Reinforcement Learning | Apr 24, 2025 | Decision MakingQ-Learning | —Unverified | 0 |
| Hierarchical Attention Fusion of Visual and Textual Representations for Cross-Domain Sequential Recommendation | Apr 21, 2025 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Consensus in Motion: A Case of Dynamic Rationality of Sequential Learning in Probability Aggregation | Apr 20, 2025 | Decision MakingSequential Decision Making | —Unverified | 0 |