| Vid2World: Crafting Video Diffusion Models to Interactive World Models | May 20, 2025 | Robot ManipulationSequential Decision Making | —Unverified | 0 |
| Sample and Computationally Efficient Continuous-Time Reinforcement Learning with General Function Approximation | May 20, 2025 | Computational Efficiencycontinuous-control | CodeCode Available | 0 |
| OMGPT: A Sequence Modeling Framework for Data-driven Operational Decision Making | May 19, 2025 | Decision MakingManagement | —Unverified | 0 |
| Deep Symbolic Optimization: Reinforcement Learning for Symbolic Mathematics | May 16, 2025 | Equation Discoveryreinforcement-learning | —Unverified | 0 |
| Generalization Guarantees for Learning Branch-and-Cut Policies in Integer Programming | May 16, 2025 | Sequential Decision MakingVariable Selection | —Unverified | 0 |
| Batched Nonparametric Bandits via k-Nearest Neighbor UCB | May 15, 2025 | Decision MakingMarketing | —Unverified | 0 |
| Sequential Treatment Effect Estimation with Unmeasured Confounders | May 14, 2025 | counterfactualSequential Decision Making | —Unverified | 0 |
| Counterfactual Strategies for Markov Decision Processes | May 14, 2025 | counterfactualDecision Making | —Unverified | 0 |
| rfPG: Robust Finite-Memory Policy Gradients for Hidden-Model POMDPs | May 14, 2025 | Decision Making Under UncertaintySequential Decision Making | —Unverified | 0 |
| A Practical Introduction to Deep Reinforcement Learning | May 13, 2025 | Autonomous DrivingDecision Making | —Unverified | 0 |
| Explainable Reinforcement Learning Agents Using World Models | May 12, 2025 | counterfactualreinforcement-learning | —Unverified | 0 |
| Constrained Online Decision-Making: A Unified Framework | May 11, 2025 | Active Learningcounterfactual | —Unverified | 0 |
| A Multi-Agent Reinforcement Learning Approach for Cooperative Air-Ground-Human Crowdsensing in Emergency Rescue | May 11, 2025 | Decision Making Under UncertaintyMulti-agent Reinforcement Learning | —Unverified | 0 |
| RL-DAUNCE: Reinforcement Learning-Driven Data Assimilation with Uncertainty-Aware Constrained Ensembles | May 8, 2025 | Computational EfficiencyReinforcement Learning (RL) | —Unverified | 0 |
| Active Sampling for MRI-based Sequential Decision Making | May 7, 2025 | Decision MakingDiagnostic | CodeCode Available | 0 |
| Policy-labeled Preference Learning: Is Preference Enough for RLHF? | May 6, 2025 | continuous-controlContinuous Control | —Unverified | 0 |
| MDPs with a State Sensing Cost | May 6, 2025 | Sequential Decision Making | —Unverified | 0 |
| D3HRL: A Distributed Hierarchical Reinforcement Learning Approach Based on Causal Discovery and Spurious Correlation Detection | May 4, 2025 | Causal DiscoveryDecision Making | —Unverified | 0 |
| Bayesian learning of the optimal action-value function in a Markov decision process | May 3, 2025 | Decision MakingSequential Decision Making | —Unverified | 0 |
| A Minimax-MDP Framework with Future-imposed Conditions for Learning-augmented Problems | May 2, 2025 | Decision MakingPrediction Intervals | —Unverified | 0 |
| Self-Generated In-Context Examples Improve LLM Agents for Sequential Decision-Making Tasks | May 1, 2025 | Decision MakingLarge Language Model | —Unverified | 0 |
| Fast and Robust: Task Sampling with Posterior and Diversity Synergies for Adaptive Decision-Makers in Randomized Environments | Apr 27, 2025 | Decision MakingDiversity | —Unverified | 0 |
| SAPO-RL: Sequential Actuator Placement Optimization for Fuselage Assembly via Reinforcement Learning | Apr 24, 2025 | Decision MakingQ-Learning | —Unverified | 0 |
| Hierarchical Attention Fusion of Visual and Textual Representations for Cross-Domain Sequential Recommendation | Apr 21, 2025 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Consensus in Motion: A Case of Dynamic Rationality of Sequential Learning in Probability Aggregation | Apr 20, 2025 | Decision MakingSequential Decision Making | —Unverified | 0 |