| Vid2World: Crafting Video Diffusion Models to Interactive World Models | May 20, 2025 | Robot ManipulationSequential Decision Making | —Unverified | 0 |
| OMGPT: A Sequence Modeling Framework for Data-driven Operational Decision Making | May 19, 2025 | Decision MakingManagement | —Unverified | 0 |
| Generalization Guarantees for Learning Branch-and-Cut Policies in Integer Programming | May 16, 2025 | Sequential Decision MakingVariable Selection | —Unverified | 0 |
| Deep Symbolic Optimization: Reinforcement Learning for Symbolic Mathematics | May 16, 2025 | Equation Discoveryreinforcement-learning | —Unverified | 0 |
| Batched Nonparametric Bandits via k-Nearest Neighbor UCB | May 15, 2025 | Decision MakingMarketing | —Unverified | 0 |
| Comparing Exploration-Exploitation Strategies of LLMs and Humans: Insights from Standard Multi-armed Bandit Tasks | May 15, 2025 | Decision MakingDecision Making Under Uncertainty | CodeCode Available | 1 |
| Counterfactual Strategies for Markov Decision Processes | May 14, 2025 | counterfactualDecision Making | —Unverified | 0 |
| Sequential Treatment Effect Estimation with Unmeasured Confounders | May 14, 2025 | counterfactualSequential Decision Making | —Unverified | 0 |
| rfPG: Robust Finite-Memory Policy Gradients for Hidden-Model POMDPs | May 14, 2025 | Decision Making Under UncertaintySequential Decision Making | —Unverified | 0 |
| A Practical Introduction to Deep Reinforcement Learning | May 13, 2025 | Autonomous DrivingDecision Making | —Unverified | 0 |