| Shaping Laser Pulses with Reinforcement Learning | Mar 1, 2025 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| Semi-Parametric Batched Global Multi-Armed Bandits with Covariates | Mar 1, 2025 | Decision MakingMulti-Armed Bandits | —Unverified | 0 |
| Scalable Decision-Making in Stochastic Environments through Learned Temporal Abstraction | Feb 28, 2025 | continuous-controlContinuous Control | CodeCode Available | 0 |
| WOFOSTGym: A Crop Simulator for Learning Annual and Perennial Crop Management Strategies | Feb 26, 2025 | Decision MakingManagement | CodeCode Available | 0 |
| Training a Generally Curious Agent | Feb 24, 2025 | Decision MakingEfficient Exploration | CodeCode Available | 1 |
| PMAT: Optimizing Action Generation Order in Multi-Agent Reinforcement Learning | Feb 23, 2025 | Action GenerationDecision Making | CodeCode Available | 0 |
| The Evolving Landscape of LLM- and VLM-Integrated Reinforcement Learning | Feb 21, 2025 | Decision Makingreinforcement-learning | —Unverified | 0 |
| Reinforcement Learning for Ultrasound Image Analysis A Comprehensive Review of Advances and Applications | Feb 20, 2025 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| Making Universal Policies Universal | Feb 20, 2025 | Imitation LearningSequential Decision Making | CodeCode Available | 0 |
| AlphaMaze: Enhancing Large Language Models' Spatial Intelligence via GRPO | Feb 20, 2025 | Autonomous NavigationNavigate | CodeCode Available | 2 |