| Quantization-Free Autoregressive Action Transformer | Mar 18, 2025 | Imitation LearningQuantization | CodeCode Available | 0 |
| Reasoning in visual navigation of end-to-end trained agents: a dynamical systems approach | Mar 11, 2025 | NavigateSequential Decision Making | —Unverified | 0 |
| Zero-Shot Action Generalization with Limited Observations | Mar 11, 2025 | Decision MakingReinforcement Learning (RL) | —Unverified | 0 |
| Locally Private Nonparametric Contextual Multi-armed Bandits | Mar 11, 2025 | Decision MakingMulti-Armed Bandits | CodeCode Available | 0 |
| Graph-Dependent Regret Bounds in Multi-Armed Bandits with Interference | Mar 10, 2025 | Multi-Armed BanditsSequential Decision Making | —Unverified | 0 |
| GFlowVLM: Enhancing Multi-step Reasoning in Vision-Language Models with Generative Flow Networks | Mar 9, 2025 | Card GamesDiversity | —Unverified | 0 |
| Bayesian Graph Traversal | Mar 7, 2025 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Adversarial Agents: Black-Box Evasion Attacks with Reinforcement Learning | Mar 3, 2025 | reinforcement-learningReinforcement Learning | —Unverified | 0 |
| Shaping Laser Pulses with Reinforcement Learning | Mar 1, 2025 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| Semi-Parametric Batched Global Multi-Armed Bandits with Covariates | Mar 1, 2025 | Decision MakingMulti-Armed Bandits | —Unverified | 0 |
| Scalable Decision-Making in Stochastic Environments through Learned Temporal Abstraction | Feb 28, 2025 | continuous-controlContinuous Control | CodeCode Available | 0 |
| WOFOSTGym: A Crop Simulator for Learning Annual and Perennial Crop Management Strategies | Feb 26, 2025 | Decision MakingManagement | CodeCode Available | 0 |
| PMAT: Optimizing Action Generation Order in Multi-Agent Reinforcement Learning | Feb 23, 2025 | Action GenerationDecision Making | CodeCode Available | 0 |
| The Evolving Landscape of LLM- and VLM-Integrated Reinforcement Learning | Feb 21, 2025 | Decision Makingreinforcement-learning | —Unverified | 0 |
| Reinforcement Learning for Ultrasound Image Analysis A Comprehensive Review of Advances and Applications | Feb 20, 2025 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| Making Universal Policies Universal | Feb 20, 2025 | Imitation LearningSequential Decision Making | CodeCode Available | 0 |
| Value Gradient Sampler: Sampling as Sequential Decision Making | Feb 18, 2025 | Anomaly DetectionDecision Making | CodeCode Available | 0 |
| Learning to Solve the Min-Max Mixed-Shelves Picker-Routing Problem via Hierarchical and Parallel Decoding | Feb 14, 2025 | Decision MakingMulti-agent Reinforcement Learning | CodeCode Available | 0 |
| Learning Fair Policies for Infectious Diseases Mitigation using Path Integral Control | Feb 14, 2025 | Decision MakingDecision Making Under Uncertainty | —Unverified | 0 |
| Self-Evaluation for Job-Shop Scheduling | Feb 12, 2025 | Combinatorial OptimizationDecision Making | —Unverified | 0 |
| Advancing Autonomous VLM Agents via Variational Subgoal-Conditioned Reinforcement Learning | Feb 11, 2025 | Decision Makingreinforcement-learning | —Unverified | 0 |
| A Survey on Explainable Deep Reinforcement Learning | Feb 8, 2025 | Adversarial RobustnessDecision Making | —Unverified | 0 |
| Unifying and Optimizing Data Values for Selection via Sequential-Decision-Making | Feb 6, 2025 | Data ValuationDecision Making | —Unverified | 0 |
| Online Clustering of Dueling Bandits | Feb 4, 2025 | ClusteringDecision Making | —Unverified | 0 |
| VLA-Cache: Towards Efficient Vision-Language-Action Model via Adaptive Token Caching in Robotic Manipulation | Feb 4, 2025 | Decision MakingSequential Decision Making | —Unverified | 0 |