| Quantization-Free Autoregressive Action Transformer | Mar 18, 2025 | Imitation LearningQuantization | CodeCode Available | 0 |
| Zero-Shot Action Generalization with Limited Observations | Mar 11, 2025 | Decision MakingReinforcement Learning (RL) | —Unverified | 0 |
| Locally Private Nonparametric Contextual Multi-armed Bandits | Mar 11, 2025 | Decision MakingMulti-Armed Bandits | CodeCode Available | 0 |
| Reasoning in visual navigation of end-to-end trained agents: a dynamical systems approach | Mar 11, 2025 | NavigateSequential Decision Making | —Unverified | 0 |
| Graph-Dependent Regret Bounds in Multi-Armed Bandits with Interference | Mar 10, 2025 | Multi-Armed BanditsSequential Decision Making | —Unverified | 0 |
| GFlowVLM: Enhancing Multi-step Reasoning in Vision-Language Models with Generative Flow Networks | Mar 9, 2025 | Card GamesDiversity | —Unverified | 0 |
| Bayesian Graph Traversal | Mar 7, 2025 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Adversarial Agents: Black-Box Evasion Attacks with Reinforcement Learning | Mar 3, 2025 | reinforcement-learningReinforcement Learning | —Unverified | 0 |
| On Generalization Across Environments In Multi-Objective Reinforcement Learning | Mar 2, 2025 | Decision MakingMulti-Objective Reinforcement Learning | CodeCode Available | 1 |
| Reinforcement learning with combinatorial actions for coupled restless bandits | Mar 1, 2025 | reinforcement-learningReinforcement Learning | CodeCode Available | 1 |