| Perspective-Shifted Neuro-Symbolic World Models: A Framework for Socially-Aware Robot Navigation | Mar 26, 2025 | Decision MakingModel-based Reinforcement Learning | CodeCode Available | 0 |
| Observation Adaptation via Annealed Importance Resampling for Partially Observable Markov Decision Processes | Mar 25, 2025 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Depth Matters: Multimodal RGB-D Perception for Robust Autonomous Agents | Mar 20, 2025 | Decision MakingSequential Decision Making | CodeCode Available | 0 |
| VIPER: Visual Perception and Explainable Reasoning for Sequential Decision-Making | Mar 19, 2025 | Decision MakingSequential Decision Making | —Unverified | 0 |
| A Parallel Hybrid Action Space Reinforcement Learning Model for Real-world Adaptive Traffic Signal Control | Mar 18, 2025 | Decision MakingSequential Decision Making | CodeCode Available | 0 |
| Quantization-Free Autoregressive Action Transformer | Mar 18, 2025 | Imitation LearningQuantization | CodeCode Available | 0 |
| Zero-Shot Action Generalization with Limited Observations | Mar 11, 2025 | Decision MakingReinforcement Learning (RL) | —Unverified | 0 |
| Locally Private Nonparametric Contextual Multi-armed Bandits | Mar 11, 2025 | Decision MakingMulti-Armed Bandits | CodeCode Available | 0 |
| Reasoning in visual navigation of end-to-end trained agents: a dynamical systems approach | Mar 11, 2025 | NavigateSequential Decision Making | —Unverified | 0 |
| Graph-Dependent Regret Bounds in Multi-Armed Bandits with Interference | Mar 10, 2025 | Multi-Armed BanditsSequential Decision Making | —Unverified | 0 |
| GFlowVLM: Enhancing Multi-step Reasoning in Vision-Language Models with Generative Flow Networks | Mar 9, 2025 | Card GamesDiversity | —Unverified | 0 |
| Bayesian Graph Traversal | Mar 7, 2025 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Adversarial Agents: Black-Box Evasion Attacks with Reinforcement Learning | Mar 3, 2025 | reinforcement-learningReinforcement Learning | —Unverified | 0 |
| On Generalization Across Environments In Multi-Objective Reinforcement Learning | Mar 2, 2025 | Decision MakingMulti-Objective Reinforcement Learning | CodeCode Available | 1 |
| Reinforcement learning with combinatorial actions for coupled restless bandits | Mar 1, 2025 | reinforcement-learningReinforcement Learning | CodeCode Available | 1 |
| Shaping Laser Pulses with Reinforcement Learning | Mar 1, 2025 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| Semi-Parametric Batched Global Multi-Armed Bandits with Covariates | Mar 1, 2025 | Decision MakingMulti-Armed Bandits | —Unverified | 0 |
| Scalable Decision-Making in Stochastic Environments through Learned Temporal Abstraction | Feb 28, 2025 | continuous-controlContinuous Control | CodeCode Available | 0 |
| WOFOSTGym: A Crop Simulator for Learning Annual and Perennial Crop Management Strategies | Feb 26, 2025 | Decision MakingManagement | CodeCode Available | 0 |
| Training a Generally Curious Agent | Feb 24, 2025 | Decision MakingEfficient Exploration | CodeCode Available | 1 |
| PMAT: Optimizing Action Generation Order in Multi-Agent Reinforcement Learning | Feb 23, 2025 | Action GenerationDecision Making | CodeCode Available | 0 |
| The Evolving Landscape of LLM- and VLM-Integrated Reinforcement Learning | Feb 21, 2025 | Decision Makingreinforcement-learning | —Unverified | 0 |
| Reinforcement Learning for Ultrasound Image Analysis A Comprehensive Review of Advances and Applications | Feb 20, 2025 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| Making Universal Policies Universal | Feb 20, 2025 | Imitation LearningSequential Decision Making | CodeCode Available | 0 |
| AlphaMaze: Enhancing Large Language Models' Spatial Intelligence via GRPO | Feb 20, 2025 | Autonomous NavigationNavigate | CodeCode Available | 2 |