| Offline Action-Free Learning of Ex-BMDPs by Comparing Diverse Datasets | Mar 26, 2025 | Representation LearningSequential Decision Making | —Unverified | 0 |
| Observation Adaptation via Annealed Importance Resampling for Partially Observable Markov Decision Processes | Mar 25, 2025 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Depth Matters: Multimodal RGB-D Perception for Robust Autonomous Agents | Mar 20, 2025 | Decision MakingSequential Decision Making | CodeCode Available | 0 |
| VIPER: Visual Perception and Explainable Reasoning for Sequential Decision-Making | Mar 19, 2025 | Decision MakingSequential Decision Making | —Unverified | 0 |
| A Parallel Hybrid Action Space Reinforcement Learning Model for Real-world Adaptive Traffic Signal Control | Mar 18, 2025 | Decision MakingSequential Decision Making | CodeCode Available | 0 |
| Quantization-Free Autoregressive Action Transformer | Mar 18, 2025 | Imitation LearningQuantization | CodeCode Available | 0 |
| Reasoning in visual navigation of end-to-end trained agents: a dynamical systems approach | Mar 11, 2025 | NavigateSequential Decision Making | —Unverified | 0 |
| Locally Private Nonparametric Contextual Multi-armed Bandits | Mar 11, 2025 | Decision MakingMulti-Armed Bandits | CodeCode Available | 0 |
| Zero-Shot Action Generalization with Limited Observations | Mar 11, 2025 | Decision MakingReinforcement Learning (RL) | —Unverified | 0 |
| Graph-Dependent Regret Bounds in Multi-Armed Bandits with Interference | Mar 10, 2025 | Multi-Armed BanditsSequential Decision Making | —Unverified | 0 |