| TALES: Text Adventure Learning Environment Suite | Apr 19, 2025 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Position Paper: Rethinking Privacy in RL for Sequential Decision-making in the Age of LLMs | Apr 15, 2025 | Autonomous VehiclesDecision Making | —Unverified | 0 |
| Offline Dynamic Inventory and Pricing Strategy: Addressing Censored and Dependent Demand | Apr 14, 2025 | Sequential Decision MakingSurvival Analysis | CodeCode Available | 0 |
| Truncated Matrix Completion - An Empirical Study | Apr 14, 2025 | Decision MakingLow-Rank Matrix Completion | —Unverified | 0 |
| Towards More Efficient, Robust, Instance-adaptive, and Generalizable Sequential Decision making | Apr 12, 2025 | Decision MakingDecision Making Under Uncertainty | —Unverified | 0 |
| A Framework of decision-relevant observability: Reinforcement Learning converges under relative ignorability | Apr 10, 2025 | Causal InferenceDecision Making | —Unverified | 0 |
| RAISE: Reinforenced Adaptive Instruction Selection For Large Language Models | Apr 9, 2025 | Sequential Decision Making | —Unverified | 0 |
| Deep Reinforcement Learning Algorithms for Option Hedging | Apr 7, 2025 | Deep Reinforcement LearningQ-Learning | CodeCode Available | 0 |
| A Classification View on Meta Learning Bandits | Apr 6, 2025 | ClassificationMeta-Learning | —Unverified | 0 |
| From Automation to Autonomy in Smart Manufacturing: A Bayesian Optimization Framework for Modeling Multi-Objective Experimentation and Sequential Decision Making | Apr 5, 2025 | Bayesian OptimizationData Integration | —Unverified | 0 |
| MORAL: A Multimodal Reinforcement Learning Framework for Decision Making in Autonomous Laboratories | Apr 4, 2025 | Decision MakingImage Captioning | —Unverified | 0 |
| Counterfactual Inference under Thompson Sampling | Apr 3, 2025 | Causal Inferencecounterfactual | —Unverified | 0 |
| Towards Enabling Learning for Time-Varying finite horizon Sequential Decision-Making Problems* | Apr 2, 2025 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Off-Policy Evaluation for Sequential Persuasion Process with Unobserved Confounding | Apr 1, 2025 | Decision MakingOff-policy evaluation | —Unverified | 0 |
| Remember, but also, Forget: Bridging Myopic and Perfect Recall Fairness with Past-Discounting | Apr 1, 2025 | Decision MakingFairness | —Unverified | 0 |
| Towards Trustworthy GUI Agents: A Survey | Mar 30, 2025 | Decision MakingSequential Decision Making | CodeCode Available | 0 |
| Reinforcement Learning-based Token Pruning in Vision Transformers: A Markov Game Approach | Mar 30, 2025 | Decision MakingReinforcement Learning (RL) | CodeCode Available | 0 |
| Exploring Explainable Multi-player MCTS-minimax Hybrids in Board Game Using Process Mining | Mar 30, 2025 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Perspective-Shifted Neuro-Symbolic World Models: A Framework for Socially-Aware Robot Navigation | Mar 26, 2025 | Decision MakingModel-based Reinforcement Learning | CodeCode Available | 0 |
| Offline Action-Free Learning of Ex-BMDPs by Comparing Diverse Datasets | Mar 26, 2025 | Representation LearningSequential Decision Making | —Unverified | 0 |
| Data Mixture Optimization: A Multi-fidelity Multi-scale Bayesian Framework | Mar 26, 2025 | Bayesian OptimizationSequential Decision Making | CodeCode Available | 0 |
| Observation Adaptation via Annealed Importance Resampling for Partially Observable Markov Decision Processes | Mar 25, 2025 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Depth Matters: Multimodal RGB-D Perception for Robust Autonomous Agents | Mar 20, 2025 | Decision MakingSequential Decision Making | CodeCode Available | 0 |
| VIPER: Visual Perception and Explainable Reasoning for Sequential Decision-Making | Mar 19, 2025 | Decision MakingSequential Decision Making | —Unverified | 0 |
| A Parallel Hybrid Action Space Reinforcement Learning Model for Real-world Adaptive Traffic Signal Control | Mar 18, 2025 | Decision MakingSequential Decision Making | CodeCode Available | 0 |
| Quantization-Free Autoregressive Action Transformer | Mar 18, 2025 | Imitation LearningQuantization | CodeCode Available | 0 |
| Reasoning in visual navigation of end-to-end trained agents: a dynamical systems approach | Mar 11, 2025 | NavigateSequential Decision Making | —Unverified | 0 |
| Zero-Shot Action Generalization with Limited Observations | Mar 11, 2025 | Decision MakingReinforcement Learning (RL) | —Unverified | 0 |
| Locally Private Nonparametric Contextual Multi-armed Bandits | Mar 11, 2025 | Decision MakingMulti-Armed Bandits | CodeCode Available | 0 |
| Graph-Dependent Regret Bounds in Multi-Armed Bandits with Interference | Mar 10, 2025 | Multi-Armed BanditsSequential Decision Making | —Unverified | 0 |
| GFlowVLM: Enhancing Multi-step Reasoning in Vision-Language Models with Generative Flow Networks | Mar 9, 2025 | Card GamesDiversity | —Unverified | 0 |
| Bayesian Graph Traversal | Mar 7, 2025 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Adversarial Agents: Black-Box Evasion Attacks with Reinforcement Learning | Mar 3, 2025 | reinforcement-learningReinforcement Learning | —Unverified | 0 |
| Shaping Laser Pulses with Reinforcement Learning | Mar 1, 2025 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| Semi-Parametric Batched Global Multi-Armed Bandits with Covariates | Mar 1, 2025 | Decision MakingMulti-Armed Bandits | —Unverified | 0 |
| Scalable Decision-Making in Stochastic Environments through Learned Temporal Abstraction | Feb 28, 2025 | continuous-controlContinuous Control | CodeCode Available | 0 |
| WOFOSTGym: A Crop Simulator for Learning Annual and Perennial Crop Management Strategies | Feb 26, 2025 | Decision MakingManagement | CodeCode Available | 0 |
| PMAT: Optimizing Action Generation Order in Multi-Agent Reinforcement Learning | Feb 23, 2025 | Action GenerationDecision Making | CodeCode Available | 0 |
| The Evolving Landscape of LLM- and VLM-Integrated Reinforcement Learning | Feb 21, 2025 | Decision Makingreinforcement-learning | —Unverified | 0 |
| Reinforcement Learning for Ultrasound Image Analysis A Comprehensive Review of Advances and Applications | Feb 20, 2025 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| Making Universal Policies Universal | Feb 20, 2025 | Imitation LearningSequential Decision Making | CodeCode Available | 0 |
| Value Gradient Sampler: Sampling as Sequential Decision Making | Feb 18, 2025 | Anomaly DetectionDecision Making | CodeCode Available | 0 |
| Learning to Solve the Min-Max Mixed-Shelves Picker-Routing Problem via Hierarchical and Parallel Decoding | Feb 14, 2025 | Decision MakingMulti-agent Reinforcement Learning | CodeCode Available | 0 |
| Learning Fair Policies for Infectious Diseases Mitigation using Path Integral Control | Feb 14, 2025 | Decision MakingDecision Making Under Uncertainty | —Unverified | 0 |
| Self-Evaluation for Job-Shop Scheduling | Feb 12, 2025 | Combinatorial OptimizationDecision Making | —Unverified | 0 |
| Advancing Autonomous VLM Agents via Variational Subgoal-Conditioned Reinforcement Learning | Feb 11, 2025 | Decision Makingreinforcement-learning | —Unverified | 0 |
| A Survey on Explainable Deep Reinforcement Learning | Feb 8, 2025 | Adversarial RobustnessDecision Making | —Unverified | 0 |
| Unifying and Optimizing Data Values for Selection via Sequential-Decision-Making | Feb 6, 2025 | Data ValuationDecision Making | —Unverified | 0 |
| Online Clustering of Dueling Bandits | Feb 4, 2025 | ClusteringDecision Making | —Unverified | 0 |
| VLA-Cache: Towards Efficient Vision-Language-Action Model via Adaptive Token Caching in Robotic Manipulation | Feb 4, 2025 | Decision MakingSequential Decision Making | —Unverified | 0 |