| Self-Generated In-Context Examples Improve LLM Agents for Sequential Decision-Making Tasks | May 1, 2025 | Decision MakingLarge Language Model | —Unverified | 0 |
| Fast and Robust: Task Sampling with Posterior and Diversity Synergies for Adaptive Decision-Makers in Randomized Environments | Apr 27, 2025 | Decision MakingDiversity | —Unverified | 0 |
| SAPO-RL: Sequential Actuator Placement Optimization for Fuselage Assembly via Reinforcement Learning | Apr 24, 2025 | Decision MakingQ-Learning | —Unverified | 0 |
| Hierarchical Attention Fusion of Visual and Textual Representations for Cross-Domain Sequential Recommendation | Apr 21, 2025 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Consensus in Motion: A Case of Dynamic Rationality of Sequential Learning in Probability Aggregation | Apr 20, 2025 | Decision MakingSequential Decision Making | —Unverified | 0 |
| TALES: Text Adventure Learning Environment Suite | Apr 19, 2025 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Position Paper: Rethinking Privacy in RL for Sequential Decision-making in the Age of LLMs | Apr 15, 2025 | Autonomous VehiclesDecision Making | —Unverified | 0 |
| Offline Dynamic Inventory and Pricing Strategy: Addressing Censored and Dependent Demand | Apr 14, 2025 | Sequential Decision MakingSurvival Analysis | CodeCode Available | 0 |
| Truncated Matrix Completion - An Empirical Study | Apr 14, 2025 | Decision MakingLow-Rank Matrix Completion | —Unverified | 0 |
| Towards More Efficient, Robust, Instance-adaptive, and Generalizable Sequential Decision making | Apr 12, 2025 | Decision MakingDecision Making Under Uncertainty | —Unverified | 0 |
| A Framework of decision-relevant observability: Reinforcement Learning converges under relative ignorability | Apr 10, 2025 | Causal InferenceDecision Making | —Unverified | 0 |
| RAISE: Reinforenced Adaptive Instruction Selection For Large Language Models | Apr 9, 2025 | Sequential Decision Making | —Unverified | 0 |
| Deep Reinforcement Learning Algorithms for Option Hedging | Apr 7, 2025 | Deep Reinforcement LearningQ-Learning | CodeCode Available | 0 |
| A Classification View on Meta Learning Bandits | Apr 6, 2025 | ClassificationMeta-Learning | —Unverified | 0 |
| From Automation to Autonomy in Smart Manufacturing: A Bayesian Optimization Framework for Modeling Multi-Objective Experimentation and Sequential Decision Making | Apr 5, 2025 | Bayesian OptimizationData Integration | —Unverified | 0 |
| MORAL: A Multimodal Reinforcement Learning Framework for Decision Making in Autonomous Laboratories | Apr 4, 2025 | Decision MakingImage Captioning | —Unverified | 0 |
| Counterfactual Inference under Thompson Sampling | Apr 3, 2025 | Causal Inferencecounterfactual | —Unverified | 0 |
| Towards Enabling Learning for Time-Varying finite horizon Sequential Decision-Making Problems* | Apr 2, 2025 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Remember, but also, Forget: Bridging Myopic and Perfect Recall Fairness with Past-Discounting | Apr 1, 2025 | Decision MakingFairness | —Unverified | 0 |
| Off-Policy Evaluation for Sequential Persuasion Process with Unobserved Confounding | Apr 1, 2025 | Decision MakingOff-policy evaluation | —Unverified | 0 |
| Reinforcement Learning-based Token Pruning in Vision Transformers: A Markov Game Approach | Mar 30, 2025 | Decision MakingReinforcement Learning (RL) | CodeCode Available | 0 |
| Towards Trustworthy GUI Agents: A Survey | Mar 30, 2025 | Decision MakingSequential Decision Making | CodeCode Available | 0 |
| Exploring Explainable Multi-player MCTS-minimax Hybrids in Board Game Using Process Mining | Mar 30, 2025 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Data Mixture Optimization: A Multi-fidelity Multi-scale Bayesian Framework | Mar 26, 2025 | Bayesian OptimizationSequential Decision Making | CodeCode Available | 0 |
| Offline Action-Free Learning of Ex-BMDPs by Comparing Diverse Datasets | Mar 26, 2025 | Representation LearningSequential Decision Making | —Unverified | 0 |
| Perspective-Shifted Neuro-Symbolic World Models: A Framework for Socially-Aware Robot Navigation | Mar 26, 2025 | Decision MakingModel-based Reinforcement Learning | CodeCode Available | 0 |
| Observation Adaptation via Annealed Importance Resampling for Partially Observable Markov Decision Processes | Mar 25, 2025 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Depth Matters: Multimodal RGB-D Perception for Robust Autonomous Agents | Mar 20, 2025 | Decision MakingSequential Decision Making | CodeCode Available | 0 |
| VIPER: Visual Perception and Explainable Reasoning for Sequential Decision-Making | Mar 19, 2025 | Decision MakingSequential Decision Making | —Unverified | 0 |
| A Parallel Hybrid Action Space Reinforcement Learning Model for Real-world Adaptive Traffic Signal Control | Mar 18, 2025 | Decision MakingSequential Decision Making | CodeCode Available | 0 |
| Quantization-Free Autoregressive Action Transformer | Mar 18, 2025 | Imitation LearningQuantization | CodeCode Available | 0 |
| Zero-Shot Action Generalization with Limited Observations | Mar 11, 2025 | Decision MakingReinforcement Learning (RL) | —Unverified | 0 |
| Locally Private Nonparametric Contextual Multi-armed Bandits | Mar 11, 2025 | Decision MakingMulti-Armed Bandits | CodeCode Available | 0 |
| Reasoning in visual navigation of end-to-end trained agents: a dynamical systems approach | Mar 11, 2025 | NavigateSequential Decision Making | —Unverified | 0 |
| Graph-Dependent Regret Bounds in Multi-Armed Bandits with Interference | Mar 10, 2025 | Multi-Armed BanditsSequential Decision Making | —Unverified | 0 |
| GFlowVLM: Enhancing Multi-step Reasoning in Vision-Language Models with Generative Flow Networks | Mar 9, 2025 | Card GamesDiversity | —Unverified | 0 |
| Bayesian Graph Traversal | Mar 7, 2025 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Adversarial Agents: Black-Box Evasion Attacks with Reinforcement Learning | Mar 3, 2025 | reinforcement-learningReinforcement Learning | —Unverified | 0 |
| On Generalization Across Environments In Multi-Objective Reinforcement Learning | Mar 2, 2025 | Decision MakingMulti-Objective Reinforcement Learning | CodeCode Available | 1 |
| Reinforcement learning with combinatorial actions for coupled restless bandits | Mar 1, 2025 | reinforcement-learningReinforcement Learning | CodeCode Available | 1 |
| Shaping Laser Pulses with Reinforcement Learning | Mar 1, 2025 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| Semi-Parametric Batched Global Multi-Armed Bandits with Covariates | Mar 1, 2025 | Decision MakingMulti-Armed Bandits | —Unverified | 0 |
| Scalable Decision-Making in Stochastic Environments through Learned Temporal Abstraction | Feb 28, 2025 | continuous-controlContinuous Control | CodeCode Available | 0 |
| WOFOSTGym: A Crop Simulator for Learning Annual and Perennial Crop Management Strategies | Feb 26, 2025 | Decision MakingManagement | CodeCode Available | 0 |
| Training a Generally Curious Agent | Feb 24, 2025 | Decision MakingEfficient Exploration | CodeCode Available | 1 |
| PMAT: Optimizing Action Generation Order in Multi-Agent Reinforcement Learning | Feb 23, 2025 | Action GenerationDecision Making | CodeCode Available | 0 |
| The Evolving Landscape of LLM- and VLM-Integrated Reinforcement Learning | Feb 21, 2025 | Decision Makingreinforcement-learning | —Unverified | 0 |
| Reinforcement Learning for Ultrasound Image Analysis A Comprehensive Review of Advances and Applications | Feb 20, 2025 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| Making Universal Policies Universal | Feb 20, 2025 | Imitation LearningSequential Decision Making | CodeCode Available | 0 |
| AlphaMaze: Enhancing Large Language Models' Spatial Intelligence via GRPO | Feb 20, 2025 | Autonomous NavigationNavigate | CodeCode Available | 2 |