| Value Gradient Sampler: Sampling as Sequential Decision Making | Feb 18, 2025 | Anomaly DetectionDecision Making | CodeCode Available | 0 |
| Learning to Solve the Min-Max Mixed-Shelves Picker-Routing Problem via Hierarchical and Parallel Decoding | Feb 14, 2025 | Decision MakingMulti-agent Reinforcement Learning | CodeCode Available | 0 |
| Learning Fair Policies for Infectious Diseases Mitigation using Path Integral Control | Feb 14, 2025 | Decision MakingDecision Making Under Uncertainty | —Unverified | 0 |
| Self-Evaluation for Job-Shop Scheduling | Feb 12, 2025 | Combinatorial OptimizationDecision Making | —Unverified | 0 |
| Advancing Autonomous VLM Agents via Variational Subgoal-Conditioned Reinforcement Learning | Feb 11, 2025 | Decision Makingreinforcement-learning | —Unverified | 0 |
| A Survey on Explainable Deep Reinforcement Learning | Feb 8, 2025 | Adversarial RobustnessDecision Making | —Unverified | 0 |
| Unifying and Optimizing Data Values for Selection via Sequential-Decision-Making | Feb 6, 2025 | Data ValuationDecision Making | —Unverified | 0 |
| Online Clustering of Dueling Bandits | Feb 4, 2025 | ClusteringDecision Making | —Unverified | 0 |
| VLA-Cache: Towards Efficient Vision-Language-Action Model via Adaptive Token Caching in Robotic Manipulation | Feb 4, 2025 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Plan-Then-Execute: An Empirical Study of User Trust and Team Performance When Using LLM Agents As A Daily Assistant | Feb 3, 2025 | Sequential Decision Making | CodeCode Available | 0 |
| Meta-Prompt Optimization for LLM-Based Sequential Decision Making | Feb 2, 2025 | Bayesian OptimizationDecision Making | —Unverified | 0 |
| Offline Learning for Combinatorial Multi-armed Bandits | Jan 31, 2025 | Decision MakingLanguage Modeling | —Unverified | 0 |
| Deceptive Sequential Decision-Making via Regularized Policy Optimization | Jan 30, 2025 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Contextual Online Decision Making with Infinite-Dimensional Functional Regression | Jan 30, 2025 | Decision MakingMulti-Armed Bandits | —Unverified | 0 |
| HD-CB: The First Exploration of Hyperdimensional Computing for Contextual Bandits Problems | Jan 28, 2025 | Computational EfficiencyMulti-Armed Bandits | —Unverified | 0 |
| Sample-Efficient Behavior Cloning Using General Domain Knowledge | Jan 27, 2025 | Car RacingFeature Engineering | —Unverified | 0 |
| An Adaptable Budget Planner for Enhancing Budget-Constrained Auto-Bidding in Online Advertising | Jan 26, 2025 | In-Context Reinforcement LearningSequential Decision Making | CodeCode Available | 0 |
| Bridging Visualization and Optimization: Multimodal Large Language Models on Graph-Structured Combinatorial Optimization | Jan 21, 2025 | Combinatorial OptimizationSequential Decision Making | —Unverified | 0 |
| On Learning Informative Trajectory Embeddings for Imitation, Classification and Regression | Jan 16, 2025 | Autonomous DrivingClustering | CodeCode Available | 0 |
| Exploring the Inquiry-Diagnosis Relationship with Advanced Patient Simulators | Jan 16, 2025 | DiagnosticSequential Decision Making | CodeCode Available | 0 |
| Embodied Scene Understanding for Vision Language Models via MetaVQA | Jan 15, 2025 | Decision MakingQuestion Answering | —Unverified | 0 |
| Counterfactually Fair Reinforcement Learning via Sequential Data Preprocessing | Jan 10, 2025 | Causal Inferencecounterfactual | —Unverified | 0 |
| All AI Models are Wrong, but Some are Optimal | Jan 10, 2025 | AllDecision Making | —Unverified | 0 |
| Generative Flow Networks: Theory and Applications to Structure Learning | Jan 9, 2025 | Sequential Decision MakingVariational Inference | —Unverified | 0 |
| Explainable Reinforcement Learning via Temporal Policy Decomposition | Jan 7, 2025 | reinforcement-learningReinforcement Learning | —Unverified | 0 |
| Beyond O(T) Regret: Decoupling Learning and Decision-making in Online Linear Programming | Jan 6, 2025 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Co-Activation Graph Analysis of Safety-Verified and Explainable Deep Reinforcement Learning Policies | Jan 6, 2025 | Decision MakingDeep Reinforcement Learning | CodeCode Available | 1 |
| SeqMvRL: A Sequential Fusion Framework for Multi-view Representation Learning | Jan 1, 2025 | Representation LearningSequential Decision Making | —Unverified | 0 |
| Demystifying Online Clustering of Bandits: Enhanced Exploration Under Stochastic and Smoothed Adversarial Contexts | Jan 1, 2025 | ClusteringOnline Clustering | —Unverified | 0 |
| Optimizing Fantasy Sports Team Selection with Deep Reinforcement Learning | Dec 26, 2024 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| Navigating Data Corruption in Machine Learning: Balancing Quality, Quantity, and Imputation Strategies | Dec 24, 2024 | Deep Reinforcement LearningImputation | CodeCode Available | 0 |
| MineStudio: A Streamlined Package for Minecraft AI Agent Development | Dec 24, 2024 | AI AgentDecision Making | CodeCode Available | 3 |
| HyperQ-Opt: Q-learning for Hyperparameter Optimization | Dec 23, 2024 | Bayesian OptimizationHyperparameter Optimization | —Unverified | 0 |
| Fairness in Reinforcement Learning with Bisimulation Metrics | Dec 22, 2024 | Decision MakingFairness | —Unverified | 0 |
| GAS: Generative Auto-bidding with Post-training Search | Dec 22, 2024 | Computational EfficiencySequential Decision Making | —Unverified | 0 |
| Subgoal Discovery Using a Free Energy Paradigm and State Aggregations | Dec 21, 2024 | Reinforcement Learning (RL)Sequential Decision Making | —Unverified | 0 |
| DriveGPT: Scaling Autoregressive Behavior Models for Driving | Dec 19, 2024 | Autonomous DrivingDecision Making | —Unverified | 0 |
| Threshold UCT: Cost-Constrained Monte Carlo Tree Search with Pareto Curves | Dec 18, 2024 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Active Reinforcement Learning Strategies for Offline Policy Improvement | Dec 17, 2024 | Active Learningcontinuous-control | —Unverified | 0 |
| Revelations: A Decidable Class of POMDPs with Omega-Regular Objectives | Dec 16, 2024 | Decision MakingSequential Decision Making | CodeCode Available | 0 |
| Solving Robust Markov Decision Processes: Generic, Reliable, Efficient | Dec 13, 2024 | Decision MakingSequential Decision Making | —Unverified | 0 |
| How Should We Represent History in Interpretable Models of Clinical Policies? | Dec 10, 2024 | Decision MakingRepresentation Learning | CodeCode Available | 0 |
| Effective Reward Specification in Deep Reinforcement Learning | Dec 10, 2024 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Swarm Behavior Cloning | Dec 10, 2024 | Decision MakingImitation Learning | —Unverified | 0 |
| Optimizing Sensor Redundancy in Sequential Decision-Making Problems | Dec 10, 2024 | Decision MakingOpenAI Gym | —Unverified | 0 |
| Discrete-Time Distribution Steering using Monte Carlo Tree Search | Dec 9, 2024 | Decision MakingSequential Decision Making | CodeCode Available | 0 |
| A Note on Sample Complexity of Interactive Imitation Learning with Log Loss | Dec 9, 2024 | Decision MakingImitation Learning | —Unverified | 0 |
| Conservative Contextual Bandits: Beyond Linear Representations | Dec 9, 2024 | Multi-Armed BanditsSequential Decision Making | —Unverified | 0 |
| Reinforcement Learning: An Overview | Dec 6, 2024 | Decision MakingDeep Reinforcement Learning | CodeCode Available | 0 |
| Nonmyopic Global Optimisation via Approximate Dynamic Programming | Dec 6, 2024 | Bayesian OptimisationGaussian Processes | CodeCode Available | 0 |