| Plan-Then-Execute: An Empirical Study of User Trust and Team Performance When Using LLM Agents As A Daily Assistant | Feb 3, 2025 | Sequential Decision Making | CodeCode Available | 0 |
| Meta-Prompt Optimization for LLM-Based Sequential Decision Making | Feb 2, 2025 | Bayesian OptimizationDecision Making | —Unverified | 0 |
| Offline Learning for Combinatorial Multi-armed Bandits | Jan 31, 2025 | Decision MakingLanguage Modeling | —Unverified | 0 |
| Deceptive Sequential Decision-Making via Regularized Policy Optimization | Jan 30, 2025 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Contextual Online Decision Making with Infinite-Dimensional Functional Regression | Jan 30, 2025 | Decision MakingMulti-Armed Bandits | —Unverified | 0 |
| HD-CB: The First Exploration of Hyperdimensional Computing for Contextual Bandits Problems | Jan 28, 2025 | Computational EfficiencyMulti-Armed Bandits | —Unverified | 0 |
| Sample-Efficient Behavior Cloning Using General Domain Knowledge | Jan 27, 2025 | Car RacingFeature Engineering | —Unverified | 0 |
| An Adaptable Budget Planner for Enhancing Budget-Constrained Auto-Bidding in Online Advertising | Jan 26, 2025 | In-Context Reinforcement LearningSequential Decision Making | CodeCode Available | 0 |
| Bridging Visualization and Optimization: Multimodal Large Language Models on Graph-Structured Combinatorial Optimization | Jan 21, 2025 | Combinatorial OptimizationSequential Decision Making | —Unverified | 0 |
| Exploring the Inquiry-Diagnosis Relationship with Advanced Patient Simulators | Jan 16, 2025 | DiagnosticSequential Decision Making | CodeCode Available | 0 |
| On Learning Informative Trajectory Embeddings for Imitation, Classification and Regression | Jan 16, 2025 | Autonomous DrivingClustering | CodeCode Available | 0 |
| Embodied Scene Understanding for Vision Language Models via MetaVQA | Jan 15, 2025 | Decision MakingQuestion Answering | —Unverified | 0 |
| Counterfactually Fair Reinforcement Learning via Sequential Data Preprocessing | Jan 10, 2025 | Causal Inferencecounterfactual | —Unverified | 0 |
| All AI Models are Wrong, but Some are Optimal | Jan 10, 2025 | AllDecision Making | —Unverified | 0 |
| Generative Flow Networks: Theory and Applications to Structure Learning | Jan 9, 2025 | Sequential Decision MakingVariational Inference | —Unverified | 0 |
| Explainable Reinforcement Learning via Temporal Policy Decomposition | Jan 7, 2025 | reinforcement-learningReinforcement Learning | —Unverified | 0 |
| Beyond O(T) Regret: Decoupling Learning and Decision-making in Online Linear Programming | Jan 6, 2025 | Decision MakingSequential Decision Making | —Unverified | 0 |
| SeqMvRL: A Sequential Fusion Framework for Multi-view Representation Learning | Jan 1, 2025 | Representation LearningSequential Decision Making | —Unverified | 0 |
| Demystifying Online Clustering of Bandits: Enhanced Exploration Under Stochastic and Smoothed Adversarial Contexts | Jan 1, 2025 | ClusteringOnline Clustering | —Unverified | 0 |
| Optimizing Fantasy Sports Team Selection with Deep Reinforcement Learning | Dec 26, 2024 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| Navigating Data Corruption in Machine Learning: Balancing Quality, Quantity, and Imputation Strategies | Dec 24, 2024 | Deep Reinforcement LearningImputation | CodeCode Available | 0 |
| HyperQ-Opt: Q-learning for Hyperparameter Optimization | Dec 23, 2024 | Bayesian OptimizationHyperparameter Optimization | —Unverified | 0 |
| Fairness in Reinforcement Learning with Bisimulation Metrics | Dec 22, 2024 | Decision MakingFairness | —Unverified | 0 |
| GAS: Generative Auto-bidding with Post-training Search | Dec 22, 2024 | Computational EfficiencySequential Decision Making | —Unverified | 0 |
| Subgoal Discovery Using a Free Energy Paradigm and State Aggregations | Dec 21, 2024 | Reinforcement Learning (RL)Sequential Decision Making | —Unverified | 0 |