| Plan-Then-Execute: An Empirical Study of User Trust and Team Performance When Using LLM Agents As A Daily Assistant | Feb 3, 2025 | Sequential Decision Making | CodeCode Available | 0 |
| Meta-Prompt Optimization for LLM-Based Sequential Decision Making | Feb 2, 2025 | Bayesian OptimizationDecision Making | —Unverified | 0 |
| Offline Learning for Combinatorial Multi-armed Bandits | Jan 31, 2025 | Decision MakingLanguage Modeling | —Unverified | 0 |
| Contextual Online Decision Making with Infinite-Dimensional Functional Regression | Jan 30, 2025 | Decision MakingMulti-Armed Bandits | —Unverified | 0 |
| Deceptive Sequential Decision-Making via Regularized Policy Optimization | Jan 30, 2025 | Decision MakingSequential Decision Making | —Unverified | 0 |
| HD-CB: The First Exploration of Hyperdimensional Computing for Contextual Bandits Problems | Jan 28, 2025 | Computational EfficiencyMulti-Armed Bandits | —Unverified | 0 |
| Sample-Efficient Behavior Cloning Using General Domain Knowledge | Jan 27, 2025 | Car RacingFeature Engineering | —Unverified | 0 |
| An Adaptable Budget Planner for Enhancing Budget-Constrained Auto-Bidding in Online Advertising | Jan 26, 2025 | In-Context Reinforcement LearningSequential Decision Making | CodeCode Available | 0 |
| Bridging Visualization and Optimization: Multimodal Large Language Models on Graph-Structured Combinatorial Optimization | Jan 21, 2025 | Combinatorial OptimizationSequential Decision Making | —Unverified | 0 |
| Exploring the Inquiry-Diagnosis Relationship with Advanced Patient Simulators | Jan 16, 2025 | DiagnosticSequential Decision Making | CodeCode Available | 0 |
| On Learning Informative Trajectory Embeddings for Imitation, Classification and Regression | Jan 16, 2025 | Autonomous DrivingClustering | CodeCode Available | 0 |
| Embodied Scene Understanding for Vision Language Models via MetaVQA | Jan 15, 2025 | Decision MakingQuestion Answering | —Unverified | 0 |
| Counterfactually Fair Reinforcement Learning via Sequential Data Preprocessing | Jan 10, 2025 | Causal Inferencecounterfactual | —Unverified | 0 |
| All AI Models are Wrong, but Some are Optimal | Jan 10, 2025 | AllDecision Making | —Unverified | 0 |
| Generative Flow Networks: Theory and Applications to Structure Learning | Jan 9, 2025 | Sequential Decision MakingVariational Inference | —Unverified | 0 |
| Explainable Reinforcement Learning via Temporal Policy Decomposition | Jan 7, 2025 | reinforcement-learningReinforcement Learning | —Unverified | 0 |
| Beyond O(T) Regret: Decoupling Learning and Decision-making in Online Linear Programming | Jan 6, 2025 | Decision MakingSequential Decision Making | —Unverified | 0 |
| SeqMvRL: A Sequential Fusion Framework for Multi-view Representation Learning | Jan 1, 2025 | Representation LearningSequential Decision Making | —Unverified | 0 |
| Demystifying Online Clustering of Bandits: Enhanced Exploration Under Stochastic and Smoothed Adversarial Contexts | Jan 1, 2025 | ClusteringOnline Clustering | —Unverified | 0 |
| Optimizing Fantasy Sports Team Selection with Deep Reinforcement Learning | Dec 26, 2024 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| Navigating Data Corruption in Machine Learning: Balancing Quality, Quantity, and Imputation Strategies | Dec 24, 2024 | Deep Reinforcement LearningImputation | CodeCode Available | 0 |
| HyperQ-Opt: Q-learning for Hyperparameter Optimization | Dec 23, 2024 | Bayesian OptimizationHyperparameter Optimization | —Unverified | 0 |
| Fairness in Reinforcement Learning with Bisimulation Metrics | Dec 22, 2024 | Decision MakingFairness | —Unverified | 0 |
| GAS: Generative Auto-bidding with Post-training Search | Dec 22, 2024 | Computational EfficiencySequential Decision Making | —Unverified | 0 |
| Subgoal Discovery Using a Free Energy Paradigm and State Aggregations | Dec 21, 2024 | Reinforcement Learning (RL)Sequential Decision Making | —Unverified | 0 |
| DriveGPT: Scaling Autoregressive Behavior Models for Driving | Dec 19, 2024 | Autonomous DrivingDecision Making | —Unverified | 0 |
| Threshold UCT: Cost-Constrained Monte Carlo Tree Search with Pareto Curves | Dec 18, 2024 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Active Reinforcement Learning Strategies for Offline Policy Improvement | Dec 17, 2024 | Active Learningcontinuous-control | —Unverified | 0 |
| Revelations: A Decidable Class of POMDPs with Omega-Regular Objectives | Dec 16, 2024 | Decision MakingSequential Decision Making | CodeCode Available | 0 |
| Solving Robust Markov Decision Processes: Generic, Reliable, Efficient | Dec 13, 2024 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Swarm Behavior Cloning | Dec 10, 2024 | Decision MakingImitation Learning | —Unverified | 0 |
| How Should We Represent History in Interpretable Models of Clinical Policies? | Dec 10, 2024 | Decision MakingRepresentation Learning | CodeCode Available | 0 |
| Effective Reward Specification in Deep Reinforcement Learning | Dec 10, 2024 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Optimizing Sensor Redundancy in Sequential Decision-Making Problems | Dec 10, 2024 | Decision MakingOpenAI Gym | —Unverified | 0 |
| Conservative Contextual Bandits: Beyond Linear Representations | Dec 9, 2024 | Multi-Armed BanditsSequential Decision Making | —Unverified | 0 |
| A Note on Sample Complexity of Interactive Imitation Learning with Log Loss | Dec 9, 2024 | Decision MakingImitation Learning | —Unverified | 0 |
| Discrete-Time Distribution Steering using Monte Carlo Tree Search | Dec 9, 2024 | Decision MakingSequential Decision Making | CodeCode Available | 0 |
| Nonmyopic Global Optimisation via Approximate Dynamic Programming | Dec 6, 2024 | Bayesian OptimisationGaussian Processes | CodeCode Available | 0 |
| Reinforcement Learning: An Overview | Dec 6, 2024 | Decision MakingDeep Reinforcement Learning | CodeCode Available | 0 |
| Integrated Sensing and Communications for Low-Altitude Economy: A Deep Reinforcement Learning Approach | Dec 5, 2024 | Collision AvoidanceDeep Reinforcement Learning | —Unverified | 0 |
| Failure Probability Estimation for Black-Box Autonomous Systems using State-Dependent Importance Sampling Proposals | Dec 3, 2024 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Technical Report on Reinforcement Learning Control on the Lucas-Nülle Inverted Pendulum | Dec 3, 2024 | Decision MakingReinforcement Learning (RL) | —Unverified | 0 |
| Time-Series-Informed Closed-loop Learning for Sequential Decision Making and Control | Dec 3, 2024 | Bayesian OptimizationDecision Making | —Unverified | 0 |
| Selective Reviews of Bandit Problems in AI via a Statistical View | Dec 3, 2024 | Decision MakingDecision Making Under Uncertainty | —Unverified | 0 |
| Decision Transformer vs. Decision Mamba: Analysing the Complexity of Sequential Decision Making in Atari Games | Dec 1, 2024 | Atari GamesDecision Making | CodeCode Available | 0 |
| STEVE-Audio: Expanding the Goal Conditioning Modalities of Embodied Agents in Minecraft | Dec 1, 2024 | Decision MakingMinecraft | —Unverified | 0 |
| Market Making without Regret | Nov 21, 2024 | Sequential Decision Making | —Unverified | 0 |
| On adaptivity and minimax optimality of two-sided nearest neighbors | Nov 20, 2024 | Decision MakingMatrix Completion | CodeCode Available | 0 |
| Robust Markov Decision Processes: A Place Where AI and Formal Methods Meet | Nov 18, 2024 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Towards Sample-Efficiency and Generalization of Transfer and Inverse Reinforcement Learning: A Comprehensive Literature Review | Nov 15, 2024 | Reinforcement Learning (RL)Sequential Decision Making | —Unverified | 0 |