SOTAVerified

Sequential Decision Making

Papers

Showing 201225 of 1210 papers

TitleStatusHype
Plan-Then-Execute: An Empirical Study of User Trust and Team Performance When Using LLM Agents As A Daily AssistantCode0
Meta-Prompt Optimization for LLM-Based Sequential Decision Making0
Offline Learning for Combinatorial Multi-armed Bandits0
Deceptive Sequential Decision-Making via Regularized Policy Optimization0
Contextual Online Decision Making with Infinite-Dimensional Functional Regression0
HD-CB: The First Exploration of Hyperdimensional Computing for Contextual Bandits Problems0
Sample-Efficient Behavior Cloning Using General Domain Knowledge0
An Adaptable Budget Planner for Enhancing Budget-Constrained Auto-Bidding in Online AdvertisingCode0
Bridging Visualization and Optimization: Multimodal Large Language Models on Graph-Structured Combinatorial Optimization0
Exploring the Inquiry-Diagnosis Relationship with Advanced Patient SimulatorsCode0
On Learning Informative Trajectory Embeddings for Imitation, Classification and RegressionCode0
Embodied Scene Understanding for Vision Language Models via MetaVQA0
Counterfactually Fair Reinforcement Learning via Sequential Data Preprocessing0
All AI Models are Wrong, but Some are Optimal0
Generative Flow Networks: Theory and Applications to Structure Learning0
Explainable Reinforcement Learning via Temporal Policy Decomposition0
Beyond O(T) Regret: Decoupling Learning and Decision-making in Online Linear Programming0
SeqMvRL: A Sequential Fusion Framework for Multi-view Representation Learning0
Demystifying Online Clustering of Bandits: Enhanced Exploration Under Stochastic and Smoothed Adversarial Contexts0
Optimizing Fantasy Sports Team Selection with Deep Reinforcement Learning0
Navigating Data Corruption in Machine Learning: Balancing Quality, Quantity, and Imputation StrategiesCode0
HyperQ-Opt: Q-learning for Hyperparameter Optimization0
Fairness in Reinforcement Learning with Bisimulation Metrics0
GAS: Generative Auto-bidding with Post-training Search0
Subgoal Discovery Using a Free Energy Paradigm and State Aggregations0
Show:102550
← PrevPage 9 of 49Next →

No leaderboard results yet.