SOTAVerified

Sequential Decision Making

Papers

Showing 101125 of 1210 papers

TitleStatusHype
Value Gradient Sampler: Sampling as Sequential Decision MakingCode0
Learning Fair Policies for Infectious Diseases Mitigation using Path Integral Control0
Learning to Solve the Min-Max Mixed-Shelves Picker-Routing Problem via Hierarchical and Parallel DecodingCode0
Self-Evaluation for Job-Shop Scheduling0
Advancing Autonomous VLM Agents via Variational Subgoal-Conditioned Reinforcement Learning0
A Survey on Explainable Deep Reinforcement Learning0
Unifying and Optimizing Data Values for Selection via Sequential-Decision-Making0
Online Clustering of Dueling Bandits0
VLA-Cache: Towards Efficient Vision-Language-Action Model via Adaptive Token Caching in Robotic Manipulation0
Plan-Then-Execute: An Empirical Study of User Trust and Team Performance When Using LLM Agents As A Daily AssistantCode0
Meta-Prompt Optimization for LLM-Based Sequential Decision Making0
Offline Learning for Combinatorial Multi-armed Bandits0
Deceptive Sequential Decision-Making via Regularized Policy Optimization0
Contextual Online Decision Making with Infinite-Dimensional Functional Regression0
HD-CB: The First Exploration of Hyperdimensional Computing for Contextual Bandits Problems0
Sample-Efficient Behavior Cloning Using General Domain Knowledge0
An Adaptable Budget Planner for Enhancing Budget-Constrained Auto-Bidding in Online AdvertisingCode0
Bridging Visualization and Optimization: Multimodal Large Language Models on Graph-Structured Combinatorial Optimization0
On Learning Informative Trajectory Embeddings for Imitation, Classification and RegressionCode0
Exploring the Inquiry-Diagnosis Relationship with Advanced Patient SimulatorsCode0
Embodied Scene Understanding for Vision Language Models via MetaVQA0
Counterfactually Fair Reinforcement Learning via Sequential Data Preprocessing0
All AI Models are Wrong, but Some are Optimal0
Generative Flow Networks: Theory and Applications to Structure Learning0
Explainable Reinforcement Learning via Temporal Policy Decomposition0
Show:102550
← PrevPage 5 of 49Next →

No leaderboard results yet.