SOTAVerified

Sequential Decision Making

Papers

Showing 101150 of 1210 papers

TitleStatusHype
Value Gradient Sampler: Sampling as Sequential Decision MakingCode0
Learning to Solve the Min-Max Mixed-Shelves Picker-Routing Problem via Hierarchical and Parallel DecodingCode0
Learning Fair Policies for Infectious Diseases Mitigation using Path Integral Control0
Self-Evaluation for Job-Shop Scheduling0
Advancing Autonomous VLM Agents via Variational Subgoal-Conditioned Reinforcement Learning0
A Survey on Explainable Deep Reinforcement Learning0
Unifying and Optimizing Data Values for Selection via Sequential-Decision-Making0
Online Clustering of Dueling Bandits0
VLA-Cache: Towards Efficient Vision-Language-Action Model via Adaptive Token Caching in Robotic Manipulation0
Plan-Then-Execute: An Empirical Study of User Trust and Team Performance When Using LLM Agents As A Daily AssistantCode0
Meta-Prompt Optimization for LLM-Based Sequential Decision Making0
Offline Learning for Combinatorial Multi-armed Bandits0
Deceptive Sequential Decision-Making via Regularized Policy Optimization0
Contextual Online Decision Making with Infinite-Dimensional Functional Regression0
HD-CB: The First Exploration of Hyperdimensional Computing for Contextual Bandits Problems0
Sample-Efficient Behavior Cloning Using General Domain Knowledge0
An Adaptable Budget Planner for Enhancing Budget-Constrained Auto-Bidding in Online AdvertisingCode0
Bridging Visualization and Optimization: Multimodal Large Language Models on Graph-Structured Combinatorial Optimization0
On Learning Informative Trajectory Embeddings for Imitation, Classification and RegressionCode0
Exploring the Inquiry-Diagnosis Relationship with Advanced Patient SimulatorsCode0
Embodied Scene Understanding for Vision Language Models via MetaVQA0
Counterfactually Fair Reinforcement Learning via Sequential Data Preprocessing0
All AI Models are Wrong, but Some are Optimal0
Generative Flow Networks: Theory and Applications to Structure Learning0
Explainable Reinforcement Learning via Temporal Policy Decomposition0
Beyond O(T) Regret: Decoupling Learning and Decision-making in Online Linear Programming0
Co-Activation Graph Analysis of Safety-Verified and Explainable Deep Reinforcement Learning PoliciesCode1
SeqMvRL: A Sequential Fusion Framework for Multi-view Representation Learning0
Demystifying Online Clustering of Bandits: Enhanced Exploration Under Stochastic and Smoothed Adversarial Contexts0
Optimizing Fantasy Sports Team Selection with Deep Reinforcement Learning0
Navigating Data Corruption in Machine Learning: Balancing Quality, Quantity, and Imputation StrategiesCode0
MineStudio: A Streamlined Package for Minecraft AI Agent DevelopmentCode3
HyperQ-Opt: Q-learning for Hyperparameter Optimization0
Fairness in Reinforcement Learning with Bisimulation Metrics0
GAS: Generative Auto-bidding with Post-training Search0
Subgoal Discovery Using a Free Energy Paradigm and State Aggregations0
DriveGPT: Scaling Autoregressive Behavior Models for Driving0
Threshold UCT: Cost-Constrained Monte Carlo Tree Search with Pareto Curves0
Active Reinforcement Learning Strategies for Offline Policy Improvement0
Revelations: A Decidable Class of POMDPs with Omega-Regular ObjectivesCode0
Solving Robust Markov Decision Processes: Generic, Reliable, Efficient0
How Should We Represent History in Interpretable Models of Clinical Policies?Code0
Effective Reward Specification in Deep Reinforcement Learning0
Swarm Behavior Cloning0
Optimizing Sensor Redundancy in Sequential Decision-Making Problems0
Discrete-Time Distribution Steering using Monte Carlo Tree SearchCode0
A Note on Sample Complexity of Interactive Imitation Learning with Log Loss0
Conservative Contextual Bandits: Beyond Linear Representations0
Reinforcement Learning: An OverviewCode0
Nonmyopic Global Optimisation via Approximate Dynamic ProgrammingCode0
Show:102550
← PrevPage 3 of 25Next →

No leaderboard results yet.