SOTAVerified

Sequential Decision Making

Papers

Showing 201250 of 1210 papers

TitleStatusHype
Plan-Then-Execute: An Empirical Study of User Trust and Team Performance When Using LLM Agents As A Daily AssistantCode0
Meta-Prompt Optimization for LLM-Based Sequential Decision Making0
Offline Learning for Combinatorial Multi-armed Bandits0
Contextual Online Decision Making with Infinite-Dimensional Functional Regression0
Deceptive Sequential Decision-Making via Regularized Policy Optimization0
HD-CB: The First Exploration of Hyperdimensional Computing for Contextual Bandits Problems0
Sample-Efficient Behavior Cloning Using General Domain Knowledge0
An Adaptable Budget Planner for Enhancing Budget-Constrained Auto-Bidding in Online AdvertisingCode0
Bridging Visualization and Optimization: Multimodal Large Language Models on Graph-Structured Combinatorial Optimization0
Exploring the Inquiry-Diagnosis Relationship with Advanced Patient SimulatorsCode0
On Learning Informative Trajectory Embeddings for Imitation, Classification and RegressionCode0
Embodied Scene Understanding for Vision Language Models via MetaVQA0
Counterfactually Fair Reinforcement Learning via Sequential Data Preprocessing0
All AI Models are Wrong, but Some are Optimal0
Generative Flow Networks: Theory and Applications to Structure Learning0
Explainable Reinforcement Learning via Temporal Policy Decomposition0
Beyond O(T) Regret: Decoupling Learning and Decision-making in Online Linear Programming0
SeqMvRL: A Sequential Fusion Framework for Multi-view Representation Learning0
Demystifying Online Clustering of Bandits: Enhanced Exploration Under Stochastic and Smoothed Adversarial Contexts0
Optimizing Fantasy Sports Team Selection with Deep Reinforcement Learning0
Navigating Data Corruption in Machine Learning: Balancing Quality, Quantity, and Imputation StrategiesCode0
HyperQ-Opt: Q-learning for Hyperparameter Optimization0
Fairness in Reinforcement Learning with Bisimulation Metrics0
GAS: Generative Auto-bidding with Post-training Search0
Subgoal Discovery Using a Free Energy Paradigm and State Aggregations0
DriveGPT: Scaling Autoregressive Behavior Models for Driving0
Threshold UCT: Cost-Constrained Monte Carlo Tree Search with Pareto Curves0
Active Reinforcement Learning Strategies for Offline Policy Improvement0
Revelations: A Decidable Class of POMDPs with Omega-Regular ObjectivesCode0
Solving Robust Markov Decision Processes: Generic, Reliable, Efficient0
Swarm Behavior Cloning0
How Should We Represent History in Interpretable Models of Clinical Policies?Code0
Effective Reward Specification in Deep Reinforcement Learning0
Optimizing Sensor Redundancy in Sequential Decision-Making Problems0
Conservative Contextual Bandits: Beyond Linear Representations0
A Note on Sample Complexity of Interactive Imitation Learning with Log Loss0
Discrete-Time Distribution Steering using Monte Carlo Tree SearchCode0
Nonmyopic Global Optimisation via Approximate Dynamic ProgrammingCode0
Reinforcement Learning: An OverviewCode0
Integrated Sensing and Communications for Low-Altitude Economy: A Deep Reinforcement Learning Approach0
Failure Probability Estimation for Black-Box Autonomous Systems using State-Dependent Importance Sampling Proposals0
Technical Report on Reinforcement Learning Control on the Lucas-Nülle Inverted Pendulum0
Time-Series-Informed Closed-loop Learning for Sequential Decision Making and Control0
Selective Reviews of Bandit Problems in AI via a Statistical View0
Decision Transformer vs. Decision Mamba: Analysing the Complexity of Sequential Decision Making in Atari GamesCode0
STEVE-Audio: Expanding the Goal Conditioning Modalities of Embodied Agents in Minecraft0
Market Making without Regret0
On adaptivity and minimax optimality of two-sided nearest neighborsCode0
Robust Markov Decision Processes: A Place Where AI and Formal Methods Meet0
Towards Sample-Efficiency and Generalization of Transfer and Inverse Reinforcement Learning: A Comprehensive Literature Review0
Show:102550
← PrevPage 5 of 25Next →

No leaderboard results yet.