SOTAVerified

Sequential Decision Making

Papers

Showing 851900 of 1210 papers

TitleStatusHype
Multi-Armed Bandits With Machine Learning-Generated Surrogate Rewards0
Multi-echelon Supply Chains with Uncertain Seasonal Demands and Lead Times Using Deep Reinforcement Learning0
Multi-Environment Pretraining Enables Transfer to Action Limited Datasets0
Multi-granular Adversarial Attacks against Black-box Neural Ranking Models0
Multimodal Pretrained Models for Verifiable Sequential Decision-Making: Planning, Grounding, and Perception0
Multi-Player Zero-Sum Markov Games with Networked Separable Interactions0
Multi-shot Pedestrian Re-identification via Sequential Decision Making0
Multi-Task Generative Adversarial Nets with Shared Memory for Cross-Domain Coordination Control0
Multi-task Representation Learning for Pure Exploration in Linear Bandits0
MuZero with Self-competition for Rate Control in VP9 Video Compression0
Natural Policy Gradient Primal-Dual Method for Constrained Markov Decision Processes0
Near-optimal Offline Reinforcement Learning with Linear Representation: Leveraging Variance Information with Pessimism0
Negotiable Reinforcement Learning for Pareto Optimal Sequential Decision-Making0
Compositional Q-learning for electrolyte repletion with imbalanced patient sub-populations0
Network Offloading Policies for Cloud Robotics: a Learning-based Approach0
Neural Bootstrapping Attention for Neural Processes0
Neural Column Generation for Capacitated Vehicle Routing0
Neural Heterogeneous Scheduler0
Neuro-symbolic Meta Reinforcement Learning for Trading0
Neuro-Symbolic World Models for Adapting to Open World Novelty0
No DBA? No regret! Multi-armed bandits for index tuning of analytical and HTAP workloads with provable guarantees0
Non-Deterministic Policies in Markovian Decision Processes0
Non-maximizing policies that fulfill multi-criterion aspirations in expectation0
Non-Stationary Bandits with Habituation and Recovery Dynamics0
Non-stationary Delayed Combinatorial Semi-Bandit with Causally Related Rewards0
Not all users are the same: Providing personalized explanations for sequential decision making problems0
Novel Approaches to Accelerating the Convergence Rate of Markov Decision Process for Search Result Diversification0
NovGrid: A Flexible Grid World for Evaluating Agent Response to Novelty0
O3D: Offline Data-driven Discovery and Distillation for Sequential Decision-Making with Large Language Models0
Observation Adaptation via Annealed Importance Resampling for Partially Observable Markov Decision Processes0
Offline Action-Free Learning of Ex-BMDPs by Comparing Diverse Datasets0
Offline Hierarchical Reinforcement Learning via Inverse Optimization0
Offline Imitation of Badminton Player Behavior via Experiential Contexts and Brownian Motion0
Offline Learning for Combinatorial Multi-armed Bandits0
Offline Reinforcement Learning with Differentiable Function Approximation is Provably Efficient0
Offline Risk-sensitive RL with Partial Observability to Enhance Performance in Human-Robot Teaming0
Off-Policy Evaluation for Sequential Persuasion Process with Unobserved Confounding0
OMGPT: A Sequence Modeling Framework for Data-driven Operational Decision Making0
On adaptivity and minimax optimality of two-sided nearest neighbors0
On Bellman's Optimality Principle for zs-POSGs0
On Blame Attribution for Accountable Multi-Agent Sequential Decision Making0
On Computation and Generalization of Generative Adversarial Imitation Learning0
On Efficiency in Hierarchical Reinforcement Learning0
On Efficient Online Imitation Learning via Classification0
One-shot learning and behavioral eligibility traces in sequential decision making0
On Improving Deep Reinforcement Learning for POMDPs0
Online Batch Decision-Making with High-Dimensional Covariates0
Online Clustering of Dueling Bandits0
Online Convex Optimization for Sequential Decision Processes and Extensive-Form Games0
Online Convex Optimization with Continuous Switching Constraint0
Show:102550
← PrevPage 18 of 25Next →

No leaderboard results yet.