SOTAVerified

Sequential Decision Making

Papers

Showing 701750 of 1210 papers

TitleStatusHype
Route Optimization via Environment-Aware Deep Network and Reinforcement Learning0
SAAC: Safe Reinforcement Learning as an Adversarial Game of Actor-Critics0
SAC-GLAM: Improving Online RL for LLM agents with Soft Actor-Critic and Hindsight Relabeling0
Safe Policy Improvement by Minimizing Robust Baseline Regret0
Safe POMDP Online Planning via Shielding0
Safe Posterior Sampling for Constrained MDPs with Bounded Constraint Violation0
Safe Sequential Optimization for Switching Environments0
Safe Time-Varying Optimization based on Gaussian Processes with Spatio-Temporal Kernel0
Safety-Aware Algorithms for Adversarial Contextual Bandit0
Safety-aware Causal Representation for Trustworthy Offline Reinforcement Learning in Autonomous Driving0
Sample-efficient Adversarial Imitation Learning0
Sample-Efficient Behavior Cloning Using General Domain Knowledge0
Sample-efficient Safe Learning for Online Nonlinear Control with Control Barrier Functions0
Sampling Through the Lens of Sequential Decision Making0
SAPO-RL: Sequential Actuator Placement Optimization for Fuselage Assembly via Reinforcement Learning0
Learning NP-Hard Multi-Agent Assignment Planning using GNN: Inference on a Random Graph and Provable Auction-Fitted Q-learning0
Scalable Bayesian Inference in the Era of Deep Learning: From Gaussian Processes to Deep Neural Networks0
Scalable First-Order Methods for Robust MDPs0
Scalable Thompson Sampling via Optimal Transport0
Scaling Multi-Armed Bandit Algorithms0
Scaling up ML-based Black-box Planning with Partial STRIPS Models0
Second-order Quantile Methods for Experts and Combinatorial Games0
Selective Perception: Optimizing State Descriptions with Reinforcement Learning for Language Model Actors0
Selective Reviews of Bandit Problems in AI via a Statistical View0
Self-Evaluation for Job-Shop Scheduling0
Self-evolving Autoencoder Embedded Q-Network0
Self-Generated In-Context Examples Improve LLM Agents for Sequential Decision-Making Tasks0
Self-Supervised Reinforcement Learning that Transfers using Random Features0
Semi-Parametric Batched Global Multi-Armed Bandits with Covariates0
SENTINEL: Taming Uncertainty with Ensemble-based Distributional Reinforcement Learning0
SeqMvRL: A Sequential Fusion Framework for Multi-view Representation Learning0
Sequential Batch Learning in Finite-Action Linear Contextual Bandits0
Sequential Bayesian experimental designs via reinforcement learning0
Sequential Decision-Making for Inline Text Autocomplete0
Sequential Decision Making on Unmatched Data using Bayesian Kernel Embeddings0
Sequential Fair Resource Allocation under a Markov Decision Process Framework0
Sequential Information Design: Learning to Persuade in the Dark0
Sequential Stochastic Optimization in Separable Learning Environments0
Sequential Treatment Effect Estimation with Unmeasured Confounders0
Servant of Many Masters: Shifting priorities in Pareto-optimal sequential decision-making0
Shaping Laser Pulses with Reinforcement Learning0
Sharp Thresholds of the Information Cascade Fragility Under a Mismatched Model0
Short-Long Policy Evaluation with Novel Actions0
Similarities between policy gradient methods (PGM) in Reinforcement learning (RL) and supervised learning (SL)0
Simulating Network Paths with Recurrent Buffering Units0
Single and Multi-Agent Deep Reinforcement Learning for AI-Enabled Wireless Networks: A Tutorial0
Situated Language Learning via Interactive Narratives0
Sliding-Window Thompson Sampling for Non-Stationary Settings0
SMART: Self-supervised Multi-task pretrAining with contRol Transformers0
Socially-Optimal Mechanism Design for Incentivized Online Learning0
Show:102550
← PrevPage 15 of 25Next →

No leaderboard results yet.