SOTAVerified

Sequential Decision Making

Papers

Showing 701725 of 1210 papers

TitleStatusHype
Representation Learning for Context-Dependent Decision-Making0
Hierarchical Constrained Stochastic Shortest Path Planning via Cost Budget Allocation0
Federated Multi-Armed Bandits Under Byzantine Attacks0
Explainable Knowledge Graph Embedding: Inference Reconciliation for Knowledge Inferences Supporting Robot ActionsCode0
Towards Flexible Inference in Sequential Decision Problems via Bidirectional Transformers0
Evolutionary Multi-Armed Bandits with Genetic Thompson SamplingCode0
Toward Policy Explanations for Multi-Agent Reinforcement LearningCode0
Integrating Reward Maximization and Population Estimation: Sequential Decision-Making for Internal Revenue Service Audit Selection0
GFCL: A GRU-based Federated Continual Learning Framework against Data Poisoning Attacks in IoV0
SAAC: Safe Reinforcement Learning as an Adversarial Game of Actor-Critics0
Training and Evaluation of Deep Policies using Reinforcement Learning and Generative Models0
Worst-case Performance of Greedy Policies in Bandits with Imperfect Context Observations0
Achieving Long-Term Fairness in Sequential Decision MakingCode0
Actual Causality and Responsibility Attribution in Decentralized Partially Observable Markov Decision Processes0
An Online Approach to Solve the Dynamic Vehicle Routing Problem with Stochastic Trip Requests for Paratransit Services0
NovGrid: A Flexible Grid World for Evaluating Agent Response to Novelty0
Model-based Multi-agent Reinforcement Learning: Recent Progress and Prospects0
The price of unfairness in linear bandits with biased feedback0
Near-optimal Offline Reinforcement Learning with Linear Representation: Leveraging Variance Information with Pessimism0
A Trainable Approach to Zero-delay Smoothing Spline Interpolation0
Reinforcement Learning in Modern Biostatistics: Constructing Optimal Adaptive Interventions0
Linear Stochastic Bandits over a Bit-Constrained Channel0
Hierarchical Reinforcement Learning with AI Planning ModelsCode0
LISA: Learning Interpretable Skill Abstractions from LanguageCode0
Simulating Network Paths with Recurrent Buffering Units0
Show:102550
← PrevPage 29 of 49Next →

No leaderboard results yet.