SOTAVerified

Sequential Decision Making

Papers

Showing 10011025 of 1210 papers

TitleStatusHype
Epistemic Exploration for Generalizable Planning and Learning in Non-Stationary SettingsCode0
Adapting Static Fairness to Sequential Decision-Making: Bias Mitigation Strategies towards Equal Long-term Benefit RateCode0
Agent-Specific Effects: A Causal Effect Propagation Analysis in Multi-Agent MDPsCode0
Did we personalize? Assessing personalization by an online reinforcement learning algorithm using resamplingCode0
Learning to Reach Goals via DiffusionCode0
Online Decision Making with History-Average Dependent Costs (Extended)Code0
Evolutionary Multi-Armed Bandits with Genetic Thompson SamplingCode0
Online Learning with Costly Features in Non-stationary EnvironmentsCode0
SOPE: Spectrum of Off-Policy EstimatorsCode0
Explainable Knowledge Graph Embedding: Inference Reconciliation for Knowledge Inferences Supporting Robot ActionsCode0
Learning to Solve the Min-Max Mixed-Shelves Picker-Routing Problem via Hierarchical and Parallel DecodingCode0
Sample and Computationally Efficient Continuous-Time Reinforcement Learning with General Function ApproximationCode0
Sound Heuristic Search Value Iteration for Undiscounted POMDPs with Reachability ObjectivesCode0
Learning Versatile Skills with Curriculum MaskingCode0
Bellman Meets Hawkes: Model-Based Reinforcement Learning via Temporal Point ProcessesCode0
Reinforcement LearningCode0
TORE: Token Recycling in Vision Transformers for Efficient Active Visual ExplorationCode0
Lifelong Learning with a Changing Action SetCode0
TextAtari: 100K Frames Game Playing with Language AgentsCode0
A Parallel Hybrid Action Space Reinforcement Learning Model for Real-world Adaptive Traffic Signal ControlCode0
Adversarially Robust Decision TransformerCode0
Exploring the Inquiry-Diagnosis Relationship with Advanced Patient SimulatorsCode0
Detecting Adversarial Attacks on Neural Network Policies with Visual ForesightCode0
SANDWICH: Towards an Offline, Differentiable, Fully-Trainable Wireless Neural Ray-Tracing SurrogateCode0
Universal Off-Policy EvaluationCode0
Show:102550
← PrevPage 41 of 49Next →

No leaderboard results yet.