SOTAVerified

Sequential Decision Making

Papers

Showing 701750 of 1210 papers

TitleStatusHype
Representation Learning for Context-Dependent Decision-Making0
Hierarchical Constrained Stochastic Shortest Path Planning via Cost Budget Allocation0
Federated Multi-Armed Bandits Under Byzantine Attacks0
Explainable Knowledge Graph Embedding: Inference Reconciliation for Knowledge Inferences Supporting Robot ActionsCode0
Towards Flexible Inference in Sequential Decision Problems via Bidirectional Transformers0
Evolutionary Multi-Armed Bandits with Genetic Thompson SamplingCode0
Toward Policy Explanations for Multi-Agent Reinforcement LearningCode0
Integrating Reward Maximization and Population Estimation: Sequential Decision-Making for Internal Revenue Service Audit Selection0
GFCL: A GRU-based Federated Continual Learning Framework against Data Poisoning Attacks in IoV0
SAAC: Safe Reinforcement Learning as an Adversarial Game of Actor-Critics0
Training and Evaluation of Deep Policies using Reinforcement Learning and Generative Models0
Worst-case Performance of Greedy Policies in Bandits with Imperfect Context Observations0
Achieving Long-Term Fairness in Sequential Decision MakingCode0
Actual Causality and Responsibility Attribution in Decentralized Partially Observable Markov Decision Processes0
An Online Approach to Solve the Dynamic Vehicle Routing Problem with Stochastic Trip Requests for Paratransit Services0
NovGrid: A Flexible Grid World for Evaluating Agent Response to Novelty0
Model-based Multi-agent Reinforcement Learning: Recent Progress and Prospects0
The price of unfairness in linear bandits with biased feedback0
Near-optimal Offline Reinforcement Learning with Linear Representation: Leveraging Variance Information with Pessimism0
A Trainable Approach to Zero-delay Smoothing Spline Interpolation0
Reinforcement Learning in Modern Biostatistics: Constructing Optimal Adaptive Interventions0
Linear Stochastic Bandits over a Bit-Constrained Channel0
Hierarchical Reinforcement Learning with AI Planning ModelsCode0
LISA: Learning Interpretable Skill Abstractions from LanguageCode0
Simulating Network Paths with Recurrent Buffering Units0
A Survey of Explainable Reinforcement Learning0
MuZero with Self-competition for Rate Control in VP9 Video Compression0
Sequential Bayesian experimental designs via reinforcement learning0
Provably Efficient Causal Model-Based Reinforcement Learning for Systematic Generalization0
Provable Reinforcement Learning with a Short-Term Memory0
Improved Regret for Differentially Private Exploration in Linear MDP0
Meta-Learning Hypothesis Spaces for Sequential Decision-making0
Bellman Meets Hawkes: Model-Based Reinforcement Learning via Temporal Point ProcessesCode0
Accelerate Model Parallel Training by Using Efficient Graph Traversal Order in Device PlacementCode0
Generalizing Off-Policy Evaluation From a Causal Perspective For Sequential Decision-Making0
Active Learning-Based Multistage Sequential Decision-Making Model with Application on Common Bile Duct Stone Evaluation0
Automated Reinforcement Learning: An Overview0
Multi-echelon Supply Chains with Uncertain Seasonal Demands and Lead Times Using Deep Reinforcement Learning0
Subgoal-Based Explanations for Unreliable Intelligent Decision Support Systems0
State of the Art of User Simulation approaches for conversational information retrieval0
Temporal Detection of Anomalies via Actor-Critic Based Controlled Sensing0
Socially-Optimal Mechanism Design for Incentivized Online Learning0
Reducing Planning Complexity of General Reinforcement Learning with Non-Markovian Abstractions0
A Survey on Interpretable Reinforcement Learning0
Revisiting Game Representations: The Hidden Costs of Efficiency in Sequential Decision-making Algorithms0
Differentially Private Regret Minimization in Episodic Markov Decision ProcessesCode0
Application of Deep Reinforcement Learning to Payment Fraud0
First-Order Regret in Reinforcement Learning with Linear Function Approximation: A Robust Estimation Approach0
MDPFuzz: Testing Models Solving Markov Decision Processes0
Temporal-Spatial Causal Interpretations for Vision-Based Reinforcement Learning0
Show:102550
← PrevPage 15 of 25Next →

No leaderboard results yet.