SOTAVerified

Sequential Decision Making

Papers

Showing 651700 of 1210 papers

TitleStatusHype
A Deep Reinforcement Learning Framework For Column GenerationCode0
Adaptive Robust Online Portfolio Selection0
A Near-Optimal Best-of-Both-Worlds Algorithm for Online Learning with Feedback Graphs0
Robust Anytime Learning of Markov Decision ProcessesCode0
Multi-Agent Learning of Numerical Methods for Hyperbolic PDEs with Factored Dec-MDP0
Causal Explanations for Sequential Decision Making Under Uncertainty0
Adaptive Sampling for Discovery0
Temporal Latent Bottleneck: Synthesis of Fast and Slow Processing Mechanisms in Sequence LearningCode0
Multi-Agent Reinforcement Learning is a Sequence Modeling ProblemCode2
Efficient Deviation Types and Learning for Hindsight Rationality in Extensive-Form Games: CorrectionsCode0
Flow-based Recurrent Belief State Learning for POMDPs0
Survey on Fair Reinforcement Learning: Theory and Practice0
Marginal and Joint Cross-Entropies & Predictives for Online Bayesian Inference, Active Learning, and Active Sampling0
Representation Learning for Context-Dependent Decision-Making0
Hierarchical Constrained Stochastic Shortest Path Planning via Cost Budget Allocation0
Federated Multi-Armed Bandits Under Byzantine Attacks0
Explainable Knowledge Graph Embedding: Inference Reconciliation for Knowledge Inferences Supporting Robot ActionsCode0
Towards Flexible Inference in Sequential Decision Problems via Bidirectional Transformers0
Evolutionary Multi-Armed Bandits with Genetic Thompson SamplingCode0
Toward Policy Explanations for Multi-Agent Reinforcement LearningCode0
Integrating Reward Maximization and Population Estimation: Sequential Decision-Making for Internal Revenue Service Audit Selection0
GFCL: A GRU-based Federated Continual Learning Framework against Data Poisoning Attacks in IoV0
Comparing Deep Reinforcement Learning Algorithms in Two-Echelon Supply ChainsCode1
SAAC: Safe Reinforcement Learning as an Adversarial Game of Actor-Critics0
Training and Evaluation of Deep Policies using Reinforcement Learning and Generative Models0
Worst-case Performance of Greedy Policies in Bandits with Imperfect Context Observations0
Achieving Long-Term Fairness in Sequential Decision MakingCode0
Actual Causality and Responsibility Attribution in Decentralized Partially Observable Markov Decision Processes0
An Online Approach to Solve the Dynamic Vehicle Routing Problem with Stochastic Trip Requests for Paratransit Services0
NovGrid: A Flexible Grid World for Evaluating Agent Response to Novelty0
Model-based Multi-agent Reinforcement Learning: Recent Progress and Prospects0
The Sandbox Environment for Generalizable Agent Research (SEGAR)Code1
The price of unfairness in linear bandits with biased feedback0
Near-optimal Offline Reinforcement Learning with Linear Representation: Leveraging Variance Information with Pessimism0
Curriculum-based Reinforcement Learning for Distribution System Critical Load RestorationCode1
A Trainable Approach to Zero-delay Smoothing Spline Interpolation0
Deep Reinforcement Learning for Entity AlignmentCode1
Reinforcement Learning in Modern Biostatistics: Constructing Optimal Adaptive Interventions0
Linear Stochastic Bandits over a Bit-Constrained Channel0
Hierarchical Reinforcement Learning with AI Planning ModelsCode0
LISA: Learning Interpretable Skill Abstractions from LanguageCode0
Simulating Network Paths with Recurrent Buffering Units0
A Survey of Explainable Reinforcement Learning0
Sequential Bayesian experimental designs via reinforcement learning0
Provably Efficient Causal Model-Based Reinforcement Learning for Systematic Generalization0
MuZero with Self-competition for Rate Control in VP9 Video Compression0
Provable Reinforcement Learning with a Short-Term Memory0
Pre-Trained Language Models for Interactive Decision-MakingCode2
Improved Regret for Differentially Private Exploration in Linear MDP0
Meta-Learning Hypothesis Spaces for Sequential Decision-making0
Show:102550
← PrevPage 14 of 25Next →

No leaderboard results yet.