SOTAVerified

Sequential Decision Making

Papers

Showing 901950 of 1210 papers

TitleStatusHype
Bandits in Matching Markets: Ideas and Proposals for Peer Lending0
Towards Safe Policy Improvement for Non-Stationary MDPsCode0
What are the Statistical Limits of Offline RL with Linear Function Approximation?0
Deep Q-Network-based Adaptive Alert Threshold Selection Policy for Payment Fraud Systems in Retail Banking0
DBA bandits: Self-driving index tuning under ad-hoc, analytical workloads with safety guarantees0
Learning to Generalize for Sequential Decision MakingCode0
A Generative Machine Learning Approach to Policy Optimization in Pursuit-Evasion Games0
Mean-Variance Efficient Reinforcement Learning with Applications to Dynamic Financial Investment0
Is Reinforcement Learning More Difficult Than Bandits? A Near-optimal Algorithm Escaping the Curse of Horizon0
A Sample-Efficient Algorithm for Episodic Finite-Horizon MDP with Constraints0
Transfer Learning in Deep Reinforcement Learning: A Survey0
Causal Bandits without prior knowledge using separating sets0
Toward the Fundamental Limits of Imitation Learning0
Optimal Inspection and Maintenance Planning for Deteriorating Structural Components through Dynamic Bayesian Networks and Markov Decision Processes0
Inverse Policy Evaluation for Value-based Sequential Decision-making0
Spatial Privacy Pricing: The Interplay between Privacy, Utility and Price in Geo-Marketplaces0
A Survey of Knowledge-based Sequential Decision Making under Uncertainty0
Deep Model-Based Reinforcement Learning for High-Dimensional Problems, a Survey0
A Machine of Few Words -- Interactive Speaker Recognition with Reinforcement Learning0
Tracking the Race Between Deep Reinforcement Learning and Imitation Learning -- Extended Version0
Dynamics Generalization via Information Bottleneck in Deep Reinforcement Learning0
Compare and Select: Video Summarization with Multi-Agent Reinforcement Learning0
Data-efficient visuomotor policy training using reinforcement learning and generative models0
AirCapRL: Autonomous Aerial Human Motion Capture using Deep Reinforcement Learning0
Text-based RL Agents with Commonsense Knowledge: New Challenges, Environments and Approaches0
Fast reinforcement learning with generalized policy updates0
GraphOpt: Learning Optimization Models of Graph Formation0
Learning "What-if" Explanations for Sequential Decision-Making0
Convex Regularization in Monte-Carlo Tree Search0
Falsification-Based Robust Adversarial Reinforcement Learning0
Model-based Reinforcement Learning: A Survey0
Enforcing Almost-Sure Reachability in POMDPsCode0
On Bellman's Optimality Principle for zs-POSGs0
A Unifying Framework for Reinforcement Learning and Planning0
Circuit Routing Using Monte Carlo Tree Search and Deep Neural Networks0
Risk-Sensitive Reinforcement Learning: a Martingale Approach to Reward Uncertainty0
Towards Tractable Optimism in Model-Based Reinforcement Learning0
Frequentist Uncertainty in Recurrent Neural Networks via Blockwise Influence FunctionsCode0
Counterfactually Guided Off-policy Transfer in Clinical Settings0
Learning by Repetition: Stochastic Multi-armed Bandits under Priming Effect0
Parameterized MDPs and Reinforcement Learning Problems -- A Maximum Entropy Principle Based Framework0
Mutual Information Based Knowledge Transfer Under State-Action Dimension MismatchCode0
On the Relationship Between Structure in Natural Language and Models of Sequential Decision Processes0
Recurrent Sum-Product-Max Networks for Decision Making in Perfectly-Observed EnvironmentsCode0
Group-Fair Online Allocation in Continuous Time0
When is Particle Filtering Efficient for Planning in Partially Observed Linear Dynamical Systems?0
Modeling Human Driving Behavior through Generative Adversarial Imitation Learning0
Stealing Deep Reinforcement Learning Models for Fun and Profit0
Sharp Thresholds of the Information Cascade Fragility Under a Mismatched Model0
When Does MAML Objective Have Benign Landscape?0
Show:102550
← PrevPage 19 of 25Next →

No leaderboard results yet.