SOTAVerified

Sequential Decision Making

Papers

Showing 901950 of 1210 papers

TitleStatusHype
Occupancy Anticipation for Efficient Exploration and NavigationCode1
A Survey of Knowledge-based Sequential Decision Making under Uncertainty0
Deep Model-Based Reinforcement Learning for High-Dimensional Problems, a Survey0
A Machine of Few Words -- Interactive Speaker Recognition with Reinforcement Learning0
Tracking the Race Between Deep Reinforcement Learning and Imitation Learning -- Extended Version0
Dynamics Generalization via Information Bottleneck in Deep Reinforcement Learning0
Compare and Select: Video Summarization with Multi-Agent Reinforcement Learning0
Data-efficient visuomotor policy training using reinforcement learning and generative models0
AirCapRL: Autonomous Aerial Human Motion Capture using Deep Reinforcement Learning0
Text-based RL Agents with Commonsense Knowledge: New Challenges, Environments and Approaches0
Fast reinforcement learning with generalized policy updates0
GraphOpt: Learning Optimization Models of Graph Formation0
Learning "What-if" Explanations for Sequential Decision-Making0
Falsification-Based Robust Adversarial Reinforcement Learning0
Convex Regularization in Monte-Carlo Tree Search0
Enforcing Almost-Sure Reachability in POMDPsCode0
Model-based Reinforcement Learning: A Survey0
On Bellman's Optimality Principle for zs-POSGs0
Efficient Nonmyopic Bayesian Optimization via One-Shot Multi-Step TreesCode1
A Unifying Framework for Reinforcement Learning and Planning0
Circuit Routing Using Monte Carlo Tree Search and Deep Neural Networks0
Risk-Sensitive Reinforcement Learning: a Martingale Approach to Reward Uncertainty0
Towards Tractable Optimism in Model-Based Reinforcement Learning0
Counterfactually Guided Off-policy Transfer in Clinical Settings0
Frequentist Uncertainty in Recurrent Neural Networks via Blockwise Influence FunctionsCode0
Learning by Repetition: Stochastic Multi-armed Bandits under Priming Effect0
Parameterized MDPs and Reinforcement Learning Problems -- A Maximum Entropy Principle Based Framework0
On the Relationship Between Structure in Natural Language and Models of Sequential Decision Processes0
Mutual Information Based Knowledge Transfer Under State-Action Dimension MismatchCode0
Recurrent Sum-Product-Max Networks for Decision Making in Perfectly-Observed EnvironmentsCode0
Group-Fair Online Allocation in Continuous Time0
Modeling Human Driving Behavior through Generative Adversarial Imitation Learning0
When is Particle Filtering Efficient for Planning in Partially Observed Linear Dynamical Systems?0
Stealing Deep Reinforcement Learning Models for Fun and Profit0
Sharp Thresholds of the Information Cascade Fragility Under a Mismatched Model0
When Does MAML Objective Have Benign Landscape?0
Reinforcement LearningCode0
Dynamic Bi-Objective Routing of Multiple Vehicles0
Dynamic Multi-Robot Task Allocation under Uncertainty and Temporal ConstraintsCode1
Active Measure Reinforcement Learning for Observation Cost Minimization0
Causal Bayesian Optimization0
Implementability of Honest Multi-Agent Sequential Decision-Making with Dynamic Population0
Think Too Fast Nor Too Slow: The Computational Trade-off Between Planning And Reinforcement LearningCode0
Scalable First-Order Methods for Robust MDPs0
Unified Models of Human Behavioral Agents in Bandits, Contextual Bandits and RLCode1
Divide-and-Conquer Monte Carlo Tree Search For Goal-Directed Planning0
iCORPP: Interleaved Commonsense Reasoning and Probabilistic Planning on Robots0
Actor-Critic Deep Reinforcement Learning for Solving Job Shop Scheduling Problems0
Sequential Batch Learning in Finite-Action Linear Contextual Bandits0
Distributed Learning: Sequential Decision Making in Resource-Constrained Environments0
Show:102550
← PrevPage 19 of 25Next →

No leaderboard results yet.