SOTAVerified

Sequential Decision Making

Papers

Showing 10011050 of 1210 papers

TitleStatusHype
Bandit Linear Optimization for Sequential Decision Making and Extensive-Form Games0
Bandits in Matching Markets: Ideas and Proposals for Peer Lending0
Bandits with Unobserved Confounders: A Causal Approach0
Batched Neural Bandits0
Batched Nonparametric Bandits via k-Nearest Neighbor UCB0
Learning "What-if" Explanations for Sequential Decision-Making0
Bayesian decision-making under misspecified priors with applications to meta-learning0
Bayesian Exploration Networks0
Federated Learning with Uncertainty via Distilled Predictive Distributions0
Bayesian Graph Traversal0
Bayesian Inverse Transition Learning for Offline Settings0
Bayesian learning of the optimal action-value function in a Markov decision process0
Bayesian optimization explains human active search0
Be Considerate: Objectives, Side Effects, and Deciding How to Act0
Beta DVBF: Learning State-Space Models for Control from High Dimensional Observations0
BeTAIL: Behavior Transformer Adversarial Imitation Learning from Human Racing Gameplay0
Between Rate-Distortion Theory & Value Equivalence in Model-Based Reinforcement Learning0
Beyond Adaptive Submodularity: Approximation Guarantees of Greedy Policy with Adaptive Submodularity Ratio0
Beyond O(T) Regret: Decoupling Learning and Decision-making in Online Linear Programming0
Beyond Stationarity: Convergence Analysis of Stochastic Softmax Policy Gradient Methods0
Statistical Complexity and Optimal Algorithms for Non-linear Ridge Bandits0
Blessing from Human-AI Interaction: Super Reinforcement Learning in Confounded Environments0
BOF-UCB: A Bayesian-Optimistic Frequentist Algorithm for Non-Stationary Contextual Bandits0
Boltzmann Exploration Done Right0
Boosting Reinforcement Learning and Planning with Demonstrations: A Survey0
Bootstrapping Adaptive Human-Machine Interfaces with Offline Reinforcement Learning0
Branch Ranking for Efficient Mixed-Integer Programming via Offline Ranking-based Policy Learning0
Bridging Commonsense Reasoning and Probabilistic Planning via a Probabilistic Action Language0
Bridging Rested and Restless Bandits with Graph-Triggering: Rising and Rotting0
Bridging the Gap: Providing Post-Hoc Symbolic Explanations for Sequential Decision-Making Problems with Inscrutable Representations0
Bridging Visualization and Optimization: Multimodal Large Language Models on Graph-Structured Combinatorial Optimization0
Building Intelligent Autonomous Navigation Agents0
Burning RED: Unlocking Subtask-Driven Reinforcement Learning and Risk-Awareness in Average-Reward Markov Decision Processes0
Can A User Anticipate What Her Followers Want?0
Reinforcement Learning for Freight Booking Control Problems0
Causal Bayesian Optimization0
Causal Bandits without prior knowledge using separating sets0
Causal Markov Decision Processes: Learning Good Interventions Efficiently0
Chasing Ghosts: Competing with Stateful Policies0
Circuit Routing Using Monte Carlo Tree Search and Deep Neural Networks0
Code Models are Zero-shot Precondition Reasoners0
Collaborative and Federated Black-box Optimization: A Bayesian Optimization Perspective0
Collaborative Inter-agent Knowledge Distillation for Reinforcement Learning0
The f-Divergence Reinforcement Learning Framework0
Communication and Control Co-Design in 6G: Sequential Decision-Making with LLMs0
Communication-Control Codesign for Large-Scale Wireless Networked Control Systems0
Compare and Select: Video Summarization with Multi-Agent Reinforcement Learning0
Composite Bayesian Optimization In Function Spaces Using NEON -- Neural Epistemic Operator Networks0
Computing Preimages of Deep Neural Networks with Applications to Safety0
Consensus in Motion: A Case of Dynamic Rationality of Sequential Learning in Probability Aggregation0
Show:102550
← PrevPage 21 of 25Next →

No leaderboard results yet.