SOTAVerified

Sequential Decision Making

Papers

Showing 151200 of 1210 papers

TitleStatusHype
Action Set Based Policy Optimization for Safe Power Grid Management0
Annotation Efficiency: Identifying Hard Samples via Blocked Sparse Linear Bandits0
Few-shot Scooping Under Domain Shift via Simulated Maximal Deployment Gaps0
An Expandable Machine Learning-Optimization Framework to Sequential Decision-Making0
Boltzmann Exploration Done Right0
Deep Symbolic Optimization: Reinforcement Learning for Symbolic Mathematics0
Composite Bayesian Optimization In Function Spaces Using NEON -- Neural Epistemic Operator Networks0
Computing Preimages of Deep Neural Networks with Applications to Safety0
Constrained Online Decision-Making: A Unified Framework0
Contextual Experience Replay for Self-Improvement of Language Agents0
Adversarial Agents: Black-Box Evasion Attacks with Reinforcement Learning0
Statistical Complexity and Optimal Algorithms for Non-linear Ridge Bandits0
The f-Divergence Reinforcement Learning Framework0
A Near-Optimal Best-of-Both-Worlds Algorithm for Online Learning with Feedback Graphs0
Adaptive Rollout Length for Model-Based RL Using Model-Free Deep RL0
Collaborative Inter-agent Knowledge Distillation for Reinforcement Learning0
Communication and Control Co-Design in 6G: Sequential Decision-Making with LLMs0
An Arm-Wise Randomization Approach to Combinatorial Linear Semi-Bandits0
A Contextual Bandit Approach for Stream-Based Active Learning0
Code Models are Zero-shot Precondition Reasoners0
An Analysis of Frame-skipping in Reinforcement Learning0
Bayesian learning of the optimal action-value function in a Markov decision process0
Bayesian Inverse Transition Learning for Offline Settings0
Analysis of Thompson Sampling for Partially Observable Contextual Multi-Armed Bandits0
Adaptive Learning Rate for Follow-the-Regularized-Leader: Competitive Analysis and Best-of-Both-Worlds0
Collaborative and Federated Black-box Optimization: A Bayesian Optimization Perspective0
Bayesian optimization explains human active search0
Be Considerate: Objectives, Side Effects, and Deciding How to Act0
An Anytime Algorithm for Task and Motion MDPs0
Adaptive Robust Online Portfolio Selection0
Beta DVBF: Learning State-Space Models for Control from High Dimensional Observations0
BeTAIL: Behavior Transformer Adversarial Imitation Learning from Human Racing Gameplay0
Between Rate-Distortion Theory & Value Equivalence in Model-Based Reinforcement Learning0
Beyond Adaptive Submodularity: Approximation Guarantees of Greedy Policy with Adaptive Submodularity Ratio0
Beyond O(T) Regret: Decoupling Learning and Decision-making in Online Linear Programming0
Beyond Stationarity: Convergence Analysis of Stochastic Softmax Policy Gradient Methods0
Communication-Control Codesign for Large-Scale Wireless Networked Control Systems0
Blessing from Human-AI Interaction: Super Reinforcement Learning in Confounded Environments0
An End-to-End Reinforcement Learning Based Approach for Micro-View Order-Dispatching in Ride-Hailing0
BOF-UCB: A Bayesian-Optimistic Frequentist Algorithm for Non-Stationary Contextual Bandits0
Bayesian Graph Traversal0
Boosting Reinforcement Learning and Planning with Demonstrations: A Survey0
Bootstrapping Adaptive Human-Machine Interfaces with Offline Reinforcement Learning0
An Introduction to Quantum Reinforcement Learning (QRL)0
Branch Ranking for Efficient Mixed-Integer Programming via Offline Ranking-based Policy Learning0
Federated Learning with Uncertainty via Distilled Predictive Distributions0
An Online Approach to Solve the Dynamic Vehicle Routing Problem with Stochastic Trip Requests for Paratransit Services0
Bridging Commonsense Reasoning and Probabilistic Planning via a Probabilistic Action Language0
Bayesian Exploration Networks0
Bayesian decision-making under misspecified priors with applications to meta-learning0
Show:102550
← PrevPage 4 of 25Next →

No leaderboard results yet.