SOTAVerified

Sequential Decision Making

Papers

Showing 401425 of 1210 papers

TitleStatusHype
Self-evolving Autoencoder Embedded Q-Network0
Probability Tools for Sequential Random Projection0
Epistemic Exploration for Generalizable Planning and Learning in Non-Stationary SettingsCode0
Online Sequential Decision-Making with Unknown Delays0
Noise-Adaptive Confidence Sets for Linear Bandits and Application to Bayesian OptimizationCode0
Auxiliary Reward Generation with Transition Distance Representation Learning0
Offline Risk-sensitive RL with Partial Observability to Enhance Performance in Human-Robot Teaming0
Logical Specifications-guided Dynamic Task Sampling for Reinforcement Learning AgentsCode0
A Reinforcement Learning Approach for Dynamic Rebalancing in Bike-Sharing System0
Multi-Agent Reinforcement Learning for Offloading Cellular Communications with Cooperating UAVs0
Vertical Symbolic Regression via Deep Policy GradientCode0
Zero-Shot Reinforcement Learning via Function EncodersCode0
Regularized Q-Learning with Linear Function Approximation0
Long-Term Fair Decision Making through Deep Generative ModelsCode0
Stochastic Dynamic Power Dispatch with High Generalization and Few-Shot Adaption via Contextual Meta Graph Reinforcement Learning0
Learning Non-myopic Power Allocation in Constrained ScenariosCode0
LLMs for Relational Reasoning: How Far are We?0
Towards Off-Policy Reinforcement Learning for Ranking Policies with Human Feedback0
DeLF: Designing Learning Environments with Foundation ModelsCode0
Graph Q-Learning for Combinatorial Optimization0
Interactions between dynamic team composition and coordination: An agent-based modeling approach0
On Sample-Efficient Offline Reinforcement Learning: Data Diversity, Posterior Sampling, and Beyond0
Decision Making in Non-Stationary Environments with Policy-Augmented SearchCode0
Towards an Adaptable and Generalizable Optimization Engine in Decision and Control: A Meta Reinforcement Learning Approach0
Act as You Learn: Adaptive Decision-Making in Non-Stationary Markov Decision ProcessesCode0
Show:102550
← PrevPage 17 of 49Next →

No leaderboard results yet.