SOTAVerified

Sequential Decision Making

Papers

Showing 401450 of 1210 papers

TitleStatusHype
Self-evolving Autoencoder Embedded Q-Network0
Probability Tools for Sequential Random Projection0
Epistemic Exploration for Generalizable Planning and Learning in Non-Stationary SettingsCode0
Online Sequential Decision-Making with Unknown Delays0
Auxiliary Reward Generation with Transition Distance Representation Learning0
Noise-Adaptive Confidence Sets for Linear Bandits and Application to Bayesian OptimizationCode0
Offline Risk-sensitive RL with Partial Observability to Enhance Performance in Human-Robot Teaming0
Logical Specifications-guided Dynamic Task Sampling for Reinforcement Learning AgentsCode0
A Reinforcement Learning Approach for Dynamic Rebalancing in Bike-Sharing System0
Multi-Agent Reinforcement Learning for Offloading Cellular Communications with Cooperating UAVs0
Vertical Symbolic Regression via Deep Policy GradientCode0
Zero-Shot Reinforcement Learning via Function EncodersCode0
Regularized Q-Learning with Linear Function Approximation0
Long-Term Fair Decision Making through Deep Generative ModelsCode0
Stochastic Dynamic Power Dispatch with High Generalization and Few-Shot Adaption via Contextual Meta Graph Reinforcement Learning0
Learning Non-myopic Power Allocation in Constrained ScenariosCode0
LLMs for Relational Reasoning: How Far are We?0
DeLF: Designing Learning Environments with Foundation ModelsCode0
Towards Off-Policy Reinforcement Learning for Ranking Policies with Human Feedback0
Interactions between dynamic team composition and coordination: An agent-based modeling approach0
Graph Q-Learning for Combinatorial Optimization0
On Sample-Efficient Offline Reinforcement Learning: Data Diversity, Posterior Sampling, and Beyond0
Decision Making in Non-Stationary Environments with Policy-Augmented SearchCode0
Towards an Adaptable and Generalizable Optimization Engine in Decision and Control: A Meta Reinforcement Learning Approach0
Act as You Learn: Adaptive Decision-Making in Non-Stationary Markov Decision ProcessesCode0
Harnessing the Power of Federated Learning in Federated Contextual BanditsCode0
Solving Long-run Average Reward Robust MDPs via Stochastic GamesCode0
Risk-Sensitive Stochastic Optimal Control as Rao-Blackwellized Markovian Score ClimbingCode0
Parameterized Projected Bellman OperatorCode0
Evaluating and Enhancing Large Language Models for Conversational Reasoning on Knowledge GraphsCode0
Robust Active Measuring under Model UncertaintyCode0
Online Restless Multi-Armed Bandits with Long-Term Fairness Constraints0
Risk-Aware Continuous Control with Neural Contextual BanditsCode0
A Customizable Generator for Comic-Style Visual Narrative0
Learning adaptive planning representations with natural language guidance0
Online Decision Making with History-Average Dependent Costs (Extended)Code0
Remembering to Be Fair: Non-Markovian Fairness in Sequential Decision MakingCode0
A Review of Cooperation in Multi-agent Learning0
Distributed Optimization via Kernelized Multi-armed Bandits0
Generalization to New Sequential Decision Making Tasks with In-Context Learning0
Visual Encoders for Data-Efficient Imitation Learning in Modern Video Games0
Learning Curricula in Open-Ended Worlds0
AI in Pharma for Personalized Sequential Decision-Making: Methods, Applications and Opportunities0
TORE: Token Recycling in Vision Transformers for Efficient Active Visual ExplorationCode0
History Filtering in Imperfect Information Games: Algorithms and Complexity0
Learning Dynamic Selection and Pricing of Out-of-Home DeliveriesCode0
Code Models are Zero-shot Precondition Reasoners0
An Expandable Machine Learning-Optimization Framework to Sequential Decision-Making0
An advantage based policy transfer algorithm for reinforcement learning with measures of transferability0
Likelihood Ratio Confidence Sets for Sequential Decision Making0
Show:102550
← PrevPage 9 of 25Next →

No leaderboard results yet.