SOTAVerified

Sequential Decision Making

Papers

Showing 326350 of 1210 papers

TitleStatusHype
Align Your Intents: Offline Imitation Learning via Optimal Transport0
Self-evolving Autoencoder Embedded Q-Network0
Probability Tools for Sequential Random Projection0
PRISE: LLM-Style Sequence Compression for Learning Temporal Action Abstractions in ControlCode1
Jack of All Trades, Master of Some, a Multi-Purpose Transformer AgentCode2
Epistemic Exploration for Generalizable Planning and Learning in Non-Stationary SettingsCode0
Online Sequential Decision-Making with Unknown Delays0
Noise-Adaptive Confidence Sets for Linear Bandits and Application to Bayesian OptimizationCode0
Auxiliary Reward Generation with Transition Distance Representation Learning0
Premier-TACO is a Few-Shot Policy Learner: Pretraining Multitask Representation via Temporal Action-Driven Contrastive LossCode1
Offline Risk-sensitive RL with Partial Observability to Enhance Performance in Human-Robot Teaming0
Sym-Q: Adaptive Symbolic Regression via Sequential Decision-MakingCode1
Logical Specifications-guided Dynamic Task Sampling for Reinforcement Learning AgentsCode0
A Reinforcement Learning Approach for Dynamic Rebalancing in Bike-Sharing System0
Skill Set Optimization: Reinforcing Language Model Behavior via Transferable SkillsCode1
Multi-Agent Reinforcement Learning for Offloading Cellular Communications with Cooperating UAVs0
Vertical Symbolic Regression via Deep Policy GradientCode0
Zero-Shot Reinforcement Learning via Function EncodersCode0
Layered and Staged Monte Carlo Tree Search for SMT Strategy SynthesisCode1
Regularized Q-Learning with Linear Function Approximation0
Long-Term Fair Decision Making through Deep Generative ModelsCode0
Stochastic Dynamic Power Dispatch with High Generalization and Few-Shot Adaption via Contextual Meta Graph Reinforcement Learning0
Learning Non-myopic Power Allocation in Constrained ScenariosCode0
LLMs for Relational Reasoning: How Far are We?0
Towards Off-Policy Reinforcement Learning for Ranking Policies with Human Feedback0
Show:102550
← PrevPage 14 of 49Next →

No leaderboard results yet.