SOTAVerified

Sequential Decision Making

Papers

Showing 301325 of 1210 papers

TitleStatusHype
Fast Value Tracking for Deep Reinforcement Learning0
Offline Imitation of Badminton Player Behavior via Experiential Contexts and Brownian Motion0
State-Separated SARSA: A Practical Sequential Decision-Making Algorithm with Recovering Rewards0
Supervised Fine-Tuning as Inverse Reinforcement Learning0
Distributed Multi-Objective Dynamic Offloading Scheduling for Air-Ground Cooperative MEC0
Regret Minimization via Saddle Point Optimization0
AutoGuide: Automated Generation and Selection of Context-Aware Guidelines for Large Language Model Agents0
Reinforced Sequential Decision-Making for Sepsis Treatment: The POSNEGDM Framework with Mortality Classifier and TransformerCode1
CoRAL: Collaborative Retrieval-Augmented Large Language Models Improve Long-tail Recommendation0
LinearAPT: An Adaptive Algorithm for the Fixed-Budget Thresholding Linear Bandit Problem0
TRAD: Enhancing LLM Agents with Step-Wise Thought Retrieval and Aligned DecisionCode1
Inverse Design of Photonic Crystal Surface Emitting Lasers is a Sequence Modeling Problem0
Cooperative Bayesian Optimization for Imperfect Agents0
A Survey on Applications of Reinforcement Learning in Spatial Resource Allocation0
Language Guided Exploration for RL Agents in Text Environments0
On the Role of Information Structure in Reinforcement Learning for Partially-Observable Sequential Teams and Games0
Adaptive Learning Rate for Follow-the-Regularized-Leader: Competitive Analysis and Best-of-Both-Worlds0
Successfully Guiding Humans with Imperfect Instructions by Highlighting Potential Errors and Suggesting Corrections0
How Can LLM Guide RL? A Value-Based ApproachCode1
Reward Design for Justifiable Sequential Decision-MakingCode0
Information-Theoretic Safe Bayesian Optimization0
On the Performance of Empirical Risk Minimization with Smoothed Data0
BeTAIL: Behavior Transformer Adversarial Imitation Learning from Human Racing Gameplay0
Deep Generative Models for Offline Policy Learning: Tutorial, Survey, and Perspectives on Future DirectionsCode2
Toward TransfORmers: Revolutionizing the Solution of Mixed Integer Programs with Transformers0
Show:102550
← PrevPage 13 of 49Next →

No leaderboard results yet.