SOTAVerified

Sequential Decision Making

Papers

Showing 5175 of 1210 papers

TitleStatusHype
Free from Bellman Completeness: Trajectory Stitching via Model-based Return-conditioned Supervised LearningCode1
Enabling Intelligent Interactions between an Agent and an LLM: A Reinforcement Learning ApproachCode1
Multi-task Causal Learning with Gaussian ProcessesCode1
ContainerGym: A Real-World Reinforcement Learning Benchmark for Resource AllocationCode1
How Can LLM Guide RL? A Value-Based ApproachCode1
IQ-Learn: Inverse soft-Q Learning for ImitationCode1
Counterfactual Explanations in Sequential Decision Making Under UncertaintyCode1
PDDLGym: Gym Environments from PDDL ProblemsCode1
DataEnvGym: Data Generation Agents in Teacher Environments with Student FeedbackCode1
PRISE: LLM-Style Sequence Compression for Learning Temporal Action Abstractions in ControlCode1
Re-ReST: Reflection-Reinforced Self-Training for Language AgentsCode1
Dynamic Multi-Robot Task Allocation under Uncertainty and Temporal ConstraintsCode1
Does the Markov Decision Process Fit the Data: Testing for the Markov Property in Sequential Decision MakingCode1
Dynamic Causal Bayesian OptimizationCode1
Effective Reinforcement Learning through Evolutionary Surrogate-Assisted PrescriptionCode1
Deep Reinforcement Learning based Recommendation with Explicit User-Item Interactions ModelingCode1
Comparing Deep Reinforcement Learning Algorithms in Two-Echelon Supply ChainsCode1
RELIEF: Reinforcement Learning Empowered Graph Feature Prompt TuningCode1
An empirical evaluation of active inference in multi-armed banditsCode1
Approximate Inference in Discrete Distributions with Monte Carlo Tree Search and Value FunctionsCode1
An Alternative Softmax Operator for Reinforcement LearningCode1
Efficient Symptom Inquiring and Diagnosis via Adaptive Alignment of Reinforcement Learning and ClassificationCode1
Deep Reinforcement Learning for Entity AlignmentCode1
Extracting Reward Functions from Diffusion ModelsCode1
Efficient Nonmyopic Bayesian Optimization via One-Shot Multi-Step TreesCode1
Show:102550
← PrevPage 3 of 49Next →

No leaderboard results yet.