SOTAVerified

Sequential Decision Making

Papers

Showing 2650 of 1210 papers

TitleStatusHype
LLF-Bench: Benchmark for Interactive Learning from Language FeedbackCode1
Can Increasing Input Dimensionality Improve Deep Reinforcement Learning?Code1
Bridging POMDPs and Bayesian decision making for robust maintenance planning under model uncertainty: An application to railway systemsCode1
Learning Discrete World Models for Heuristic SearchCode1
Can language agents be alternatives to PPO? A Preliminary Empirical Study On OpenAI GymCode1
Efficient Symptom Inquiring and Diagnosis via Adaptive Alignment of Reinforcement Learning and ClassificationCode1
Efficient Nonmyopic Bayesian Optimization via One-Shot Multi-Step TreesCode1
Extracting Reward Functions from Diffusion ModelsCode1
Dynamic Causal Bayesian OptimizationCode1
Deep Reinforcement Learning for Entity AlignmentCode1
Dynamic Multi-Robot Task Allocation under Uncertainty and Temporal ConstraintsCode1
Free from Bellman Completeness: Trajectory Stitching via Model-based Return-conditioned Supervised LearningCode1
Decision Stacks: Flexible Reinforcement Learning via Modular Generative ModelsCode1
AdaPlanner: Adaptive Planning from Feedback with Language ModelsCode1
Does the Markov Decision Process Fit the Data: Testing for the Markov Property in Sequential Decision MakingCode1
An Alternative Softmax Operator for Reinforcement LearningCode1
An empirical evaluation of active inference in multi-armed banditsCode1
Effective Reinforcement Learning through Evolutionary Surrogate-Assisted PrescriptionCode1
Approximate Inference in Discrete Distributions with Monte Carlo Tree Search and Value FunctionsCode1
DataEnvGym: Data Generation Agents in Teacher Environments with Student FeedbackCode1
Enabling Intelligent Interactions between an Agent and an LLM: A Reinforcement Learning ApproachCode1
Adaptive Stress Testing of Trajectory Predictions in Flight Management SystemsCode1
Breadcrumbs to the Goal: Goal-Conditioned Exploration from Human-in-the-Loop FeedbackCode1
Hybrid Multi-agent Deep Reinforcement Learning for Autonomous Mobility on Demand SystemsCode1
Decision Mamba: Reinforcement Learning via Sequence Modeling with Selective State SpacesCode1
Show:102550
← PrevPage 2 of 49Next →

No leaderboard results yet.