SOTAVerified

Sequential Decision Making

Papers

Showing 76100 of 1210 papers

TitleStatusHype
AdaPlanner: Adaptive Planning from Feedback with Language ModelsCode1
Dynamic Multi-Robot Task Allocation under Uncertainty and Temporal ConstraintsCode1
Effective Reinforcement Learning through Evolutionary Surrogate-Assisted PrescriptionCode1
Extracting Reward Functions from Diffusion ModelsCode1
Free from Bellman Completeness: Trajectory Stitching via Model-based Return-conditioned Supervised LearningCode1
Hybrid Multi-agent Deep Reinforcement Learning for Autonomous Mobility on Demand SystemsCode1
Independent Reinforcement Learning for Weakly Cooperative Multiagent Traffic Control ProblemCode1
Can language agents be alternatives to PPO? A Preliminary Empirical Study On OpenAI GymCode1
Large Language Model as a Policy Teacher for Training Reinforcement Learning AgentsCode1
Layered and Staged Monte Carlo Tree Search for SMT Strategy SynthesisCode1
An Alternative Softmax Operator for Reinforcement LearningCode1
Approximate Inference in Discrete Distributions with Monte Carlo Tree Search and Value FunctionsCode1
RELIEF: Reinforcement Learning Empowered Graph Feature Prompt TuningCode1
Markup-to-Image Diffusion Models with Scheduled SamplingCode1
Masked Trajectory Models for Prediction, Representation, and ControlCode1
Breadcrumbs to the Goal: Goal-Conditioned Exploration from Human-in-the-Loop FeedbackCode1
CertRL: Formalizing Convergence Proofs for Value and Policy Iteration in CoqCode1
An empirical evaluation of active inference in multi-armed banditsCode1
Multi-task Causal Learning with Gaussian ProcessesCode1
Decision Stacks: Flexible Reinforcement Learning via Modular Generative ModelsCode1
PDDLGym: Gym Environments from PDDL ProblemsCode1
PRISE: LLM-Style Sequence Compression for Learning Temporal Action Abstractions in ControlCode1
Pursuing Overall Welfare in Federated Learning through Sequential Decision MakingCode1
Reinforced Sequential Decision-Making for Sepsis Treatment: The POSNEGDM Framework with Mortality Classifier and TransformerCode1
A Generative Machine Learning Approach to Policy Optimization in Pursuit-Evasion Games0
Show:102550
← PrevPage 4 of 49Next →

No leaderboard results yet.