SOTAVerified

Sequential Decision Making

Papers

Showing 76100 of 1210 papers

TitleStatusHype
AdaPlanner: Adaptive Planning from Feedback with Language ModelsCode1
Can Increasing Input Dimensionality Improve Deep Reinforcement Learning?Code1
Bridging POMDPs and Bayesian decision making for robust maintenance planning under model uncertainty: An application to railway systemsCode1
ContainerGym: A Real-World Reinforcement Learning Benchmark for Resource AllocationCode1
CertRL: Formalizing Convergence Proofs for Value and Policy Iteration in CoqCode1
Co-Activation Graph Analysis of Safety-Verified and Explainable Deep Reinforcement Learning PoliciesCode1
Dynamic Causal Bayesian OptimizationCode1
RELIEF: Reinforcement Learning Empowered Graph Feature Prompt TuningCode1
Dynamic Multi-Robot Task Allocation under Uncertainty and Temporal ConstraintsCode1
Efficient Symptom Inquiring and Diagnosis via Adaptive Alignment of Reinforcement Learning and ClassificationCode1
An Alternative Softmax Operator for Reinforcement LearningCode1
IQ-Learn: Inverse soft-Q Learning for ImitationCode1
Object-Aware Regularization for Addressing Causal Confusion in Imitation LearningCode1
On Generalization Across Environments In Multi-Objective Reinforcement LearningCode1
Out of the Cage: How Stochastic Parrots Win in Cyber Security EnvironmentsCode1
Counterfactual Explanations in Sequential Decision Making Under UncertaintyCode1
LLF-Bench: Benchmark for Interactive Learning from Language FeedbackCode1
An empirical evaluation of active inference in multi-armed banditsCode1
Curriculum-based Reinforcement Learning for Distribution System Critical Load RestorationCode1
Reinforcement Learning for Temporal Logic Control Synthesis with Probabilistic Satisfaction GuaranteesCode1
Decision Stacks: Flexible Reinforcement Learning via Modular Generative ModelsCode1
Decision Mamba: Reinforcement Learning via Sequence Modeling with Selective State SpacesCode1
Reinforcement learning with combinatorial actions for coupled restless banditsCode1
RLDS: an Ecosystem to Generate, Share and Use Datasets in Reinforcement LearningCode1
Distance Weighted Supervised Learning for Offline Interaction DataCode0
Show:102550
← PrevPage 4 of 49Next →

No leaderboard results yet.