SOTAVerified

Sequential Decision Making

Papers

Showing 401450 of 1210 papers

TitleStatusHype
Keep Various Trajectories: Promoting Exploration of Ensemble Policies in Continuous Control0
Agent-Specific Effects: A Causal Effect Propagation Analysis in Multi-Agent MDPsCode0
Autonomous Tree-search Ability of Large Language Models0
Imitation Learning from Purified DemonstrationsCode0
Evaluating Explanation Methods for Vision-and-Language Navigation0
Global Convergence of Policy Gradient Methods in Reinforcement Learning, Games and Control0
Optimal Sequential Decision-Making in Geosteering: A Reinforcement Learning Approach0
Beyond Stationarity: Convergence Analysis of Stochastic Softmax Policy Gradient Methods0
Learning to Reach Goals via DiffusionCode0
Towards a Unified Framework for Sequential Decision Making0
Learning to Make Adherence-Aware Advice0
TraCE: Trajectory Counterfactual Explanation ScoresCode0
State2Explanation: Concept-Based Explanations to Benefit Agent Learning and User Understanding0
Delays in Reinforcement Learning0
Safe POMDP Online Planning via Shielding0
Interactively Teaching an Inverse Reinforcement Learner with Limited FeedbackCode0
Efficient quantum recurrent reinforcement learning via quantum reservoir computing0
Fidelity-Induced Interpretable Policy Extraction for Reinforcement Learning0
Bootstrapping Adaptive Human-Machine Interfaces with Offline Reinforcement Learning0
Adapting Static Fairness to Sequential Decision-Making: Bias Mitigation Strategies towards Equal Long-term Benefit RateCode0
INTAGS: Interactive Agent-Guided Simulation0
Pure Exploration under Mediators' Feedback0
Improving Generalization in Reinforcement Learning Training Regimes for Social Robot NavigationCode0
Simple Modification of the Upper Confidence Bound Algorithm by Generalized Weighted AveragesCode0
Bayesian Exploration Networks0
Out of the Cage: How Stochastic Parrots Win in Cyber Security EnvironmentsCode1
LaGR-SEQ: Language-Guided Reinforcement Learning with Sample-Efficient QueryingCode0
A Robust Policy Bootstrapping Algorithm for Multi-objective Reinforcement Learning in Non-stationary Environments0
Intrinsically Motivated Hierarchical Policy Learning in Multi-objective Markov Decision Processes0
IMM: An Imitative Reinforcement Learning Approach with Predictive Representation Learning for Automatic Market Making0
Value-Distributional Model-Based Reinforcement LearningCode0
Multimodal Pretrained Models for Verifiable Sequential Decision-Making: Planning, Grounding, and Perception0
Bayesian Inverse Transition Learning for Offline Settings0
FLARE: Fingerprinting Deep Reinforcement Learning Agents using Universal Adversarial MasksCode0
Deep Reinforcement Learning for Robust Goal-Based Wealth Management0
DIP-RL: Demonstration-Inferred Preference Learning in Minecraft0
On the Expressivity of Multidimensional Markov Reward0
Selective Perception: Optimizing State Descriptions with Reinforcement Learning for Language Model Actors0
Hindsight-DICE: Stable Credit Assignment for Deep Reinforcement LearningCode0
Breadcrumbs to the Goal: Goal-Conditioned Exploration from Human-in-the-Loop FeedbackCode1
Online Learning with Costly Features in Non-stationary EnvironmentsCode0
Non-stationary Delayed Combinatorial Semi-Bandit with Causally Related Rewards0
POMDP inference and robust solution via deep reinforcement learning: An application to railway optimal maintenanceCode0
Multi-Player Zero-Sum Markov Games with Networked Separable Interactions0
Probabilistic Constrained Reinforcement Learning with Formal InterpretabilityCode0
FAIRO: Fairness-aware Adaptation in Sequential-Decision Making for Human-in-the-Loop Systems0
BOF-UCB: A Bayesian-Optimistic Frequentist Algorithm for Non-Stationary Contextual Bandits0
ContainerGym: A Real-World Reinforcement Learning Benchmark for Resource AllocationCode1
TGRL: An Algorithm for Teacher Guided Reinforcement Learning0
Generative Flow Networks: a Markov Chain Perspective0
Show:102550
← PrevPage 9 of 25Next →

No leaderboard results yet.