SOTAVerified

Sequential Decision Making

Papers

Showing 451500 of 1210 papers

TitleStatusHype
Finding Increasingly Large Extremal Graphs with AlphaZero and Tabu Search0
Using General Value Functions to Learn Domain-Backed Inventory Management Policies0
Safe Sequential Optimization for Switching Environments0
Efficient Symbolic Policy Learning with Differentiable Symbolic ExpressionCode0
Rethinking Decision Transformer via Hierarchical Reinforcement Learning0
Safety-aware Causal Representation for Trustworthy Offline Reinforcement Learning in Autonomous Driving0
Regret-Minimization Algorithms for Multi-Agent Cooperative Learning Systems0
High-Dimensional Prediction for Sequential Decision Making0
Robust Visual Imitation Learning with Inverse Dynamics Representations0
O3D: Offline Data-driven Discovery and Distillation for Sequential Decision-Making with Large Language Models0
Provable Benefits of Multi-task RL under Non-Markovian Decision Making Processes0
Auction-Based Scheduling0
Partially Observable Stochastic Games with Neural Perception Mechanisms0
Keep Various Trajectories: Promoting Exploration of Ensemble Policies in Continuous Control0
Agent-Specific Effects: A Causal Effect Propagation Analysis in Multi-Agent MDPsCode0
Autonomous Tree-search Ability of Large Language Models0
Imitation Learning from Purified DemonstrationsCode0
Evaluating Explanation Methods for Vision-and-Language Navigation0
Global Convergence of Policy Gradient Methods in Reinforcement Learning, Games and Control0
Optimal Sequential Decision-Making in Geosteering: A Reinforcement Learning Approach0
Beyond Stationarity: Convergence Analysis of Stochastic Softmax Policy Gradient Methods0
Learning to Reach Goals via DiffusionCode0
Towards a Unified Framework for Sequential Decision Making0
Learning to Make Adherence-Aware Advice0
TraCE: Trajectory Counterfactual Explanation ScoresCode0
State2Explanation: Concept-Based Explanations to Benefit Agent Learning and User Understanding0
Delays in Reinforcement Learning0
Safe POMDP Online Planning via Shielding0
Interactively Teaching an Inverse Reinforcement Learner with Limited FeedbackCode0
Efficient quantum recurrent reinforcement learning via quantum reservoir computing0
Fidelity-Induced Interpretable Policy Extraction for Reinforcement Learning0
Bootstrapping Adaptive Human-Machine Interfaces with Offline Reinforcement Learning0
Adapting Static Fairness to Sequential Decision-Making: Bias Mitigation Strategies towards Equal Long-term Benefit RateCode0
INTAGS: Interactive Agent-Guided Simulation0
Improving Generalization in Reinforcement Learning Training Regimes for Social Robot NavigationCode0
Pure Exploration under Mediators' Feedback0
Simple Modification of the Upper Confidence Bound Algorithm by Generalized Weighted AveragesCode0
Bayesian Exploration Networks0
LaGR-SEQ: Language-Guided Reinforcement Learning with Sample-Efficient QueryingCode0
A Robust Policy Bootstrapping Algorithm for Multi-objective Reinforcement Learning in Non-stationary Environments0
Intrinsically Motivated Hierarchical Policy Learning in Multi-objective Markov Decision Processes0
IMM: An Imitative Reinforcement Learning Approach with Predictive Representation Learning for Automatic Market Making0
Value-Distributional Model-Based Reinforcement LearningCode0
Multimodal Pretrained Models for Verifiable Sequential Decision-Making: Planning, Grounding, and Perception0
Bayesian Inverse Transition Learning for Offline Settings0
FLARE: Fingerprinting Deep Reinforcement Learning Agents using Universal Adversarial MasksCode0
Deep Reinforcement Learning for Robust Goal-Based Wealth Management0
DIP-RL: Demonstration-Inferred Preference Learning in Minecraft0
On the Expressivity of Multidimensional Markov Reward0
Hindsight-DICE: Stable Credit Assignment for Deep Reinforcement LearningCode0
Show:102550
← PrevPage 10 of 25Next →

No leaderboard results yet.