SOTAVerified

Sequential Decision Making

Papers

Showing 251300 of 1210 papers

TitleStatusHype
Cooperative Online Learning with Feedback GraphsCode0
Probabilistic Constrained Reinforcement Learning with Formal InterpretabilityCode0
Hindsight and Sequential Rationality of Correlated PlayCode0
SANDWICH: Towards an Offline, Differentiable, Fully-Trainable Wireless Neural Ray-Tracing SurrogateCode0
A2-RL: Aesthetics Aware Reinforcement Learning for Image CroppingCode0
Harnessing the Power of Federated Learning in Federated Contextual BanditsCode0
Hindsight-DICE: Stable Credit Assignment for Deep Reinforcement LearningCode0
FLARE: Fingerprinting Deep Reinforcement Learning Agents using Universal Adversarial MasksCode0
Finding Counterfactually Optimal Action Sequences in Continuous State SpacesCode0
Frequentist Uncertainty in Recurrent Neural Networks via Blockwise Influence FunctionsCode0
Fast reinforcement learning with generalized policy updatesCode0
Decision Transformer vs. Decision Mamba: Analysing the Complexity of Sequential Decision Making in Atari GamesCode0
Federated Online Clustering of BanditsCode0
Generalization to New Sequential Decision Making Tasks with In-Context LearningCode0
Imitation Learning from Purified DemonstrationsCode0
Is Mamba Compatible with Trajectory Optimization in Offline Reinforcement Learning?Code0
Adapting Static Fairness to Sequential Decision-Making: Bias Mitigation Strategies towards Equal Long-term Benefit RateCode0
Epistemic Exploration for Generalizable Planning and Learning in Non-Stationary SettingsCode0
Evolutionary Multi-Armed Bandits with Genetic Thompson SamplingCode0
End-to-End Goal-Driven Web NavigationCode0
Efficient Symbolic Policy Learning with Differentiable Symbolic ExpressionCode0
Enforcing Almost-Sure Reachability in POMDPsCode0
Explainable Knowledge Graph Embedding: Inference Reconciliation for Knowledge Inferences Supporting Robot ActionsCode0
Ecole: A Library for Learning Inside MILP SolversCode0
Enhancing Heterogeneous Multi-Agent Cooperation in Decentralized MARL via GNN-driven Intrinsic RewardsCode0
Dynamic Real-time Multimodal Routing with Hierarchical Hybrid PlanningCode0
Hierarchical Reinforcement Learning with AI Planning ModelsCode0
Dynamic Simplex: Balancing Safety and Performance in Autonomous Cyber Physical SystemsCode0
Efficient Deviation Types and Learning for Hindsight Rationality in Extensive-Form GamesCode0
Efficient Sequence Labeling with Actor-Critic TrainingCode0
Data Mixture Optimization: A Multi-fidelity Multi-scale Bayesian FrameworkCode0
Data Generation as Sequential Decision MakingCode0
Enhancing the Accuracy and Fairness of Human Decision MakingCode0
Decision Making in Non-Stationary Environments with Policy-Augmented SearchCode0
TooBadRL: Trigger Optimization to Boost Effectiveness of Backdoor Attacks on Deep Reinforcement LearningCode0
Dynamical Linear BanditsCode0
Efficient Deviation Types and Learning for Hindsight Rationality in Extensive-Form Games: CorrectionsCode0
Exploring the Inquiry-Diagnosis Relationship with Advanced Patient SimulatorsCode0
Algorithms for Fairness in Sequential Decision MakingCode0
Deep Bayesian Bandits Showdown: An Empirical Comparison of Bayesian Deep Networks for Thompson SamplingCode0
AutoGMap: Learning to Map Large-scale Sparse Graphs on Memristive CrossbarsCode0
Distance Weighted Supervised Learning for Offline Interaction DataCode0
Doubly Inhomogeneous Reinforcement LearningCode0
FLIPHAT: Joint Differential Privacy for High Dimensional Sparse Linear BanditsCode0
Differential Privacy in Cooperative Multiagent PlanningCode0
"Give Me an Example Like This": Episodic Active Reinforcement Learning from DemonstrationsCode0
Stay With Me: Lifetime Maximization Through Heteroscedastic Linear Bandits With RenegingCode0
Deep Q-Network for Angry BirdsCode0
Deep Reinforcement Learning Algorithms for Option HedgingCode0
A Hierarchical Architecture for Sequential Decision-Making in Autonomous Driving using Deep Reinforcement LearningCode0
Show:102550
← PrevPage 6 of 25Next →

No leaderboard results yet.