SOTAVerified

Sequential Decision Making

Papers

Showing 10011050 of 1210 papers

TitleStatusHype
Epistemic Exploration for Generalizable Planning and Learning in Non-Stationary SettingsCode0
Adapting Static Fairness to Sequential Decision-Making: Bias Mitigation Strategies towards Equal Long-term Benefit RateCode0
Agent-Specific Effects: A Causal Effect Propagation Analysis in Multi-Agent MDPsCode0
Did we personalize? Assessing personalization by an online reinforcement learning algorithm using resamplingCode0
Learning to Reach Goals via DiffusionCode0
Online Decision Making with History-Average Dependent Costs (Extended)Code0
Evolutionary Multi-Armed Bandits with Genetic Thompson SamplingCode0
Online Learning with Costly Features in Non-stationary EnvironmentsCode0
SOPE: Spectrum of Off-Policy EstimatorsCode0
Explainable Knowledge Graph Embedding: Inference Reconciliation for Knowledge Inferences Supporting Robot ActionsCode0
Learning to Solve the Min-Max Mixed-Shelves Picker-Routing Problem via Hierarchical and Parallel DecodingCode0
Sample and Computationally Efficient Continuous-Time Reinforcement Learning with General Function ApproximationCode0
Sound Heuristic Search Value Iteration for Undiscounted POMDPs with Reachability ObjectivesCode0
Learning Versatile Skills with Curriculum MaskingCode0
Bellman Meets Hawkes: Model-Based Reinforcement Learning via Temporal Point ProcessesCode0
Reinforcement LearningCode0
TORE: Token Recycling in Vision Transformers for Efficient Active Visual ExplorationCode0
Lifelong Learning with a Changing Action SetCode0
TextAtari: 100K Frames Game Playing with Language AgentsCode0
A Parallel Hybrid Action Space Reinforcement Learning Model for Real-world Adaptive Traffic Signal ControlCode0
Adversarially Robust Decision TransformerCode0
Exploring the Inquiry-Diagnosis Relationship with Advanced Patient SimulatorsCode0
Detecting Adversarial Attacks on Neural Network Policies with Visual ForesightCode0
SANDWICH: Towards an Offline, Differentiable, Fully-Trainable Wireless Neural Ray-Tracing SurrogateCode0
Universal Off-Policy EvaluationCode0
LISA: Learning Interpretable Skill Abstractions from LanguageCode0
Reinforcement Learning-based Token Pruning in Vision Transformers: A Markov Game ApproachCode0
Data Mixture Optimization: A Multi-fidelity Multi-scale Bayesian FrameworkCode0
Algorithms for Fairness in Sequential Decision MakingCode0
Evaluating and Enhancing Large Language Models for Conversational Reasoning on Knowledge GraphsCode0
Act as You Learn: Adaptive Decision-Making in Non-Stationary Markov Decision ProcessesCode0
Adversarial Environment Generation for Learning to Navigate the WebCode0
Scalable Bayesian optimization with high-dimensional outputs using randomized prior networksCode0
Locally Private Nonparametric Contextual Multi-armed BanditsCode0
Logical Specifications-guided Dynamic Task Sampling for Reinforcement Learning AgentsCode0
Data Generation as Sequential Decision MakingCode0
Long-Term Fair Decision Making through Deep Generative ModelsCode0
Long-Term Fairness in Sequential Multi-Agent Selection with Positive ReinforcementCode0
Scalable Decision-Making in Stochastic Environments through Learned Temporal AbstractionCode0
Toward Policy Explanations for Multi-Agent Reinforcement LearningCode0
Loss Bounds for Approximate Influence-Based AbstractionCode0
Federated Online Clustering of BanditsCode0
Batch Bayesian optimisation via density-ratio estimation with guaranteesCode0
Finding Counterfactually Optimal Action Sequences in Continuous State SpacesCode0
Statistical Inference of the Value Function for Reinforcement Learning in Infinite Horizon SettingsCode0
SCALES: From Fairness Principles to Constrained Decision-MakingCode0
MABWiser: A Parallelizable Contextual Multi-Armed Bandit Library for PythonCode0
FLARE: Fingerprinting Deep Reinforcement Learning Agents using Universal Adversarial MasksCode0
FLIPHAT: Joint Differential Privacy for High Dimensional Sparse Linear BanditsCode0
Machine Teaching for Inverse Reinforcement Learning: Algorithms and ApplicationsCode0
Show:102550
← PrevPage 21 of 25Next →

No leaderboard results yet.