SOTAVerified

Sequential Decision Making

Papers

Showing 11011150 of 1210 papers

TitleStatusHype
Hindsight and Sequential Rationality of Correlated PlayCode0
Hindsight-DICE: Stable Credit Assignment for Deep Reinforcement LearningCode0
Model-Free Episodic ControlCode0
Hindsight Learning for MDPs with Exogenous InputsCode0
Autonomous Capability Assessment of Sequential Decision-Making Systems in Stochastic Settings (Extended Version)Code0
Modeling Adversarial Attack on Pre-trained Language Models as Sequential Decision MakingCode0
DeLF: Designing Learning Environments with Foundation ModelsCode0
PMAT: Optimizing Action Generation Order in Multi-Agent Reinforcement LearningCode0
How Should We Represent History in Interpretable Models of Clinical Policies?Code0
What Hides behind Unfairness? Exploring Dynamics Fairness in Reinforcement LearningCode0
Revelations: A Decidable Class of POMDPs with Omega-Regular ObjectivesCode0
Sequential Decision Making with Expert Demonstrations under Unobserved HeterogeneityCode0
Adaptive Sequence SubmodularityCode0
Policy Learning for Malaria ControlCode0
POMDP inference and robust solution via deep reinforcement learning: An application to railway optimal maintenanceCode0
PopArt: Efficient Sparse Regression and Experimental Design for Optimal Sparse Linear BanditsCode0
Sequential Monte Carlo BanditsCode0
Reward Design for Justifiable Sequential Decision-MakingCode0
Deep Variational Reinforcement Learning for POMDPsCode0
Think Too Fast Nor Too Slow: The Computational Trade-off Between Planning And Reinforcement LearningCode0
Contextual Bandits with Large Action Spaces: Made PracticalCode0
Computing the Feedback Capacity of Finite State Channels using Reinforcement LearningCode0
Multi-Agent Soft Actor-Critic with Coordinated Loss for Autonomous Mobility-on-Demand Fleet ControlCode0
Reward Learning for Efficient Reinforcement Learning in Extractive Document SummarisationCode0
Prediction, Consistency, Curvature: Representation Learning for Locally-Linear ControlCode0
Towards Trustworthy GUI Agents: A SurveyCode0
Imitation Learning from Purified DemonstrationsCode0
Reward Machines for Deep RL in Noisy and Uncertain EnvironmentsCode0
Risk-Averse Action Selection Using Extreme Value Theory Estimates of the CVaRCode0
Multi-hop Reading Comprehension via Deep Reinforcement Learning based Document TraversalCode0
Causal Explanations for Sequential Decision-Making in Multi-Agent SystemsCode0
Preserving the Privacy of Reward Functions in MDPs through DeceptionCode0
Risk-Aware Continuous Control with Neural Contextual BanditsCode0
Flooding with Absorption: An Efficient Protocol for Heterogeneous Bandits over Complex NetworksCode0
Improving Generalization in Reinforcement Learning Training Regimes for Social Robot NavigationCode0
Catastrophic-risk-aware reinforcement learning with extreme-value-theory-based policy gradientsCode0
Bridging by Word: Image Grounded Vocabulary Construction for Visual CaptioningCode0
Deep Reinforcement Learning for Unsupervised Video Summarization with Diversity-Representativeness RewardCode0
Mutual Information Based Knowledge Transfer Under State-Action Dimension MismatchCode0
Decomposition Methods with Deep Corrections for Reinforcement LearningCode0
WOFOSTGym: A Crop Simulator for Learning Annual and Perennial Crop Management StrategiesCode0
Navigating Data Corruption in Machine Learning: Balancing Quality, Quantity, and Imputation StrategiesCode0
Probabilistic inverse optimal control for non-linear partially observable systems disentangles perceptual uncertainty and behavioral costsCode0
Automaton-Guided Curriculum Generation for Reinforcement Learning AgentsCode0
Information-Theoretic Safe Exploration with Gaussian ProcessesCode0
PlayBest: Professional Basketball Player Behavior Synthesis via Planning with DiffusionCode0
Instance Temperature Knowledge DistillationCode0
Promises and Pitfalls of the Linearized Laplace in Bayesian OptimizationCode0
Agent-State Construction with Auxiliary InputsCode0
Thompson Sampling via Local UncertaintyCode0
Show:102550
← PrevPage 23 of 25Next →

No leaderboard results yet.