SOTAVerified

Sequential Decision Making

Papers

Showing 301350 of 1210 papers

TitleStatusHype
Enhancing the Accuracy and Fairness of Human Decision MakingCode0
Curriculum Design for Teaching via Demonstrations: Theory and ApplicationsCode0
Efficient Deviation Types and Learning for Hindsight Rationality in Extensive-Form GamesCode0
Agent-State Construction with Auxiliary InputsCode0
Efficient Deviation Types and Learning for Hindsight Rationality in Extensive-Form Games: CorrectionsCode0
Achieving Long-Term Fairness in Sequential Decision MakingCode0
Efficient Sequence Labeling with Actor-Critic TrainingCode0
Deep Reinforcement Learning for Personalized Diagnostic Decision Pathways Using Electronic Health Records: A Comparative Study on Anemia and Systemic Lupus ErythematosusCode0
Autonomous Capability Assessment of Sequential Decision-Making Systems in Stochastic Settings (Extended Version)Code0
Learning Non-myopic Power Allocation in Constrained ScenariosCode0
Deep Reinforcement Learning for Surgical Gesture Segmentation and ClassificationCode0
Adaptive Action Duration with Contextual Bandits for Deep Reinforcement Learning in Dynamic EnvironmentsCode0
Deep Reinforcement Learning for Unsupervised Video Summarization with Diversity-Representativeness RewardCode0
Learning to Generalize for Sequential Decision MakingCode0
Epistemic Exploration for Generalizable Planning and Learning in Non-Stationary SettingsCode0
Learning Versatile Skills with Curriculum MaskingCode0
Deep Variational Reinforcement Learning for POMDPsCode0
Autoregressive BanditsCode0
Dynamic Real-time Multimodal Routing with Hierarchical Hybrid PlanningCode0
Dynamic Simplex: Balancing Safety and Performance in Autonomous Cyber Physical SystemsCode0
Loss Bounds for Approximate Influence-Based AbstractionCode0
DeLF: Designing Learning Environments with Foundation ModelsCode0
Dynamical Linear BanditsCode0
Agent-Specific Effects: A Causal Effect Propagation Analysis in Multi-Agent MDPsCode0
Ecole: A Library for Learning Inside MILP SolversCode0
Making Universal Policies UniversalCode0
Counterfactual Effect Decomposition in Multi-Agent Sequential Decision MakingCode0
Did we personalize? Assessing personalization by an online reinforcement learning algorithm using resamplingCode0
Merging Decision Transformers: Weight Averaging for Forming Multi-Task PoliciesCode0
Differentially Private Regret Minimization in Episodic Markov Decision ProcessesCode0
Doubly Inhomogeneous Reinforcement LearningCode0
Co-training for Policy LearningCode0
A New Bandit Setting Balancing Information from State Evolution and Corrupted ContextCode0
Modeling Adversarial Attack on Pre-trained Language Models as Sequential Decision MakingCode0
Batch Bayesian optimisation via density-ratio estimation with guaranteesCode0
Adapting Static Fairness to Sequential Decision-Making: Bias Mitigation Strategies towards Equal Long-term Benefit RateCode0
Discrete-Time Distribution Steering using Monte Carlo Tree SearchCode0
Distance Weighted Supervised Learning for Offline Interaction DataCode0
Neural Contextual Bandits without RegretCode0
Correlational Dueling Bandits with Application to Clinical Treatment in Large Decision Spaces0
CoRAL: Collaborative Retrieval-Augmented Large Language Models Improve Long-tail Recommendation0
A Survey of Continual Reinforcement Learning0
CoordiQ : Coordinated Q-learning for Electric Vehicle Charging Recommendation0
A Sufficient Statistic for Influence in Structured Multiagent Environments0
A Generative Machine Learning Approach to Policy Optimization in Pursuit-Evasion Games0
Cooperative Bayesian Optimization for Imperfect Agents0
Convex Regularization in Monte-Carlo Tree Search0
A Strong Baseline for Batch Imitation Learning0
Convex Markov Games: A New Frontier for Multi-Agent Reinforcement Learning0
Convergence and sample complexity of natural policy gradient primal-dual methods for constrained MDPs0
Show:102550
← PrevPage 7 of 25Next →

No leaderboard results yet.