SOTAVerified

Sequential Decision Making

Papers

Showing 9511000 of 1210 papers

TitleStatusHype
Predicting and Understanding Human Action Decisions: Insights from Large Language Models and Cognitive Instance-Based Learning0
Predicting Periodicity with Temporal Difference Learning0
Learning-to-defer for sequential medical decision-making under uncertainty0
Preference at First Sight0
Preference Optimization as Probabilistic Inference0
Reference Points, Risk-Taking Behavior, and Competitive Outcomes in Sequential Settings0
Privacy-Constrained Policies via Mutual Information Regularized Policy Gradients0
Probabilistic DAG Search0
Probability Tools for Sequential Random Projection0
Proportional Aggregation of Preferences for Sequential Decision Making0
Provable Benefits of Multi-task RL under Non-Markovian Decision Making Processes0
Provable Reinforcement Learning with a Short-Term Memory0
PROVABLY BENEFITS OF DEEP HIERARCHICAL RL0
Provably Efficient Causal Model-Based Reinforcement Learning for Systematic Generalization0
Provably Efficient Reinforcement Learning for Adversarial Restless Multi-Armed Bandits with Unknown Transitions and Bandit Feedback0
Provably Efficient UCB-type Algorithms For Learning Predictive State Representations0
Provably Learning Nash Policies in Constrained Markov Potential Games0
Proximal Reinforcement Learning: A New Theory of Sequential Decision Making in Primal-Dual Spaces0
Pure Exploration under Mediators' Feedback0
QForce-RL: Quantized FPGA-Optimized Reinforcement Learning Compute Engine0
Dynamical Linear BanditsCode0
Learning non-Markovian Decision-Making from State-only SequencesCode0
Doubly Inhomogeneous Reinforcement LearningCode0
Dynamic Real-time Multimodal Routing with Hierarchical Hybrid PlanningCode0
Learning Non-myopic Power Allocation in Constrained ScenariosCode0
Dynamic Simplex: Balancing Safety and Performance in Autonomous Cyber Physical SystemsCode0
Temporal Latent Bottleneck: Synthesis of Fast and Slow Processing Mechanisms in Sequence LearningCode0
Ecole: A Library for Learning Inside MILP SolversCode0
On Learning Informative Trajectory Embeddings for Imitation, Classification and RegressionCode0
Distance Weighted Supervised Learning for Offline Interaction DataCode0
On Learning Intrinsic Rewards for Policy Gradient MethodsCode0
Probabilistic Constrained Reinforcement Learning with Formal InterpretabilityCode0
Efficient Deviation Types and Learning for Hindsight Rationality in Extensive-Form GamesCode0
Efficient Deviation Types and Learning for Hindsight Rationality in Extensive-Form Games: CorrectionsCode0
Discrete-Time Distribution Steering using Monte Carlo Tree SearchCode0
Learning Sparse Rewarded Tasks from Sub-Optimal DemonstrationsCode0
Learning Structural Weight Uncertainty for Sequential Decision-MakingCode0
Efficient Sequence Labeling with Actor-Critic TrainingCode0
Learning to Discretize: Solving 1D Scalar Conservation Laws via Deep Reinforcement LearningCode0
Learning to Follow Instructions in Text-Based GamesCode0
Efficient Symbolic Policy Learning with Differentiable Symbolic ExpressionCode0
Differential Privacy in Cooperative Multiagent PlanningCode0
Learning to Generalize for Sequential Decision MakingCode0
Learning to Listen, Read, and Follow: Score Following as a Reinforcement Learning GameCode0
Differentially Private Regret Minimization in Episodic Markov Decision ProcessesCode0
End-to-End Goal-Driven Web NavigationCode0
Enforcing Almost-Sure Reachability in POMDPsCode0
Solving Long-run Average Reward Robust MDPs via Stochastic GamesCode0
Enhancing the Accuracy and Fairness of Human Decision MakingCode0
Scalable Exploration via Ensemble++Code0
Show:102550
← PrevPage 20 of 25Next →

No leaderboard results yet.