SOTAVerified

Sequential Decision Making

Papers

Showing 201250 of 1210 papers

TitleStatusHype
Learning model-based planning from scratchCode0
End-to-End Goal-Driven Web NavigationCode0
Learning Sparse Rewarded Tasks from Sub-Optimal DemonstrationsCode0
Learning Structural Weight Uncertainty for Sequential Decision-MakingCode0
Learning to Follow Instructions in Text-Based GamesCode0
Enforcing Almost-Sure Reachability in POMDPsCode0
Efficient Symbolic Policy Learning with Differentiable Symbolic ExpressionCode0
Scalable Exploration via Ensemble++Code0
Catastrophic-risk-aware reinforcement learning with extreme-value-theory-based policy gradientsCode0
A Parallel Hybrid Action Space Reinforcement Learning Model for Real-world Adaptive Traffic Signal ControlCode0
Enhancing the Accuracy and Fairness of Human Decision MakingCode0
Evaluating and Enhancing Large Language Models for Conversational Reasoning on Knowledge GraphsCode0
Causal Explanations for Sequential Decision-Making in Multi-Agent SystemsCode0
Efficient Deviation Types and Learning for Hindsight Rationality in Extensive-Form Games: CorrectionsCode0
Back to the Future -- Sequential Alignment of Text RepresentationsCode0
AVID: Adapting Video Diffusion Models to World ModelsCode0
Classification with Costly Features as a Sequential Decision-Making ProblemCode0
Classification with Costly Features using Deep Reinforcement LearningCode0
Reinforcement Learning applied to Insurance Portfolio PursuitCode0
Efficient Deviation Types and Learning for Hindsight Rationality in Extensive-Form GamesCode0
Efficient Sequence Labeling with Actor-Critic TrainingCode0
Collaborative Comic Generation: Integrating Visual Narrative Theories with AI Models for Enhanced CreativityCode0
Adversarial Environment Generation for Learning to Navigate the WebCode0
Epistemic Exploration for Generalizable Planning and Learning in Non-Stationary SettingsCode0
LaGR-SEQ: Language-Guided Reinforcement Learning with Sample-Efficient QueryingCode0
Common Benchmarks Undervalue the Generalization Power of Programmatic PoliciesCode0
Adversarially Robust Decision TransformerCode0
Modeling Adversarial Attack on Pre-trained Language Models as Sequential Decision MakingCode0
Autoregressive BanditsCode0
Adaptive Action Duration with Contextual Bandits for Deep Reinforcement Learning in Dynamic EnvironmentsCode0
Dynamical Linear BanditsCode0
Dynamic Real-time Multimodal Routing with Hierarchical Hybrid PlanningCode0
Navigating Data Corruption in Machine Learning: Balancing Quality, Quantity, and Imputation StrategiesCode0
Computing the Feedback Capacity of Finite State Channels using Reinforcement LearningCode0
Autonomous Capability Assessment of Sequential Decision-Making Systems in Stochastic Settings (Extended Version)Code0
Nonmyopic Global Optimisation via Approximate Dynamic ProgrammingCode0
Automaton-Guided Curriculum Generation for Reinforcement Learning AgentsCode0
Offline Dynamic Inventory and Pricing Strategy: Addressing Censored and Dependent DemandCode0
Achieving Long-Term Fairness in Sequential Decision MakingCode0
Doubly Robust Off-policy Value Evaluation for Reinforcement LearningCode0
Contextual Bandits with Large Action Spaces: Made PracticalCode0
Doubly Robust Policy Evaluation and OptimizationCode0
On Learning Intrinsic Rewards for Policy Gradient MethodsCode0
Online Decision Making with History-Average Dependent Costs (Extended)Code0
Operator World Models for Reinforcement LearningCode0
Continuous Monte Carlo Graph SearchCode0
Dynamic Simplex: Balancing Safety and Performance in Autonomous Cyber Physical SystemsCode0
Parameterized Projected Bellman OperatorCode0
Perspective-Shifted Neuro-Symbolic World Models: A Framework for Socially-Aware Robot NavigationCode0
Discrete-Time Distribution Steering using Monte Carlo Tree SearchCode0
Show:102550
← PrevPage 5 of 25Next →

No leaderboard results yet.