SOTAVerified

Sequential Decision Making

Papers

Showing 10511100 of 1210 papers

TitleStatusHype
Operator World Models for Reinforcement LearningCode0
Curriculum Design for Teaching via Demonstrations: Theory and ApplicationsCode0
MAHALO: Unifying Offline Reinforcement Learning and Imitation Learning from ObservationsCode0
Depth Matters: Multimodal RGB-D Perception for Robust Autonomous AgentsCode0
Frequentist Uncertainty in Recurrent Neural Networks via Blockwise Influence FunctionsCode0
Making Universal Policies UniversalCode0
Optimal Control of Mechanical Ventilators with Learned Respiratory DynamicsCode0
Achieving Long-Term Fairness in Sequential Decision MakingCode0
Back to the Future -- Sequential Alignment of Text RepresentationsCode0
Reinforcement Learning of Risk-Constrained Policies in Markov Decision ProcessesCode0
Counterfactual Effect Decomposition in Multi-Agent Sequential Decision MakingCode0
Co-training for Policy LearningCode0
Value Gradient Sampler: Sampling as Sequential Decision MakingCode0
A Hierarchical Architecture for Sequential Decision-Making in Autonomous Driving using Deep Reinforcement LearningCode0
Reinforcement Learning When All Actions are Not Always AvailableCode0
Autoregressive BanditsCode0
Optimistic Query Routing in Clustering-based Approximate Maximum Inner Product SearchCode0
The Conditional Cauchy-Schwarz Divergence with Applications to Time-Series Data and Sequential Decision MakingCode0
Vertical Symbolic Regression via Deep Policy GradientCode0
A New Bandit Setting Balancing Information from State Evolution and Corrupted ContextCode0
Merging Decision Transformers: Weight Averaging for Forming Multi-Task PoliciesCode0
"Give Me an Example Like This": Episodic Active Reinforcement Learning from DemonstrationsCode0
Reinforcement Learning applied to Insurance Portfolio PursuitCode0
ReLAX: Reinforcement Learning Agent eXplainer for Arbitrary Predictive ModelsCode0
UProp: Investigating the Uncertainty Propagation of LLMs in Multi-Step Agentic Decision-MakingCode0
Classification with Costly Features using Deep Reinforcement LearningCode0
A Deep Reinforcement Learning Framework For Column GenerationCode0
PageRank Bandits for Link PredictionCode0
Zero-Shot Reinforcement Learning via Function EncodersCode0
Parameterized Projected Bellman OperatorCode0
Remembering to Be Fair: Non-Markovian Fairness in Sequential Decision MakingCode0
Semi-Infinitely Constrained Markov Decision Processes and Efficient Reinforcement LearningCode0
Structural Causal Bandits: Where to Intervene?Code0
Minimax-Bayes Reinforcement LearningCode0
The MineRL 2019 Competition on Sample Efficient Reinforcement Learning using Human PriorsCode0
Harnessing the Power of Federated Learning in Federated Contextual BanditsCode0
A2-RL: Aesthetics Aware Reinforcement Learning for Image CroppingCode0
Stay With Me: Lifetime Maximization Through Heteroscedastic Linear Bandits With RenegingCode0
Cooperative Online Learning with Feedback GraphsCode0
Mixed-Initiative Human-Robot Teaming under Suboptimality with Online Bayesian AdaptationCode0
Continuous Monte Carlo Graph SearchCode0
Structured Control Nets for Deep Reinforcement LearningCode0
Classification with Costly Features as a Sequential Decision-Making ProblemCode0
Adaptive teachers for amortized samplersCode0
Perspective-Shifted Neuro-Symbolic World Models: A Framework for Socially-Aware Robot NavigationCode0
Rethinking Out-of-Distribution Detection for Reinforcement Learning: Advancing Methods for Evaluation and DetectionCode0
Towards Safe Policy Improvement for Non-Stationary MDPsCode0
Planning with Goal-Conditioned PoliciesCode0
Plan-Then-Execute: An Empirical Study of User Trust and Team Performance When Using LLM Agents As A Daily AssistantCode0
Plan To Predict: Learning an Uncertainty-Foreseeing Model for Model-Based Reinforcement LearningCode0
Show:102550
← PrevPage 22 of 25Next →

No leaderboard results yet.