SOTAVerified

Sequential Decision Making

Papers

Showing 201250 of 1210 papers

TitleStatusHype
Sliding-Window Thompson Sampling for Non-Stationary Settings0
A naive aggregation algorithm for improving generalization in a class of learning problems0
InfraLib: Enabling Reinforcement Learning and Decision-Making for Large-Scale Infrastructure Management0
A Sequential Decision-Making Model for Perimeter Identification0
Temporal Elections: Welfare, Strategyproofness, and Proportionality0
How to Measure Human-AI Prediction Accuracy in Explainable AI Systems0
Pareto Inverse Reinforcement Learning for Diverse Expert Policy Generation0
An End-to-End Reinforcement Learning Based Approach for Micro-View Order-Dispatching in Ride-Hailing0
Contextual Bandits for Unbounded Context Distributions0
Multi-Agent Reinforcement Learning for Autonomous Driving: A SurveyCode5
Enhancing Heterogeneous Multi-Agent Cooperation in Decentralized MARL via GNN-driven Intrinsic RewardsCode0
Meta Clustering of Neural Bandits0
Structure and Reduction of MCTS for Explainable-AI0
Non-maximizing policies that fulfill multi-criterion aspirations in expectation0
Few-shot Scooping Under Domain Shift via Simulated Maximal Deployment Gaps0
RELIEF: Reinforcement Learning Empowered Graph Feature Prompt TuningCode1
Reinforcement Learning applied to Insurance Portfolio PursuitCode0
How to Choose a Reinforcement-Learning Algorithm0
Reinforcement Learning for Sustainable Energy: A Survey0
Assessing AI Utility: The Random Guesser Test for Sequential Decision-Making Systems0
Adversarially Robust Decision TransformerCode0
Differentiable Quantum Architecture Search in Asynchronous Quantum Reinforcement Learning0
Reinforcement Learning Meets Visual OdometryCode3
Is Behavior Cloning All You Need? Understanding Horizon in Imitation Learning0
Scalable Exploration via Ensemble++Code0
Managing Risk using Rolling Forecasts in Energy-Limited and Stochastic Energy Systems0
Interpretability in Action: Exploratory Analysis of VPT, a Minecraft Agent0
Exploration Unbound0
Last-Iterate Global Convergence of Policy Gradients for Constrained Reinforcement Learning0
Preserving the Privacy of Reward Functions in MDPs through DeceptionCode0
Predicting and Understanding Human Action Decisions: Insights from Large Language Models and Cognitive Instance-Based Learning0
Long-Term Fairness in Sequential Multi-Agent Selection with Positive ReinforcementCode0
MDP Geometry, Normalization and Reward Balancing Solvers0
Communication and Control Co-Design in 6G: Sequential Decision-Making with LLMs0
Maximizing utility in multi-agent environments by anticipating the behavior of other learners0
Short-Long Policy Evaluation with Novel Actions0
Exploring a Physics-Informed Decision Transformer for Distribution System Restoration: Methodology and Performance Analysis0
Tradeoffs When Considering Deep Reinforcement Learning for Contingency Management in Advanced Air Mobility0
Operator World Models for Reinforcement LearningCode0
Instance Temperature Knowledge DistillationCode0
UNO Arena for Evaluating Sequential Decision-Making Capability of Large Language Models0
Accelerating Matrix Diagonalization through Decision Transformers with Epsilon-Greedy Optimization0
Catastrophic-risk-aware reinforcement learning with extreme-value-theory-based policy gradientsCode0
MacroHFT: Memory Augmented Context-aware Reinforcement Learning On High Frequency TradingCode2
Learned Graph Rewriting with Equality Saturation: A New Paradigm in Relational Query Rewrite and Beyond0
ARDuP: Active Region Video Diffusion for Universal Policies0
Efficient Sequential Decision Making with Large Language Models0
Constrained Reinforcement Learning with Average Reward Objective: Model-Based and Model-Free Algorithms0
Model Adaptation for Time Constrained Embodied Control0
Data-Driven Upper Confidence Bounds with Near-Optimal Regret for Heavy-Tailed Bandits0
Show:102550
← PrevPage 5 of 25Next →

No leaderboard results yet.