SOTAVerified

Sequential Decision Making

Papers

Showing 351400 of 1210 papers

TitleStatusHype
DeLF: Designing Learning Environments with Foundation ModelsCode0
Interactions between dynamic team composition and coordination: An agent-based modeling approach0
Graph Q-Learning for Combinatorial Optimization0
On Sample-Efficient Offline Reinforcement Learning: Data Diversity, Posterior Sampling, and Beyond0
Decision Making in Non-Stationary Environments with Policy-Augmented SearchCode0
Towards an Adaptable and Generalizable Optimization Engine in Decision and Control: A Meta Reinforcement Learning Approach0
Act as You Learn: Adaptive Decision-Making in Non-Stationary Markov Decision ProcessesCode0
Harnessing the Power of Federated Learning in Federated Contextual BanditsCode0
Solving Long-run Average Reward Robust MDPs via Stochastic GamesCode0
Risk-Sensitive Stochastic Optimal Control as Rao-Blackwellized Markovian Score ClimbingCode0
Parameterized Projected Bellman OperatorCode0
Robust Active Measuring under Model UncertaintyCode0
Evaluating and Enhancing Large Language Models for Conversational Reasoning on Knowledge GraphsCode0
Online Restless Multi-Armed Bandits with Long-Term Fairness Constraints0
Risk-Aware Continuous Control with Neural Contextual BanditsCode0
A Customizable Generator for Comic-Style Visual Narrative0
Learning adaptive planning representations with natural language guidance0
LLF-Bench: Benchmark for Interactive Learning from Language FeedbackCode1
Online Decision Making with History-Average Dependent Costs (Extended)Code0
A Review of Cooperation in Multi-agent Learning0
Remembering to Be Fair: Non-Markovian Fairness in Sequential Decision MakingCode0
Distributed Optimization via Kernelized Multi-armed Bandits0
Generalization to New Sequential Decision Making Tasks with In-Context Learning0
Can language agents be alternatives to PPO? A Preliminary Empirical Study On OpenAI GymCode1
Visual Encoders for Data-Efficient Imitation Learning in Modern Video Games0
Learning Curricula in Open-Ended Worlds0
AI in Pharma for Personalized Sequential Decision-Making: Methods, Applications and Opportunities0
TORE: Token Recycling in Vision Transformers for Efficient Active Visual ExplorationCode0
History Filtering in Imperfect Information Games: Algorithms and Complexity0
Learning Dynamic Selection and Pricing of Out-of-Home DeliveriesCode0
Large Language Model as a Policy Teacher for Training Reinforcement Learning AgentsCode1
Code Models are Zero-shot Precondition Reasoners0
An Expandable Machine Learning-Optimization Framework to Sequential Decision-Making0
An advantage based policy transfer algorithm for reinforcement learning with measures of transferability0
Likelihood Ratio Confidence Sets for Sequential Decision Making0
Finding Increasingly Large Extremal Graphs with AlphaZero and Tabu Search0
Safe Sequential Optimization for Switching Environments0
Using General Value Functions to Learn Domain-Backed Inventory Management Policies0
Efficient Symbolic Policy Learning with Differentiable Symbolic ExpressionCode0
Rethinking Decision Transformer via Hierarchical Reinforcement Learning0
Safety-aware Causal Representation for Trustworthy Offline Reinforcement Learning in Autonomous Driving0
Free from Bellman Completeness: Trajectory Stitching via Model-based Return-conditioned Supervised LearningCode1
Regret-Minimization Algorithms for Multi-Agent Cooperative Learning Systems0
High-Dimensional Prediction for Sequential Decision Making0
Robust Visual Imitation Learning with Inverse Dynamics Representations0
O3D: Offline Data-driven Discovery and Distillation for Sequential Decision-Making with Large Language Models0
Provable Benefits of Multi-task RL under Non-Markovian Decision Making Processes0
Eureka: Human-Level Reward Design via Coding Large Language ModelsCode4
Auction-Based Scheduling0
Partially Observable Stochastic Games with Neural Perception Mechanisms0
Show:102550
← PrevPage 8 of 25Next →

No leaderboard results yet.