SOTAVerified

Sequential Decision Making

Papers

Showing 526550 of 1210 papers

TitleStatusHype
A Survey on Model-based Reinforcement Learning0
Language Guided Exploration for RL Agents in Text Environments0
Last-Iterate Global Convergence of Policy Gradients for Constrained Reinforcement Learning0
Joint AP Probing and Scheduling: A Contextual Bandit Approach0
Accelerating Offline Reinforcement Learning Application in Real-Time Bidding and Recommendation: Potential Use of Simulation0
Keep Various Trajectories: Promoting Exploration of Ensemble Policies in Continuous Control0
Interactions between dynamic team composition and coordination: An agent-based modeling approach0
Integrating Reward Maximization and Population Estimation: Sequential Decision-Making for Internal Revenue Service Audit Selection0
Data-Driven Online Model Selection With Regret Guarantees0
Integrating Policy Summaries with Reward Decomposition for Explaining Reinforcement Learning Agents0
Integrated Sensing and Communications for Low-Altitude Economy: A Deep Reinforcement Learning Approach0
D3HRL: A Distributed Hierarchical Reinforcement Learning Approach Based on Causal Discovery and Spurious Correlation Detection0
A Survey on Interpretable Reinforcement Learning0
Data-Driven Upper Confidence Bounds with Near-Optimal Regret for Heavy-Tailed Bandits0
A Survey on Large-Population Systems and Scalable Multi-Agent Reinforcement Learning0
Data-Efficient Reinforcement Learning for Malaria Control0
Interpolating Between Softmax Policy Gradient and Neural Replicator Dynamics with Capped Implicit Exploration0
Knowledge-Based Sequential Decision-Making Under Uncertainty0
Intrinsically Motivated Hierarchical Policy Learning in Multi-objective Markov Decision Processes0
A Survey on Explainable Deep Reinforcement Learning0
Inverse Design of Photonic Crystal Surface Emitting Lasers is a Sequence Modeling Problem0
Inverse Policy Evaluation for Value-based Sequential Decision-making0
Inverse-RLignment: Large Language Model Alignment from Demonstrations through Inverse Reinforcement Learning0
Investigating Order Effects in Multidimensional Relevance Judgment using Query Logs0
InfraLib: Enabling Reinforcement Learning and Decision-Making for Large-Scale Infrastructure Management0
Show:102550
← PrevPage 22 of 49Next →

No leaderboard results yet.