SOTAVerified

Sequential Decision Making

Papers

Showing 701750 of 1210 papers

TitleStatusHype
How to Measure Human-AI Prediction Accuracy in Explainable AI Systems0
HSVI can solve zero-sum Partially Observable Stochastic Games0
HSVI for zs-POSGs using Concavity, Convexity and Lipschitz Properties0
Human AI interaction loop training: New approach for interactive reinforcement learning0
Human-Aware Robot Navigation via Reinforcement Learning with Hindsight Experience Replay and Curriculum Learning0
Human collective intelligence as distributed Bayesian inference0
Human-in-the-loop Active Covariance Learning for Improving Prediction in Small Data Sets0
Human-Modeling in Sequential Decision-Making: An Analysis through the Lens of Human-Aware AI0
Hyperbolic Deep Reinforcement Learning0
Hyperparameter Transfer Learning with Adaptive Complexity0
Hyper-parameter Tuning under a Budget Constraint0
HyperQ-Opt: Q-learning for Hyperparameter Optimization0
iCORPP: Interleaved Commonsense Reasoning and Probabilistic Planning on Robots0
Identification of Subgroups With Similar Benefits in Off-Policy Policy Evaluation0
IF2Net: Innately Forgetting-Free Networks for Continual Learning0
IMM: An Imitative Reinforcement Learning Approach with Predictive Representation Learning for Automatic Market Making0
Implementability of Honest Multi-Agent Sequential Decision-Making with Dynamic Population0
Improved Regret for Differentially Private Exploration in Linear MDP0
Improving Generalization of Reinforcement Learning with Minimax Distributional Soft Actor-Critic0
Improving Human Decision-Making by Discovering Efficient Strategies for Hierarchical Planning0
Improving Human Sequential Decision-Making with Reinforcement Learning0
Improving Online Rent-or-Buy Algorithms with Sequential Decision Making and ML Predictions0
Improving saliency models' predictions of the next fixation with humans' intrinsic cost of gaze shifts0
Improving Zero-shot Generalization in Offline Reinforcement Learning using Generalized Similarity Functions0
Inducing Point Allocation for Sparse Gaussian Processes in High-Throughput Bayesian Optimisation0
Inducing Stackelberg Equilibrium through Spatio-Temporal Sequential Decision-Making in Multi-Agent Reinforcement Learning0
Inference of Utilities and Time Preference in Sequential Decision-Making0
Information Avoidance and Overvaluation in Sequential Decision Making under Epistemic Constraints0
Information Directed Sampling for Linear Partial Monitoring0
Information-Theoretic Safe Bayesian Optimization0
InfraLib: Enabling Reinforcement Learning and Decision-Making for Large-Scale Infrastructure Management0
Integrated Sensing and Communications for Low-Altitude Economy: A Deep Reinforcement Learning Approach0
Integrating Policy Summaries with Reward Decomposition for Explaining Reinforcement Learning Agents0
Integrating Reward Maximization and Population Estimation: Sequential Decision-Making for Internal Revenue Service Audit Selection0
Interactions between dynamic team composition and coordination: An agent-based modeling approach0
Interpolating Between Softmax Policy Gradient and Neural Replicator Dynamics with Capped Implicit Exploration0
Interpretability in Action: Exploratory Analysis of VPT, a Minecraft Agent0
Intrinsically Motivated Hierarchical Policy Learning in Multi-objective Markov Decision Processes0
Invariant Lipschitz Bandits: A Side Observation Approach0
Inverse Design of Photonic Crystal Surface Emitting Lasers is a Sequence Modeling Problem0
Inverse Policy Evaluation for Value-based Sequential Decision-making0
Inverse-RLignment: Large Language Model Alignment from Demonstrations through Inverse Reinforcement Learning0
Investigating Order Effects in Multidimensional Relevance Judgment using Query Logs0
Is Behavior Cloning All You Need? Understanding Horizon in Imitation Learning0
Is Conditional Generative Modeling all you need for Decision-Making?0
Is Reinforcement Learning More Difficult Than Bandits? A Near-optimal Algorithm Escaping the Curse of Horizon0
Jamming-Resilient Path Planning for Multiple UAVs via Deep Reinforcement Learning0
JARVIS: A Neuro-Symbolic Commonsense Reasoning Framework for Conversational Embodied Agents0
Joint AP Probing and Scheduling: A Contextual Bandit Approach0
Keep Various Trajectories: Promoting Exploration of Ensemble Policies in Continuous Control0
Show:102550
← PrevPage 15 of 25Next →

No leaderboard results yet.