SOTAVerified

Sequential Decision Making

Papers

Showing 501525 of 1210 papers

TitleStatusHype
Human AI interaction loop training: New approach for interactive reinforcement learning0
Human-Aware Robot Navigation via Reinforcement Learning with Hindsight Experience Replay and Curriculum Learning0
A Note on Sample Complexity of Interactive Imitation Learning with Log Loss0
Human-in-the-loop Active Covariance Learning for Improving Prediction in Small Data Sets0
Human-Modeling in Sequential Decision-Making: An Analysis through the Lens of Human-Aware AI0
Multi-IRS-assisted Multi-Cell Uplink MIMO Communications under Imperfect CSI: A Deep Reinforcement Learning Approach0
Hyperbolic Deep Reinforcement Learning0
Hyperparameter Transfer Learning with Adaptive Complexity0
Hyper-parameter Tuning under a Budget Constraint0
HyperQ-Opt: Q-learning for Hyperparameter Optimization0
Exploiting Model Equivalences for Solving Interactive Dynamic Influence Diagrams0
Bridging Rested and Restless Bandits with Graph-Triggering: Rising and Rotting0
Explainable Reinforcement Learning via Temporal Policy Decomposition0
Explainable Reinforcement Learning for Broad-XAI: A Conceptual Framework and Survey0
An Optimized and Energy-Efficient Parallel Implementation of Non-Iteratively Trained Recurrent Neural Networks0
Adaptivity in Adaptive Submodularity0
Active Layer-Contrastive Decoding Reduces Hallucination in Large Language Model Generation0
Explainable Reinforcement Learning Agents Using World Models0
Bridging Commonsense Reasoning and Probabilistic Planning via a Probabilistic Action Language0
Experimentation Platforms Meet Reinforcement Learning: Bayesian Sequential Decision-Making for Continuous Monitoring0
Experimental analysis of data-driven control for a building heating system0
An Online Approach to Solve the Dynamic Vehicle Routing Problem with Stochastic Trip Requests for Paratransit Services0
Evaluating Explanation Methods for Vision-and-Language Navigation0
Evaluating Dynamic Conditional Quantile Treatment Effects with Applications in Ridesharing0
Branch Ranking for Efficient Mixed-Integer Programming via Offline Ranking-based Policy Learning0
Show:102550
← PrevPage 21 of 49Next →

No leaderboard results yet.