SOTAVerified

Sequential Decision Making

Papers

Showing 501550 of 1210 papers

TitleStatusHype
Human AI interaction loop training: New approach for interactive reinforcement learning0
Human-Aware Robot Navigation via Reinforcement Learning with Hindsight Experience Replay and Curriculum Learning0
A Note on Sample Complexity of Interactive Imitation Learning with Log Loss0
Human-in-the-loop Active Covariance Learning for Improving Prediction in Small Data Sets0
Human-Modeling in Sequential Decision-Making: An Analysis through the Lens of Human-Aware AI0
Multi-IRS-assisted Multi-Cell Uplink MIMO Communications under Imperfect CSI: A Deep Reinforcement Learning Approach0
Hyperbolic Deep Reinforcement Learning0
Hyperparameter Transfer Learning with Adaptive Complexity0
Hyper-parameter Tuning under a Budget Constraint0
HyperQ-Opt: Q-learning for Hyperparameter Optimization0
Exploiting Model Equivalences for Solving Interactive Dynamic Influence Diagrams0
Bridging Rested and Restless Bandits with Graph-Triggering: Rising and Rotting0
Explainable Reinforcement Learning via Temporal Policy Decomposition0
Explainable Reinforcement Learning for Broad-XAI: A Conceptual Framework and Survey0
An Optimized and Energy-Efficient Parallel Implementation of Non-Iteratively Trained Recurrent Neural Networks0
Adaptivity in Adaptive Submodularity0
Active Layer-Contrastive Decoding Reduces Hallucination in Large Language Model Generation0
Explainable Reinforcement Learning Agents Using World Models0
Bridging Commonsense Reasoning and Probabilistic Planning via a Probabilistic Action Language0
Experimentation Platforms Meet Reinforcement Learning: Bayesian Sequential Decision-Making for Continuous Monitoring0
Experimental analysis of data-driven control for a building heating system0
An Online Approach to Solve the Dynamic Vehicle Routing Problem with Stochastic Trip Requests for Paratransit Services0
Evaluating Explanation Methods for Vision-and-Language Navigation0
Evaluating Dynamic Conditional Quantile Treatment Effects with Applications in Ridesharing0
Branch Ranking for Efficient Mixed-Integer Programming via Offline Ranking-based Policy Learning0
Annotation Efficiency: Identifying Hard Samples via Blocked Sparse Linear Bandits0
Estimating Link Flows in Road Networks with Synthetic Trajectory Data Generation: Reinforcement Learning-based Approaches0
Entropy Regularization for Population Estimation0
Bootstrapping Adaptive Human-Machine Interfaces with Offline Reinforcement Learning0
Enhancing Q-Learning with Large Language Model Heuristics0
Boosting Reinforcement Learning and Planning with Demonstrations: A Survey0
Boltzmann Exploration Done Right0
An Introduction to Quantum Reinforcement Learning (QRL)0
An Expandable Machine Learning-Optimization Framework to Sequential Decision-Making0
Action Set Based Policy Optimization for Safe Power Grid Management0
Deep Symbolic Optimization: Reinforcement Learning for Symbolic Mathematics0
Few-shot Scooping Under Domain Shift via Simulated Maximal Deployment Gaps0
Emergent Risk Awareness in Rational Agents under Resource Constraints0
Embodied Scene Understanding for Vision Language Models via MetaVQA0
BOF-UCB: A Bayesian-Optimistic Frequentist Algorithm for Non-Stationary Contextual Bandits0
Efficient Strategy Synthesis for MDPs via Hierarchical Block Decomposition0
Blessing from Human-AI Interaction: Super Reinforcement Learning in Confounded Environments0
Efficient Sequential Decision Making with Large Language Models0
Statistical Complexity and Optimal Algorithms for Non-linear Ridge Bandits0
An End-to-End Reinforcement Learning Based Approach for Micro-View Order-Dispatching in Ride-Hailing0
Efficient Reinforcement Learning with Large Language Model Priors0
Efficient quantum recurrent reinforcement learning via quantum reservoir computing0
Beyond Stationarity: Convergence Analysis of Stochastic Softmax Policy Gradient Methods0
Beyond O(T) Regret: Decoupling Learning and Decision-making in Online Linear Programming0
Efficient and Optimal Algorithms for Contextual Dueling Bandits under Realizability0
Show:102550
← PrevPage 11 of 25Next →

No leaderboard results yet.