SOTAVerified

Sequential Decision Making

Papers

Showing 801850 of 1210 papers

TitleStatusHype
Joint AP Probing and Scheduling: A Contextual Bandit Approach0
RLTutor: Reinforcement Learning Based Adaptive Tutoring System by Modeling Virtual Student with Fewer InteractionsCode0
Lyapunov-based uncertainty-aware safe reinforcement learning0
On Blame Attribution for Accountable Multi-Agent Sequential Decision Making0
Robust Adaptive Submodular Maximization0
Reinforcement Learning Agent Training with Goals for Real World Tasks0
High-Accuracy Model-Based Reinforcement Learning, a Survey0
Recent Advances in Leveraging Human Guidance for Sequential Decision-Making Tasks0
Metalearning Linear Bandits by Prior Update0
Neural Contextual Bandits without RegretCode0
Bayesian decision-making under misspecified priors with applications to meta-learning0
Markov Decision Process modeled with Bandits for Sequential Decision Making in Linear-flow0
Bounded rationality for relaxing best response and mutual consistency: The Quantal Hierarchy model of decision-makingCode0
Decision making with dynamic probabilistic forecasts0
UAV-assisted Online Machine Learning over Multi-Tiered Networks: A Hierarchical Nested Personalized Federated Learning Approach0
Action Set Based Policy Optimization for Safe Power Grid Management0
A Reinforcement Learning Approach for Sequential Spatial Transformer Networks0
Building Intelligent Autonomous Navigation Agents0
Not all users are the same: Providing personalized explanations for sequential decision making problems0
Lorenz System State Stability Identification using Neural Networks0
Probabilistic DAG Search0
Robust Reinforcement Learning Under Minimax Regret for Green SecurityCode0
A modular framework for object-based saccadic decisions in dynamic scenes0
Information Avoidance and Overvaluation in Sequential Decision Making under Epistemic Constraints0
Cooperative Online Learning with Feedback GraphsCode0
Curriculum Design for Teaching via Demonstrations: Theory and ApplicationsCode0
Meta-Learning Reliable Priors in the Function Space0
Heuristic-Guided Reinforcement Learning0
Be Considerate: Objectives, Side Effects, and Deciding How to Act0
Learning to schedule job-shop problems: Representation and policy learning using graph neural network and reinforcement learning0
Techniques Toward Optimizing Viewability in RTB Ad Campaigns Using Reinforcement Learning0
Learning MDPs from Features: Predict-Then-Optimize for Sequential Decision Making by Reinforcement Learning0
Robust optimal policies for team Markov games0
Bandit based centralized matching in two-sided markets for peer to peer lending0
Data-Efficient Reinforcement Learning for Malaria Control0
Regret and Cumulative Constraint Violation Analysis for Distributed Online Constrained Convex Optimization0
Statistical Inference with M-Estimators on Adaptively Collected Data0
Universal Off-Policy EvaluationCode0
Reinforcement Learning using Guided Observability0
Discovering an Aid Policy to Minimize Student Evasion Using Offline Reinforcement Learning0
Visual Comfort Aware-Reinforcement Learning for Depth Adjustment of Stereoscopic 3D Images0
Jamming-Resilient Path Planning for Multiple UAVs via Deep Reinforcement Learning0
Ecole: A Library for Learning Inside MILP SolversCode0
Learning-Based UAV Trajectory Optimization with Collision Avoidance and Connectivity Constraints0
Online Convex Optimization with Continuous Switching Constraint0
Forward and Backward Bellman equations improve the efficiency of EM algorithm for DEC-POMDP0
Situated Language Learning via Interactive Narratives0
Encrypted Linear Contextual Bandit0
Meta-Learning for Planning: Automatic Synthesis of Sample Based Planners0
Optimal sequential decision making with probabilistic digital twins0
Show:102550
← PrevPage 17 of 25Next →

No leaderboard results yet.