SOTAVerified

Sequential Decision Making

Papers

Showing 851900 of 1210 papers

TitleStatusHype
Deciding What to Learn: A Rate-Distortion Approach0
Reinforced Imitative Graph Representation Learning for Mobile User Profiling: An Adversarial Training Perspective0
Understanding and Leveraging Causal Relations in Deep Reinforcement Learning0
Computing Preimages of Deep Neural Networks with Applications to Safety0
Learning to Make Decisions via Submodular Regularization0
Learning to Recover from Failures using Memory0
Divide-and-Conquer Monte Carlo Tree Search0
Privacy-Constrained Policies via Mutual Information Regularized Policy Gradients0
A Regret bound for Non-stationary Multi-Armed Bandits with Fairness Constraints0
Autonomous Charging of Electric Vehicle Fleets to Enhance Renewable Generation Dispatchability0
Off-Policy Optimization of Portfolio Allocation Policies under ConstraintsCode0
Demystify Painting with RL0
Learning Mobile Robot Navigation in the Dense Crowd with Deep Reinforcement Learning0
Hindsight and Sequential Rationality of Correlated PlayCode0
R-learning in actor-critic model offers a biologically relevant mechanism for sequential decision-making0
Improving Online Rent-or-Buy Algorithms with Sequential Decision Making and ML Predictions0
Planning with General Objective Functions: Going Beyond Total Rewards0
Natural Policy Gradient Primal-Dual Method for Constrained Markov Decision Processes0
On Efficiency in Hierarchical Reinforcement Learning0
Delay and Cooperation in Nonstochastic Linear Bandits0
TimeSHAP: Explaining Recurrent Models through Sequence PerturbationsCode1
LAVA: Latent Action Spaces via Variational Auto-encoding for Dialogue Policy Optimization0
Modality-Buffet for Real-Time Object Detection0
A New Bandit Setting Balancing Information from State Evolution and Corrupted ContextCode0
Robust Batch Policy Learning in Markov Decision Processes0
Reliable Off-policy Evaluation for Reinforcement Learning0
Single and Multi-Agent Deep Reinforcement Learning for AI-Enabled Wireless Networks: A Tutorial0
Adaptive Stress Testing of Trajectory Predictions in Flight Management SystemsCode1
Loss Bounds for Approximate Influence-Based AbstractionCode0
Reinforcement Learning with Efficient Active Feature Acquisition0
Multi-IRS-assisted Multi-Cell Uplink MIMO Communications under Imperfect CSI: A Deep Reinforcement Learning Approach0
Bandits in Matching Markets: Ideas and Proposals for Peer Lending0
Towards Safe Policy Improvement for Non-Stationary MDPsCode0
What are the Statistical Limits of Offline RL with Linear Function Approximation?0
Deep Q-Network-based Adaptive Alert Threshold Selection Policy for Payment Fraud Systems in Retail Banking0
DBA bandits: Self-driving index tuning under ad-hoc, analytical workloads with safety guarantees0
Text-based RL Agents with Commonsense Knowledge: New Challenges, Environments and BaselinesCode1
Learning to Generalize for Sequential Decision MakingCode0
A Generative Machine Learning Approach to Policy Optimization in Pursuit-Evasion Games0
Mean-Variance Efficient Reinforcement Learning with Applications to Dynamic Financial Investment0
Is Reinforcement Learning More Difficult Than Bandits? A Near-optimal Algorithm Escaping the Curse of Horizon0
Multi-task Causal Learning with Gaussian ProcessesCode1
A Sample-Efficient Algorithm for Episodic Finite-Horizon MDP with Constraints0
CertRL: Formalizing Convergence Proofs for Value and Policy Iteration in CoqCode1
Transfer Learning in Deep Reinforcement Learning: A Survey0
Causal Bandits without prior knowledge using separating sets0
Toward the Fundamental Limits of Imitation Learning0
Optimal Inspection and Maintenance Planning for Deteriorating Structural Components through Dynamic Bayesian Networks and Markov Decision Processes0
Inverse Policy Evaluation for Value-based Sequential Decision-making0
Spatial Privacy Pricing: The Interplay between Privacy, Utility and Price in Geo-Marketplaces0
Show:102550
← PrevPage 18 of 25Next →

No leaderboard results yet.