SOTAVerified

Sequential Decision Making

Papers

Showing 876900 of 1210 papers

TitleStatusHype
Reliable Off-policy Evaluation for Reinforcement Learning0
Single and Multi-Agent Deep Reinforcement Learning for AI-Enabled Wireless Networks: A Tutorial0
Adaptive Stress Testing of Trajectory Predictions in Flight Management SystemsCode1
Loss Bounds for Approximate Influence-Based AbstractionCode0
Reinforcement Learning with Efficient Active Feature Acquisition0
Multi-IRS-assisted Multi-Cell Uplink MIMO Communications under Imperfect CSI: A Deep Reinforcement Learning Approach0
Bandits in Matching Markets: Ideas and Proposals for Peer Lending0
Towards Safe Policy Improvement for Non-Stationary MDPsCode0
What are the Statistical Limits of Offline RL with Linear Function Approximation?0
Deep Q-Network-based Adaptive Alert Threshold Selection Policy for Payment Fraud Systems in Retail Banking0
DBA bandits: Self-driving index tuning under ad-hoc, analytical workloads with safety guarantees0
Text-based RL Agents with Commonsense Knowledge: New Challenges, Environments and BaselinesCode1
Learning to Generalize for Sequential Decision MakingCode0
A Generative Machine Learning Approach to Policy Optimization in Pursuit-Evasion Games0
Mean-Variance Efficient Reinforcement Learning with Applications to Dynamic Financial Investment0
Is Reinforcement Learning More Difficult Than Bandits? A Near-optimal Algorithm Escaping the Curse of Horizon0
Multi-task Causal Learning with Gaussian ProcessesCode1
A Sample-Efficient Algorithm for Episodic Finite-Horizon MDP with Constraints0
CertRL: Formalizing Convergence Proofs for Value and Policy Iteration in CoqCode1
Transfer Learning in Deep Reinforcement Learning: A Survey0
Causal Bandits without prior knowledge using separating sets0
Toward the Fundamental Limits of Imitation Learning0
Optimal Inspection and Maintenance Planning for Deteriorating Structural Components through Dynamic Bayesian Networks and Markov Decision Processes0
Inverse Policy Evaluation for Value-based Sequential Decision-making0
Spatial Privacy Pricing: The Interplay between Privacy, Utility and Price in Geo-Marketplaces0
Show:102550
← PrevPage 36 of 49Next →

No leaderboard results yet.