| Reliable Off-policy Evaluation for Reinforcement Learning | Nov 8, 2020 | Decision MakingOff-policy evaluation | —Unverified | 0 |
| Single and Multi-Agent Deep Reinforcement Learning for AI-Enabled Wireless Networks: A Tutorial | Nov 6, 2020 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| Adaptive Stress Testing of Trajectory Predictions in Flight Management Systems | Nov 4, 2020 | Decision MakingManagement | CodeCode Available | 1 |
| Loss Bounds for Approximate Influence-Based Abstraction | Nov 3, 2020 | Decision MakingSequential Decision Making | CodeCode Available | 0 |
| Reinforcement Learning with Efficient Active Feature Acquisition | Nov 2, 2020 | Decision MakingModel-based Reinforcement Learning | —Unverified | 0 |
| Multi-IRS-assisted Multi-Cell Uplink MIMO Communications under Imperfect CSI: A Deep Reinforcement Learning Approach | Nov 2, 2020 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| Bandits in Matching Markets: Ideas and Proposals for Peer Lending | Oct 30, 2020 | Decision MakingFairness | —Unverified | 0 |
| Towards Safe Policy Improvement for Non-Stationary MDPs | Oct 23, 2020 | Decision Makingreinforcement-learning | CodeCode Available | 0 |
| What are the Statistical Limits of Offline RL with Linear Function Approximation? | Oct 22, 2020 | Decision MakingOffline RL | —Unverified | 0 |
| Deep Q-Network-based Adaptive Alert Threshold Selection Policy for Payment Fraud Systems in Retail Banking | Oct 21, 2020 | Decision MakingFraud Detection | —Unverified | 0 |
| DBA bandits: Self-driving index tuning under ad-hoc, analytical workloads with safety guarantees | Oct 19, 2020 | AttributeDecision Making | —Unverified | 0 |
| Text-based RL Agents with Commonsense Knowledge: New Challenges, Environments and Baselines | Oct 8, 2020 | Common Sense ReasoningCommonsense Reasoning for RL | CodeCode Available | 1 |
| Learning to Generalize for Sequential Decision Making | Oct 5, 2020 | Decision MakingImitation Learning | CodeCode Available | 0 |
| A Generative Machine Learning Approach to Policy Optimization in Pursuit-Evasion Games | Oct 4, 2020 | BIG-bench Machine LearningDecision Making | —Unverified | 0 |
| Mean-Variance Efficient Reinforcement Learning with Applications to Dynamic Financial Investment | Oct 3, 2020 | Decision MakingDecision Making Under Uncertainty | —Unverified | 0 |
| Is Reinforcement Learning More Difficult Than Bandits? A Near-optimal Algorithm Escaping the Curse of Horizon | Sep 28, 2020 | Decision MakingMulti-Armed Bandits | —Unverified | 0 |
| Multi-task Causal Learning with Gaussian Processes | Sep 27, 2020 | Active LearningBayesian Optimization | CodeCode Available | 1 |
| A Sample-Efficient Algorithm for Episodic Finite-Horizon MDP with Constraints | Sep 23, 2020 | Decision MakingSequential Decision Making | —Unverified | 0 |
| CertRL: Formalizing Convergence Proofs for Value and Policy Iteration in Coq | Sep 23, 2020 | Decision Makingreinforcement-learning | CodeCode Available | 1 |
| Transfer Learning in Deep Reinforcement Learning: A Survey | Sep 16, 2020 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| Causal Bandits without prior knowledge using separating sets | Sep 16, 2020 | Causal DiscoveryDecision Making | —Unverified | 0 |
| Toward the Fundamental Limits of Imitation Learning | Sep 13, 2020 | Decision MakingImitation Learning | —Unverified | 0 |
| Optimal Inspection and Maintenance Planning for Deteriorating Structural Components through Dynamic Bayesian Networks and Markov Decision Processes | Sep 9, 2020 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Inverse Policy Evaluation for Value-based Sequential Decision-making | Aug 26, 2020 | Decision MakingQ-Learning | —Unverified | 0 |
| Spatial Privacy Pricing: The Interplay between Privacy, Utility and Price in Geo-Marketplaces | Aug 25, 2020 | Decision MakingSequential Decision Making | —Unverified | 0 |