| Bandits in Matching Markets: Ideas and Proposals for Peer Lending | Oct 30, 2020 | Decision MakingFairness | —Unverified | 0 |
| Towards Safe Policy Improvement for Non-Stationary MDPs | Oct 23, 2020 | Decision Makingreinforcement-learning | CodeCode Available | 0 |
| What are the Statistical Limits of Offline RL with Linear Function Approximation? | Oct 22, 2020 | Decision MakingOffline RL | —Unverified | 0 |
| Deep Q-Network-based Adaptive Alert Threshold Selection Policy for Payment Fraud Systems in Retail Banking | Oct 21, 2020 | Decision MakingFraud Detection | —Unverified | 0 |
| DBA bandits: Self-driving index tuning under ad-hoc, analytical workloads with safety guarantees | Oct 19, 2020 | AttributeDecision Making | —Unverified | 0 |
| Learning to Generalize for Sequential Decision Making | Oct 5, 2020 | Decision MakingImitation Learning | CodeCode Available | 0 |
| A Generative Machine Learning Approach to Policy Optimization in Pursuit-Evasion Games | Oct 4, 2020 | BIG-bench Machine LearningDecision Making | —Unverified | 0 |
| Mean-Variance Efficient Reinforcement Learning with Applications to Dynamic Financial Investment | Oct 3, 2020 | Decision MakingDecision Making Under Uncertainty | —Unverified | 0 |
| Is Reinforcement Learning More Difficult Than Bandits? A Near-optimal Algorithm Escaping the Curse of Horizon | Sep 28, 2020 | Decision MakingMulti-Armed Bandits | —Unverified | 0 |
| A Sample-Efficient Algorithm for Episodic Finite-Horizon MDP with Constraints | Sep 23, 2020 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Transfer Learning in Deep Reinforcement Learning: A Survey | Sep 16, 2020 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| Causal Bandits without prior knowledge using separating sets | Sep 16, 2020 | Causal DiscoveryDecision Making | —Unverified | 0 |
| Toward the Fundamental Limits of Imitation Learning | Sep 13, 2020 | Decision MakingImitation Learning | —Unverified | 0 |
| Optimal Inspection and Maintenance Planning for Deteriorating Structural Components through Dynamic Bayesian Networks and Markov Decision Processes | Sep 9, 2020 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Inverse Policy Evaluation for Value-based Sequential Decision-making | Aug 26, 2020 | Decision MakingQ-Learning | —Unverified | 0 |
| Spatial Privacy Pricing: The Interplay between Privacy, Utility and Price in Geo-Marketplaces | Aug 25, 2020 | Decision MakingSequential Decision Making | —Unverified | 0 |
| A Survey of Knowledge-based Sequential Decision Making under Uncertainty | Aug 19, 2020 | Decision MakingDecision Making Under Uncertainty | —Unverified | 0 |
| Deep Model-Based Reinforcement Learning for High-Dimensional Problems, a Survey | Aug 11, 2020 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| A Machine of Few Words -- Interactive Speaker Recognition with Reinforcement Learning | Aug 7, 2020 | Decision Makingreinforcement-learning | —Unverified | 0 |
| Tracking the Race Between Deep Reinforcement Learning and Imitation Learning -- Extended Version | Aug 3, 2020 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| Dynamics Generalization via Information Bottleneck in Deep Reinforcement Learning | Aug 3, 2020 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| Compare and Select: Video Summarization with Multi-Agent Reinforcement Learning | Jul 29, 2020 | Decision MakingMulti-agent Reinforcement Learning | —Unverified | 0 |
| Data-efficient visuomotor policy training using reinforcement learning and generative models | Jul 26, 2020 | Decision MakingDisentanglement | —Unverified | 0 |
| AirCapRL: Autonomous Aerial Human Motion Capture using Deep Reinforcement Learning | Jul 13, 2020 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| Text-based RL Agents with Commonsense Knowledge: New Challenges, Environments and Approaches | Jul 12, 2020 | Decision MakingReinforcement Learning (RL) | —Unverified | 0 |
| Fast reinforcement learning with generalized policy updates | Jul 9, 2020 | Decision MakingProblem Decomposition | —Unverified | 0 |
| GraphOpt: Learning Optimization Models of Graph Formation | Jul 7, 2020 | Decision MakingLink Prediction | —Unverified | 0 |
| Learning "What-if" Explanations for Sequential Decision-Making | Jul 2, 2020 | counterfactualCounterfactual Reasoning | —Unverified | 0 |
| Convex Regularization in Monte-Carlo Tree Search | Jul 1, 2020 | Atari GamesDecision Making | —Unverified | 0 |
| Falsification-Based Robust Adversarial Reinforcement Learning | Jul 1, 2020 | Autonomous VehiclesDecision Making | —Unverified | 0 |
| Model-based Reinforcement Learning: A Survey | Jun 30, 2020 | Decision Makingmodel | —Unverified | 0 |
| Enforcing Almost-Sure Reachability in POMDPs | Jun 30, 2020 | Decision Makingreinforcement-learning | CodeCode Available | 0 |
| On Bellman's Optimality Principle for zs-POSGs | Jun 29, 2020 | Decision MakingHeuristic Search | —Unverified | 0 |
| A Unifying Framework for Reinforcement Learning and Planning | Jun 26, 2020 | Decision Makingreinforcement-learning | —Unverified | 0 |
| Circuit Routing Using Monte Carlo Tree Search and Deep Neural Networks | Jun 24, 2020 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Risk-Sensitive Reinforcement Learning: a Martingale Approach to Reward Uncertainty | Jun 23, 2020 | Decision MakingPortfolio Optimization | —Unverified | 0 |
| Towards Tractable Optimism in Model-Based Reinforcement Learning | Jun 21, 2020 | continuous-controlContinuous Control | —Unverified | 0 |
| Frequentist Uncertainty in Recurrent Neural Networks via Blockwise Influence Functions | Jun 20, 2020 | Decision MakingSequential Decision Making | CodeCode Available | 0 |
| Counterfactually Guided Off-policy Transfer in Clinical Settings | Jun 20, 2020 | counterfactualDecision Making | —Unverified | 0 |
| Learning by Repetition: Stochastic Multi-armed Bandits under Priming Effect | Jun 18, 2020 | Decision MakingMulti-Armed Bandits | —Unverified | 0 |
| Parameterized MDPs and Reinforcement Learning Problems -- A Maximum Entropy Principle Based Framework | Jun 17, 2020 | Decision MakingQ-Learning | —Unverified | 0 |
| Mutual Information Based Knowledge Transfer Under State-Action Dimension Mismatch | Jun 12, 2020 | Decision MakingDeep Reinforcement Learning | CodeCode Available | 0 |
| On the Relationship Between Structure in Natural Language and Models of Sequential Decision Processes | Jun 12, 2020 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Recurrent Sum-Product-Max Networks for Decision Making in Perfectly-Observed Environments | Jun 12, 2020 | Decision Makingreinforcement-learning | CodeCode Available | 0 |
| Group-Fair Online Allocation in Continuous Time | Jun 11, 2020 | Cloud ComputingDecision Making | —Unverified | 0 |
| When is Particle Filtering Efficient for Planning in Partially Observed Linear Dynamical Systems? | Jun 10, 2020 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Modeling Human Driving Behavior through Generative Adversarial Imitation Learning | Jun 10, 2020 | Decision MakingDisentanglement | —Unverified | 0 |
| Stealing Deep Reinforcement Learning Models for Fun and Profit | Jun 9, 2020 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| Sharp Thresholds of the Information Cascade Fragility Under a Mismatched Model | Jun 7, 2020 | Decision MakingSequential Decision Making | —Unverified | 0 |
| When Does MAML Objective Have Benign Landscape? | May 31, 2020 | Decision MakingMeta-Learning | —Unverified | 0 |