Exposure-Aware Recommendation using Contextual Bandits Sep 4, 2022 Multi-Armed Bandits Recommendation Systems
— Unverified 0Variational Inference for Model-Free and Model-Based Reinforcement Learning Sep 4, 2022 Bayesian Inference Bayesian Optimization
— Unverified 0Dynamic Global Sensitivity for Differentially Private Contextual Bandits Aug 30, 2022 Interactive Recommendation Multi-Armed Bandits
— Unverified 0A Provably Efficient Model-Free Posterior Sampling Method for Episodic Reinforcement Learning Aug 23, 2022 Multi-Armed Bandits reinforcement-learning
— Unverified 0Understanding the stochastic dynamics of sequential decision-making processes: A path-integral analysis of multi-armed bandits Aug 11, 2022 Decision Making Decision Making Under Uncertainty
— Unverified 0Increasing Students' Engagement to Reminder Emails Through Multi-Armed Bandits Aug 10, 2022 Management Multi-Armed Bandits
— Unverified 0Nonstationary Continuum-Armed Bandit Strategies for Automated Trading in a Simulated Financial Market Aug 4, 2022 Bayesian Optimisation Bayesian Optimization
Code Code Available 0Raising Student Completion Rates with Adaptive Curriculum and Contextual Bandits Jul 28, 2022 Model-based Reinforcement Learning Multi-Armed Bandits
— Unverified 0Towards Soft Fairness in Restless Multi-Armed Bandits Jul 27, 2022 Fairness Multi-Armed Bandits
— Unverified 0SPRT-based Efficient Best Arm Identification in Stochastic Bandits Jul 22, 2022 Multi-Armed Bandits Thompson Sampling
— Unverified 0Online Learning with Off-Policy Feedback Jul 18, 2022 Decision Making Multi-Armed Bandits
— Unverified 0Parallel Best Arm Identification in Heterogeneous Environments Jul 16, 2022 Multi-Armed Bandits
— Unverified 0Contextual Bandits with Smooth Regret: Efficient Learning in Continuous Action Spaces Jul 12, 2022 continuous-control Continuous Control
Code Code Available 0Contextual Bandits with Large Action Spaces: Made Practical Jul 12, 2022 Decision Making Multi-Armed Bandits
Code Code Available 0Transformer Neural Processes: Uncertainty-Aware Meta Learning Via Sequence Modeling Jul 9, 2022 Bayesian Optimization Decision Making
Code Code Available 1Online SuBmodular + SuPermodular (BP) Maximization with Bandit Feedback Jul 7, 2022 Computational Efficiency Movie Recommendation
Code Code Available 0Model Selection in Reinforcement Learning with General Function Approximations Jul 6, 2022 Model Selection Multi-Armed Bandits
— Unverified 0Instance-optimal PAC Algorithms for Contextual Bandits Jul 5, 2022 Multi-Armed Bandits
— Unverified 0Autonomous Drug Design with Multi-Armed Bandits Jul 4, 2022 Drug Design Multi-Armed Bandits
— Unverified 0Ranking In Generalized Linear Bandits Jun 30, 2022 Diversity Multi-Armed Bandits
Code Code Available 0Two-Stage Neural Contextual Bandits for Personalised News Recommendation Jun 26, 2022 Computational Efficiency Multi-Armed Bandits
Code Code Available 0Joint Representation Training in Sequential Tasks with Shared Structure Jun 24, 2022 Multi-Armed Bandits Reinforcement Learning (RL)
— Unverified 0Langevin Monte Carlo for Contextual Bandits Jun 22, 2022 Multi-Armed Bandits Thompson Sampling
Code Code Available 1Multiple-Play Stochastic Bandits with Shareable Finite-Capacity Arms Jun 17, 2022 Multi-Armed Bandits
— Unverified 0On Private Online Convex Optimization: Optimal Algorithms in _p-Geometry and High Dimensional Contextual Bandits Jun 16, 2022 Multi-Armed Bandits
Code Code Available 0A Contextual Combinatorial Semi-Bandit Approach to Network Bottleneck Identification Jun 16, 2022 Multi-Armed Bandits Thompson Sampling
— Unverified 0Combinatorial Pure Exploration of Causal Bandits Jun 16, 2022 Causal Inference Multi-Armed Bandits
— Unverified 0Distributed Differential Privacy in Multi-Armed Bandits Jun 12, 2022 Multi-Armed Bandits
— Unverified 0Squeeze All: Novel Estimator and Self-Normalized Bound for Linear Contextual Bandits Jun 11, 2022 All Multi-Armed Bandits
— Unverified 0Communication Efficient Distributed Learning for Kernelized Contextual Bandits Jun 10, 2022 Multi-Armed Bandits
— Unverified 0Conformal Off-Policy Prediction in Contextual Bandits Jun 9, 2022 Conformal Prediction Multi-Armed Bandits
— Unverified 0Neural Bandit with Arm Group Graph Jun 8, 2022 Multi-Armed Bandits
— Unverified 0Efficient Resource Allocation with Fairness Constraints in Restless Multi-Armed Bandits Jun 8, 2022 Decision Making Fairness
— Unverified 0Finite-Time Regret of Thompson Sampling Algorithms for Exponential Family Multi-Armed Bandits Jun 7, 2022 Multi-Armed Bandits Thompson Sampling
— Unverified 0A Simple and Optimal Policy Design with Safety against Heavy-Tailed Risk for Stochastic Bandits Jun 7, 2022 Multi-Armed Bandits Thompson Sampling
— Unverified 0Group Meritocratic Fairness in Linear Contextual Bandits Jun 7, 2022 Fairness Multi-Armed Bandits
Code Code Available 0Robust Pareto Set Identification with Contaminated Bandit Feedback Jun 6, 2022 Management Multi-Armed Bandits
— Unverified 0Asymptotic Instance-Optimal Algorithms for Interactive Decision Making Jun 6, 2022 Decision Making Multi-Armed Bandits
— Unverified 0Contextual Bandits with Knapsacks for a Conversion Model Jun 1, 2022 model Multi-Armed Bandits
— Unverified 0Provably and Practically Efficient Neural Contextual Bandits May 31, 2022 Multi-Armed Bandits
— Unverified 0Provable General Function Class Representation Learning in Multitask Bandits and MDPs May 31, 2022 Multi-Armed Bandits Reinforcement Learning (RL)
— Unverified 0Online Meta-Learning in Adversarial Multi-Armed Bandits May 31, 2022 Meta-Learning Multi-Armed Bandits
— Unverified 0Optimistic Whittle Index Policy: Online Learning for Restless Bandits May 30, 2022 Multi-Armed Bandits
Code Code Available 0Quantum Multi-Armed Bandits and Stochastic Linear Bandits Enjoy Logarithmic Regrets May 30, 2022 Multi-Armed Bandits reinforcement-learning
— Unverified 0Federated Neural Bandits May 28, 2022 Multi-Armed Bandits
Code Code Available 0Fairness and Welfare Quantification for Regret in Multi-Armed Bandits May 27, 2022 Fairness Multi-Armed Bandits
— Unverified 0Meta-Learning Adversarial Bandits May 27, 2022 Meta-Learning Multi-Armed Bandits
— Unverified 0Lifting the Information Ratio: An Information-Theoretic Analysis of Thompson Sampling for Contextual Bandits May 27, 2022 Multi-Armed Bandits Thompson Sampling
— Unverified 0Exploration, Exploitation, and Engagement in Multi-Armed Bandits with Abandonment May 26, 2022 Multi-Armed Bandits Q-Learning
— Unverified 0Contextual Pandora's Box May 26, 2022 Multi-Armed Bandits Stochastic Optimization
— Unverified 0