Communication Efficient Distributed Learning for Kernelized Contextual Bandits Jun 10, 2022 Multi-Armed Bandits
— Unverified 0Conformal Off-Policy Prediction in Contextual Bandits Jun 9, 2022 Conformal Prediction Multi-Armed Bandits
— Unverified 0Efficient Resource Allocation with Fairness Constraints in Restless Multi-Armed Bandits Jun 8, 2022 Decision Making Fairness
— Unverified 0Neural Bandit with Arm Group Graph Jun 8, 2022 Multi-Armed Bandits
— Unverified 0Finite-Time Regret of Thompson Sampling Algorithms for Exponential Family Multi-Armed Bandits Jun 7, 2022 Multi-Armed Bandits Thompson Sampling
— Unverified 0A Simple and Optimal Policy Design with Safety against Heavy-Tailed Risk for Stochastic Bandits Jun 7, 2022 Multi-Armed Bandits Thompson Sampling
— Unverified 0Group Meritocratic Fairness in Linear Contextual Bandits Jun 7, 2022 Fairness Multi-Armed Bandits
Code Code Available 0Robust Pareto Set Identification with Contaminated Bandit Feedback Jun 6, 2022 Management Multi-Armed Bandits
— Unverified 0Asymptotic Instance-Optimal Algorithms for Interactive Decision Making Jun 6, 2022 Decision Making Multi-Armed Bandits
— Unverified 0Contextual Bandits with Knapsacks for a Conversion Model Jun 1, 2022 model Multi-Armed Bandits
— Unverified 0Provable General Function Class Representation Learning in Multitask Bandits and MDPs May 31, 2022 Multi-Armed Bandits Reinforcement Learning (RL)
— Unverified 0Online Meta-Learning in Adversarial Multi-Armed Bandits May 31, 2022 Meta-Learning Multi-Armed Bandits
— Unverified 0Provably and Practically Efficient Neural Contextual Bandits May 31, 2022 Multi-Armed Bandits
— Unverified 0Optimistic Whittle Index Policy: Online Learning for Restless Bandits May 30, 2022 Multi-Armed Bandits
Code Code Available 0Quantum Multi-Armed Bandits and Stochastic Linear Bandits Enjoy Logarithmic Regrets May 30, 2022 Multi-Armed Bandits reinforcement-learning
— Unverified 0Federated Neural Bandits May 28, 2022 Multi-Armed Bandits
Code Code Available 0Fairness and Welfare Quantification for Regret in Multi-Armed Bandits May 27, 2022 Fairness Multi-Armed Bandits
— Unverified 0Lifting the Information Ratio: An Information-Theoretic Analysis of Thompson Sampling for Contextual Bandits May 27, 2022 Multi-Armed Bandits Thompson Sampling
— Unverified 0Meta-Learning Adversarial Bandits May 27, 2022 Meta-Learning Multi-Armed Bandits
— Unverified 0Exploration, Exploitation, and Engagement in Multi-Armed Bandits with Abandonment May 26, 2022 Multi-Armed Bandits Q-Learning
— Unverified 0Contextual Pandora's Box May 26, 2022 Multi-Armed Bandits Stochastic Optimization
— Unverified 0Neural Contextual Bandits Based Dynamic Sensor Selection for Low-Power Body-Area Networks May 24, 2022 Anomaly Detection Multi-Armed Bandits
— Unverified 0Information-Directed Selection for Top-Two Algorithms May 24, 2022 Multi-Armed Bandits Thompson Sampling
Code Code Available 0Computationally Efficient Horizon-Free Reinforcement Learning for Linear Mixture MDPs May 23, 2022 Multi-Armed Bandits reinforcement-learning
— Unverified 0Falsification of Multiple Requirements for Cyber-Physical Systems Using Online Generative Adversarial Networks and Multi-Armed Bandits May 23, 2022 Multi-Armed Bandits
— Unverified 0Contextual Information-Directed Sampling May 22, 2022 Multi-Armed Bandits Reinforcement Learning (RL)
— Unverified 0Pessimism for Offline Linear Contextual Bandits using _p Confidence Sets May 21, 2022 Multi-Armed Bandits
— Unverified 0Stability Enforced Bandit Algorithms for Channel Selection in Remote State Estimation of Gauss-Markov Processes May 20, 2022 channel selection Multi-Armed Bandits
— Unverified 0Breaking the T Barrier: Instance-Independent Logarithmic Regret in Stochastic Contextual Linear Bandits May 19, 2022 Multi-Armed Bandits parameter estimation
— Unverified 0Multi-Armed Bandits in Brain-Computer Interfaces May 19, 2022 Multi-Armed Bandits
Code Code Available 0Slowly Changing Adversarial Bandit Algorithms are Efficient for Discounted MDPs May 18, 2022 Multi-Armed Bandits reinforcement-learning
— Unverified 0Semi-Parametric Contextual Bandits with Graph-Laplacian Regularization May 17, 2022 Multi-Armed Bandits Thompson Sampling
— Unverified 0From Dirichlet to Rubin: Optimistic Exploration in RL without Bonuses May 16, 2022 Multi-Armed Bandits
— Unverified 0Nearly Optimal Algorithms for Linear Contextual Bandits with Adversarial Corruptions May 13, 2022 Multi-Armed Bandits
— Unverified 0A Survey of Risk-Aware Multi-Armed Bandits May 12, 2022 Multi-Armed Bandits Portfolio Optimization
— Unverified 0Federated Multi-Armed Bandits Under Byzantine Attacks May 9, 2022 Data Poisoning Decision Making
— Unverified 0Selectively Contextual Bandits May 9, 2022 Multi-Armed Bandits
— Unverified 0Multi-Player Multi-Armed Bandits with Finite Shareable Resources Arms: Learning Algorithms & Applications Apr 28, 2022 Edge-computing Multi-Armed Bandits
— Unverified 0Evolutionary Multi-Armed Bandits with Genetic Thompson Sampling Apr 26, 2022 Decision Making Evolutionary Algorithms
Code Code Available 0Rate-Constrained Remote Contextual Bandits Apr 26, 2022 Marketing Multi-Armed Bandits
— Unverified 0Thompson Sampling for Bandit Learning in Matching Markets Apr 26, 2022 Multi-Armed Bandits Thompson Sampling
Code Code Available 0Worst-case Performance of Greedy Policies in Bandits with Imperfect Context Observations Apr 10, 2022 Decision Making Decision Making Under Uncertainty
— Unverified 0Stochastic Multi-armed Bandits with Non-stationary Rewards Generated by a Linear Dynamical System Apr 6, 2022 Decision Making Multi-Armed Bandits
— Unverified 0Strategies for Safe Multi-Armed Bandits with Logarithmic Regret and Risk Apr 1, 2022 Multi-Armed Bandits
— Unverified 0Flexible and Efficient Contextual Bandits with Heterogeneous Treatment Effect Oracles Mar 30, 2022 Decision Making Heterogeneous Treatment Effect Estimation
— Unverified 0Best Arm Identification in Restless Markov Multi-Armed Bandits Mar 29, 2022 Multi-Armed Bandits
— Unverified 0On Kernelized Multi-Armed Bandits with Constraints Mar 29, 2022 Multi-Armed Bandits Thompson Sampling
— Unverified 0Modeling Attrition in Recommender Systems with Departing Bandits Mar 25, 2022 Multi-Armed Bandits Recommendation Systems
— Unverified 0Multi-armed bandits for resource efficient, online optimization of language model pre-training: the use case of dynamic masking Mar 24, 2022 Bayesian Optimization Decision Making
Code Code Available 0Efficient Algorithms for Extreme Bandits Mar 21, 2022 Multi-Armed Bandits
Code Code Available 0