Scalable Representation Learning in Linear Contextual Bandits with Constant Regret Guarantees Oct 24, 2022 Multi-Armed Bandits Representation Learning
— Unverified 0PAC-Bayesian Offline Contextual Bandits With Guarantees Oct 24, 2022 Generalization Bounds Multi-Armed Bandits
— Unverified 0Conditionally Risk-Averse Contextual Bandits Oct 24, 2022 Management Multi-Armed Bandits
Code Code Available 0Fast Beam Alignment via Pure Exploration in Multi-armed Bandits Oct 23, 2022 Multi-Armed Bandits
Code Code Available 0Optimal Contextual Bandits with Knapsacks under Realizability via Regression Oracles Oct 21, 2022 Multi-Armed Bandits regression
Code Code Available 0Vertical Federated Linear Contextual Bandits Oct 20, 2022 Multi-Armed Bandits
— Unverified 0Contextual bandits with concave rewards, and an application to fair ranking Oct 18, 2022 Fairness Multi-Armed Bandits
— Unverified 0Simulated Contextual Bandits for Personalization Tasks from Recommendation Datasets Oct 12, 2022 Benchmarking Multi-Armed Bandits
Code Code Available 0Maximum entropy exploration in contextual bandits with neural networks and energy based models Oct 12, 2022 Multi-Armed Bandits
— Unverified 0Constant regret for sequence prediction with limited advice Oct 5, 2022 Multi-Armed Bandits Prediction
— Unverified 0Improved High-Probability Regret for Adversarial Bandits with Time-Varying Feedback Graphs Oct 4, 2022 Multi-Armed Bandits
— Unverified 0ProtoBandit: Efficient Prototype Selection via Multi-Armed Bandits Oct 4, 2022 Decision Making Multi-Armed Bandits
— Unverified 0Replicable Bandits Oct 4, 2022 Multi-Armed Bandits
— Unverified 0On Best-Arm Identification with a Fixed Budget in Non-Parametric Multi-Armed Bandits Sep 30, 2022 Multi-Armed Bandits
— Unverified 0Off-Policy Risk Assessment in Markov Decision Processes Sep 21, 2022 Multi-Armed Bandits Safety Alignment
— Unverified 0Active Inference for Autonomous Decision-Making with Contextual Multi-Armed Bandits Sep 19, 2022 Decision Making Decision Making Under Uncertainty
— Unverified 0Towards Robust Off-Policy Evaluation via Human Inputs Sep 18, 2022 Multi-Armed Bandits Off-policy evaluation
— Unverified 0Constrained Policy Optimization for Controlled Self-Learning in Conversational AI Systems Sep 17, 2022 Multi-Armed Bandits Self-Learning
— Unverified 0Risk-aware linear bandits with convex loss Sep 15, 2022 Decision Making Multi-Armed Bandits
— Unverified 0Double Doubly Robust Thompson Sampling for Generalized Linear Contextual Bandits Sep 15, 2022 Multi-Armed Bandits Thompson Sampling
— Unverified 0Risk-Averse Multi-Armed Bandits with Unobserved Confounders: A Case Study in Emotion Regulation in Mobile Health Sep 9, 2022 Multi-Armed Bandits Transfer Learning
— Unverified 0When Privacy Meets Partial Information: A Refined Analysis of Differentially Private Bandits Sep 6, 2022 Multi-Armed Bandits
— Unverified 0Multi-Armed Bandits with Self-Information Rewards Sep 6, 2022 Multi-Armed Bandits
— Unverified 0Exposure-Aware Recommendation using Contextual Bandits Sep 4, 2022 Multi-Armed Bandits Recommendation Systems
— Unverified 0Variational Inference for Model-Free and Model-Based Reinforcement Learning Sep 4, 2022 Bayesian Inference Bayesian Optimization
— Unverified 0Dynamic Global Sensitivity for Differentially Private Contextual Bandits Aug 30, 2022 Interactive Recommendation Multi-Armed Bandits
— Unverified 0A Provably Efficient Model-Free Posterior Sampling Method for Episodic Reinforcement Learning Aug 23, 2022 Multi-Armed Bandits reinforcement-learning
— Unverified 0Understanding the stochastic dynamics of sequential decision-making processes: A path-integral analysis of multi-armed bandits Aug 11, 2022 Decision Making Decision Making Under Uncertainty
— Unverified 0Increasing Students' Engagement to Reminder Emails Through Multi-Armed Bandits Aug 10, 2022 Management Multi-Armed Bandits
— Unverified 0Nonstationary Continuum-Armed Bandit Strategies for Automated Trading in a Simulated Financial Market Aug 4, 2022 Bayesian Optimisation Bayesian Optimization
Code Code Available 0Raising Student Completion Rates with Adaptive Curriculum and Contextual Bandits Jul 28, 2022 Model-based Reinforcement Learning Multi-Armed Bandits
— Unverified 0Towards Soft Fairness in Restless Multi-Armed Bandits Jul 27, 2022 Fairness Multi-Armed Bandits
— Unverified 0SPRT-based Efficient Best Arm Identification in Stochastic Bandits Jul 22, 2022 Multi-Armed Bandits Thompson Sampling
— Unverified 0Online Learning with Off-Policy Feedback Jul 18, 2022 Decision Making Multi-Armed Bandits
— Unverified 0Parallel Best Arm Identification in Heterogeneous Environments Jul 16, 2022 Multi-Armed Bandits
— Unverified 0Contextual Bandits with Smooth Regret: Efficient Learning in Continuous Action Spaces Jul 12, 2022 continuous-control Continuous Control
Code Code Available 0Contextual Bandits with Large Action Spaces: Made Practical Jul 12, 2022 Decision Making Multi-Armed Bandits
Code Code Available 0Online SuBmodular + SuPermodular (BP) Maximization with Bandit Feedback Jul 7, 2022 Computational Efficiency Movie Recommendation
Code Code Available 0Model Selection in Reinforcement Learning with General Function Approximations Jul 6, 2022 Model Selection Multi-Armed Bandits
— Unverified 0Instance-optimal PAC Algorithms for Contextual Bandits Jul 5, 2022 Multi-Armed Bandits
— Unverified 0Autonomous Drug Design with Multi-Armed Bandits Jul 4, 2022 Drug Design Multi-Armed Bandits
— Unverified 0Ranking In Generalized Linear Bandits Jun 30, 2022 Diversity Multi-Armed Bandits
Code Code Available 0Two-Stage Neural Contextual Bandits for Personalised News Recommendation Jun 26, 2022 Computational Efficiency Multi-Armed Bandits
Code Code Available 0Joint Representation Training in Sequential Tasks with Shared Structure Jun 24, 2022 Multi-Armed Bandits Reinforcement Learning (RL)
— Unverified 0Multiple-Play Stochastic Bandits with Shareable Finite-Capacity Arms Jun 17, 2022 Multi-Armed Bandits
— Unverified 0On Private Online Convex Optimization: Optimal Algorithms in _p-Geometry and High Dimensional Contextual Bandits Jun 16, 2022 Multi-Armed Bandits
Code Code Available 0Combinatorial Pure Exploration of Causal Bandits Jun 16, 2022 Causal Inference Multi-Armed Bandits
— Unverified 0A Contextual Combinatorial Semi-Bandit Approach to Network Bottleneck Identification Jun 16, 2022 Multi-Armed Bandits Thompson Sampling
— Unverified 0Distributed Differential Privacy in Multi-Armed Bandits Jun 12, 2022 Multi-Armed Bandits
— Unverified 0Squeeze All: Novel Estimator and Self-Normalized Bound for Linear Contextual Bandits Jun 11, 2022 All Multi-Armed Bandits
— Unverified 0