Networked Restless Bandits with Positive Externalities Dec 9, 2022 Multi-Armed Bandits
Code Code Available 0Stochastic Rising Bandits Dec 7, 2022 Model Selection Multi-Armed Bandits
Code Code Available 0AC-Band: A Combinatorial Bandit-Based Approach to Algorithm Configuration Dec 1, 2022 Multi-Armed Bandits
Code Code Available 0On Regret-optimal Cooperative Nonstochastic Multi-armed Bandits Nov 30, 2022 Multi-Armed Bandits
— Unverified 0Incorporating Multi-armed Bandit with Local Search for MaxSAT Nov 29, 2022 Multi-Armed Bandits
Code Code Available 0Constrained Pure Exploration Multi-Armed Bandits with a Fixed Budget Nov 27, 2022 Attribute Multi-Armed Bandits
— Unverified 0Contextual Decision-Making with Knapsacks Beyond the Worst Case Nov 25, 2022 Decision Making Management
— Unverified 0Contextual Bandits in a Survey Experiment on Charitable Giving: Within-Experiment Outcomes versus Policy Learning Nov 22, 2022 Multi-Armed Bandits
— Unverified 0Transfer Learning for Contextual Multi-armed Bandits Nov 22, 2022 Multi-Armed Bandits Transfer Learning
— Unverified 0Causal Bandits: Online Decision-Making in Endogenous Settings Nov 16, 2022 Decision Making Multi-Armed Bandits
— Unverified 0Bandit Algorithms for Prophet Inequality and Pandora's Box Nov 16, 2022 Multi-Armed Bandits Stochastic Optimization
— Unverified 0Latent Bottlenecked Attentive Neural Processes Nov 15, 2022 Meta-Learning Multi-Armed Bandits
Code Code Available 0On Penalization in Stochastic Multi-armed Bandits Nov 15, 2022 Fairness Multi-Armed Bandits
— Unverified 0Multi-Player Bandits Robust to Adversarial Collisions Nov 15, 2022 Multi-Armed Bandits
— Unverified 0Hypothesis Transfer in Bandits by Weighted Models Nov 14, 2022 Multi-Armed Bandits Transfer Learning
— Unverified 0Contextual Bandits with Packing and Covering Constraints: A Modular Lagrangian Approach via Regression Nov 14, 2022 Multi-Armed Bandits regression
— Unverified 0Generalizing distribution of partial rewards for multi-armed bandits with temporally-partitioned rewards Nov 13, 2022 Multi-Armed Bandits
— Unverified 0Thompson Sampling for High-Dimensional Sparse Linear Contextual Bandits Nov 11, 2022 Multi-Armed Bandits Thompson Sampling
Code Code Available 0Safe and Adaptive Decision-Making for Optimization of Safety-Critical Systems: The ARTEO Algorithm Nov 10, 2022 Decision Making Decision Making Under Uncertainty
Code Code Available 0Contexts can be Cheap: Solving Stochastic Contextual Bandits with Linear Bandit Algorithms Nov 8, 2022 Multi-Armed Bandits
— Unverified 0Adaptive Data Depth via Multi-Armed Bandits Nov 8, 2022 Multi-Armed Bandits
Code Code Available 0Indexability is Not Enough for Whittle: Improved, Near-Optimal Algorithms for Restless Bandits Oct 31, 2022 Multi-Armed Bandits
Code Code Available 1Revisiting Simple Regret: Fast Rates for Returning a Good Arm Oct 30, 2022 Multi-Armed Bandits
— Unverified 0Robust Contextual Linear Bandits Oct 26, 2022 Multi-Armed Bandits
— Unverified 0Conditionally Risk-Averse Contextual Bandits Oct 24, 2022 Management Multi-Armed Bandits
Code Code Available 0Local Metric Learning for Off-Policy Evaluation in Contextual Bandits with Continuous Actions Oct 24, 2022 Metric Learning Multi-Armed Bandits
Code Code Available 0PAC-Bayesian Offline Contextual Bandits With Guarantees Oct 24, 2022 Generalization Bounds Multi-Armed Bandits
— Unverified 0Scalable Representation Learning in Linear Contextual Bandits with Constant Regret Guarantees Oct 24, 2022 Multi-Armed Bandits Representation Learning
— Unverified 0Fast Beam Alignment via Pure Exploration in Multi-armed Bandits Oct 23, 2022 Multi-Armed Bandits
Code Code Available 0Optimal Contextual Bandits with Knapsacks under Realizability via Regression Oracles Oct 21, 2022 Multi-Armed Bandits regression
Code Code Available 0Vertical Federated Linear Contextual Bandits Oct 20, 2022 Multi-Armed Bandits
— Unverified 0Anytime-valid off-policy inference for contextual bandits Oct 19, 2022 counterfactual Multi-Armed Bandits
Code Code Available 1Contextual bandits with concave rewards, and an application to fair ranking Oct 18, 2022 Fairness Multi-Armed Bandits
— Unverified 0Multi-agent Dynamic Algorithm Configuration Oct 13, 2022 Multi-Armed Bandits Reinforcement Learning (RL)
Code Code Available 1Simulated Contextual Bandits for Personalization Tasks from Recommendation Datasets Oct 12, 2022 Benchmarking Multi-Armed Bandits
Code Code Available 0Maximum entropy exploration in contextual bandits with neural networks and energy based models Oct 12, 2022 Multi-Armed Bandits
— Unverified 0Constant regret for sequence prediction with limited advice Oct 5, 2022 Multi-Armed Bandits Prediction
— Unverified 0ProtoBandit: Efficient Prototype Selection via Multi-Armed Bandits Oct 4, 2022 Decision Making Multi-Armed Bandits
— Unverified 0Replicable Bandits Oct 4, 2022 Multi-Armed Bandits
— Unverified 0Improved High-Probability Regret for Adversarial Bandits with Time-Varying Feedback Graphs Oct 4, 2022 Multi-Armed Bandits
— Unverified 0On Best-Arm Identification with a Fixed Budget in Non-Parametric Multi-Armed Bandits Sep 30, 2022 Multi-Armed Bandits
— Unverified 0Off-Policy Risk Assessment in Markov Decision Processes Sep 21, 2022 Multi-Armed Bandits Safety Alignment
— Unverified 0Active Inference for Autonomous Decision-Making with Contextual Multi-Armed Bandits Sep 19, 2022 Decision Making Decision Making Under Uncertainty
— Unverified 0Towards Robust Off-Policy Evaluation via Human Inputs Sep 18, 2022 Multi-Armed Bandits Off-policy evaluation
— Unverified 0Constrained Policy Optimization for Controlled Self-Learning in Conversational AI Systems Sep 17, 2022 Multi-Armed Bandits Self-Learning
— Unverified 0Risk-aware linear bandits with convex loss Sep 15, 2022 Decision Making Multi-Armed Bandits
— Unverified 0Double Doubly Robust Thompson Sampling for Generalized Linear Contextual Bandits Sep 15, 2022 Multi-Armed Bandits Thompson Sampling
— Unverified 0Risk-Averse Multi-Armed Bandits with Unobserved Confounders: A Case Study in Emotion Regulation in Mobile Health Sep 9, 2022 Multi-Armed Bandits Transfer Learning
— Unverified 0When Privacy Meets Partial Information: A Refined Analysis of Differentially Private Bandits Sep 6, 2022 Multi-Armed Bandits
— Unverified 0Multi-Armed Bandits with Self-Information Rewards Sep 6, 2022 Multi-Armed Bandits
— Unverified 0