Towards Fundamental Limits of Multi-armed Bandits with Random Walk Feedback Nov 3, 2020 Multi-Armed Bandits Recommendation Systems
— Unverified 0Rarely-switching linear bandits: optimization of causal effects for the real world May 30, 2019 Causal Inference Multi-Armed Bandits
— Unverified 0Rate-Constrained Remote Contextual Bandits Apr 26, 2022 Marketing Multi-Armed Bandits
— Unverified 0Reciprocal Learning Aug 12, 2024 Active Learning Multi-Armed Bandits
— Unverified 0Recommenadation aided Caching using Combinatorial Multi-armed Bandits Apr 30, 2024 Multi-Armed Bandits
— Unverified 0Budgeted Multi-Armed Bandits with Asymmetric Confidence Intervals Jun 12, 2023 Multi-Armed Bandits
Code Code Available 0Cascading Bandits for Large-Scale Recommendation Problems Mar 17, 2016 Multi-Armed Bandits Recommendation Systems
Code Code Available 0Incorporating Multi-armed Bandit with Local Search for MaxSAT Nov 29, 2022 Multi-Armed Bandits
Code Code Available 0VITS : Variational Inference Thompson Sampling for contextual bandits Jul 19, 2023 Multi-Armed Bandits Thompson Sampling
Code Code Available 0Causal Contextual Bandits with Adaptive Context May 28, 2024 Multi-Armed Bandits
Code Code Available 0Efficient Explorative Key-term Selection Strategies for Conversational Contextual Bandits Mar 1, 2023 Computational Efficiency Multi-Armed Bandits
Code Code Available 0Adaptive Action Duration with Contextual Bandits for Deep Reinforcement Learning in Dynamic Environments Jun 17, 2025 Atari Games Board Games
Code Code Available 0Causally Abstracted Multi-armed Bandits Apr 26, 2024 Decision Making Multi-Armed Bandits
Code Code Available 0Censored Semi-Bandits: A Framework for Resource Allocation with Censored Feedback Sep 4, 2019 Multi-Armed Bandits
Code Code Available 0Online Learning for Function Placement in Serverless Computing Oct 17, 2024 Multi-Armed Bandits
Code Code Available 0Safe and Adaptive Decision-Making for Optimization of Safety-Critical Systems: The ARTEO Algorithm Nov 10, 2022 Decision Making Decision Making Under Uncertainty
Code Code Available 0Optimal Contextual Bandits with Knapsacks under Realizability via Regression Oracles Oct 21, 2022 Multi-Armed Bandits regression
Code Code Available 0Efficient Kernel UCB for Contextual Bandits Feb 11, 2022 Computational Efficiency Multi-Armed Bandits
Code Code Available 0Multi-armed bandits for resource efficient, online optimization of language model pre-training: the use case of dynamic masking Mar 24, 2022 Bayesian Optimization Decision Making
Code Code Available 0Multi-Armed Bandits in Brain-Computer Interfaces May 19, 2022 Multi-Armed Bandits
Code Code Available 0Off-Policy Evaluation of Slate Bandit Policies via Optimizing Abstraction Feb 3, 2024 Marketing Multi-Armed Bandits
Code Code Available 0Off-Policy Evaluation Using Information Borrowing and Context-Based Switching Dec 18, 2021 Multi-Armed Bandits Off-policy evaluation
Code Code Available 0Infinite Action Contextual Bandits with Reusable Data Exhaust Feb 16, 2023 Model Selection Multi-Armed Bandits
Code Code Available 0Combinatorial Bandits under Strategic Manipulations Feb 25, 2021 Multi-Armed Bandits Recommendation Systems
Code Code Available 0Adapting multi-armed bandits policies to contextual bandits scenarios Nov 11, 2018 Binary Classification Classification
Code Code Available 0Using Subjective Logic to Estimate Uncertainty in Multi-Armed Bandit Problems Aug 17, 2020 Decision Making Multi-Armed Bandits
Code Code Available 0Maximizing and Satisficing in Multi-armed Bandits with Graph Information Aug 2, 2021 Decision Making Multi-Armed Bandits
Code Code Available 0Combinatorial Multi-armed Bandits for Resource Allocation May 10, 2021 Multi-Armed Bandits
Code Code Available 0Empirical Likelihood for Contextual Bandits Jun 7, 2019 Multi-Armed Bandits
Code Code Available 0Online SuBmodular + SuPermodular (BP) Maximization with Bandit Feedback Jul 7, 2022 Computational Efficiency Movie Recommendation
Code Code Available 0Intrinsically Efficient, Stable, and Bounded Off-Policy Evaluation for Reinforcement Learning Jun 9, 2019 Multi-Armed Bandits Off-policy evaluation
Code Code Available 0Introduction to Multi-Armed Bandits Apr 15, 2019 Multi-Armed Bandits
Code Code Available 0Invariant Policy Learning: A Causal Perspective Jun 1, 2021 Multi-Armed Bandits Recommendation Systems
Code Code Available 0Equal Opportunity in Online Classification with Partial Feedback Feb 6, 2019 Classification Decision Making Under Uncertainty
Code Code Available 0Inverse Contextual Bandits: Learning How Behavior Evolves over Time Jul 13, 2021 Benchmarking Decision Making
Code Code Available 0An Experimental Design for Anytime-Valid Causal Inference on Multi-Armed Bandits Nov 9, 2023 Causal Inference Experimental Design
Code Code Available 0Combining Diverse Information for Coordinated Action: Stochastic Bandit Algorithms for Heterogeneous Agents Aug 6, 2024 Multi-Armed Bandits Sensitivity
Code Code Available 0Information-Directed Selection for Top-Two Algorithms May 24, 2022 Multi-Armed Bandits Thompson Sampling
Code Code Available 0Flooding with Absorption: An Efficient Protocol for Heterogeneous Bandits over Complex Networks Mar 9, 2023 Decision Making Multi-Armed Bandits
Code Code Available 0IRL for Restless Multi-Armed Bandits with Applications in Maternal and Child Health Dec 11, 2024 Multi-Armed Bandits
Code Code Available 0Estimation of Warfarin Dosage with Reinforcement Learning Sep 15, 2021 Multi-Armed Bandits reinforcement-learning
Code Code Available 0Evaluating Deep Vs. Wide & Deep Learners As Contextual Bandits For Personalized Email Promo Recommendations Jan 31, 2022 Multi-Armed Bandits Thompson Sampling
Code Code Available 0Model selection for contextual bandits Jun 3, 2019 model Model Selection
Code Code Available 0Best Arm Identification with Fixed Budget: A Large Deviation Perspective Dec 19, 2023 Multi-Armed Bandits
Code Code Available 0Evolutionary Multi-Armed Bandits with Genetic Thompson Sampling Apr 26, 2022 Decision Making Evolutionary Algorithms
Code Code Available 0Optimal Learning for Structured Bandits Jul 14, 2020 Decision Making Decision Making Under Uncertainty
Code Code Available 0Conditionally Risk-Averse Contextual Bandits Oct 24, 2022 Management Multi-Armed Bandits
Code Code Available 0Tight Regret Bounds for Single-pass Streaming Multi-armed Bandits Jun 3, 2023 Multi-Armed Bandits Open-Ended Question Answering
Code Code Available 0Confidence Intervals for Policy Evaluation in Adaptive Experiments Nov 7, 2019 Experimental Design Multi-Armed Bandits
Code Code Available 0Confident Off-Policy Evaluation and Selection through Self-Normalized Importance Weighting Jun 18, 2020 Multi-Armed Bandits Off-policy evaluation
Code Code Available 0