Policy Learning with Adaptively Collected Data May 5, 2021 Multi-Armed Bandits
Code Code Available 0Neural Contextual Bandits without Regret Jul 7, 2021 Decision Making Multi-Armed Bandits
Code Code Available 0Meta-in-context learning in large language models May 22, 2023 In-Context Learning Multi-Armed Bandits
Code Code Available 0Neural Contextual Bandits with UCB-based Exploration Nov 11, 2019 Efficient Exploration Multi-Armed Bandits
Code Code Available 0Adaptive Experimentation with Delayed Binary Feedback Feb 2, 2022 Multi-Armed Bandits valid
Code Code Available 0Group Meritocratic Fairness in Linear Contextual Bandits Jun 7, 2022 Fairness Multi-Armed Bandits
Code Code Available 0Neural Linear Bandits: Overcoming Catastrophic Forgetting through Likelihood Matching Sep 25, 2019 Efficient Exploration Multi-Armed Bandits
Code Code Available 0Power Constrained Bandits Apr 13, 2020 Decision Making Multi-Armed Bandits
Code Code Available 0Batched Multi-armed Bandits Problem Apr 3, 2019 Multi-Armed Bandits
Code Code Available 0Harnessing the Power of Federated Learning in Federated Contextual Bandits Dec 26, 2023 Decision Making Federated Learning
Code Code Available 0Truncated LinUCB for Stochastic Linear Bandits Feb 23, 2022 Multi-Armed Bandits
Code Code Available 0Adaptive Estimator Selection for Off-Policy Evaluation Feb 18, 2020 Multi-Armed Bandits Off-policy evaluation
Code Code Available 0Practical Bayesian Learning of Neural Networks via Adaptive Optimisation Methods Nov 8, 2018 Multi-Armed Bandits Thompson Sampling
Code Code Available 0NeuroSep-CP-LCB: A Deep Learning-based Contextual Multi-armed Bandit Algorithm with Uncertainty Quantification for Early Sepsis Prediction Mar 20, 2025 Conformal Prediction Decision Making
Code Code Available 0Heterogeneous Multi-player Multi-armed Bandits: Closing the Gap and Generalization Oct 27, 2021 Efficient Exploration Multi-Armed Bandits
Code Code Available 0A Survey on Contextual Multi-armed Bandits Aug 13, 2015 Multi-Armed Bandits Survey
Code Code Available 0Practical Calculation of Gittins Indices for Multi-armed Bandits Sep 11, 2019 Multi-Armed Bandits
Code Code Available 0Stay With Me: Lifetime Maximization Through Heteroscedastic Linear Bandits With Reneging Oct 29, 2018 Decision Making Multi-Armed Bandits
Code Code Available 0A Field Test of Bandit Algorithms for Recommendations: Understanding the Validity of Assumptions on Human Preferences in Multi-armed Bandits Apr 16, 2023 Multi-Armed Bandits Recommendation Systems
Code Code Available 0Hierarchical Multi-Armed Bandits for the Concurrent Intelligent Tutoring of Concepts and Problems of Varying Difficulty Levels Aug 10, 2024 Knowledge Tracing Multi-Armed Bandits
Code Code Available 0Towards the D-Optimal Online Experiment Design for Recommender Selection Oct 23, 2021 Multi-Armed Bandits
Code Code Available 0Distributionally Robust Policy Evaluation under General Covariate Shift in Contextual Bandits Jan 21, 2024 Multi-Armed Bandits regression
Code Code Available 0When is Off-Policy Evaluation (Reward Modeling) Useful in Contextual Bandits? A Data-Centric Perspective Nov 23, 2023 Large Language Model Multi-Armed Bandits
Code Code Available 0Minimum Empirical Divergence for Sub-Gaussian Linear Bandits Oct 31, 2024 Multi-Armed Bandits Off-policy evaluation
Code Code Available 0Regret Bounds for Thompson Sampling in Episodic Restless Bandit Problems May 29, 2019 Multi-Armed Bandits Thompson Sampling
Code Code Available 0Mitigating Exposure Bias in Online Learning to Rank Recommendation: A Novel Reward Model for Cascading Bandits Aug 8, 2024 Exposure Fairness Fairness
Code Code Available 0Model-free Reinforcement Learning in Infinite-horizon Average-reward Markov Decision Processes Oct 15, 2019 Multi-Armed Bandits reinforcement-learning
Code Code Available 0Nonparametric Gaussian Mixture Models for the Multi-Armed Bandit Aug 8, 2018 Density Estimation Multi-Armed Bandits
Code Code Available 0Taming the Monster: A Fast and Simple Algorithm for Contextual Bandits Feb 4, 2014 General Classification Multi-Armed Bandits
Code Code Available 0Two-Stage Neural Contextual Bandits for Personalised News Recommendation Jun 26, 2022 Computational Efficiency Multi-Armed Bandits
Code Code Available 0Human in the Loop Adaptive Optimization for Improved Time Series Forecasting May 21, 2025 Language Modeling Language Modelling
Code Code Available 0Adversarial Attacks on Combinatorial Multi-Armed Bandits Oct 8, 2023 Multi-Armed Bandits
Code Code Available 0Machine Teaching of Active Sequential Learners Sep 8, 2018 Multi-Armed Bandits Probabilistic Programming
Code Code Available 0Doubly-Robust Lasso Bandit Jul 26, 2019 Multi-Armed Bandits Recommendation Systems
Code Code Available 0A Survey of Online Experiment Design with the Stochastic Multi-Armed Bandit Oct 2, 2015 Decision Making Multi-Armed Bandits
Code Code Available 0Thompson Sampling via Local Uncertainty Oct 30, 2019 Decision Making Multi-Armed Bandits
Code Code Available 0Identification of the Generalized Condorcet Winner in Multi-dueling Bandits Dec 1, 2021 Multi-Armed Bandits
Code Code Available 0SIC-MMAB: Synchronisation Involves Communication in Multiplayer Multi-Armed Bandits Sep 21, 2018 Multi-Armed Bandits
Code Code Available 0Doubly Robust Policy Evaluation and Learning Mar 23, 2011 Decision Making Multi-Armed Bandits
Code Code Available 0Dual-Mandate Patrols: Multi-Armed Bandits for Green Security Sep 14, 2020 Multi-Armed Bandits
Code Code Available 0Addressing the Long-term Impact of ML Decisions via Policy Regret Jun 2, 2021 Multi-Armed Bandits
Code Code Available 0Test-Time Scaling of Diffusion Models via Noise Trajectory Search May 24, 2025 Denoising Image Generation
Code Code Available 0Regulating Greed Over Time in Multi-Armed Bandits May 21, 2015 Multi-Armed Bandits Time Series Analysis
Code Code Available 0Safe Exploration for Optimizing Contextual Bandits Feb 2, 2020 counterfactual Information Retrieval
Code Code Available 0Simulated Contextual Bandits for Personalization Tasks from Recommendation Datasets Oct 12, 2022 Benchmarking Multi-Armed Bandits
Code Code Available 0Reinforcement Learning for Physical Layer Communications Jun 22, 2021 Deep Reinforcement Learning Multi-Armed Bandits
Code Code Available 0Simultaneously Achieving Group Exposure Fairness and Within-Group Meritocracy in Stochastic Bandits Feb 8, 2024 Attribute Exposure Fairness
Code Code Available 0Mostly Exploration-Free Algorithms for Contextual Bandits Apr 28, 2017 Diversity Multi-Armed Bandits
Code Code Available 0Scalable Exploration via Ensemble++ Jul 18, 2024 Computational Efficiency Decision Making
Code Code Available 0The Assistive Multi-Armed Bandit Jan 24, 2019 Multi-Armed Bandits
Code Code Available 0