Doubly-Robust Lasso Bandit Jul 26, 2019 Multi-Armed Bandits Recommendation Systems
Code Code Available 05 Human in the Loop Adaptive Optimization for Improved Time Series Forecasting May 21, 2025 Language Modeling Language Modelling
Code Code Available 05 A Convex Framework for Confounding Robust Inference Sep 21, 2023 Model Selection Multi-Armed Bandits
Code Code Available 05 Doubly Robust Policy Evaluation and Learning Mar 23, 2011 Decision Making Multi-Armed Bandits
Code Code Available 05 From Complexity to Simplicity: Adaptive ES-Active Subspaces for Blackbox Optimization Mar 7, 2019 compressed sensing Multi-Armed Bandits
Code Code Available 05 Infinite Action Contextual Bandits with Reusable Data Exhaust Feb 16, 2023 Model Selection Multi-Armed Bandits
Code Code Available 05 Introduction to Multi-Armed Bandits Apr 15, 2019 Multi-Armed Bandits
Code Code Available 05 Invariant Policy Learning: A Causal Perspective Jun 1, 2021 Multi-Armed Bandits Recommendation Systems
Code Code Available 05 Addressing the Long-term Impact of ML Decisions via Policy Regret Jun 2, 2021 Multi-Armed Bandits
Code Code Available 05 Antithetic Sampling for Top-k Shapley Identification Apr 2, 2025 Multi-Armed Bandits
Code Code Available 05 Deep Bayesian Bandits Showdown: An Empirical Comparison of Bayesian Deep Networks for Thompson Sampling Feb 26, 2018 Decision Making Deep Reinforcement Learning
Code Code Available 05 Decentralized Cooperative Stochastic Bandits Oct 10, 2018 Multi-Armed Bandits
Code Code Available 05 Latent Bottlenecked Attentive Neural Processes Nov 15, 2022 Meta-Learning Multi-Armed Bandits
Code Code Available 05 Learning Contextual Bandits in a Non-stationary Environment May 23, 2018 Multi-Armed Bandits Recommendation Systems
Code Code Available 05 Approximating a Target Distribution using Weight Queries Jun 24, 2020 Domain Adaptation Multi-Armed Bandits
Code Code Available 05 Let's Get It Started: Fostering the Discoverability of New Releases on Deezer Jan 5, 2024 Multi-Armed Bandits
Code Code Available 05 Adversarial Attacks on Combinatorial Multi-Armed Bandits Oct 8, 2023 Multi-Armed Bandits
Code Code Available 05 Locally Differentially Private (Contextual) Bandits Learning Jun 1, 2020 Multi-Armed Bandits Privacy Preserving Deep Learning
Code Code Available 05 AC-Band: A Combinatorial Bandit-Based Approach to Algorithm Configuration Dec 1, 2022 Multi-Armed Bandits
Code Code Available 05 MABSplit: Faster Forest Training Using Multi-Armed Bandits Dec 14, 2022 Feature Importance Multi-Armed Bandits
Code Code Available 05 Master-slave Deep Architecture for Top-K Multi-armed Bandits with Non-linear Bandit Feedback and Diversity Constraints Aug 24, 2023 Diversity Multi-Armed Bandits
Code Code Available 05 Medoids in almost linear time via multi-armed bandits Nov 2, 2017 Multi-Armed Bandits
Code Code Available 05 A Survey of Online Experiment Design with the Stochastic Multi-Armed Bandit Oct 2, 2015 Decision Making Multi-Armed Bandits
Code Code Available 05 Mitigating Exposure Bias in Online Learning to Rank Recommendation: A Novel Reward Model for Cascading Bandits Aug 8, 2024 Exposure Fairness Fairness
Code Code Available 05 A Survey on Contextual Multi-armed Bandits Aug 13, 2015 Multi-Armed Bandits Survey
Code Code Available 05 Machine Teaching of Active Sequential Learners Sep 8, 2018 Multi-Armed Bandits Probabilistic Programming
Code Code Available 05 A New Bandit Setting Balancing Information from State Evolution and Corrupted Context Nov 16, 2020 Decision Making Efficient Exploration
Code Code Available 05 Distributionally Robust Policy Evaluation under General Covariate Shift in Contextual Bandits Jan 21, 2024 Multi-Armed Bandits regression
Code Code Available 05 Dual-Mandate Patrols: Multi-Armed Bandits for Green Security Sep 14, 2020 Multi-Armed Bandits
Code Code Available 05 Contextual Bandits with Smooth Regret: Efficient Learning in Continuous Action Spaces Jul 12, 2022 continuous-control Continuous Control
Code Code Available 05 Contextual bandits with entropy-based human feedback Feb 12, 2025 Multi-Armed Bandits
Code Code Available 05 Contextual Bandits with Stochastic Experts Feb 23, 2018 Multi-Armed Bandits
Code Code Available 05 Conditionally Risk-Averse Contextual Bandits Oct 24, 2022 Management Multi-Armed Bandits
Code Code Available 05 Adaptive Action Duration with Contextual Bandits for Deep Reinforcement Learning in Dynamic Environments Jun 17, 2025 Atari Games Board Games
Code Code Available 05 Confidence Intervals for Policy Evaluation in Adaptive Experiments Nov 7, 2019 Experimental Design Multi-Armed Bandits
Code Code Available 05 Flooding with Absorption: An Efficient Protocol for Heterogeneous Bandits over Complex Networks Mar 9, 2023 Decision Making Multi-Armed Bandits
Code Code Available 05 A Field Test of Bandit Algorithms for Recommendations: Understanding the Validity of Assumptions on Human Preferences in Multi-armed Bandits Apr 16, 2023 Multi-Armed Bandits Recommendation Systems
Code Code Available 05 Confident Off-Policy Evaluation and Selection through Self-Normalized Importance Weighting Jun 18, 2020 Multi-Armed Bandits Off-policy evaluation
Code Code Available 05 Constrained regret minimization for multi-criterion multi-armed bandits Jun 17, 2020 Attribute Multi-Armed Bandits
Code Code Available 05 Balanced off-policy evaluation in general action spaces Jun 9, 2019 Binary Classification counterfactual
Code Code Available 05 Contextual Bandits with Large Action Spaces: Made Practical Jul 12, 2022 Decision Making Multi-Armed Bandits
Code Code Available 05 Contextual Linear Bandits under Noisy Features: Towards Bayesian Oracles Mar 3, 2017 Multi-Armed Bandits
Code Code Available 05 Censored Semi-Bandits: A Framework for Resource Allocation with Censored Feedback Sep 4, 2019 Multi-Armed Bandits
Code Code Available 05 Correlated Multi-armed Bandits with a Latent Random Source Aug 17, 2018 Multi-Armed Bandits
Code Code Available 05 Cost-Efficient Online Decision Making: A Combinatorial Multi-Armed Bandit Approach Aug 21, 2023 Decision Making Multi-Armed Bandits
Code Code Available 05 RoME: A Robust Mixed-Effects Bandit Algorithm for Optimizing Mobile Health Interventions Dec 11, 2023 Multi-Armed Bandits Off-policy evaluation
Code Code Available 05 Causally Abstracted Multi-armed Bandits Apr 26, 2024 Decision Making Multi-Armed Bandits
Code Code Available 05 Combinatorial Bandits under Strategic Manipulations Feb 25, 2021 Multi-Armed Bandits Recommendation Systems
Code Code Available 05 Cascading Bandits for Large-Scale Recommendation Problems Mar 17, 2016 Multi-Armed Bandits Recommendation Systems
Code Code Available 05 Causal Contextual Bandits with Adaptive Context May 28, 2024 Multi-Armed Bandits
Code Code Available 05