Efficient Explorative Key-term Selection Strategies for Conversational Contextual Bandits Mar 1, 2023 Computational Efficiency Multi-Armed Bandits
Code Code Available 05 From Complexity to Simplicity: Adaptive ES-Active Subspaces for Blackbox Optimization Mar 7, 2019 compressed sensing Multi-Armed Bandits
Code Code Available 05 Efficient Kernel UCB for Contextual Bandits Feb 11, 2022 Computational Efficiency Multi-Armed Bandits
Code Code Available 05 Addressing the Long-term Impact of ML Decisions via Policy Regret Jun 2, 2021 Multi-Armed Bandits
Code Code Available 05 Estimation of Warfarin Dosage with Reinforcement Learning Sep 15, 2021 Multi-Armed Bandits reinforcement-learning
Code Code Available 05 Evaluating Deep Vs. Wide & Deep Learners As Contextual Bandits For Personalized Email Promo Recommendations Jan 31, 2022 Multi-Armed Bandits Thompson Sampling
Code Code Available 05 Fairness of Exposure in Online Restless Multi-armed Bandits Feb 9, 2024 Fairness Multi-Armed Bandits
Code Code Available 05 Adversarial Attacks on Combinatorial Multi-Armed Bandits Oct 8, 2023 Multi-Armed Bandits
Code Code Available 05 Adapting multi-armed bandits policies to contextual bandits scenarios Nov 11, 2018 Binary Classification Classification
Code Code Available 05 Distribution oblivious, risk-aware algorithms for multi-armed bandits with unbounded rewards Jun 3, 2019 Multi-Armed Bandits
Code Code Available 05 Federated Neural Bandits May 28, 2022 Multi-Armed Bandits
Code Code Available 05 Finding All ε-Good Arms in Stochastic Bandits Jun 16, 2020 All Multi-Armed Bandits
Code Code Available 05 Empirical analysis of representation learning and exploration in neural kernel bandits Nov 5, 2021 Bayesian Inference Decision Making
Code Code Available 05 Doubly robust off-policy evaluation with shrinkage Jul 22, 2019 Model Selection Multi-Armed Bandits
Code Code Available 05 Doubly Robust Policy Evaluation and Learning Mar 23, 2011 Decision Making Multi-Armed Bandits
Code Code Available 05 Gaussian Gated Linear Networks Jun 10, 2020 Denoising Density Estimation
Code Code Available 05 An Empirical Evaluation of Federated Contextual Bandit Algorithms Mar 17, 2023 Federated Learning Multi-Armed Bandits
Code Code Available 05 Deep Bayesian Bandits Showdown: An Empirical Comparison of Bayesian Deep Networks for Thompson Sampling Feb 26, 2018 Decision Making Deep Reinforcement Learning
Code Code Available 05 Decentralized Cooperative Stochastic Bandits Oct 10, 2018 Multi-Armed Bandits
Code Code Available 05 Correlated Multi-armed Bandits with a Latent Random Source Aug 17, 2018 Multi-Armed Bandits
Code Code Available 05 Cost-Efficient Online Decision Making: A Combinatorial Multi-Armed Bandit Approach Aug 21, 2023 Decision Making Multi-Armed Bandits
Code Code Available 05 Distributionally Robust Policy Evaluation under General Covariate Shift in Contextual Bandits Jan 21, 2024 Multi-Armed Bandits regression
Code Code Available 05 Contextual Bandits with Stochastic Experts Feb 23, 2018 Multi-Armed Bandits
Code Code Available 05 Active Feature Selection for the Mutual Information Criterion Dec 13, 2020 feature selection Multi-Armed Bandits
Code Code Available 05 Contextual Linear Bandits under Noisy Features: Towards Bayesian Oracles Mar 3, 2017 Multi-Armed Bandits
Code Code Available 05 Contextual bandits with entropy-based human feedback Feb 12, 2025 Multi-Armed Bandits
Code Code Available 05 Confidence Intervals for Policy Evaluation in Adaptive Experiments Nov 7, 2019 Experimental Design Multi-Armed Bandits
Code Code Available 05 Conditionally Risk-Averse Contextual Bandits Oct 24, 2022 Management Multi-Armed Bandits
Code Code Available 05 Confident Off-Policy Evaluation and Selection through Self-Normalized Importance Weighting Jun 18, 2020 Multi-Armed Bandits Off-policy evaluation
Code Code Available 05 Contextual Bandits with Large Action Spaces: Made Practical Jul 12, 2022 Decision Making Multi-Armed Bandits
Code Code Available 05 (Almost) Free Incentivized Exploration from Decentralized Learning Agents Oct 27, 2021 Multi-Armed Bandits
Code Code Available 05 Constrained regret minimization for multi-criterion multi-armed bandits Jun 17, 2020 Attribute Multi-Armed Bandits
Code Code Available 05 Adaptive Estimator Selection for Off-Policy Evaluation Feb 18, 2020 Multi-Armed Bandits Off-policy evaluation
Code Code Available 05 Corralling a Band of Bandit Algorithms Dec 19, 2016 Multi-Armed Bandits
Code Code Available 05 Adaptive Experimentation with Delayed Binary Feedback Feb 2, 2022 Multi-Armed Bandits valid
Code Code Available 05 Contextual Bandits with Smooth Regret: Efficient Learning in Continuous Action Spaces Jul 12, 2022 continuous-control Continuous Control
Code Code Available 05 Doubly-Robust Lasso Bandit Jul 26, 2019 Multi-Armed Bandits Recommendation Systems
Code Code Available 05 A New Bandit Setting Balancing Information from State Evolution and Corrupted Context Nov 16, 2020 Decision Making Efficient Exploration
Code Code Available 05 Scalable Exploration via Ensemble++ Jul 18, 2024 Computational Efficiency Decision Making
Code Code Available 05 RoME: A Robust Mixed-Effects Bandit Algorithm for Optimizing Mobile Health Interventions Dec 11, 2023 Multi-Armed Bandits Off-policy evaluation
Code Code Available 05 Combinatorial Bandits under Strategic Manipulations Feb 25, 2021 Multi-Armed Bandits Recommendation Systems
Code Code Available 05 Adaptive Data Depth via Multi-Armed Bandits Nov 8, 2022 Multi-Armed Bandits
Code Code Available 05 Combinatorial Multi-armed Bandits for Resource Allocation May 10, 2021 Multi-Armed Bandits
Code Code Available 05 Adaptive Linear Estimating Equations Jul 14, 2023 Multi-Armed Bandits
Code Code Available 05 Causally Abstracted Multi-armed Bandits Apr 26, 2024 Decision Making Multi-Armed Bandits
Code Code Available 05 A Convex Framework for Confounding Robust Inference Sep 21, 2023 Model Selection Multi-Armed Bandits
Code Code Available 05 Bandit-Based Monte Carlo Optimization for Nearest Neighbors May 21, 2018 Clustering Multi-Armed Bandits
Code Code Available 05 An Experimental Design for Anytime-Valid Causal Inference on Multi-Armed Bandits Nov 9, 2023 Causal Inference Experimental Design
Code Code Available 05 Safe and Adaptive Decision-Making for Optimization of Safety-Critical Systems: The ARTEO Algorithm Nov 10, 2022 Decision Making Decision Making Under Uncertainty
Code Code Available 05 Censored Semi-Bandits: A Framework for Resource Allocation with Censored Feedback Sep 4, 2019 Multi-Armed Bandits
Code Code Available 05