NeuroSep-CP-LCB: A Deep Learning-based Contextual Multi-armed Bandit Algorithm with Uncertainty Quantification for Early Sepsis Prediction Mar 20, 2025 Conformal Prediction Decision Making
Code Code Available 05 Nonparametric Gaussian Mixture Models for the Multi-Armed Bandit Aug 8, 2018 Density Estimation Multi-Armed Bandits
Code Code Available 05 Constrained regret minimization for multi-criterion multi-armed bandits Jun 17, 2020 Attribute Multi-Armed Bandits
Code Code Available 05 Off-Policy Evaluation of Slate Bandit Policies via Optimizing Abstraction Feb 3, 2024 Marketing Multi-Armed Bandits
Code Code Available 05 Model selection for contextual bandits Jun 3, 2019 model Model Selection
Code Code Available 05 On-line Adaptative Curriculum Learning for GANs Jul 31, 2018 Multi-Armed Bandits Stochastic Optimization
Code Code Available 05 On Private Online Convex Optimization: Optimal Algorithms in _p-Geometry and High Dimensional Contextual Bandits Jun 16, 2022 Multi-Armed Bandits
Code Code Available 05 Budgeted Multi-Armed Bandits with Asymmetric Confidence Intervals Jun 12, 2023 Multi-Armed Bandits
Code Code Available 05 The Unreasonable Effectiveness of Greedy Algorithms in Multi-Armed Bandit with Many Arms Feb 24, 2020 Multi-Armed Bandits
Code Code Available 05 Optimal Baseline Corrections for Off-Policy Contextual Bandits May 9, 2024 Decision Making Multi-Armed Bandits
Code Code Available 05 From Complexity to Simplicity: Adaptive ES-Active Subspaces for Blackbox Optimization Mar 7, 2019 compressed sensing Multi-Armed Bandits
Code Code Available 05 Information-Directed Selection for Top-Two Algorithms May 24, 2022 Multi-Armed Bandits Thompson Sampling
Code Code Available 05 Optimal Regret Is Achievable with Bounded Approximate Inference Error: An Enhanced Bayesian Upper Confidence Bound Framework Jan 31, 2022 Bayesian Inference Multi-Armed Bandits
Code Code Available 05 Conditionally Risk-Averse Contextual Bandits Oct 24, 2022 Management Multi-Armed Bandits
Code Code Available 05 Cascading Bandits for Large-Scale Recommendation Problems Mar 17, 2016 Multi-Armed Bandits Recommendation Systems
Code Code Available 05 Offline Contextual Bandits with Overparameterized Models Jun 27, 2020 Multi-Armed Bandits Q-Learning
Code Code Available 05 Contextual bandits with entropy-based human feedback Feb 12, 2025 Multi-Armed Bandits
Code Code Available 05 Causal Contextual Bandits with Adaptive Context May 28, 2024 Multi-Armed Bandits
Code Code Available 05 Addressing the Long-term Impact of ML Decisions via Policy Regret Jun 2, 2021 Multi-Armed Bandits
Code Code Available 05 Cost-Efficient Online Decision Making: A Combinatorial Multi-Armed Bandit Approach Aug 21, 2023 Decision Making Multi-Armed Bandits
Code Code Available 05 Causally Abstracted Multi-armed Bandits Apr 26, 2024 Decision Making Multi-Armed Bandits
Code Code Available 05 Censored Semi-Bandits: A Framework for Resource Allocation with Censored Feedback Sep 4, 2019 Multi-Armed Bandits
Code Code Available 05 Doubly Robust Policy Evaluation and Optimization Mar 10, 2015 Decision Making Multi-Armed Bandits
Code Code Available 05 Maximizing and Satisficing in Multi-armed Bandits with Graph Information Aug 2, 2021 Decision Making Multi-Armed Bandits
Code Code Available 05 Quantile Bandits for Best Arms Identification Oct 22, 2020 Decision Making Multi-Armed Bandits
Code Code Available 05 Quantum exploration algorithms for multi-armed bandits Jul 14, 2020 Multi-Armed Bandits
Code Code Available 05 Recurrent Neural-Linear Posterior Sampling for Nonstationary Contextual Bandits Jul 9, 2020 Multi-Armed Bandits
Code Code Available 05 Regret Bounds for Thompson Sampling in Episodic Restless Bandit Problems May 29, 2019 Multi-Armed Bandits Thompson Sampling
Code Code Available 05 Reinforcement Learning for Physical Layer Communications Jun 22, 2021 Deep Reinforcement Learning Multi-Armed Bandits
Code Code Available 05 Relational Boosted Bandits Dec 16, 2020 Attribute Descriptive
Code Code Available 05 Group Meritocratic Fairness in Linear Contextual Bandits Jun 7, 2022 Fairness Multi-Armed Bandits
Code Code Available 05 Combinatorial Bandits under Strategic Manipulations Feb 25, 2021 Multi-Armed Bandits Recommendation Systems
Code Code Available 05 Semiparametric Contextual Bandits Mar 12, 2018 Multi-Armed Bandits
Code Code Available 05 Sequential Decision Making with Expert Demonstrations under Unobserved Heterogeneity Apr 10, 2024 Decision Making Meta Reinforcement Learning
Code Code Available 05 Adversarial Attacks on Combinatorial Multi-Armed Bandits Oct 8, 2023 Multi-Armed Bandits
Code Code Available 05 Combinatorial Multi-armed Bandits for Resource Allocation May 10, 2021 Multi-Armed Bandits
Code Code Available 05 Simultaneously Achieving Group Exposure Fairness and Within-Group Meritocracy in Stochastic Bandits Feb 8, 2024 Attribute Exposure Fairness
Code Code Available 05 Smooth Contextual Bandits: Bridging the Parametric and Non-differentiable Regret Regimes Sep 5, 2019 Multi-Armed Bandits
Code Code Available 05 Adapting multi-armed bandits policies to contextual bandits scenarios Nov 11, 2018 Binary Classification Classification
Code Code Available 05 Subgaussian and Differentiable Importance Sampling for Off-Policy Evaluation and Learning Dec 1, 2021 Multi-Armed Bandits Off-policy evaluation
Code Code Available 05 Test-Time Scaling of Diffusion Models via Noise Trajectory Search May 24, 2025 Denoising Image Generation
Code Code Available 05 The Assistive Multi-Armed Bandit Jan 24, 2019 Multi-Armed Bandits
Code Code Available 05 Thompson Sampling for Contextual Bandits with Linear Payoffs Sep 15, 2012 Multi-Armed Bandits Thompson Sampling
Code Code Available 05 Thompson Sampling for High-Dimensional Sparse Linear Contextual Bandits Nov 11, 2022 Multi-Armed Bandits Thompson Sampling
Code Code Available 05 Combining Diverse Information for Coordinated Action: Stochastic Bandit Algorithms for Heterogeneous Agents Aug 6, 2024 Multi-Armed Bandits Sensitivity
Code Code Available 05 Thompson Sampling for Multinomial Logit Contextual Bandits Dec 1, 2019 Multi-Armed Bandits Thompson Sampling
Code Code Available 05 Flooding with Absorption: An Efficient Protocol for Heterogeneous Bandits over Complex Networks Mar 9, 2023 Decision Making Multi-Armed Bandits
Code Code Available 05 A Survey of Online Experiment Design with the Stochastic Multi-Armed Bandit Oct 2, 2015 Decision Making Multi-Armed Bandits
Code Code Available 05 Multi-Armed Bandits with Correlated Arms Nov 6, 2019 Multi-Armed Bandits
Code Code Available 05 Variational inference for the multi-armed contextual bandit Sep 10, 2017 Multi-Armed Bandits Reinforcement Learning
Code Code Available 05