A Convex Framework for Confounding Robust Inference Sep 21, 2023 Model Selection Multi-Armed Bandits
Code Code Available 05 Contextual bandits with entropy-based human feedback Feb 12, 2025 Multi-Armed Bandits
Code Code Available 05 Contextual Bandits with Stochastic Experts Feb 23, 2018 Multi-Armed Bandits
Code Code Available 05 Efficient Explorative Key-term Selection Strategies for Conversational Contextual Bandits Mar 1, 2023 Computational Efficiency Multi-Armed Bandits
Code Code Available 05 Adaptive Data Depth via Multi-Armed Bandits Nov 8, 2022 Multi-Armed Bandits
Code Code Available 05 Empirical Likelihood for Contextual Bandits Jun 7, 2019 Multi-Armed Bandits
Code Code Available 05 RoME: A Robust Mixed-Effects Bandit Algorithm for Optimizing Mobile Health Interventions Dec 11, 2023 Multi-Armed Bandits Off-policy evaluation
Code Code Available 05 Estimation of Warfarin Dosage with Reinforcement Learning Sep 15, 2021 Multi-Armed Bandits reinforcement-learning
Code Code Available 05 Flooding with Absorption: An Efficient Protocol for Heterogeneous Bandits over Complex Networks Mar 9, 2023 Decision Making Multi-Armed Bandits
Code Code Available 05 Federated Multi-armed Bandits with Personalization Feb 25, 2021 Federated Learning Multi-Armed Bandits
Code Code Available 05 Finding All ε-Good Arms in Stochastic Bandits Jun 16, 2020 All Multi-Armed Bandits
Code Code Available 05 Finite-time Analysis of Globally Nonstationary Multi-Armed Bandits Jul 23, 2021 Multi-Armed Bandits
Code Code Available 05 Conditionally Risk-Averse Contextual Bandits Oct 24, 2022 Management Multi-Armed Bandits
Code Code Available 05 Gaussian Gated Linear Networks Jun 10, 2020 Denoising Density Estimation
Code Code Available 05 Group Meritocratic Fairness in Linear Contextual Bandits Jun 7, 2022 Fairness Multi-Armed Bandits
Code Code Available 05 Batched Multi-armed Bandits Problem Apr 3, 2019 Multi-Armed Bandits
Code Code Available 05 Combinatorial Multi-armed Bandits for Resource Allocation May 10, 2021 Multi-Armed Bandits
Code Code Available 05 Combinatorial Bandits under Strategic Manipulations Feb 25, 2021 Multi-Armed Bandits Recommendation Systems
Code Code Available 05 Combining Diverse Information for Coordinated Action: Stochastic Bandit Algorithms for Heterogeneous Agents Aug 6, 2024 Multi-Armed Bandits Sensitivity
Code Code Available 05 Hierarchical Multi-Armed Bandits for the Concurrent Intelligent Tutoring of Concepts and Problems of Varying Difficulty Levels Aug 10, 2024 Knowledge Tracing Multi-Armed Bandits
Code Code Available 05 Confidence Intervals for Policy Evaluation in Adaptive Experiments Nov 7, 2019 Experimental Design Multi-Armed Bandits
Code Code Available 05 Identification of the Generalized Condorcet Winner in Multi-dueling Bandits Dec 1, 2021 Multi-Armed Bandits
Code Code Available 05 Causal Contextual Bandits with Adaptive Context May 28, 2024 Multi-Armed Bandits
Code Code Available 05 Cascading Bandits for Large-Scale Recommendation Problems Mar 17, 2016 Multi-Armed Bandits Recommendation Systems
Code Code Available 05 Introduction to Multi-Armed Bandits Apr 15, 2019 Multi-Armed Bandits
Code Code Available 05 Bayesian Design Principles for Frequentist Sequential Learning Oct 1, 2023 Multi-Armed Bandits reinforcement-learning
Code Code Available 05 Bayesian Optimisation over Multiple Continuous and Categorical Inputs Jun 20, 2019 Bayesian Optimisation Diversity
Code Code Available 05 Inverse Contextual Bandits: Learning How Behavior Evolves over Time Jul 13, 2021 Benchmarking Decision Making
Code Code Available 05 Scalable Exploration via Ensemble++ Jul 18, 2024 Computational Efficiency Decision Making
Code Code Available 05 Kernel Conditional Moment Constraints for Confounding Robust Inference Feb 26, 2023 Multi-Armed Bandits Sensitivity
Code Code Available 05 Causally Abstracted Multi-armed Bandits Apr 26, 2024 Decision Making Multi-Armed Bandits
Code Code Available 05 Budgeted Multi-Armed Bandits with Asymmetric Confidence Intervals Jun 12, 2023 Multi-Armed Bandits
Code Code Available 05 Nonstationary Continuum-Armed Bandit Strategies for Automated Trading in a Simulated Financial Market Aug 4, 2022 Bayesian Optimisation Bayesian Optimization
Code Code Available 05 Let's Get It Started: Fostering the Discoverability of New Releases on Deezer Jan 5, 2024 Multi-Armed Bandits
Code Code Available 05 Model selection for contextual bandits Jun 3, 2019 model Model Selection
Code Code Available 05 Locally Differentially Private (Contextual) Bandits Learning Jun 1, 2020 Multi-Armed Bandits Privacy Preserving Deep Learning
Code Code Available 05 An Empirical Evaluation of Federated Contextual Bandit Algorithms Mar 17, 2023 Federated Learning Multi-Armed Bandits
Code Code Available 05 Low-Rank Bandits via Tight Two-to-Infinity Singular Subspace Recovery Feb 24, 2024 Multi-Armed Bandits
Code Code Available 05 Best Arm Identification with Fixed Budget: A Large Deviation Perspective Dec 19, 2023 Multi-Armed Bandits
Code Code Available 05 Adaptive Linear Estimating Equations Jul 14, 2023 Multi-Armed Bandits
Code Code Available 05 Censored Semi-Bandits: A Framework for Resource Allocation with Censored Feedback Sep 4, 2019 Multi-Armed Bandits
Code Code Available 05 Model-free Reinforcement Learning in Infinite-horizon Average-reward Markov Decision Processes Oct 15, 2019 Multi-Armed Bandits reinforcement-learning
Code Code Available 05 Distribution oblivious, risk-aware algorithms for multi-armed bandits with unbounded rewards Jun 3, 2019 Multi-Armed Bandits
Code Code Available 05 More Robust Doubly Robust Off-policy Evaluation Feb 10, 2018 Multi-Armed Bandits Off-policy evaluation
Code Code Available 05 Bandit-Based Monte Carlo Optimization for Nearest Neighbors May 21, 2018 Clustering Multi-Armed Bandits
Code Code Available 05 Multi-agent Multi-armed Bandits with Minimum Reward Guarantee Fairness Feb 21, 2025 Fairness Multi-Armed Bandits
Code Code Available 05 An Experimental Design for Anytime-Valid Causal Inference on Multi-Armed Bandits Nov 9, 2023 Causal Inference Experimental Design
Code Code Available 05 Multi-Armed Bandits in Brain-Computer Interfaces May 19, 2022 Multi-Armed Bandits
Code Code Available 05 Confident Off-Policy Evaluation and Selection through Self-Normalized Importance Weighting Jun 18, 2020 Multi-Armed Bandits Off-policy evaluation
Code Code Available 05 Decentralized Cooperative Stochastic Bandits Oct 10, 2018 Multi-Armed Bandits
Code Code Available 05