Information-Directed Selection for Top-Two Algorithms May 24, 2022 Multi-Armed Bandits Thompson Sampling
Code Code Available 0Neural Contextual Bandits Based Dynamic Sensor Selection for Low-Power Body-Area Networks May 24, 2022 Anomaly Detection Multi-Armed Bandits
— Unverified 0Computationally Efficient Horizon-Free Reinforcement Learning for Linear Mixture MDPs May 23, 2022 Multi-Armed Bandits reinforcement-learning
— Unverified 0Falsification of Multiple Requirements for Cyber-Physical Systems Using Online Generative Adversarial Networks and Multi-Armed Bandits May 23, 2022 Multi-Armed Bandits
— Unverified 0Contextual Information-Directed Sampling May 22, 2022 Multi-Armed Bandits Reinforcement Learning (RL)
— Unverified 0Pessimism for Offline Linear Contextual Bandits using _p Confidence Sets May 21, 2022 Multi-Armed Bandits
— Unverified 0SplitPlace: AI Augmented Splitting and Placement of Large-Scale Neural Networks in Mobile Edge Environments May 21, 2022 Edge-computing Multi-Armed Bandits
Code Code Available 1Stability Enforced Bandit Algorithms for Channel Selection in Remote State Estimation of Gauss-Markov Processes May 20, 2022 channel selection Multi-Armed Bandits
— Unverified 0Breaking the T Barrier: Instance-Independent Logarithmic Regret in Stochastic Contextual Linear Bandits May 19, 2022 Multi-Armed Bandits parameter estimation
— Unverified 0Multi-Armed Bandits in Brain-Computer Interfaces May 19, 2022 Multi-Armed Bandits
Code Code Available 0Slowly Changing Adversarial Bandit Algorithms are Efficient for Discounted MDPs May 18, 2022 Multi-Armed Bandits reinforcement-learning
— Unverified 0Semi-Parametric Contextual Bandits with Graph-Laplacian Regularization May 17, 2022 Multi-Armed Bandits Thompson Sampling
— Unverified 0From Dirichlet to Rubin: Optimistic Exploration in RL without Bonuses May 16, 2022 Multi-Armed Bandits
— Unverified 0Nearly Optimal Algorithms for Linear Contextual Bandits with Adversarial Corruptions May 13, 2022 Multi-Armed Bandits
— Unverified 0A Survey of Risk-Aware Multi-Armed Bandits May 12, 2022 Multi-Armed Bandits Portfolio Optimization
— Unverified 0Selectively Contextual Bandits May 9, 2022 Multi-Armed Bandits
— Unverified 0Federated Multi-Armed Bandits Under Byzantine Attacks May 9, 2022 Data Poisoning Decision Making
— Unverified 0Pervasive Machine Learning for Smart Radio Environments Enabled by Reconfigurable Intelligent Surfaces May 8, 2022 BIG-bench Machine Learning Deep Reinforcement Learning
Code Code Available 1Multi-Player Multi-Armed Bandits with Finite Shareable Resources Arms: Learning Algorithms & Applications Apr 28, 2022 Edge-computing Multi-Armed Bandits
— Unverified 0Evolutionary Multi-Armed Bandits with Genetic Thompson Sampling Apr 26, 2022 Decision Making Evolutionary Algorithms
Code Code Available 0Thompson Sampling for Bandit Learning in Matching Markets Apr 26, 2022 Multi-Armed Bandits Thompson Sampling
Code Code Available 0Rate-Constrained Remote Contextual Bandits Apr 26, 2022 Marketing Multi-Armed Bandits
— Unverified 0Worst-case Performance of Greedy Policies in Bandits with Imperfect Context Observations Apr 10, 2022 Decision Making Decision Making Under Uncertainty
— Unverified 0Stochastic Multi-armed Bandits with Non-stationary Rewards Generated by a Linear Dynamical System Apr 6, 2022 Decision Making Multi-Armed Bandits
— Unverified 0Strategies for Safe Multi-Armed Bandits with Logarithmic Regret and Risk Apr 1, 2022 Multi-Armed Bandits
— Unverified 0Flexible and Efficient Contextual Bandits with Heterogeneous Treatment Effect Oracles Mar 30, 2022 Decision Making Heterogeneous Treatment Effect Estimation
— Unverified 0Best Arm Identification in Restless Markov Multi-Armed Bandits Mar 29, 2022 Multi-Armed Bandits
— Unverified 0On Kernelized Multi-Armed Bandits with Constraints Mar 29, 2022 Multi-Armed Bandits Thompson Sampling
— Unverified 0Modeling Attrition in Recommender Systems with Departing Bandits Mar 25, 2022 Multi-Armed Bandits Recommendation Systems
— Unverified 0Multi-armed bandits for resource efficient, online optimization of language model pre-training: the use case of dynamic masking Mar 24, 2022 Bayesian Optimization Decision Making
Code Code Available 0Efficient Algorithms for Extreme Bandits Mar 21, 2022 Multi-Armed Bandits
Code Code Available 0Approximate Function Evaluation via Multi-Armed Bandits Mar 18, 2022 Multi-Armed Bandits
— Unverified 0Reinforced Meta Active Learning Mar 9, 2022 Active Learning Informativeness
— Unverified 0Reward-Biased Maximum Likelihood Estimation for Neural Contextual Bandits Mar 8, 2022 Multi-Armed Bandits
— Unverified 0PAC-Bayesian Lifelong Learning For Multi-Armed Bandits Mar 7, 2022 Lifelong learning Multi-Armed Bandits
— Unverified 0Restless Multi-Armed Bandits under Exogenous Global Markov Process Feb 28, 2022 Multi-Armed Bandits
— Unverified 0Federated Online Sparse Decision Making Feb 27, 2022 Decision Making Multi-Armed Bandits
— Unverified 0Truncated LinUCB for Stochastic Linear Bandits Feb 23, 2022 Multi-Armed Bandits
Code Code Available 0The Pareto Frontier of Instance-Dependent Guarantees in Multi-Player Multi-Armed Bandits with no Communication Feb 19, 2022 Multi-Armed Bandits
— Unverified 0Cost-Efficient Distributed Learning via Combinatorial Multi-Armed Bandits Feb 16, 2022 Multi-Armed Bandits
— Unverified 0Versatile Dueling Bandits: Best-of-both-World Analyses for Online Learning from Preferences Feb 14, 2022 Multi-Armed Bandits
— Unverified 0Off-Policy Evaluation for Large Action Spaces via Embeddings Feb 13, 2022 Multi-Armed Bandits Off-policy evaluation
Code Code Available 2Shuffle Private Linear Contextual Bandits Feb 11, 2022 Multi-Armed Bandits
— Unverified 0Efficient Kernel UCB for Contextual Bandits Feb 11, 2022 Computational Efficiency Multi-Armed Bandits
Code Code Available 0Remote Contextual Bandits Feb 10, 2022 Marketing Multi-Armed Bandits
— Unverified 0Settling the Communication Complexity for Distributed Offline Reinforcement Learning Feb 10, 2022 Multi-Armed Bandits Offline RL
— Unverified 0Smoothed Online Learning is as Easy as Statistical Learning Feb 9, 2022 Learning Theory Multi-Armed Bandits
— Unverified 0Budgeted Combinatorial Multi-Armed Bandits Feb 8, 2022 Multi-Armed Bandits
— Unverified 0Variance-Optimal Augmentation Logging for Counterfactual Evaluation in Contextual Bandits Feb 3, 2022 counterfactual Multi-Armed Bandits
— Unverified 0Doubly Robust Off-Policy Evaluation for Ranking Policies under the Cascade Behavior Model Feb 3, 2022 Multi-Armed Bandits Off-policy evaluation
Code Code Available 2