Online Continuous Hyperparameter Optimization for Generalized Linear Contextual Bandits Feb 18, 2023 Hyperparameter Optimization Multi-Armed Bandits
— Unverified 0Approximate Thompson Sampling via Epistemic Neural Networks Feb 18, 2023 Thompson Sampling
Code Code Available 1A Bandit Approach to Online Pricing for Heterogeneous Edge Resource Allocation Feb 14, 2023 Edge-computing Thompson Sampling
— Unverified 0Learning How to Infer Partial MDPs for In-Context Adaptation and Exploration Feb 8, 2023 Bayesian Inference Thompson Sampling
— Unverified 0Leveraging Demonstrations to Improve Online Learning: Quality Matters Feb 7, 2023 Thompson Sampling
— Unverified 0Optimality of Thompson Sampling with Noninformative Priors for Pareto Bandits Feb 3, 2023 Thompson Sampling
— Unverified 0Two-sided Competing Matching Recommendation Markets With Quota and Complementary Preferences Constraints Jan 24, 2023 Thompson Sampling
Code Code Available 0Differentially Private Online Bayesian Estimation With Adaptive Truncation Jan 19, 2023 Privacy Preserving Sensitivity
Code Code Available 0A Combinatorial Semi-Bandit Approach to Charging Station Selection for Electric Vehicles Jan 17, 2023 Combinatorial Optimization Thompson Sampling
— Unverified 0Thompson Sampling with Diffusion Generative Prior Jan 12, 2023 Decision Making Denoising
— Unverified 0Reinforcement Learning in Credit Scoring and Underwriting Dec 15, 2022 Decision Making Efficient Exploration
— Unverified 0Neural Bandits for Data Mining: Searching for Dangerous Polypharmacy Dec 10, 2022 Thompson Sampling
Code Code Available 0Online Learning-based Waveform Selection for Improved Vehicle Recognition in Automotive Radar Dec 1, 2022 reinforcement-learning Reinforcement Learning (RL)
— Unverified 0Monte Carlo Tree Search Algorithms for Risk-Aware and Multi-Objective Reinforcement Learning Nov 23, 2022 Multi-Objective Reinforcement Learning reinforcement-learning
— Unverified 0Thompson Sampling for High-Dimensional Sparse Linear Contextual Bandits Nov 11, 2022 Multi-Armed Bandits Thompson Sampling
Code Code Available 0Atlas: Automate Online Service Configuration in Network Slicing Oct 30, 2022 Bayesian Optimization Safe Exploration
Code Code Available 0Optimizing Pessimism in Dynamic Treatment Regimes: A Bayesian Learning Approach Oct 26, 2022 Thompson Sampling Variational Inference
Code Code Available 0Meta Learning of Interface Conditions for Multi-Domain Physics-Informed Neural Networks Oct 23, 2022 Meta-Learning Thompson Sampling
— Unverified 0Sample-Then-Optimize Batch Neural Thompson Sampling Oct 13, 2022 AutoML Bayesian Optimization
Code Code Available 1Deep Active Ensemble Sampling For Image Classification Oct 11, 2022 Active Learning Classification
— Unverified 0The Typical Behavior of Bandit Algorithms Oct 11, 2022 Thompson Sampling
— Unverified 0Cost Aware Asynchronous Multi-Agent Active Search Oct 5, 2022 Decision Making Thompson Sampling
— Unverified 0Thompson Sampling with Virtual Helping Agents Sep 16, 2022 Decision Making Sequential Decision Making
— Unverified 0Double Doubly Robust Thompson Sampling for Generalized Linear Contextual Bandits Sep 15, 2022 Multi-Armed Bandits Thompson Sampling
— Unverified 0A Nonparametric Contextual Bandit with Arm-level Eligibility Control for Customer Service Routing Sep 8, 2022 Thompson Sampling
— Unverified 0Sample Efficient Learning of Factored Embeddings of Tensor Fields Sep 1, 2022 Recommendation Systems Thompson Sampling
— Unverified 0Causal Bandits for Linear Structural Equation Models Aug 26, 2022 Thompson Sampling
Code Code Available 0Dynamic collaborative filtering Thompson Sampling for cross-domain advertisements recommendation Aug 25, 2022 Collaborative Filtering Recommendation Systems
— Unverified 0A Provably Efficient Model-Free Posterior Sampling Method for Episodic Reinforcement Learning Aug 23, 2022 Multi-Armed Bandits reinforcement-learning
— Unverified 0Non-Stationary Dynamic Pricing Via Actor-Critic Information-Directed Pricing Aug 19, 2022 Thompson Sampling
— Unverified 0Increasing Students' Engagement to Reminder Emails Through Multi-Armed Bandits Aug 10, 2022 Management Multi-Armed Bandits
— Unverified 0Using Adaptive Experiments to Rapidly Help Students Aug 10, 2022 Thompson Sampling
— Unverified 0Bayesian Optimization-Based Beam Alignment for MmWave MIMO Communication Systems Jul 28, 2022 Bayesian Optimization Thompson Sampling
— Unverified 0SPRT-based Efficient Best Arm Identification in Stochastic Bandits Jul 22, 2022 Multi-Armed Bandits Thompson Sampling
— Unverified 0Chimera: A Hybrid Machine Learning Driven Multi-Objective Design Space Exploration Tool for FPGA High-Level Synthesis Jul 3, 2022 Active Learning Descriptive
— Unverified 0Ranking In Generalized Linear Bandits Jun 30, 2022 Diversity Multi-Armed Bandits
Code Code Available 0Risk-averse Contextual Multi-armed Bandit Problem with Linear Payoffs Jun 24, 2022 Thompson Sampling
— Unverified 0Langevin Monte Carlo for Contextual Bandits Jun 22, 2022 Multi-Armed Bandits Thompson Sampling
Code Code Available 1Analysis of Thompson Sampling for Controlling Unknown Linear Diffusion Processes Jun 20, 2022 Decision Making Decision Making Under Uncertainty
— Unverified 0Thompson Sampling for (Combinatorial) Pure Exploration Jun 18, 2022 Thompson Sampling
— Unverified 0Thompson Sampling for Robust Transfer in Multi-Task Bandits Jun 17, 2022 Multi-Task Learning Thompson Sampling
Code Code Available 0Thompson Sampling Achieves O(T) Regret in Linear Quadratic Control Jun 17, 2022 Decision Making Decision Making Under Uncertainty
— Unverified 0A Contextual Combinatorial Semi-Bandit Approach to Network Bottleneck Identification Jun 16, 2022 Multi-Armed Bandits Thompson Sampling
— Unverified 0On Provably Robust Meta-Bayesian Optimization Jun 14, 2022 Bayesian Optimization Meta-Learning
Code Code Available 0Top Two Algorithms Revisited Jun 13, 2022 Thompson Sampling Vocal Bursts Valence Prediction
— Unverified 0Regret Bounds for Information-Directed Reinforcement Learning Jun 9, 2022 reinforcement-learning Reinforcement Learning
— Unverified 0A Simple and Optimal Policy Design with Safety against Heavy-Tailed Risk for Stochastic Bandits Jun 7, 2022 Multi-Armed Bandits Thompson Sampling
— Unverified 0Finite-Time Regret of Thompson Sampling Algorithms for Exponential Family Multi-Armed Bandits Jun 7, 2022 Multi-Armed Bandits Thompson Sampling
— Unverified 0Bandit Theory and Thompson Sampling-Guided Directed Evolution for Sequence Optimization Jun 5, 2022 BIG-bench Machine Learning Evolutionary Algorithms
— Unverified 0Incentivizing Combinatorial Bandit Exploration Jun 1, 2022 Thompson Sampling
— Unverified 0