Chimera: A Hybrid Machine Learning Driven Multi-Objective Design Space Exploration Tool for FPGA High-Level Synthesis Jul 3, 2022 Active Learning Descriptive
— Unverified 0Code Repair with LLMs gives an Exploration-Exploitation Tradeoff May 26, 2024 Code Repair Language Modeling
— Unverified 0Bayesian Analysis of Combinatorial Gaussian Process Bandits Dec 20, 2023 Bayesian Inference Informativeness
— Unverified 0Combinatorial Multi-armed Bandits: Arm Selection via Group Testing Oct 14, 2024 Multi-Armed Bandits parameter estimation
— Unverified 0BBQ-Networks: Efficient Exploration in Deep Reinforcement Learning for Task-Oriented Dialogue Systems Nov 15, 2017 Deep Reinforcement Learning Efficient Exploration
— Unverified 0Combinatorial Neural Bandits May 31, 2023 Thompson Sampling
— Unverified 0Combining Bayesian Optimization and Lipschitz Optimization Oct 10, 2018 Bayesian Optimization global-optimization
— Unverified 0Concurrent Decentralized Channel Allocation and Access Point Selection using Multi-Armed Bandits in multi BSS WLANs Jun 5, 2020 Multi-Armed Bandits Thompson Sampling
— Unverified 0Connecting Thompson Sampling and UCB: Towards More Efficient Trade-offs Between Privacy and Regret May 5, 2025 Thompson Sampling
— Unverified 0Connections Between Mirror Descent, Thompson Sampling and the Information Ratio May 28, 2019 Thompson Sampling
— Unverified 0Constrained Contextual Bandit Learning for Adaptive Radar Waveform Selection Mar 9, 2021 Thompson Sampling
— Unverified 0Constrained Thompson Sampling for Real-Time Electricity Pricing with Grid Reliability Constraints Jun 17, 2020 Thompson Sampling
— Unverified 0Constrained Thompson Sampling for Wireless Link Optimization Feb 28, 2019 Thompson Sampling
— Unverified 0A Reinforcement Learning based Reset Policy for CDCL SAT Solvers Apr 4, 2024 reinforcement-learning Reinforcement Learning
— Unverified 0A relaxed technical assumption for posterior sampling-based reinforcement learning for control of unknown linear systems Aug 19, 2021 Thompson Sampling
— Unverified 0Context Attentive Bandits: Contextual Bandit with Restricted Context May 10, 2017 Recommendation Systems Thompson Sampling
— Unverified 0Context Attribution with Multi-Armed Bandit Optimization Jun 24, 2025 Thompson Sampling
— Unverified 0Adaptive Portfolio by Solving Multi-armed Bandit via Thompson Sampling Nov 13, 2019 Decision Making Management
— Unverified 0Contextual Bandits for Advertising Budget Allocation Aug 22, 2020 Marketing Multi-Armed Bandits
— Unverified 0Contextual Bandits with Non-Stationary Correlated Rewards for User Association in MmWave Vehicular Networks Oct 8, 2024 Multi-Armed Bandits Thompson Sampling
— Unverified 0Contextual Bandit with Herding Effects: Algorithms and Recommendation Applications Aug 26, 2024 Multi-Armed Bandits Thompson Sampling
— Unverified 0Contextual Multi-armed Bandit Algorithm for Semiparametric Reward Model Jan 31, 2019 Recommendation Systems Thompson Sampling
— Unverified 0Contextual Multi-Armed Bandits for Causal Marketing Oct 2, 2018 Causal Inference counterfactual
— Unverified 0Contextual Thompson Sampling via Generation of Missing Data Feb 10, 2025 Decision Making Fairness
— Unverified 0Convergence Rates of Posterior Distributions in Markov Decision Process Jul 22, 2019 Thompson Sampling
— Unverified 0Convolutional Monte Carlo Rollouts in Go Dec 10, 2015 GPU Thompson Sampling
— Unverified 0Cost Aware Asynchronous Multi-Agent Active Search Oct 5, 2022 Decision Making Thompson Sampling
— Unverified 0Cost-efficient Knowledge-based Question Answering with Large Language Models May 27, 2024 Knowledge Graphs Model Selection
— Unverified 0Asymptotically Optimal Bandits under Weighted Information May 28, 2021 Thompson Sampling
— Unverified 0Counterfactual Data-Fusion for Online Reinforcement Learners Aug 1, 2017 counterfactual Decision Making
— Unverified 0Counterfactual Inference under Thompson Sampling Apr 3, 2025 Causal Inference counterfactual
— Unverified 0Towards Efficient and Optimal Covariance-Adaptive Algorithms for Combinatorial Semi-Bandits Feb 23, 2024 Thompson Sampling
— Unverified 0Cover Tree Bayesian Reinforcement Learning May 8, 2013 reinforcement-learning Reinforcement Learning
— Unverified 0Customized Nonlinear Bandits for Online Response Selection in Neural Conversation Models Nov 22, 2017 Multi-Armed Bandits Response Generation
— Unverified 0Asymptotic Convergence of Thompson Sampling Nov 8, 2020 Multi-Armed Bandits Thompson Sampling
— Unverified 0Debiasing Samples from Online Learning Using Bootstrap Jul 31, 2021 Off-policy evaluation Thompson Sampling
— Unverified 0Decentralized Multi-Agent Active Search and Tracking when Targets Outnumber Agents Jan 6, 2024 Decision Making Thompson Sampling
— Unverified 0Deciding What to Learn: A Rate-Distortion Approach Jan 15, 2021 Decision Making Sequential Decision Making
— Unverified 0Deconfounded Warm-Start Thompson Sampling with Applications to Precision Medicine May 22, 2025 Thompson Sampling
— Unverified 0Deep Active Ensemble Sampling For Image Classification Oct 11, 2022 Active Learning Classification
— Unverified 0Bayesian Quantile and Expectile Optimisation Jan 12, 2020 Bayesian Optimisation Gaussian Processes
— Unverified 0An Information-Theoretic Analysis of Thompson Sampling for Logistic Bandits Dec 3, 2024 Thompson Sampling
— Unverified 0Deep Contextual Multi-armed Bandits Jul 25, 2018 Marketing Multi-Armed Bandits
— Unverified 0Deep Exploration for Recommendation Systems Sep 26, 2021 Recommendation Systems Thompson Sampling
— Unverified 0Deep Hierarchy in Bandits Feb 3, 2022 Thompson Sampling
— Unverified 0Delay-Adaptive Learning in Generalized Linear Contextual Bandits Mar 11, 2020 Multi-Armed Bandits Thompson Sampling
— Unverified 0Adaptively Optimize Content Recommendation Using Multi Armed Bandit Algorithms in E-commerce Jul 30, 2021 Thompson Sampling
— Unverified 0Differentially Private Federated Bayesian Optimization with Distributed Exploration Oct 27, 2021 Bayesian Optimization Federated Learning
— Unverified 0Diffusion Approximations for Thompson Sampling May 19, 2021 Multi-Armed Bandits Thompson Sampling
— Unverified 0A Copula approach for hyperparameter transfer learning Sep 25, 2019 Bayesian Optimization Thompson Sampling
— Unverified 0