| Neural Contextual Bandits without Regret | Jul 7, 2021 | Decision MakingMulti-Armed Bandits | CodeCode Available | 0 |
| Interactively Learning Preference Constraints in Linear Bandits | Jun 10, 2022 | Decision MakingSequential Decision Making | CodeCode Available | 0 |
| Interactively Teaching an Inverse Reinforcement Learner with Limited Feedback | Sep 16, 2023 | Active LearningDecision Making | CodeCode Available | 0 |
| Interactive Machine Comprehension with Information Seeking Agents | Aug 27, 2019 | Decision MakingInformation Retrieval | CodeCode Available | 0 |
| Risk-Sensitive Stochastic Optimal Control as Rao-Blackwellized Markovian Score Climbing | Dec 21, 2023 | Decision MakingSequential Decision Making | CodeCode Available | 0 |
| TraCE: Trajectory Counterfactual Explanation Scores | Sep 27, 2023 | counterfactualCounterfactual Explanation | CodeCode Available | 0 |
| Show Me the Whole World: Towards Entire Item Space Exploration for Interactive Personalized Recommendations | Oct 19, 2021 | Decision MakingModel Selection | CodeCode Available | 0 |
| AutoGMap: Learning to Map Large-scale Sparse Graphs on Memristive Crossbars | Nov 15, 2021 | CPUDecision Making | CodeCode Available | 0 |
| Noise-Adaptive Confidence Sets for Linear Bandits and Application to Bayesian Optimization | Feb 12, 2024 | Bayesian OptimizationDecision Making | CodeCode Available | 0 |
| Value-Distributional Model-Based Reinforcement Learning | Aug 12, 2023 | continuous-controlContinuous Control | CodeCode Available | 0 |