| Neural Contextual Bandits without Regret | Jul 7, 2021 | Decision MakingMulti-Armed Bandits | CodeCode Available | 0 |
| Interactively Learning Preference Constraints in Linear Bandits | Jun 10, 2022 | Decision MakingSequential Decision Making | CodeCode Available | 0 |
| Interactively Teaching an Inverse Reinforcement Learner with Limited Feedback | Sep 16, 2023 | Active LearningDecision Making | CodeCode Available | 0 |
| Interactive Machine Comprehension with Information Seeking Agents | Aug 27, 2019 | Decision MakingInformation Retrieval | CodeCode Available | 0 |
| Risk-Sensitive Stochastic Optimal Control as Rao-Blackwellized Markovian Score Climbing | Dec 21, 2023 | Decision MakingSequential Decision Making | CodeCode Available | 0 |
| TraCE: Trajectory Counterfactual Explanation Scores | Sep 27, 2023 | counterfactualCounterfactual Explanation | CodeCode Available | 0 |
| Show Me the Whole World: Towards Entire Item Space Exploration for Interactive Personalized Recommendations | Oct 19, 2021 | Decision MakingModel Selection | CodeCode Available | 0 |
| AutoGMap: Learning to Map Large-scale Sparse Graphs on Memristive Crossbars | Nov 15, 2021 | CPUDecision Making | CodeCode Available | 0 |
| Noise-Adaptive Confidence Sets for Linear Bandits and Application to Bayesian Optimization | Feb 12, 2024 | Bayesian OptimizationDecision Making | CodeCode Available | 0 |
| Value-Distributional Model-Based Reinforcement Learning | Aug 12, 2023 | continuous-controlContinuous Control | CodeCode Available | 0 |
| RLTutor: Reinforcement Learning Based Adaptive Tutoring System by Modeling Virtual Student with Fewer Interactions | Jul 31, 2021 | Decision Makingreinforcement-learning | CodeCode Available | 0 |
| Non-monotonic Resource Utilization in the Bandits with Knapsacks Problem | Sep 24, 2022 | Decision MakingDecision Making Under Uncertainty | CodeCode Available | 0 |
| Deep Reinforcement Learning for Surgical Gesture Segmentation and Classification | Jun 21, 2018 | Action SegmentationClassification | CodeCode Available | 0 |
| Nonmyopic Global Optimisation via Approximate Dynamic Programming | Dec 6, 2024 | Bayesian OptimisationGaussian Processes | CodeCode Available | 0 |
| Simple Modification of the Upper Confidence Bound Algorithm by Generalized Weighted Averages | Aug 28, 2023 | Decision MakingDecision Making Under Uncertainty | CodeCode Available | 0 |
| Is Mamba Compatible with Trajectory Optimization in Offline Reinforcement Learning? | May 20, 2024 | Atari GamesMamba | CodeCode Available | 0 |
| Robust Active Measuring under Model Uncertainty | Dec 18, 2023 | Decision Makingmodel | CodeCode Available | 0 |
| Deep Reinforcement Learning for Personalized Diagnostic Decision Pathways Using Electronic Health Records: A Comparative Study on Anemia and Systemic Lupus Erythematosus | Apr 9, 2024 | Decision MakingDeep Reinforcement Learning | CodeCode Available | 0 |
| Deep Reinforcement Learning for Imbalanced Classification | Jan 5, 2019 | ClassificationDecision Making | CodeCode Available | 0 |
| Bounded rationality for relaxing best response and mutual consistency: The Quantal Hierarchy model of decision-making | Jun 30, 2021 | Decision MakingSequential Decision Making | CodeCode Available | 0 |
| Anderson Acceleration for Partially Observable Markov Decision Processes: A Maximum Entropy Approach | Nov 28, 2022 | Decision MakingSequential Decision Making | CodeCode Available | 0 |
| Robust Anytime Learning of Markov Decision Processes | May 31, 2022 | Bayesian InferenceDecision Making | CodeCode Available | 0 |
| Adaptive Action Duration with Contextual Bandits for Deep Reinforcement Learning in Dynamic Environments | Jun 17, 2025 | Atari GamesBoard Games | CodeCode Available | 0 |
| Common Benchmarks Undervalue the Generalization Power of Programmatic Policies | Jun 17, 2025 | Sequential Decision Making | CodeCode Available | 0 |
| Accelerate Model Parallel Training by Using Efficient Graph Traversal Order in Device Placement | Jan 21, 2022 | Decision MakingSequential Decision Making | CodeCode Available | 0 |