| Neural Contextual Bandits without Regret | Jul 7, 2021 | Decision MakingMulti-Armed Bandits | CodeCode Available | 0 |
| Interactively Learning Preference Constraints in Linear Bandits | Jun 10, 2022 | Decision MakingSequential Decision Making | CodeCode Available | 0 |
| Interactively Teaching an Inverse Reinforcement Learner with Limited Feedback | Sep 16, 2023 | Active LearningDecision Making | CodeCode Available | 0 |
| Interactive Machine Comprehension with Information Seeking Agents | Aug 27, 2019 | Decision MakingInformation Retrieval | CodeCode Available | 0 |
| Risk-Sensitive Stochastic Optimal Control as Rao-Blackwellized Markovian Score Climbing | Dec 21, 2023 | Decision MakingSequential Decision Making | CodeCode Available | 0 |
| TraCE: Trajectory Counterfactual Explanation Scores | Sep 27, 2023 | counterfactualCounterfactual Explanation | CodeCode Available | 0 |
| Show Me the Whole World: Towards Entire Item Space Exploration for Interactive Personalized Recommendations | Oct 19, 2021 | Decision MakingModel Selection | CodeCode Available | 0 |
| AutoGMap: Learning to Map Large-scale Sparse Graphs on Memristive Crossbars | Nov 15, 2021 | CPUDecision Making | CodeCode Available | 0 |
| Noise-Adaptive Confidence Sets for Linear Bandits and Application to Bayesian Optimization | Feb 12, 2024 | Bayesian OptimizationDecision Making | CodeCode Available | 0 |
| Value-Distributional Model-Based Reinforcement Learning | Aug 12, 2023 | continuous-controlContinuous Control | CodeCode Available | 0 |
| RLTutor: Reinforcement Learning Based Adaptive Tutoring System by Modeling Virtual Student with Fewer Interactions | Jul 31, 2021 | Decision Makingreinforcement-learning | CodeCode Available | 0 |
| Non-monotonic Resource Utilization in the Bandits with Knapsacks Problem | Sep 24, 2022 | Decision MakingDecision Making Under Uncertainty | CodeCode Available | 0 |
| Deep Reinforcement Learning for Surgical Gesture Segmentation and Classification | Jun 21, 2018 | Action SegmentationClassification | CodeCode Available | 0 |
| Nonmyopic Global Optimisation via Approximate Dynamic Programming | Dec 6, 2024 | Bayesian OptimisationGaussian Processes | CodeCode Available | 0 |
| Simple Modification of the Upper Confidence Bound Algorithm by Generalized Weighted Averages | Aug 28, 2023 | Decision MakingDecision Making Under Uncertainty | CodeCode Available | 0 |
| Is Mamba Compatible with Trajectory Optimization in Offline Reinforcement Learning? | May 20, 2024 | Atari GamesMamba | CodeCode Available | 0 |
| Robust Active Measuring under Model Uncertainty | Dec 18, 2023 | Decision Makingmodel | CodeCode Available | 0 |
| Deep Reinforcement Learning for Personalized Diagnostic Decision Pathways Using Electronic Health Records: A Comparative Study on Anemia and Systemic Lupus Erythematosus | Apr 9, 2024 | Decision MakingDeep Reinforcement Learning | CodeCode Available | 0 |
| Deep Reinforcement Learning for Imbalanced Classification | Jan 5, 2019 | ClassificationDecision Making | CodeCode Available | 0 |
| Bounded rationality for relaxing best response and mutual consistency: The Quantal Hierarchy model of decision-making | Jun 30, 2021 | Decision MakingSequential Decision Making | CodeCode Available | 0 |
| Anderson Acceleration for Partially Observable Markov Decision Processes: A Maximum Entropy Approach | Nov 28, 2022 | Decision MakingSequential Decision Making | CodeCode Available | 0 |
| Robust Anytime Learning of Markov Decision Processes | May 31, 2022 | Bayesian InferenceDecision Making | CodeCode Available | 0 |
| Adaptive Action Duration with Contextual Bandits for Deep Reinforcement Learning in Dynamic Environments | Jun 17, 2025 | Atari GamesBoard Games | CodeCode Available | 0 |
| Common Benchmarks Undervalue the Generalization Power of Programmatic Policies | Jun 17, 2025 | Sequential Decision Making | CodeCode Available | 0 |
| Accelerate Model Parallel Training by Using Efficient Graph Traversal Order in Device Placement | Jan 21, 2022 | Decision MakingSequential Decision Making | CodeCode Available | 0 |
| LaGR-SEQ: Language-Guided Reinforcement Learning with Sample-Efficient Querying | Aug 21, 2023 | Decision Makingreinforcement-learning | CodeCode Available | 0 |
| Combining Experimental and Historical Data for Policy Evaluation | Jun 1, 2024 | Data IntegrationDecision Making | CodeCode Available | 0 |
| Quantization-Free Autoregressive Action Transformer | Mar 18, 2025 | Imitation LearningQuantization | CodeCode Available | 0 |
| Deep Reinforcement Learning Algorithms for Option Hedging | Apr 7, 2025 | Deep Reinforcement LearningQ-Learning | CodeCode Available | 0 |
| Deep Q-Network for Angry Birds | Oct 4, 2019 | Decision MakingDeep Reinforcement Learning | CodeCode Available | 0 |
| Offline Dynamic Inventory and Pricing Strategy: Addressing Censored and Dependent Demand | Apr 14, 2025 | Sequential Decision MakingSurvival Analysis | CodeCode Available | 0 |
| Robust Reinforcement Learning Under Minimax Regret for Green Security | Jun 15, 2021 | Decision Makingreinforcement-learning | CodeCode Available | 0 |
| Active Sampling for MRI-based Sequential Decision Making | May 7, 2025 | Decision MakingDiagnostic | CodeCode Available | 0 |
| Quizbowl: The Case for Incremental Question Answering | Apr 9, 2019 | BIG-bench Machine LearningDecision Making | CodeCode Available | 0 |
| Skill Disentanglement for Imitation Learning from Suboptimal Demonstrations | Jun 13, 2023 | Decision MakingDisentanglement | CodeCode Available | 0 |
| Deep Bayesian Bandits Showdown: An Empirical Comparison of Bayesian Deep Networks for Thompson Sampling | Feb 26, 2018 | Decision MakingDeep Reinforcement Learning | CodeCode Available | 0 |
| Read, Watch, and Move: Reinforcement Learning for Temporally Grounding Natural Language Descriptions in Videos | Jan 21, 2019 | Decision MakingMulti-Task Learning | CodeCode Available | 0 |
| Off-Policy Evaluation for Action-Dependent Non-Stationary Environments | Jan 24, 2023 | counterfactualCounterfactual Reasoning | CodeCode Available | 0 |
| An Adaptable Budget Planner for Enhancing Budget-Constrained Auto-Bidding in Online Advertising | Jan 26, 2025 | In-Context Reinforcement LearningSequential Decision Making | CodeCode Available | 0 |
| Learning a Fast Mixing Exogenous Block MDP using a Single Trajectory | Oct 3, 2024 | Representation LearningSequential Decision Making | CodeCode Available | 0 |
| Off-Policy Optimization of Portfolio Allocation Policies under Constraints | Dec 21, 2020 | Decision MakingPortfolio Optimization | CodeCode Available | 0 |
| Off-policy Policy Evaluation For Sequential Decisions Under Unobserved Confounding | Mar 12, 2020 | Decision MakingManagement | CodeCode Available | 0 |
| Hierarchical Reinforcement Learning with AI Planning Models | Mar 1, 2022 | Decision MakingHierarchical Reinforcement Learning | CodeCode Available | 0 |
| TooBadRL: Trigger Optimization to Boost Effectiveness of Backdoor Attacks on Deep Reinforcement Learning | Jun 11, 2025 | Deep Reinforcement LearningSequential Decision Making | CodeCode Available | 0 |
| Learning Coordination Policies over Heterogeneous Graphs for Human-Robot Teams via Recurrent Neural Schedule Propagation | Jan 30, 2023 | Decision MakingGraph Attention | CodeCode Available | 0 |
| Temporal Shift Reinforcement Learning | Sep 5, 2021 | Decision MakingDeep Reinforcement Learning | CodeCode Available | 0 |
| Learning Discrete State Abstractions With Deep Variational Inference | Mar 9, 2020 | Decision MakingMulti-Goal Reinforcement Learning | CodeCode Available | 0 |
| Decision Transformer vs. Decision Mamba: Analysing the Complexity of Sequential Decision Making in Atari Games | Dec 1, 2024 | Atari GamesDecision Making | CodeCode Available | 0 |
| Decision Making in Non-Stationary Environments with Policy-Augmented Search | Jan 6, 2024 | Decision MakingDecision Making Under Uncertainty | CodeCode Available | 0 |
| Learning Dynamic Selection and Pricing of Out-of-Home Deliveries | Nov 23, 2023 | BenchmarkingDecision Making | CodeCode Available | 0 |