| Enhancing the Accuracy and Fairness of Human Decision Making | May 25, 2018 | Decision MakingFairness | CodeCode Available | 0 | 5 |
| Curriculum Design for Teaching via Demonstrations: Theory and Applications | Jun 8, 2021 | Decision MakingReinforcement Learning (RL) | CodeCode Available | 0 | 5 |
| Efficient Deviation Types and Learning for Hindsight Rationality in Extensive-Form Games | Feb 13, 2021 | counterfactualDecision Making | CodeCode Available | 0 | 5 |
| Agent-State Construction with Auxiliary Inputs | Nov 15, 2022 | Decision Makingreinforcement-learning | CodeCode Available | 0 | 5 |
| Efficient Deviation Types and Learning for Hindsight Rationality in Extensive-Form Games: Corrections | May 24, 2022 | counterfactualDecision Making | CodeCode Available | 0 | 5 |
| Achieving Long-Term Fairness in Sequential Decision Making | Apr 4, 2022 | Decision MakingFairness | CodeCode Available | 0 | 5 |
| Efficient Sequence Labeling with Actor-Critic Training | Sep 30, 2018 | Decision MakingNER | CodeCode Available | 0 | 5 |
| Deep Reinforcement Learning for Personalized Diagnostic Decision Pathways Using Electronic Health Records: A Comparative Study on Anemia and Systemic Lupus Erythematosus | Apr 9, 2024 | Decision MakingDeep Reinforcement Learning | CodeCode Available | 0 | 5 |
| Autonomous Capability Assessment of Sequential Decision-Making Systems in Stochastic Settings (Extended Version) | Jun 7, 2023 | Active LearningDecision Making | CodeCode Available | 0 | 5 |
| Learning Non-myopic Power Allocation in Constrained Scenarios | Jan 18, 2024 | Decision MakingSequential Decision Making | CodeCode Available | 0 | 5 |
| Deep Reinforcement Learning for Surgical Gesture Segmentation and Classification | Jun 21, 2018 | Action SegmentationClassification | CodeCode Available | 0 | 5 |
| Adaptive Action Duration with Contextual Bandits for Deep Reinforcement Learning in Dynamic Environments | Jun 17, 2025 | Atari GamesBoard Games | CodeCode Available | 0 | 5 |
| Deep Reinforcement Learning for Unsupervised Video Summarization with Diversity-Representativeness Reward | Dec 29, 2017 | Decision MakingDeep Reinforcement Learning | CodeCode Available | 0 | 5 |
| Learning to Generalize for Sequential Decision Making | Oct 5, 2020 | Decision MakingImitation Learning | CodeCode Available | 0 | 5 |
| Epistemic Exploration for Generalizable Planning and Learning in Non-Stationary Settings | Feb 13, 2024 | Decision MakingSequential Decision Making | CodeCode Available | 0 | 5 |
| Learning Versatile Skills with Curriculum Masking | Oct 23, 2024 | Decision MakingOffline RL | CodeCode Available | 0 | 5 |
| Deep Variational Reinforcement Learning for POMDPs | Jun 6, 2018 | Decision MakingInductive Bias | CodeCode Available | 0 | 5 |
| Autoregressive Bandits | Dec 12, 2022 | Decision MakingSequential Decision Making | CodeCode Available | 0 | 5 |
| Dynamic Real-time Multimodal Routing with Hierarchical Hybrid Planning | Feb 5, 2019 | Decision MakingDecision Making Under Uncertainty | CodeCode Available | 0 | 5 |
| Dynamic Simplex: Balancing Safety and Performance in Autonomous Cyber Physical Systems | Feb 20, 2023 | Decision MakingSequential Decision Making | CodeCode Available | 0 | 5 |
| Loss Bounds for Approximate Influence-Based Abstraction | Nov 3, 2020 | Decision MakingSequential Decision Making | CodeCode Available | 0 | 5 |
| DeLF: Designing Learning Environments with Foundation Models | Jan 17, 2024 | Decision MakingReinforcement Learning (RL) | CodeCode Available | 0 | 5 |
| Dynamical Linear Bandits | Nov 16, 2022 | Decision MakingSequential Decision Making | CodeCode Available | 0 | 5 |
| Agent-Specific Effects: A Causal Effect Propagation Analysis in Multi-Agent MDPs | Oct 17, 2023 | counterfactualDecision Making | CodeCode Available | 0 | 5 |
| Ecole: A Library for Learning Inside MILP Solvers | Apr 6, 2021 | BIG-bench Machine LearningCombinatorial Optimization | CodeCode Available | 0 | 5 |
| Making Universal Policies Universal | Feb 20, 2025 | Imitation LearningSequential Decision Making | CodeCode Available | 0 | 5 |
| Counterfactual Effect Decomposition in Multi-Agent Sequential Decision Making | Oct 16, 2024 | Attributecounterfactual | CodeCode Available | 0 | 5 |
| Did we personalize? Assessing personalization by an online reinforcement learning algorithm using resampling | Apr 11, 2023 | Decision MakingReinforcement Learning (RL) | CodeCode Available | 0 | 5 |
| Merging Decision Transformers: Weight Averaging for Forming Multi-Task Policies | Mar 14, 2023 | Decision MakingMuJoCo | CodeCode Available | 0 | 5 |
| Differentially Private Regret Minimization in Episodic Markov Decision Processes | Dec 20, 2021 | Decision MakingReinforcement Learning (RL) | CodeCode Available | 0 | 5 |
| Doubly Inhomogeneous Reinforcement Learning | Nov 8, 2022 | Change Point DetectionClustering | CodeCode Available | 0 | 5 |
| Co-training for Policy Learning | Jul 3, 2019 | Combinatorial Optimizationcontinuous-control | CodeCode Available | 0 | 5 |
| A New Bandit Setting Balancing Information from State Evolution and Corrupted Context | Nov 16, 2020 | Decision MakingEfficient Exploration | CodeCode Available | 0 | 5 |
| Modeling Adversarial Attack on Pre-trained Language Models as Sequential Decision Making | May 27, 2023 | Adversarial AttackDecision Making | CodeCode Available | 0 | 5 |
| Batch Bayesian optimisation via density-ratio estimation with guarantees | Sep 22, 2022 | Bayesian InferenceBayesian Optimisation | CodeCode Available | 0 | 5 |
| Adapting Static Fairness to Sequential Decision-Making: Bias Mitigation Strategies towards Equal Long-term Benefit Rate | Sep 7, 2023 | Decision MakingFairness | CodeCode Available | 0 | 5 |
| Discrete-Time Distribution Steering using Monte Carlo Tree Search | Dec 9, 2024 | Decision MakingSequential Decision Making | CodeCode Available | 0 | 5 |
| Distance Weighted Supervised Learning for Offline Interaction Data | Apr 26, 2023 | Decision MakingImitation Learning | CodeCode Available | 0 | 5 |
| Neural Contextual Bandits without Regret | Jul 7, 2021 | Decision MakingMulti-Armed Bandits | CodeCode Available | 0 | 5 |
| Correlational Dueling Bandits with Application to Clinical Treatment in Large Decision Spaces | Jul 8, 2017 | Decision MakingDecision Making Under Uncertainty | —Unverified | 0 | 0 |
| CoRAL: Collaborative Retrieval-Augmented Large Language Models Improve Long-tail Recommendation | Mar 11, 2024 | Recommendation SystemsReinforcement Learning (RL) | —Unverified | 0 | 0 |
| A Survey of Continual Reinforcement Learning | Jun 27, 2025 | Continual LearningDecision Making | —Unverified | 0 | 0 |
| CoordiQ : Coordinated Q-learning for Electric Vehicle Charging Recommendation | Jan 28, 2021 | Decision MakingQ-Learning | —Unverified | 0 | 0 |
| A Sufficient Statistic for Influence in Structured Multiagent Environments | Jul 22, 2019 | Decision MakingSequential Decision Making | —Unverified | 0 | 0 |
| A Generative Machine Learning Approach to Policy Optimization in Pursuit-Evasion Games | Oct 4, 2020 | BIG-bench Machine LearningDecision Making | —Unverified | 0 | 0 |
| Cooperative Bayesian Optimization for Imperfect Agents | Mar 7, 2024 | Bayesian OptimizationDecision Making | —Unverified | 0 | 0 |
| Convex Regularization in Monte-Carlo Tree Search | Jul 1, 2020 | Atari GamesDecision Making | —Unverified | 0 | 0 |
| A Strong Baseline for Batch Imitation Learning | Feb 6, 2023 | continuous-controlContinuous Control | —Unverified | 0 | 0 |
| Convex Markov Games: A New Frontier for Multi-Agent Reinforcement Learning | Oct 22, 2024 | Decision MakingDiversity | —Unverified | 0 | 0 |
| Convergence and sample complexity of natural policy gradient primal-dual methods for constrained MDPs | Jun 6, 2022 | Decision MakingSequential Decision Making | —Unverified | 0 | 0 |