| Predicting and Understanding Human Action Decisions: Insights from Large Language Models and Cognitive Instance-Based Learning | Jul 12, 2024 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Predicting Periodicity with Temporal Difference Learning | Sep 20, 2018 | Decision MakingReinforcement Learning | —Unverified | 0 |
| Learning-to-defer for sequential medical decision-making under uncertainty | Sep 13, 2021 | Decision MakingDecision Making Under Uncertainty | —Unverified | 0 |
| Preference at First Sight | Jun 24, 2016 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Preference Optimization as Probabilistic Inference | Oct 5, 2024 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Reference Points, Risk-Taking Behavior, and Competitive Outcomes in Sequential Settings | Sep 20, 2024 | counterfactualDecision Making | —Unverified | 0 |
| Privacy-Constrained Policies via Mutual Information Regularized Policy Gradients | Dec 30, 2020 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Probabilistic DAG Search | Jun 16, 2021 | Decision Makingfeature selection | —Unverified | 0 |
| Probability Tools for Sequential Random Projection | Feb 16, 2024 | Decision MakingDecision Making Under Uncertainty | —Unverified | 0 |
| Proportional Aggregation of Preferences for Sequential Decision Making | Jun 26, 2023 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Provable Benefits of Multi-task RL under Non-Markovian Decision Making Processes | Oct 20, 2023 | Decision MakingMulti-Task Learning | —Unverified | 0 |
| Provable Reinforcement Learning with a Short-Term Memory | Feb 8, 2022 | Decision Makingreinforcement-learning | —Unverified | 0 |
| PROVABLY BENEFITS OF DEEP HIERARCHICAL RL | Sep 25, 2019 | Decision MakingHierarchical Reinforcement Learning | —Unverified | 0 |
| Provably Efficient Causal Model-Based Reinforcement Learning for Systematic Generalization | Feb 14, 2022 | Decision MakingModel-based Reinforcement Learning | —Unverified | 0 |
| Provably Efficient Reinforcement Learning for Adversarial Restless Multi-Armed Bandits with Unknown Transitions and Bandit Feedback | May 2, 2024 | Multi-Armed BanditsSequential Decision Making | —Unverified | 0 |
| Provably Efficient UCB-type Algorithms For Learning Predictive State Representations | Jul 1, 2023 | Computational EfficiencyDecision Making | —Unverified | 0 |
| Provably Learning Nash Policies in Constrained Markov Potential Games | Jun 13, 2023 | Decision MakingMulti-agent Reinforcement Learning | —Unverified | 0 |
| Proximal Reinforcement Learning: A New Theory of Sequential Decision Making in Primal-Dual Spaces | May 26, 2014 | Decision Makingreinforcement-learning | —Unverified | 0 |
| Pure Exploration under Mediators' Feedback | Aug 29, 2023 | Decision MakingMulti-Armed Bandits | —Unverified | 0 |
| QForce-RL: Quantized FPGA-Optimized Reinforcement Learning Compute Engine | Jun 8, 2025 | Decision MakingQuantization | —Unverified | 0 |
| Dynamical Linear Bandits | Nov 16, 2022 | Decision MakingSequential Decision Making | CodeCode Available | 0 |
| Learning non-Markovian Decision-Making from State-only Sequences | Jun 27, 2023 | Decision MakingImitation Learning | CodeCode Available | 0 |
| Doubly Inhomogeneous Reinforcement Learning | Nov 8, 2022 | Change Point DetectionClustering | CodeCode Available | 0 |
| Dynamic Real-time Multimodal Routing with Hierarchical Hybrid Planning | Feb 5, 2019 | Decision MakingDecision Making Under Uncertainty | CodeCode Available | 0 |
| Learning Non-myopic Power Allocation in Constrained Scenarios | Jan 18, 2024 | Decision MakingSequential Decision Making | CodeCode Available | 0 |
| Dynamic Simplex: Balancing Safety and Performance in Autonomous Cyber Physical Systems | Feb 20, 2023 | Decision MakingSequential Decision Making | CodeCode Available | 0 |
| Temporal Latent Bottleneck: Synthesis of Fast and Slow Processing Mechanisms in Sequence Learning | May 30, 2022 | Decision MakingInductive Bias | CodeCode Available | 0 |
| Ecole: A Library for Learning Inside MILP Solvers | Apr 6, 2021 | BIG-bench Machine LearningCombinatorial Optimization | CodeCode Available | 0 |
| On Learning Informative Trajectory Embeddings for Imitation, Classification and Regression | Jan 16, 2025 | Autonomous DrivingClustering | CodeCode Available | 0 |
| Distance Weighted Supervised Learning for Offline Interaction Data | Apr 26, 2023 | Decision MakingImitation Learning | CodeCode Available | 0 |
| On Learning Intrinsic Rewards for Policy Gradient Methods | Apr 17, 2018 | Atari GamesDecision Making | CodeCode Available | 0 |
| Probabilistic Constrained Reinforcement Learning with Formal Interpretability | Jul 13, 2023 | Decision Makingreinforcement-learning | CodeCode Available | 0 |
| Efficient Deviation Types and Learning for Hindsight Rationality in Extensive-Form Games | Feb 13, 2021 | counterfactualDecision Making | CodeCode Available | 0 |
| Efficient Deviation Types and Learning for Hindsight Rationality in Extensive-Form Games: Corrections | May 24, 2022 | counterfactualDecision Making | CodeCode Available | 0 |
| Discrete-Time Distribution Steering using Monte Carlo Tree Search | Dec 9, 2024 | Decision MakingSequential Decision Making | CodeCode Available | 0 |
| Learning Sparse Rewarded Tasks from Sub-Optimal Demonstrations | Apr 1, 2020 | continuous-controlContinuous Control | CodeCode Available | 0 |
| Learning Structural Weight Uncertainty for Sequential Decision-Making | Dec 30, 2017 | Decision MakingMulti-Armed Bandits | CodeCode Available | 0 |
| Efficient Sequence Labeling with Actor-Critic Training | Sep 30, 2018 | Decision MakingNER | CodeCode Available | 0 |
| Learning to Discretize: Solving 1D Scalar Conservation Laws via Deep Reinforcement Learning | May 27, 2019 | Decision MakingDeep Reinforcement Learning | CodeCode Available | 0 |
| Learning to Follow Instructions in Text-Based Games | Nov 8, 2022 | Decision MakingInstruction Following | CodeCode Available | 0 |
| Efficient Symbolic Policy Learning with Differentiable Symbolic Expression | Nov 2, 2023 | Decision MakingDeep Reinforcement Learning | CodeCode Available | 0 |
| Differential Privacy in Cooperative Multiagent Planning | Jan 20, 2023 | Decision MakingSequential Decision Making | CodeCode Available | 0 |
| Learning to Generalize for Sequential Decision Making | Oct 5, 2020 | Decision MakingImitation Learning | CodeCode Available | 0 |
| Learning to Listen, Read, and Follow: Score Following as a Reinforcement Learning Game | Jul 17, 2018 | Decision MakingDeep Reinforcement Learning | CodeCode Available | 0 |
| Differentially Private Regret Minimization in Episodic Markov Decision Processes | Dec 20, 2021 | Decision MakingReinforcement Learning (RL) | CodeCode Available | 0 |
| End-to-End Goal-Driven Web Navigation | Feb 6, 2016 | Decision MakingQuestion Answering | CodeCode Available | 0 |
| Enforcing Almost-Sure Reachability in POMDPs | Jun 30, 2020 | Decision Makingreinforcement-learning | CodeCode Available | 0 |
| Solving Long-run Average Reward Robust MDPs via Stochastic Games | Dec 21, 2023 | Decision MakingDecision Making Under Uncertainty | CodeCode Available | 0 |
| Enhancing the Accuracy and Fairness of Human Decision Making | May 25, 2018 | Decision MakingFairness | CodeCode Available | 0 |
| Scalable Exploration via Ensemble++ | Jul 18, 2024 | Computational EfficiencyDecision Making | CodeCode Available | 0 |