| Operator World Models for Reinforcement Learning | Jun 28, 2024 | Decision Makingreinforcement-learning | CodeCode Available | 0 |
| Curriculum Design for Teaching via Demonstrations: Theory and Applications | Jun 8, 2021 | Decision MakingReinforcement Learning (RL) | CodeCode Available | 0 |
| MAHALO: Unifying Offline Reinforcement Learning and Imitation Learning from Observations | Mar 30, 2023 | Decision MakingImitation Learning | CodeCode Available | 0 |
| Depth Matters: Multimodal RGB-D Perception for Robust Autonomous Agents | Mar 20, 2025 | Decision MakingSequential Decision Making | CodeCode Available | 0 |
| Frequentist Uncertainty in Recurrent Neural Networks via Blockwise Influence Functions | Jun 20, 2020 | Decision MakingSequential Decision Making | CodeCode Available | 0 |
| Making Universal Policies Universal | Feb 20, 2025 | Imitation LearningSequential Decision Making | CodeCode Available | 0 |
| Optimal Control of Mechanical Ventilators with Learned Respiratory Dynamics | Nov 12, 2024 | Decision MakingManagement | CodeCode Available | 0 |
| Achieving Long-Term Fairness in Sequential Decision Making | Apr 4, 2022 | Decision MakingFairness | CodeCode Available | 0 |
| Back to the Future -- Sequential Alignment of Text Representations | Sep 8, 2019 | Decision MakingRumour Detection | CodeCode Available | 0 |
| Reinforcement Learning of Risk-Constrained Policies in Markov Decision Processes | Feb 27, 2020 | Decision Makingreinforcement-learning | CodeCode Available | 0 |
| Counterfactual Effect Decomposition in Multi-Agent Sequential Decision Making | Oct 16, 2024 | Attributecounterfactual | CodeCode Available | 0 |
| Co-training for Policy Learning | Jul 3, 2019 | Combinatorial Optimizationcontinuous-control | CodeCode Available | 0 |
| Value Gradient Sampler: Sampling as Sequential Decision Making | Feb 18, 2025 | Anomaly DetectionDecision Making | CodeCode Available | 0 |
| A Hierarchical Architecture for Sequential Decision-Making in Autonomous Driving using Deep Reinforcement Learning | Jun 20, 2019 | Autonomous DrivingDecision Making | CodeCode Available | 0 |
| Reinforcement Learning When All Actions are Not Always Available | Jun 5, 2019 | AllDecision Making | CodeCode Available | 0 |
| Autoregressive Bandits | Dec 12, 2022 | Decision MakingSequential Decision Making | CodeCode Available | 0 |
| Optimistic Query Routing in Clustering-based Approximate Maximum Inner Product Search | May 20, 2024 | ClusteringSequential Decision Making | CodeCode Available | 0 |
| The Conditional Cauchy-Schwarz Divergence with Applications to Time-Series Data and Sequential Decision Making | Jan 21, 2023 | Decision MakingSequential Decision Making | CodeCode Available | 0 |
| Vertical Symbolic Regression via Deep Policy Gradient | Feb 1, 2024 | Decision MakingDeep Reinforcement Learning | CodeCode Available | 0 |
| A New Bandit Setting Balancing Information from State Evolution and Corrupted Context | Nov 16, 2020 | Decision MakingEfficient Exploration | CodeCode Available | 0 |
| Merging Decision Transformers: Weight Averaging for Forming Multi-Task Policies | Mar 14, 2023 | Decision MakingMuJoCo | CodeCode Available | 0 |
| "Give Me an Example Like This": Episodic Active Reinforcement Learning from Demonstrations | Jun 5, 2024 | Active LearningReinforcement Learning (RL) | CodeCode Available | 0 |
| Reinforcement Learning applied to Insurance Portfolio Pursuit | Aug 1, 2024 | Decision Makingreinforcement-learning | CodeCode Available | 0 |
| ReLAX: Reinforcement Learning Agent eXplainer for Arbitrary Predictive Models | Oct 22, 2021 | counterfactualDecision Making | CodeCode Available | 0 |
| UProp: Investigating the Uncertainty Propagation of LLMs in Multi-Step Agentic Decision-Making | Jun 20, 2025 | Decision MakingQuestion Answering | CodeCode Available | 0 |
| Classification with Costly Features using Deep Reinforcement Learning | Nov 20, 2017 | ClassificationClassification with Costly Features | CodeCode Available | 0 |
| A Deep Reinforcement Learning Framework For Column Generation | Jun 3, 2022 | Decision MakingDeep Reinforcement Learning | CodeCode Available | 0 |
| PageRank Bandits for Link Prediction | Nov 3, 2024 | Decision MakingGraph Learning | CodeCode Available | 0 |
| Zero-Shot Reinforcement Learning via Function Encoders | Jan 30, 2024 | Decision Makingreinforcement-learning | CodeCode Available | 0 |
| Parameterized Projected Bellman Operator | Dec 20, 2023 | Decision MakingReinforcement Learning (RL) | CodeCode Available | 0 |
| Remembering to Be Fair: Non-Markovian Fairness in Sequential Decision Making | Dec 8, 2023 | Decision MakingFairness | CodeCode Available | 0 |
| Semi-Infinitely Constrained Markov Decision Processes and Efficient Reinforcement Learning | Apr 29, 2023 | Decision MakingDeep Reinforcement Learning | CodeCode Available | 0 |
| Structural Causal Bandits: Where to Intervene? | Dec 1, 2018 | Decision MakingSequential Decision Making | CodeCode Available | 0 |
| Minimax-Bayes Reinforcement Learning | Feb 21, 2023 | Decision MakingDecision Making Under Uncertainty | CodeCode Available | 0 |
| The MineRL 2019 Competition on Sample Efficient Reinforcement Learning using Human Priors | Apr 22, 2019 | Decision MakingDeep Reinforcement Learning | CodeCode Available | 0 |
| Harnessing the Power of Federated Learning in Federated Contextual Bandits | Dec 26, 2023 | Decision MakingFederated Learning | CodeCode Available | 0 |
| A2-RL: Aesthetics Aware Reinforcement Learning for Image Cropping | Sep 14, 2017 | Decision MakingImage Cropping | CodeCode Available | 0 |
| Stay With Me: Lifetime Maximization Through Heteroscedastic Linear Bandits With Reneging | Oct 29, 2018 | Decision MakingMulti-Armed Bandits | CodeCode Available | 0 |
| Cooperative Online Learning with Feedback Graphs | Jun 9, 2021 | Decision MakingSequential Decision Making | CodeCode Available | 0 |
| Mixed-Initiative Human-Robot Teaming under Suboptimality with Online Bayesian Adaptation | Mar 24, 2024 | Decision MakingSequential Decision Making | CodeCode Available | 0 |
| Continuous Monte Carlo Graph Search | Oct 4, 2022 | continuous-controlContinuous Control | CodeCode Available | 0 |
| Structured Control Nets for Deep Reinforcement Learning | Feb 22, 2018 | Decision MakingDeep Reinforcement Learning | CodeCode Available | 0 |
| Classification with Costly Features as a Sequential Decision-Making Problem | Sep 5, 2019 | ClassificationClassification with Costly Features | CodeCode Available | 0 |
| Adaptive teachers for amortized samplers | Oct 2, 2024 | Decision MakingEfficient Exploration | CodeCode Available | 0 |
| Perspective-Shifted Neuro-Symbolic World Models: A Framework for Socially-Aware Robot Navigation | Mar 26, 2025 | Decision MakingModel-based Reinforcement Learning | CodeCode Available | 0 |
| Rethinking Out-of-Distribution Detection for Reinforcement Learning: Advancing Methods for Evaluation and Detection | Apr 10, 2024 | Out-of-Distribution DetectionOut of Distribution (OOD) Detection | CodeCode Available | 0 |
| Towards Safe Policy Improvement for Non-Stationary MDPs | Oct 23, 2020 | Decision Makingreinforcement-learning | CodeCode Available | 0 |
| Planning with Goal-Conditioned Policies | Nov 19, 2019 | Decision Makingreinforcement-learning | CodeCode Available | 0 |
| Plan-Then-Execute: An Empirical Study of User Trust and Team Performance When Using LLM Agents As A Daily Assistant | Feb 3, 2025 | Sequential Decision Making | CodeCode Available | 0 |
| Plan To Predict: Learning an Uncertainty-Foreseeing Model for Model-Based Reinforcement Learning | Jan 20, 2023 | Decision Makingmodel | CodeCode Available | 0 |