| Cooperative Online Learning with Feedback Graphs | Jun 9, 2021 | Decision MakingSequential Decision Making | CodeCode Available | 0 | 5 |
| Probabilistic Constrained Reinforcement Learning with Formal Interpretability | Jul 13, 2023 | Decision Makingreinforcement-learning | CodeCode Available | 0 | 5 |
| Hindsight and Sequential Rationality of Correlated Play | Dec 10, 2020 | counterfactualDecision Making | CodeCode Available | 0 | 5 |
| SANDWICH: Towards an Offline, Differentiable, Fully-Trainable Wireless Neural Ray-Tracing Surrogate | Nov 13, 2024 | Decision MakingGPU | CodeCode Available | 0 | 5 |
| A2-RL: Aesthetics Aware Reinforcement Learning for Image Cropping | Sep 14, 2017 | Decision MakingImage Cropping | CodeCode Available | 0 | 5 |
| Harnessing the Power of Federated Learning in Federated Contextual Bandits | Dec 26, 2023 | Decision MakingFederated Learning | CodeCode Available | 0 | 5 |
| Hindsight-DICE: Stable Credit Assignment for Deep Reinforcement Learning | Jul 21, 2023 | Decision MakingDeep Reinforcement Learning | CodeCode Available | 0 | 5 |
| FLARE: Fingerprinting Deep Reinforcement Learning Agents using Universal Adversarial Masks | Jul 27, 2023 | Decision MakingDeep Reinforcement Learning | CodeCode Available | 0 | 5 |
| Finding Counterfactually Optimal Action Sequences in Continuous State Spaces | Jun 6, 2023 | Causal InferenceDecision Making | CodeCode Available | 0 | 5 |
| Frequentist Uncertainty in Recurrent Neural Networks via Blockwise Influence Functions | Jun 20, 2020 | Decision MakingSequential Decision Making | CodeCode Available | 0 | 5 |
| Fast reinforcement learning with generalized policy updates | Jul 9, 2020 | Decision MakingProblem Decomposition | CodeCode Available | 0 | 5 |
| Decision Transformer vs. Decision Mamba: Analysing the Complexity of Sequential Decision Making in Atari Games | Dec 1, 2024 | Atari GamesDecision Making | CodeCode Available | 0 | 5 |
| Federated Online Clustering of Bandits | Aug 31, 2022 | ClusteringDecision Making | CodeCode Available | 0 | 5 |
| Generalization to New Sequential Decision Making Tasks with In-Context Learning | Dec 6, 2023 | Decision MakingDiversity | CodeCode Available | 0 | 5 |
| Imitation Learning from Purified Demonstrations | Oct 11, 2023 | Decision MakingImitation Learning | CodeCode Available | 0 | 5 |
| Is Mamba Compatible with Trajectory Optimization in Offline Reinforcement Learning? | May 20, 2024 | Atari GamesMamba | CodeCode Available | 0 | 5 |
| Adapting Static Fairness to Sequential Decision-Making: Bias Mitigation Strategies towards Equal Long-term Benefit Rate | Sep 7, 2023 | Decision MakingFairness | CodeCode Available | 0 | 5 |
| Epistemic Exploration for Generalizable Planning and Learning in Non-Stationary Settings | Feb 13, 2024 | Decision MakingSequential Decision Making | CodeCode Available | 0 | 5 |
| Evolutionary Multi-Armed Bandits with Genetic Thompson Sampling | Apr 26, 2022 | Decision MakingEvolutionary Algorithms | CodeCode Available | 0 | 5 |
| End-to-End Goal-Driven Web Navigation | Feb 6, 2016 | Decision MakingQuestion Answering | CodeCode Available | 0 | 5 |
| Efficient Symbolic Policy Learning with Differentiable Symbolic Expression | Nov 2, 2023 | Decision MakingDeep Reinforcement Learning | CodeCode Available | 0 | 5 |
| Enforcing Almost-Sure Reachability in POMDPs | Jun 30, 2020 | Decision Makingreinforcement-learning | CodeCode Available | 0 | 5 |
| Explainable Knowledge Graph Embedding: Inference Reconciliation for Knowledge Inferences Supporting Robot Actions | May 4, 2022 | Decision MakingGraph Embedding | CodeCode Available | 0 | 5 |
| Ecole: A Library for Learning Inside MILP Solvers | Apr 6, 2021 | BIG-bench Machine LearningCombinatorial Optimization | CodeCode Available | 0 | 5 |
| Enhancing Heterogeneous Multi-Agent Cooperation in Decentralized MARL via GNN-driven Intrinsic Rewards | Aug 12, 2024 | Graph Neural NetworkMulti-agent Reinforcement Learning | CodeCode Available | 0 | 5 |
| Dynamic Real-time Multimodal Routing with Hierarchical Hybrid Planning | Feb 5, 2019 | Decision MakingDecision Making Under Uncertainty | CodeCode Available | 0 | 5 |
| Hierarchical Reinforcement Learning with AI Planning Models | Mar 1, 2022 | Decision MakingHierarchical Reinforcement Learning | CodeCode Available | 0 | 5 |
| Dynamic Simplex: Balancing Safety and Performance in Autonomous Cyber Physical Systems | Feb 20, 2023 | Decision MakingSequential Decision Making | CodeCode Available | 0 | 5 |
| Efficient Deviation Types and Learning for Hindsight Rationality in Extensive-Form Games | Feb 13, 2021 | counterfactualDecision Making | CodeCode Available | 0 | 5 |
| Efficient Sequence Labeling with Actor-Critic Training | Sep 30, 2018 | Decision MakingNER | CodeCode Available | 0 | 5 |
| Data Mixture Optimization: A Multi-fidelity Multi-scale Bayesian Framework | Mar 26, 2025 | Bayesian OptimizationSequential Decision Making | CodeCode Available | 0 | 5 |
| Data Generation as Sequential Decision Making | Jun 10, 2015 | Decision MakingImputation | CodeCode Available | 0 | 5 |
| Enhancing the Accuracy and Fairness of Human Decision Making | May 25, 2018 | Decision MakingFairness | CodeCode Available | 0 | 5 |
| Decision Making in Non-Stationary Environments with Policy-Augmented Search | Jan 6, 2024 | Decision MakingDecision Making Under Uncertainty | CodeCode Available | 0 | 5 |
| TooBadRL: Trigger Optimization to Boost Effectiveness of Backdoor Attacks on Deep Reinforcement Learning | Jun 11, 2025 | Deep Reinforcement LearningSequential Decision Making | CodeCode Available | 0 | 5 |
| Dynamical Linear Bandits | Nov 16, 2022 | Decision MakingSequential Decision Making | CodeCode Available | 0 | 5 |
| Efficient Deviation Types and Learning for Hindsight Rationality in Extensive-Form Games: Corrections | May 24, 2022 | counterfactualDecision Making | CodeCode Available | 0 | 5 |
| Exploring the Inquiry-Diagnosis Relationship with Advanced Patient Simulators | Jan 16, 2025 | DiagnosticSequential Decision Making | CodeCode Available | 0 | 5 |
| Algorithms for Fairness in Sequential Decision Making | Jan 24, 2019 | Decision MakingFairness | CodeCode Available | 0 | 5 |
| Deep Bayesian Bandits Showdown: An Empirical Comparison of Bayesian Deep Networks for Thompson Sampling | Feb 26, 2018 | Decision MakingDeep Reinforcement Learning | CodeCode Available | 0 | 5 |
| AutoGMap: Learning to Map Large-scale Sparse Graphs on Memristive Crossbars | Nov 15, 2021 | CPUDecision Making | CodeCode Available | 0 | 5 |
| Distance Weighted Supervised Learning for Offline Interaction Data | Apr 26, 2023 | Decision MakingImitation Learning | CodeCode Available | 0 | 5 |
| Doubly Inhomogeneous Reinforcement Learning | Nov 8, 2022 | Change Point DetectionClustering | CodeCode Available | 0 | 5 |
| FLIPHAT: Joint Differential Privacy for High Dimensional Sparse Linear Bandits | May 22, 2024 | Decision MakingSequential Decision Making | CodeCode Available | 0 | 5 |
| Differential Privacy in Cooperative Multiagent Planning | Jan 20, 2023 | Decision MakingSequential Decision Making | CodeCode Available | 0 | 5 |
| "Give Me an Example Like This": Episodic Active Reinforcement Learning from Demonstrations | Jun 5, 2024 | Active LearningReinforcement Learning (RL) | CodeCode Available | 0 | 5 |
| Stay With Me: Lifetime Maximization Through Heteroscedastic Linear Bandits With Reneging | Oct 29, 2018 | Decision MakingMulti-Armed Bandits | CodeCode Available | 0 | 5 |
| Deep Q-Network for Angry Birds | Oct 4, 2019 | Decision MakingDeep Reinforcement Learning | CodeCode Available | 0 | 5 |
| Deep Reinforcement Learning Algorithms for Option Hedging | Apr 7, 2025 | Deep Reinforcement LearningQ-Learning | CodeCode Available | 0 | 5 |
| A Hierarchical Architecture for Sequential Decision-Making in Autonomous Driving using Deep Reinforcement Learning | Jun 20, 2019 | Autonomous DrivingDecision Making | CodeCode Available | 0 | 5 |