| FLIPHAT: Joint Differential Privacy for High Dimensional Sparse Linear Bandits | May 22, 2024 | Decision MakingSequential Decision Making | CodeCode Available | 0 | 5 |
| Anderson Acceleration for Partially Observable Markov Decision Processes: A Maximum Entropy Approach | Nov 28, 2022 | Decision MakingSequential Decision Making | CodeCode Available | 0 | 5 |
| Finding Counterfactually Optimal Action Sequences in Continuous State Spaces | Jun 6, 2023 | Causal InferenceDecision Making | CodeCode Available | 0 | 5 |
| Stay With Me: Lifetime Maximization Through Heteroscedastic Linear Bandits With Reneging | Oct 29, 2018 | Decision MakingMulti-Armed Bandits | CodeCode Available | 0 | 5 |
| Explainable Knowledge Graph Embedding: Inference Reconciliation for Knowledge Inferences Supporting Robot Actions | May 4, 2022 | Decision MakingGraph Embedding | CodeCode Available | 0 | 5 |
| Evolutionary Multi-Armed Bandits with Genetic Thompson Sampling | Apr 26, 2022 | Decision MakingEvolutionary Algorithms | CodeCode Available | 0 | 5 |
| Algorithms for Fairness in Sequential Decision Making | Jan 24, 2019 | Decision MakingFairness | CodeCode Available | 0 | 5 |
| Adapting Static Fairness to Sequential Decision-Making: Bias Mitigation Strategies towards Equal Long-term Benefit Rate | Sep 7, 2023 | Decision MakingFairness | CodeCode Available | 0 | 5 |
| A Biologically Plausible Benchmark for Contextual Bandit Algorithms in Precision Oncology Using in vitro Data | Nov 11, 2019 | BenchmarkingDecision Making | CodeCode Available | 0 | 5 |
| Enforcing Almost-Sure Reachability in POMDPs | Jun 30, 2020 | Decision Makingreinforcement-learning | CodeCode Available | 0 | 5 |
| Fast reinforcement learning with generalized policy updates | Jul 9, 2020 | Decision MakingProblem Decomposition | CodeCode Available | 0 | 5 |
| Hindsight and Sequential Rationality of Correlated Play | Dec 10, 2020 | counterfactualDecision Making | CodeCode Available | 0 | 5 |
| Efficient Sequence Labeling with Actor-Critic Training | Sep 30, 2018 | Decision MakingNER | CodeCode Available | 0 | 5 |
| Efficient Deviation Types and Learning for Hindsight Rationality in Extensive-Form Games: Corrections | May 24, 2022 | counterfactualDecision Making | CodeCode Available | 0 | 5 |
| Efficient Symbolic Policy Learning with Differentiable Symbolic Expression | Nov 2, 2023 | Decision MakingDeep Reinforcement Learning | CodeCode Available | 0 | 5 |
| Dynamic Real-time Multimodal Routing with Hierarchical Hybrid Planning | Feb 5, 2019 | Decision MakingDecision Making Under Uncertainty | CodeCode Available | 0 | 5 |
| Dynamic Simplex: Balancing Safety and Performance in Autonomous Cyber Physical Systems | Feb 20, 2023 | Decision MakingSequential Decision Making | CodeCode Available | 0 | 5 |
| An Adaptable Budget Planner for Enhancing Budget-Constrained Auto-Bidding in Online Advertising | Jan 26, 2025 | In-Context Reinforcement LearningSequential Decision Making | CodeCode Available | 0 | 5 |
| Batch Bayesian optimisation via density-ratio estimation with guarantees | Sep 22, 2022 | Bayesian InferenceBayesian Optimisation | CodeCode Available | 0 | 5 |
| Ecole: A Library for Learning Inside MILP Solvers | Apr 6, 2021 | BIG-bench Machine LearningCombinatorial Optimization | CodeCode Available | 0 | 5 |
| Doubly Robust Off-policy Value Evaluation for Reinforcement Learning | Nov 11, 2015 | Decision Makingreinforcement-learning | CodeCode Available | 0 | 5 |
| Efficient Deviation Types and Learning for Hindsight Rationality in Extensive-Form Games | Feb 13, 2021 | counterfactualDecision Making | CodeCode Available | 0 | 5 |
| Doubly Inhomogeneous Reinforcement Learning | Nov 8, 2022 | Change Point DetectionClustering | CodeCode Available | 0 | 5 |
| Doubly Robust Policy Evaluation and Optimization | Mar 10, 2015 | Decision MakingMulti-Armed Bandits | CodeCode Available | 0 | 5 |
| Distance Weighted Supervised Learning for Offline Interaction Data | Apr 26, 2023 | Decision MakingImitation Learning | CodeCode Available | 0 | 5 |
| End-to-End Goal-Driven Web Navigation | Feb 6, 2016 | Decision MakingQuestion Answering | CodeCode Available | 0 | 5 |
| Enhancing the Accuracy and Fairness of Human Decision Making | May 25, 2018 | Decision MakingFairness | CodeCode Available | 0 | 5 |
| Epistemic Exploration for Generalizable Planning and Learning in Non-Stationary Settings | Feb 13, 2024 | Decision MakingSequential Decision Making | CodeCode Available | 0 | 5 |
| Bellman Meets Hawkes: Model-Based Reinforcement Learning via Temporal Point Processes | Jan 29, 2022 | Decision MakingModel-based Reinforcement Learning | CodeCode Available | 0 | 5 |
| Best Arm Identification for Stochastic Rising Bandits | Feb 15, 2023 | Decision MakingModel Selection | CodeCode Available | 0 | 5 |
| Scalable Exploration via Ensemble++ | Jul 18, 2024 | Computational EfficiencyDecision Making | CodeCode Available | 0 | 5 |
| Exploring the Inquiry-Diagnosis Relationship with Advanced Patient Simulators | Jan 16, 2025 | DiagnosticSequential Decision Making | CodeCode Available | 0 | 5 |
| Differentially Private Regret Minimization in Episodic Markov Decision Processes | Dec 20, 2021 | Decision MakingReinforcement Learning (RL) | CodeCode Available | 0 | 5 |
| Federated Online Clustering of Bandits | Aug 31, 2022 | ClusteringDecision Making | CodeCode Available | 0 | 5 |
| Act as You Learn: Adaptive Decision-Making in Non-Stationary Markov Decision Processes | Jan 3, 2024 | Decision MakingHeuristic Search | CodeCode Available | 0 | 5 |
| FLARE: Fingerprinting Deep Reinforcement Learning Agents using Universal Adversarial Masks | Jul 27, 2023 | Decision MakingDeep Reinforcement Learning | CodeCode Available | 0 | 5 |
| Back to the Future -- Sequential Alignment of Text Representations | Sep 8, 2019 | Decision MakingRumour Detection | CodeCode Available | 0 | 5 |
| Frequentist Uncertainty in Recurrent Neural Networks via Blockwise Influence Functions | Jun 20, 2020 | Decision MakingSequential Decision Making | CodeCode Available | 0 | 5 |
| β-Multivariational Autoencoder for Entangled Representation Learning in Video Frames | Nov 22, 2022 | Decision MakingObject | CodeCode Available | 0 | 5 |
| Adaptive Sequence Submodularity | Feb 15, 2019 | Decision MakingLink Prediction | CodeCode Available | 0 | 5 |
| Differential Privacy in Cooperative Multiagent Planning | Jan 20, 2023 | Decision MakingSequential Decision Making | CodeCode Available | 0 | 5 |
| AVID: Adapting Video Diffusion Models to World Models | Oct 1, 2024 | Decision MakingSequential Decision Making | CodeCode Available | 0 | 5 |
| Reinforcement Learning applied to Insurance Portfolio Pursuit | Aug 1, 2024 | Decision Makingreinforcement-learning | CodeCode Available | 0 | 5 |
| Did we personalize? Assessing personalization by an online reinforcement learning algorithm using resampling | Apr 11, 2023 | Decision MakingReinforcement Learning (RL) | CodeCode Available | 0 | 5 |
| Adaptive teachers for amortized samplers | Oct 2, 2024 | Decision MakingEfficient Exploration | CodeCode Available | 0 | 5 |
| Discrete-Time Distribution Steering using Monte Carlo Tree Search | Dec 9, 2024 | Decision MakingSequential Decision Making | CodeCode Available | 0 | 5 |
| Bridging by Word: Image Grounded Vocabulary Construction for Visual Captioning | Jul 1, 2019 | Decision MakingImage Captioning | CodeCode Available | 0 | 5 |
| Dynamical Linear Bandits | Nov 16, 2022 | Decision MakingSequential Decision Making | CodeCode Available | 0 | 5 |
| Hindsight-DICE: Stable Credit Assignment for Deep Reinforcement Learning | Jul 21, 2023 | Decision MakingDeep Reinforcement Learning | CodeCode Available | 0 | 5 |
| Autoregressive Bandits | Dec 12, 2022 | Decision MakingSequential Decision Making | CodeCode Available | 0 | 5 |