| A Deep Reinforcement Learning Framework For Column Generation | Jun 3, 2022 | Decision MakingDeep Reinforcement Learning | CodeCode Available | 0 |
| Adaptive Robust Online Portfolio Selection | Jun 2, 2022 | Decision MakingSequential Decision Making | —Unverified | 0 |
| A Near-Optimal Best-of-Both-Worlds Algorithm for Online Learning with Feedback Graphs | Jun 1, 2022 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Robust Anytime Learning of Markov Decision Processes | May 31, 2022 | Bayesian InferenceDecision Making | CodeCode Available | 0 |
| Multi-Agent Learning of Numerical Methods for Hyperbolic PDEs with Factored Dec-MDP | May 31, 2022 | Decision Makingreinforcement-learning | —Unverified | 0 |
| Causal Explanations for Sequential Decision Making Under Uncertainty | May 30, 2022 | Causal InferenceDecision Making | —Unverified | 0 |
| Adaptive Sampling for Discovery | May 30, 2022 | Decision MakingDrug Discovery | —Unverified | 0 |
| Temporal Latent Bottleneck: Synthesis of Fast and Slow Processing Mechanisms in Sequence Learning | May 30, 2022 | Decision MakingInductive Bias | CodeCode Available | 0 |
| Multi-Agent Reinforcement Learning is a Sequence Modeling Problem | May 30, 2022 | Decision MakingMuJoCo | CodeCode Available | 2 |
| Efficient Deviation Types and Learning for Hindsight Rationality in Extensive-Form Games: Corrections | May 24, 2022 | counterfactualDecision Making | CodeCode Available | 0 |
| Flow-based Recurrent Belief State Learning for POMDPs | May 23, 2022 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Survey on Fair Reinforcement Learning: Theory and Practice | May 20, 2022 | ArticlesDecision Making | —Unverified | 0 |
| Marginal and Joint Cross-Entropies & Predictives for Online Bayesian Inference, Active Learning, and Active Sampling | May 18, 2022 | Active LearningBayesian Inference | —Unverified | 0 |
| Representation Learning for Context-Dependent Decision-Making | May 12, 2022 | Decision MakingQ-Learning | —Unverified | 0 |
| Hierarchical Constrained Stochastic Shortest Path Planning via Cost Budget Allocation | May 11, 2022 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Federated Multi-Armed Bandits Under Byzantine Attacks | May 9, 2022 | Data PoisoningDecision Making | —Unverified | 0 |
| Explainable Knowledge Graph Embedding: Inference Reconciliation for Knowledge Inferences Supporting Robot Actions | May 4, 2022 | Decision MakingGraph Embedding | CodeCode Available | 0 |
| Towards Flexible Inference in Sequential Decision Problems via Bidirectional Transformers | Apr 28, 2022 | Decision MakingOffline RL | —Unverified | 0 |
| Evolutionary Multi-Armed Bandits with Genetic Thompson Sampling | Apr 26, 2022 | Decision MakingEvolutionary Algorithms | CodeCode Available | 0 |
| Toward Policy Explanations for Multi-Agent Reinforcement Learning | Apr 26, 2022 | Autonomous DrivingDecision Making | CodeCode Available | 0 |
| Integrating Reward Maximization and Population Estimation: Sequential Decision-Making for Internal Revenue Service Audit Selection | Apr 25, 2022 | Decision MakingSequential Decision Making | —Unverified | 0 |
| GFCL: A GRU-based Federated Continual Learning Framework against Data Poisoning Attacks in IoV | Apr 23, 2022 | Anomaly DetectionContinual Learning | —Unverified | 0 |
| Comparing Deep Reinforcement Learning Algorithms in Two-Echelon Supply Chains | Apr 20, 2022 | Decision MakingDeep Reinforcement Learning | CodeCode Available | 1 |
| SAAC: Safe Reinforcement Learning as an Adversarial Game of Actor-Critics | Apr 20, 2022 | continuous-controlContinuous Control | —Unverified | 0 |
| Training and Evaluation of Deep Policies using Reinforcement Learning and Generative Models | Apr 18, 2022 | Decision Makingreinforcement-learning | —Unverified | 0 |
| Worst-case Performance of Greedy Policies in Bandits with Imperfect Context Observations | Apr 10, 2022 | Decision MakingDecision Making Under Uncertainty | —Unverified | 0 |
| Achieving Long-Term Fairness in Sequential Decision Making | Apr 4, 2022 | Decision MakingFairness | CodeCode Available | 0 |
| Actual Causality and Responsibility Attribution in Decentralized Partially Observable Markov Decision Processes | Apr 1, 2022 | Decision MakingDecision Making Under Uncertainty | —Unverified | 0 |
| An Online Approach to Solve the Dynamic Vehicle Routing Problem with Stochastic Trip Requests for Paratransit Services | Mar 28, 2022 | Decision MakingSequential Decision Making | —Unverified | 0 |
| NovGrid: A Flexible Grid World for Evaluating Agent Response to Novelty | Mar 23, 2022 | Decision Makingreinforcement-learning | —Unverified | 0 |
| Model-based Multi-agent Reinforcement Learning: Recent Progress and Prospects | Mar 20, 2022 | Decision MakingMulti-agent Reinforcement Learning | —Unverified | 0 |
| The Sandbox Environment for Generalizable Agent Research (SEGAR) | Mar 19, 2022 | Decision MakingSequential Decision Making | CodeCode Available | 1 |
| The price of unfairness in linear bandits with biased feedback | Mar 18, 2022 | AttributeDecision Making | —Unverified | 0 |
| Near-optimal Offline Reinforcement Learning with Linear Representation: Leveraging Variance Information with Pessimism | Mar 11, 2022 | Decision Makingreinforcement-learning | —Unverified | 0 |
| Curriculum-based Reinforcement Learning for Distribution System Critical Load Restoration | Mar 8, 2022 | Decision Makingreinforcement-learning | CodeCode Available | 1 |
| A Trainable Approach to Zero-delay Smoothing Spline Interpolation | Mar 7, 2022 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Deep Reinforcement Learning for Entity Alignment | Mar 7, 2022 | Decision MakingDeep Reinforcement Learning | CodeCode Available | 1 |
| Reinforcement Learning in Modern Biostatistics: Constructing Optimal Adaptive Interventions | Mar 4, 2022 | Causal InferenceDecision Making | —Unverified | 0 |
| Linear Stochastic Bandits over a Bit-Constrained Channel | Mar 2, 2022 | Decision MakingDecision Making Under Uncertainty | —Unverified | 0 |
| Hierarchical Reinforcement Learning with AI Planning Models | Mar 1, 2022 | Decision MakingHierarchical Reinforcement Learning | CodeCode Available | 0 |
| LISA: Learning Interpretable Skill Abstractions from Language | Feb 28, 2022 | Decision MakingImitation Learning | CodeCode Available | 0 |
| Simulating Network Paths with Recurrent Buffering Units | Feb 23, 2022 | Decision MakingSequential Decision Making | —Unverified | 0 |
| A Survey of Explainable Reinforcement Learning | Feb 17, 2022 | Decision Makingreinforcement-learning | —Unverified | 0 |
| Sequential Bayesian experimental designs via reinforcement learning | Feb 14, 2022 | Bayesian InferenceDecision Making | —Unverified | 0 |
| Provably Efficient Causal Model-Based Reinforcement Learning for Systematic Generalization | Feb 14, 2022 | Decision MakingModel-based Reinforcement Learning | —Unverified | 0 |
| MuZero with Self-competition for Rate Control in VP9 Video Compression | Feb 14, 2022 | Decision MakingQuantization | —Unverified | 0 |
| Provable Reinforcement Learning with a Short-Term Memory | Feb 8, 2022 | Decision Makingreinforcement-learning | —Unverified | 0 |
| Pre-Trained Language Models for Interactive Decision-Making | Feb 3, 2022 | Decision MakingImitation Learning | CodeCode Available | 2 |
| Improved Regret for Differentially Private Exploration in Linear MDP | Feb 2, 2022 | Decision MakingPrivacy Preserving | —Unverified | 0 |
| Meta-Learning Hypothesis Spaces for Sequential Decision-making | Feb 1, 2022 | Bayesian OptimizationDecision Making | —Unverified | 0 |