| Representation Learning for Context-Dependent Decision-Making | May 12, 2022 | Decision MakingQ-Learning | —Unverified | 0 |
| Hierarchical Constrained Stochastic Shortest Path Planning via Cost Budget Allocation | May 11, 2022 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Federated Multi-Armed Bandits Under Byzantine Attacks | May 9, 2022 | Data PoisoningDecision Making | —Unverified | 0 |
| Explainable Knowledge Graph Embedding: Inference Reconciliation for Knowledge Inferences Supporting Robot Actions | May 4, 2022 | Decision MakingGraph Embedding | CodeCode Available | 0 |
| Towards Flexible Inference in Sequential Decision Problems via Bidirectional Transformers | Apr 28, 2022 | Decision MakingOffline RL | —Unverified | 0 |
| Evolutionary Multi-Armed Bandits with Genetic Thompson Sampling | Apr 26, 2022 | Decision MakingEvolutionary Algorithms | CodeCode Available | 0 |
| Toward Policy Explanations for Multi-Agent Reinforcement Learning | Apr 26, 2022 | Autonomous DrivingDecision Making | CodeCode Available | 0 |
| Integrating Reward Maximization and Population Estimation: Sequential Decision-Making for Internal Revenue Service Audit Selection | Apr 25, 2022 | Decision MakingSequential Decision Making | —Unverified | 0 |
| GFCL: A GRU-based Federated Continual Learning Framework against Data Poisoning Attacks in IoV | Apr 23, 2022 | Anomaly DetectionContinual Learning | —Unverified | 0 |
| SAAC: Safe Reinforcement Learning as an Adversarial Game of Actor-Critics | Apr 20, 2022 | continuous-controlContinuous Control | —Unverified | 0 |
| Training and Evaluation of Deep Policies using Reinforcement Learning and Generative Models | Apr 18, 2022 | Decision Makingreinforcement-learning | —Unverified | 0 |
| Worst-case Performance of Greedy Policies in Bandits with Imperfect Context Observations | Apr 10, 2022 | Decision MakingDecision Making Under Uncertainty | —Unverified | 0 |
| Achieving Long-Term Fairness in Sequential Decision Making | Apr 4, 2022 | Decision MakingFairness | CodeCode Available | 0 |
| Actual Causality and Responsibility Attribution in Decentralized Partially Observable Markov Decision Processes | Apr 1, 2022 | Decision MakingDecision Making Under Uncertainty | —Unverified | 0 |
| An Online Approach to Solve the Dynamic Vehicle Routing Problem with Stochastic Trip Requests for Paratransit Services | Mar 28, 2022 | Decision MakingSequential Decision Making | —Unverified | 0 |
| NovGrid: A Flexible Grid World for Evaluating Agent Response to Novelty | Mar 23, 2022 | Decision Makingreinforcement-learning | —Unverified | 0 |
| Model-based Multi-agent Reinforcement Learning: Recent Progress and Prospects | Mar 20, 2022 | Decision MakingMulti-agent Reinforcement Learning | —Unverified | 0 |
| The price of unfairness in linear bandits with biased feedback | Mar 18, 2022 | AttributeDecision Making | —Unverified | 0 |
| Near-optimal Offline Reinforcement Learning with Linear Representation: Leveraging Variance Information with Pessimism | Mar 11, 2022 | Decision Makingreinforcement-learning | —Unverified | 0 |
| A Trainable Approach to Zero-delay Smoothing Spline Interpolation | Mar 7, 2022 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Reinforcement Learning in Modern Biostatistics: Constructing Optimal Adaptive Interventions | Mar 4, 2022 | Causal InferenceDecision Making | —Unverified | 0 |
| Linear Stochastic Bandits over a Bit-Constrained Channel | Mar 2, 2022 | Decision MakingDecision Making Under Uncertainty | —Unverified | 0 |
| Hierarchical Reinforcement Learning with AI Planning Models | Mar 1, 2022 | Decision MakingHierarchical Reinforcement Learning | CodeCode Available | 0 |
| LISA: Learning Interpretable Skill Abstractions from Language | Feb 28, 2022 | Decision MakingImitation Learning | CodeCode Available | 0 |
| Simulating Network Paths with Recurrent Buffering Units | Feb 23, 2022 | Decision MakingSequential Decision Making | —Unverified | 0 |
| A Survey of Explainable Reinforcement Learning | Feb 17, 2022 | Decision Makingreinforcement-learning | —Unverified | 0 |
| MuZero with Self-competition for Rate Control in VP9 Video Compression | Feb 14, 2022 | Decision MakingQuantization | —Unverified | 0 |
| Sequential Bayesian experimental designs via reinforcement learning | Feb 14, 2022 | Bayesian InferenceDecision Making | —Unverified | 0 |
| Provably Efficient Causal Model-Based Reinforcement Learning for Systematic Generalization | Feb 14, 2022 | Decision MakingModel-based Reinforcement Learning | —Unverified | 0 |
| Provable Reinforcement Learning with a Short-Term Memory | Feb 8, 2022 | Decision Makingreinforcement-learning | —Unverified | 0 |
| Improved Regret for Differentially Private Exploration in Linear MDP | Feb 2, 2022 | Decision MakingPrivacy Preserving | —Unverified | 0 |
| Meta-Learning Hypothesis Spaces for Sequential Decision-making | Feb 1, 2022 | Bayesian OptimizationDecision Making | —Unverified | 0 |
| Bellman Meets Hawkes: Model-Based Reinforcement Learning via Temporal Point Processes | Jan 29, 2022 | Decision MakingModel-based Reinforcement Learning | CodeCode Available | 0 |
| Accelerate Model Parallel Training by Using Efficient Graph Traversal Order in Device Placement | Jan 21, 2022 | Decision MakingSequential Decision Making | CodeCode Available | 0 |
| Generalizing Off-Policy Evaluation From a Causal Perspective For Sequential Decision-Making | Jan 20, 2022 | counterfactualDecision Making | —Unverified | 0 |
| Active Learning-Based Multistage Sequential Decision-Making Model with Application on Common Bile Duct Stone Evaluation | Jan 13, 2022 | Active LearningDecision Making | —Unverified | 0 |
| Automated Reinforcement Learning: An Overview | Jan 13, 2022 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| Multi-echelon Supply Chains with Uncertain Seasonal Demands and Lead Times Using Deep Reinforcement Learning | Jan 12, 2022 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| Subgoal-Based Explanations for Unreliable Intelligent Decision Support Systems | Jan 11, 2022 | Decision MakingSequential Decision Making | —Unverified | 0 |
| State of the Art of User Simulation approaches for conversational information retrieval | Jan 10, 2022 | Decision MakingInformation Retrieval | —Unverified | 0 |
| Temporal Detection of Anomalies via Actor-Critic Based Controlled Sensing | Jan 3, 2022 | Anomaly DetectionDecision Making | —Unverified | 0 |
| Socially-Optimal Mechanism Design for Incentivized Online Learning | Dec 29, 2021 | Decision MakingEdge-computing | —Unverified | 0 |
| Reducing Planning Complexity of General Reinforcement Learning with Non-Markovian Abstractions | Dec 26, 2021 | Decision MakingGeneral Reinforcement Learning | —Unverified | 0 |
| A Survey on Interpretable Reinforcement Learning | Dec 24, 2021 | Autonomous DrivingDecision Making | —Unverified | 0 |
| Revisiting Game Representations: The Hidden Costs of Efficiency in Sequential Decision-making Algorithms | Dec 20, 2021 | counterfactualDecision Making | —Unverified | 0 |
| Differentially Private Regret Minimization in Episodic Markov Decision Processes | Dec 20, 2021 | Decision MakingReinforcement Learning (RL) | CodeCode Available | 0 |
| Application of Deep Reinforcement Learning to Payment Fraud | Dec 8, 2021 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| First-Order Regret in Reinforcement Learning with Linear Function Approximation: A Robust Estimation Approach | Dec 7, 2021 | Decision Makingreinforcement-learning | —Unverified | 0 |
| MDPFuzz: Testing Models Solving Markov Decision Processes | Dec 6, 2021 | Autonomous DrivingCollision Avoidance | —Unverified | 0 |
| Temporal-Spatial Causal Interpretations for Vision-Based Reinforcement Learning | Dec 6, 2021 | Causal DiscoveryDecision Making | —Unverified | 0 |